commits

Customer reported that one of their applications started failing to
open files with STATUS_INSUFFICIENT_RESOURCES due to NetApp server
hitting the maximum number of opens to same file that it would allow
for a single client connection.

It turned out the client was failing to reuse open handles with
deferred closes because matching ->f_flags directly without masking
off O_CREAT|O_EXCL|O_TRUNC bits first broke the comparision and then
client ended up with thousands of deferred closes to same file. Those
bits are already satisfied on the original open, so no need to check
them against existing open handles.

Reproducer:

#include <stdio.h>
#include <stdlib.h>
#include <string.h>
#include <unistd.h>
#include <fcntl.h>
#include <pthread.h>

#define NR_THREADS 4
#define NR_ITERATIONS 2500
#define TEST_FILE "/mnt/1/test/dir/foo"

static char buf[64];

static void *worker(void *arg)
{
int i, j;
int fd;

for (i = 0; i < NR_ITERATIONS; i++) {
fd = open(TEST_FILE, O_WRONLY|O_CREAT|O_APPEND, 0666);
for (j = 0; j < 16; j++)
write(fd, buf, sizeof(buf));
close(fd);
}
}

int main(int argc, char *argv[])
{
pthread_t t[NR_THREADS];
int fd;
int i;

fd = open(TEST_FILE, O_WRONLY|O_CREAT|O_TRUNC, 0666);
close(fd);
memset(buf, 'a', sizeof(buf));
for (i = 0; i < NR_THREADS; i++)
pthread_create(&t[i], NULL, worker, NULL);
for (i = 0; i < NR_THREADS; i++)
pthread_join(t[i], NULL);
return 0;
}

Before patch:

$ mount.cifs //srv/share /mnt/1 -o ...
$ mkdir -p /mnt/1/test/dir
$ gcc repro.c && ./a.out
...
number of opens: 1391

After patch:

$ mount.cifs //srv/share /mnt/1 -o ...
$ mkdir -p /mnt/1/test/dir
$ gcc repro.c && ./a.out
...
number of opens: 1

Cc: linux-cifs@vger.kernel.org
Cc: David Howells <dhowells@redhat.com>
Cc: Jay Shin <jaeshin@redhat.com>
Cc: Pierguido Lambri <plambri@redhat.com>
Fixes: b8ea3b1ff544 ("smb: enable reuse of deferred file handles for write operations")
Acked-by: Shyam Prasad N <sprasad@microsoft.com>
Signed-off-by: Paulo Alcantara (Red Hat) <pc@manguebit.org>
Signed-off-by: Steve French <stfrench@microsoft.com>

7mo ago

Linus Torvalds

19272b37

Linux 6.16-rc1 v6.16-rc1

7mo ago

Linus Torvalds

6d13760e

Merge tag 'io_uring-6.16-20250614' of git://git.kernel.dk/linux

7mo ago

Jens Axboe

9ce6c987

nvme: always punt polled uring_cmd end_io work to task_work

7mo ago

Philipp Kerling

93310053

smb: client: disable path remapping with POSIX extensions

7mo ago

Linus Torvalds

939f15e6

Merge tag 'turbostat-2025.06.08' of git://git.kernel.org/pub/scm/linux/kernel/git/lenb/linux

7mo ago

Linus Torvalds

588adb24

Merge tag 'rust-fixes-6.16' of git://git.kernel.org/pub/scm/linux/kernel/git/ojeda/linux

7mo ago

Jens Axboe

b62e0efd

io_uring: run local task_work from ring exit IOPOLL reaping

7mo ago

Bagas Sanjaya

db3dfae1

Documentation: ublk: Separate UBLK_F_AUTO_BUF_REG fallback behavior sublists

7mo ago

Linus Torvalds

be54f8c5

Merge tag 'timers-cleanups-2025-06-08' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

7mo ago

Len Brown

42fd37dc

tools/power turbostat: version 2025.06.08

7mo ago

Linus Torvalds

27b9989b

Merge tag 'mm-hotfixes-stable-2025-06-13-21-56' of git://git.kernel.org/pub/scm/linux/kernel/git/akpm/mm

7mo ago

FUJITA Tomonori

5b2d595e

rust: time: Fix compile error in impl_has_hr_timer macro

7mo ago

Jens Axboe

26ec15e4

io_uring/kbuf: don't truncate end buffer for multiple buffer peeks

7mo ago

Matthew Wilcox (Oracle)

5e223e06

block: Fix bvec_set_folio() for very large folios

7mo ago

Linus Torvalds

0529ef8c

Merge tag 'x86-urgent-2025-06-08' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

7mo ago

Ingo Molnar

41cb0855

treewide, timers: Rename from_timer() to timer_container_of()

7mo ago

Zhang Rui

d8c0f5d9

tools/power turbostat: Add initial support for BartlettLake

7mo ago

Linus Torvalds

4774cfe3

Merge tag 'scsi-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/jejb/scsi

7mo ago

Lorenzo Stoakes

bb666b7c

mm: add mmap_prepare() compatibility layer for nested file systems

Nested file systems, that is those which invoke call_mmap() within their
own f_op->mmap() handlers, may encounter underlying file systems which
provide the f_op->mmap_prepare() hook introduced by commit c84bf6dd2b83
("mm: introduce new .mmap_prepare() file callback").

We have a chicken-and-egg scenario here - until all file systems are
converted to using .mmap_prepare(), we cannot convert these nested
handlers, as we can't call f_op->mmap from an .mmap_prepare() hook.

So we have to do it the other way round - invoke the .mmap_prepare() hook
from an .mmap() one.

in order to do so, we need to convert VMA state into a struct vm_area_desc
descriptor, invoking the underlying file system's f_op->mmap_prepare()
callback passing a pointer to this, and then setting VMA state accordingly
and safely.

This patch achieves this via the compat_vma_mmap_prepare() function, which
we invoke from call_mmap() if f_op->mmap_prepare() is specified in the
passed in file pointer.

We place the fundamental logic into mm/vma.h where VMA manipulation
belongs. We also update the VMA userland tests to accommodate the
changes.

The compat_vma_mmap_prepare() function and its associated machinery is
temporary, and will be removed once the conversion of file systems is
complete.

We carefully place this code so it can be used with CONFIG_MMU and also
with cutting edge nommu silicon.

[akpm@linux-foundation.org: export compat_vma_mmap_prepare tp fix build]
[lorenzo.stoakes@oracle.com: remove unused declarations]
Link: https://lkml.kernel.org/r/ac3ae324-4c65-432a-8c6d-2af988b18ac8@lucifer.local
Link: https://lkml.kernel.org/r/20250609165749.344976-1-lorenzo.stoakes@oracle.com
Fixes: c84bf6dd2b83 ("mm: introduce new .mmap_prepare() file callback").
Signed-off-by: Lorenzo Stoakes <lorenzo.stoakes@oracle.com>
Reported-by: Jann Horn <jannh@google.com>
Closes: https://lore.kernel.org/linux-mm/CAG48ez04yOEVx1ekzOChARDDBZzAKwet8PEoPM4Ln3_rk91AzQ@mail.gmail.com/
Reviewed-by: Pedro Falcato <pfalcato@suse.de>
Reviewed-by: Vlastimil Babka <vbabka@suse.cz>
Cc: Al Viro <viro@zeniv.linux.org.uk>
Cc: Christian Brauner <brauner@kernel.org>
Cc: Jan Kara <jack@suse.cz>
Cc: Jann Horn <jannh@google.com>
Cc: Liam Howlett <liam.howlett@oracle.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>

7mo ago

Keith Busch

c538f400

io_uring: consistently use rcu semantics with sqpoll thread

7mo ago

Matthew Wilcox (Oracle)

f826ec79

bio: Fix bio_first_folio() for SPARSEMEM without VMEMMAP

7mo ago

Linus Torvalds

4710eacf

Merge tag 'timers-urgent-2025-06-08' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

7mo ago

Zeng Heng

dd2922dc

fs/resctrl: Restore the rdt_last_cmd_clear() calls after acquiring rdtgroup_mutex

7mo ago

Linus Torvalds

8630c59e

Merge tag 'kbuild-v6.16' of git://git.kernel.org/pub/scm/linux/kernel/git/masahiroy/linux-kbuild

7mo ago

Zhang Rui

83075bd5

tools/power turbostat: Add initial support for DMR

7mo ago

Linus Torvalds

25294cb8

Merge tag 'drm-fixes-2025-06-14' of https://gitlab.freedesktop.org/drm/kernel

7mo ago

Rajashekhar M A

5c3ba819

scsi: error: alua: I/O errors for ALUA state transitions

7mo ago

Huacai Chen

66ac1a4d

init: fix build warnings about export.h

7mo ago

Penglei Jiang

ac0b8b32

io_uring: fix use-after-free of sq->thread in __io_uring_show_fdinfo()

syzbot reports:

BUG: KASAN: slab-use-after-free in getrusage+0x1109/0x1a60
Read of size 8 at addr ffff88810de2d2c8 by task a.out/304

CPU: 0 UID: 0 PID: 304 Comm: a.out Not tainted 6.16.0-rc1 #1 PREEMPT(voluntary)
Hardware name: QEMU Ubuntu 24.04 PC (i440FX + PIIX, 1996), BIOS 1.16.3-debian-1.16.3-2 04/01/2014
Call Trace:
<TASK>
dump_stack_lvl+0x53/0x70
print_report+0xd0/0x670
? __pfx__raw_spin_lock_irqsave+0x10/0x10
? getrusage+0x1109/0x1a60
kasan_report+0xce/0x100
? getrusage+0x1109/0x1a60
getrusage+0x1109/0x1a60
? __pfx_getrusage+0x10/0x10
__io_uring_show_fdinfo+0x9fe/0x1790
? ksys_read+0xf7/0x1c0
? do_syscall_64+0xa4/0x260
? vsnprintf+0x591/0x1100
? __pfx___io_uring_show_fdinfo+0x10/0x10
? __pfx_vsnprintf+0x10/0x10
? mutex_trylock+0xcf/0x130
? __pfx_mutex_trylock+0x10/0x10
? __pfx_show_fd_locks+0x10/0x10
? io_uring_show_fdinfo+0x57/0x80
io_uring_show_fdinfo+0x57/0x80
seq_show+0x38c/0x690
seq_read_iter+0x3f7/0x1180
? inode_set_ctime_current+0x160/0x4b0
seq_read+0x271/0x3e0
? __pfx_seq_read+0x10/0x10
? __pfx__raw_spin_lock+0x10/0x10
? __mark_inode_dirty+0x402/0x810
? selinux_file_permission+0x368/0x500
? file_update_time+0x10f/0x160
vfs_read+0x177/0xa40
? __pfx___handle_mm_fault+0x10/0x10
? __pfx_vfs_read+0x10/0x10
? mutex_lock+0x81/0xe0
? __pfx_mutex_lock+0x10/0x10
? fdget_pos+0x24d/0x4b0
ksys_read+0xf7/0x1c0
? __pfx_ksys_read+0x10/0x10
? do_user_addr_fault+0x43b/0x9c0
do_syscall_64+0xa4/0x260
entry_SYSCALL_64_after_hwframe+0x77/0x7f
RIP: 0033:0x7f0f74170fc9
Code: 00 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 8
RSP: 002b:00007fffece049e8 EFLAGS: 00000206 ORIG_RAX: 0000000000000000
RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007f0f74170fc9
RDX: 0000000000001000 RSI: 00007fffece049f0 RDI: 0000000000000004
RBP: 00007fffece05ad0 R08: 0000000000000000 R09: 00007fffece04d90
R10: 0000000000000000 R11: 0000000000000206 R12: 00005651720a1100
R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
</TASK>

Allocated by task 298:
kasan_save_stack+0x33/0x60
kasan_save_track+0x14/0x30
__kasan_slab_alloc+0x6e/0x70
kmem_cache_alloc_node_noprof+0xe8/0x330
copy_process+0x376/0x5e00
create_io_thread+0xab/0xf0
io_sq_offload_create+0x9ed/0xf20
io_uring_setup+0x12b0/0x1cc0
do_syscall_64+0xa4/0x260
entry_SYSCALL_64_after_hwframe+0x77/0x7f

Freed by task 22:
kasan_save_stack+0x33/0x60
kasan_save_track+0x14/0x30
kasan_save_free_info+0x3b/0x60
__kasan_slab_free+0x37/0x50
kmem_cache_free+0xc4/0x360
rcu_core+0x5ff/0x19f0
handle_softirqs+0x18c/0x530
run_ksoftirqd+0x20/0x30
smpboot_thread_fn+0x287/0x6c0
kthread+0x30d/0x630
ret_from_fork+0xef/0x1a0
ret_from_fork_asm+0x1a/0x30

Last potentially related work creation:
kasan_save_stack+0x33/0x60
kasan_record_aux_stack+0x8c/0xa0
__call_rcu_common.constprop.0+0x68/0x940
__schedule+0xff2/0x2930
__cond_resched+0x4c/0x80
mutex_lock+0x5c/0xe0
io_uring_del_tctx_node+0xe1/0x2b0
io_uring_clean_tctx+0xb7/0x160
io_uring_cancel_generic+0x34e/0x760
do_exit+0x240/0x2350
do_group_exit+0xab/0x220
__x64_sys_exit_group+0x39/0x40
x64_sys_call+0x1243/0x1840
do_syscall_64+0xa4/0x260
entry_SYSCALL_64_after_hwframe+0x77/0x7f

The buggy address belongs to the object at ffff88810de2cb00
which belongs to the cache task_struct of size 3712
The buggy address is located 1992 bytes inside of
freed 3712-byte region [ffff88810de2cb00, ffff88810de2d980)

which is caused by the task_struct pointed to by sq->thread being
released while it is being used in the function
__io_uring_show_fdinfo(). Holding ctx->uring_lock does not prevent ehre
relase or exit of sq->thread.

Fix this by assigning and looking up ->thread under RCU, and grabbing a
reference to the task_struct. This ensures that it cannot get released
while fdinfo is using it.

Reported-by: syzbot+531502bbbe51d2f769f4@syzkaller.appspotmail.com
Closes: https://lore.kernel.org/all/682b06a5.a70a0220.3849cf.00b3.GAE@google.com
Fixes: 3fcb9d17206e ("io_uring/sqpoll: statistics of the true utilization of sq threads")
Signed-off-by: Penglei Jiang <superman.xpt@gmail.com>
Link: https://lore.kernel.org/r/20250610171801.70960-1-superman.xpt@gmail.com
[axboe: massage commit message]
Signed-off-by: Jens Axboe <axboe@kernel.dk>

7mo ago

Jens Axboe

961296e8

block: use plug request list tail for one-shot backmerge attempt

7mo ago

Linus Torvalds

d9864e7d

Merge tag 'perf-urgent-2025-06-08' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

7mo ago

Herbert Xu

434d7f9b

timens: Add struct seq_file forward declaration

7mo ago

Thomas Gleixner

8b68e978

x86/iopl: Cure TIF_IO_BITMAP inconsistencies

7mo ago

Linus Torvalds

b3154a6f

Merge tag 'sh-for-v6.16-tag1' of git://git.kernel.org/pub/scm/linux/kernel/git/glaubitz/sh-linux

7mo ago

Petr Pavlu

c50a04f8

genksyms: Fix enum consts from a reference affecting new values

7mo ago

Zhang Rui

2a535d6c

tools/power turbostat: Dump RAPL sysfs info

7mo ago

Linus Torvalds

18531f4d

Merge tag 'acpi-6.16-rc2' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm

7mo ago

Dave Airlie

1364af9c

Merge tag 'drm-misc-fixes-2025-06-12' of https://gitlab.freedesktop.org/drm/misc/kernel into drm-fixes

7mo ago

Dexuan Cui

b2f96656

scsi: storvsc: Increase the timeouts to storvsc_timeout

7mo ago

Barry Song

02fb3650

MAINTAINERS: add Barry as a THP reviewer

7mo ago

Jens Axboe

079afb08

io_uring/futex: mark wait requests as inflight

7mo ago

Christoph Hellwig

cf625013

block: don't use submit_bio_noacct_nocheck in blk_zone_wplug_bio_work

7mo ago

Linus Torvalds

70b7d651

Merge tag 'irq-urgent-2025-06-08' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

7mo ago

Dapeng Mi

86aa94cd

perf/x86/intel: Fix incorrect MSR index calculations in intel_pmu_config_acr()

7mo ago

Linus Torvalds

5abc7438

Merge tag 'nfs-for-6.16-1' of git://git.linux-nfs.org/projects/anna/linux-nfs

Pull NFS clent updates from Anna Schumaker:
"New Features:

- Implement the Sunrpc rfc2203 rpcsec_gss sequence number cache

- Add support for FALLOC_FL_ZERO_RANGE on NFS v4.2

- Add a localio sysfs attribute

Stable Fixes:

- Fix double-unlock bug in nfs_return_empty_folio()

- Don't check for OPEN feature support in v4.1

- Always probe for LOCALIO support asynchronously

- Prevent hang on NFS mounts with xprtsec=[m]tls

Other Bugfixes:

- xattr handlers should check for absent nfs filehandles

- Fix setattr caching of TIME_[MODIFY|ACCESS]_SET when timestamps are
delegated

- Fix listxattr to return selinux security labels

- Connect to NFSv3 DS using TLS if MDS connection uses TLS

- Clear SB_RDONLY before getting a superblock, and ignore when
remounting

- Fix incorrect handling of NFS error codes in nfs4_do_mkdir()

- Various nfs_localio fixes from Neil Brown that include fixing an
rcu compilation error found by older gcc versions.

- Update stats on flexfiles pNFS DSes when receiving NFS4ERR_DELAY

Cleanups:

- Add a refcount tracker for struct net in the nfs_client

- Allow FREE_STATEID to clean up delegations

- Always set NLINK even if the server doesn't support it

- Cleanups to the NFS folio writeback code

- Remove dead code from xs_tcp_tls_setup_socket()"

* tag 'nfs-for-6.16-1' of git://git.linux-nfs.org/projects/anna/linux-nfs: (30 commits)
flexfiles/pNFS: update stats on NFS4ERR_DELAY for v4.1 DSes
nfs_localio: change nfsd_file_put_local() to take a pointer to __rcu pointer
nfs_localio: protect race between nfs_uuid_put() and nfs_close_local_fh()
nfs_localio: duplicate nfs_close_local_fh()
nfs_localio: simplify interface to nfsd for getting nfsd_file
nfs_localio: always hold nfsd net ref with nfsd_file ref
nfs_localio: use cmpxchg() to install new nfs_file_localio
SUNRPC: Remove dead code from xs_tcp_tls_setup_socket()
SUNRPC: Prevent hang on NFS mount with xprtsec=[m]tls
nfs: fix incorrect handling of large-number NFS errors in nfs4_do_mkdir()
nfs: ignore SB_RDONLY when remounting nfs
nfs: clear SB_RDONLY before getting superblock
NFS: always probe for LOCALIO support asynchronously
pnfs/flexfiles: connect to NFSv3 DS using TLS if MDS connection uses TLS
NFS: add localio to sysfs
nfs: use writeback_iter directly
nfs: refactor nfs_do_writepage
nfs: don't return AOP_WRITEPAGE_ACTIVATE from nfs_do_writepage
nfs: fold nfs_page_async_flush into nfs_do_writepage
NFSv4: Always set NLINK even if the server doesn't support it
...

7mo ago

Steven Rostedt

99850a1c

x86/fpu: Remove unused trace events

7mo ago

Linus Torvalds

b7191581

Merge tag 'loongarch-6.16' of git://git.kernel.org/pub/scm/linux/kernel/git/chenhuacai/linux-loongson

7mo ago

Mike Rapoport

8a368260

sh: kprobes: Remove unused variables in kprobe_exceptions_notify()

7mo ago

Masahiro Yamada

e21efe83

arch: use always-$(KBUILD_BUILTIN) for vmlinux.lds

7mo ago

Zhang Rui

69078520

tools/power turbostat: Avoid probing the same perf counters

7mo ago

Linux 6.16-rc2 v6.16-rc2

e04c78d8

Linus Torvalds

7mo

Merge tag 'kbuild-fixes-v6.16' of git://git.kernel.org/pub/scm/linux/kernel/git/masahiroy/linux-kbuild

08215f54

Linus Torvalds

7mo

Merge tag 'v6.16-rc1-smb3-client-fixes' of git://git.samba.org/sfrench/cifs-2.6

8c6bc74c

Linus Torvalds

7mo

gendwarfksyms: Fix structure type overrides

2f6b47b2

Sami Tolvanen

7mo

Merge tag 'iommu-fixes-v6.16-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/iommu/linux

ac91b4de

Linus Torvalds

7mo

smb: improve directory cache reuse for readdir operations

72dd7961

Bharath SM

7mo

kbuild: move warnings about linux/export.h from W=1 to W=2

a6a7946b

Masahiro Yamada

7mo

Merge tag 'block-6.16-20250614' of git://git.kernel.dk/linux

f713ffa3

Linus Torvalds

7mo

iommu/tegra: Fix incorrect size calculation

f9705d66

Jason Gunthorpe

7mo

smb: client: fix perf regression with deferred closes

b64af6bc

Paulo Alcantara

7mo

Linux 6.16-rc1 v6.16-rc1

19272b37

Linus Torvalds

7mo

Merge tag 'io_uring-6.16-20250614' of git://git.kernel.dk/linux

6d13760e

Linus Torvalds

7mo

nvme: always punt polled uring_cmd end_io work to task_work

9ce6c987

Jens Axboe

7mo

smb: client: disable path remapping with POSIX extensions

93310053

Philipp Kerling

7mo

Merge tag 'turbostat-2025.06.08' of git://git.kernel.org/pub/scm/linux/kernel/git/lenb/linux

939f15e6

Linus Torvalds

7mo

Merge tag 'rust-fixes-6.16' of git://git.kernel.org/pub/scm/linux/kernel/git/ojeda/linux

588adb24

Linus Torvalds

7mo

io_uring: run local task_work from ring exit IOPOLL reaping

b62e0efd

Jens Axboe

7mo

Documentation: ublk: Separate UBLK_F_AUTO_BUF_REG fallback behavior sublists

db3dfae1

Bagas Sanjaya

7mo

Merge tag 'timers-cleanups-2025-06-08' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

be54f8c5

Linus Torvalds

7mo

tools/power turbostat: version 2025.06.08

42fd37dc

Len Brown

7mo

Merge tag 'mm-hotfixes-stable-2025-06-13-21-56' of git://git.kernel.org/pub/scm/linux/kernel/git/akpm/mm

27b9989b

Linus Torvalds

7mo

rust: time: Fix compile error in impl_has_hr_timer macro

5b2d595e

FUJITA Tomonori

7mo

io_uring/kbuf: don't truncate end buffer for multiple buffer peeks

26ec15e4

Jens Axboe

7mo

block: Fix bvec_set_folio() for very large folios

5e223e06

Matthew Wilcox (Oracle)

7mo

Merge tag 'x86-urgent-2025-06-08' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

0529ef8c

Linus Torvalds

7mo

treewide, timers: Rename from_timer() to timer_container_of()

41cb0855

Ingo Molnar

7mo

tools/power turbostat: Add initial support for BartlettLake

d8c0f5d9

Zhang Rui

7mo

Merge tag 'scsi-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/jejb/scsi

4774cfe3

Linus Torvalds

7mo

mm: add mmap_prepare() compatibility layer for nested file systems

bb666b7c

Lorenzo Stoakes

7mo

io_uring: consistently use rcu semantics with sqpoll thread

c538f400

Keith Busch

7mo

bio: Fix bio_first_folio() for SPARSEMEM without VMEMMAP

f826ec79

Matthew Wilcox (Oracle)

7mo

Merge tag 'timers-urgent-2025-06-08' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

4710eacf

Linus Torvalds

7mo

fs/resctrl: Restore the rdt_last_cmd_clear() calls after acquiring rdtgroup_mutex

dd2922dc

Zeng Heng

7mo

Merge tag 'kbuild-v6.16' of git://git.kernel.org/pub/scm/linux/kernel/git/masahiroy/linux-kbuild

8630c59e

Linus Torvalds

7mo

tools/power turbostat: Add initial support for DMR

83075bd5

Zhang Rui

7mo

Merge tag 'drm-fixes-2025-06-14' of https://gitlab.freedesktop.org/drm/kernel

25294cb8

Linus Torvalds

7mo

scsi: error: alua: I/O errors for ALUA state transitions

5c3ba819

Rajashekhar M A

7mo

init: fix build warnings about export.h

66ac1a4d

Huacai Chen

7mo

io_uring: fix use-after-free of sq->thread in __io_uring_show_fdinfo()

ac0b8b32

Penglei Jiang

7mo

block: use plug request list tail for one-shot backmerge attempt

961296e8

Jens Axboe

7mo

Merge tag 'perf-urgent-2025-06-08' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

d9864e7d

Linus Torvalds

7mo

timens: Add struct seq_file forward declaration

434d7f9b

Herbert Xu

7mo

x86/iopl: Cure TIF_IO_BITMAP inconsistencies

io_bitmap_exit() is invoked from exit_thread() when a task exists or
when a fork fails. In the latter case the exit_thread() cleans up
resources which were allocated during fork().

io_bitmap_exit() invokes task_update_io_bitmap(), which in turn ends up
in tss_update_io_bitmap(). tss_update_io_bitmap() operates on the
current task. If current has TIF_IO_BITMAP set, but no bitmap installed,
tss_update_io_bitmap() crashes with a NULL pointer dereference.

There are two issues, which lead to that problem:

1) io_bitmap_exit() should not invoke task_update_io_bitmap() when
the task, which is cleaned up, is not the current task. That's a
clear indicator for a cleanup after a failed fork().

2) A task should not have TIF_IO_BITMAP set and neither a bitmap
installed nor IOPL emulation level 3 activated.

This happens when a kernel thread is created in the context of
a user space thread, which has TIF_IO_BITMAP set as the thread
flags are copied and the IO bitmap pointer is cleared.

Other than in the failed fork() case this has no impact because
kernel threads including IO workers never return to user space and
therefore never invoke tss_update_io_bitmap().

Cure this by adding the missing cleanups and checks:

1) Prevent io_bitmap_exit() to invoke task_update_io_bitmap() if
the to be cleaned up task is not the current task.

2) Clear TIF_IO_BITMAP in copy_thread() unconditionally. For user
space forks it is set later, when the IO bitmap is inherited in
io_bitmap_share().

For paranoia sake, add a warning into tss_update_io_bitmap() to catch
the case, when that code is invoked with inconsistent state.

Fixes: ea5f1cd7ab49 ("x86/ioperm: Remove bitmap if all permissions dropped")
Reported-by: syzbot+e2b1803445d236442e54@syzkaller.appspotmail.com
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
Signed-off-by: Borislav Petkov (AMD) <bp@alien8.de>
Cc: stable@vger.kernel.org
Link: https://lore.kernel.org/87wmdceom2.ffs@tglx