tjh.dev/kernel at 119e1ef80ecfe0d1deb6378d4ab41f5b71519de1

mntput_no_expire() does the calculation of total refcount under mount_lock;
unfortunately, the decrement (as well as all increments) are done outside
of it, leading to false positives in the "are we dropping the last reference"
test. Consider the following situation:
* mnt is a lazy-umounted mount, kept alive by two opened files. One
of those files gets closed. Total refcount of mnt is 2. On CPU 42
mntput(mnt) (called from __fput()) drops one reference, decrementing component
* After it has looked at component #0, the process on CPU 0 does
mntget(), incrementing component #0, gets preempted and gets to run again -
on CPU 69. There it does mntput(), which drops the reference (component #69)
and proceeds to spin on mount_lock.
* On CPU 42 our first mntput() finishes counting. It observes the
decrement of component #69, but not the increment of component #0. As the
result, the total it gets is not 1 as it should've been - it's 0. At which
point we decide that vfsmount needs to be killed and proceed to free it and
shut the filesystem down. However, there's still another opened file
on that filesystem, with reference to (now freed) vfsmount, etc. and we are
screwed.

It's not a wide race, but it can be reproduced with artificial slowdown of
the mnt_get_count() loop, and it should be easier to hit on SMP KVM setups.

Fix consists of moving the refcount decrement under mount_lock; the tricky
part is that we want (and can) keep the fast case (i.e. mount that still
has non-NULL ->mnt_ns) entirely out of mount_lock. All places that zero
mnt->mnt_ns are dropping some reference to mnt and they call synchronize_rcu()
before that mntput(). IOW, if mntput() observes (under rcu_read_lock())
a non-NULL ->mnt_ns, it is guaranteed that there is another reference yet to
be dropped.

Reported-by: Jann Horn <jannh@google.com>
Tested-by: Jann Horn <jannh@google.com>
Fixes: 48a066e72d97 ("RCU'd vsfmounts")
Cc: stable@vger.kernel.org
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

9ea0a46c

Al Viro

7 years ago

root dentries need RCU-delayed freeing

90bad5e0

Al Viro

7 years ago

aio: don't expose __aio_sigset in uapi

9ba546c0

Christoph Hellwig

7 years ago

ocxlflash_getfile(): fix double-iput() on alloc_file() failures

c7e9075f

Al Viro

7 years ago

cxl_getfile(): fix double-iput() on alloc_file() failures

d202797f

Al Viro

7 years ago

drm_mode_create_lease_ioctl(): fix open-coded filp_clone_open()

b4e7a7a8

Al Viro

7 years ago

proc: add proc_seq_release

877f919e

Chunyu Hu

7 years ago

Linux 4.18-rc1

ce397d21

Linus Torvalds

7 years ago

v4.18-rc1

Merge tag 'for-linus-20180616' of git://git.kernel.dk/linux-block

265c5596

Linus Torvalds

7 years ago

Merge tag 'docs-broken-links' of git://linuxtv.org/mchehab/experimental

5e7b9212

Linus Torvalds

7 years ago

bsg: fix race of bsg_open and bsg_unregister

d6c73964

Anatoliy Glagolev

7 years ago

Merge tag 'fsnotify_for_v4.18-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/jack/linux-fs

dbb2816f

Linus Torvalds

7 years ago

fix a series of Documentation/ broken file name references

44348e8a

Mauro Carvalho Chehab

7 years ago

branches 3

master 22 hours ago default

compare

nocache-cleanup 3 weeks ago

compare

for-next 1 year ago

compare

tags 928

v7.1-rc1

22 hours ago latest

README

Linux kernel
============

There are several guides for kernel developers and users. These guides can
be rendered in a number of formats, like HTML and PDF. Please read
Documentation/admin-guide/README.rst first.

In order to build the documentation, use ``make htmldocs`` or
``make pdfdocs``.  The formatted documentation can also be read online at:

    https://www.kernel.org/doc/html/latest/

There are various text files in the Documentation/ subdirectory,
several of them using the Restructured Text markup notation.
See Documentation/00-INDEX for a list of what is contained in each file.

Please read the Documentation/process/changes.rst file, as it contains the
requirements for building and running the kernel, and information about
the problems which may result by upgrading your kernel.

Configure Feed

Configure Feed

Clone this repository