Linux kernel mirror (for testing) git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git
kernel os linux

ubi: Fix race condition between ctrl_cdev_ioctl and ubi_cdev_ioctl

Hulk Robot reported a KASAN report about use-after-free:
==================================================================
BUG: KASAN: use-after-free in __list_del_entry_valid+0x13d/0x160
Read of size 8 at addr ffff888035e37d98 by task ubiattach/1385
[...]
Call Trace:
klist_dec_and_del+0xa7/0x4a0
klist_put+0xc7/0x1a0
device_del+0x4d4/0xed0
cdev_device_del+0x1a/0x80
ubi_attach_mtd_dev+0x2951/0x34b0 [ubi]
ctrl_cdev_ioctl+0x286/0x2f0 [ubi]

Allocated by task 1414:
device_add+0x60a/0x18b0
cdev_device_add+0x103/0x170
ubi_create_volume+0x1118/0x1a10 [ubi]
ubi_cdev_ioctl+0xb7f/0x1ba0 [ubi]

Freed by task 1385:
cdev_device_del+0x1a/0x80
ubi_remove_volume+0x438/0x6c0 [ubi]
ubi_cdev_ioctl+0xbf4/0x1ba0 [ubi]
[...]
==================================================================

The lock held by ctrl_cdev_ioctl is ubi_devices_mutex, but the lock held
by ubi_cdev_ioctl is ubi->device_mutex. Therefore, the two locks can be
concurrent.

ctrl_cdev_ioctl contains two operations: ubi_attach and ubi_detach.
ubi_detach is bug-free because it uses reference counting to prevent
concurrency. However, uif_init and uif_close in ubi_attach may race with
ubi_cdev_ioctl.

uif_init will race with ubi_cdev_ioctl as in the following stack.
cpu1 cpu2 cpu3
_______________________|________________________|______________________
ctrl_cdev_ioctl
ubi_attach_mtd_dev
uif_init
ubi_cdev_ioctl
ubi_create_volume
cdev_device_add
ubi_add_volume
// sysfs exist
kill_volumes
ubi_cdev_ioctl
ubi_remove_volume
cdev_device_del
// first free
ubi_free_volume
cdev_del
// double free
cdev_device_del

And uif_close will race with ubi_cdev_ioctl as in the following stack.
cpu1 cpu2 cpu3
_______________________|________________________|______________________
ctrl_cdev_ioctl
ubi_attach_mtd_dev
uif_init
ubi_cdev_ioctl
ubi_create_volume
cdev_device_add
ubi_debugfs_init_dev
//error goto out_uif;
uif_close
kill_volumes
ubi_cdev_ioctl
ubi_remove_volume
cdev_device_del
// first free
ubi_free_volume
// double free

The cause of this problem is that commit 714fb87e8bc0 make device
"available" before it becomes accessible via sysfs. Therefore, we
roll back the modification. We will fix the race condition between
ubi device creation and udev by removing ubi_get_device in
vol_attribute_show and dev_attribute_show.This avoids accessing
uninitialized ubi_devices[ubi_num].

ubi_get_device is used to prevent devices from being deleted during
sysfs execution. However, now kernfs ensures that devices will not
be deleted before all reference counting are released.
The key process is shown in the following stack.

device_del
device_remove_attrs
device_remove_groups
sysfs_remove_groups
sysfs_remove_group
remove_files
kernfs_remove_by_name
kernfs_remove_by_name_ns
__kernfs_remove
kernfs_drain

Fixes: 714fb87e8bc0 ("ubi: Fix race condition between ubi device creation and udev")
Reported-by: Hulk Robot <hulkci@huawei.com>
Signed-off-by: Baokun Li <libaokun1@huawei.com>
Signed-off-by: Richard Weinberger <richard@nod.at>

authored by

Baokun Li and committed by
Richard Weinberger
3cbf0e39 aa39cc67

+2 -15
+1 -8
drivers/mtd/ubi/build.c
··· 351 351 * we still can use 'ubi->ubi_num'. 352 352 */ 353 353 ubi = container_of(dev, struct ubi_device, dev); 354 - ubi = ubi_get_device(ubi->ubi_num); 355 - if (!ubi) 356 - return -ENODEV; 357 354 358 355 if (attr == &dev_eraseblock_size) 359 356 ret = sprintf(buf, "%d\n", ubi->leb_size); ··· 379 382 else 380 383 ret = -EINVAL; 381 384 382 - ubi_put_device(ubi); 383 385 return ret; 384 386 } 385 387 ··· 975 979 goto out_detach; 976 980 } 977 981 978 - /* Make device "available" before it becomes accessible via sysfs */ 979 - ubi_devices[ubi_num] = ubi; 980 - 981 982 err = uif_init(ubi); 982 983 if (err) 983 984 goto out_detach; ··· 1019 1026 wake_up_process(ubi->bgt_thread); 1020 1027 spin_unlock(&ubi->wl_lock); 1021 1028 1029 + ubi_devices[ubi_num] = ubi; 1022 1030 ubi_notify_all(ubi, UBI_VOLUME_ADDED, NULL); 1023 1031 return ubi_num; 1024 1032 ··· 1028 1034 out_uif: 1029 1035 uif_close(ubi); 1030 1036 out_detach: 1031 - ubi_devices[ubi_num] = NULL; 1032 1037 ubi_wl_close(ubi); 1033 1038 ubi_free_all_volumes(ubi); 1034 1039 vfree(ubi->vtbl);
+1 -7
drivers/mtd/ubi/vmt.c
··· 56 56 { 57 57 int ret; 58 58 struct ubi_volume *vol = container_of(dev, struct ubi_volume, dev); 59 - struct ubi_device *ubi; 60 - 61 - ubi = ubi_get_device(vol->ubi->ubi_num); 62 - if (!ubi) 63 - return -ENODEV; 59 + struct ubi_device *ubi = vol->ubi; 64 60 65 61 spin_lock(&ubi->volumes_lock); 66 62 if (!ubi->volumes[vol->vol_id]) { 67 63 spin_unlock(&ubi->volumes_lock); 68 - ubi_put_device(ubi); 69 64 return -ENODEV; 70 65 } 71 66 /* Take a reference to prevent volume removal */ ··· 98 103 vol->ref_count -= 1; 99 104 ubi_assert(vol->ref_count >= 0); 100 105 spin_unlock(&ubi->volumes_lock); 101 - ubi_put_device(ubi); 102 106 return ret; 103 107 } 104 108