Linux kernel mirror (for testing) git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git
kernel os linux

md: don't record new badblocks for faulty rdev

Faulty will be checked before issuing IO to the rdev, however, rdev can
be faulty at any time, hence it's possible that rdev_set_badblocks()
will be called for faulty rdev. In this case, mddev->sb_flags will be
set and some other path can be blocked by updating super block.

Since faulty rdev will not be accesed anymore, there is no need to
record new babblocks for faulty rdev and forcing updating super block.

Noted this is not a bugfix, just prevent updating superblock in some
corner cases, and will help to slice a bug related to external
metadata[1], testing also shows that devices are removed faster in the
case IO error.

[1] https://lore.kernel.org/all/f34452df-810b-48b2-a9b4-7f925699a9e7@linux.intel.com/

Signed-off-by: Yu Kuai <yukuai3@huawei.com>
Tested-by: Mariusz Tkaczyk <mariusz.tkaczyk@linux.intel.com>
Link: https://lore.kernel.org/r/20241031033114.3845582-4-yukuai1@huaweicloud.com
Signed-off-by: Song Liu <song@kernel.org>

authored by

Yu Kuai and committed by
Song Liu
29967332 50e82748

+11
+11
drivers/md/md.c
··· 9791 9791 { 9792 9792 struct mddev *mddev = rdev->mddev; 9793 9793 int rv; 9794 + 9795 + /* 9796 + * Recording new badblocks for faulty rdev will force unnecessary 9797 + * super block updating. This is fragile for external management because 9798 + * userspace daemon may trying to remove this device and deadlock may 9799 + * occur. This will be probably solved in the mdadm, but it is safer to 9800 + * avoid it. 9801 + */ 9802 + if (test_bit(Faulty, &rdev->flags)) 9803 + return 1; 9804 + 9794 9805 if (is_new) 9795 9806 s += rdev->new_data_offset; 9796 9807 else