commit 0b1d647a02c5a1b67d45287eeb6cb3b2219c41c3 · tjh.dev/kernel

tjh.dev / kernel

Linux kernel mirror (for testing) git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git

kernel os linux

[PATCH] dm: work around mempool_alloc, bio_alloc_bioset deadlocks

This patch works around a complex dm-related deadlock/livelock down in the
mempool allocator.

Alasdair said:

Several dm targets suffer from this.

Mempools are not yet used correctly everywhere in device-mapper: they can
get shared when devices are stacked, and some targets share them across
multiple instances. I made fixing this one of the prerequisites for this
patch:

md-dm-reduce-stack-usage-with-stacked-block-devices.patch

which in some cases makes people more likely to hit the problem.

There's been some progress on this recently with (unfinished) dm-crypt
patches at:

http://www.kernel.org/pub/linux/kernel/people/agk/patches/2.6/editing/
(dm-crypt-move-io-to-workqueue.patch plus dependencies)

and:

I've no problems with a temporary workaround like that, but Milan Broz (a
new Redhat developer in the Czech Republic) has started reviewing all the
mempool usage in device-mapper so I'm expecting we'll soon have a proper fix
for this associated problems. [He's back from holiday at the start of next
week.]

For now, this sad-but-safe little patch will allow the machine to recover.

[akpm@osdl.org: rewrote changelog]
Cc: Alasdair G Kergon <agk@redhat.com>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

authored by Pavel Mironchik and committed by Linus Torvalds 19 years ago 0b1d647a 1e5f5e5c

+7 -2

1 changed file

expand all

unified split

mempool.c

+7 -2

mm/mempool.c

··· 238 238 init_wait(&wait); 239 239 prepare_to_wait(&pool->wait, &wait, TASK_UNINTERRUPTIBLE); 240 240 smp_mb(); 241 - if (!pool->curr_nr) 242 - io_schedule(); 241 + if (!pool->curr_nr) { 242 + /* 243 + * FIXME: this should be io_schedule(). The timeout is there 244 + * as a workaround for some DM problems in 2.6.18. 245 + */ 246 + io_schedule_timeout(5*HZ); 247 + } 243 248 finish_wait(&pool->wait, &wait); 244 249 245 250 goto repeat_alloc;