commits

Get UML to use the generic bug support rather than arch specific one.

If I insert an artificial bug right before loading init, I get this:

Kernel panic - not syncing: Kernel mode signal 4

EIP: 0023:[<0819d501>] CPU: 0 Not tainted ESP: 002b:f7fd4fbc EFLAGS: 00000246
Not tainted
EAX: 00000000 EBX: 00007870 ECX: 00000013 EDX: 00007870
ESI: 0000786d EDI: 00000011 EBP: f7fd4fd8 DS: 002b ES: 002b
08273bec: [<0806e814>] show_regs+0x104/0x106
08273c08: [<08058927>] panic_exit+0x2c/0x4b
08273c18: [<08080ee7>] notifier_call_chain+0x32/0x5b
08273c38: [<08080fbd>] __atomic_notifier_call_chain+0x30/0x32
08273c54: [<08080fee>] atomic_notifier_call_chain+0x2f/0x31
08273c70: [<08073b88>] panic+0x75/0x131
08273c94: [<080586c7>] relay_signal+0x87/0x95
08273cb0: [<0806b9ee>] sig_handler_common_skas+0x9e/0x120
08273cd8: [<08067738>] sig_handler+0x28/0x4f
08273cec: [<0806792e>] handle_signal+0x53/0x89
08273d0c: [<08069f60>] hard_handler+0x18/0x28
08273d1c: [<ffffe500>] transitions+0xf7d598b8/0xfffffff0

With this patch in place, this is how it looks:

BUG: failure at init/main.c:779/init_post()!
Kernel panic - not syncing: BUG!

EIP: 0023:[<081a65d1>] CPU: 0 Not tainted ESP: 002b:f7f0dfbc EFLAGS: 00000246
Not tainted
EAX: 00000000 EBX: 000069db ECX: 00000013 EDX: 000069db
ESI: 000069d8 EDI: 00000011 EBP: f7f0dfd8 DS: 002b ES: 002b
098efedc: [<0806e9a4>] show_regs+0x104/0x106
098efef8: [<080589c7>] panic_exit+0x2c/0x4b
098eff08: [<080818d7>] notifier_call_chain+0x32/0x5b
098eff28: [<080819ad>] __atomic_notifier_call_chain+0x30/0x32
098eff44: [<080819de>] atomic_notifier_call_chain+0x2f/0x31
098eff60: [<08073f28>] panic+0x75/0x131
098eff84: [<080541d5>] init_post+0xcd/0xe8
098eff9c: [<08048ad4>] kernel_init+0x8e/0x9a
098effb4: [<08066dee>] run_kernel_thread+0x41/0x53
098effe0: [<08058e75>] new_thread_handler+0x62/0x8b
098efffc: [<a55a5a5a>] 0xa55a5a5a

[ jdike - added BUG_TABLE to linker script ]

Signed-off-by: Nick Piggin <npiggin@suse.de>
Signed-off-by: Jeff Dike <jdike@linux.intel.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>

18y ago

Linus Torvalds

3f2c6d0f

Merge branch 'master' of master.kernel.org:/pub/scm/linux/kernel/git/cooloney/blackfin-2.6

18y ago

Ingo Korb

b08b5ad9

Char: stallion, fix oops during init with ISA cards

18y ago

Linus Torvalds

4beb2584

Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/roland/infiniband

18y ago

Mike Frysinger

216e39db

Blackfin arch: add proper const volatile to addr argument to the read functions

18y ago

Ivan Kokshaysky

58ed2f9c

alpha: fix alignment problem in csum_ipv6_magic()

18y ago

Linus Torvalds

e2f90a91

Merge branch 'master' of master.kernel.org:/pub/scm/linux/kernel/git/davem/net-2.6

18y ago

Jack Morgenstein

c8681f14

IB/mlx4: Correct max_srq_wr returned from mlx4_ib_query_device()

18y ago

Sonic Zhang

334280ff

Blackfin arch: Add definition of dma_mapping_error

18y ago

Andy Whitcroft

653d4876

update checkpatch.pl to version 0.05

18y ago

Arjan van de Ven

0864a4e2

Allow DEBUG_RODATA and KPROBES to co-exist

18y ago

David Howells

19e6454c

[AF_RXRPC]: Return the number of bytes buffered in rxrpc_send_data()

18y ago

Roland Dreier

13ef5f44

IPoIB/cm: Remove dead definition of struct ipoib_cm_id

18y ago

Mike Frysinger

b9b71276

Blackfin arch: move cond_syscall() behind __KERNEL__ like all other architectures

18y ago

Christoph Lameter

92c4ca5c

sched: fix next_interval determination in idle_balance()

18y ago

Linus Torvalds

79d9a72f

Merge master.kernel.org:/pub/scm/linux/kernel/git/davej/agpgart

18y ago

Neil Horman

cc0191ae

[IPVS]: Fix state variable on failure to start ipvs threads

18y ago

Michael S. Tsirkin

82c3aca6

IPoIB/cm: Fix interoperability when MTU doesn't match

18y ago

Robin Getz

86b73c8c

Blackfin arch: match kernel startup messaage with new linker script

18y ago

Christoph Lameter

84966343

SLUB: fix behavior if the text output of list_locations overflows PAGE_SIZE

18y ago

Linus Torvalds

9738cbe3

Merge branch 'master' of git://git.kernel.org/pub/scm/linux/kernel/git/kyle/parisc-2.6

18y ago

Wang Zhenyu

47d46379

[AGPGART] intel_agp: don't load if no IGD and AGP port

18y ago

Patrick McHardy

28121617

[XFRM]: Fix MTU calculation for non-ESP SAs

18y ago

Michael S. Tsirkin

3ec7393a

IPoIB/cm: Initialize RX before moving QP to RTR

18y ago

Mike Frysinger

9c8f1729

Blackfin arch: add missing braces around array bfin serial init

18y ago

Ben Dooks

1e27dbe7

SM501: Check SM501 ID register on initialisation

18y ago

Thomas Gleixner

58229a18

posix-timers: Prevent softirq starvation by small intervals and SIG_IGN

18y ago

Randolph Chung

05dc16d6

[PARISC] unwinder improvements

18y ago

Linus Torvalds

f1518a08

Merge branch 'upstream-linus' of master.kernel.org:/pub/scm/linux/kernel/git/jgarzik/libata-dev

18y ago

Linus Torvalds

fa490cfd

Fix possible runqueue lock starvation in wait_task_inactive()

Miklos Szeredi reported very long pauses (several seconds, sometimes
more) on his T60 (with a Core2Duo) which he managed to track down to
wait_task_inactive()'s open-coded busy-loop.

He observed that an interrupt on one core tries to acquire the
runqueue-lock but does not succeed in doing so for a very long time -
while wait_task_inactive() on the other core loops waiting for the first
core to deschedule a task (which it wont do while spinning in an
interrupt handler).

This rewrites wait_task_inactive() to do all its waiting optimistically
without any locks taken at all, and then just double-check the end
result with the proper runqueue lock held over just a very short
section. If there were races in the optimistic wait, of a preemption
event scheduled the process away, we simply re-synchronize, and start
over.

So the code now looks like this:

repeat:
/* Unlocked, optimistic looping! */
rq = task_rq(p);
while (task_running(rq, p))
cpu_relax();

/* Get the *real* values */
rq = task_rq_lock(p, &flags);
running = task_running(rq, p);
array = p->array;
task_rq_unlock(rq, &flags);

/* Check them.. */
if (unlikely(running)) {
cpu_relax();
goto repeat;
}

/* Preempted away? Yield if so.. */
if (unlikely(array)) {
yield();
goto repeat;
}

Basically, that first "while()" loop is done entirely without any
locking at all (and doesn't check for the case where the target process
might have been preempted away), and so it's possibly "incorrect", but
we don't really care. Both the runqueue used, and the "task_running()"
check might be the wrong tests, but they won't oops - they just mean
that we could possibly get the wrong results due to lack of locking and
exit the loop early in the case of a race condition.

So once we've exited the loop, we then get the proper (and careful) rq
lock, and check the running/runnable state _safely_. And if it turns
out that our quick-and-dirty and unsafe loop was wrong after all, we
just go back and try it all again.

(The patch also adds a lot of comments, which is the actual bulk of it
all, to make it more obvious why we can do these things without holding
the locks).

Thanks to Miklos for all the testing and tracking it down.

Tested-by: Miklos Szeredi <miklos@szeredi.hu>
Acked-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>