* BUG_ON(rq->nr_running != 0) in CPU_DEAD
@ 2009-11-12 4:53 Eric W. Biederman
2009-11-12 5:04 ` kernel BUG at /home/eric/projects/linux/linux-2.6.32-rc6-sysctl-simplification/kernel/sched.c:7321! Eric W. Biederman
0 siblings, 1 reply; 2+ messages in thread
From: Eric W. Biederman @ 2009-11-12 4:53 UTC (permalink / raw)
To: linux-kernel; +Cc: Ingo Molnar, Thomas Gleixner
When calling halt on one of my test machines I got this.
I have seen this a few times (but it is rare).
This appears to be BUG_ON(rq->nr_running != 0) from migration_call() CPU_DEAD.
I don't know what to make of it other than disable_nonboot_cpus is broken,
in a way I'm not familiar with.
Eric
[ 624.034007] ------------[ cut here ]------------
[ 624.035066] Kernel BUG at ffffffff813d0d82 [verbose debug info unavailable]
[ 624.035066] invalid opcode: 0000 [#1] SMP
[ 624.035066] last sysfs file: /sys/devices/pci0000:00/0000:00:1d.3/usb5/5-0:1.0/bInterfaceProtocol
[ 624.035066] CPU 0
[ 624.035066] Modules linked in:
[ 624.035066] Pid: 7102, comm: halt Not tainted 2.6.32-rc6x86_64 #20
[ 624.035066] RIP: 0010:[<ffffffff813d0d82>] [<ffffffff813d0d82>] migration_call+0x358/0x4cd
[ 624.035066] RSP: 0018:ffff88007d967d48 EFLAGS: 00010046
[ 624.035066] RAX: 0000000000000003 RBX: 0000000000010000 RCX: ffff880001980000
[ 624.035066] RDX: 0000000000010000 RSI: ffff880001990088 RDI: ffff88007ea68e38
[ 624.035066] RBP: ffff880001990000 R08: ffff88007ea68898 R09: ffff880001990060
[ 624.035066] R10: ffff88007ea68888 R11: 0000000000000001 R12: ffff88007ea68df0
[ 624.035066] R13: ffff880001990000 R14: 0000000000000003 R15: 0000000000000000
[ 624.035066] FS: 00007f43ea5bb6f0(0000) GS:ffff880001800000(0000) knlGS:0000000000000000
[ 624.035066] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[ 624.035066] CR2: 00000000005aae48 CR3: 000000007da9c000 CR4: 00000000000006f0
[ 624.035066] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 624.035066] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[ 624.035066] Process halt (pid: 7102, threadinfo ffff88007d966000, task ffff88007da686c0)
[ 624.035066] Stack:
[ 624.035066] 0000000000000003 0000000000000000 0000000000000000 00000000ffffffff
[ 624.035066] <0> ffffffff8161eec0 0000000000000003 0000000000000017 ffffffff81048ec4
[ 624.035066] <0> ffff88007da686c0 0000000000000003 0000000000000010 ffffffff816200e8
[ 624.035066] Call Trace:
[ 624.035066] [<ffffffff81048ec4>] ? notifier_call_chain+0x29/0x56
[ 624.035066] [<ffffffff813b4687>] ? _cpu_down+0x1a6/0x284
[ 624.035066] [<ffffffff81036039>] ? disable_nonboot_cpus+0x63/0xfa
[ 624.035066] [<ffffffff81040f3b>] ? kernel_power_off+0x21/0x3a
[ 624.035066] [<ffffffff810411e3>] ? sys_reboot+0x13b/0x158
[ 624.035066] [<ffffffff8103f145>] ? kill_pid_info+0x35/0x46
[ 624.035066] [<ffffffff8103f9d9>] ? sys_kill+0x72/0x143
[ 624.035066] [<ffffffff8109a316>] ? dput+0x2c/0x13a
[ 624.035066] [<ffffffff8108c540>] ? __fput+0x198/0x1c8
[ 624.035066] [<ffffffff8109ebbd>] ? mntput_no_expire+0x1c/0x8c
[ 624.035066] [<ffffffff81089e60>] ? filp_close+0x5e/0x66
[ 624.035066] [<ffffffff8100aceb>] ? system_call_fastpath+0x16/0x1b
[ 624.035066] Code: 49 8b 44 24 30 4c 89 e6 4c 89 ef ff 50 30 48 c7 c2 00 00 01 00 44 89 f0 41 83 bc 24 48 01 00 00 00 48 8b 0c c5 20 fc 61 81 75 04 <0
[ 624.035066] RIP [<ffffffff813d0d82>] migration_call+0x358/0x4cd
[ 624.035066] RSP <ffff88007d967d48>
[ 624.035066] ---[ end trace b69b4ad3bcd0ae93 ]---
^ permalink raw reply [flat|nested] 2+ messages in thread
* kernel BUG at /home/eric/projects/linux/linux-2.6.32-rc6-sysctl-simplification/kernel/sched.c:7321!
2009-11-12 4:53 BUG_ON(rq->nr_running != 0) in CPU_DEAD Eric W. Biederman
@ 2009-11-12 5:04 ` Eric W. Biederman
0 siblings, 0 replies; 2+ messages in thread
From: Eric W. Biederman @ 2009-11-12 5:04 UTC (permalink / raw)
To: linux-kernel; +Cc: Ingo Molnar, Thomas Gleixner
Bah. I have finally gotten the problem to reproduce with verbose
debugging enabled. It was migrate_dead: BUG_ON(!p->exit_state)
that triggered.
Eric
[ 1591.705444] sd 6:0:0:0: [sdb] Synchronizing SCSI cache
[ 1591.710766] sd 6:0:0:0: [sdb] Stopping disk
[ 1592.062511] sd 5:0:0:0: [sda] Synchronizing SCSI cache
[ 1592.067814] sd 5:0:0:0: [sda] Stopping disk
[ 1592.545072] e1000e 0000:04:00.0: PCI INT A disabled
[ 1592.550008] e1000e 0000:04:00.0: PME# enabled
[ 1592.569415] ACPI: Preparing to enter system sleep state S5
[ 1592.575508] Disabling non-boot CPUs ...
[ 1592.581800] CPU 1 is now offline
[ 1592.602917] CPU1 is down
[ 1592.608352] CPU 2 is now offline
[ 1592.633354] CPU2 is down
[ 1592.641196] CPU 3 is now offline
[ 1592.644483] SMP alternatives: switching to UP code
[ 1592.655745] ------------[ cut here ]------------
[ 1592.656859] kernel BUG at /home/eric/projects/linux/linux-2.6.32-rc6-sysctl-simplification/kernel/sched.c:7321!
[ 1592.656859] invalid opcode: 0000 [#1] SMP
[ 1592.656859] last sysfs file: /sys/devices/pci0000:00/0000:00:1d.3/usb5/5-0:1.0/bInterfaceProtocol
[ 1592.656859] CPU 0
[ 1592.656859] Modules linked in:
[ 1592.656859] Pid: 7103, comm: halt Not tainted 2.6.32-rc6x86_64 #22
[ 1592.656859] RIP: 0010:[<ffffffff813d0ef3>] [<ffffffff813d0ef3>] migration_call+0x358/0x4cd
[ 1592.656859] RSP: 0018:ffff880037771d48 EFLAGS: 00010046
[ 1592.656859] RAX: 0000000000000003 RBX: 0000000000010000 RCX: ffff880001980000
[ 1592.656859] RDX: 0000000000010000 RSI: ffff880001990088 RDI: ffff88007f17f3e8
[ 1592.656859] RBP: ffff880001990000 R08: ffff88007fbd8698 R09: ffff880001990060
[ 1592.656859] R10: ffff88007fbd8688 R11: 0000000000000001 R12: ffff88007f17f3a0
[ 1592.656859] R13: ffff880001990000 R14: 0000000000000003 R15: 0000000000000000
[ 1592.656859] FS: 00007f12843ea6f0(0000) GS:ffff880001800000(0000) knlGS:0000000000000000
[ 1592.656859] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[ 1592.656859] CR2: 00000000005aae48 CR3: 00000000379c5000 CR4: 00000000000006f0
[ 1592.656859] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 1592.656859] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[ 1592.656859] Process halt (pid: 7103, threadinfo ffff880037770000, task ffff88007f18b3e0)
[ 1592.656859] Stack:
[ 1592.656859] 0000000000000003 0000000000000000 0000000000000000 00000000ffffffff
[ 1592.656859] <0> ffffffff81628ec0 0000000000000003 0000000000000017 ffffffff81049014
[ 1592.656859] <0> ffff88007f18b3e0 0000000000000003 0000000000000010 ffffffff8162a0e8
[ 1592.656859] Call Trace:
[ 1592.656859] [<ffffffff81049014>] ? notifier_call_chain+0x29/0x56
[ 1592.656859] [<ffffffff813b47f8>] ? _cpu_down+0x1a6/0x284
[ 1592.656859] [<ffffffff81036189>] ? disable_nonboot_cpus+0x63/0xfa
[ 1592.656859] [<ffffffff8104108b>] ? kernel_power_off+0x21/0x3a
[ 1592.656859] [<ffffffff81041333>] ? sys_reboot+0x13b/0x158
[ 1592.656859] [<ffffffff8103f295>] ? kill_pid_info+0x35/0x46
[ 1592.656859] [<ffffffff8103fb29>] ? sys_kill+0x72/0x143
[ 1592.656859] [<ffffffff8109a466>] ? dput+0x2c/0x13a
[ 1592.656859] [<ffffffff8108c690>] ? __fput+0x198/0x1c8
[ 1592.656859] [<ffffffff8109ed0d>] ? mntput_no_expire+0x1c/0x8c
[ 1592.656859] [<ffffffff81089fb0>] ? filp_close+0x5e/0x66
[ 1592.656859] [<ffffffff8100aceb>] ? system_call_fastpath+0x16/0x1b
[ 1592.656859] Code: 49 8b 44 24 30 4c 89 e6 4c 89 ef ff 50 30 48 c7 c2 00 00 01 00 44 89 f0 41 83 bc 24 48 01 00 00 00 48 8b 0c c5 20 9c 62 81 75 04 <0
[ 1592.656859] RIP [<ffffffff813d0ef3>] migration_call+0x358/0x4cd
[ 1592.656859] RSP <ffff880037771d48>
[ 1592.656859] ---[ end trace bb0b71ca768c73d8 ]---
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2009-11-12 5:04 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2009-11-12 4:53 BUG_ON(rq->nr_running != 0) in CPU_DEAD Eric W. Biederman
2009-11-12 5:04 ` kernel BUG at /home/eric/projects/linux/linux-2.6.32-rc6-sysctl-simplification/kernel/sched.c:7321! Eric W. Biederman
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox