* Re: soft lock failure on FC8 [not found] <216e58580802040422u27c1efa0vf1d02b194514888e@mail.gmail.com> @ 2008-02-04 12:33 ` Andrew Morton 2008-02-04 12:43 ` Jens Axboe 0 siblings, 1 reply; 2+ messages in thread From: Andrew Morton @ 2008-02-04 12:33 UTC (permalink / raw) To: Dragon kumar; +Cc: linux-kernel, Jens Axboe On Mon, 4 Feb 2008 17:52:16 +0530 "Dragon kumar" <disaster2008@gmail.com> wrote: > Hi Andrew, > > I am not able to boot 2.6.24-mm1 kernel on x86_64 machine with FC8. I > am attaching config file and call trace also with this mail. > > > [ 921.273592] BUG: soft lockup - CPU#0 stuck for 61s! [scsi_scan_0:473] > [ 921.273601] CPU 0: > [ 921.273601] Modules linked in: scsi_wait_scan tg3 shpchp > pci_hotplug aacraid sd_mod scsi_mod ext3 jbd mbcache uhci_hcd ohci_hcd > ssb ehci_hcd usbcore > [ 921.273601] Pid: 473, comm: scsi_scan_0 Not tainted 2.6.24-mm1-autokern1 #1 > [ 921.273601] RIP: 0010:[<ffffffff8031fa1b>] [<ffffffff8031fa1b>] > radix_tree_gang_lookup+0xe1/0x139 > [ 921.273601] RSP: 0018:ffff81007d99bda8 EFLAGS: 00000293 > [ 921.273601] RAX: 0000000000000000 RBX: ffff81007d99bde0 RCX: 0000000000000000 > [ 921.273601] RDX: 0000000000000000 RSI: ffff81007d8f2690 RDI: 000000000000000c > [ 921.273601] RBP: 0000000000000001 R08: ffff81007d8dce0c R09: 0000000000000001 > [ 921.273601] R10: ffff81007d90caf0 R11: 000000000000000d R12: ffff81000106ac00 > [ 921.273601] R13: ffff8100808ed000 R14: ffff81007d99a000 R15: ffffffff8077eef0 > [ 921.273601] FS: 0000000000000000(0000) GS:ffffffff8057e000(0000) > knlGS:0000000000000000 > [ 921.273601] CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b > [ 921.273601] CR2: 00000000006167d0 CR3: 0000000000201000 CR4: 00000000000006e0 > [ 921.273601] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 > [ 921.273601] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 > [ 921.273601] > [ 921.273601] Call Trace: > [ 921.273601] [<ffffffff8031870d>] ? call_for_each_cic+0x72/0x104 > [ 921.273601] [<ffffffff80318e84>] ? cfq_exit_single_io_context+0x0/0x4e > [ 921.273601] [<ffffffff803187b7>] ? cfq_exit_io_context+0x18/0x1a > [ 921.273601] [<ffffffff80311844>] ? exit_io_context+0x101/0x111 > [ 921.273601] [<ffffffff80239e53>] ? do_exit+0x794/0x7c1 > [ 921.273601] [<ffffffff8020d03f>] ? child_rip+0x11/0x12 > [ 921.273601] [<ffffffff8020c703>] ? restore_args+0x0/0x30 > [ 921.273601] [<ffffffff80249c5d>] ? kthreadd+0x17d/0x1a2 > [ 921.273601] [<ffffffff80249da9>] ? kthread+0x0/0x77 > [ 921.273601] [<ffffffff8020d02e>] ? child_rip+0x0/0x12 > [ 921.273601] > At a guess I'd say that call_for_each_cic() is failing to advance across the radix-tree and got stuck. Could you please apply this debug patch and retest? Thanks. diff -puN block/cfq-iosched.c~a block/cfq-iosched.c --- a/block/cfq-iosched.c~a +++ a/block/cfq-iosched.c @@ -1159,6 +1159,7 @@ call_for_each_cic(struct io_context *ioc do { int i; + unsigned long next_index; /* * Perhaps there's a better way - this just gang lookups from @@ -1171,8 +1172,13 @@ call_for_each_cic(struct io_context *ioc break; called += nr; - index = 1 + (unsigned long) cics[nr - 1]->key; - + next_index = 1 + (unsigned long) cics[nr - 1]->key; + if (next_index <= index) { + printk("next_index=%lu, index=%lu\n", + next_index, index); + dump_stack(); + } + index = next_index; for (i = 0; i < nr; i++) func(ioc, cics[i]); } while (nr == CIC_GANG_NR); _ ^ permalink raw reply [flat|nested] 2+ messages in thread
* Re: soft lock failure on FC8 2008-02-04 12:33 ` soft lock failure on FC8 Andrew Morton @ 2008-02-04 12:43 ` Jens Axboe 0 siblings, 0 replies; 2+ messages in thread From: Jens Axboe @ 2008-02-04 12:43 UTC (permalink / raw) To: Andrew Morton; +Cc: Dragon kumar, linux-kernel On Mon, Feb 04 2008, Andrew Morton wrote: > On Mon, 4 Feb 2008 17:52:16 +0530 "Dragon kumar" <disaster2008@gmail.com> wrote: > > > Hi Andrew, > > > > I am not able to boot 2.6.24-mm1 kernel on x86_64 machine with FC8. I > > am attaching config file and call trace also with this mail. > > > > > > [ 921.273592] BUG: soft lockup - CPU#0 stuck for 61s! [scsi_scan_0:473] > > [ 921.273601] CPU 0: > > [ 921.273601] Modules linked in: scsi_wait_scan tg3 shpchp > > pci_hotplug aacraid sd_mod scsi_mod ext3 jbd mbcache uhci_hcd ohci_hcd > > ssb ehci_hcd usbcore > > [ 921.273601] Pid: 473, comm: scsi_scan_0 Not tainted 2.6.24-mm1-autokern1 #1 > > [ 921.273601] RIP: 0010:[<ffffffff8031fa1b>] [<ffffffff8031fa1b>] > > radix_tree_gang_lookup+0xe1/0x139 > > [ 921.273601] RSP: 0018:ffff81007d99bda8 EFLAGS: 00000293 > > [ 921.273601] RAX: 0000000000000000 RBX: ffff81007d99bde0 RCX: 0000000000000000 > > [ 921.273601] RDX: 0000000000000000 RSI: ffff81007d8f2690 RDI: 000000000000000c > > [ 921.273601] RBP: 0000000000000001 R08: ffff81007d8dce0c R09: 0000000000000001 > > [ 921.273601] R10: ffff81007d90caf0 R11: 000000000000000d R12: ffff81000106ac00 > > [ 921.273601] R13: ffff8100808ed000 R14: ffff81007d99a000 R15: ffffffff8077eef0 > > [ 921.273601] FS: 0000000000000000(0000) GS:ffffffff8057e000(0000) > > knlGS:0000000000000000 > > [ 921.273601] CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b > > [ 921.273601] CR2: 00000000006167d0 CR3: 0000000000201000 CR4: 00000000000006e0 > > [ 921.273601] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 > > [ 921.273601] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 > > [ 921.273601] > > [ 921.273601] Call Trace: > > [ 921.273601] [<ffffffff8031870d>] ? call_for_each_cic+0x72/0x104 > > [ 921.273601] [<ffffffff80318e84>] ? cfq_exit_single_io_context+0x0/0x4e > > [ 921.273601] [<ffffffff803187b7>] ? cfq_exit_io_context+0x18/0x1a > > [ 921.273601] [<ffffffff80311844>] ? exit_io_context+0x101/0x111 > > [ 921.273601] [<ffffffff80239e53>] ? do_exit+0x794/0x7c1 > > [ 921.273601] [<ffffffff8020d03f>] ? child_rip+0x11/0x12 > > [ 921.273601] [<ffffffff8020c703>] ? restore_args+0x0/0x30 > > [ 921.273601] [<ffffffff80249c5d>] ? kthreadd+0x17d/0x1a2 > > [ 921.273601] [<ffffffff80249da9>] ? kthread+0x0/0x77 > > [ 921.273601] [<ffffffff8020d02e>] ? child_rip+0x0/0x12 > > [ 921.273601] > > > > At a guess I'd say that call_for_each_cic() is failing to advance across > the radix-tree and got stuck. I'd say that's a good guess, I don't see how else it could get stuck looping forever. > > Could you please apply this debug patch and retest? > > Thanks. > > diff -puN block/cfq-iosched.c~a block/cfq-iosched.c > --- a/block/cfq-iosched.c~a > +++ a/block/cfq-iosched.c > @@ -1159,6 +1159,7 @@ call_for_each_cic(struct io_context *ioc > > do { > int i; > + unsigned long next_index; > > /* > * Perhaps there's a better way - this just gang lookups from > @@ -1171,8 +1172,13 @@ call_for_each_cic(struct io_context *ioc > break; > > called += nr; > - index = 1 + (unsigned long) cics[nr - 1]->key; > - > + next_index = 1 + (unsigned long) cics[nr - 1]->key; > + if (next_index <= index) { > + printk("next_index=%lu, index=%lu\n", > + next_index, index); > + dump_stack(); > + } > + index = next_index; > for (i = 0; i < nr; i++) > func(ioc, cics[i]); > } while (nr == CIC_GANG_NR); > _ > -- Jens Axboe ^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2008-02-04 12:43 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <216e58580802040422u27c1efa0vf1d02b194514888e@mail.gmail.com>
2008-02-04 12:33 ` soft lock failure on FC8 Andrew Morton
2008-02-04 12:43 ` Jens Axboe
This is an external index of several public inboxes, see mirroring instructions on how to clone and mirror all data and code used by this external index.