* Re: system hang on latest git [not found] <1FE6DD409037234FAB833C420AA843EC75736A@orsmsx424.amr.corp.intel.com> @ 2008-01-29 20:11 ` Jens Axboe 2008-01-29 20:16 ` Jens Axboe 0 siblings, 1 reply; 5+ messages in thread From: Jens Axboe @ 2008-01-29 20:11 UTC (permalink / raw) To: Luck, Tony; +Cc: LKML, linux-ia64 On Tue, Jan 29 2008, Luck, Tony wrote: > I pulled Linus' tree this morning (git head = 0ba6c33bcddc64a54b5f1c25a696c4767dc76292) > and built for ia64 (using arch/ia64/configs/tiger_defconfig). System booted > OK, but when I stressed it a little (building another kernel with "make -j32") > it hung. > > The console has a bunch (98) of warnings about tasks blocked for more than 120 > seconds like this: > INFO: task grep:9168 blocked for more than 120 seconds. > "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. > > Call Trace: > [<a000000100704120>] schedule+0x11c0/0x1340 > spà000001ed8afbf0 bspà000001ed8a1280 > [<a00000010024e720>] do_get_write_access+0x660/0xbe0 > spà000001ed8afc20 bspà000001ed8a1208 > [<a00000010024f060>] journal_get_write_access+0x40/0x80 > spà000001ed8afca0 bspà000001ed8a11c8 > [<a000000100245db0>] __ext3_journal_get_write_access+0x30/0xa0 > spà000001ed8afca0 bspà000001ed8a1190 > [<a00000010022dea0>] ext3_reserve_inode_write+0x80/0x120 > spà000001ed8afca0 bspà000001ed8a1158 > [<a00000010022df70>] ext3_mark_inode_dirty+0x30/0x80 > spà000001ed8afca0 bspà000001ed8a1130 > [<a000000100232530>] ext3_dirty_inode+0xd0/0x120 > spà000001ed8afcc0 bspà000001ed8a1100 > [<a000000100170e20>] __mark_inode_dirty+0xa0/0x3e0 > spà000001ed8afcc0 bspà000001ed8a10b0 > [<a00000010015b570>] touch_atime+0x310/0x340 > spà000001ed8afcc0 bspà000001ed8a1088 > [<a0000001000d6c20>] do_generic_mapping_read+0x780/0x7a0 > spà000001ed8afce0 bspà000001ed8a0fe0 > [<a0000001000db250>] generic_file_aio_read+0x290/0x340 > spà000001ed8afce0 bspà000001ed8a0f80 > [<a00000010012c990>] do_sync_read+0x170/0x200 > spà000001ed8afd10 bspà000001ed8a0f40 > [<a00000010012cbd0>] vfs_read+0x1b0/0x2e0 > spà000001ed8afe20 bspà000001ed8a0ef0 > [<a00000010012d250>] sys_read+0x70/0xe0 > spà000001ed8afe20 bspà000001ed8a0e78 > [<a00000010000a4a0>] ia64_ret_from_syscall+0x0/0x20 > spà000001ed8afe30 bspà000001ed8a0e78 > > > [The stack trace has several variations ... some from sys_read(), some from > sys_open(), some from sys_execve(), some from sys_mmap() etc. 84/98 stack > traces pass through the touch_atime->__mark_inode_dirty path ... all 98 > are attached] > > A quick dig into processor state shows 8 cpus are idle. 7 are spinning > in __spin_lock_irq() from __make_request() and one is in spin_lock() from > as_merged_requests(). Looks like a deadlock on queue lock and ioc lock, but I don't see immediately what the problem is. I can't stick around for longer tonight, but I'll get to the bottom of this tomorrow. -- Jens Axboe ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: system hang on latest git 2008-01-29 20:11 ` system hang on latest git Jens Axboe @ 2008-01-29 20:16 ` Jens Axboe 2008-01-29 21:46 ` Olof Johansson 0 siblings, 1 reply; 5+ messages in thread From: Jens Axboe @ 2008-01-29 20:16 UTC (permalink / raw) To: Luck, Tony; +Cc: LKML, linux-ia64 On Tue, Jan 29 2008, Jens Axboe wrote: > On Tue, Jan 29 2008, Luck, Tony wrote: > > I pulled Linus' tree this morning (git head = 0ba6c33bcddc64a54b5f1c25a696c4767dc76292) > > and built for ia64 (using arch/ia64/configs/tiger_defconfig). System booted > > OK, but when I stressed it a little (building another kernel with "make -j32") > > it hung. > > > > The console has a bunch (98) of warnings about tasks blocked for more than 120 > > seconds like this: > > INFO: task grep:9168 blocked for more than 120 seconds. > > "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. > > > > Call Trace: > > [<a000000100704120>] schedule+0x11c0/0x1340 > > spà000001ed8afbf0 bspà000001ed8a1280 > > [<a00000010024e720>] do_get_write_access+0x660/0xbe0 > > spà000001ed8afc20 bspà000001ed8a1208 > > [<a00000010024f060>] journal_get_write_access+0x40/0x80 > > spà000001ed8afca0 bspà000001ed8a11c8 > > [<a000000100245db0>] __ext3_journal_get_write_access+0x30/0xa0 > > spà000001ed8afca0 bspà000001ed8a1190 > > [<a00000010022dea0>] ext3_reserve_inode_write+0x80/0x120 > > spà000001ed8afca0 bspà000001ed8a1158 > > [<a00000010022df70>] ext3_mark_inode_dirty+0x30/0x80 > > spà000001ed8afca0 bspà000001ed8a1130 > > [<a000000100232530>] ext3_dirty_inode+0xd0/0x120 > > spà000001ed8afcc0 bspà000001ed8a1100 > > [<a000000100170e20>] __mark_inode_dirty+0xa0/0x3e0 > > spà000001ed8afcc0 bspà000001ed8a10b0 > > [<a00000010015b570>] touch_atime+0x310/0x340 > > spà000001ed8afcc0 bspà000001ed8a1088 > > [<a0000001000d6c20>] do_generic_mapping_read+0x780/0x7a0 > > spà000001ed8afce0 bspà000001ed8a0fe0 > > [<a0000001000db250>] generic_file_aio_read+0x290/0x340 > > spà000001ed8afce0 bspà000001ed8a0f80 > > [<a00000010012c990>] do_sync_read+0x170/0x200 > > spà000001ed8afd10 bspà000001ed8a0f40 > > [<a00000010012cbd0>] vfs_read+0x1b0/0x2e0 > > spà000001ed8afe20 bspà000001ed8a0ef0 > > [<a00000010012d250>] sys_read+0x70/0xe0 > > spà000001ed8afe20 bspà000001ed8a0e78 > > [<a00000010000a4a0>] ia64_ret_from_syscall+0x0/0x20 > > spà000001ed8afe30 bspà000001ed8a0e78 > > > > > > [The stack trace has several variations ... some from sys_read(), some from > > sys_open(), some from sys_execve(), some from sys_mmap() etc. 84/98 stack > > traces pass through the touch_atime->__mark_inode_dirty path ... all 98 > > are attached] > > > > A quick dig into processor state shows 8 cpus are idle. 7 are spinning > > in __spin_lock_irq() from __make_request() and one is in spin_lock() from > > as_merged_requests(). > > Looks like a deadlock on queue lock and ioc lock, but I don't see > immediately what the problem is. I can't stick around for longer > tonight, but I'll get to the bottom of this tomorrow. Actually, can you try this? It has a known race but nothing to worry about, and it removes ioc->lock from irq context. diff --git a/block/as-iosched.c b/block/as-iosched.c index b201d16..585aad2 100644 --- a/block/as-iosched.c +++ b/block/as-iosched.c @@ -235,10 +235,8 @@ static void as_put_io_context(struct request *rq) aic = RQ_IOC(rq)->aic; if (rq_is_sync(rq) && aic) { - spin_lock(&aic->lock); set_bit(AS_TASK_IORUNNING, &aic->state); aic->last_end_request = jiffies; - spin_unlock(&aic->lock); } put_io_context(RQ_IOC(rq)); -- Jens Axboe ^ permalink raw reply related [flat|nested] 5+ messages in thread
* Re: system hang on latest git 2008-01-29 20:16 ` Jens Axboe @ 2008-01-29 21:46 ` Olof Johansson 2008-01-29 21:38 ` Jens Axboe 0 siblings, 1 reply; 5+ messages in thread From: Olof Johansson @ 2008-01-29 21:46 UTC (permalink / raw) To: Jens Axboe; +Cc: Luck, Tony, LKML, linux-ia64 On Tue, Jan 29, 2008 at 09:16:48PM +0100, Jens Axboe wrote: > Actually, can you try this? It has a known race but nothing to worry > about, and it removes ioc->lock from irq context. I just tried this myself, since I saw hangs within seconds of running 'aiostress' from autotest on this morning's kernel as well (g0ba6c33). It didn't help. My config is 2 cores (powerpc), built with arch/powerpc/configs/pasemi_defconfig. Hardware is sata disk on marvell 7042. Stacktraces: 0:mon> t [link register ] c000000000221984 .as_merged_requests+0x164/0x1e0 [c00000007e8ff750] c00000007e8ff800 (unreliable) [c00000007e8ff800] c000000000212f20 .elv_merge_requests+0x50/0xb0 [c00000007e8ff890] c000000000218c98 .attempt_merge+0x318/0x3d0 [c00000007e8ff940] c00000000021af98 .__make_request+0x2f8/0x720 [c00000007e8ffa20] c000000000216e6c .generic_make_request+0x22c/0x2f0 [c00000007e8ffad0] c000000000216fd8 .submit_bio+0xa8/0x150 [c00000007e8ffb90] c0000000000efa7c .submit_bh+0x15c/0x1e0 [c00000007e8ffc20] c000000000145d5c .journal_do_submit_data+0x6c/0xb0 [c00000007e8ffcc0] c000000000147460 .journal_commit_transaction+0x1640/0x16e0 [c00000007e8ffe10] c00000000014c238 .kjournald+0x108/0x2f0 [c00000007e8fff00] c000000000067ca8 .kthread+0xc8/0xe0 [c00000007e8fff90] c0000000000237bc .kernel_thread+0x4c/0x68 0:mon> c1 1:mon> t [link register ] c00000000033c20c .scsi_device_unbusy+0x8c/0x120 [c00000007e10b6f0] c00000000033c1a8 .scsi_device_unbusy+0x28/0x120 (unreliable) [c00000007e10b780] c000000000333078 .scsi_finish_command+0x38/0x130 [c00000007e10b810] c00000000033c478 .scsi_softirq_done+0xd8/0x1a0 [c00000007e10b8b0] c00000000021a780 .blk_done_softirq+0xb0/0xe0 [c00000007e10b940] c000000000051e58 .__do_softirq+0xe8/0x1d0 [c00000007e10ba00] c00000000000ac64 .do_softirq+0x64/0xa0 [c00000007e10ba80] c000000000051b84 .irq_exit+0xc4/0xd0 [c00000007e10bb00] c00000000000b3c0 .do_IRQ+0xe0/0x120 [c00000007e10bb80] c00000000000405c hardware_interrupt_entry+0x18/0x3c --- Exception: 501 (Hardware Interrupt) at c000000000010564 .cpu_idle+0xb4/0x160 [c00000007e10be70] c000000000010524 .cpu_idle+0x74/0x160 (unreliable) [c00000007e10bf00] c000000000540e6c [c00000007e10bf90] c000000000007364 .start_secondary_prolog+0xc/0x10 scsi_device_busy is sitting on: spin_lock(sdev->request_queue->queue_lock); and as_merged_requests on the second lock at: double_spin_lock(&rioc->lock, &nioc->lock, rioc < nioc); -Olof ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: system hang on latest git 2008-01-29 21:46 ` Olof Johansson @ 2008-01-29 21:38 ` Jens Axboe 2008-01-29 21:56 ` Olof Johansson 0 siblings, 1 reply; 5+ messages in thread From: Jens Axboe @ 2008-01-29 21:38 UTC (permalink / raw) To: Olof Johansson; +Cc: Luck, Tony, LKML, linux-ia64 On Tue, Jan 29 2008, Olof Johansson wrote: > On Tue, Jan 29, 2008 at 09:16:48PM +0100, Jens Axboe wrote: > > > Actually, can you try this? It has a known race but nothing to worry > > about, and it removes ioc->lock from irq context. > > I just tried this myself, since I saw hangs within seconds of running > 'aiostress' from autotest on this morning's kernel as well (g0ba6c33). > > It didn't help. My config is 2 cores (powerpc), built with > arch/powerpc/configs/pasemi_defconfig. Hardware is sata disk on marvell > 7042. Please try this. diff --git a/block/as-iosched.c b/block/as-iosched.c index b201d16..9603684 100644 --- a/block/as-iosched.c +++ b/block/as-iosched.c @@ -1275,9 +1275,13 @@ static void as_merged_requests(struct request_queue *q, struct request *req, * Don't copy here but swap, because when anext is * removed below, it must contain the unused context */ - double_spin_lock(&rioc->lock, &nioc->lock, rioc < nioc); - swap_io_context(&rioc, &nioc); - double_spin_unlock(&rioc->lock, &nioc->lock, rioc < nioc); + if (rioc != nioc) { + double_spin_lock(&rioc->lock, &nioc->lock, + rioc < nioc); + swap_io_context(&rioc, &nioc); + double_spin_unlock(&rioc->lock, &nioc->lock, + rioc < nioc); + } } } > Stacktraces: > > 0:mon> t > [link register ] c000000000221984 .as_merged_requests+0x164/0x1e0 > [c00000007e8ff750] c00000007e8ff800 (unreliable) > [c00000007e8ff800] c000000000212f20 .elv_merge_requests+0x50/0xb0 > [c00000007e8ff890] c000000000218c98 .attempt_merge+0x318/0x3d0 > [c00000007e8ff940] c00000000021af98 .__make_request+0x2f8/0x720 > [c00000007e8ffa20] c000000000216e6c .generic_make_request+0x22c/0x2f0 > [c00000007e8ffad0] c000000000216fd8 .submit_bio+0xa8/0x150 > [c00000007e8ffb90] c0000000000efa7c .submit_bh+0x15c/0x1e0 > [c00000007e8ffc20] c000000000145d5c .journal_do_submit_data+0x6c/0xb0 > [c00000007e8ffcc0] c000000000147460 .journal_commit_transaction+0x1640/0x16e0 > [c00000007e8ffe10] c00000000014c238 .kjournald+0x108/0x2f0 > [c00000007e8fff00] c000000000067ca8 .kthread+0xc8/0xe0 > [c00000007e8fff90] c0000000000237bc .kernel_thread+0x4c/0x68 > 0:mon> c1 > 1:mon> t > [link register ] c00000000033c20c .scsi_device_unbusy+0x8c/0x120 > [c00000007e10b6f0] c00000000033c1a8 .scsi_device_unbusy+0x28/0x120 (unreliable) > [c00000007e10b780] c000000000333078 .scsi_finish_command+0x38/0x130 > [c00000007e10b810] c00000000033c478 .scsi_softirq_done+0xd8/0x1a0 > [c00000007e10b8b0] c00000000021a780 .blk_done_softirq+0xb0/0xe0 > [c00000007e10b940] c000000000051e58 .__do_softirq+0xe8/0x1d0 > [c00000007e10ba00] c00000000000ac64 .do_softirq+0x64/0xa0 > [c00000007e10ba80] c000000000051b84 .irq_exit+0xc4/0xd0 > [c00000007e10bb00] c00000000000b3c0 .do_IRQ+0xe0/0x120 > [c00000007e10bb80] c00000000000405c hardware_interrupt_entry+0x18/0x3c > --- Exception: 501 (Hardware Interrupt) at c000000000010564 .cpu_idle+0xb4/0x160 > [c00000007e10be70] c000000000010524 .cpu_idle+0x74/0x160 (unreliable) > [c00000007e10bf00] c000000000540e6c > [c00000007e10bf90] c000000000007364 .start_secondary_prolog+0xc/0x10 > > > scsi_device_busy is sitting on: > > spin_lock(sdev->request_queue->queue_lock); > > and as_merged_requests on the second lock at: > > double_spin_lock(&rioc->lock, &nioc->lock, rioc < nioc); > > The latter is the problem, but since the queue lock is also held before doing the double_spin_lock(), we get into even more trouble. The locking hierarchy is fine, it's always queue lock -> io context locks, so the above patch should be all that is needed to fix this. My initial analysis was wrong, that's all :/ -- Jens Axboe ^ permalink raw reply related [flat|nested] 5+ messages in thread
* Re: system hang on latest git 2008-01-29 21:38 ` Jens Axboe @ 2008-01-29 21:56 ` Olof Johansson 0 siblings, 0 replies; 5+ messages in thread From: Olof Johansson @ 2008-01-29 21:56 UTC (permalink / raw) To: Jens Axboe; +Cc: Luck, Tony, LKML, linux-ia64 On Tue, Jan 29, 2008 at 10:38:52PM +0100, Jens Axboe wrote: > Please try this. [...] > The latter is the problem, but since the queue lock is also held before > doing the double_spin_lock(), we get into even more trouble. The locking > hierarchy is fine, it's always queue lock -> io context locks, so the > above patch should be all that is needed to fix this. > > My initial analysis was wrong, that's all :/ Yep, looks much better now. Tested-by: Olof Johansson <olof@lixom.net> -Olof ^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2008-01-29 21:56 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <1FE6DD409037234FAB833C420AA843EC75736A@orsmsx424.amr.corp.intel.com>
2008-01-29 20:11 ` system hang on latest git Jens Axboe
2008-01-29 20:16 ` Jens Axboe
2008-01-29 21:46 ` Olof Johansson
2008-01-29 21:38 ` Jens Axboe
2008-01-29 21:56 ` Olof Johansson
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox