* [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish >12 hour
@ 2008-03-17 11:18 KOSAKI Motohiro
2008-03-17 11:51 ` KOSAKI Motohiro
2008-03-17 17:38 ` Luck, Tony
0 siblings, 2 replies; 18+ messages in thread
From: KOSAKI Motohiro @ 2008-03-17 11:18 UTC (permalink / raw)
To: LKML, linux-ia64, Fenghua Yu, Hidetoshi Seto, Tony Luck,
Andrew Morton
Cc: kosaki.motohiro
Hi
I tested following.
% ./hackbench 130 process 500
(this parameter mean consume all physical memory and 1G swap space)
2.6.25-rc3-mm1: works well
2.6.25-rc5: works well
2.6.25-rc5-mm1: doesn't finish >12 hour
my bisect indicate the cause of bug is git-ia64.patch.
and, git-ia64.patch of 2.6.25-rc5-mm1 contain following patches.
[IA64] cleanup and improve fsys_gettimeofday
[IA64] Multiple outstanding ptc.g instruction support
[IA64] VIRT_CPU_ACCOUNTING (accurate cpu time accounting)
unfortunately, I don't know how investigate more ;-)
Please any suggestion.
^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish >12 hour
2008-03-17 11:18 [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish >12 hour KOSAKI Motohiro
@ 2008-03-17 11:51 ` KOSAKI Motohiro
2008-03-17 16:09 ` Luck, Tony
2008-03-17 17:38 ` Luck, Tony
1 sibling, 1 reply; 18+ messages in thread
From: KOSAKI Motohiro @ 2008-03-17 11:51 UTC (permalink / raw)
To: linux-ia64, Fenghua Yu, Tony Luck
Cc: kosaki.motohiro, LKML, Hidetoshi Seto, Andrew Morton
Hi Tony-san,
seto-san teach me method of more investigate.
and I found the root cause is contained by following patch.
[IA64] Multiple outstanding ptc.g instruction support
Could you please revert that patch from ia64?
I dislike its strong regression.
- kosaki
> [IA64] cleanup and improve fsys_gettimeofday
> [IA64] Multiple outstanding ptc.g instruction support
> [IA64] VIRT_CPU_ACCOUNTING (accurate cpu time accounting)
>
> unfortunately, I don't know how investigate more ;-)
> Please any suggestion.
^ permalink raw reply [flat|nested] 18+ messages in thread
* RE: [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish >12 hour
2008-03-17 11:51 ` KOSAKI Motohiro
@ 2008-03-17 16:09 ` Luck, Tony
2008-03-17 23:43 ` KOSAKI Motohiro
0 siblings, 1 reply; 18+ messages in thread
From: Luck, Tony @ 2008-03-17 16:09 UTC (permalink / raw)
To: KOSAKI Motohiro, linux-ia64, Yu, Fenghua
Cc: LKML, Hidetoshi Seto, Andrew Morton
> seto-san teach me method of more investigate.
> and I found the root cause is contained by following patch.
>
> [IA64] Multiple outstanding ptc.g instruction support
Thank you very much for your testing, and for isolating the
cause of this regression.
> 2.6.25-rc5: works well
> 2.6.25-rc5-mm1: doesn't finish >12 hour
How long does hackbench take to complete on 2.6.25-rc5?
Does -mm1 hang completely? Or is it making slow progress?
> Could you please revert that patch from ia64?
I'll revert it while I investigate the cause of this
problem and try to find a way to avoid this problem.
> I dislike its strong regression.
Me too!
-Tony
^ permalink raw reply [flat|nested] 18+ messages in thread
* RE: [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish >12 hour
2008-03-17 11:18 [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish >12 hour KOSAKI Motohiro
2008-03-17 11:51 ` KOSAKI Motohiro
@ 2008-03-17 17:38 ` Luck, Tony
2008-03-17 23:44 ` KOSAKI Motohiro
1 sibling, 1 reply; 18+ messages in thread
From: Luck, Tony @ 2008-03-17 17:38 UTC (permalink / raw)
To: KOSAKI Motohiro, LKML, linux-ia64, Yu, Fenghua, Hidetoshi Seto,
Andrew Morton
> % ./hackbench 130 process 500
I'd like to reproduce your results so I can make a new
version of the ptc.g patch that doesn't have this regression.
I found this version (that claims to be the latest) of hackbench.c
http://people.redhat.com/mingo/cfs-scheduler/tools/hackbench.c
Is this the one you used?
What was the configuration of your machine (how many cpus, how
much memory)?
Thanks
-Tony
^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish >12 hour
2008-03-17 16:09 ` Luck, Tony
@ 2008-03-17 23:43 ` KOSAKI Motohiro
0 siblings, 0 replies; 18+ messages in thread
From: KOSAKI Motohiro @ 2008-03-17 23:43 UTC (permalink / raw)
To: Luck, Tony
Cc: kosaki.motohiro, linux-ia64, Yu, Fenghua, LKML, Hidetoshi Seto,
Andrew Morton
Hi Tony-san,
Thank you for quick responce.
> > 2.6.25-rc5: works well
> > 2.6.25-rc5-mm1: doesn't finish >12 hour
>
> How long does hackbench take to complete on 2.6.25-rc5?
> Does -mm1 hang completely? Or is it making slow progress?
2.6.25-rc5: about 200sec
-mm1: super heavy soft lockup happend.
I cut & paste console message to tail of this mail.
> > Could you please revert that patch from ia64?
>
> I'll revert it while I investigate the cause of this
> problem and try to find a way to avoid this problem.
>
> > I dislike its strong regression.
>
> Me too!
Thank you very much!
----------------------------------------------------------
2.6.25-rc5-mm1 spinlock lockup
Starting Avahi daemon... [ OK ]
Starting HAL daemon: [ OK ]
Starting smartd: [ OK ]
Red Hat Enterprise Linux Server release 5.1 (Tikanga)
Kernel 2.6.25-rc5-mm1 on an ia64
PQ-muneda login: BUG: spinlock lockup on CPU#1, hackbench/12693, e0000040c45b1950
Call Trace:
[<a000000100015f20>] show_stack+0x80/0xa0
spà000160369ff950 bspà000160369f1560
[<a000000100015f70>] dump_stack+0x30/0x60
spà000160369ffb20 bspà000160369f1548
[<a000000100462960>] _raw_spin_lock+0x2e0/0x300
spà000160369ffb20 bspà000160369f14f0
[<a0000001008290e0>] _spin_lock+0x20/0x40
spà000160369ffb20 bspà000160369f14d0
[<a00000010014b2e0>] page_lock_anon_vma+0x80/0xa0
spà000160369ffb20 bspà000160369f14a8
[<a00000010014da60>] page_referenced+0x180/0x340
spà000160369ffb20 bspà000160369f1460
[<a00000010012bd00>] shrink_active_list+0x9e0/0xe80
spà000160369ffb30 bspà000160369f13b8
[<a00000010012e460>] shrink_zone+0x220/0x260
spà000160369ffbf0 bspà000160369f1368
[<a0000001001300f0>] try_to_free_pages+0x590/0x900
spà000160369ffbf0 bspà000160369f1290
[<a000000100120500>] __alloc_pages_internal+0x4a0/0x860
spà000160369ffc30 bspà000160369f11d8
[<a000000100120950>] __alloc_pages+0x30/0x60
spà000160369ffc40 bspà000160369f11a8
BUG: spinlock lockup on CPU#3, hackbench/12705, a04000000013abd0
Call Trace:
[<a000000100015f20>] show_stack+0x80/0xa0
spà0001603727f940 bspà00016037271588
[<a000000100015f70>] dump_stack+0x30/0x60
spà0001603727fb10 bspà00016037271570
[<a000000100462960>] _raw_spin_lock+0x2e0/0x300
spà0001603727fb10 bspà00016037271518
[<a0000001008290e0>] _spin_lock+0x20/0x40
spà0001603727fb10 bspà000160372714f8
[<a00000010014b470>] page_check_address+0x170/0x200
spà0001603727fb10 bspà000160372714b8
[<a00000010014b680>] page_referenced_one+0xa0/0x2c0
spà0001603727fb10 bspà00016037271478
[<a00000010014db10>] page_referenced+0x230/0x340
spà0001603727fb20 bspà00016037271430
[<a00000010012bd00>] shrink_active_list+0x9e0/0xe80
spà0001603727fb30 bspà00016037271388
[<a00000010012e460>] shrink_zone+0x220/0x260
spà0001603727fbf0 bspà00016037271338
[<a0000001001300f0>] try_to_free_pages+0x590/0x900
spà0001603727fbf0 bspà00016037271260
[<a000000100120500>] __alloc_pages_internal+0x4a0/0x860
spà0001603727fc30 bspà000160372711a8
[<a000000100120950>] __alloc_pages+0x30/0x60
spà0001603727fc40 bspà00016037271178
[<a0000001001714e0>] kmem_getpages+0x120/0x2a0
spà0001603727fc40 bspà00016037271130
[<a000000100172480>] fallback_alloc+0x320/0x440
spà0001603727fc40 bspà000160372710a8
[<a0000001001726b0>] ____cache_alloc_node+0x110/0x300
spà0001603727fc40 bspà00016037271028
[<a000000100171770>] kmem_cache_alloc_node+0x110/0x3c0
spà0001603727fc40 bspà00016037270fe0
[<a0000001006ea210>] __alloc_skb+0x70/0x280
spà0001603727fc40 bspà00016037270f98
[<a0000001006df480>] sock_alloc_send_skb+0x400/0x560
spà0001603727fc40 bspà00016037270f28
[<a0000001007e7930>] unix_stream_sendmsg+0x3b0/0x780
spà0001603727fc70 bspà00016037270e68
[<a0000001006d7e80>] sock_aio_write+0x260/0x2a0
spà0001603727fca0 bspà00016037270e28
[<a0000001001817b0>] do_sync_write+0x170/0x260
spà0001603727fd20 bspà00016037270dd0
[<a000000100182f70>] vfs_write+0x310/0x320
spà0001603727fe20 bspà00016037270d80
[<a000000100183ad0>] sys_write+0x70/0xe0
spà0001603727fe20 bspà00016037270d08
[<a00000010000aa40>] ia64_ret_from_syscall+0x0/0x20
spà0001603727fe30 bspà00016037270d08
[<a000000000010720>] __kernel_syscall_via_break+0x0/0x20
spà00016037280000 bspà00016037270d08
[<a0000001001714e0>] kmem_getpages+0x120/0x2a0
spà000160369ffc40 bspà000160369f1160
[<a000000100172480>] fallback_alloc+0x320/0x440
spà000160369ffc40 bspà000160369f10d8
[<a0000001001726b0>] ____cache_alloc_node+0x110/0x300
spà000160369ffc40 bspà000160369f1058
[<a000000100171770>] kmem_cache_alloc_node+0x110/0x3c0
spà000160369ffc40 bspà000160369f1018
[<a000000100171a80>] __kmalloc_node+0x60/0xa0
spà000160369ffc40 bspà000160369f0fe0
[<a0000001006ea250>] __alloc_skb+0xb0/0x280
spà000160369ffc40 bspà000160369f0f98
[<a0000001006df480>] sock_alloc_send_skb+0x400/0x560
spà000160369ffc40 bspà000160369f0f28
[<a0000001007e7930>] unix_stream_sendmsg+0x3b0/0x780
spà000160369ffc70 bspà000160369f0e68
[<a0000001006d7e80>] sock_aio_write+0x260/0x2a0
spà000160369ffca0 bspà000160369f0e28
[<a0000001001817b0>] do_sync_write+0x170/0x260
spà000160369ffd20 bspà000160369f0dd0
[<a000000100182f70>] vfs_write+0x310/0x320
spà000160369ffe20 bspà000160369f0d80
[<a000000100183ad0>] sys_write+0x70/0xe0
spà000160369ffe20 bspà000160369f0d08
[<a00000010000aa40>] ia64_ret_from_syscall+0x0/0x20
spà000160369ffe30 bspà000160369f0d08
[<a000000000010720>] __kernel_syscall_via_break+0x0/0x20
spà00016036a00000 bspà000160369f0d08
BUG: soft lockup - CPU#0 stuck for 61s! [hackbench:10060]
Modules linked in: sunrpc binfmt_misc dm_multipath fan sg thermal processor button container e100 eepro100 mii dm_snapshot dm_zero dm_mirror dm_log dm_mod lpfc mptspi mptscsih mptbase ehci_hcd ohci_hcd uhci_hcd usbcore
Pid: 10060, CPU 0, comm: hackbench
psr : 00001010085a6010 ifs : 8000000000000590 ip : [<a0000001004628b0>] Not tainted (2.6.25-rc5-mm1)
ip is at _raw_spin_lock+0x230/0x300
unat: 0000000000000000 pfs : 0000000000000590 rsc : 0000000000000003
rnat: 0000000000000000 bsps: 0000000000000000 pr : a45856a555966995
ldrs: 0000000000000000 ccv : 0000000000000000 fpsr: 0009804c8a70033f
csd : 0000000000000000 ssd : 0000000000000000
b0 : a000000100462890 b6 : a00000010012b2a0 b7 : a00000010000f190
f6 : 1003efffffffffffffc03 f7 : 1003e0000000000000b13
f8 : 1003e0000000000000023 f9 : 1003ea3d70a3d70a3d70b
f10 : 1003e0000000000000051 f11 : 1003efffffffffffff481
r1 : a000000100f80990 r2 : 0000000000000000 r3 : 0000000000000004
r8 : e0000040beab0c64 r9 : e0000040cee50000 r10 : ffffffffdead4ead
r11 : 00000000dead4ead r12 : e0000040beabfb10 r13 : e0000040beab0000
r14 : 0000000000000001 r15 : 0000000000000000 r16 : a040000058126354
r17 : 000000002f7c4000 r18 : 000000000bc6c000 r19 : 0000000000614000
r20 : 000000000c280000 r21 : a000000100d98da8 r22 : a000000100d98da8
r23 : 0000000000000001 r24 : 0000000000000001 r25 : 0000000000000001
r26 : 5fc0000000000000 r27 : a040000058246e30 r28 : 0000000000000000
r29 : e0000040beab0c64 r30 : 0000000000000000 r31 : a04000005824a1d8
Call Trace:
[<a000000100015f20>] show_stack+0x80/0xa0
spà000040beabf770 bspà000040beab1838
[<a000000100016820>] show_regs+0x880/0x8c0
spà000040beabf940 bspà000040beab17d8
[<a000000100101b60>] softlockup_tick+0x2e0/0x360
spà000040beabf940 bspà000040beab1788
[<a0000001000ba320>] run_local_timers+0x40/0x60
spà000040beabf940 bspà000040beab1770
[<a0000001000ba420>] update_process_times+0x40/0xc0
spà000040beabf940 bspà000040beab1740
[<a00000010003b150>] timer_interrupt+0x1b0/0x4a0
spà000040beabf940 bspà000040beab16e0
[<a000000100102420>] handle_IRQ_event+0x80/0x120
spà000040beabf940 bspà000040beab16a8
[<a000000100102600>] __do_IRQ+0x140/0x420
spà000040beabf940 bspà000040beab1648
[<a000000100012df0>] ia64_handle_irq+0x3f0/0x420
spà000040beabf940 bspà000040beab15c8
[<a00000010000abe0>] ia64_leave_kernel+0x0/0x270
spà000040beabf940 bspà000040beab15c8
[<a0000001004628b0>] _raw_spin_lock+0x230/0x300
spà000040beabfb10 bspà000040beab1548
[<a0000001008290e0>] _spin_lock+0x20/0x40
spà000040beabfb10 bspà000040beab1528
[<a00000010014b470>] page_check_address+0x170/0x200
spà000040beabfb10 bspà000040beab14e8
[<a00000010014b680>] page_referenced_one+0xa0/0x2c0
spà000040beabfb10 bspà000040beab14a8
[<a00000010014db10>] page_referenced+0x230/0x340
spà000040beabfb20 bspà000040beab1460
[<a00000010012bd00>] shrink_active_list+0x9e0/0xe80
spà000040beabfb30 bspà000040beab13b8
[<a00000010012e460>] shrink_zone+0x220/0x260
spà000040beabfbf0 bspà000040beab1368
[<a0000001001300f0>] try_to_free_pages+0x590/0x900
spà000040beabfbf0 bspà000040beab1290
[<a000000100120500>] __alloc_pages_internal+0x4a0/0x860
spà000040beabfc30 bspà000040beab11d8
[<a000000100120950>] __alloc_pages+0x30/0x60
spà000040beabfc40 bspà000040beab11a8
[<a0000001001714e0>] kmem_getpages+0x120/0x2a0
spà000040beabfc40 bspà000040beab1160
[<a000000100172480>] fallback_alloc+0x320/0x440
spà000040beabfc40 bspà000040beab10d8
[<a0000001001726b0>] ____cache_alloc_node+0x110/0x300
spà000040beabfc40 bspà000040beab1058
[<a000000100171770>] kmem_cache_alloc_node+0x110/0x3c0
spà000040beabfc40 bspà000040beab1018
[<a000000100171a80>] __kmalloc_node+0x60/0xa0
spà000040beabfc40 bspà000040beab0fe0
[<a0000001006ea250>] __alloc_skb+0xb0/0x280
spà000040beabfc40 bspà000040beab0f98
[<a0000001006df480>] sock_alloc_send_skb+0x400/0x560
spà000040beabfc40 bspà000040beab0f28
[<a0000001007e7930>] unix_stream_sendmsg+0x3b0/0x780
spà000040beabfc70 bspà000040beab0e68
[<a0000001006d7e80>] sock_aio_write+0x260/0x2a0
spà000040beabfca0 bspà000040beab0e28
[<a0000001001817b0>] do_sync_write+0x170/0x260
spà000040beabfd20 bspà000040beab0dd0
[<a000000100182f70>] vfs_write+0x310/0x320
spà000040beabfe20 bspà000040beab0d80
[<a000000100183ad0>] sys_write+0x70/0xe0
spà000040beabfe20 bspà000040beab0d08
[<a00000010000aa40>] ia64_ret_from_syscall+0x0/0x20
spà000040beabfe30 bspà000040beab0d08
[<a000000000010720>] __kernel_syscall_via_break+0x0/0x20
spà000040beac0000 bspà000040beab0d08
BUG: soft lockup - CPU#7 stuck for 61s! [hackbench:10576]
Modules linked in: sunrpc binfmt_misc dm_multipath fan sg thermal processor button container e100 eepro100 mii dm_snapshot dm_zero dm_mirror dm_log dm_mod lpfc mptspi mptscsih mptbase ehci_hcd ohci_hcd uhci_hcd usbcore
Pid: 10576, CPU 7, comm: hackbench
psr : 00001010085a6010 ifs : 8000000000000590 ip : [<a0000001004628b0>] Not tainted (2.6.25-rc5-mm1)
ip is at _raw_spin_lock+0x230/0x300
unat: 0000000000000000 pfs : 0000000000000590 rsc : 0000000000000003
rnat: a45456a565965a55 bsps: a00000010014b680 pr : a45456a565966995
ldrs: 0000000000000000 ccv : 0000000000000000 fpsr: 0009804c8a70433f
csd : 0000000000000000 ssd : 0000000000000000
b0 : a000000100462890 b6 : a000000100088f20 b7 : a0000001000995a0
f6 : 1000f87ff800000000000 f7 : 1003e0000000000010fff
f8 : 1003e0000000000000023 f9 : 1003ea3d70a3d70a3d70b
f10 : 1003e0000000000000051 f11 : 1003efffffffffffff481
r1 : a000000100f80990 r2 : 0000000000000000 r3 : 0000000000000000
r8 : e000000045b20c64 r9 : e0000040beab0000 r10 : ffffffffdead4ead
r11 : 00000000dead4ead r12 : e000000045b2fb20 r13 : e000000045b20000
r14 : 0000000000000001 r15 : 0000000000000007 r16 : e0000040c45b48fc
r17 : 000000002f7c4000 r18 : 000000000bc6c000 r19 : 0000000000614000
r20 : 000000000c280000 r21 : a000000100d98da8 r22 : a000000100d98da8
r23 : 0000000000000001 r24 : 0000000000000001 r25 : 0000000000000001
r26 : e000000045b2fb50 r27 : a040000058271bb0 r28 : e000000045b2fb48
r29 : 0000000000007bfe r30 : a000000100d764b0 r31 : a040000058271b58
Call Trace:
[<a000000100015f20>] show_stack+0x80/0xa0
spà00000045b2f780 bspà00000045b217d8
[<a000000100016820>] show_regs+0x880/0x8c0
spà00000045b2f950 bspà00000045b21780
[<a000000100101b60>] softlockup_tick+0x2e0/0x360
spà00000045b2f950 bspà00000045b21730
[<a0000001000ba320>] run_local_timers+0x40/0x60
spà00000045b2f950 bspà00000045b21718
[<a0000001000ba420>] update_process_times+0x40/0xc0
spà00000045b2f950 bspà00000045b216e8
[<a00000010003b150>] timer_interrupt+0x1b0/0x4a0
spà00000045b2f950 bspà00000045b21688
[<a000000100102420>] handle_IRQ_event+0x80/0x120
spà00000045b2f950 bspà00000045b21650
[<a000000100102600>] __do_IRQ+0x140/0x420
spà00000045b2f950 bspà00000045b215e8
[<a000000100012df0>] ia64_handle_irq+0x3f0/0x420
spà00000045b2f950 bspà00000045b21570
[<a00000010000abe0>] ia64_leave_kernel+0x0/0x270
spà00000045b2f950 bspà00000045b21570
[<a0000001004628b0>] _raw_spin_lock+0x230/0x300
spà00000045b2fb20 bspà00000045b214f0
[<a0000001008290e0>] _spin_lock+0x20/0x40
spà00000045b2fb20 bspà00000045b214d0
[<a00000010014b2e0>] page_lock_anon_vma+0x80/0xa0
spà00000045b2fb20 bspà00000045b214a8
[<a00000010014da60>] page_referenced+0x180/0x340
spà00000045b2fb20 bspà00000045b21460
[<a00000010012bd00>] shrink_active_list+0x9e0/0xe80
spà00000045b2fb30 bspà00000045b213b8
[<a00000010012e460>] shrink_zone+0x220/0x260
spà00000045b2fbf0 bspà00000045b21368
[<a0000001001300f0>] try_to_free_pages+0x590/0x900
spà00000045b2fbf0 bspà00000045b21290
[<a000000100120500>] __alloc_pages_internal+0x4a0/0x860
spà00000045b2fc30 bspà00000045b211d8
[<a000000100120950>] __alloc_pages+0x30/0x60
spà00000045b2fc40 bspà00000045b211a8
[<a0000001001714e0>] kmem_getpages+0x120/0x2a0
spà00000045b2fc40 bspà00000045b21160
[<a000000100172480>] fallback_alloc+0x320/0x440
spà00000045b2fc40 bspà00000045b210d8
[<a0000001001726b0>] ____cache_alloc_node+0x110/0x300
spà00000045b2fc40 bspà00000045b21058
[<a000000100171770>] kmem_cache_alloc_node+0x110/0x3c0
spà00000045b2fc40 bspà00000045b21018
[<a000000100171a80>] __kmalloc_node+0x60/0xa0
spà00000045b2fc40 bspà00000045b20fe0
[<a0000001006ea250>] __alloc_skb+0xb0/0x280
spà00000045b2fc40 bspà00000045b20f98
[<a0000001006df480>] sock_alloc_send_skb+0x400/0x560
spà00000045b2fc40 bspà00000045b20f28
[<a0000001007e7930>] unix_stream_sendmsg+0x3b0/0x780
spà00000045b2fc70 bspà00000045b20e68
[<a0000001006d7e80>] sock_aio_write+0x260/0x2a0
spà00000045b2fca0 bspà00000045b20e28
[<a0000001001817b0>] do_sync_write+0x170/0x260
spà00000045b2fd20 bspà00000045b20dd0
[<a000000100182f70>] vfs_write+0x310/0x320
spà00000045b2fe20 bspà00000045b20d80
[<a000000100183ad0>] sys_write+0x70/0xe0
spà00000045b2fe20 bspà00000045b20d08
[<a00000010000aa40>] ia64_ret_from_syscall+0x0/0x20
spà00000045b2fe30 bspà00000045b20d08
[<a000000000010720>] __kernel_syscall_via_break+0x0/0x20
spà00000045b30000 bspà00000045b20d08
BUG: soft lockup - CPU#5 stuck for 61s! [hackbench:12811]
Modules linked in: sunrpc binfmt_misc dm_multipath fan sg thermal processor button container e100 eepro100 mii dm_snapshot dm_zero dm_mirror dm_log dm_mod lpfc mptspi mptscsih mptbase ehci_hcd ohci_hcd uhci_hcd usbcore
Pid: 12811, CPU 5, comm: hackbench
psr : 00001010085a6010 ifs : 8000000000000005 ip : [<a000000100008d30>] Not tainted (2.6.25-rc5-mm1)
ip is at ia64_delay_loop+0x30/0x40
unat: 0000000000000000 pfs : 0000000000000590 rsc : 0000000000000003
rnat: 0000000000000000 bsps: 0000000000000000 pr : a45456a555966995
ldrs: 0000000000000000 ccv : 0000000000000000 fpsr: 0009804c8a70033f
csd : 0000000000000000 ssd : 0000000000000000
b0 : a000000100462890 b6 : a00000010012b2a0 b7 : a00000010000f190
f6 : 1003efffffffffffffc03 f7 : 1003e0000000000000b13
f8 : 1003e0000000000000023 f9 : 1003ea3d70a3d70a3d70b
f10 : 1003e0000000000000051 f11 : 1003efffffffffffff481
r1 : a000000100f80990 r2 : 0000000000000000 r3 : 0000000000000003
r8 : e00001603bd10c64 r9 : e0000040cda10000 r10 : ffffffffdead4ead
r11 : 00000000dead4ead r12 : e00001603bd1fb20 r13 : e00001603bd10000
r14 : 0000000000000001 r15 : 0000000000000005 r16 : e0000040c45b48fc
r17 : 000000002f7c4000 r18 : 000000000bc6c000 r19 : 0000000000614000
r20 : 000000000c280000 r21 : a000000100d98da8 r22 : a000000100d98da8
r23 : 0000000000000001 r24 : 0000000000000001 r25 : 0000000000000001
r26 : e00001603bd1fb50 r27 : a040000058012d30 r28 : e00001603bd1fb48
r29 : 0000000000007bfe r30 : 0000000000000000 r31 : a04000005829e4d8
Call Trace:
[<a000000100015f20>] show_stack+0x80/0xa0
spà0001603bd1f780 bspà0001603bd117d8
[<a000000100016820>] show_regs+0x880/0x8c0
spà0001603bd1f950 bspà0001603bd11780
[<a000000100101b60>] softlockup_tick+0x2e0/0x360
spà0001603bd1f950 bspà0001603bd11730
[<a0000001000ba320>] run_local_timers+0x40/0x60
spà0001603bd1f950 bspà0001603bd11718
[<a0000001000ba420>] update_process_times+0x40/0xc0
spà0001603bd1f950 bspà0001603bd116e8
[<a00000010003b150>] timer_interrupt+0x1b0/0x4a0
spà0001603bd1f950 bspà0001603bd11688
[<a000000100102420>] handle_IRQ_event+0x80/0x120
spà0001603bd1f950 bspà0001603bd11650
[<a000000100102600>] __do_IRQ+0x140/0x420
spà0001603bd1f950 bspà0001603bd115e8
[<a000000100012df0>] ia64_handle_irq+0x3f0/0x420
spà0001603bd1f950 bspà0001603bd11570
[<a00000010000abe0>] ia64_leave_kernel+0x0/0x270
spà0001603bd1f950 bspà0001603bd11570
[<a000000100008d30>] ia64_delay_loop+0x30/0x40
spà0001603bd1fb20 bspà0001603bd11548
[<a000000100462890>] _raw_spin_lock+0x210/0x300
spà0001603bd1fb20 bspà0001603bd114f0
[<a0000001008290e0>] _spin_lock+0x20/0x40
spà0001603bd1fb20 bspà0001603bd114d0
[<a00000010014b2e0>] page_lock_anon_vma+0x80/0xa0
spà0001603bd1fb20 bspà0001603bd114a8
[<a00000010014da60>] page_referenced+0x180/0x340
spà0001603bd1fb20 bspà0001603bd11460
[<a00000010012bd00>] shrink_active_list+0x9e0/0xe80
spà0001603bd1fb30 bspà0001603bd113b8
[<a00000010012e460>] shrink_zone+0x220/0x260
spà0001603bd1fbf0 bspà0001603bd11368
[<a0000001001300f0>] try_to_free_pages+0x590/0x900
spà0001603bd1fbf0 bspà0001603bd11290
[<a000000100120500>] __alloc_pages_internal+0x4a0/0x860
spà0001603bd1fc30 bspà0001603bd111d8
[<a000000100120950>] __alloc_pages+0x30/0x60
spà0001603bd1fc40 bspà0001603bd111a8
[<a0000001001714e0>] kmem_getpages+0x120/0x2a0
spà0001603bd1fc40 bspà0001603bd11160
[<a000000100172480>] fallback_alloc+0x320/0x440
spà0001603bd1fc40 bspà0001603bd110d8
[<a0000001001726b0>] ____cache_alloc_node+0x110/0x300
spà0001603bd1fc40 bspà0001603bd11058
[<a000000100171770>] kmem_cache_alloc_node+0x110/0x3c0
spà0001603bd1fc40 bspà0001603bd11018
[<a000000100171a80>] __kmalloc_node+0x60/0xa0
spà0001603bd1fc40 bspà0001603bd10fe0
[<a0000001006ea250>] __alloc_skb+0xb0/0x280
spà0001603bd1fc40 bspà0001603bd10f98
[<a0000001006df480>] sock_alloc_send_skb+0x400/0x560
spà0001603bd1fc40 bspà0001603bd10f28
[<a0000001007e7930>] unix_stream_sendmsg+0x3b0/0x780
spà0001603bd1fc70 bspà0001603bd10e68
[<a0000001006d7e80>] sock_aio_write+0x260/0x2a0
spà0001603bd1fca0 bspà0001603bd10e28
[<a0000001001817b0>] do_sync_write+0x170/0x260
spà0001603bd1fd20 bspà0001603bd10dd0
[<a000000100182f70>] vfs_write+0x310/0x320
spà0001603bd1fe20 bspà0001603bd10d80
[<a000000100183ad0>] sys_write+0x70/0xe0
spà0001603bd1fe20 bspà0001603bd10d08
[<a00000010000aa40>] ia64_ret_from_syscall+0x0/0x20
spà0001603bd1fe30 bspà0001603bd10d08
[<a000000000010720>] __kernel_syscall_via_break+0x0/0x20
spà0001603bd20000 bspà0001603bd10d08
BUG: soft lockup - CPU#4 stuck for 61s! [hackbench:9582]
Modules linked in: sunrpc binfmt_misc dm_multipath fan sg thermal processor button container e100 eepro100 mii dm_snapshot dm_zero dm_mirror dm_log dm_mod lpfc mptspi mptscsih mptbase ehci_hcd ohci_hcd uhci_hcd usbcore
Pid: 9582, CPU 4, comm: hackbench
psr : 00001010085a6010 ifs : 8000000000000590 ip : [<a0000001004628b0>] Not tainted (2.6.25-rc5-mm1)
ip is at _raw_spin_lock+0x230/0x300
unat: 0000000000000000 pfs : 0000000000000590 rsc : 0000000000000003
rnat: 0000000000000000 bsps: 0000000000000000 pr : a45456a555966995
ldrs: 0000000000000000 ccv : 0000000000000000 fpsr: 0009804c8a70033f
csd : 0000000000000000 ssd : 0000000000000000
b0 : a000000100462890 b6 : a00000010012b2a0 b7 : a00000010000f190
f6 : 1003efffffffffffffc03 f7 : 1003e0000000000000b13
f8 : 1003e0000000000000023 f9 : 1003ea3d70a3d70a3d70b
f10 : 1003e0000000000000051 f11 : 1003efffffffffffff497
r1 : a000000100f80990 r2 : 0000000000000000 r3 : 0000000000000002
r8 : e0000040fda10c64 r9 : e00001603e090000 r10 : ffffffffdead4ead
r11 : 00000000dead4ead r12 : e0000040fda1fb20 r13 : e0000040fda10000
r14 : 0000000000000001 r15 : 0000000000000004 r16 : e0000040c45b854c
r17 : 000000002f7c4000 r18 : 000000000bc6c000 r19 : 0000000000614000
r20 : 000000000c280000 r21 : a000000100d98da8 r22 : a000000100d98da8
r23 : 0000000000000001 r24 : 0000000000000001 r25 : 0000000000000001
r26 : e0000040fda1fb50 r27 : a0400000582853b0 r28 : e0000040fda1fb48
r29 : 0000000000007bfe r30 : 0000000000000000 r31 : a040000058285358
Call Trace:
[<a000000100015f20>] show_stack+0x80/0xa0
spà000040fda1f780 bspà000040fda117a8
[<a000000100016820>] show_regs+0x880/0x8c0
spà000040fda1f950 bspà000040fda11750
[<a000000100101b60>] softlockup_tick+0x2e0/0x360
spà000040fda1f950 bspà000040fda11700
[<a0000001000ba320>] run_local_timers+0x40/0x60
spà000040fda1f950 bspà000040fda116e8
[<a0000001000ba420>] update_process_times+0x40/0xc0
spà000040fda1f950 bspà000040fda116b8
[<a00000010003b150>] timer_interrupt+0x1b0/0x4a0
spà000040fda1f950 bspà000040fda11658
[<a000000100102420>] handle_IRQ_event+0x80/0x120
spà000040fda1f950 bspà000040fda11620
[<a000000100102600>] __do_IRQ+0x140/0x420
spà000040fda1f950 bspà000040fda115b8
[<a000000100012df0>] ia64_handle_irq+0x3f0/0x420
spà000040fda1f950 bspà000040fda11540
[<a00000010000abe0>] ia64_leave_kernel+0x0/0x270
spà000040fda1f950 bspà000040fda11540
[<a0000001004628b0>] _raw_spin_lock+0x230/0x300
spà000040fda1fb20 bspà000040fda114c0
[<a0000001008290e0>] _spin_lock+0x20/0x40
spà000040fda1fb20 bspà000040fda114a0
[<a00000010014b2e0>] page_lock_anon_vma+0x80/0xa0
spà000040fda1fb20 bspà000040fda11478
[<a00000010014da60>] page_referenced+0x180/0x340
spà000040fda1fb20 bspà000040fda11430
[<a00000010012bd00>] shrink_active_list+0x9e0/0xe80
spà000040fda1fb30 bspà000040fda11388
[<a00000010012e460>] shrink_zone+0x220/0x260
spà000040fda1fbf0 bspà000040fda11338
[<a0000001001300f0>] try_to_free_pages+0x590/0x900
spà000040fda1fbf0 bspà000040fda11260
[<a000000100120500>] __alloc_pages_internal+0x4a0/0x860
spà000040fda1fc30 bspà000040fda111a8
[<a000000100120950>] __alloc_pages+0x30/0x60
spà000040fda1fc40 bspà000040fda11178
[<a0000001001714e0>] kmem_getpages+0x120/0x2a0
spà000040fda1fc40 bspà000040fda11130
[<a000000100172480>] fallback_alloc+0x320/0x440
spà000040fda1fc40 bspà000040fda110a8
[<a0000001001726b0>] ____cache_alloc_node+0x110/0x300
spà000040fda1fc40 bspà000040fda11028
[<a000000100171770>] kmem_cache_alloc_node+0x110/0x3c0
spà000040fda1fc40 bspà000040fda10fe0
[<a0000001006ea210>] __alloc_skb+0x70/0x280
spà000040fda1fc40 bspà000040fda10f98
[<a0000001006df480>] sock_alloc_send_skb+0x400/0x560
spà000040fda1fc40 bspà000040fda10f28
[<a0000001007e7930>] unix_stream_sendmsg+0x3b0/0x780
spà000040fda1fc70 bspà000040fda10e68
[<a0000001006d7e80>] sock_aio_write+0x260/0x2a0
spà000040fda1fca0 bspà000040fda10e28
[<a0000001001817b0>] do_sync_write+0x170/0x260
spà000040fda1fd20 bspà000040fda10dd0
[<a000000100182f70>] vfs_write+0x310/0x320
spà000040fda1fe20 bspà000040fda10d80
[<a000000100183ad0>] sys_write+0x70/0xe0
spà000040fda1fe20 bspà000040fda10d08
[<a00000010000aa40>] ia64_ret_from_syscall+0x0/0x20
spà000040fda1fe30 bspà000040fda10d08
[<a000000000010720>] __kernel_syscall_via_break+0x0/0x20
spà000040fda20000 bspà000040fda10d08
BUG: soft lockup - CPU#6 stuck for 61s! [hackbench:13609]
Modules linked in: sunrpc binfmt_misc dm_multipath fan sg thermal processor button container e100 eepro100 mii dm_snapshot dm_zero dm_mirror dm_log dm_mod lpfc mptspi mptscsih mptbase ehci_hcd ohci_hcd uhci_hcd usbcore
Pid: 13609, CPU 6, comm: hackbench
psr : 00001010085a6010 ifs : 8000000000000590 ip : [<a0000001004628b0>] Not tainted (2.6.25-rc5-mm1)
ip is at _raw_spin_lock+0x230/0x300
unat: 0000000000000000 pfs : 0000000000000590 rsc : 0000000000000003
rnat: 0000000000000000 bsps: 0000000000000000 pr : a45456a555966995
ldrs: 0000000000000000 ccv : 0000000000000000 fpsr: 0009804c8a70033f
csd : 0000000000000000 ssd : 0000000000000000
b0 : a000000100462890 b6 : a00000010012b2a0 b7 : a0000001001ac880
f6 : 1003efffffffffffffc03 f7 : 1003e0000000000000b13
f8 : 1003e0000000000000023 f9 : 1003ea3d70a3d70a3d70b
f10 : 1003e0000000000000051 f11 : 1003efffffffffffff497
r1 : a000000100f80990 r2 : 0000000000000000 r3 : 0000000000000004
r8 : e00001601a310c64 r9 : e0000040cee50000 r10 : ffffffffdead4ead
r11 : 00000000dead4ead r12 : e00001601a31fb10 r13 : e00001601a310000
r14 : 0000000000000001 r15 : 0000000000000006 r16 : a040000058126354
r17 : 000000002f7c4000 r18 : 000000000bc6c000 r19 : 0000000000614000
r20 : 000000000c280000 r21 : a000000100d98da8 r22 : a000000100d98da8
r23 : 0000000000000001 r24 : 0000000000000001 r25 : 0000000000000001
r26 : 5fc0000000000000 r27 : a040000058246270 r28 : 0000000000000006
r29 : e00001601a310c64 r30 : 0000000000000000 r31 : a04000005824a8d8
Call Trace:
[<a000000100015f20>] show_stack+0x80/0xa0
spà0001601a31f770 bspà0001601a311838
[<a000000100016820>] show_regs+0x880/0x8c0
spà0001601a31f940 bspà0001601a3117d8
[<a000000100101b60>] softlockup_tick+0x2e0/0x360
spà0001601a31f940 bspà0001601a311788
[<a0000001000ba320>] run_local_timers+0x40/0x60
spà0001601a31f940 bspà0001601a311770
[<a0000001000ba420>] update_process_times+0x40/0xc0
spà0001601a31f940 bspà0001601a311740
[<a00000010003b150>] timer_interrupt+0x1b0/0x4a0
spà0001601a31f940 bspà0001601a3116e0
[<a000000100102420>] handle_IRQ_event+0x80/0x120
spà0001601a31f940 bspà0001601a3116a8
[<a000000100102600>] __do_IRQ+0x140/0x420
spà0001601a31f940 bspà0001601a311648
[<a000000100012df0>] ia64_handle_irq+0x3f0/0x420
spà0001601a31f940 bspà0001601a3115c8
[<a00000010000abe0>] ia64_leave_kernel+0x0/0x270
spà0001601a31f940 bspà0001601a3115c8
[<a0000001004628b0>] _raw_spin_lock+0x230/0x300
spà0001601a31fb10 bspà0001601a311548
[<a0000001008290e0>] _spin_lock+0x20/0x40
spà0001601a31fb10 bspà0001601a311528
[<a00000010014b470>] page_check_address+0x170/0x200
spà0001601a31fb10 bspà0001601a3114e8
[<a00000010014b680>] page_referenced_one+0xa0/0x2c0
spà0001601a31fb10 bspà0001601a3114a8
[<a00000010014db10>] page_referenced+0x230/0x340
spà0001601a31fb20 bspà0001601a311460
[<a00000010012bd00>] shrink_active_list+0x9e0/0xe80
spà0001601a31fb30 bspà0001601a3113b8
[<a00000010012e460>] shrink_zone+0x220/0x260
spà0001601a31fbf0 bspà0001601a311368
[<a0000001001300f0>] try_to_free_pages+0x590/0x900
spà0001601a31fbf0 bspà0001601a311290
[<a000000100120500>] __alloc_pages_internal+0x4a0/0x860
spà0001601a31fc30 bspà0001601a3111d8
[<a000000100120950>] __alloc_pages+0x30/0x60
spà0001601a31fc40 bspà0001601a3111a8
[<a0000001001714e0>] kmem_getpages+0x120/0x2a0
spà0001601a31fc40 bspà0001601a311160
[<a000000100172480>] fallback_alloc+0x320/0x440
spà0001601a31fc40 bspà0001601a3110d8
[<a0000001001726b0>] ____cache_alloc_node+0x110/0x300
spà0001601a31fc40 bspà0001601a311058
[<a000000100171770>] kmem_cache_alloc_node+0x110/0x3c0
spà0001601a31fc40 bspà0001601a311018
[<a000000100171a80>] __kmalloc_node+0x60/0xa0
spà0001601a31fc40 bspà0001601a310fe0
[<a0000001006ea250>] __alloc_skb+0xb0/0x280
spà0001601a31fc40 bspà0001601a310f98
[<a0000001006df480>] sock_alloc_send_skb+0x400/0x560
spà0001601a31fc40 bspà0001601a310f28
[<a0000001007e7930>] unix_stream_sendmsg+0x3b0/0x780
spà0001601a31fc70 bspà0001601a310e68
[<a0000001006d7e80>] sock_aio_write+0x260/0x2a0
spà0001601a31fca0 bspà0001601a310e28
[<a0000001001817b0>] do_sync_write+0x170/0x260
spà0001601a31fd20 bspà0001601a310dd0
[<a000000100182f70>] vfs_write+0x310/0x320
spà0001601a31fe20 bspà0001601a310d80
[<a000000100183ad0>] sys_write+0x70/0xe0
spà0001601a31fe20 bspà0001601a310d08
[<a00000010000aa40>] ia64_ret_from_syscall+0x0/0x20
spà0001601a31fe30 bspà0001601a310d08
[<a000000000010720>] __kernel_syscall_via_break+0x0/0x20
spà0001601a320000 bspà0001601a310d08
BUG: soft lockup - CPU#2 stuck for 61s! [hackbench:11407]
Modules linked in: sunrpc binfmt_misc dm_multipath fan sg thermal processor button container e100 eepro100 mii dm_snapshot dm_zero dm_mirror dm_log dm_mod lpfc mptspi mptscsih mptbase ehci_hcd ohci_hcd uhci_hcd usbcore
Pid: 11407, CPU 2, comm: hackbench
psr : 00001010085a6010 ifs : 8000000000000590 ip : [<a0000001004628b0>] Not tainted (2.6.25-rc5-mm1)
ip is at _raw_spin_lock+0x230/0x300
unat: 0000000000000000 pfs : 0000000000000590 rsc : 0000000000000003
rnat: 0000000000000000 bsps: 0000000000000000 pr : a45856a555966999
ldrs: 0000000000000000 ccv : 0000000000000000 fpsr: 0009804c8a70033f
csd : 0000000000000000 ssd : 0000000000000000
b0 : a000000100462890 b6 : a00000010012b2a0 b7 : a00000010000f190
f6 : 1003efffffffffffffc03 f7 : 1003e0000000000000b13
f8 : 1003e0000000000000023 f9 : 1003ea3d70a3d70a3d70b
f10 : 1003e0000000000000051 f11 : 1003efffffffffffff497
r1 : a000000100f80990 r2 : 0000000000000000 r3 : 0000000000000002
r8 : 0000000000000000 r9 : a000000100d9b9d8 r10 : a000000100d81e28
r11 : 00000000dead4ead r12 : e00000003241fb20 r13 : e000000032410000
r14 : 0000000000000001 r15 : 0000000000000000 r16 : e000000032410c64
r17 : 000000002f7c4000 r18 : 000000000bc6c000 r19 : 0000000000614000
r20 : 000000000c280000 r21 : a000000100d98da8 r22 : a000000100d98da8
r23 : 0000000000000001 r24 : 0000000000000001 r25 : 0000000000000001
r26 : e00000003241fb50 r27 : a0400000582a3730 r28 : e00000003241fb48
r29 : 0000000000007bfe r30 : 0000000000000000 r31 : a0400000582a4ed8
Call Trace:
[<a000000100015f20>] show_stack+0x80/0xa0
spà0000003241f780 bspà000000324117a8
[<a000000100016820>] show_regs+0x880/0x8c0
spà0000003241f950 bspà00000032411750
[<a000000100101b60>] softlockup_tick+0x2e0/0x360
spà0000003241f950 bspà00000032411700
[<a0000001000ba320>] run_local_timers+0x40/0x60
spà0000003241f950 bspà000000324116e8
[<a0000001000ba420>] update_process_times+0x40/0xc0
spà0000003241f950 bspà000000324116b8
[<a00000010003b150>] timer_interrupt+0x1b0/0x4a0
spà0000003241f950 bspà00000032411658
[<a000000100102420>] handle_IRQ_event+0x80/0x120
spà0000003241f950 bspà00000032411620
[<a000000100102600>] __do_IRQ+0x140/0x420
spà0000003241f950 bspà000000324115b8
[<a000000100012df0>] ia64_handle_irq+0x3f0/0x420
spà0000003241f950 bspà00000032411540
[<a00000010000abe0>] ia64_leave_kernel+0x0/0x270
spà0000003241f950 bspà00000032411540
[<a0000001004628b0>] _raw_spin_lock+0x230/0x300
spà0000003241fb20 bspà000000324114c0
[<a0000001008290e0>] _spin_lock+0x20/0x40
spà0000003241fb20 bspà000000324114a0
[<a00000010014b2e0>] page_lock_anon_vma+0x80/0xa0
spà0000003241fb20 bspà00000032411478
[<a00000010014da60>] page_referenced+0x180/0x340
spà0000003241fb20 bspà00000032411430
[<a00000010012bd00>] shrink_active_list+0x9e0/0xe80
spà0000003241fb30 bspà00000032411388
[<a00000010012e460>] shrink_zone+0x220/0x260
spà0000003241fbf0 bspà00000032411338
[<a0000001001300f0>] try_to_free_pages+0x590/0x900
spà0000003241fbf0 bspà00000032411260
[<a000000100120500>] __alloc_pages_internal+0x4a0/0x860
spà0000003241fc30 bspà000000324111a8
[<a000000100120950>] __alloc_pages+0x30/0x60
spà0000003241fc40 bspà00000032411178
[<a0000001001714e0>] kmem_getpages+0x120/0x2a0
spà0000003241fc40 bspà00000032411130
[<a000000100172480>] fallback_alloc+0x320/0x440
spà0000003241fc40 bspà000000324110a8
[<a0000001001726b0>] ____cache_alloc_node+0x110/0x300
spà0000003241fc40 bspà00000032411028
[<a000000100171770>] kmem_cache_alloc_node+0x110/0x3c0
spà0000003241fc40 bspà00000032410fe0
[<a0000001006ea210>] __alloc_skb+0x70/0x280
spà0000003241fc40 bspà00000032410f98
[<a0000001006df480>] sock_alloc_send_skb+0x400/0x560
spà0000003241fc40 bspà00000032410f28
[<a0000001007e7930>] unix_stream_sendmsg+0x3b0/0x780
spà0000003241fc70 bspà00000032410e68
[<a0000001006d7e80>] sock_aio_write+0x260/0x2a0
spà0000003241fca0 bspà00000032410e28
[<a0000001001817b0>] do_sync_write+0x170/0x260
spà0000003241fd20 bspà00000032410dd0
[<a000000100182f70>] vfs_write+0x310/0x320
spà0000003241fe20 bspà00000032410d80
[<a000000100183ad0>] sys_write+0x70/0xe0
spà0000003241fe20 bspà00000032410d08
[<a00000010000aa40>] ia64_ret_from_syscall+0x0/0x20
spà0000003241fe30 bspà00000032410d08
[<a000000000010720>] __kernel_syscall_via_break+0x0/0x20
spà00000032420000 bspà00000032410d08
BUG: soft lockup - CPU#3 stuck for 61s! [hackbench:12705]
Modules linked in: sunrpc binfmt_misc dm_multipath fan sg thermal processor button container e100 eepro100 mii dm_snapshot dm_zero dm_mirror dm_log dm_mod lpfc mptspi mptscsih mptbase ehci_hcd ohci_hcd uhci_hcd usbcore
Pid: 12705, CPU 3, comm: hackbench
psr : 00001010085a6010 ifs : 8000000000000590 ip : [<a0000001004628a0>] Not tainted (2.6.25-rc5-mm1)
ip is at _raw_spin_lock+0x220/0x300
unat: 0000000000000000 pfs : 0000000000000590 rsc : 0000000000000003
rnat: 0000000000000000 bsps: 0000000000000000 pr : a45866a555966a99
ldrs: 0000000000000000 ccv : 0000000000000000 fpsr: 0009804c8a70033f
csd : 0000000000000000 ssd : 0000000000000000
b0 : a000000100462890 b6 : a0000001004573c0 b7 : a00000010000b1f0
f6 : 000000000000000000000 f7 : 1003e9e3779b97f4a7c16
f8 : 1003e0a00000000001072 f9 : 1003effffffffffffffe9
f10 : 1003e000000000000002d f11 : 1003e8208208208208209
r1 : a000000100f80990 r2 : 0000000000000000 r3 : e00001603727f940
r8 : ffffffffffffffff r9 : a000000100d98bd0 r10 : 0000000000000001
r11 : a000000100d98bf8 r12 : e00001603727fb10 r13 : e000016037270000
r14 : 0000000000000001 r15 : 9fffffffffffffff r16 : e00001603727f588
r17 : 00000000dead4ead r18 : 0000000000004000 r19 : a000000100d98bc8
r20 : 000000000000000a r21 : 0000000000000000 r22 : a000000100d9b2b0
r23 : 0000000000000001 r24 : 0000000000000001 r25 : 0000000000000001
r26 : 00000000000fffff r27 : 0000000000007c3b r28 : 0000000000000003
r29 : e000016037270c64 r30 : ffffffffffffffdb r31 : 0000000000000022
Call Trace:
[<a000000100015f20>] show_stack+0x80/0xa0
spà0001603727f770 bspà00016037271808
[<a000000100016820>] show_regs+0x880/0x8c0
spà0001603727f940 bspà000160372717a8
[<a000000100101b60>] softlockup_tick+0x2e0/0x360
spà0001603727f940 bspà00016037271758
[<a0000001000ba320>] run_local_timers+0x40/0x60
spà0001603727f940 bspà00016037271740
[<a0000001000ba420>] update_process_times+0x40/0xc0
spà0001603727f940 bspà00016037271710
[<a00000010003b150>] timer_interrupt+0x1b0/0x4a0
spà0001603727f940 bspà000160372716b0
[<a000000100102420>] handle_IRQ_event+0x80/0x120
spà0001603727f940 bspà00016037271678
[<a000000100102600>] __do_IRQ+0x140/0x420
spà0001603727f940 bspà00016037271618
[<a000000100012df0>] ia64_handle_irq+0x3f0/0x420
spà0001603727f940 bspà00016037271598
[<a00000010000abe0>] ia64_leave_kernel+0x0/0x270
spà0001603727f940 bspà00016037271598
[<a0000001004628a0>] _raw_spin_lock+0x220/0x300
spà0001603727fb10 bspà00016037271518
[<a0000001008290e0>] _spin_lock+0x20/0x40
spà0001603727fb10 bspà000160372714f8
[<a00000010014b470>] page_check_address+0x170/0x200
spà0001603727fb10 bspà000160372714b8
[<a00000010014b680>] page_referenced_one+0xa0/0x2c0
spà0001603727fb10 bspà00016037271478
[<a00000010014db10>] page_referenced+0x230/0x340
spà0001603727fb20 bspà00016037271430
[<a00000010012bd00>] shrink_active_list+0x9e0/0xe80
spà0001603727fb30 bspà00016037271388
[<a00000010012e460>] shrink_zone+0x220/0x260
spà0001603727fbf0 bspà00016037271338
[<a0000001001300f0>] try_to_free_pages+0x590/0x900
spà0001603727fbf0 bspà00016037271260
[<a000000100120500>] __alloc_pages_internal+0x4a0/0x860
spà0001603727fc30 bspà000160372711a8
[<a000000100120950>] __alloc_pages+0x30/0x60
spà0001603727fc40 bspà00016037271178
[<a0000001001714e0>] kmem_getpages+0x120/0x2a0
spà0001603727fc40 bspà00016037271130
[<a000000100172480>] fallback_alloc+0x320/0x440
spà0001603727fc40 bspà000160372710a8
[<a0000001001726b0>] ____cache_alloc_node+0x110/0x300
spà0001603727fc40 bspà00016037271028
[<a000000100171770>] kmem_cache_alloc_node+0x110/0x3c0
spà0001603727fc40 bspà00016037270fe0
[<a0000001006ea210>] __alloc_skb+0x70/0x280
spà0001603727fc40 bspà00016037270f98
[<a0000001006df480>] sock_alloc_send_skb+0x400/0x560
spà0001603727fc40 bspà00016037270f28
[<a0000001007e7930>] unix_stream_sendmsg+0x3b0/0x780
spà0001603727fc70 bspà00016037270e68
[<a0000001006d7e80>] sock_aio_write+0x260/0x2a0
spà0001603727fca0 bspà00016037270e28
[<a0000001001817b0>] do_sync_write+0x170/0x260
spà0001603727fd20 bspà00016037270dd0
[<a000000100182f70>] vfs_write+0x310/0x320
spà0001603727fe20 bspà00016037270d80
[<a000000100183ad0>] sys_write+0x70/0xe0
spà0001603727fe20 bspà00016037270d08
[<a00000010000aa40>] ia64_ret_from_syscall+0x0/0x20
spà0001603727fe30 bspà00016037270d08
[<a000000000010720>] __kernel_syscall_via_break+0x0/0x20
spà00016037280000 bspà00016037270d08
BUG: spinlock lockup on CPU#4, hackbench/9582, e0000040c45b8548
Call Trace:
[<a000000100015f20>] show_stack+0x80/0xa0
spà000040fda1f950 bspà000040fda11530
[<a000000100015f70>] dump_stack+0x30/0x60
spà000040fda1fb20 bspà000040fda11518
[<a000000100462960>] _raw_spin_lock+0x2e0/0x300
spà000040fda1fb20 bspà000040fda114c0
[<a0000001008290e0>] _spin_lock+0x20/0x40
spà000040fda1fb20 bspà000040fda114a0
[<a00000010014b2e0>] page_lock_anon_vma+0x80/0xa0
spà000040fda1fb20 bspà000040fda11478
[<a00000010014da60>] page_referenced+0x180/0x340
spà000040fda1fb20 bspà000040fda11430
[<a00000010012bd00>] shrink_active_list+0x9e0/0xe80
spà000040fda1fb30 bspà000040fda11388
[<a00000010012e460>] shrink_zone+0x220/0x260
spà000040fda1fbf0 bspà000040fda11338
[<a0000001001300f0>] try_to_free_pages+0x590/0x900
spà000040fda1fbf0 bspà000040fda11260
[<a000000100120500>] __alloc_pages_internal+0x4a0/0x860
spà000040fda1fc30 bspà000040fda111a8
[<a000000100120950>] __alloc_pages+0x30/0x60
spà000040fda1fc40 bspà000040fda11178
[<a0000001001714e0>] kmem_getpages+0x120/0x2a0
spà000040fda1fc40 bspà000040fda11130
[<a000000100172480>] fallback_alloc+0x320/0x440
spà000040fda1fc40 bspà000040fda110a8
[<a0000001001726b0>] ____cache_alloc_node+0x110/0x300
spà000040fda1fc40 bspà000040fda11028
[<a000000100171770>] kmem_cache_alloc_node+0x110/0x3c0
spà000040fda1fc40 bspà000040fda10fe0
[<a0000001006ea210>] __alloc_skb+0x70/0x280
spà000040fda1fc40 bspà000040fda10f98
[<a0000001006df480>] sock_alloc_send_skb+0x400/0x560
spà000040fda1fc40 bspà000040fda10f28
[<a0000001007e7930>] unix_stream_sendmsg+0x3b0/0x780
spà000040fda1fc70 bspà000040fda10e68
[<a0000001006d7e80>] sock_aio_write+0x260/0x2a0
spà000040fda1fca0 bspà000040fda10e28
[<a0000001001817b0>] do_sync_write+0x170/0x260
spà000040fda1fd20 bspà000040fda10dd0
[<a000000100182f70>] vfs_write+0x310/0x320
spà000040fda1fe20 bspà000040fda10d80
[<a000000100183ad0>] sys_write+0x70/0xe0
spà000040fda1fe20 bspà000040fda10d08
[<a00000010000aa40>] ia64_ret_from_syscall+0x0/0x20
spà000040fda1fe30 bspà000040fda10d08
[<a000000000010720>] __kernel_syscall_via_break+0x0/0x20
spà000040fda20000 bspà000040fda10d08
BUG: spinlock lockup on CPU#6, hackbench/13609, a040000058126350
Call Trace:
[<a000000100015f20>] show_stack+0x80/0xa0
spà0001601a31f940 bspà0001601a3115b8
[<a000000100015f70>] dump_stack+0x30/0x60
spà0001601a31fb10 bspà0001601a3115a0
[<a000000100462960>] _raw_spin_lock+0x2e0/0x300
spà0001601a31fb10 bspà0001601a311548
[<a0000001008290e0>] _spin_lock+0x20/0x40
spà0001601a31fb10 bspà0001601a311528
[<a00000010014b470>] page_check_address+0x170/0x200
spà0001601a31fb10 bspà0001601a3114e8
[<a00000010014b680>] page_referenced_one+0xa0/0x2c0
spà0001601a31fb10 bspà0001601a3114a8
[<a00000010014db10>] page_referenced+0x230/0x340
spà0001601a31fb20 bspà0001601a311460
[<a00000010012bd00>] shrink_active_list+0x9e0/0xe80
spà0001601a31fb30 bspà0001601a3113b8
[<a00000010012e460>] shrink_zone+0x220/0x260
spà0001601a31fbf0 bspà0001601a311368
BUG: soft lockup - CPU#1 stuck for 61s! [hackbench:12693]
Modules linked in: sunrpc binfmt_misc dm_multipath fan sg thermal processor button container e100 eepro100 mii dm_snapshot dm_zero dm_mirror dm_log dm_mod lpfc mptspi mptscsih mptbase ehci_hcd ohci_hcd uhci_hcd usbcore001601a31fc30 bspà0001601a3111d8
[<a000000100120950>] __alloc_pages+0x30/0x60
Pid: 12693, CPU 1, comm: hackbench31fc40 bspà0001601a3111a8
psr : 00001410085a6010 ifs : 8000000000000005 ip : [<a000000100008d02>] Not tainted (2.6.25-rc5-mm1) spà0001601a31fc40 bspà0001601a311160
ip is at ia64_delay_loop+0x2/0x40loc+0x320/0x440
unat: 0000000000000000 pfs : 0000000000000590 rsc : 000000000000000310d8
rnat: e0000160369f0c60 bsps: 0000000000000000 pr : a45856a565966a95
ldrs: 0000000000000000 ccv : 0000000000000000 fpsr: 0009804c8a70433f1058
csd : 0000000000000000 ssd : 0000000000000000x110/0x3c0
b0 : a000000100462890 b6 : a0000001005db080 b7 : a000000100537b801018
f6 : 1003e00000016fe73fb4d f7 : 1003e0000000000000190
f8 : 1003e00000016fe73f9bd f9 : 1003e0000000000000001à0001601a310fe0
f10 : 1003e5eca18d532710f16 f11 : 1003e000000000000000e
r1 : a000000100f80990 r2 : 0000000000000000 r3 : e0000160369ff9500f98
r8 : ffffffffffffffff r9 : a000000100d98bd0 r10 : 0000000000000001
r11 : a000000100d98bf8 r12 : e0000160369ffb20 r13 : e0000160369f00000f28
r14 : 0000000000000001 r15 : 9fffffffffffffff r16 : e0000160369ff598
r17 : 00000000dead4ead r18 : a000000100cf122c r19 : a000000100d98bc80e68
r20 : 0000000000008303 r21 : 00000000000fffff r22 : 0000000000100000
r23 : 0000000000000001 r24 : 0000000000000001 r25 : 00000000000000010e28
r26 : 0000000000000000 r27 : 0000000000000060 r28 : 0000000000000001
r29 : e0000160369f0c64 r30 : 0000000000008301 r31 : 00000000000083010dd0
[<a000000100182f70>] vfs_write+0x310/0x320
Call Trace: spà0001601a31fe20 bspà0001601a310d80
[<a000000100015f20>] show_stack+0x80/0xa0
spà000160369ff780 bspà000160369f17d8
[<a000000100016820>] show_regs+0x880/0x8c0+0x0/0x20
spà000160369ff950 bspà000160369f1780
[<a000000100101b60>] softlockup_tick+0x2e0/0x3600x0/0x20
spà000160369ff950 bspà000160369f1730
[<a0000001000ba320>] run_local_timers+0x40/0x60
spà000160369ff950 bspà000160369f1718
[<a0000001000ba420>] update_process_times+0x40/0xc0
spà000160369ff950 bspà000160369f16e8
[<a00000010003b150>] timer_interrupt+0x1b0/0x4a0
spà000160369ff950 bspà000160369f1688
[<a000000100102420>] handle_IRQ_event+0x80/0x120
spà000160369ff950 bspà000160369f1650
[<a000000100102600>] __do_IRQ+0x140/0x420
spà000160369ff950 bspà000160369f15e8
[<a000000100012df0>] ia64_handle_irq+0x3f0/0x420
spà000160369ff950 bspà000160369f1570
[<a00000010000abe0>] ia64_leave_kernel+0x0/0x270
spà000160369ff950 bspà000160369f1570
[<a000000100008d00>] ia64_delay_loop+0x0/0x40
spà000160369ffb20 bspà000160369f1548
[<a000000100462890>] _raw_spin_lock+0x210/0x300
spà000160369ffb20 bspà000160369f14f0
[<a0000001008290e0>] _spin_lock+0x20/0x40
spà000160369ffb20 bspà000160369f14d0
[<a00000010014b2e0>] page_lock_anon_vma+0x80/0xa0
spà000160369ffb20 bspà000160369f14a8
[<a00000010014da60>] page_referenced+0x180/0x340
spà000160369ffb20 bspà000160369f1460
[<a00000010012bd00>] shrink_active_list+0x9e0/0xe80
spà000160369ffb30 bspà000160369f13b8
[<a00000010012e460>] shrink_zone+0x220/0x260
spà000160369ffbf0 bspà000160369f1368
[<a0000001001300f0>] try_to_free_pages+0x590/0x900
spà000160369ffbf0 bspà000160369f1290
[<a000000100120500>] __alloc_pages_internal+0x4a0/0x860
spà000160369ffc30 bspà000160369f11d8
[<a000000100120950>] __alloc_pages+0x30/0x60
spà000160369ffc40 bspà000160369f11a8
[<a0000001001714e0>] kmem_getpages+0x120/0x2a0
spà000160369ffc40 bspà000160369f1160
[<a000000100172480>] fallback_alloc+0x320/0x440
spà000160369ffc40 bspà000160369f10d8
[<a0000001001726b0>] ____cache_alloc_node+0x110/0x300
spà000160369ffc40 bspà000160369f1058
[<a000000100171770>] kmem_cache_alloc_node+0x110/0x3c0
spà000160369ffc40 bspà000160369f1018
[<a000000100171a80>] __kmalloc_node+0x60/0xa0
spà000160369ffc40 bspà000160369f0fe0
[<a0000001006ea250>] __alloc_skb+0xb0/0x280
spà000160369ffc40 bspà000160369f0f98
[<a0000001006df480>] sock_alloc_send_skb+0x400/0x560
spà000160369ffc40 bspà000160369f0f28
[<a0000001007e7930>] unix_stream_sendmsg+0x3b0/0x780
spà000160369ffc70 bspà000160369f0e68
[<a0000001006d7e80>] sock_aio_write+0x260/0x2a0
spà000160369ffca0 bspà000160369f0e28
[<a0000001001817b0>] do_sync_write+0x170/0x260
spà000160369ffd20 bspà000160369f0dd0
[<a000000100182f70>] vfs_write+0x310/0x320
spà000160369ffe20 bspà000160369f0d80
[<a000000100183ad0>] sys_write+0x70/0xe0
spà000160369ffe20 bspà000160369f0d08
[<a00000010000aa40>] ia64_ret_from_syscall+0x0/0x20
spà000160369ffe30 bspà000160369f0d08
[<a000000000010720>] __kernel_syscall_via_break+0x0/0x20
spà00016036a00000 bspà000160369f0d08
BUG: spinlock lockup on CPU#0, hackbench/10060, a040000058126350
Call Trace:
[<a000000100015f20>] show_stack+0x80/0xa0
spà000040beabf940 bspà000040beab15b8
[<a000000100015f70>] dump_stack+0x30/0x60
spà000040beabfb10 bspà000040beab15a0
[<a000000100462960>] _raw_spin_lock+0x2e0/0x300
spà000040beabfb10 bspà000040beab1548
[<a0000001008290e0>] _spin_lock+0x20/0x40
spà000040beabfb10 bspà000040beab1528
[<a00000010014b470>] page_check_address+0x170/0x200
spà000040beabfb10 bspà000040beab14e8
[<a00000010014b680>] page_referenced_one+0xa0/0x2c0
spà000040beabfb10 bspà000040beab14a8
[<a00000010014db10>] page_referenced+0x230/0x340
spà000040beabfb20 bspà000040beab1460
[<a00000010012bd00>] shrink_active_list+0x9e0/0xe80
spà000040beabfb30 bspà000040beab13b8
[<a00000010012e460>] shrink_zone+0x220/0x260
spà000040beabfbf0 bspà000040beab1368
[<a0000001001300f0>] try_to_free_pages+0x590/0x900
spà000040beabfbf0 bspà000040beab1290
[<a000000100120500>] __alloc_pages_internal+0x4a0/0x860
spà000040beabfc30 bspà000040beab11d8
[<a000000100120950>] __alloc_pages+0x30/0x60
spà000040beabfc40 bspà000040beab11a8
[<a0000001001714e0>] kmem_getpages+0x120/0x2a0
spà000040beabfc40 bspà000040beab1160
[<a000000100172480>] fallback_alloc+0x320/0x440
spà000040beabfc40 bspà000040beab10d8
[<a0000001001726b0>] ____cache_alloc_node+0x110/0x300
spà000040beabfc40 bspà000040beab1058
[<a000000100171770>] kmem_cache_alloc_node+0x110/0x3c0
spà000040beabfc40 bspà000040beab1018
[<a000000100171a80>] __kmalloc_node+0x60/0xa0
spà000040beabfc40 bspà000040beab0fe0
[<a0000001006ea250>] __alloc_skb+0xb0/0x280
spà000040beabfc40 bspà000040beab0f98
[<a0000001006df480>] sock_alloc_send_skb+0x400/0x560
spà000040beabfc40 bspà000040beab0f28
[<a0000001007e7930>] unix_stream_sendmsg+0x3b0/0x780
spà000040beabfc70 bspà000040beab0e68
[<a0000001006d7e80>] sock_aio_write+0x260/0x2a0
spà000040beabfca0 bspà000040beab0e28
[<a0000001001817b0>] do_sync_write+0x170/0x260
spà000040beabfd20 bspà000040beab0dd0
[<a000000100182f70>] vfs_write+0x310/0x320
spà000040beabfe20 bspà000040beab0d80
[<a000000100183ad0>] sys_write+0x70/0xe0
spà000040beabfe20 bspà000040beab0d08
[<a00000010000aa40>] ia64_ret_from_syscall+0x0/0x20
spà000040beabfe30 bspà000040beab0d08
[<a000000000010720>] __kernel_syscall_via_break+0x0/0x20
spà000040beac0000 bspà000040beab0d08
BUG: spinlock lockup on CPU#2, hackbench/11407, e0000040c45b8548
Call Trace:
[<a000000100015f20>] show_stack+0x80/0xa0
spà0000003241f950 bspà00000032411530
[<a000000100015f70>] dump_stack+0x30/0x60
spà0000003241fb20 bspà00000032411518
[<a000000100462960>] _raw_spin_lock+0x2e0/0x300
spà0000003241fb20 bspà000000324114c0
[<a0000001008290e0>] _spin_lock+0x20/0x40
spà0000003241fb20 bspà000000324114a0
[<a00000010014b2e0>] page_lock_anon_vma+0x80/0xa0
spà0000003241fb20 bspà00000032411478
[<a00000010014da60>] page_referenced+0x180/0x340
spà0000003241fb20 bspà00000032411430
[<a00000010012bd00>] shrink_active_list+0x9e0/0xe80
spà0000003241fb30 bspà00000032411388
[<a00000010012e460>] shrink_zone+0x220/0x260
spà0000003241fbf0 bspà00000032411338
[<a0000001001300f0>] try_to_free_pages+0x590/0x900
spà0000003241fbf0 bspà00000032411260
[<a000000100120500>] __alloc_pages_internal+0x4a0/0x860
spà0000003241fc30 bspà000000324111a8
[<a000000100120950>] __alloc_pages+0x30/0x60
spà0000003241fc40 bspà00000032411178
[<a0000001001714e0>] kmem_getpages+0x120/0x2a0
spà0000003241fc40 bspà00000032411130
[<a000000100172480>] fallback_alloc+0x320/0x440
spà0000003241fc40 bspà000000324110a8
[<a0000001001726b0>] ____cache_alloc_node+0x110/0x300
spà0000003241fc40 bspà00000032411028
[<a000000100171770>] kmem_cache_alloc_node+0x110/0x3c0
spà0000003241fc40 bspà00000032410fe0
[<a0000001006ea210>] __alloc_skb+0x70/0x280
spà0000003241fc40 bspà00000032410f98
[<a0000001006df480>] sock_alloc_send_skb+0x400/0x560
spà0000003241fc40 bspà00000032410f28
[<a0000001007e7930>] unix_stream_sendmsg+0x3b0/0x780
spà0000003241fc70 bspà00000032410e68
[<a0000001006d7e80>] sock_aio_write+0x260/0x2a0
spà0000003241fca0 bspà00000032410e28
[<a0000001001817b0>] do_sync_write+0x170/0x260
spà0000003241fd20 bspà00000032410dd0
[<a000000100182f70>] vfs_write+0x310/0x320
spà0000003241fe20 bspà00000032410d80
[<a000000100183ad0>] sys_write+0x70/0xe0
spà0000003241fe20 bspà00000032410d08
[<a00000010000aa40>] ia64_ret_from_syscall+0x0/0x20
spà0000003241fe30 bspà00000032410d08
[<a000000000010720>] __kernel_syscall_via_break+0x0/0x20
spà00000032420000 bspà00000032410d08
^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish >12 hour
2008-03-17 17:38 ` Luck, Tony
@ 2008-03-17 23:44 ` KOSAKI Motohiro
2008-03-18 0:36 ` Yu, Fenghua
0 siblings, 1 reply; 18+ messages in thread
From: KOSAKI Motohiro @ 2008-03-17 23:44 UTC (permalink / raw)
To: Luck, Tony
Cc: kosaki.motohiro, LKML, linux-ia64, Yu, Fenghua, Hidetoshi Seto,
Andrew Morton
Hi
> > % ./hackbench 130 process 500
>
> I'd like to reproduce your results so I can make a new
> version of the ptc.g patch that doesn't have this regression.
Great.
> I found this version (that claims to be the latest) of hackbench.c
>
> http://people.redhat.com/mingo/cfs-scheduler/tools/hackbench.c
>
> Is this the one you used?
Yes, I used it.
> What was the configuration of your machine (how many cpus, how
> much memory)?
cpu: Itanium2 x8
mem: 8GB(4GB x2 node)
Thanks.
^ permalink raw reply [flat|nested] 18+ messages in thread
* RE: [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish >12 hour
2008-03-17 23:44 ` KOSAKI Motohiro
@ 2008-03-18 0:36 ` Yu, Fenghua
2008-03-18 0:47 ` KOSAKI Motohiro
0 siblings, 1 reply; 18+ messages in thread
From: Yu, Fenghua @ 2008-03-18 0:36 UTC (permalink / raw)
To: KOSAKI Motohiro, Luck, Tony
Cc: LKML, linux-ia64, Hidetoshi Seto, Andrew Morton
> > % ./hackbench 130 process 500
> > 2.6.25-rc5: works well
> > 2.6.25-rc5-mm1: doesn't finish >12 hour
>cpu: Itanium2 x8
>mem: 8GB(4GB x2 node)
On Itanium2x8 w/ 8GB on 1 node, All of 2.6.24, 2.6.25-rc5 and
2.6.25-rc5-mm1 shows same "out of memory" within one minute. I haven't
seen spinlock lockup issue yet.
Thanks.
-Fenghua
^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish >12 hour
2008-03-18 0:36 ` Yu, Fenghua
@ 2008-03-18 0:47 ` KOSAKI Motohiro
2008-03-19 0:14 ` Yu, Fenghua
0 siblings, 1 reply; 18+ messages in thread
From: KOSAKI Motohiro @ 2008-03-18 0:47 UTC (permalink / raw)
To: Yu, Fenghua
Cc: kosaki.motohiro, Luck, Tony, LKML, linux-ia64, Hidetoshi Seto,
Andrew Morton
Hi Yu-san,
> > > % ./hackbench 130 process 500
> > > 2.6.25-rc5: works well
> > > 2.6.25-rc5-mm1: doesn't finish >12 hour
> >cpu: Itanium2 x8
> >mem: 8GB(4GB x2 node)
>
> On Itanium2x8 w/ 8GB on 1 node, All of 2.6.24, 2.6.25-rc5 and
> 2.6.25-rc5-mm1 shows same "out of memory" within one minute. I haven't
> seen spinlock lockup issue yet.
this paramter mean use all physical memory and about 1GB swap space.
Could you expand swap space?
- kosaki
^ permalink raw reply [flat|nested] 18+ messages in thread
* RE: [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish >12 hour
2008-03-18 0:47 ` KOSAKI Motohiro
@ 2008-03-19 0:14 ` Yu, Fenghua
2008-03-19 23:23 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish Peter Zijlstra
2008-04-04 23:52 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish >12 hour Luck, Tony
0 siblings, 2 replies; 18+ messages in thread
From: Yu, Fenghua @ 2008-03-19 0:14 UTC (permalink / raw)
To: KOSAKI Motohiro
Cc: Luck, Tony, LKML, linux-ia64, Hidetoshi Seto, Andrew Morton
>this paramter mean use all physical memory and about 1GB swap space.
>Could you expand swap space?
We can reproduce the soft lockup issue now and root cause the issue as
well.
Since the ptc.g patch uses semaphore ptcg_sem to serialize multiple
ptc.g instructions in ia64_global_tlb_purge(). This requires the code
path should be safe to sleep in down(). But the code path can not sleep
during swap because it holds some spin locks (e.g. anon_vma_lock). Going
to sleep finally causes soft lockup.
Actually we though of this issue before releasing the ptcg patch and
wrote some non-sleeping versions of ptcg patches. But since we couldn't
see the sleeping issue during our testing, we didn't release a
non-sleeping ptcg patch. If replacing the ptcg patch in -mm1 tree with
one of our non-sleeping ptcg patches, the issue goes away.
Tony and I are working on releasing a final ptcg patch to solve the
issue.
Thanks.
-Fenghua
^ permalink raw reply [flat|nested] 18+ messages in thread
* RE: [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish
2008-03-19 0:14 ` Yu, Fenghua
@ 2008-03-19 23:23 ` Peter Zijlstra
2008-03-19 23:50 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish>12 hour Yu, Fenghua
` (2 more replies)
2008-04-04 23:52 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish >12 hour Luck, Tony
1 sibling, 3 replies; 18+ messages in thread
From: Peter Zijlstra @ 2008-03-19 23:23 UTC (permalink / raw)
To: Yu, Fenghua
Cc: KOSAKI Motohiro, Luck, Tony, LKML, linux-ia64, Hidetoshi Seto,
Andrew Morton
On Tue, 2008-03-18 at 17:14 -0700, Yu, Fenghua wrote:
> >this paramter mean use all physical memory and about 1GB swap space.
> >Could you expand swap space?
>
> We can reproduce the soft lockup issue now and root cause the issue as
> well.
>
> Since the ptc.g patch uses semaphore ptcg_sem to serialize multiple
> ptc.g instructions in ia64_global_tlb_purge(). This requires the code
> path should be safe to sleep in down(). But the code path can not sleep
> during swap because it holds some spin locks (e.g. anon_vma_lock). Going
> to sleep finally causes soft lockup.
>
> Actually we though of this issue before releasing the ptcg patch and
> wrote some non-sleeping versions of ptcg patches. But since we couldn't
> see the sleeping issue during our testing, we didn't release a
> non-sleeping ptcg patch. If replacing the ptcg patch in -mm1 tree with
> one of our non-sleeping ptcg patches, the issue goes away.
>
> Tony and I are working on releasing a final ptcg patch to solve the
> issue.
Which makes me wonder, why did you ever use a semaphore here? Looking at
the code its a straight forward mutex. And when you would have used a
mutex lockdep would have warned about this.
There is hardly ever a good reason to use semaphores in new code, we're
trying very hard to get rid of them.
Hmm, then again, does ia64 have lockdep?
^ permalink raw reply [flat|nested] 18+ messages in thread
* RE: [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish>12 hour
2008-03-19 23:23 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish Peter Zijlstra
@ 2008-03-19 23:50 ` Yu, Fenghua
2008-03-20 3:39 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish >12 hour Luming Yu
2008-03-20 16:04 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish>12 hour Luck, Tony
2 siblings, 0 replies; 18+ messages in thread
From: Yu, Fenghua @ 2008-03-19 23:50 UTC (permalink / raw)
To: Peter Zijlstra
Cc: KOSAKI Motohiro, Luck, Tony, LKML, linux-ia64, Hidetoshi Seto,
Andrew Morton
>Which makes me wonder, why did you ever use a semaphore here? Looking
at
>the code its a straight forward mutex. And when you would have used a
>mutex lockdep would have warned about this.
>There is hardly ever a good reason to use semaphores in new code, we're
>trying very hard to get rid of them.
The real issue here is the code path can not go to sleep. If simply
replacing semaphore with mutex, the issue still happens. First of all we
need to have a lock system which doesn't allow the code go to sleep. We
are
working on a new patch now.
But one step back, if without considering this sleeping issue, I agree
with
you that mutex would be a better approach than semaphore.
>Hmm, then again, does ia64 have lockdep?
IA64 doesn't support lockdep yet.
Thanks.
-Fenghua
^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish >12 hour
2008-03-19 23:23 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish Peter Zijlstra
2008-03-19 23:50 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish>12 hour Yu, Fenghua
@ 2008-03-20 3:39 ` Luming Yu
2008-03-20 4:51 ` KOSAKI Motohiro
2008-03-20 16:04 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish>12 hour Luck, Tony
2 siblings, 1 reply; 18+ messages in thread
From: Luming Yu @ 2008-03-20 3:39 UTC (permalink / raw)
To: Peter Zijlstra
Cc: Yu, Fenghua, KOSAKI Motohiro, Luck, Tony, LKML, linux-ia64,
Hidetoshi Seto, Andrew Morton
> Hmm, then again, does ia64 have lockdep?
I posted an initial version of lockdep support for ia64 here.
http://www.gelato.unsw.edu.au/archives/linux-ia64/0712/21510.html
Time is still needed to make it upstream though..
^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish >12 hour
2008-03-20 3:39 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish >12 hour Luming Yu
@ 2008-03-20 4:51 ` KOSAKI Motohiro
0 siblings, 0 replies; 18+ messages in thread
From: KOSAKI Motohiro @ 2008-03-20 4:51 UTC (permalink / raw)
To: Luming Yu
Cc: Peter Zijlstra, Yu, Fenghua, Luck, Tony, LKML, linux-ia64,
Hidetoshi Seto, Andrew Morton
Hi
> > Hmm, then again, does ia64 have lockdep?
> I posted an initial version of lockdep support for ia64 here.
> http://www.gelato.unsw.edu.au/archives/linux-ia64/0712/21510.html
> Time is still needed to make it upstream though..
That is very nice work.
please rebase to latest kernel.
I advocate your activity :)
^ permalink raw reply [flat|nested] 18+ messages in thread
* RE: [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish>12 hour
2008-03-19 23:23 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish Peter Zijlstra
2008-03-19 23:50 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish>12 hour Yu, Fenghua
2008-03-20 3:39 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish >12 hour Luming Yu
@ 2008-03-20 16:04 ` Luck, Tony
2008-03-20 16:16 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish>12 Peter Zijlstra
2 siblings, 1 reply; 18+ messages in thread
From: Luck, Tony @ 2008-03-20 16:04 UTC (permalink / raw)
To: Peter Zijlstra, Yu, Fenghua
Cc: KOSAKI Motohiro, LKML, linux-ia64, Hidetoshi Seto, Andrew Morton
> Which makes me wonder, why did you ever use a semaphore here? Looking at
> the code its a straight forward mutex. And when you would have used a
> mutex lockdep would have warned about this.
The functionality that we are trying to add is to allow up to N
simultaneous processors to execute the critical region. On current
processors/platforms N=1 so a spinlock or mutex would be fine, but
there will be platforms for which N is a small integer greater than
one. Semaphore initialized to N looked to be the ideal primitive
for this (until Motohiro-san ran the test case that showed the path
where we call this code with a spinlock held).
Next question is whether it is reasonable to get to this code
while holding a spinlock. Isn't this a problem for architectures
that need to use cross-processor interrupts to do a global TLB
shootdown?
-Tony
^ permalink raw reply [flat|nested] 18+ messages in thread
* RE: [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish>12
2008-03-20 16:04 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish>12 hour Luck, Tony
@ 2008-03-20 16:16 ` Peter Zijlstra
2008-03-20 17:23 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish>12hour Luck, Tony
0 siblings, 1 reply; 18+ messages in thread
From: Peter Zijlstra @ 2008-03-20 16:16 UTC (permalink / raw)
To: Luck, Tony
Cc: Yu, Fenghua, KOSAKI Motohiro, LKML, linux-ia64, Hidetoshi Seto,
Andrew Morton
On Thu, 2008-03-20 at 09:04 -0700, Luck, Tony wrote:
> > Which makes me wonder, why did you ever use a semaphore here? Looking at
> > the code its a straight forward mutex. And when you would have used a
> > mutex lockdep would have warned about this.
>
> The functionality that we are trying to add is to allow up to N
> simultaneous processors to execute the critical region. On current
> processors/platforms N=1 so a spinlock or mutex would be fine, but
> there will be platforms for which N is a small integer greater than
> one. Semaphore initialized to N looked to be the ideal primitive
> for this (until Motohiro-san ran the test case that showed the path
> where we call this code with a spinlock held).
Right, no alternative there.
> Next question is whether it is reasonable to get to this code
> while holding a spinlock. Isn't this a problem for architectures
> that need to use cross-processor interrupts to do a global TLB
> shootdown?
Yeah, semaphores can't be used from hardirq contexts for much the same
reasons. But its all ia64 code, right? So I'm not directly seeing how
other archs are affected here.
^ permalink raw reply [flat|nested] 18+ messages in thread
* RE: [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish>12hour
2008-03-20 16:16 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish>12 Peter Zijlstra
@ 2008-03-20 17:23 ` Luck, Tony
0 siblings, 0 replies; 18+ messages in thread
From: Luck, Tony @ 2008-03-20 17:23 UTC (permalink / raw)
To: Peter Zijlstra
Cc: Yu, Fenghua, KOSAKI Motohiro, LKML, linux-ia64, Hidetoshi Seto,
Andrew Morton
> Yeah, semaphores can't be used from hardirq contexts for much the same
> reasons. But its all ia64 code, right? So I'm not directly seeing how
> other archs are affected here.
The root of the problem is a call chain like this:
[<a00000010005c200>] ia64_global_tlb_purge+0x540/0xa40
spà000001c28afca0 bspà000001c28a0ea0
[<a00000010005c8a0>] flush_tlb_range+0x1a0/0x240
spà000001c28afca0 bspà000001c28a0e70
[<a000000100105a90>] page_referenced_one+0x170/0x260
spà000001c28afca0 bspà000001c28a0e20
[<a000000100105f20>] page_referenced+0x180/0x320
spà000001c28afcb0 bspà000001c28a0dd0
[<a0000001000ec640>] shrink_active_list+0x520/0xbc0
spà000001c28afcc0 bspà000001c28a0d48
[<a0000001000ece30>] shrink_zone+0x150/0x200
spà000001c28afd80 bspà000001c28a0d00
[<a0000001000edaf0>] kswapd+0x690/0x940
spà000001c28afd80 bspà000001c28a0c40
[<a0000001000a6860>] kthread+0xa0/0x120
spà000001c28afe30 bspà000001c28a0c10
[<a000000100013b70>] kernel_thread_helper+0x30/0x60
spà000001c28afe30 bspà000001c28a0be0
[<a0000001000089c0>] start_kernel_thread+0x20/0x40
spà000001c28afe30 bspà000001c28a0be0
Some place in the upper (arch independent) parts of this trace the
code acquires an anon_vma_lock and holds it while calling into arch
specific code (there is presumably some inlining going on here because
the source doesn't have an obvious call from page_referenced_one() to
flush_tlb_range()).
My concern for other architectures (especially ones that require IPI to
complete the TLB shootdown) is that holding anon_vma_lock while doing
the shootdown leaves them open for some deadlock scenarios in this
code path. But perhaps I completley misunderstand how IPI shootdown
happens (very possible).
-Tony
^ permalink raw reply [flat|nested] 18+ messages in thread
* RE: [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish >12 hour
2008-03-19 0:14 ` Yu, Fenghua
2008-03-19 23:23 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish Peter Zijlstra
@ 2008-04-04 23:52 ` Luck, Tony
2008-04-05 5:03 ` KOSAKI Motohiro
1 sibling, 1 reply; 18+ messages in thread
From: Luck, Tony @ 2008-04-04 23:52 UTC (permalink / raw)
To: Yu, Fenghua, KOSAKI Motohiro
Cc: LKML, linux-ia64, Hidetoshi Seto, Andrew Morton
> We can reproduce the soft lockup issue now and root cause the issue as
well.
>
> Since the ptc.g patch uses semaphore ptcg_sem to serialize multiple
> ptc.g instructions in ia64_global_tlb_purge(). This requires the code
> path should be safe to sleep in down(). But the code path can not
> sleep during swap because it holds some spin locks (e.g.
anon_vma_lock).
> Going to sleep finally causes soft lockup.
I've just pushed a new version of this patch into my test tree (so
it will show up in -mm and linux-next when they pull and rebuild).
It now uses the Piggin/Wilcox "spinaphore" primitives. It passes
all my tests, and some runs of "hackbench" that tried to match the
tests that failed before at Fujitsu by forcing moderate amounts of
swapping.
-Tony
^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish >12 hour
2008-04-04 23:52 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish >12 hour Luck, Tony
@ 2008-04-05 5:03 ` KOSAKI Motohiro
0 siblings, 0 replies; 18+ messages in thread
From: KOSAKI Motohiro @ 2008-04-05 5:03 UTC (permalink / raw)
To: Luck, Tony; +Cc: Yu, Fenghua, LKML, linux-ia64, Hidetoshi Seto, Andrew Morton
> I've just pushed a new version of this patch into my test tree (so
> it will show up in -mm and linux-next when they pull and rebuild).
> It now uses the Piggin/Wilcox "spinaphore" primitives. It passes
> all my tests, and some runs of "hackbench" that tried to match the
> tests that failed before at Fujitsu by forcing moderate amounts of
> swapping.
Great!
^ permalink raw reply [flat|nested] 18+ messages in thread
end of thread, other threads:[~2008-04-05 5:03 UTC | newest]
Thread overview: 18+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-03-17 11:18 [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish >12 hour KOSAKI Motohiro
2008-03-17 11:51 ` KOSAKI Motohiro
2008-03-17 16:09 ` Luck, Tony
2008-03-17 23:43 ` KOSAKI Motohiro
2008-03-17 17:38 ` Luck, Tony
2008-03-17 23:44 ` KOSAKI Motohiro
2008-03-18 0:36 ` Yu, Fenghua
2008-03-18 0:47 ` KOSAKI Motohiro
2008-03-19 0:14 ` Yu, Fenghua
2008-03-19 23:23 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish Peter Zijlstra
2008-03-19 23:50 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish>12 hour Yu, Fenghua
2008-03-20 3:39 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish >12 hour Luming Yu
2008-03-20 4:51 ` KOSAKI Motohiro
2008-03-20 16:04 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish>12 hour Luck, Tony
2008-03-20 16:16 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish>12 Peter Zijlstra
2008-03-20 17:23 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish>12hour Luck, Tony
2008-04-04 23:52 ` [2.6.25-rc5-mm1][regression] ia64: hackbench doesn't finish >12 hour Luck, Tony
2008-04-05 5:03 ` KOSAKI Motohiro
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox