All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Hans-Peter Jansen" <hpj@urpla.net>
To: linux-kernel@vger.kernel.org
Cc: opensuse-kernel@opensuse.org, xfs@oss.sgi.com
Subject: 2.6.34-rc3: simple du (on a big xfs tree) triggers oom killer
Date: Mon, 5 Apr 2010 00:49:17 +0200	[thread overview]
Message-ID: <201004050049.17952.hpj@urpla.net> (raw)

[Sorry for the cross post, but I don't know where to start to tackle this 
 issue]

Hi,

on an attempt to get to a current kernel, I suffer from an issue, where a 
simple du on a reasonably big xfs tree leads to invoking the oom killer: 

Apr  4 23:24:53 tyrex kernel: [  418.913223] XFS mounting filesystem sdd1
Apr  4 23:24:54 tyrex kernel: [  419.774606] Ending clean XFS mount for filesystem: sdd1
Apr  4 23:26:02 tyrex kernel: [  488.160795] du invoked oom-killer: gfp_mask=0x802d0, order=0, oom_adj=0
Apr  4 23:26:02 tyrex kernel: [  488.160798] du cpuset=/ mems_allowed=0
Apr  4 23:26:02 tyrex kernel: [  488.160800] Pid: 6397, comm: du Tainted: G        W  2.6.34-rc3-13-vanilla #1
Apr  4 23:26:02 tyrex kernel: [  488.160802] Call Trace:
Apr  4 23:26:02 tyrex kernel: [  488.160808]  [<c02becc7>] dump_header+0x67/0x1a0
Apr  4 23:26:02 tyrex kernel: [  488.160811]  [<c03cf1a7>] ? ___ratelimit+0x77/0xe0
Apr  4 23:26:02 tyrex kernel: [  488.160813]  [<c02bee59>] oom_kill_process+0x59/0x160
Apr  4 23:26:02 tyrex kernel: [  488.160815]  [<c02bf43e>] __out_of_memory+0x4e/0xc0
Apr  4 23:26:02 tyrex kernel: [  488.160817]  [<c02bf502>] out_of_memory+0x52/0xc0
Apr  4 23:26:02 tyrex kernel: [  488.160819]  [<c02c20f4>] __alloc_pages_slowpath+0x444/0x4c0
Apr  4 23:26:02 tyrex kernel: [  488.160822]  [<c02c22c2>] __alloc_pages_nodemask+0x152/0x160
Apr  4 23:26:02 tyrex kernel: [  488.160825]  [<c02ea4a9>] cache_grow+0x249/0x2e0
Apr  4 23:26:02 tyrex kernel: [  488.160838]  [<c02ea748>] cache_alloc_refill+0x208/0x240
Apr  4 23:26:02 tyrex kernel: [  488.160840]  [<c02eab19>] kmem_cache_alloc+0xb9/0xc0
Apr  4 23:26:02 tyrex kernel: [  488.160868]  [<f86375dd>] ? xfs_trans_brelse+0xfd/0x150 [xfs]
Apr  4 23:26:02 tyrex kernel: [  488.160888]  [<f863d547>] kmem_zone_alloc+0x77/0xb0 [xfs]
Apr  4 23:26:02 tyrex kernel: [  488.160905]  [<f860a043>] ? xfs_da_state_free+0x53/0x60 [xfs]
Apr  4 23:26:02 tyrex kernel: [  488.160923]  [<f861c796>] xfs_inode_alloc+0x26/0x110 [xfs]
Apr  4 23:26:02 tyrex kernel: [  488.160941]  [<f861c8b2>] xfs_iget_cache_miss+0x32/0x200 [xfs]
Apr  4 23:26:02 tyrex kernel: [  488.160960]  [<f8630fd9>] ? xfs_perag_get+0x29/0xa0 [xfs]
Apr  4 23:26:02 tyrex kernel: [  488.160977]  [<f861cbac>] xfs_iget+0x12c/0x1b0 [xfs]
Apr  4 23:26:02 tyrex kernel: [  488.160997]  [<f863c9c5>] xfs_lookup+0xc5/0x110 [xfs]
Apr  4 23:26:02 tyrex kernel: [  488.161017]  [<f86467d1>] xfs_vn_lookup+0x51/0xa0 [xfs]
Apr  4 23:26:02 tyrex kernel: [  488.161021]  [<c02ffef7>] do_lookup+0xa7/0x1c0
Apr  4 23:26:02 tyrex kernel: [  488.161023]  [<c030089b>] link_path_walk+0x1db/0x850
Apr  4 23:26:02 tyrex kernel: [  488.161026]  [<c024a870>] ? capable+0x10/0x40
Apr  4 23:26:02 tyrex kernel: [  488.161028]  [<c038969e>] ? security_inode_permission+0x1e/0x20
Apr  4 23:26:02 tyrex kernel: [  488.161030]  [<c0301294>] path_walk+0x54/0xc0
Apr  4 23:26:02 tyrex kernel: [  488.161032]  [<c0301431>] do_path_lookup+0x51/0x90
Apr  4 23:26:02 tyrex kernel: [  488.161034]  [<c0301f8a>] user_path_at+0x4a/0x80
Apr  4 23:26:02 tyrex kernel: [  488.161037]  [<c02f9748>] ? cp_new_stat64+0xe8/0x100
Apr  4 23:26:02 tyrex kernel: [  488.161039]  [<c02f9bb5>] vfs_fstatat+0x35/0x70
Apr  4 23:26:02 tyrex kernel: [  488.161041]  [<c02f9c0d>] sys_fstatat64+0x1d/0x40
Apr  4 23:26:02 tyrex kernel: [  488.161044]  [<c0202d8c>] sysenter_do_call+0x12/0x22
Apr  4 23:26:02 tyrex kernel: [  488.161045] Mem-Info:
Apr  4 23:26:02 tyrex kernel: [  488.161046] DMA per-cpu:
Apr  4 23:26:02 tyrex kernel: [  488.161047] CPU    0: hi:    0, btch:   1 usd:   0
Apr  4 23:26:02 tyrex kernel: [  488.161048] CPU    1: hi:    0, btch:   1 usd:   0
Apr  4 23:26:02 tyrex kernel: [  488.161050] CPU    2: hi:    0, btch:   1 usd:   0
Apr  4 23:26:02 tyrex kernel: [  488.161051] CPU    3: hi:    0, btch:   1 usd:   0
Apr  4 23:26:02 tyrex kernel: [  488.161052] CPU    4: hi:    0, btch:   1 usd:   0
Apr  4 23:26:02 tyrex kernel: [  488.161053] CPU    5: hi:    0, btch:   1 usd:   0
Apr  4 23:26:02 tyrex kernel: [  488.161054] CPU    6: hi:    0, btch:   1 usd:   0
Apr  4 23:26:02 tyrex kernel: [  488.161056] CPU    7: hi:    0, btch:   1 usd:   0
Apr  4 23:26:02 tyrex kernel: [  488.161057] Normal per-cpu:
Apr  4 23:26:02 tyrex kernel: [  488.161058] CPU    0: hi:  186, btch:  31 usd: 173
Apr  4 23:26:02 tyrex kernel: [  488.161059] CPU    1: hi:  186, btch:  31 usd:  18
Apr  4 23:26:02 tyrex kernel: [  488.161060] CPU    2: hi:  186, btch:  31 usd:  41
Apr  4 23:26:02 tyrex kernel: [  488.161061] CPU    3: hi:  186, btch:  31 usd: 197
Apr  4 23:26:02 tyrex kernel: [  488.161062] CPU    4: hi:  186, btch:  31 usd: 175
Apr  4 23:26:02 tyrex kernel: [  488.161063] CPU    5: hi:  186, btch:  31 usd: 157
Apr  4 23:26:02 tyrex kernel: [  488.161065] CPU    6: hi:  186, btch:  31 usd: 182
Apr  4 23:26:02 tyrex kernel: [  488.161066] CPU    7: hi:  186, btch:  31 usd: 173
Apr  4 23:26:02 tyrex kernel: [  488.161067] HighMem per-cpu:
Apr  4 23:26:02 tyrex kernel: [  488.161068] CPU    0: hi:  186, btch:  31 usd:  24
Apr  4 23:26:02 tyrex kernel: [  488.161069] CPU    1: hi:  186, btch:  31 usd:  33
Apr  4 23:26:02 tyrex kernel: [  488.161070] CPU    2: hi:  186, btch:  31 usd:  55
Apr  4 23:26:02 tyrex kernel: [  488.161071] CPU    3: hi:  186, btch:  31 usd: 172
Apr  4 23:26:02 tyrex kernel: [  488.161072] CPU    4: hi:  186, btch:  31 usd: 177
Apr  4 23:26:02 tyrex kernel: [  488.161073] CPU    5: hi:  186, btch:  31 usd: 100
Apr  4 23:26:02 tyrex kernel: [  488.161075] CPU    6: hi:  186, btch:  31 usd:  38
Apr  4 23:26:02 tyrex kernel: [  488.161076] CPU    7: hi:  186, btch:  31 usd: 115
Apr  4 23:26:02 tyrex kernel: [  488.161078] active_anon:98428 inactive_anon:24627 isolated_anon:0
Apr  4 23:26:02 tyrex kernel: [  488.161079]  active_file:4141 inactive_file:25571 isolated_file:0
Apr  4 23:26:02 tyrex kernel: [  488.161080]  unevictable:0 dirty:2 writeback:0 unstable:0
Apr  4 23:26:02 tyrex kernel: [  488.161080]  free:403317 slab_reclaimable:203863 slab_unreclaimable:4837
Apr  4 23:26:02 tyrex kernel: [  488.161081]  mapped:7418 shmem:137 pagetables:625 bounce:0
Apr  4 23:26:02 tyrex kernel: [  488.161085] 
DMA free:3520kB min:64kB low:80kB high:96kB active_anon:0kB inactive_anon:0kB 
active_file:0kB inactive_file:0kB unevictable:0kB isolated(anon):0kB isolated(file):0kB 
present:15804kB mlocked:0kB dirty:0kB writeback:0kB mapped:0kB shmem:0kB slab_reclaimable:6032kB 
slab_unreclaimable:36kB kernel_stack:0kB pagetables:0kB unstable:0kB bounce:0kB writeback_tmp:0kB 
pages_scanned:0 all_unreclaimable? yes
Apr  4 23:26:02 tyrex kernel: [  488.161089] lowmem_reserve[]: 0 865 3023 3023
Apr  4 23:26:02 tyrex kernel: [  488.161093] 
Normal free:3580kB min:3728kB low:4660kB high:5592kB active_anon:0kB inactive_anon:0kB 
active_file:360kB inactive_file:264kB unevictable:0kB isolated(anon):0kB isolated(file):0kB 
present:885944kB mlocked:0kB dirty:0kB writeback:0kB mapped:0kB shmem:0kB slab_reclaimable:809420kB 
slab_unreclaimable:19312kB kernel_stack:3304kB pagetables:0kB unstable:0kB bounce:0kB writeback_tmp:0kB 
pages_scanned:24 all_unreclaimable? no
Apr  4 23:26:02 tyrex kernel: [  488.161097] lowmem_reserve[]: 0 0 17267 17267
Apr  4 23:26:02 tyrex kernel: [  488.161101] 
HighMem free:1606168kB min:512kB low:2836kB high:5164kB active_anon:393712kB inactive_anon:98508kB 
active_file:16228kB inactive_file:102020kB unevictable:0kB isolated(anon):0kB isolated(file):0kB 
present:2210252kB mlocked:0kB dirty:8kB writeback:0kB mapped:29672kB shmem:548kB slab_reclaimable:0kB 
slab_unreclaimable:0kB kernel_stack:0kB pagetables:2500kB unstable:0kB bounce:0kB writeback_tmp:0kB 
pages_scanned:0 all_unreclaimable? no
Apr  4 23:26:02 tyrex kernel: [  488.161105] lowmem_reserve[]: 0 0 0 0
Apr  4 23:26:02 tyrex kernel: [  488.161107] DMA: 18*4kB 53*8kB 31*16kB 20*32kB 14*64kB 8*128kB 0*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 3552kB
Apr  4 23:26:02 tyrex kernel: [  488.161112] Normal: 32*4kB 1*8kB 1*16kB 1*32kB 1*64kB 1*128kB 1*256kB 0*512kB 1*1024kB 1*2048kB 0*4096kB = 3704kB
Apr  4 23:26:02 tyrex kernel: [  488.161117] HighMem: 17*4kB 29*8kB 47*16kB 16*32kB 6*64kB 30*128kB 53*256kB 27*512kB 14*1024kB 7*2048kB 377*4096kB = 1606044kB
Apr  4 23:26:02 tyrex kernel: [  488.161122] 29947 total pagecache pages
Apr  4 23:26:02 tyrex kernel: [  488.161123] 0 pages in swap cache
Apr  4 23:26:02 tyrex kernel: [  488.161124] Swap cache stats: add 0, delete 0, find 0/0
Apr  4 23:26:02 tyrex kernel: [  488.161125] Free swap  = 2104476kB
Apr  4 23:26:02 tyrex kernel: [  488.161126] Total swap = 2104476kB
Apr  4 23:26:02 tyrex kernel: [  488.165523] 784224 pages RAM
Apr  4 23:26:02 tyrex kernel: [  488.165524] 556914 pages HighMem
Apr  4 23:26:02 tyrex kernel: [  488.165525] 12060 pages reserved
Apr  4 23:26:02 tyrex kernel: [  488.165526] 82604 pages shared
Apr  4 23:26:02 tyrex kernel: [  488.165527] 328045 pages non-shared
Apr  4 23:26:02 tyrex kernel: [  488.165529] Out of memory: kill process 4788 (mysqld-max) score 326208 or a child
Apr  4 23:26:02 tyrex kernel: [  488.165531] Killed process 4788 (mysqld-max) vsz:1304832kB, anon-rss:121428kB, file-rss:4336kB
[...]

vmstat 1, when it occurs:
 1  0      0 1722516      4 926532    0    0 13800     0 2598 6479  2 14 84  0
 3  0      0 1735540      4 927392    0    0 10128     0 2144 5434  0  8 92  0
 1  0      0 1733944      4 927868    0    0 10416    92 4465 8264  1  7 93  0
 0  0      0 1733448      4 928024    0    0  9220     0 2081 3570  1  8 91  0
 0  0      0 1734160      4 927412    0    0  7404     0 1912 3014  0  6 90  4
 1  0      0 1733072      4 927208    0    0  9512     0 2227 3591  1  8 89  2
 1  1      0 1728356      4 928256    0    0  4620     0 1334 2087  0  2 92  5
 2  1      0 1926208      4 928832    0    0  3848   712 1869 3515  0  5 84 11
 1  0      0 2181520      4 849064    0    0  2736  3983 3154 308011  2 14 71 13
 0  0      0 2181028      4 847560    0    0   140   109  300  484  2  0 98  0
 4  0      0 2561212      4 467272    0    0     4     0  383  233  0  6 94  0
 0  0      0 2995016      4  33560    0    0    44     0  421  238  0  9 91  0
 0  0      0 2995024      4  33584    0    0   120     0  187  251  0  0 100  0
 0  0      0 2994932      4  33636    0    0     0     0   34   63  0  0 100  0
 1  0      0 2995212      4  33548    0    0     8     0   72   94  0  0 100  0
 0  0      0 2995368      4  33528    0    0     0     0   39   73  0  0 100  0
 0  0      0 2995384      4  33348    0    0     0     0   96  116  0  0 100  0
 0  0      0 2995556      4  33256    0    0    40     0  139  184  0  0 100  0
 0  0      0 2995800      4  33248    0    0    68     0  178  193  0  0 100  0

The oom-killer got triggered around the line with 308011 context switches.
Needless to say, this event renders this system dysfunctional.

Sorry for the insanely long lines, but wrapping them to a sane width results 
in an unparseable mess. This kernel is closest to upstream, that I was able
to produce, built in Novell build service, available here:
http://download.opensuse.org/repositories/home:/frispete:/kernel-head/openSUSE_11.1/
As you can see, userspace is openSUSE 11.1, arch is i586, based on vanilla 
built of rc3 with Linus' git tree applied up to 5e11611a5d22252f3f9c169a3c9377eac0c32033.
I have reproduced this issue also with 2.6.33.1. Usually, I run the openSUSE 
customized kernels, e.g. kernel-pae-2.6.27.45-0.1.1 doesn't show this issue.

boot.msg: ftp://urpla.net/boot.msg
config: ftp://urpla.net/vanilla

I do use some unusual vm settings, as documented in boot.msg, but that should 
not result in this issue, does it?

Hardware: Xeon X3460, Asus P7F-E motherboard, Intel 3420 PCH chipset, 12 GB Mem

In case, you wonder, why I drove that test with a 4GB memory setup: that was
the easiest way to demonstrate the problem without resorting to tainted and 
heavy customized kernels. And for compatibility reasons (customer support), 
I have to stay at i586 for the moment. 

Of course, the du command does succeed with the distributed kernel with exactly
the same (userspace and sysctl) settings: 
1,1T    /backup/daily.0

Anybody with an idea, what's going on here?

TIA,
Pete

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

WARNING: multiple messages have this Message-ID (diff)
From: "Hans-Peter Jansen" <hpj@urpla.net>
To: linux-kernel@vger.kernel.org
Cc: opensuse-kernel@opensuse.org, xfs@oss.sgi.com
Subject: 2.6.34-rc3: simple du (on a big xfs tree) triggers oom killer
Date: Mon, 5 Apr 2010 00:49:17 +0200	[thread overview]
Message-ID: <201004050049.17952.hpj@urpla.net> (raw)

[Sorry for the cross post, but I don't know where to start to tackle this 
 issue]

Hi,

on an attempt to get to a current kernel, I suffer from an issue, where a 
simple du on a reasonably big xfs tree leads to invoking the oom killer: 

Apr  4 23:24:53 tyrex kernel: [  418.913223] XFS mounting filesystem sdd1
Apr  4 23:24:54 tyrex kernel: [  419.774606] Ending clean XFS mount for filesystem: sdd1
Apr  4 23:26:02 tyrex kernel: [  488.160795] du invoked oom-killer: gfp_mask=0x802d0, order=0, oom_adj=0
Apr  4 23:26:02 tyrex kernel: [  488.160798] du cpuset=/ mems_allowed=0
Apr  4 23:26:02 tyrex kernel: [  488.160800] Pid: 6397, comm: du Tainted: G        W  2.6.34-rc3-13-vanilla #1
Apr  4 23:26:02 tyrex kernel: [  488.160802] Call Trace:
Apr  4 23:26:02 tyrex kernel: [  488.160808]  [<c02becc7>] dump_header+0x67/0x1a0
Apr  4 23:26:02 tyrex kernel: [  488.160811]  [<c03cf1a7>] ? ___ratelimit+0x77/0xe0
Apr  4 23:26:02 tyrex kernel: [  488.160813]  [<c02bee59>] oom_kill_process+0x59/0x160
Apr  4 23:26:02 tyrex kernel: [  488.160815]  [<c02bf43e>] __out_of_memory+0x4e/0xc0
Apr  4 23:26:02 tyrex kernel: [  488.160817]  [<c02bf502>] out_of_memory+0x52/0xc0
Apr  4 23:26:02 tyrex kernel: [  488.160819]  [<c02c20f4>] __alloc_pages_slowpath+0x444/0x4c0
Apr  4 23:26:02 tyrex kernel: [  488.160822]  [<c02c22c2>] __alloc_pages_nodemask+0x152/0x160
Apr  4 23:26:02 tyrex kernel: [  488.160825]  [<c02ea4a9>] cache_grow+0x249/0x2e0
Apr  4 23:26:02 tyrex kernel: [  488.160838]  [<c02ea748>] cache_alloc_refill+0x208/0x240
Apr  4 23:26:02 tyrex kernel: [  488.160840]  [<c02eab19>] kmem_cache_alloc+0xb9/0xc0
Apr  4 23:26:02 tyrex kernel: [  488.160868]  [<f86375dd>] ? xfs_trans_brelse+0xfd/0x150 [xfs]
Apr  4 23:26:02 tyrex kernel: [  488.160888]  [<f863d547>] kmem_zone_alloc+0x77/0xb0 [xfs]
Apr  4 23:26:02 tyrex kernel: [  488.160905]  [<f860a043>] ? xfs_da_state_free+0x53/0x60 [xfs]
Apr  4 23:26:02 tyrex kernel: [  488.160923]  [<f861c796>] xfs_inode_alloc+0x26/0x110 [xfs]
Apr  4 23:26:02 tyrex kernel: [  488.160941]  [<f861c8b2>] xfs_iget_cache_miss+0x32/0x200 [xfs]
Apr  4 23:26:02 tyrex kernel: [  488.160960]  [<f8630fd9>] ? xfs_perag_get+0x29/0xa0 [xfs]
Apr  4 23:26:02 tyrex kernel: [  488.160977]  [<f861cbac>] xfs_iget+0x12c/0x1b0 [xfs]
Apr  4 23:26:02 tyrex kernel: [  488.160997]  [<f863c9c5>] xfs_lookup+0xc5/0x110 [xfs]
Apr  4 23:26:02 tyrex kernel: [  488.161017]  [<f86467d1>] xfs_vn_lookup+0x51/0xa0 [xfs]
Apr  4 23:26:02 tyrex kernel: [  488.161021]  [<c02ffef7>] do_lookup+0xa7/0x1c0
Apr  4 23:26:02 tyrex kernel: [  488.161023]  [<c030089b>] link_path_walk+0x1db/0x850
Apr  4 23:26:02 tyrex kernel: [  488.161026]  [<c024a870>] ? capable+0x10/0x40
Apr  4 23:26:02 tyrex kernel: [  488.161028]  [<c038969e>] ? security_inode_permission+0x1e/0x20
Apr  4 23:26:02 tyrex kernel: [  488.161030]  [<c0301294>] path_walk+0x54/0xc0
Apr  4 23:26:02 tyrex kernel: [  488.161032]  [<c0301431>] do_path_lookup+0x51/0x90
Apr  4 23:26:02 tyrex kernel: [  488.161034]  [<c0301f8a>] user_path_at+0x4a/0x80
Apr  4 23:26:02 tyrex kernel: [  488.161037]  [<c02f9748>] ? cp_new_stat64+0xe8/0x100
Apr  4 23:26:02 tyrex kernel: [  488.161039]  [<c02f9bb5>] vfs_fstatat+0x35/0x70
Apr  4 23:26:02 tyrex kernel: [  488.161041]  [<c02f9c0d>] sys_fstatat64+0x1d/0x40
Apr  4 23:26:02 tyrex kernel: [  488.161044]  [<c0202d8c>] sysenter_do_call+0x12/0x22
Apr  4 23:26:02 tyrex kernel: [  488.161045] Mem-Info:
Apr  4 23:26:02 tyrex kernel: [  488.161046] DMA per-cpu:
Apr  4 23:26:02 tyrex kernel: [  488.161047] CPU    0: hi:    0, btch:   1 usd:   0
Apr  4 23:26:02 tyrex kernel: [  488.161048] CPU    1: hi:    0, btch:   1 usd:   0
Apr  4 23:26:02 tyrex kernel: [  488.161050] CPU    2: hi:    0, btch:   1 usd:   0
Apr  4 23:26:02 tyrex kernel: [  488.161051] CPU    3: hi:    0, btch:   1 usd:   0
Apr  4 23:26:02 tyrex kernel: [  488.161052] CPU    4: hi:    0, btch:   1 usd:   0
Apr  4 23:26:02 tyrex kernel: [  488.161053] CPU    5: hi:    0, btch:   1 usd:   0
Apr  4 23:26:02 tyrex kernel: [  488.161054] CPU    6: hi:    0, btch:   1 usd:   0
Apr  4 23:26:02 tyrex kernel: [  488.161056] CPU    7: hi:    0, btch:   1 usd:   0
Apr  4 23:26:02 tyrex kernel: [  488.161057] Normal per-cpu:
Apr  4 23:26:02 tyrex kernel: [  488.161058] CPU    0: hi:  186, btch:  31 usd: 173
Apr  4 23:26:02 tyrex kernel: [  488.161059] CPU    1: hi:  186, btch:  31 usd:  18
Apr  4 23:26:02 tyrex kernel: [  488.161060] CPU    2: hi:  186, btch:  31 usd:  41
Apr  4 23:26:02 tyrex kernel: [  488.161061] CPU    3: hi:  186, btch:  31 usd: 197
Apr  4 23:26:02 tyrex kernel: [  488.161062] CPU    4: hi:  186, btch:  31 usd: 175
Apr  4 23:26:02 tyrex kernel: [  488.161063] CPU    5: hi:  186, btch:  31 usd: 157
Apr  4 23:26:02 tyrex kernel: [  488.161065] CPU    6: hi:  186, btch:  31 usd: 182
Apr  4 23:26:02 tyrex kernel: [  488.161066] CPU    7: hi:  186, btch:  31 usd: 173
Apr  4 23:26:02 tyrex kernel: [  488.161067] HighMem per-cpu:
Apr  4 23:26:02 tyrex kernel: [  488.161068] CPU    0: hi:  186, btch:  31 usd:  24
Apr  4 23:26:02 tyrex kernel: [  488.161069] CPU    1: hi:  186, btch:  31 usd:  33
Apr  4 23:26:02 tyrex kernel: [  488.161070] CPU    2: hi:  186, btch:  31 usd:  55
Apr  4 23:26:02 tyrex kernel: [  488.161071] CPU    3: hi:  186, btch:  31 usd: 172
Apr  4 23:26:02 tyrex kernel: [  488.161072] CPU    4: hi:  186, btch:  31 usd: 177
Apr  4 23:26:02 tyrex kernel: [  488.161073] CPU    5: hi:  186, btch:  31 usd: 100
Apr  4 23:26:02 tyrex kernel: [  488.161075] CPU    6: hi:  186, btch:  31 usd:  38
Apr  4 23:26:02 tyrex kernel: [  488.161076] CPU    7: hi:  186, btch:  31 usd: 115
Apr  4 23:26:02 tyrex kernel: [  488.161078] active_anon:98428 inactive_anon:24627 isolated_anon:0
Apr  4 23:26:02 tyrex kernel: [  488.161079]  active_file:4141 inactive_file:25571 isolated_file:0
Apr  4 23:26:02 tyrex kernel: [  488.161080]  unevictable:0 dirty:2 writeback:0 unstable:0
Apr  4 23:26:02 tyrex kernel: [  488.161080]  free:403317 slab_reclaimable:203863 slab_unreclaimable:4837
Apr  4 23:26:02 tyrex kernel: [  488.161081]  mapped:7418 shmem:137 pagetables:625 bounce:0
Apr  4 23:26:02 tyrex kernel: [  488.161085] 
DMA free:3520kB min:64kB low:80kB high:96kB active_anon:0kB inactive_anon:0kB 
active_file:0kB inactive_file:0kB unevictable:0kB isolated(anon):0kB isolated(file):0kB 
present:15804kB mlocked:0kB dirty:0kB writeback:0kB mapped:0kB shmem:0kB slab_reclaimable:6032kB 
slab_unreclaimable:36kB kernel_stack:0kB pagetables:0kB unstable:0kB bounce:0kB writeback_tmp:0kB 
pages_scanned:0 all_unreclaimable? yes
Apr  4 23:26:02 tyrex kernel: [  488.161089] lowmem_reserve[]: 0 865 3023 3023
Apr  4 23:26:02 tyrex kernel: [  488.161093] 
Normal free:3580kB min:3728kB low:4660kB high:5592kB active_anon:0kB inactive_anon:0kB 
active_file:360kB inactive_file:264kB unevictable:0kB isolated(anon):0kB isolated(file):0kB 
present:885944kB mlocked:0kB dirty:0kB writeback:0kB mapped:0kB shmem:0kB slab_reclaimable:809420kB 
slab_unreclaimable:19312kB kernel_stack:3304kB pagetables:0kB unstable:0kB bounce:0kB writeback_tmp:0kB 
pages_scanned:24 all_unreclaimable? no
Apr  4 23:26:02 tyrex kernel: [  488.161097] lowmem_reserve[]: 0 0 17267 17267
Apr  4 23:26:02 tyrex kernel: [  488.161101] 
HighMem free:1606168kB min:512kB low:2836kB high:5164kB active_anon:393712kB inactive_anon:98508kB 
active_file:16228kB inactive_file:102020kB unevictable:0kB isolated(anon):0kB isolated(file):0kB 
present:2210252kB mlocked:0kB dirty:8kB writeback:0kB mapped:29672kB shmem:548kB slab_reclaimable:0kB 
slab_unreclaimable:0kB kernel_stack:0kB pagetables:2500kB unstable:0kB bounce:0kB writeback_tmp:0kB 
pages_scanned:0 all_unreclaimable? no
Apr  4 23:26:02 tyrex kernel: [  488.161105] lowmem_reserve[]: 0 0 0 0
Apr  4 23:26:02 tyrex kernel: [  488.161107] DMA: 18*4kB 53*8kB 31*16kB 20*32kB 14*64kB 8*128kB 0*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 3552kB
Apr  4 23:26:02 tyrex kernel: [  488.161112] Normal: 32*4kB 1*8kB 1*16kB 1*32kB 1*64kB 1*128kB 1*256kB 0*512kB 1*1024kB 1*2048kB 0*4096kB = 3704kB
Apr  4 23:26:02 tyrex kernel: [  488.161117] HighMem: 17*4kB 29*8kB 47*16kB 16*32kB 6*64kB 30*128kB 53*256kB 27*512kB 14*1024kB 7*2048kB 377*4096kB = 1606044kB
Apr  4 23:26:02 tyrex kernel: [  488.161122] 29947 total pagecache pages
Apr  4 23:26:02 tyrex kernel: [  488.161123] 0 pages in swap cache
Apr  4 23:26:02 tyrex kernel: [  488.161124] Swap cache stats: add 0, delete 0, find 0/0
Apr  4 23:26:02 tyrex kernel: [  488.161125] Free swap  = 2104476kB
Apr  4 23:26:02 tyrex kernel: [  488.161126] Total swap = 2104476kB
Apr  4 23:26:02 tyrex kernel: [  488.165523] 784224 pages RAM
Apr  4 23:26:02 tyrex kernel: [  488.165524] 556914 pages HighMem
Apr  4 23:26:02 tyrex kernel: [  488.165525] 12060 pages reserved
Apr  4 23:26:02 tyrex kernel: [  488.165526] 82604 pages shared
Apr  4 23:26:02 tyrex kernel: [  488.165527] 328045 pages non-shared
Apr  4 23:26:02 tyrex kernel: [  488.165529] Out of memory: kill process 4788 (mysqld-max) score 326208 or a child
Apr  4 23:26:02 tyrex kernel: [  488.165531] Killed process 4788 (mysqld-max) vsz:1304832kB, anon-rss:121428kB, file-rss:4336kB
[...]

vmstat 1, when it occurs:
 1  0      0 1722516      4 926532    0    0 13800     0 2598 6479  2 14 84  0
 3  0      0 1735540      4 927392    0    0 10128     0 2144 5434  0  8 92  0
 1  0      0 1733944      4 927868    0    0 10416    92 4465 8264  1  7 93  0
 0  0      0 1733448      4 928024    0    0  9220     0 2081 3570  1  8 91  0
 0  0      0 1734160      4 927412    0    0  7404     0 1912 3014  0  6 90  4
 1  0      0 1733072      4 927208    0    0  9512     0 2227 3591  1  8 89  2
 1  1      0 1728356      4 928256    0    0  4620     0 1334 2087  0  2 92  5
 2  1      0 1926208      4 928832    0    0  3848   712 1869 3515  0  5 84 11
 1  0      0 2181520      4 849064    0    0  2736  3983 3154 308011  2 14 71 13
 0  0      0 2181028      4 847560    0    0   140   109  300  484  2  0 98  0
 4  0      0 2561212      4 467272    0    0     4     0  383  233  0  6 94  0
 0  0      0 2995016      4  33560    0    0    44     0  421  238  0  9 91  0
 0  0      0 2995024      4  33584    0    0   120     0  187  251  0  0 100  0
 0  0      0 2994932      4  33636    0    0     0     0   34   63  0  0 100  0
 1  0      0 2995212      4  33548    0    0     8     0   72   94  0  0 100  0
 0  0      0 2995368      4  33528    0    0     0     0   39   73  0  0 100  0
 0  0      0 2995384      4  33348    0    0     0     0   96  116  0  0 100  0
 0  0      0 2995556      4  33256    0    0    40     0  139  184  0  0 100  0
 0  0      0 2995800      4  33248    0    0    68     0  178  193  0  0 100  0

The oom-killer got triggered around the line with 308011 context switches.
Needless to say, this event renders this system dysfunctional.

Sorry for the insanely long lines, but wrapping them to a sane width results 
in an unparseable mess. This kernel is closest to upstream, that I was able
to produce, built in Novell build service, available here:
http://download.opensuse.org/repositories/home:/frispete:/kernel-head/openSUSE_11.1/
As you can see, userspace is openSUSE 11.1, arch is i586, based on vanilla 
built of rc3 with Linus' git tree applied up to 5e11611a5d22252f3f9c169a3c9377eac0c32033.
I have reproduced this issue also with 2.6.33.1. Usually, I run the openSUSE 
customized kernels, e.g. kernel-pae-2.6.27.45-0.1.1 doesn't show this issue.

boot.msg: ftp://urpla.net/boot.msg
config: ftp://urpla.net/vanilla

I do use some unusual vm settings, as documented in boot.msg, but that should 
not result in this issue, does it?

Hardware: Xeon X3460, Asus P7F-E motherboard, Intel 3420 PCH chipset, 12 GB Mem

In case, you wonder, why I drove that test with a 4GB memory setup: that was
the easiest way to demonstrate the problem without resorting to tainted and 
heavy customized kernels. And for compatibility reasons (customer support), 
I have to stay at i586 for the moment. 

Of course, the du command does succeed with the distributed kernel with exactly
the same (userspace and sysctl) settings: 
1,1T    /backup/daily.0

Anybody with an idea, what's going on here?

TIA,
Pete

             reply	other threads:[~2010-04-04 22:47 UTC|newest]

Thread overview: 48+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-04-04 22:49 Hans-Peter Jansen [this message]
2010-04-04 22:49 ` 2.6.34-rc3: simple du (on a big xfs tree) triggers oom killer Hans-Peter Jansen
2010-04-05  0:49 ` Dave Chinner
2010-04-05  0:49   ` Dave Chinner
2010-04-05 11:35   ` Hans-Peter Jansen
2010-04-05 11:35     ` Hans-Peter Jansen
2010-04-05 23:06     ` Dave Chinner
2010-04-05 23:06       ` Dave Chinner
2010-04-06 14:52       ` 2.6.34-rc3: simple du (on a big xfs tree) triggers oom killer [bisected: 57817c68229984818fea9e614d6f95249c3fb098] Hans-Peter Jansen
2010-04-06 14:52         ` Hans-Peter Jansen
2010-04-06 23:11         ` Dave Chinner
2010-04-06 23:11           ` Dave Chinner
2010-04-07  1:45           ` Dave Chinner
2010-04-07  1:45             ` Dave Chinner
2010-04-07 22:02             ` Hans-Peter Jansen
2010-04-07 22:02               ` Hans-Peter Jansen
2010-04-12 22:32               ` Dave Chinner
2010-04-12 22:32                 ` Dave Chinner
2010-04-13  8:50                 ` Hans-Peter Jansen
2010-04-13  8:50                   ` Hans-Peter Jansen
2010-04-13  9:18                   ` Dave Chinner
2010-04-13  9:18                     ` Dave Chinner
2010-04-13  9:42                     ` Hans-Peter Jansen
2010-04-13  9:42                       ` Hans-Peter Jansen
2010-04-24 16:44                       ` Hans-Peter Jansen
2010-04-24 16:44                         ` Hans-Peter Jansen
2010-04-24 21:23                         ` Emmanuel Florac
2010-04-24 21:23                           ` Emmanuel Florac
2010-04-24 22:30                           ` Hans-Peter Jansen
2010-04-24 22:30                             ` Hans-Peter Jansen
2010-04-24 22:40                             ` [opensuse-kernel] " Justin P. Mattock
2010-04-24 22:40                               ` Justin P. Mattock
2010-04-24 22:41                             ` Justin P. Mattock
2010-04-24 22:41                               ` Justin P. Mattock
2010-04-25 13:04                             ` Emmanuel Florac
2010-04-25 13:04                               ` Emmanuel Florac
2010-04-25 16:27                         ` Greg KH
2010-04-25 16:27                           ` Greg KH
2010-04-25 16:57                           ` Christoph Hellwig
2010-04-25 16:57                             ` Christoph Hellwig
2010-04-25 18:18                             ` Greg KH
2010-04-25 18:18                               ` Greg KH
2010-04-26  0:36                           ` Dave Chinner
2010-04-26  0:36                             ` Dave Chinner
2010-04-27  0:02                             ` Greg KH
2010-04-27  0:02                               ` Greg KH
2010-04-26  0:32                         ` Dave Chinner
2010-04-26  0:32                           ` Dave Chinner

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=201004050049.17952.hpj@urpla.net \
    --to=hpj@urpla.net \
    --cc=linux-kernel@vger.kernel.org \
    --cc=opensuse-kernel@opensuse.org \
    --cc=xfs@oss.sgi.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.