From: Dave Chinner <david@fromorbit.com>
To: Christian Kujau <lists@nerdbynature.de>
Cc: minchan.kim@gmail.com, LKML <linux-kernel@vger.kernel.org>,
Markus Trippelsdorf <markus@trippelsdorf.de>,
xfs@oss.sgi.com
Subject: Re: 2.6.39-rc4+: oom-killer busy killing tasks
Date: Mon, 2 May 2011 22:38:24 +1000 [thread overview]
Message-ID: <20110502123824.GB2978@dastard> (raw)
In-Reply-To: <alpine.DEB.2.01.1105020210160.18728@trent.utfs.org>
On Mon, May 02, 2011 at 02:26:17AM -0700, Christian Kujau wrote:
> On Sun, 1 May 2011 at 18:01, Dave Chinner wrote:
> > I really don't know why the xfs inode cache is not being trimmed. I
> > really, really need to know if the XFS inode cache shrinker is
> > getting blocked or not running - do you have those sysrq-w traces
> > when near OOM I asked for a while back?
>
> Here's another attempt at getting those:
>
> http://nerdbynature.de/bits/2.6.39-rc4/oom/
> * messages-11.txt.gz & slabinfo-11.txt.bz2
> - oom-killer at 00:05:04
> - last sysrq-w to succeed at 00:05:03
>
> * messages-12.txt.gz & slabinfo-12.txt.bz2, along
> with meminfo-post-oom-12.txt & sysrq-w_post-oom-12.jpg could
> be more interesting:
> - last sysrq-w to succeed at 01:27:08
> - oom-killer at 01:27:11
>
> ...but after the OOM-killer was killing quite a few processes, MemFree
> showed 511236 kB free memory, yet ssh logins were still being killed.
> Finally I got a root shell on the box, issued sysrq-w again and even
> executed /bin/sync, which came back. But looking at the logs now
> nothing went to the disk (/var/log resides on / which is a ext4 fs).
> See sysrq-w_post-oom-12.jpg for a sysrq-w I took 2381s after boot time,
> or 01:32 - syslog stopped on 01:27.
Same problem:
MemFree: 511236 kB
....
LowTotal: 759904 kB
LowFree: 3804 kB
i.e. that low memory is being exhausted by the slab cache, while
there is lots of free high memory, and the low memory zone is marked
as all unreclaimable....
The sysrq trace less than 1s before the first OOM shows this:
[c00770ec] __lock_acquire+0x43c/0x1818 (unreliable)
[c000a924] __switch_to+0x9c/0x128
[c0417580] schedule+0x274/0x8bc
[c0418128] schedule_timeout+0x16c/0x214
[c04172a0] io_schedule_timeout+0xb0/0x11c
[c00b153c] congestion_wait+0x8c/0xdc
[c00aa43c] kswapd+0x6d0/0x884
[c005e3d0] kthread+0x84/0x88
[c0010908] kernel_thread+0x4c/0x68
Background memory reclaim appears to be blocked by IO congestion....
Cheers,
Dave.
--
Dave Chinner
david@fromorbit.com
_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs
next prev parent reply other threads:[~2011-05-02 12:34 UTC|newest]
Thread overview: 42+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-04-22 1:57 2.6.39-rc4+: oom-killer busy killing tasks Christian Kujau
2011-04-22 2:58 ` Minchan Kim
2011-04-22 3:03 ` Christian Kujau
2011-04-22 17:41 ` Christian Kujau
2011-04-22 18:46 ` Christian Kujau
2011-04-22 22:47 ` Minchan Kim
2011-04-24 23:46 ` Dave Chinner
2011-04-25 5:51 ` Christian Kujau
2011-04-25 7:19 ` Christian Kujau
2011-04-26 15:14 ` Christian Kujau
2011-04-27 2:26 ` Dave Chinner
2011-04-27 7:46 ` Christian Kujau
2011-04-27 10:28 ` Dave Chinner
2011-04-27 23:16 ` Minchan Kim
2011-04-27 23:56 ` Dave Chinner
2011-04-28 17:30 ` Christian Kujau
2011-04-28 23:37 ` Dave Chinner
2011-04-29 17:32 ` Christian Kujau
2011-04-29 19:58 ` Christian Kujau
2011-04-29 20:17 ` Markus Trippelsdorf
2011-04-29 20:20 ` Christian Kujau
2011-04-29 20:21 ` Markus Trippelsdorf
2011-04-30 0:17 ` Christian Kujau
2011-05-01 8:01 ` Dave Chinner
2011-05-02 4:59 ` Christian Kujau
2011-05-02 12:19 ` Dave Chinner
2011-05-02 19:59 ` Christian Kujau
2011-05-03 0:51 ` Dave Chinner
2011-05-03 4:04 ` Christian Kujau
2011-05-03 6:36 ` Dave Chinner
2011-05-03 20:53 ` Christian Kujau
2011-05-04 0:46 ` Christian Kujau
2011-05-04 1:51 ` Christian Kujau
2011-05-04 7:36 ` Dave Chinner
2011-05-04 11:12 ` Dave Chinner
2011-05-04 19:10 ` Christian Kujau
2011-05-04 23:15 ` Dave Chinner
2011-05-05 2:07 ` Christian Kujau
2011-05-02 9:26 ` Christian Kujau
2011-05-02 12:38 ` Dave Chinner [this message]
2011-04-25 8:02 ` Christian Kujau
2011-04-25 9:50 ` Christian Kujau
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20110502123824.GB2978@dastard \
--to=david@fromorbit.com \
--cc=linux-kernel@vger.kernel.org \
--cc=lists@nerdbynature.de \
--cc=markus@trippelsdorf.de \
--cc=minchan.kim@gmail.com \
--cc=xfs@oss.sgi.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox