From: Theodore Tso <tytso@mit.edu>
To: Nick Piggin <nickpiggin@yahoo.com.au>
Cc: Rob Landley <rob@landley.net>,
James Bottomley <James.Bottomley@steeleye.com>,
Matthew Wilcox <matthew@wil.cx>,
linux-kernel@vger.kernel.org, linux-scsi@vger.kernel.org,
Jens Axboe <axboe@suse.de>,
Suparna Bhattacharya <suparna@in.ibm.com>,
Nick Piggin <piggin@cyberone.com.au>
Subject: Re: OOM killer gripe (was Re: What still uses the block layer?)
Date: Mon, 15 Oct 2007 07:40:03 -0400 [thread overview]
Message-ID: <20071015114003.GB21216@thunk.org> (raw)
In-Reply-To: <200710152337.45252.nickpiggin@yahoo.com.au>
On Mon, Oct 15, 2007 at 11:37:44PM +1000, Nick Piggin wrote:
> I hate to go completely offtopic here, but disks are so incredibly
> slow when compared to RAM that there is really nothing the kernel
> can do about this. Presumably the job will finish, given infinite
> time.
About 6 weeks ago, on a 2.6.23-rc kernel, I accidentally typed "make
-j", and left off the 4 before I hit the return key. About 2-3
minutes later, the box locked pretty tight. I managed to switch to a
VT console before I lost total control of X (took many, many minutes
to do the switch), but after many minutes, managed to get logged into
the console, but I wasn't able to get a ps command to complete so I
could start killing processes. (I probably should have just done a
"killall make" right away, but hindsight is 20/20.)
The console was showing that the OOM killer was attempting to kill
processes, but apparently not fast enough to stem the tide of all of
the new processes getting generated by the make -j. (I'm guessing
that it was killing the gcc processes and not the make processes.)
> Would an oom-kill-someone-now sysrq be of help, I wonder?
I tried sysrq-f (oom_kill), but no dice. Given that the oom killer
was active and apparently triggering on its own, this wasn't all that
surprising.
The interesting thing is I tried to do an sysrq-e (send SIGTERM to all
processes except), waited 5 minutes or so, then tried an alt-sysrq-i
(send SIGKILL to all processes except init), and the system was still
thrashing itself to death, even after giving it plenty of time to try
to recover.
I finally gave up and held down the power button. This was on a box
with 4 gigs memory (but only 3 gigs visible thanks a cheap
BIOS/chipset) and 4 gigs swap (mainly intended for suspend/resume).
I chalked it up to me being stupid (I should have noticed and
Ctrl-C'ed the make -j much more quickly, or if I were a sysadmin on a
time-sharing system with users I didn't trust, configured RLIMIT_NPROC
and/or per-user container resource limits) and the OOM killer not
being aggressive enough in such a situation. But having better things
to do, I didn't go whining on LKML about it, although I have to say
that the kernel behavior isn't exactly ideal. One of these days when
I have time, I'll try investigating it with a few memlocked processes
running at real-time priorities and Systemtap and figure out what the
heck was going on....
I suppose I should just configure suspending to a file instead of a
swap partition, but I've just historically trusted suspend/resume to a
swap partition much more than to a file. Or maybe I should hack in a
sysctl to prevent any swapping even though the swap partition is
configured (so only suspend/resume will use it).
- Ted
next prev parent reply other threads:[~2007-10-15 11:41 UTC|newest]
Thread overview: 93+ messages / expand[flat|nested] mbox.gz Atom feed top
2007-10-12 1:11 What still uses the block layer? Rob Landley
2007-10-13 22:05 ` Matthew Wilcox
2007-10-14 5:54 ` David Newall
2007-10-14 17:46 ` Stefan Richter
2007-10-14 22:35 ` Tilman Schmidt
2007-10-14 23:36 ` Rob Landley
2007-10-15 1:23 ` Neil Brown
2007-10-15 5:44 ` Stefan Richter
2007-10-15 9:26 ` Rob Landley
2007-10-15 16:08 ` Matthew Wilcox
2007-10-15 17:10 ` Stefan Richter
2007-10-16 3:06 ` david
2007-10-16 5:56 ` Stefan Richter
2007-10-16 10:19 ` Alan Cox
2007-10-16 19:54 ` david
2007-10-16 19:54 ` Matthew Wilcox
2007-10-16 20:18 ` Stefan Richter
2007-10-16 20:34 ` Theodore Tso
2007-10-16 20:56 ` Stefan Richter
2007-10-16 20:55 ` david
2007-10-16 21:49 ` Alan Cox
2007-10-17 9:48 ` Gabor Gombas
2007-10-17 17:23 ` Stefan Richter
2007-10-17 21:04 ` david
2007-10-15 20:29 ` Wilfried Klaebe
2007-10-14 22:24 ` James Bottomley
2007-10-14 23:45 ` Rob Landley
2007-10-15 1:45 ` Theodore Tso
2007-10-15 8:04 ` Rob Landley
2007-10-15 9:06 ` Julian Calaby
2007-10-15 10:08 ` Rob Landley
2007-10-15 17:33 ` Greg KH
2007-10-16 2:54 ` david
2007-10-16 4:04 ` Matthew Wilcox
2007-10-16 4:11 ` Arjan van de Ven
2007-10-16 4:15 ` david
2007-10-16 4:21 ` Greg KH
2007-10-16 5:00 ` david
[not found] ` <646765f40710150327i78519a0fvaea7a83d5975b180@mail.gmail.com>
[not found] ` <200710151511.29748.rob@landley.net>
2007-10-15 23:49 ` Julian Calaby
2007-10-15 10:32 ` Loïc Grenié
2007-10-15 21:09 ` Rob Landley
2007-10-15 11:19 ` Neil Brown
2007-10-15 21:34 ` Rob Landley
2007-10-15 21:46 ` Jeff Garzik
2007-10-15 22:01 ` Alan Cox
2007-10-15 23:41 ` Neil Brown
2007-10-16 2:12 ` david
2007-10-15 13:21 ` Theodore Tso
2007-10-15 13:29 ` Alan Cox
2007-10-15 13:35 ` Theodore Tso
2007-10-15 17:44 ` Jeff Garzik
2007-10-15 14:46 ` Douglas Gilbert
2007-10-16 2:51 ` david
2007-10-15 13:37 ` OOM killer gripe (was Re: What still uses the block layer?) Nick Piggin
2007-10-15 9:52 ` Rob Landley
2007-10-15 15:08 ` Nick Piggin
2007-10-16 6:22 ` David Newall
2007-10-20 9:48 ` Pavel Machek
2007-10-15 11:40 ` Theodore Tso [this message]
2007-10-20 9:50 ` Pavel Machek
2007-10-16 3:55 ` Eric W. Biederman
2007-10-16 4:10 ` david
2007-10-16 4:45 ` Eric W. Biederman
2007-10-16 6:59 ` Nick Piggin
2007-10-16 4:38 ` Eric W. Biederman
2007-10-16 6:38 ` Rob Landley
2007-10-16 9:31 ` Eric W. Biederman
2007-10-16 10:28 ` Alan Cox
2007-10-16 23:59 ` Rob Landley
2007-10-16 7:34 ` Nick Piggin
2007-10-18 13:00 ` Rogier Wolff
2007-10-19 6:49 ` Rob Landley
2007-10-19 7:21 ` Rogier Wolff
2007-10-16 20:37 ` Andrew Morton
2007-10-17 5:34 ` What still uses the block layer? Valdis.Kletnieks
2007-10-17 6:07 ` david
2007-10-15 6:00 ` Greg KH
2007-10-15 8:36 ` Rob Landley
2007-10-15 13:08 ` Alan Cox
2007-10-15 14:00 ` Arjan van de Ven
2007-10-15 18:56 ` Matthew Garrett
2007-10-15 17:25 ` Greg KH
2007-10-15 18:00 ` Matthew Wilcox
2007-10-15 18:46 ` Jeff Garzik
2007-10-16 6:33 ` Stefan Richter
2007-10-17 23:43 ` Bill Davidsen
2007-10-15 22:54 ` Rob Landley
2007-10-15 8:52 ` Christoph Hellwig
2007-10-15 13:10 ` James Bottomley
2007-10-15 21:51 ` Rob Landley
2007-10-15 0:45 ` Luben Tuikov
2007-10-15 6:51 ` Rob Landley
2007-10-15 8:37 ` Luben Tuikov
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20071015114003.GB21216@thunk.org \
--to=tytso@mit.edu \
--cc=James.Bottomley@steeleye.com \
--cc=axboe@suse.de \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-scsi@vger.kernel.org \
--cc=matthew@wil.cx \
--cc=nickpiggin@yahoo.com.au \
--cc=piggin@cyberone.com.au \
--cc=rob@landley.net \
--cc=suparna@in.ibm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox