From: Rogier Wolff <R.E.Wolff@BitWizard.nl>
To: Nick Piggin <nickpiggin@yahoo.com.au>
Cc: "Eric W. Biederman" <ebiederm@xmission.com>,
Rob Landley <rob@landley.net>, Theodore Tso <tytso@mit.edu>,
James Bottomley <James.Bottomley@steeleye.com>,
Matthew Wilcox <matthew@wil.cx>,
linux-kernel@vger.kernel.org, linux-scsi@vger.kernel.org,
Jens Axboe <axboe@suse.de>,
Suparna Bhattacharya <suparna@in.ibm.com>,
Nick Piggin <piggin@cyberone.com.au>
Subject: Re: OOM killer gripe (was Re: What still uses the block layer?)
Date: Thu, 18 Oct 2007 15:00:49 +0200 [thread overview]
Message-ID: <20071018130048.GA25084@bitwizard.nl> (raw)
In-Reply-To: <200710161734.15880.nickpiggin@yahoo.com.au>
On Tue, Oct 16, 2007 at 05:34:15PM +1000, Nick Piggin wrote:
> > It's a hard call. The I/O time for 1MB of contiguous disk data
> > is about the I/O time of 512 bytes of contiguous disk data.
>
> And if you're thrashing, then by definition you need to throw
> out 1MB of your working set in order to read it in.
Right. But you need a differential hit rate of only a few percent on
that 1020 extra kb of data you swapped in versus the 1Mb of data you
swapped out for this to be advantageous.
With "differential hit rate" I mean the chances of getting a hit on
the 1Mb of data just paged in, minus the chances of getting a hit on
the 1Mb of data just paged out.
With a little luck that 1Mb that is paged out didn't get used for
quite a while, while there is a hint that the 1Mb you're paging in
is active, as one of its sub-pages just got a hit.
So... IMHO, it would be useful to implement something that pages out
chunks of memory larger than a single hardware page. This would reduce
the size of the memory management tables (*), as well as improve disk
throughput if things DO come to paging....
This should of course be configurable. Some workloads are better off
with a virtual page size of 8k, some with 128k. some with 1M.
As far as I can see, the "page-cluster" parameter defines how many
pages at a time are selected for page-out at a time. This increases
the page-out efficiency. Improving the page-in efficiency is also
useful: It is the other half of hte equation.
Roger.
(*) If the kernel starts working with a 1Mb virtual page size, you
need a 256 times smaller mapping table between processes and memory or
swap. Of course, the hardware doesn't support this (actually, it does
for 1Mb virtual pages), so you'll have to create 256 page table
entries for the hardware instead of just one.
--
** R.E.Wolff@BitWizard.nl ** http://www.BitWizard.nl/ ** +31-15-2600998 **
** Delftechpark 26 2628 XH Delft, The Netherlands. KVK: 27239233 **
*-- BitWizard writes Linux device drivers for any device you may have! --*
Q: It doesn't work. A: Look buddy, doesn't work is an ambiguous statement.
Does it sit on the couch all day? Is it unemployed? Please be specific!
Define 'it' and what it isn't doing. --------- Adapted from lxrbot FAQ
next prev parent reply other threads:[~2007-10-18 13:07 UTC|newest]
Thread overview: 93+ messages / expand[flat|nested] mbox.gz Atom feed top
2007-10-12 1:11 What still uses the block layer? Rob Landley
2007-10-13 22:05 ` Matthew Wilcox
2007-10-14 5:54 ` David Newall
2007-10-14 17:46 ` Stefan Richter
2007-10-14 22:35 ` Tilman Schmidt
2007-10-14 23:36 ` Rob Landley
2007-10-15 1:23 ` Neil Brown
2007-10-15 5:44 ` Stefan Richter
2007-10-15 9:26 ` Rob Landley
2007-10-15 16:08 ` Matthew Wilcox
2007-10-15 17:10 ` Stefan Richter
2007-10-16 3:06 ` david
2007-10-16 5:56 ` Stefan Richter
2007-10-16 10:19 ` Alan Cox
2007-10-16 19:54 ` david
2007-10-16 19:54 ` Matthew Wilcox
2007-10-16 20:18 ` Stefan Richter
2007-10-16 20:34 ` Theodore Tso
2007-10-16 20:56 ` Stefan Richter
2007-10-16 20:55 ` david
2007-10-16 21:49 ` Alan Cox
2007-10-17 9:48 ` Gabor Gombas
2007-10-17 17:23 ` Stefan Richter
2007-10-17 21:04 ` david
2007-10-15 20:29 ` Wilfried Klaebe
2007-10-14 22:24 ` James Bottomley
2007-10-14 23:45 ` Rob Landley
2007-10-15 1:45 ` Theodore Tso
2007-10-15 8:04 ` Rob Landley
2007-10-15 9:06 ` Julian Calaby
2007-10-15 10:08 ` Rob Landley
2007-10-15 17:33 ` Greg KH
2007-10-16 2:54 ` david
2007-10-16 4:04 ` Matthew Wilcox
2007-10-16 4:11 ` Arjan van de Ven
2007-10-16 4:15 ` david
2007-10-16 4:21 ` Greg KH
2007-10-16 5:00 ` david
[not found] ` <646765f40710150327i78519a0fvaea7a83d5975b180@mail.gmail.com>
[not found] ` <200710151511.29748.rob@landley.net>
2007-10-15 23:49 ` Julian Calaby
2007-10-15 10:32 ` Loïc Grenié
2007-10-15 21:09 ` Rob Landley
2007-10-15 11:19 ` Neil Brown
2007-10-15 21:34 ` Rob Landley
2007-10-15 21:46 ` Jeff Garzik
2007-10-15 22:01 ` Alan Cox
2007-10-15 23:41 ` Neil Brown
2007-10-16 2:12 ` david
2007-10-15 13:21 ` Theodore Tso
2007-10-15 13:29 ` Alan Cox
2007-10-15 13:35 ` Theodore Tso
2007-10-15 17:44 ` Jeff Garzik
2007-10-15 14:46 ` Douglas Gilbert
2007-10-16 2:51 ` david
2007-10-15 13:37 ` OOM killer gripe (was Re: What still uses the block layer?) Nick Piggin
2007-10-15 9:52 ` Rob Landley
2007-10-15 15:08 ` Nick Piggin
2007-10-16 6:22 ` David Newall
2007-10-20 9:48 ` Pavel Machek
2007-10-15 11:40 ` Theodore Tso
2007-10-20 9:50 ` Pavel Machek
2007-10-16 3:55 ` Eric W. Biederman
2007-10-16 4:10 ` david
2007-10-16 4:45 ` Eric W. Biederman
2007-10-16 6:59 ` Nick Piggin
2007-10-16 4:38 ` Eric W. Biederman
2007-10-16 6:38 ` Rob Landley
2007-10-16 9:31 ` Eric W. Biederman
2007-10-16 10:28 ` Alan Cox
2007-10-16 23:59 ` Rob Landley
2007-10-16 7:34 ` Nick Piggin
2007-10-18 13:00 ` Rogier Wolff [this message]
2007-10-19 6:49 ` Rob Landley
2007-10-19 7:21 ` Rogier Wolff
2007-10-16 20:37 ` Andrew Morton
2007-10-17 5:34 ` What still uses the block layer? Valdis.Kletnieks
2007-10-17 6:07 ` david
2007-10-15 6:00 ` Greg KH
2007-10-15 8:36 ` Rob Landley
2007-10-15 13:08 ` Alan Cox
2007-10-15 14:00 ` Arjan van de Ven
2007-10-15 18:56 ` Matthew Garrett
2007-10-15 17:25 ` Greg KH
2007-10-15 18:00 ` Matthew Wilcox
2007-10-15 18:46 ` Jeff Garzik
2007-10-16 6:33 ` Stefan Richter
2007-10-17 23:43 ` Bill Davidsen
2007-10-15 22:54 ` Rob Landley
2007-10-15 8:52 ` Christoph Hellwig
2007-10-15 13:10 ` James Bottomley
2007-10-15 21:51 ` Rob Landley
2007-10-15 0:45 ` Luben Tuikov
2007-10-15 6:51 ` Rob Landley
2007-10-15 8:37 ` Luben Tuikov
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20071018130048.GA25084@bitwizard.nl \
--to=r.e.wolff@bitwizard.nl \
--cc=James.Bottomley@steeleye.com \
--cc=axboe@suse.de \
--cc=ebiederm@xmission.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-scsi@vger.kernel.org \
--cc=matthew@wil.cx \
--cc=nickpiggin@yahoo.com.au \
--cc=piggin@cyberone.com.au \
--cc=rob@landley.net \
--cc=suparna@in.ibm.com \
--cc=tytso@mit.edu \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox