From: Mel Gorman <mgorman@suse.de>
To: Seth Jennings <sjenning@linux.vnet.ibm.com>
Cc: Andrew Morton <akpm@linux-foundation.org>,
Greg Kroah-Hartman <gregkh@linuxfoundation.org>,
Nitin Gupta <ngupta@vflare.org>, Minchan Kim <minchan@kernel.org>,
Konrad Rzeszutek Wilk <konrad.wilk@oracle.com>,
Dan Magenheimer <dan.magenheimer@oracle.com>,
Robert Jennings <rcj@linux.vnet.ibm.com>,
Jenifer Hopper <jhopper@us.ibm.com>,
Johannes Weiner <jweiner@redhat.com>,
Rik van Riel <riel@redhat.com>,
Larry Woodman <lwoodman@redhat.com>,
Benjamin Herrenschmidt <benh@kernel.crashing.org>,
Dave Hansen <dave@sr71.net>, Joe Perches <joe@perches.com>,
Joonsoo Kim <iamjoonsoo.kim@lge.com>,
Cody P Schafer <cody@linux.vnet.ibm.com>,
Hugh Dickens <hughd@google.com>,
Paul Mackerras <paulus@samba.org>,
linux-mm@kvack.org, linux-kernel@vger.kernel.org,
devel@driverdev.osuosl.org
Subject: Re: [PATCHv11 2/4] zbud: add to mm/
Date: Tue, 21 May 2013 09:10:20 +0100 [thread overview]
Message-ID: <20130521081020.GT11497@suse.de> (raw)
In-Reply-To: <20130520154225.GA25536@cerebellum>
On Mon, May 20, 2013 at 10:42:25AM -0500, Seth Jennings wrote:
> On Mon, May 20, 2013 at 02:54:39PM +0100, Mel Gorman wrote:
> > On Sun, May 19, 2013 at 03:52:19PM -0500, Seth Jennings wrote:
> > > My first guess is that the external fragmentation situation you are referring to
> > > is a workload in which all pages compress to greater than half a page. If so,
> > > then it doesn't matter what NCHUCNKS_ORDER is, there won't be any pages the
> > > compress enough to fit in the < PAGE_SIZE/2 free space that remains in the
> > > unbuddied zbud pages.
> > >
> >
> > There are numerous aspects to this, too many to write them all down.
> > Modelling the external fragmentation one and how it affects swap IO
> > would be a complete pain in the ass so lets consider the following
> > example instead as it's a bit clearer.
> >
> > Three processes. Process A compresses by 75%, Process B compresses to 15%,
> > Process C pages compress to 15%. They are all adding to zswap in lockstep.
> > Lets say that zswap can hold 100 physical pages.
> >
> > NCHUNKS == 2
> > All Process A pages get rejected.
>
> Ah, I think this is our disconnect. Process A pages will not be rejected.
> They will be stored in a zbud page, and that zbud page will be added
> to the 0th unbuddied list. This list maintains a list of zbud pages
> that will never be buddied because there are no free chunks.
>
D'oh, good point. Unfortunately, the problem then still exists at the
writeback end which I didn't bring up in the previous mail. Take three
processes writing in lockstep. Process A pages compress to 15%, B compresses
to 15%, C compresses to 60%. Each physical page packing will look like this
nchunks=6 nchunks = 2
Page 0 A B A B
Page 1 C A C
Page 2 B C A B
Page 3 A B C
Pattern repeats..........
This continues until zswap is full. Now all three process stop and process
D starts writing to zswap, each of its pages compresses to 80%. These will
be freed in LRU order which is effectively FIFO.
With nchunks=6, to store 2 process D pages, it must write out 4 pages
to swap. With nchunks=2, to store two process D pages, it must write 3
pages so process D stalls for less time with nchunks==2.
This is a variation of the zsmalloc packing problem where greater packing
leads to worse performance when zswap is full. The user bug report will
look something like "performance goes to hell when zswap is full although
swap IO rates look normal". If it was a kernel parameter, setting nchunk=2
as a kernel boot parameter will at least be a workaround.
Of course, this is all hypothetical. It is certainly possible to create
a reference string where nchunks=2 generates more IO but it tends to
generate the IO sooner and more closely resemble existing swap behaviour
that users might be willing to accept as a workaround until their problem
can be resolved. This is why I think the parameter should be available at
boot-time as a debugging/workaround option for developers to recommend to
a user.
--
Mel Gorman
SUSE Labs
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2013-05-21 8:10 UTC|newest]
Thread overview: 48+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-05-13 12:39 [PATCHv11 0/4] zswap: compressed swap caching Seth Jennings
2013-05-13 12:40 ` [PATCHv11 1/4] debugfs: add get/set for atomic types Seth Jennings
2013-05-16 14:58 ` Rik van Riel
2013-05-13 12:40 ` [PATCHv11 2/4] zbud: add to mm/ Seth Jennings
2013-05-14 8:47 ` Bob Liu
2013-05-14 17:03 ` Seth Jennings
2013-05-16 15:30 ` Rik van Riel
2013-05-17 15:48 ` Mel Gorman
2013-05-19 20:52 ` Seth Jennings
2013-05-20 13:54 ` Mel Gorman
2013-05-20 15:42 ` Seth Jennings
2013-05-21 8:10 ` Mel Gorman [this message]
2013-05-23 2:00 ` Bob Liu
2013-05-23 9:52 ` Mel Gorman
2013-05-13 12:40 ` [PATCHv11 3/4] zswap: " Seth Jennings
2013-05-14 9:19 ` Bob Liu
2013-05-14 16:00 ` Seth Jennings
2013-05-14 16:37 ` Dan Magenheimer
2013-05-14 17:28 ` Seth Jennings
2013-05-14 20:54 ` Dan Magenheimer
2013-05-17 17:00 ` Mel Gorman
2013-05-16 17:16 ` Rik van Riel
2013-05-17 16:54 ` Mel Gorman
2013-05-19 23:33 ` Seth Jennings
2013-05-13 12:40 ` [PATCHv11 4/4] zswap: add documentation Seth Jennings
2013-05-16 17:06 ` Rik van Riel
2013-05-17 16:04 ` Mel Gorman
[not found] <<1368448803-2089-1-git-send-email-sjenning@linux.vnet.ibm.com>
[not found] ` <<1368448803-2089-3-git-send-email-sjenning@linux.vnet.ibm.com>
2013-05-13 15:43 ` [PATCHv11 2/4] zbud: add to mm/ Dan Magenheimer
2013-05-13 20:59 ` Seth Jennings
2013-05-16 15:30 ` Rik van Riel
[not found] ` <<1368448803-2089-4-git-send-email-sjenning@linux.vnet.ibm.com>
2013-05-13 22:31 ` [PATCHv11 3/4] zswap: " Dan Magenheimer
2013-05-14 16:35 ` Seth Jennings
2013-05-14 20:18 ` Dan Magenheimer
2013-05-14 22:55 ` Seth Jennings
2013-05-15 17:09 ` Dan Magenheimer
2013-05-15 18:55 ` Konrad Rzeszutek Wilk
2013-05-15 19:35 ` Dan Magenheimer
2013-05-15 20:45 ` Rik van Riel
2013-05-15 21:36 ` Dan Magenheimer
2013-05-15 22:01 ` Rik van Riel
2013-05-15 20:09 ` Seth Jennings
2013-05-15 20:24 ` Dave Hansen
2013-05-15 20:55 ` Dan Magenheimer
2013-05-15 20:45 ` Konrad Rzeszutek Wilk
2013-05-15 20:52 ` Dan Magenheimer
2013-05-15 22:14 ` Rik van Riel
2013-05-16 16:45 ` Dan Magenheimer
2013-05-16 17:06 ` Rik van Riel
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20130521081020.GT11497@suse.de \
--to=mgorman@suse.de \
--cc=akpm@linux-foundation.org \
--cc=benh@kernel.crashing.org \
--cc=cody@linux.vnet.ibm.com \
--cc=dan.magenheimer@oracle.com \
--cc=dave@sr71.net \
--cc=devel@driverdev.osuosl.org \
--cc=gregkh@linuxfoundation.org \
--cc=hughd@google.com \
--cc=iamjoonsoo.kim@lge.com \
--cc=jhopper@us.ibm.com \
--cc=joe@perches.com \
--cc=jweiner@redhat.com \
--cc=konrad.wilk@oracle.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=lwoodman@redhat.com \
--cc=minchan@kernel.org \
--cc=ngupta@vflare.org \
--cc=paulus@samba.org \
--cc=rcj@linux.vnet.ibm.com \
--cc=riel@redhat.com \
--cc=sjenning@linux.vnet.ibm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).