kernel-testers.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Chris Mason <chris.mason@oracle.com>
To: Pekka Enberg <penberg@cs.helsinki.fi>
Cc: Jens Axboe <jens.axboe@oracle.com>, Mel Gorman <mel@csn.ul.ie>,
	KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com>,
	Andrew Morton <akpm@linux-foundation.org>,
	Frans Pop <elendil@planet.nl>, Jiri Kosina <jkosina@suse.cz>,
	Sven Geggus <lists@fuchsschwanzdomain.de>,
	Karol Lewandowski <karol.k.lewandowski@gmail.com>,
	Tobias Oetiker <tobi@oetiker.ch>,
	linux-kernel@vger.kernel.org,
	"linux-mm@kvack.org" <linux-mm@kvack.org>,
	Rik van Riel <riel@redhat.com>,
	Christoph Lameter <cl@linux-foundation.org>,
	Stephan von Krawczynski <skraw@ithnet.com>,
	"Rafael J. Wysocki" <rjw@sisk.pl>,
	Kernel Testers List <kernel-testers@vger.kernel.org>,
	Linus Torvalds <torvalds@linux-foundation.org>
Subject: Re: [PATCH 3/5] page allocator: Wait on both sync and async congestion after direct reclaim
Date: Fri, 13 Nov 2009 10:22:54 -0500	[thread overview]
Message-ID: <20091113152254.GC7891@think> (raw)
In-Reply-To: <84144f020911130541p42c0b3d5lc307f97f22e2d356@mail.gmail.com>

On Fri, Nov 13, 2009 at 03:41:46PM +0200, Pekka Enberg wrote:
> Hi Jens,
> 
> On Fri, Nov 13, 2009 at 3:32 PM, Jens Axboe <jens.axboe@oracle.com> wrote:
> >> Suggest an alternative that brings congestion_wait() more in line with
> >> 2.6.30 behaviour then.
> >
> > I don't have a good explanation as to why the delays have changed,
> > unfortunately. Are we sure that they have between .30 and .31? The
> > dm-crypt case is overly complex and lots of changes could have broken
> > that house of cards.
> 
> Hand-waving or not, we have end user reports stating that reverting
> commit 8aa7e847d834ed937a9ad37a0f2ad5b8584c1ab0 ("Fix
> congestion_wait() sync/async vs read/write confusion") fixes their
> (rather serious) OOM regression. The commit in question _does_
> introduce a functional change and if this was your average regression,
> people would be kicking and screaming to get it reverted.
> 
> So is there a reason we shouldn't send a partial revert of the commit
> (switching to BLK_RW_SYNC) to Linus until the "real" issue gets
> resolved? Yes, I realize it's ugly voodoo magic but dammit, it used to
> work!

If the workload didn't involve dm-crypt everything would make more sense.
With dm-crypt, things look like this:

VM:
Someone submits a write (sync or not) to clean a page
out of pages, I'll wait for the disk

dm-crypt:
Async threads wakeup and start encrypting submitted writes

Async threads finish and hand off to other async threads to submit new writes
or
dm encryption thread submits the write directly (without my patch)

The problem with all of this is that waiting for congestion doesn't
actually have very much to do with waiting for dm-crypt IO.  We might
be checking congestion long before dm-cryptd actually gets around to
sending the IO to the disk (so we'll just wait for the full timeout),
or the congestion check could be waiting for entirely unrelated IO.

Because dm-crypt uses the same queue for encrypting and decrypting reads
and writes, if writepage needs to read a block (say a metadata mapping
block), waiting for that block to become unencrypted will have to wait
for the entire queue of both encrypted and decrypted blocks to finish.

[ Unless I completely misread dm-crypt, which is always possible ]

So, cleaning pages can take an arbitrary amount of time, based on how
much CPU time is available, how many other pages are in the worker
threads and how fast the disk is once we finally hand the pages over to
it.

Tuning the VM timeouts based on this seems like a bad idea to me.  If
the VM is waiting for pages to leave writeback, we should just add a
sequence counter and wait queue that gets bumped as pages leave
writeback (or something that wouldn't do horrible things to cachelines).

Otherwise we're going to end up waiting way too long for the sync
congestion on configs that don't intermix sync and async in such complex
ways.

-chris

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

  reply	other threads:[~2009-11-13 15:22 UTC|newest]

Thread overview: 57+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-11-12 19:30 [PATCH 0/5] Reduce GFP_ATOMIC allocation failures, candidate fix V3 Mel Gorman
2009-11-12 19:30 ` [PATCH 1/5] page allocator: Always wake kswapd when restarting an allocation attempt after direct reclaim failed Mel Gorman
2009-11-13  5:23   ` KOSAKI Motohiro
2009-11-13 13:55     ` Mel Gorman
2009-11-12 19:30 ` [PATCH 2/5] page allocator: Do not allow interrupts to use ALLOC_HARDER Mel Gorman
     [not found]   ` <1258054235-3208-3-git-send-email-mel-wPRd99KPJ+uzQB+pC5nmwQ@public.gmane.org>
2009-11-13  5:24     ` KOSAKI Motohiro
2009-11-13 13:56       ` Mel Gorman
2009-11-12 19:30 ` [PATCH 3/5] page allocator: Wait on both sync and async congestion after direct reclaim Mel Gorman
     [not found]   ` <1258054235-3208-4-git-send-email-mel-wPRd99KPJ+uzQB+pC5nmwQ@public.gmane.org>
2009-11-13 11:20     ` KOSAKI Motohiro
     [not found]       ` <20091113142526.33B3.A69D9226-+CUm20s59erQFUHtdCDX3A@public.gmane.org>
2009-11-13 11:55         ` Jens Axboe
     [not found]           ` <20091113115558.GY8742-tSWWG44O7X1aa/9Udqfwiw@public.gmane.org>
2009-11-13 12:28             ` Mel Gorman
     [not found]               ` <20091113122821.GC29804-wPRd99KPJ+uzQB+pC5nmwQ@public.gmane.org>
2009-11-13 13:32                 ` Jens Axboe
     [not found]                   ` <20091113133211.GA8742-tSWWG44O7X1aa/9Udqfwiw@public.gmane.org>
2009-11-13 13:41                     ` Pekka Enberg
2009-11-13 15:22                       ` Chris Mason [this message]
2009-11-13 14:16                     ` Mel Gorman
2009-11-20 14:56                   ` Mel Gorman
2009-11-12 19:30 ` [PATCH 4/5] vmscan: Have kswapd sleep for a short interval and double check it should be asleep Mel Gorman
2009-11-13 10:43   ` KOSAKI Motohiro
     [not found]     ` <20091113142558.33B6.A69D9226-+CUm20s59erQFUHtdCDX3A@public.gmane.org>
2009-11-13 14:13       ` Mel Gorman
2009-11-13 18:00         ` KOSAKI Motohiro
     [not found]           ` <20091114023901.3DA8.A69D9226-+CUm20s59erQFUHtdCDX3A@public.gmane.org>
2009-11-13 18:17             ` Mel Gorman
     [not found]               ` <20091113181740.GN29804-wPRd99KPJ+uzQB+pC5nmwQ@public.gmane.org>
2009-11-14  9:34                 ` KOSAKI Motohiro
     [not found]                   ` <2f11576a0911140134u21eafa83t9642bb25ccd953de-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2009-11-14 15:46                     ` Mel Gorman
     [not found]                       ` <20091114154636.GR29804-wPRd99KPJ+uzQB+pC5nmwQ@public.gmane.org>
2009-11-17 11:03                         ` KOSAKI Motohiro
     [not found]                           ` <20091117141638.3DCB.A69D9226-+CUm20s59erQFUHtdCDX3A@public.gmane.org>
2009-11-17 11:44                             ` Mel Gorman
2009-11-17 12:18                               ` KOSAKI Motohiro
2009-11-17 12:25                                 ` Mel Gorman
     [not found]                                   ` <20091117122555.GZ29804-wPRd99KPJ+uzQB+pC5nmwQ@public.gmane.org>
2009-11-18  5:20                                     ` KOSAKI Motohiro
2009-11-17 10:34                     ` [PATCH] vmscan: Have kswapd sleep for a short interval and double check it should be asleep fix 1 Mel Gorman
2009-11-18  5:27                       ` KOSAKI Motohiro
2009-11-12 19:30 ` [PATCH 5/5] vmscan: Take order into consideration when deciding if kswapd is in trouble Mel Gorman
2009-11-13  9:54   ` KOSAKI Motohiro
     [not found]     ` <20091113142608.33B9.A69D9226-+CUm20s59erQFUHtdCDX3A@public.gmane.org>
2009-11-13 13:54       ` Mel Gorman
     [not found]         ` <20091113135443.GF29804-wPRd99KPJ+uzQB+pC5nmwQ@public.gmane.org>
2009-11-13 14:48           ` Minchan Kim
2009-11-13 18:00         ` KOSAKI Motohiro
     [not found]           ` <20091114023138.3DA5.A69D9226-+CUm20s59erQFUHtdCDX3A@public.gmane.org>
2009-11-13 18:15             ` [PATCH] vmscan: Stop kswapd waiting on congestion when the min watermark is not being met Mel Gorman
2009-11-13 18:26               ` Frans Pop
     [not found]               ` <20091113181557.GM29804-wPRd99KPJ+uzQB+pC5nmwQ@public.gmane.org>
2009-11-13 18:33                 ` KOSAKI Motohiro
     [not found]                   ` <2f11576a0911131033w4a9e6042k3349f0be290a167e-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2009-11-13 20:03                     ` [PATCH] vmscan: Stop kswapd waiting on congestion when the min watermark is not being met V2 Mel Gorman
     [not found]                       ` <20091113200357.GO29804-wPRd99KPJ+uzQB+pC5nmwQ@public.gmane.org>
2009-11-26 14:45                         ` Tobias Oetiker
     [not found]                           ` <alpine.DEB.2.00.0911261542500.21450-EjsAmf5DE5zIvOfxy3zmAzgUDZmNtoG9@public.gmane.org>
2009-11-29  7:42                             ` still getting allocation failures (was Re: [PATCH] vmscan: Stop kswapd waiting on congestion when the min watermark is not being met V2) Tobi Oetiker
2009-12-02 11:32                               ` Mel Gorman
2009-12-02 21:30                                 ` Tobias Oetiker
     [not found]                                   ` <alpine.DEB.2.00.0912022210220.30023-EjsAmf5DE5zIvOfxy3zmAzgUDZmNtoG9@public.gmane.org>
2009-12-03 20:26                                     ` Corrado Zoccolo
     [not found]                                       ` <4e5e476b0912031226i5b0e6cf9hdfd5519182ccdefa-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2009-12-14  5:59                                         ` Tobias Oetiker
     [not found]                                           ` <alpine.DEB.2.00.0912140646550.12657-EjsAmf5DE5zIvOfxy3zmAzgUDZmNtoG9@public.gmane.org>
2009-12-14  8:49                                             ` Corrado Zoccolo
2009-11-13 18:36               ` [PATCH] vmscan: Stop kswapd waiting on congestion when the min watermark is not being met Rik van Riel
2009-11-13 14:38     ` [PATCH 5/5] vmscan: Take order into consideration when deciding if kswapd is in trouble Minchan Kim
     [not found]   ` <1258054235-3208-6-git-send-email-mel-wPRd99KPJ+uzQB+pC5nmwQ@public.gmane.org>
2009-11-13 12:41     ` Minchan Kim
     [not found] ` <1258054235-3208-1-git-send-email-mel-wPRd99KPJ+uzQB+pC5nmwQ@public.gmane.org>
2009-11-13  9:04   ` [PATCH 0/5] Reduce GFP_ATOMIC allocation failures, candidate fix V3 Frans Pop
     [not found]     ` <200911131004.25293.elendil-EIBgga6/0yRmR6Xm/wNWPw@public.gmane.org>
2009-11-16 17:57       ` Mel Gorman
2009-11-13 12:47   ` Tobias Oetiker
     [not found]     ` <alpine.DEB.2.00.0911131346560.22447-ZxOFkyelvri73kmZFr1Urg@public.gmane.org>
2009-11-13 13:37       ` Mel Gorman
2009-11-15 12:07   ` Karol Lewandowski
     [not found]     ` <20091115120721.GA7557-nLtalAL5mPp2RxbNQum0x1nzlInOXLuq@public.gmane.org>
2009-11-16  9:52       ` Mel Gorman
2009-11-16 12:08         ` Karol Lewandowski
     [not found]           ` <20091116120845.GA10115-nLtalAL5mPp2RxbNQum0x1nzlInOXLuq@public.gmane.org>
2009-11-16 14:32             ` Karol Lewandowski

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20091113152254.GC7891@think \
    --to=chris.mason@oracle.com \
    --cc=akpm@linux-foundation.org \
    --cc=cl@linux-foundation.org \
    --cc=elendil@planet.nl \
    --cc=jens.axboe@oracle.com \
    --cc=jkosina@suse.cz \
    --cc=karol.k.lewandowski@gmail.com \
    --cc=kernel-testers@vger.kernel.org \
    --cc=kosaki.motohiro@jp.fujitsu.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=lists@fuchsschwanzdomain.de \
    --cc=mel@csn.ul.ie \
    --cc=penberg@cs.helsinki.fi \
    --cc=riel@redhat.com \
    --cc=rjw@sisk.pl \
    --cc=skraw@ithnet.com \
    --cc=tobi@oetiker.ch \
    --cc=torvalds@linux-foundation.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).