All of lore.kernel.org
 help / color / mirror / Atom feed
From: Christian Ehrhardt <ehrhardt@linux.vnet.ibm.com>
To: Johannes Weiner <hannes@cmpxchg.org>
Cc: Mel Gorman <mel@csn.ul.ie>, Rik van Riel <riel@redhat.com>,
	Andrew Morton <akpm@linux-foundation.org>,
	linux-mm@kvack.org, Nick Piggin <npiggin@suse.de>,
	Chris Mason <chris.mason@oracle.com>,
	Jens Axboe <jens.axboe@oracle.com>,
	linux-kernel@vger.kernel.org, gregkh@novell.com,
	Corrado Zoccolo <czoccolo@gmail.com>
Subject: Re: [RFC PATCH 0/3] Avoid the use of congestion_wait under zone pressure
Date: Tue, 20 Apr 2010 10:54:35 +0200	[thread overview]
Message-ID: <4BCD6BCB.4040403@linux.vnet.ibm.com> (raw)
In-Reply-To: <4BCD55DA.2020000@linux.vnet.ibm.com>



Christian Ehrhardt wrote:
> 
> 
> Johannes Weiner wrote:
[...]

>>>
>>> It stays at ~85M with more writes which is approx 50% of my free 160M 
>>> memory.
>>
>> Ok, so I am the idiot that got quoted on 'the active set is not too 
>> big, so
>> buffer heads are not a problem when avoiding to scan it' in eternal 
>> history.
>>
>> But the threshold inactive/active ratio for skipping active file pages is
>> actually 1:1.
>>
>> The easiest 'fix' is probably to change that ratio, 2:1 (or even 3:1?) 
>> appears
>> to be a bit more natural anyway?  Below is a patch that changes it to 
>> 2:1.
>> Christian, can you check if it fixes your regression?
> 
> I'll check it out.
> from the numbers I have up to now I know that the good->bad transition 
> for my case is somewhere between 30M/60M e.g. first and second write.
> The ratio 2:1 will eat max 53M of my ~160M that gets split up.
> 
> That means setting the ratio to 2:1 or whatever else might help or not, 
> but eventually there is just another setting of workload vs. memory 
> constraints that would still be affected. Still I guess 3:1 (and I'll 
> try that as well) should be enough to be a bit more towards the save side.

For "my case" 2:1 is not enough, 3:1 almost and 4:1 fixes the issue.
Still as I mentioned before I think any value carved in stone can and 
will be bad to some use case - as 1:1 is for mine.

If we end up being unable to fix it internally by allowing the system to 
"forget" and eventually free old unused buffers at least somewhen - then 
we should neither implement it as 2:1 nor 3:1 nor whatsoever, but as 
userspace configurable e.g. /proc/sys/vm/active_inactive_ratio.

I hope your suggestion below or an extension to it will allow the kernel 
to free the buffers somewhen. Depending on how good/fast this solution 
then will work we can still modify the ratio if needed.

>> Additionally, we can always scan active file pages but only deactivate 
>> them
>> when the ratio is off and otherwise strip buffers of clean pages.
> 
> In think we need something that allows the system to forget its history 
> somewhen - be it 1:1 or x:1 - if the workload changes "long enough"(tm) 
> it should eventually throw all old things out.
> Like I described before many systems have different usage patterns when 
> e.g. comparing day/night workload. So it is far from optimal if e.g. day 
> write loads eat so much cache and never give it back for nightly huge 
> reads tasks or something similar.
> 
> Would your suggestion achieve that already?
> If not what kind change could?
> 
[...]
-- 

Grüsse / regards, Christian Ehrhardt
IBM Linux Technology Center, System z Linux Performance

WARNING: multiple messages have this Message-ID (diff)
From: Christian Ehrhardt <ehrhardt@linux.vnet.ibm.com>
To: Johannes Weiner <hannes@cmpxchg.org>
Cc: Mel Gorman <mel@csn.ul.ie>, Rik van Riel <riel@redhat.com>,
	Andrew Morton <akpm@linux-foundation.org>,
	linux-mm@kvack.org, Nick Piggin <npiggin@suse.de>,
	Chris Mason <chris.mason@oracle.com>,
	Jens Axboe <jens.axboe@oracle.com>,
	linux-kernel@vger.kernel.org, gregkh@novell.com,
	Corrado Zoccolo <czoccolo@gmail.com>
Subject: Re: [RFC PATCH 0/3] Avoid the use of congestion_wait under zone pressure
Date: Tue, 20 Apr 2010 10:54:35 +0200	[thread overview]
Message-ID: <4BCD6BCB.4040403@linux.vnet.ibm.com> (raw)
In-Reply-To: <4BCD55DA.2020000@linux.vnet.ibm.com>



Christian Ehrhardt wrote:
> 
> 
> Johannes Weiner wrote:
[...]

>>>
>>> It stays at ~85M with more writes which is approx 50% of my free 160M 
>>> memory.
>>
>> Ok, so I am the idiot that got quoted on 'the active set is not too 
>> big, so
>> buffer heads are not a problem when avoiding to scan it' in eternal 
>> history.
>>
>> But the threshold inactive/active ratio for skipping active file pages is
>> actually 1:1.
>>
>> The easiest 'fix' is probably to change that ratio, 2:1 (or even 3:1?) 
>> appears
>> to be a bit more natural anyway?  Below is a patch that changes it to 
>> 2:1.
>> Christian, can you check if it fixes your regression?
> 
> I'll check it out.
> from the numbers I have up to now I know that the good->bad transition 
> for my case is somewhere between 30M/60M e.g. first and second write.
> The ratio 2:1 will eat max 53M of my ~160M that gets split up.
> 
> That means setting the ratio to 2:1 or whatever else might help or not, 
> but eventually there is just another setting of workload vs. memory 
> constraints that would still be affected. Still I guess 3:1 (and I'll 
> try that as well) should be enough to be a bit more towards the save side.

For "my case" 2:1 is not enough, 3:1 almost and 4:1 fixes the issue.
Still as I mentioned before I think any value carved in stone can and 
will be bad to some use case - as 1:1 is for mine.

If we end up being unable to fix it internally by allowing the system to 
"forget" and eventually free old unused buffers at least somewhen - then 
we should neither implement it as 2:1 nor 3:1 nor whatsoever, but as 
userspace configurable e.g. /proc/sys/vm/active_inactive_ratio.

I hope your suggestion below or an extension to it will allow the kernel 
to free the buffers somewhen. Depending on how good/fast this solution 
then will work we can still modify the ratio if needed.

>> Additionally, we can always scan active file pages but only deactivate 
>> them
>> when the ratio is off and otherwise strip buffers of clean pages.
> 
> In think we need something that allows the system to forget its history 
> somewhen - be it 1:1 or x:1 - if the workload changes "long enough"(tm) 
> it should eventually throw all old things out.
> Like I described before many systems have different usage patterns when 
> e.g. comparing day/night workload. So it is far from optimal if e.g. day 
> write loads eat so much cache and never give it back for nightly huge 
> reads tasks or something similar.
> 
> Would your suggestion achieve that already?
> If not what kind change could?
> 
[...]
-- 

Grusse / regards, Christian Ehrhardt
IBM Linux Technology Center, System z Linux Performance

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

  reply	other threads:[~2010-04-20  8:54 UTC|newest]

Thread overview: 136+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-03-08 11:48 [RFC PATCH 0/3] Avoid the use of congestion_wait under zone pressure Mel Gorman
2010-03-08 11:48 ` Mel Gorman
2010-03-08 11:48 ` [PATCH 1/3] page-allocator: Under memory pressure, wait on pressure to relieve instead of congestion Mel Gorman
2010-03-08 11:48   ` Mel Gorman
2010-03-09 13:35   ` Nick Piggin
2010-03-09 13:35     ` Nick Piggin
2010-03-09 14:17     ` Mel Gorman
2010-03-09 14:17       ` Mel Gorman
2010-03-09 15:03       ` Nick Piggin
2010-03-09 15:03         ` Nick Piggin
2010-03-09 15:42         ` Christian Ehrhardt
2010-03-09 15:42           ` Christian Ehrhardt
2010-03-09 18:22           ` Mel Gorman
2010-03-09 18:22             ` Mel Gorman
2010-03-10  2:38             ` Nick Piggin
2010-03-10  2:38               ` Nick Piggin
2010-03-09 17:35         ` Mel Gorman
2010-03-09 17:35           ` Mel Gorman
2010-03-10  2:35           ` Nick Piggin
2010-03-10  2:35             ` Nick Piggin
2010-03-09 15:50   ` Christoph Lameter
2010-03-09 15:50     ` Christoph Lameter
2010-03-09 15:56     ` Christian Ehrhardt
2010-03-09 15:56       ` Christian Ehrhardt
2010-03-09 16:09       ` Christoph Lameter
2010-03-09 16:09         ` Christoph Lameter
2010-03-09 17:01         ` Mel Gorman
2010-03-09 17:01           ` Mel Gorman
2010-03-09 17:11           ` Christoph Lameter
2010-03-09 17:11             ` Christoph Lameter
2010-03-09 17:30             ` Mel Gorman
2010-03-09 17:30               ` Mel Gorman
2010-03-08 11:48 ` [PATCH 2/3] page-allocator: Check zone pressure when batch of pages are freed Mel Gorman
2010-03-08 11:48   ` Mel Gorman
2010-03-09  9:53   ` Nick Piggin
2010-03-09  9:53     ` Nick Piggin
2010-03-09 10:08     ` Mel Gorman
2010-03-09 10:08       ` Mel Gorman
2010-03-09 10:23       ` Nick Piggin
2010-03-09 10:23         ` Nick Piggin
2010-03-09 10:36         ` Mel Gorman
2010-03-09 10:36           ` Mel Gorman
2010-03-09 11:11           ` Nick Piggin
2010-03-09 11:11             ` Nick Piggin
2010-03-09 11:29             ` Mel Gorman
2010-03-09 11:29               ` Mel Gorman
2010-03-08 11:48 ` [PATCH 3/3] vmscan: Put kswapd to sleep on its own waitqueue, not congestion Mel Gorman
2010-03-08 11:48   ` Mel Gorman
2010-03-09 10:00   ` Nick Piggin
2010-03-09 10:00     ` Nick Piggin
2010-03-09 10:21     ` Mel Gorman
2010-03-09 10:21       ` Mel Gorman
2010-03-09 10:32       ` Nick Piggin
2010-03-09 10:32         ` Nick Piggin
2010-03-11 23:41 ` [RFC PATCH 0/3] Avoid the use of congestion_wait under zone pressure Andrew Morton
2010-03-11 23:41   ` Andrew Morton
2010-03-12  6:39   ` Christian Ehrhardt
2010-03-12  6:39     ` Christian Ehrhardt
2010-03-12  7:05     ` Andrew Morton
2010-03-12  7:05       ` Andrew Morton
2010-03-12 10:47       ` Mel Gorman
2010-03-12 10:47         ` Mel Gorman
2010-03-12 12:15         ` Christian Ehrhardt
2010-03-12 12:15           ` Christian Ehrhardt
2010-03-12 14:37           ` Andrew Morton
2010-03-12 14:37             ` Andrew Morton
2010-03-15 12:29             ` Mel Gorman
2010-03-15 12:29               ` Mel Gorman
2010-03-15 14:45               ` Christian Ehrhardt
2010-03-15 14:45                 ` Christian Ehrhardt
2010-03-15 12:34             ` Christian Ehrhardt
2010-03-15 12:34               ` Christian Ehrhardt
2010-03-15 20:09               ` Andrew Morton
2010-03-15 20:09                 ` Andrew Morton
2010-03-16 10:11                 ` Mel Gorman
2010-03-16 10:11                   ` Mel Gorman
2010-03-18 17:42                 ` Mel Gorman
2010-03-18 17:42                   ` Mel Gorman
2010-03-22 23:50                 ` Mel Gorman
2010-03-22 23:50                   ` Mel Gorman
2010-03-23 14:35                   ` Christian Ehrhardt
2010-03-23 14:35                     ` Christian Ehrhardt
2010-03-23 21:35                   ` Corrado Zoccolo
2010-03-23 21:35                     ` Corrado Zoccolo
2010-03-24 11:48                     ` Mel Gorman
2010-03-24 11:48                       ` Mel Gorman
2010-03-24 12:56                       ` Corrado Zoccolo
2010-03-24 12:56                         ` Corrado Zoccolo
2010-03-23 22:29                   ` Rik van Riel
2010-03-23 22:29                     ` Rik van Riel
2010-03-24 14:50                     ` Mel Gorman
2010-03-24 14:50                       ` Mel Gorman
2010-04-19 12:22                       ` Christian Ehrhardt
2010-04-19 12:22                         ` Christian Ehrhardt
2010-04-19 21:44                         ` Johannes Weiner
2010-04-19 21:44                           ` Johannes Weiner
2010-04-20  7:20                           ` Christian Ehrhardt
2010-04-20  7:20                             ` Christian Ehrhardt
2010-04-20  8:54                             ` Christian Ehrhardt [this message]
2010-04-20  8:54                               ` Christian Ehrhardt
2010-04-20 15:32                             ` Johannes Weiner
2010-04-20 15:32                               ` Johannes Weiner
2010-04-20 17:22                               ` Rik van Riel
2010-04-20 17:22                                 ` Rik van Riel
2010-04-21  4:23                                 ` Christian Ehrhardt
2010-04-21  4:23                                   ` Christian Ehrhardt
2010-04-21  7:35                                   ` Christian Ehrhardt
2010-04-21  7:35                                     ` Christian Ehrhardt
2010-04-21 13:19                                     ` Rik van Riel
2010-04-21 13:19                                       ` Rik van Riel
2010-04-22  6:21                                       ` Christian Ehrhardt
2010-04-22  6:21                                         ` Christian Ehrhardt
2010-04-26 10:59                                         ` Subject: [PATCH][RFC] mm: make working set portion that is protected tunable v2 Christian Ehrhardt
2010-04-26 10:59                                           ` Christian Ehrhardt
2010-04-26 11:59                                           ` KOSAKI Motohiro
2010-04-26 11:59                                             ` KOSAKI Motohiro
2010-04-26 12:43                                             ` Christian Ehrhardt
2010-04-26 12:43                                               ` Christian Ehrhardt
2010-04-26 14:20                                               ` Rik van Riel
2010-04-26 14:20                                                 ` Rik van Riel
2010-04-27 14:00                                                 ` Christian Ehrhardt
2010-04-27 14:00                                                   ` Christian Ehrhardt
2010-04-21  9:03                                   ` [RFC PATCH 0/3] Avoid the use of congestion_wait under zone pressure Johannes Weiner
2010-04-21  9:03                                     ` Johannes Weiner
2010-04-21 13:20                                   ` Rik van Riel
2010-04-21 13:20                                     ` Rik van Riel
2010-04-20 14:40                           ` Rik van Riel
2010-04-20 14:40                             ` Rik van Riel
2010-03-24  2:38                   ` Greg KH
2010-03-24  2:38                     ` Greg KH
2010-03-24 11:49                     ` Mel Gorman
2010-03-24 11:49                       ` Mel Gorman
2010-03-24 13:13                   ` Johannes Weiner
2010-03-24 13:13                     ` Johannes Weiner
2010-03-12  9:09   ` Mel Gorman
2010-03-12  9:09     ` Mel Gorman

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4BCD6BCB.4040403@linux.vnet.ibm.com \
    --to=ehrhardt@linux.vnet.ibm.com \
    --cc=akpm@linux-foundation.org \
    --cc=chris.mason@oracle.com \
    --cc=czoccolo@gmail.com \
    --cc=gregkh@novell.com \
    --cc=hannes@cmpxchg.org \
    --cc=jens.axboe@oracle.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=mel@csn.ul.ie \
    --cc=npiggin@suse.de \
    --cc=riel@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.