All of lore.kernel.org
 help / color / mirror / Atom feed
From: Wu Fengguang <wfg@mail.ustc.edu.cn>
To: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Andrew Morton <akpm@osdl.org>, linux-kernel@vger.kernel.org
Subject: Re: [PATCH 17/33] readahead: context based method
Date: Thu, 25 May 2006 09:25:56 +0800	[thread overview]
Message-ID: <348520355.07663@ustc.edu.cn> (raw)
Message-ID: <20060525012556.GA6111@mail.ustc.edu.cn> (raw)
In-Reply-To: <1148486016.10561.73.camel@lappy>

On Wed, May 24, 2006 at 05:53:36PM +0200, Peter Zijlstra wrote:
> On Wed, 2006-05-24 at 21:33 +0800, Wu Fengguang wrote:
> > On Wed, May 24, 2006 at 02:37:48PM +0200, Peter Zijlstra wrote:
> > > On Wed, 2006-05-24 at 19:13 +0800, Wu Fengguang wrote:
> > > 
> > > > +#define PAGE_REFCNT_0           0
> > > > +#define PAGE_REFCNT_1           (1 << PG_referenced)
> > > > +#define PAGE_REFCNT_2           (1 << PG_active)
> > > > +#define PAGE_REFCNT_3           ((1 << PG_active) | (1 << PG_referenced))
> > > > +#define PAGE_REFCNT_MASK        PAGE_REFCNT_3
> > > > +
> > > > +/*
> > > > + * STATUS   REFERENCE COUNT
> > > > + *  __                   0
> > > > + *  _R       PAGE_REFCNT_1
> > > > + *  A_       PAGE_REFCNT_2
> > > > + *  AR       PAGE_REFCNT_3
> > > > + *
> > > > + *  A/R: Active / Referenced
> > > > + */
> > > > +static inline unsigned long page_refcnt(struct page *page)
> > > > +{
> > > > +        return page->flags & PAGE_REFCNT_MASK;
> > > > +}
> > > > +
> > > > +/*
> > > > + * STATUS   REFERENCE COUNT      TYPE
> > > > + *  __                   0      fresh
> > > > + *  _R       PAGE_REFCNT_1      stale
> > > > + *  A_       PAGE_REFCNT_2      disturbed once
> > > > + *  AR       PAGE_REFCNT_3      disturbed twice
> > > > + *
> > > > + *  A/R: Active / Referenced
> > > > + */
> > > > +static inline unsigned long cold_page_refcnt(struct page *page)
> > > > +{
> > > > +	if (!page || PageActive(page))
> > > > +		return 0;
> > > > +
> > > > +	return page_refcnt(page);
> > > > +}
> > > > +
> > > 
> > > Why all of this if all you're ever going to use is cold_page_refcnt.
> > 
> > Well, the two functions have a long history...
> > 
> > There has been a PG_activate which makes the two functions quite
> > different. It was later removed for fear of the behavior changes it
> > introduced. However, there's still possibility that someone
> > reintroduce similar flags in the future :)
> > 
> > > What about something like this:
> > > 
> > > static inline int cold_page_referenced(struct page *page)
> > > {
> > > 	if (!page || PageActive(page))
> > > 		return 0;
> > > 	return !!PageReferenced(page);
> > > }
> > 
> > Ah, here's another theory: the algorithm uses reference count
> > conceptually, so it may be better to retain the current form.
> 
> Reference count of what exactly, if you were to say of the page, I'd
> have expected only the first function, page_refcnt().
> 
> What I don't exactly understand is why you specialise to the inactive
> list. Why do you need that?
> 
> The reason I'm asking is that when I merge this with my page replacement
> work, I need to find a generalised concept. cold_page_refcnt() would
> become to mean something like: number of references for those pages that
> are direct reclaim candidates. And honestly, that doesn't make a lot of
> sense.
> 
> If you could explain the concept behind this, I'd be grateful.

Good question, and sorry for mentioning this...

There are some background info here:

        [DISTURBS] section of
        http://marc.theaimsgroup.com/?l=linux-kernel&m=112678976802381&w=2

        [DELAYED ACTIVATION] section of
        http://marc.theaimsgroup.com/?l=linux-kernel&m=112679176611006&w=2

It involves a tricky situation where there are two sequential readers
that come close enough, so that the follower retouched the pages
visited by the leader:

          chunk 1         chunk 2               chunk 3
        ==========  =============-------  --------------------
                       follower ^                     leader ^

It is all ok if the revisited pages still stay in the inactive list,
these pages will act as measurement of len(inactive list)/speed(leader).
But if the revisited pages(marked by '=') are sent to active list
immediately, the measurement will no longer be as accurate. The trace
is 'disturbed'. In this case, using page_refcnt() can be aggressive
and unsafe from thrashing, while cold_page_refcnt() can be conservative.

So either one of page_refcnt()/cold_page_refcnt() should be ok, as
long as we know the consequence of this situation.  After all, it is
really uncommon to see much invocation of the context based method,
and even rare for this kind of situation to happen.

Regards,
Wu

  reply	other threads:[~2006-05-25  1:26 UTC|newest]

Thread overview: 108+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2006-05-24 11:12 [PATCH 00/33] Adaptive read-ahead V12 Wu Fengguang
2006-05-24 11:12 ` Wu Fengguang
2006-05-25 15:44   ` Andrew Morton
2006-05-25 19:26     ` Michael Stone
2006-05-25 19:40     ` David Lang
2006-05-25 22:01       ` Andrew Morton
2006-05-25 20:28         ` David Lang
2006-05-26  0:48         ` Michael Stone
2006-05-26  1:19     ` Wu Fengguang
2006-05-26  1:19       ` Wu Fengguang
2006-05-26  2:10     ` Jon Smirl
2006-05-26  3:14       ` Nick Piggin
2006-05-26 14:00     ` Andi Kleen
2006-05-26 16:25       ` Andrew Morton
2006-05-26 23:54       ` Folkert van Heusden
2006-05-27  0:00         ` Con Kolivas
2006-05-27  0:08           ` Con Kolivas
2006-05-28 22:20             ` Diego Calleja
2006-05-28 22:31               ` kernel
2006-05-29  3:04                 ` Wu Fengguang
2006-05-29  3:04                   ` Wu Fengguang
2006-05-24 11:12 ` [PATCH 02/33] radixtree: look-aside cache Wu Fengguang
2006-05-24 11:12   ` Wu Fengguang
2006-05-24 11:12 ` [PATCH 03/33] radixtree: hole scanning functions Wu Fengguang
2006-05-24 11:12   ` Wu Fengguang
2006-05-25 16:19     ` Andrew Morton
2006-05-26  7:04       ` Wu Fengguang
2006-05-26  7:04         ` Wu Fengguang
2006-05-26 11:05       ` Wu Fengguang
2006-05-26 11:05         ` Wu Fengguang
2006-05-26 16:19           ` Andrew Morton
2006-05-24 11:12 ` [PATCH 04/33] readahead: page flag PG_readahead Wu Fengguang
2006-05-24 11:12   ` Wu Fengguang
2006-05-25 16:23     ` Andrew Morton
2006-05-26  7:06       ` Wu Fengguang
2006-05-26  7:06         ` Wu Fengguang
2006-05-24 12:27   ` Peter Zijlstra
2006-05-24 12:37     ` Wu Fengguang
2006-05-24 12:37       ` Wu Fengguang
2006-05-24 12:48       ` Peter Zijlstra
2006-05-24 11:12 ` [PATCH 05/33] readahead: refactor do_generic_mapping_read() Wu Fengguang
2006-05-24 11:12   ` Wu Fengguang
2006-05-24 11:12 ` [PATCH 06/33] readahead: refactor __do_page_cache_readahead() Wu Fengguang
2006-05-24 11:12   ` Wu Fengguang
2006-05-25 16:30     ` Andrew Morton
2006-05-25 22:33       ` Paul Mackerras
2006-05-25 22:40         ` Andrew Morton
2006-05-26  7:13       ` Wu Fengguang
2006-05-26  7:13         ` Wu Fengguang
2006-05-24 11:12 ` [PATCH 07/33] readahead: insert cond_resched() calls Wu Fengguang
2006-05-24 11:12   ` Wu Fengguang
2006-05-24 11:12 ` [PATCH 08/33] readahead: common macros Wu Fengguang
2006-05-24 11:12   ` Wu Fengguang
2006-05-25  5:56     ` Nick Piggin
2006-05-25 10:41       ` Wu Fengguang
2006-05-25 10:41         ` Wu Fengguang
2006-05-26  3:33           ` Nick Piggin
2006-05-26  6:59             ` Wu Fengguang
2006-05-26  6:59               ` Wu Fengguang
2006-05-25 13:42       ` Wu Fengguang
2006-05-25 13:42         ` Wu Fengguang
2006-05-25 14:38           ` Andrew Morton
2006-05-25 16:33     ` Andrew Morton
2006-05-24 11:12 ` [PATCH 09/33] readahead: events accounting Wu Fengguang
2006-05-24 11:12   ` Wu Fengguang
2006-05-25 16:36     ` Andrew Morton
2006-05-26  7:09       ` Wu Fengguang
2006-05-26  7:09         ` Wu Fengguang
2006-05-27 13:20       ` Wu Fengguang
2006-05-27 13:20         ` Wu Fengguang
2006-05-29  8:19           ` Martin Peschke
2006-05-24 11:12 ` [PATCH 10/33] readahead: support functions Wu Fengguang
2006-05-24 11:12   ` Wu Fengguang
2006-05-25  5:13     ` Nick Piggin
2006-05-25 11:13       ` Wu Fengguang
2006-05-25 11:13         ` Wu Fengguang
2006-05-25 16:48     ` Andrew Morton
2006-05-26  7:31       ` Wu Fengguang
2006-05-26  7:31         ` Wu Fengguang
2006-05-24 11:12 ` [PATCH 11/33] readahead: sysctl parameters Wu Fengguang
2006-05-24 11:12   ` Wu Fengguang
2006-05-25  4:50     ` [PATCH 12/33] readahead: min/max sizes Nick Piggin
2006-05-25 12:12       ` Wu Fengguang
2006-05-25 12:12         ` Wu Fengguang
2006-05-24 11:12 ` [PATCH 13/33] readahead: state based method - aging accounting Wu Fengguang
2006-05-24 11:12   ` Wu Fengguang
2006-05-26 17:04     ` Andrew Morton
2006-05-27  6:22       ` Wu Fengguang
2006-05-27  6:22         ` Wu Fengguang
2006-05-27  7:00           ` Andrew Morton
2006-05-27  7:22             ` Wu Fengguang
2006-05-27  7:22               ` Wu Fengguang
2006-05-24 11:13 ` [PATCH 14/33] readahead: state based method - data structure Wu Fengguang
2006-05-24 11:13   ` Wu Fengguang
2006-05-25  6:03     ` Nick Piggin
2006-05-25 10:43       ` Wu Fengguang
2006-05-25 10:43         ` Wu Fengguang
2006-05-26 17:05     ` Andrew Morton
2006-05-27  7:02       ` Wu Fengguang
2006-05-27  7:02         ` Wu Fengguang
2006-05-27  8:27       ` Wu Fengguang
2006-05-27  8:27         ` Wu Fengguang
2006-05-24 11:13 ` [PATCH 15/33] readahead: state based method - routines Wu Fengguang
2006-05-24 11:13   ` Wu Fengguang
2006-05-26 17:15     ` Andrew Morton
2006-05-27  2:06       ` Wu Fengguang
2006-05-27  2:06         ` Wu Fengguang
2006-05-24 11:13 ` [PATCH 17/33] readahead: context based method Wu Fengguang
2006-05-24 11:13   ` Wu Fengguang
2006-05-25  5:26     ` Nick Piggin
2006-05-25  8:03       ` Wu Fengguang
2006-05-25  8:03         ` Wu Fengguang
2006-05-26 17:23     ` Andrew Morton
2006-05-27  2:12       ` Wu Fengguang
2006-05-27  2:12         ` Wu Fengguang
2006-05-26 17:27     ` Andrew Morton
2006-05-27  8:04       ` Wu Fengguang
2006-05-27  8:04         ` Wu Fengguang
2006-05-24 12:37   ` Peter Zijlstra
2006-05-24 13:33     ` Wu Fengguang
2006-05-24 13:33       ` Wu Fengguang
2006-05-24 15:53       ` Peter Zijlstra
2006-05-25  1:25         ` Wu Fengguang [this message]
2006-05-25  1:25           ` Wu Fengguang
2006-05-24 11:13 ` [PATCH 18/33] readahead: initial method - guiding sizes Wu Fengguang
2006-05-24 11:13   ` Wu Fengguang
2006-05-24 11:13 ` [PATCH 19/33] readahead: initial method - thrashing guard size Wu Fengguang
2006-05-24 11:13   ` Wu Fengguang
2006-05-24 11:13 ` [PATCH 20/33] readahead: initial method - expected read size Wu Fengguang
2006-05-24 11:13   ` Wu Fengguang
2006-05-25  5:34     ` [PATCH 22/33] readahead: initial method Nick Piggin
2006-05-25  8:59       ` Wu Fengguang
2006-05-25  8:59         ` Wu Fengguang
2006-05-26 17:29     ` [PATCH 20/33] readahead: initial method - expected read size Andrew Morton
2006-05-27  6:38       ` Wu Fengguang
2006-05-27  6:38         ` Wu Fengguang
2006-05-24 11:13 ` [PATCH 23/33] readahead: backward prefetching method Wu Fengguang
2006-05-24 11:13   ` Wu Fengguang
2006-05-26 17:37     ` Nate Diller
2006-05-26 19:22       ` Nathan Scott
2006-05-28 12:30         ` Wu Fengguang
2006-05-28 12:30           ` Wu Fengguang
2006-05-24 11:13 ` [PATCH 24/33] readahead: seeking reads method Wu Fengguang
2006-05-24 11:13   ` Wu Fengguang
2006-05-24 11:13 ` [PATCH 25/33] readahead: thrashing recovery method Wu Fengguang
2006-05-24 11:13   ` Wu Fengguang
2006-05-24 11:13 ` [PATCH 26/33] readahead: call scheme Wu Fengguang
2006-05-24 11:13   ` Wu Fengguang
2006-05-24 11:13 ` [PATCH 27/33] readahead: laptop mode Wu Fengguang
2006-05-24 11:13   ` Wu Fengguang
2006-05-26 17:38     ` Andrew Morton
2006-05-24 11:13 ` [PATCH 28/33] readahead: loop case Wu Fengguang
2006-05-24 11:13   ` Wu Fengguang
2006-05-24 14:01   ` Limin Wang
2006-05-25 15:48     ` wfg
2006-05-25 15:48       ` wfg
2006-05-24 11:13 ` [PATCH 29/33] readahead: nfsd case Wu Fengguang
2006-05-24 11:13   ` Wu Fengguang
2006-05-24 11:13 ` [PATCH 30/33] readahead: turn on by default Wu Fengguang
2006-05-24 11:13   ` Wu Fengguang
2006-05-24 11:13 ` [PATCH 31/33] readahead: debug radix tree new functions Wu Fengguang
2006-05-24 11:13   ` Wu Fengguang
2006-05-24 11:13 ` [PATCH 32/33] readahead: debug traces showing accessed file names Wu Fengguang
2006-05-24 11:13   ` Wu Fengguang
2006-05-24 11:13 ` [PATCH 33/33] readahead: debug traces showing read patterns Wu Fengguang
2006-05-24 11:13   ` Wu Fengguang
     [not found] <20060526113906.084341801@localhost.localdomain>
2006-05-26 11:39 ` [PATCH 17/33] readahead: context based method Wu Fengguang
2006-05-26 11:39   ` Wu Fengguang

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=348520355.07663@ustc.edu.cn \
    --to=wfg@mail.ustc.edu.cn \
    --cc=a.p.zijlstra@chello.nl \
    --cc=akpm@osdl.org \
    --cc=linux-kernel@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.