Re: [PATCH 17/17] mm: filemap: Avoid unnecessary barries and waitqueue lookup in unlock_page fastpath

All of lore.kernel.org
 help / color / mirror / Atom feed

From: Mel Gorman <mgorman@suse.de>
To: Jan Kara <jack@suse.cz>
Cc: Linux-MM <linux-mm@kvack.org>,
	Linux-FSDevel <linux-fsdevel@vger.kernel.org>,
	Johannes Weiner <hannes@cmpxchg.org>,
	Vlastimil Babka <vbabka@suse.cz>, Michal Hocko <mhocko@suse.cz>,
	Hugh Dickins <hughd@google.com>,
	Linux Kernel <linux-kernel@vger.kernel.org>
Subject: Re: [PATCH 17/17] mm: filemap: Avoid unnecessary barries and waitqueue lookup in unlock_page fastpath
Date: Wed, 7 May 2014 10:03:22 +0100	[thread overview]
Message-ID: <20140507090322.GD23991@suse.de> (raw)
In-Reply-To: <20140505105054.GC23927@quack.suse.cz>

On Mon, May 05, 2014 at 12:50:54PM +0200, Jan Kara wrote:
> On Thu 01-05-14 09:44:48, Mel Gorman wrote:
> > From: Nick Piggin <npiggin@suse.de>
> > 
> > This patch introduces a new page flag for 64-bit capable machines,
> > PG_waiters, to signal there are processes waiting on PG_lock and uses it to
> > avoid memory barriers and waitqueue hash lookup in the unlock_page fastpath.
> > 
> > This adds a few branches to the fast path but avoids bouncing a dirty
> > cache line between CPUs. 32-bit machines always take the slow path but the
> > primary motivation for this patch is large machines so I do not think that
> > is a concern.
> ...
> >  /* 
> > diff --git a/kernel/sched/wait.c b/kernel/sched/wait.c
> > index 7d50f79..fb83fe0 100644
> > --- a/kernel/sched/wait.c
> > +++ b/kernel/sched/wait.c
> > @@ -304,8 +304,7 @@ int wake_bit_function(wait_queue_t *wait, unsigned mode, int sync, void *arg)
> >  		= container_of(wait, struct wait_bit_queue, wait);
> >  
> >  	if (wait_bit->key.flags != key->flags ||
> > -			wait_bit->key.bit_nr != key->bit_nr ||
> > -			test_bit(key->bit_nr, key->flags))
> > +			wait_bit->key.bit_nr != key->bit_nr)
> >  		return 0;
> >  	else
> >  		return autoremove_wake_function(wait, mode, sync, key);
>   This change seems to be really unrelated? And it would deserve a comment
> on its own I'd think so maybe split that in a separate patch?
> 

Without it processes can sleep forever on the lock bit and hang due to
races between when the PG_waiters is set and cleared. I'll investigate
if this can be done a better way.

> > diff --git a/mm/filemap.c b/mm/filemap.c
> > index c60ed0f..93e4385 100644
> > --- a/mm/filemap.c
> > +++ b/mm/filemap.c
> > +int  __wait_on_page_locked_killable(struct page *page)
> > +{
> > +	int ret = 0;
> > +	wait_queue_head_t *wq = page_waitqueue(page);
> > +	DEFINE_WAIT_BIT(wait, &page->flags, PG_locked);
> > +
> > +	if (!test_bit(PG_locked, &page->flags))
> > +		return 0;
> > +	do {
> > +		prepare_to_wait(wq, &wait.wait, TASK_KILLABLE);
> > +		if (!PageWaiters(page))
> > +			SetPageWaiters(page);
> > +		if (likely(PageLocked(page)))
> > +			ret = sleep_on_page_killable(page);
> > +		finish_wait(wq, &wait.wait);
> > +	} while (PageLocked(page) && !ret);
>   So I'm somewhat wondering why this is the only page waiting variant that
> does finish_wait() inside the loop. Everyone else does it outside the while
> loop which seems sufficient to me even in this case...
> 

No reason. The finish_wait can always be outside. I'll fix it up.
Thanks.

-- 
Mel Gorman
SUSE Labs

WARNING: multiple messages have this Message-ID (diff)

From: Mel Gorman <mgorman@suse.de>
To: Jan Kara <jack@suse.cz>
Cc: Linux-MM <linux-mm@kvack.org>,
	Linux-FSDevel <linux-fsdevel@vger.kernel.org>,
	Johannes Weiner <hannes@cmpxchg.org>,
	Vlastimil Babka <vbabka@suse.cz>, Michal Hocko <mhocko@suse.cz>,
	Hugh Dickins <hughd@google.com>,
	Linux Kernel <linux-kernel@vger.kernel.org>
Subject: Re: [PATCH 17/17] mm: filemap: Avoid unnecessary barries and waitqueue lookup in unlock_page fastpath
Date: Wed, 7 May 2014 10:03:22 +0100	[thread overview]
Message-ID: <20140507090322.GD23991@suse.de> (raw)
In-Reply-To: <20140505105054.GC23927@quack.suse.cz>

On Mon, May 05, 2014 at 12:50:54PM +0200, Jan Kara wrote:
> On Thu 01-05-14 09:44:48, Mel Gorman wrote:
> > From: Nick Piggin <npiggin@suse.de>
> > 
> > This patch introduces a new page flag for 64-bit capable machines,
> > PG_waiters, to signal there are processes waiting on PG_lock and uses it to
> > avoid memory barriers and waitqueue hash lookup in the unlock_page fastpath.
> > 
> > This adds a few branches to the fast path but avoids bouncing a dirty
> > cache line between CPUs. 32-bit machines always take the slow path but the
> > primary motivation for this patch is large machines so I do not think that
> > is a concern.
> ...
> >  /* 
> > diff --git a/kernel/sched/wait.c b/kernel/sched/wait.c
> > index 7d50f79..fb83fe0 100644
> > --- a/kernel/sched/wait.c
> > +++ b/kernel/sched/wait.c
> > @@ -304,8 +304,7 @@ int wake_bit_function(wait_queue_t *wait, unsigned mode, int sync, void *arg)
> >  		= container_of(wait, struct wait_bit_queue, wait);
> >  
> >  	if (wait_bit->key.flags != key->flags ||
> > -			wait_bit->key.bit_nr != key->bit_nr ||
> > -			test_bit(key->bit_nr, key->flags))
> > +			wait_bit->key.bit_nr != key->bit_nr)
> >  		return 0;
> >  	else
> >  		return autoremove_wake_function(wait, mode, sync, key);
>   This change seems to be really unrelated? And it would deserve a comment
> on its own I'd think so maybe split that in a separate patch?
> 

Without it processes can sleep forever on the lock bit and hang due to
races between when the PG_waiters is set and cleared. I'll investigate
if this can be done a better way.

> > diff --git a/mm/filemap.c b/mm/filemap.c
> > index c60ed0f..93e4385 100644
> > --- a/mm/filemap.c
> > +++ b/mm/filemap.c
> > +int  __wait_on_page_locked_killable(struct page *page)
> > +{
> > +	int ret = 0;
> > +	wait_queue_head_t *wq = page_waitqueue(page);
> > +	DEFINE_WAIT_BIT(wait, &page->flags, PG_locked);
> > +
> > +	if (!test_bit(PG_locked, &page->flags))
> > +		return 0;
> > +	do {
> > +		prepare_to_wait(wq, &wait.wait, TASK_KILLABLE);
> > +		if (!PageWaiters(page))
> > +			SetPageWaiters(page);
> > +		if (likely(PageLocked(page)))
> > +			ret = sleep_on_page_killable(page);
> > +		finish_wait(wq, &wait.wait);
> > +	} while (PageLocked(page) && !ret);
>   So I'm somewhat wondering why this is the only page waiting variant that
> does finish_wait() inside the loop. Everyone else does it outside the while
> loop which seems sufficient to me even in this case...
> 

No reason. The finish_wait can always be outside. I'll fix it up.
Thanks.

-- 
Mel Gorman
SUSE Labs

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

next prev parent reply	other threads:[~2014-05-07  9:03 UTC|newest]

Thread overview: 113+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-05-01  8:44 [PATCH 00/17] Misc page alloc, shmem, mark_page_accessed and page_waitqueue optimisations Mel Gorman
2014-05-01  8:44 ` Mel Gorman
2014-05-01  8:44 ` [PATCH 01/17] mm: page_alloc: Do not update zlc unless the zlc is active Mel Gorman
2014-05-01  8:44   ` Mel Gorman
2014-05-01 13:25   ` Johannes Weiner
2014-05-01 13:25     ` Johannes Weiner
2014-05-06 15:04   ` Rik van Riel
2014-05-06 15:04     ` Rik van Riel
2014-05-01  8:44 ` [PATCH 02/17] mm: page_alloc: Do not treat a zone that cannot be used for dirty pages as "full" Mel Gorman
2014-05-01  8:44   ` Mel Gorman
2014-05-06 15:09   ` Rik van Riel
2014-05-06 15:09     ` Rik van Riel
2014-05-01  8:44 ` [PATCH 03/17] mm: page_alloc: Use jump labels to avoid checking number_of_cpusets Mel Gorman
2014-05-01  8:44   ` Mel Gorman
2014-05-06 15:10   ` Rik van Riel
2014-05-06 15:10     ` Rik van Riel
2014-05-06 20:23   ` Peter Zijlstra
2014-05-06 20:23     ` Peter Zijlstra
2014-05-06 22:21     ` Mel Gorman
2014-05-06 22:21       ` Mel Gorman
2014-05-07  9:04       ` Peter Zijlstra
2014-05-07  9:43         ` Mel Gorman
2014-05-07  9:43           ` Mel Gorman
2014-05-01  8:44 ` [PATCH 04/17] mm: page_alloc: Calculate classzone_idx once from the zonelist ref Mel Gorman
2014-05-01  8:44   ` Mel Gorman
2014-05-06 16:01   ` Rik van Riel
2014-05-06 16:01     ` Rik van Riel
2014-05-01  8:44 ` [PATCH 05/17] mm: page_alloc: Only check the zone id check if pages are buddies Mel Gorman
2014-05-01  8:44   ` Mel Gorman
2014-05-06 16:48   ` Rik van Riel
2014-05-06 16:48     ` Rik van Riel
2014-05-01  8:44 ` [PATCH 06/17] mm: page_alloc: Only check the alloc flags and gfp_mask for dirty once Mel Gorman
2014-05-01  8:44   ` Mel Gorman
2014-05-06 17:24   ` Rik van Riel
2014-05-06 17:24     ` Rik van Riel
2014-05-01  8:44 ` [PATCH 07/17] mm: page_alloc: Take the ALLOC_NO_WATERMARK check out of the fast path Mel Gorman
2014-05-01  8:44   ` Mel Gorman
2014-05-06 17:25   ` Rik van Riel
2014-05-06 17:25     ` Rik van Riel
2014-05-01  8:44 ` [PATCH 08/17] mm: page_alloc: Use word-based accesses for get/set pageblock bitmaps Mel Gorman
2014-05-01  8:44   ` Mel Gorman
2014-05-02 22:34   ` Sasha Levin
2014-05-02 22:34     ` Sasha Levin
2014-05-04 13:14     ` Mel Gorman
2014-05-04 13:14       ` Mel Gorman
2014-05-05 12:40       ` Vlastimil Babka
2014-05-05 12:40         ` Vlastimil Babka
2014-05-06  9:13         ` Mel Gorman
2014-05-06  9:13           ` Mel Gorman
2014-05-06 14:42           ` Vlastimil Babka
2014-05-06 14:42             ` Vlastimil Babka
2014-05-06 15:12             ` Mel Gorman
2014-05-06 15:12               ` Mel Gorman
2014-05-06 20:34   ` Peter Zijlstra
2014-05-06 20:34     ` Peter Zijlstra
2014-05-06 22:24     ` Mel Gorman
2014-05-06 22:24       ` Mel Gorman
2014-05-01  8:44 ` [PATCH 09/17] mm: page_alloc: Reduce number of times page_to_pfn is called Mel Gorman
2014-05-01  8:44   ` Mel Gorman
2014-05-06 18:47   ` Rik van Riel
2014-05-06 18:47     ` Rik van Riel
2014-05-01  8:44 ` [PATCH 10/17] mm: page_alloc: Lookup pageblock migratetype with IRQs enabled during free Mel Gorman
2014-05-01  8:44   ` Mel Gorman
2014-05-06 18:48   ` Rik van Riel
2014-05-06 18:48     ` Rik van Riel
2014-05-01  8:44 ` [PATCH 11/17] mm: page_alloc: Use unsigned int for order in more places Mel Gorman
2014-05-01  8:44   ` Mel Gorman
2014-05-01 14:35   ` Dave Hansen
2014-05-01 14:35     ` Dave Hansen
2014-05-01 15:11     ` Mel Gorman
2014-05-01 15:11       ` Mel Gorman
2014-05-01 15:38       ` Dave Hansen
2014-05-01 15:38         ` Dave Hansen
2014-05-06 18:49   ` Rik van Riel
2014-05-06 18:49     ` Rik van Riel
2014-05-01  8:44 ` [PATCH 12/17] mm: page_alloc: Convert hot/cold parameter and immediate callers to bool Mel Gorman
2014-05-01  8:44   ` Mel Gorman
2014-05-06 18:49   ` Rik van Riel
2014-05-06 18:49     ` Rik van Riel
2014-05-01  8:44 ` [PATCH 13/17] mm: shmem: Avoid atomic operation during shmem_getpage_gfp Mel Gorman
2014-05-01  8:44   ` Mel Gorman
2014-05-06 18:53   ` Rik van Riel
2014-05-06 18:53     ` Rik van Riel
2014-05-01  8:44 ` [PATCH 14/17] mm: Do not use atomic operations when releasing pages Mel Gorman
2014-05-01  8:44   ` Mel Gorman
2014-05-01 13:29   ` Johannes Weiner
2014-05-01 13:29     ` Johannes Weiner
2014-05-01 13:39     ` Mel Gorman
2014-05-01 13:39       ` Mel Gorman
2014-05-01 13:47       ` Johannes Weiner
2014-05-01 13:47         ` Johannes Weiner
2014-05-06 18:54   ` Rik van Riel
2014-05-06 18:54     ` Rik van Riel
2014-05-01  8:44 ` [PATCH 15/17] mm: Do not use unnecessary atomic operations when adding pages to the LRU Mel Gorman
2014-05-01  8:44   ` Mel Gorman
2014-05-01 13:33   ` Johannes Weiner
2014-05-01 13:33     ` Johannes Weiner
2014-05-01 13:40     ` Mel Gorman
2014-05-01 13:40       ` Mel Gorman
2014-05-06 15:30   ` Vlastimil Babka
2014-05-06 15:30     ` Vlastimil Babka
2014-05-06 15:55     ` Mel Gorman
2014-05-06 15:55       ` Mel Gorman
2014-05-01  8:44 ` [PATCH 16/17] mm: Non-atomically mark page accessed during page cache allocation where possible Mel Gorman
2014-05-01  8:44   ` Mel Gorman
2014-05-01  8:44 ` [PATCH 17/17] mm: filemap: Avoid unnecessary barries and waitqueue lookup in unlock_page fastpath Mel Gorman
2014-05-01  8:44   ` Mel Gorman
2014-05-05 10:50   ` Jan Kara
2014-05-05 10:50     ` Jan Kara
2014-05-07  9:03     ` Mel Gorman [this message]
2014-05-07  9:03       ` Mel Gorman
2014-05-06 20:30   ` Peter Zijlstra
2014-05-06 20:30     ` Peter Zijlstra

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20140507090322.GD23991@suse.de \
    --to=mgorman@suse.de \
    --cc=hannes@cmpxchg.org \
    --cc=hughd@google.com \
    --cc=jack@suse.cz \
    --cc=linux-fsdevel@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=mhocko@suse.cz \
    --cc=vbabka@suse.cz \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.