Re: [PATCH -V3 -mm] mm, swap: Fix race between swapoff and some swap operations

All of lore.kernel.org
 help / color / mirror / Atom feed

From: "Huang\, Ying" <ying.huang@intel.com>
To: "Paul E. McKenney" <paulmck@linux.vnet.ibm.com>
Cc: Andrew Morton <akpm@linux-foundation.org>,
	linux-mm@kvack.org, linux-kernel@vger.kernel.org,
	Hugh Dickins <hughd@google.com>, Minchan Kim <minchan@kernel.org>,
	Johannes Weiner <hannes@cmpxchg.org>,
	Tim Chen <tim.c.chen@linux.intel.com>, Shaohua Li <shli@fb.com>,
	Mel Gorman <mgorman@techsingularity.net>,
	JXrXme Glisse <jglisse@redhat.com>,
	Michal Hocko <mhocko@suse.com>,
	Andrea Arcangeli <aarcange@redhat.com>,
	David Rientjes <rientjes@google.com>,
	Rik van Riel <riel@redhat.com>, Jan Kara <jack@suse.cz>,
	Dave Jiang <dave.jiang@intel.com>, Aaron Lu <aaron.lu@intel.com>
Subject: Re: [PATCH -V3 -mm] mm, swap: Fix race between swapoff and some swap operations
Date: Tue, 19 Dec 2017 16:08:09 +0800	[thread overview]
Message-ID: <87lghzta1i.fsf@yhuang-dev.intel.com> (raw)
In-Reply-To: <20171219053650.GB7829@linux.vnet.ibm.com> (Paul E. McKenney's message of "Mon, 18 Dec 2017 21:36:50 -0800")

"Paul E. McKenney" <paulmck@linux.vnet.ibm.com> writes:

> On Tue, Dec 19, 2017 at 09:57:21AM +0800, Huang, Ying wrote:
>> "Paul E. McKenney" <paulmck@linux.vnet.ibm.com> writes:
>> 
>> > On Mon, Dec 18, 2017 at 03:41:41PM +0800, Huang, Ying wrote:
>> >> "Huang, Ying" <ying.huang@intel.com> writes:
>> >> And, it appears that if we replace smp_wmb() in _enable_swap_info() with
>> >> stop_machine() in some way, we can avoid smp_rmb() in get_swap_device().
>> >> This can reduce overhead in normal path further.  Can we get same effect
>> >> with RCU?  For example, use synchronize_rcu() instead of stop_machine()?
>> >> 
>> >> Hi, Paul, can you help me on this?
>> >
>> > If the key loads before and after the smp_rmb() are within the same
>> > RCU read-side critical section, -and- if one of the critical writes is
>> > before the synchronize_rcu() and the other critical write is after the
>> > synchronize_rcu(), then you normally don't need the smp_rmb().
>> >
>> > Otherwise, you likely do still need the smp_rmb().
>> 
>> My question may be too general, let make it more specific.  For the
>> following program,
>> 
>> "
>> int a;
>> int b;
>> 
>> void intialize(void)
>> {
>>         a = 1;
>>         synchronize_rcu();
>>         b = 2;
>> }
>> 
>> void test(void)
>> {
>>         int c;
>> 
>>         rcu_read_lock();
>>         c = b;
>>         /* ignored smp_rmb() */
>>         if (c)
>>                 pr_info("a=%d\n", a);
>>         rcu_read_unlock();
>> }
>> "
>> 
>> Is it possible for it to show
>> 
>> "
>> a=0
>> "
>> 
>> in kernel log?
>> 
>> 
>> If it couldn't, this could be a useful usage model of RCU to accelerate
>> hot path.
>
> This is not possible, and it can be verified using the Linux kernel
> memory model.  An introduction to an older version of this model may
> be found here (including an introduction to litmus tests and their
> output):
>
> 	https://lwn.net/Articles/718628/
> 	https://lwn.net/Articles/720550/
>
> The litmus test and its output are shown below.
>
> The reason it is not possible is that the entirety of test()'s RCU
> read-side critical section must do one of two things:
>
> 1.	Come before the return from initialize()'s synchronize_rcu().
> 2.	Come after the call to initialize()'s synchronize_rcu().
>
> Suppose test()'s load from "b" sees initialize()'s assignment.  Then
> some part of test()'s RCU read-side critical section came after
> initialize()'s call to synchronize_rcu(), which means that the entirety
> of test()'s RCU read-side critical section must come after initialize()'s
> call to synchronize_rcu().  Therefore, whenever "c" is non-zero, the
> pr_info() must see "a" non-zero.
>
> 							Thanx, Paul
>
> ------------------------------------------------------------------------
>
> C MP-o-sync-o+rl-o-ctl-o-rul
>
> {}
>
> P0(int *a, int *b)
> {
> 	WRITE_ONCE(*a, 1);
> 	synchronize_rcu();
> 	WRITE_ONCE(*b, 2);
> }
>
> P1(int *a, int *b)
> {
> 	int r0;
> 	int r1;
>
> 	rcu_read_lock();
> 	r0 = READ_ONCE(*b);
> 	if (r0)
> 		r1 = READ_ONCE(*a);
> 	rcu_read_unlock();
> }
>
> exists (1:r0=1 /\ 1:r1=0)
>
> ------------------------------------------------------------------------
>
> States 2
> 1:r0=0; 1:r1=0;
> 1:r0=2; 1:r1=1;
> No
> Witnesses
> Positive: 0 Negative: 2
> Condition exists (1:r0=1 /\ 1:r1=0)
> Observation MP-o-sync-o+rl-o-ctl-o-rul Never 0 2
> Time MP-o-sync-o+rl-o-ctl-o-rul 0.01
> Hash=b20eca2da50fa84b15e489502420ff56
>
> ------------------------------------------------------------------------
>
> The "Never 0 2" means that the condition cannot happen.

Thanks a lot for your detailed explanation!  That helps me much!

Best Regards,
Huang, Ying

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

WARNING: multiple messages have this Message-ID (diff)

From: "Huang\, Ying" <ying.huang@intel.com>
To: "Paul E. McKenney" <paulmck@linux.vnet.ibm.com>
Cc: Andrew Morton <akpm@linux-foundation.org>, <linux-mm@kvack.org>,
	<linux-kernel@vger.kernel.org>, Hugh Dickins <hughd@google.com>,
	Minchan Kim <minchan@kernel.org>,
	Johannes Weiner <hannes@cmpxchg.org>,
	Tim Chen <tim.c.chen@linux.intel.com>, Shaohua Li <shli@fb.com>,
	Mel Gorman <mgorman@techsingularity.net>,
	JXrXme Glisse <jglisse@redhat.com>,
	"Michal Hocko" <mhocko@suse.com>,
	Andrea Arcangeli <aarcange@redhat.com>,
	"David Rientjes" <rientjes@google.com>,
	Rik van Riel <riel@redhat.com>, Jan Kara <jack@suse.cz>,
	Dave Jiang <dave.jiang@intel.com>, Aaron Lu <aaron.lu@intel.com>
Subject: Re: [PATCH -V3 -mm] mm, swap: Fix race between swapoff and some swap operations
Date: Tue, 19 Dec 2017 16:08:09 +0800	[thread overview]
Message-ID: <87lghzta1i.fsf@yhuang-dev.intel.com> (raw)
In-Reply-To: <20171219053650.GB7829@linux.vnet.ibm.com> (Paul E. McKenney's message of "Mon, 18 Dec 2017 21:36:50 -0800")

"Paul E. McKenney" <paulmck@linux.vnet.ibm.com> writes:

> On Tue, Dec 19, 2017 at 09:57:21AM +0800, Huang, Ying wrote:
>> "Paul E. McKenney" <paulmck@linux.vnet.ibm.com> writes:
>> 
>> > On Mon, Dec 18, 2017 at 03:41:41PM +0800, Huang, Ying wrote:
>> >> "Huang, Ying" <ying.huang@intel.com> writes:
>> >> And, it appears that if we replace smp_wmb() in _enable_swap_info() with
>> >> stop_machine() in some way, we can avoid smp_rmb() in get_swap_device().
>> >> This can reduce overhead in normal path further.  Can we get same effect
>> >> with RCU?  For example, use synchronize_rcu() instead of stop_machine()?
>> >> 
>> >> Hi, Paul, can you help me on this?
>> >
>> > If the key loads before and after the smp_rmb() are within the same
>> > RCU read-side critical section, -and- if one of the critical writes is
>> > before the synchronize_rcu() and the other critical write is after the
>> > synchronize_rcu(), then you normally don't need the smp_rmb().
>> >
>> > Otherwise, you likely do still need the smp_rmb().
>> 
>> My question may be too general, let make it more specific.  For the
>> following program,
>> 
>> "
>> int a;
>> int b;
>> 
>> void intialize(void)
>> {
>>         a = 1;
>>         synchronize_rcu();
>>         b = 2;
>> }
>> 
>> void test(void)
>> {
>>         int c;
>> 
>>         rcu_read_lock();
>>         c = b;
>>         /* ignored smp_rmb() */
>>         if (c)
>>                 pr_info("a=%d\n", a);
>>         rcu_read_unlock();
>> }
>> "
>> 
>> Is it possible for it to show
>> 
>> "
>> a=0
>> "
>> 
>> in kernel log?
>> 
>> 
>> If it couldn't, this could be a useful usage model of RCU to accelerate
>> hot path.
>
> This is not possible, and it can be verified using the Linux kernel
> memory model.  An introduction to an older version of this model may
> be found here (including an introduction to litmus tests and their
> output):
>
> 	https://lwn.net/Articles/718628/
> 	https://lwn.net/Articles/720550/
>
> The litmus test and its output are shown below.
>
> The reason it is not possible is that the entirety of test()'s RCU
> read-side critical section must do one of two things:
>
> 1.	Come before the return from initialize()'s synchronize_rcu().
> 2.	Come after the call to initialize()'s synchronize_rcu().
>
> Suppose test()'s load from "b" sees initialize()'s assignment.  Then
> some part of test()'s RCU read-side critical section came after
> initialize()'s call to synchronize_rcu(), which means that the entirety
> of test()'s RCU read-side critical section must come after initialize()'s
> call to synchronize_rcu().  Therefore, whenever "c" is non-zero, the
> pr_info() must see "a" non-zero.
>
> 							Thanx, Paul
>
> ------------------------------------------------------------------------
>
> C MP-o-sync-o+rl-o-ctl-o-rul
>
> {}
>
> P0(int *a, int *b)
> {
> 	WRITE_ONCE(*a, 1);
> 	synchronize_rcu();
> 	WRITE_ONCE(*b, 2);
> }
>
> P1(int *a, int *b)
> {
> 	int r0;
> 	int r1;
>
> 	rcu_read_lock();
> 	r0 = READ_ONCE(*b);
> 	if (r0)
> 		r1 = READ_ONCE(*a);
> 	rcu_read_unlock();
> }
>
> exists (1:r0=1 /\ 1:r1=0)
>
> ------------------------------------------------------------------------
>
> States 2
> 1:r0=0; 1:r1=0;
> 1:r0=2; 1:r1=1;
> No
> Witnesses
> Positive: 0 Negative: 2
> Condition exists (1:r0=1 /\ 1:r1=0)
> Observation MP-o-sync-o+rl-o-ctl-o-rul Never 0 2
> Time MP-o-sync-o+rl-o-ctl-o-rul 0.01
> Hash=b20eca2da50fa84b15e489502420ff56
>
> ------------------------------------------------------------------------
>
> The "Never 0 2" means that the condition cannot happen.

Thanks a lot for your detailed explanation!  That helps me much!

Best Regards,
Huang, Ying

next prev parent reply	other threads:[~2017-12-19  8:08 UTC|newest]

Thread overview: 13+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2017-12-18  7:34 [PATCH -V3 -mm] mm, swap: Fix race between swapoff and some swap operations Huang, Ying
2017-12-18  7:34 ` Huang, Ying
2017-12-18  7:41 ` Huang, Ying
2017-12-18  7:41   ` Huang, Ying
2017-12-18 21:13   ` Junaid Shahid
2017-12-18 21:13     ` Junaid Shahid
2017-12-18 23:09   ` Paul E. McKenney
2017-12-18 23:09     ` Paul E. McKenney
2017-12-19  1:57     ` Huang, Ying
2017-12-19  5:36       ` Paul E. McKenney
2017-12-19  5:36         ` Paul E. McKenney
2017-12-19  8:08         ` Huang, Ying [this message]
2017-12-19  8:08           ` Huang, Ying

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=87lghzta1i.fsf@yhuang-dev.intel.com \
    --to=ying.huang@intel.com \
    --cc=aarcange@redhat.com \
    --cc=aaron.lu@intel.com \
    --cc=akpm@linux-foundation.org \
    --cc=dave.jiang@intel.com \
    --cc=hannes@cmpxchg.org \
    --cc=hughd@google.com \
    --cc=jack@suse.cz \
    --cc=jglisse@redhat.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=mgorman@techsingularity.net \
    --cc=mhocko@suse.com \
    --cc=minchan@kernel.org \
    --cc=paulmck@linux.vnet.ibm.com \
    --cc=riel@redhat.com \
    --cc=rientjes@google.com \
    --cc=shli@fb.com \
    --cc=tim.c.chen@linux.intel.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.