virtualization.lists.linux-foundation.org archive mirror
 help / color / mirror / Atom feed
From: "Paul E. McKenney" <paulmck@linux.vnet.ibm.com>
To: Will Deacon <will.deacon@arm.com>
Cc: linux-mips@linux-mips.org, linux-ia64@vger.kernel.org,
	"Michael S. Tsirkin" <mst@redhat.com>,
	Peter Zijlstra <peterz@infradead.org>,
	virtualization@lists.linux-foundation.org,
	"H. Peter Anvin" <hpa@zytor.com>,
	sparclinux@vger.kernel.org, Ingo Molnar <mingo@kernel.org>,
	linux-arch@vger.kernel.org, linux-s390@vger.kernel.org,
	Russell King - ARM Linux <linux@arm.linux.org.uk>,
	user-mode-linux-devel@lists.sourceforge.net,
	linux-sh@vger.kernel.org, Michael Ellerman <mpe@ellerman.id.au>,
	x86@kernel.org, xen-devel@lists.xenproject.org,
	Ingo Molnar <mingo@elte.hu>,
	linux-xtensa@linux-xtensa.org, james.hogan@imgtec.com,
	Arnd Bergmann <arnd@arndb.de>,
	Stefano Stabellini <stefano.stabellini@eu.citrix.com>,
	adi-buildroot-devel@lists.sourceforge.net,
	Leonid Yegoshin <Leonid.Yegoshin@imgtec.com>,
	ddaney.cavm@gmail.com, Thomas Gleixner <tglx@linutronix.de>,
	linux-metag@vger.kernel.
Subject: Re: [v3,11/41] mips: reuse asm-generic/barrier.h
Date: Mon, 25 Jan 2016 22:12:11 -0800	[thread overview]
Message-ID: <20160126061211.GK4503@linux.vnet.ibm.com> (raw)
In-Reply-To: <20160125180234.GA26732@arm.com>

On Mon, Jan 25, 2016 at 06:02:34PM +0000, Will Deacon wrote:
> Hi Paul,
> 
> On Fri, Jan 15, 2016 at 09:39:12AM -0800, Paul E. McKenney wrote:
> > On Fri, Jan 15, 2016 at 09:55:54AM +0100, Peter Zijlstra wrote:
> > > On Thu, Jan 14, 2016 at 01:29:13PM -0800, Paul E. McKenney wrote:
> > > > So smp_mb() provides transitivity, as do pairs of smp_store_release()
> > > > and smp_read_acquire(), 
> > > 
> > > But they provide different grades of transitivity, which is where all
> > > the confusion lays.
> > > 
> > > smp_mb() is strongly/globally transitive, all CPUs will agree on the order.
> > > 
> > > Whereas the RCpc release+acquire is weakly so, only the two cpus
> > > involved in the handover will agree on the order.
> > 
> > Good point!
> > 
> > Using grace periods in place of smp_mb() also provides strong/global
> > transitivity, but also insanely high latencies.  ;-)
> > 
> > The patch below updates Documentation/memory-barriers.txt to define
> > local vs. global transitivity.  The corresponding ppcmem litmus test
> > is included below as well.
> > 
> > Should we start putting litmus tests for the various examples
> > somewhere, perhaps in a litmus-tests directory within each participating
> > architecture?  I have a pile of powerpc-related litmus tests on my laptop,
> > but they probably aren't doing all that much good there.
> 
> I too would like to have the litmus tests in the kernel so that we can
> refer to them from memory-barriers.txt. Ideally they wouldn't be targetted
> to a particular arch, however.

Agreed.  Working on it...

> > PPC local-transitive
> > ""
> > {
> > 0:r1=1; 0:r2=u; 0:r3=v; 0:r4=x; 0:r5=y; 0:r6=z;
> > 1:r1=1; 1:r2=u; 1:r3=v; 1:r4=x; 1:r5=y; 1:r6=z;
> > 2:r1=1; 2:r2=u; 2:r3=v; 2:r4=x; 2:r5=y; 2:r6=z;
> > 3:r1=1; 3:r2=u; 3:r3=v; 3:r4=x; 3:r5=y; 3:r6=z;
> > }
> >  P0           | P1           | P2           | P3           ;
> >  lwz r9,0(r4) | lwz r9,0(r5) | lwz r9,0(r6) | stw r1,0(r3) ;
> >  lwsync       | lwsync       | lwsync       | sync         ;
> >  stw r1,0(r2) | lwz r8,0(r3) | stw r1,0(r7) | lwz r9,0(r2) ;
> >  lwsync       | lwz r7,0(r2) |              |              ;
> >  stw r1,0(r5) | lwsync       |              |              ;
> >               | stw r1,0(r6) |              |              ;
> > exists
> > (* (0:r9=0 /\ 1:r9=1 /\ 2:r9=1 /\ 1:r8=0 /\ 3:r9=0) *)
> > (* (0:r9=1 /\ 1:r9=1 /\ 2:r9=1) *)
> > (* (0:r9=0 /\ 1:r9=1 /\ 2:r9=1 /\ 1:r7=0) *)
> > (0:r9=0 /\ 1:r9=1 /\ 2:r9=1 /\ 1:r7=0)
> 
> i.e. we should rewrite this using READ_ONCE/WRITE_ONCE and smp_mb() etc.

Yep!

> > ------------------------------------------------------------------------
> > 
> > commit 2cb4e83a1b5c89c8e39b8a64bd89269d05913e41
> > Author: Paul E. McKenney <paulmck@linux.vnet.ibm.com>
> > Date:   Fri Jan 15 09:30:42 2016 -0800
> > 
> >     documentation: Distinguish between local and global transitivity
> >     
> >     The introduction of smp_load_acquire() and smp_store_release() had
> >     the side effect of introducing a weaker notion of transitivity:
> >     The transitivity of full smp_mb() barriers is global, but that
> >     of smp_store_release()/smp_load_acquire() chains is local.  This
> >     commit therefore introduces the notion of local transitivity and
> >     gives an example.
> >     
> >     Reported-by: Peter Zijlstra <peterz@infradead.org>
> >     Reported-by: Will Deacon <will.deacon@arm.com>
> >     Signed-off-by: Paul E. McKenney <paulmck@linux.vnet.ibm.com>
> > 
> > diff --git a/Documentation/memory-barriers.txt b/Documentation/memory-barriers.txt
> > index c66ba46d8079..d8109ed99342 100644
> > --- a/Documentation/memory-barriers.txt
> > +++ b/Documentation/memory-barriers.txt
> > @@ -1318,8 +1318,82 @@ or a level of cache, CPU 2 might have early access to CPU 1's writes.
> >  General barriers are therefore required to ensure that all CPUs agree
> >  on the combined order of CPU 1's and CPU 2's accesses.
> >  
> > -To reiterate, if your code requires transitivity, use general barriers
> > -throughout.
> > +General barriers provide "global transitivity", so that all CPUs will
> > +agree on the order of operations.  In contrast, a chain of release-acquire
> > +pairs provides only "local transitivity", so that only those CPUs on
> > +the chain are guaranteed to agree on the combined order of the accesses.
> 
> Thanks for having a go at this. I tried defining something axiomatically,
> but got stuck pretty quickly. In my scheme, I used "data-directed
> transitivity" instead of "local transitivity", since the latter seems to
> be a bit of a misnomer.

I figured that "local" meant local to the CPUs participating in the
release-acquire chain.  As opposed to smp_mb() chains where the ordering
is "global" as in visible to all CPUs, whether on the chain or not.
Does that help?

> > +For example, switching to C code in deference to Herman Hollerith:
> > +
> > +	int u, v, x, y, z;
> > +
> > +	void cpu0(void)
> > +	{
> > +		r0 = smp_load_acquire(&x);
> > +		WRITE_ONCE(u, 1);
> > +		smp_store_release(&y, 1);
> > +	}
> > +
> > +	void cpu1(void)
> > +	{
> > +		r1 = smp_load_acquire(&y);
> > +		r4 = READ_ONCE(v);
> > +		r5 = READ_ONCE(u);
> > +		smp_store_release(&z, 1);
> > +	}
> > +
> > +	void cpu2(void)
> > +	{
> > +		r2 = smp_load_acquire(&z);
> > +		smp_store_release(&x, 1);
> > +	}
> > +
> > +	void cpu3(void)
> > +	{
> > +		WRITE_ONCE(v, 1);
> > +		smp_mb();
> > +		r3 = READ_ONCE(u);
> > +	}
> > +
> > +Because cpu0(), cpu1(), and cpu2() participate in a local transitive
> > +chain of smp_store_release()/smp_load_acquire() pairs, the following
> > +outcome is prohibited:
> > +
> > +	r0 == 1 && r1 == 1 && r2 == 1
> > +
> > +Furthermore, because of the release-acquire relationship between cpu0()
> > +and cpu1(), cpu1() must see cpu0()'s writes, so that the following
> > +outcome is prohibited:
> > +
> > +	r1 == 1 && r5 == 0
> > +
> > +However, the transitivity of release-acquire is local to the participating
> > +CPUs and does not apply to cpu3().  Therefore, the following outcome
> > +is possible:
> > +
> > +	r0 == 0 && r1 == 1 && r2 == 1 && r3 == 0 && r4 == 0
> 
> I think you should be completely explicit and include r5 == 1 here, too.

Good point -- I added this as an additional outcome:

	r0 == 0 && r1 == 1 && r2 == 1 && r3 == 0 && r4 == 0 && r5 == 1

> Also -- where would you add the smp_mb__after_release_acquire to fix
> (i.e. forbid) this? Immediately after cpu1()'s read of y?

That sounds plausible, but we would first have to agree on exactly
what smp_mb__after_release_acquire() did.  ;-)

> > +Although cpu0(), cpu1(), and cpu2() will see their respective reads and
> > +writes in order, CPUs not involved in the release-acquire chain might
> > +well disagree on the order.  This disagreement stems from the fact that
> > +the weak memory-barrier instructions used to implement smp_load_acquire()
> > +and smp_store_release() are not required to order prior stores against
> > +subsequent loads in all cases.  This means that cpu3() can see cpu0()'s
> > +store to u as happening -after- cpu1()'s load from v, even though
> > +both cpu0() and cpu1() agree that these two operations occurred in the
> > +intended order.
> > +
> > +However, please keep in mind that smp_load_acquire() is not magic.
> > +In particular, it simply reads from its argument with ordering.  It does
> > +-not- ensure that any particular value will be read.  Therefore, the
> > +following outcome is possible:
> > +
> > +	r0 == 0 && r1 == 0 && r2 == 0 && r5 == 0
> > +
> > +Note that this outcome can happen even on a mythical sequentially
> > +consistent system where nothing is ever reordered.
> 
> I'm not sure this last bit is strictly necessary. If somebody thinks that
> acquire/release involve some sort of implicit synchronisation, I think
> they may have bigger problems with memory-barriers.txt.

Agreed.  But unless I add text like this occasionally, such people could
easily read through much of memory-barriers.txt and think that they did
in fact understand it.  So I have to occasionally trip an assertion in
their brain.  Or try to...  :-/

							Thanx, Paul

  parent reply	other threads:[~2016-01-26  6:12 UTC|newest]

Thread overview: 106+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <1452426622-4471-12-git-send-email-mst@redhat.com>
2016-01-12  1:14 ` [v3,11/41] mips: reuse asm-generic/barrier.h Leonid Yegoshin
     [not found] ` <56945366.2090504@imgtec.com>
2016-01-12  8:43   ` Michael S. Tsirkin
2016-01-12  9:27   ` Peter Zijlstra
2016-01-12 10:25     ` Peter Zijlstra
     [not found]     ` <20160112102555.GV6373@twins.programming.kicks-ass.net>
2016-01-12 10:40       ` Peter Zijlstra
2016-01-12 11:41         ` Will Deacon
2016-01-12 20:45           ` Leonid Yegoshin
     [not found]           ` <569565DA.2010903@imgtec.com>
2016-01-12 21:40             ` Peter Zijlstra
     [not found]             ` <20160112214003.GQ6357@twins.programming.kicks-ass.net>
2016-01-13  0:21               ` Leonid Yegoshin
2016-01-13 10:45             ` Will Deacon
     [not found]             ` <20160113104516.GE25458@arm.com>
2016-01-13 19:02               ` Leonid Yegoshin
     [not found]               ` <56969F4B.7070001@imgtec.com>
2016-01-13 20:48                 ` Peter Zijlstra
2016-01-13 20:58                   ` Leonid Yegoshin
2016-01-14 12:04                     ` Will Deacon
     [not found]                     ` <20160114120445.GB15828@arm.com>
2016-01-14 16:16                       ` Paul E. McKenney
2016-01-14 19:42                         ` Leonid Yegoshin
2016-01-14 20:15                           ` Peter Zijlstra
     [not found]                           ` <20160114201513.GI6357@twins.programming.kicks-ass.net>
2016-01-14 20:36                             ` Paul E. McKenney
2016-01-14 20:46                             ` Peter Zijlstra
2016-01-14 20:46                             ` Leonid Yegoshin
2016-01-14 21:34                               ` Paul E. McKenney
     [not found]                               ` <20160114213440.GG3818@linux.vnet.ibm.com>
2016-01-14 21:45                                 ` Leonid Yegoshin
2016-01-14 22:24                                   ` Paul E. McKenney
2016-01-14 23:04                                     ` Leonid Yegoshin
2016-01-14 20:12                       ` Leonid Yegoshin
     [not found]                       ` <56980145.5030901@imgtec.com>
2016-01-14 20:48                         ` Paul E. McKenney
2016-01-14 21:24                           ` Leonid Yegoshin
2016-01-14 22:20                             ` Paul E. McKenney
     [not found]                             ` <20160114222046.GH3818@linux.vnet.ibm.com>
2016-01-15  9:57                               ` Will Deacon
2016-01-15 18:54                                 ` Leonid Yegoshin
2016-01-26 10:24                               ` Peter Zijlstra
2016-01-26 10:32                                 ` Peter Zijlstra
2016-01-26 11:09                                   ` Will Deacon
     [not found]                                   ` <20160126110053.GA21553@arm.com>
2016-01-26 20:11                                     ` Paul E. McKenney
2016-01-27  8:35                                       ` [PATCH] documentation: Add disclaimer Peter Zijlstra
     [not found]                                       ` <20160127083546.GJ6357@twins.programming.kicks-ass.net>
2016-01-27 10:11                                         ` Will Deacon
2016-04-14 21:40                                         ` Paul E. McKenney
2016-01-27 14:57                                       ` David Howells
     [not found]                                       ` <15882.1453906627@warthog.procyon.org.uk>
2016-01-27 23:35                                         ` Paul E. McKenney
2016-01-28 20:02                                         ` David Howells
2016-04-14 21:40                                         ` Paul E. McKenney
2016-01-26 19:44                                 ` [v3,11/41] mips: reuse asm-generic/barrier.h Paul E. McKenney
2016-01-18  8:19                           ` Herbert Xu
     [not found]                           ` <20160118081929.GA30420@gondor.apana.org.au>
2016-01-18 15:46                             ` Paul E. McKenney
2016-01-26 16:52                               ` Boqun Feng
     [not found]                               ` <20160126165207.GB6029@fixme-laptop.cn.ibm.com>
2016-01-26 17:22                                 ` Peter Zijlstra
2016-01-26 19:44                                   ` Linus Torvalds
     [not found]                                   ` <CA+55aFzcC6C8imPs5vk4yH1Y2YHjnAdFM9HCkVs04COxuDQH6w@mail.gmail.com>
2016-01-26 20:10                                     ` Paul E. McKenney
     [not found]                                     ` <20160126201037.GU4503@linux.vnet.ibm.com>
2016-01-26 22:15                                       ` Linus Torvalds
     [not found]                                       ` <CA+55aFxjb+2rs2wVHtiSCcOzgMrE8H=yDeNcjyujPQudDCtLgw@mail.gmail.com>
2016-01-26 22:33                                         ` Linus Torvalds
     [not found]                                         ` <CA+55aFwxTJd+uibcxtZD3tGnj_n=LMwyAa0s8qyx_OF0OMWQkA@mail.gmail.com>
2016-01-26 23:29                                           ` Paul E. McKenney
2016-01-26 23:45                                             ` Linus Torvalds
     [not found]                                             ` <CA+55aFyD94yaA1QzXgfeO18T-czY3TVUi5n6r-9jOUObDeR-zQ@mail.gmail.com>
2016-01-27  0:57                                               ` Paul E. McKenney
2016-01-27  2:04                                             ` Boqun Feng
     [not found]                                             ` <20160127020447.GA1293@fixme-laptop.cn.ibm.com>
2016-01-27 23:30                                               ` Paul E. McKenney
2016-01-27  7:51                                           ` Peter Zijlstra
2016-01-27  8:05                                             ` Linus Torvalds
2016-01-26 19:51                                 ` Paul E. McKenney
2016-01-13 22:26               ` Leonid Yegoshin
     [not found]               ` <5696CF08.8080700@imgtec.com>
2016-01-14  9:24                 ` Michael S. Tsirkin
2016-01-14 12:14                 ` Will Deacon
     [not found]                 ` <20160114121449.GC15828@arm.com>
2016-01-14 19:28                   ` Leonid Yegoshin
     [not found]                   ` <5697F6D2.60409@imgtec.com>
2016-01-14 20:34                     ` Paul E. McKenney
2016-01-14 21:01                       ` Leonid Yegoshin
2016-01-14 21:29                         ` Paul E. McKenney
2016-01-14 21:36                           ` Leonid Yegoshin
2016-01-14 22:55                             ` Paul E. McKenney
2016-01-14 23:33                               ` Leonid Yegoshin
2016-01-15  0:47                                 ` Paul E. McKenney
2016-01-15  1:07                                   ` Leonid Yegoshin
2016-01-27 11:26                                     ` Maciej W. Rozycki
     [not found]                                     ` <alpine.DEB.2.00.1601271116520.5958@tp.orcam.me.uk>
2016-01-28  0:48                                       ` Leonid Yegoshin
2016-01-29 13:38                                         ` Maciej W. Rozycki
2016-01-28  0:58                                       ` Leonid Yegoshin
2016-01-27 10:40                                   ` Ralf Baechle
     [not found]                                   ` <20160127104032.GB2939@linux-mips.org>
2016-01-27 12:09                                     ` Maciej W. Rozycki
2016-01-15 10:24                               ` Will Deacon
2016-01-15 17:54                                 ` Paul E. McKenney
2016-01-15 19:28                                   ` Paul E. McKenney
2016-01-25 14:41                                     ` Will Deacon
     [not found]                                     ` <20160125144133.GB22927@arm.com>
2016-01-26  1:06                                       ` Paul E. McKenney
     [not found]                                       ` <20160126010646.GH4503@linux.vnet.ibm.com>
2016-01-26 12:10                                         ` Will Deacon
     [not found]                                         ` <20160126121010.GD21553@arm.com>
2016-01-26 23:37                                           ` Paul E. McKenney
2016-01-27 10:23                                             ` Will Deacon
2016-01-15  8:55                           ` Peter Zijlstra
     [not found]                           ` <20160115085554.GF3421@worktop>
2016-01-15  9:13                             ` Peter Zijlstra
2016-01-15 17:46                               ` Paul E. McKenney
2016-01-15 21:27                                 ` Peter Zijlstra
2016-01-15 21:58                                   ` Paul E. McKenney
     [not found]                                   ` <20160115215853.GC3818@linux.vnet.ibm.com>
2016-01-25 16:42                                     ` Will Deacon
2016-01-26  6:03                                       ` Paul E. McKenney
2016-01-26 10:19                                         ` Peter Zijlstra
2016-01-26 12:16                                         ` Will Deacon
2016-01-26 14:35                                           ` Boqun Feng
2016-01-26 19:58                                           ` Paul E. McKenney
     [not found]                                           ` <20160126195820.GS4503@linux.vnet.ibm.com>
2016-01-27 10:25                                             ` Will Deacon
2016-01-27 23:32                                               ` Paul E. McKenney
     [not found]                                         ` <20160126101927.GD6357@twins.programming.kicks-ass.net>
2016-01-26 20:13                                           ` Paul E. McKenney
2016-01-27  8:39                                             ` Peter Zijlstra
2016-01-15 17:39                             ` Paul E. McKenney
2016-01-15 21:29                               ` Peter Zijlstra
2016-01-15 22:01                                 ` Paul E. McKenney
2016-01-25 18:02                               ` Will Deacon
     [not found]                               ` <20160125180234.GA26732@arm.com>
2016-01-26  6:12                                 ` Paul E. McKenney [this message]
2016-01-26 10:15                                   ` Peter Zijlstra
     [not found]   ` <20160112103913-mutt-send-email-mst@redhat.com>
2016-01-12  9:51     ` Peter Zijlstra

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20160126061211.GK4503@linux.vnet.ibm.com \
    --to=paulmck@linux.vnet.ibm.com \
    --cc=Leonid.Yegoshin@imgtec.com \
    --cc=adi-buildroot-devel@lists.sourceforge.net \
    --cc=arnd@arndb.de \
    --cc=ddaney.cavm@gmail.com \
    --cc=hpa@zytor.com \
    --cc=james.hogan@imgtec.com \
    --cc=linux-arch@vger.kernel.org \
    --cc=linux-ia64@vger.kernel.org \
    --cc=linux-metag@vger.kernel. \
    --cc=linux-mips@linux-mips.org \
    --cc=linux-s390@vger.kernel.org \
    --cc=linux-sh@vger.kernel.org \
    --cc=linux-xtensa@linux-xtensa.org \
    --cc=linux@arm.linux.org.uk \
    --cc=mingo@elte.hu \
    --cc=mingo@kernel.org \
    --cc=mpe@ellerman.id.au \
    --cc=mst@redhat.com \
    --cc=peterz@infradead.org \
    --cc=sparclinux@vger.kernel.org \
    --cc=stefano.stabellini@eu.citrix.com \
    --cc=tglx@linutronix.de \
    --cc=user-mode-linux-devel@lists.sourceforge.net \
    --cc=virtualization@lists.linux-foundation.org \
    --cc=will.deacon@arm.com \
    --cc=x86@kernel.org \
    --cc=xen-devel@lists.xenproject.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).