From: "Paul E. McKenney" <paulmck@linux.vnet.ibm.com>
To: Peter Zijlstra <peterz@infradead.org>
Cc: linux-mips@linux-mips.org, linux-ia64@vger.kernel.org,
	"Michael S. Tsirkin" <mst@redhat.com>,
	Will Deacon <will.deacon@arm.com>,
	virtualization@lists.linux-foundation.org,
	"H. Peter Anvin" <hpa@zytor.com>,
	sparclinux@vger.kernel.org, Ingo Molnar <mingo@kernel.org>,
	linux-arch@vger.kernel.org, linux-s390@vger.kernel.org,
	Russell King - ARM Linux <linux@arm.linux.org.uk>,
	user-mode-linux-devel@lists.sourceforge.net,
	linux-sh@vger.kernel.org, Michael Ellerman <mpe@ellerman.id.au>,
	x86@kernel.org, xen-devel@lists.xenproject.org,
	Ingo Molnar <mingo@elte.hu>,
	linux-xtensa@linux-xtensa.org, james.hogan@imgtec.com,
	Arnd Bergmann <arnd@arndb.de>,
	Stefano Stabellini <stefano.stabellini@eu.citrix.com>,
	adi-buildroot-devel@lists.sourceforge.net,
	Leonid Yegoshin <Leonid.Yegoshin@imgtec.com>,
	ddaney.cavm@gmail.com, Thomas Gleixner <tglx@linutronix.de>,
	linux-metag@vger.kernel.org
Subject: Re: [v3,11/41] mips: reuse asm-generic/barrier.h
Date: Fri, 15 Jan 2016 09:39:12 -0800	[thread overview]
Message-ID: <20160115173912.GU3818@linux.vnet.ibm.com> (raw)
In-Reply-To: <20160115085554.GF3421@worktop>
On Fri, Jan 15, 2016 at 09:55:54AM +0100, Peter Zijlstra wrote:
> On Thu, Jan 14, 2016 at 01:29:13PM -0800, Paul E. McKenney wrote:
> > So smp_mb() provides transitivity, as do pairs of smp_store_release()
> > and smp_read_acquire(), 
> 
> But they provide different grades of transitivity, which is where all
> the confusion lays.
> 
> smp_mb() is strongly/globally transitive, all CPUs will agree on the order.
> 
> Whereas the RCpc release+acquire is weakly so, only the two cpus
> involved in the handover will agree on the order.
Good point!
Using grace periods in place of smp_mb() also provides strong/global
transitivity, but also insanely high latencies.  ;-)
The patch below updates Documentation/memory-barriers.txt to define
local vs. global transitivity.  The corresponding ppcmem litmus test
is included below as well.
Should we start putting litmus tests for the various examples
somewhere, perhaps in a litmus-tests directory within each participating
architecture?  I have a pile of powerpc-related litmus tests on my laptop,
but they probably aren't doing all that much good there.
							Thanx, Paul
------------------------------------------------------------------------
PPC local-transitive
""
{
0:r1=1; 0:r2=u; 0:r3=v; 0:r4=x; 0:r5=y; 0:r6=z;
1:r1=1; 1:r2=u; 1:r3=v; 1:r4=x; 1:r5=y; 1:r6=z;
2:r1=1; 2:r2=u; 2:r3=v; 2:r4=x; 2:r5=y; 2:r6=z;
3:r1=1; 3:r2=u; 3:r3=v; 3:r4=x; 3:r5=y; 3:r6=z;
}
 P0           | P1           | P2           | P3           ;
 lwz r9,0(r4) | lwz r9,0(r5) | lwz r9,0(r6) | stw r1,0(r3) ;
 lwsync       | lwsync       | lwsync       | sync         ;
 stw r1,0(r2) | lwz r8,0(r3) | stw r1,0(r7) | lwz r9,0(r2) ;
 lwsync       | lwz r7,0(r2) |              |              ;
 stw r1,0(r5) | lwsync       |              |              ;
              | stw r1,0(r6) |              |              ;
exists
(* (0:r9=0 /\ 1:r9=1 /\ 2:r9=1 /\ 1:r8=0 /\ 3:r9=0) *)
(* (0:r9=1 /\ 1:r9=1 /\ 2:r9=1) *)
(* (0:r9=0 /\ 1:r9=1 /\ 2:r9=1 /\ 1:r7=0) *)
(0:r9=0 /\ 1:r9=1 /\ 2:r9=1 /\ 1:r7=0)
------------------------------------------------------------------------
commit 2cb4e83a1b5c89c8e39b8a64bd89269d05913e41
Author: Paul E. McKenney <paulmck@linux.vnet.ibm.com>
Date:   Fri Jan 15 09:30:42 2016 -0800
    documentation: Distinguish between local and global transitivity
    
    The introduction of smp_load_acquire() and smp_store_release() had
    the side effect of introducing a weaker notion of transitivity:
    The transitivity of full smp_mb() barriers is global, but that
    of smp_store_release()/smp_load_acquire() chains is local.  This
    commit therefore introduces the notion of local transitivity and
    gives an example.
    
    Reported-by: Peter Zijlstra <peterz@infradead.org>
    Reported-by: Will Deacon <will.deacon@arm.com>
    Signed-off-by: Paul E. McKenney <paulmck@linux.vnet.ibm.com>
diff --git a/Documentation/memory-barriers.txt b/Documentation/memory-barriers.txt
index c66ba46d8079..d8109ed99342 100644
--- a/Documentation/memory-barriers.txt
+++ b/Documentation/memory-barriers.txt
@@ -1318,8 +1318,82 @@ or a level of cache, CPU 2 might have early access to CPU 1's writes.
 General barriers are therefore required to ensure that all CPUs agree
 on the combined order of CPU 1's and CPU 2's accesses.
 
-To reiterate, if your code requires transitivity, use general barriers
-throughout.
+General barriers provide "global transitivity", so that all CPUs will
+agree on the order of operations.  In contrast, a chain of release-acquire
+pairs provides only "local transitivity", so that only those CPUs on
+the chain are guaranteed to agree on the combined order of the accesses.
+For example, switching to C code in deference to Herman Hollerith:
+
+	int u, v, x, y, z;
+
+	void cpu0(void)
+	{
+		r0 = smp_load_acquire(&x);
+		WRITE_ONCE(u, 1);
+		smp_store_release(&y, 1);
+	}
+
+	void cpu1(void)
+	{
+		r1 = smp_load_acquire(&y);
+		r4 = READ_ONCE(v);
+		r5 = READ_ONCE(u);
+		smp_store_release(&z, 1);
+	}
+
+	void cpu2(void)
+	{
+		r2 = smp_load_acquire(&z);
+		smp_store_release(&x, 1);
+	}
+
+	void cpu3(void)
+	{
+		WRITE_ONCE(v, 1);
+		smp_mb();
+		r3 = READ_ONCE(u);
+	}
+
+Because cpu0(), cpu1(), and cpu2() participate in a local transitive
+chain of smp_store_release()/smp_load_acquire() pairs, the following
+outcome is prohibited:
+
+	r0 == 1 && r1 == 1 && r2 == 1
+
+Furthermore, because of the release-acquire relationship between cpu0()
+and cpu1(), cpu1() must see cpu0()'s writes, so that the following
+outcome is prohibited:
+
+	r1 == 1 && r5 == 0
+
+However, the transitivity of release-acquire is local to the participating
+CPUs and does not apply to cpu3().  Therefore, the following outcome
+is possible:
+
+	r0 == 0 && r1 == 1 && r2 == 1 && r3 == 0 && r4 == 0
+
+Although cpu0(), cpu1(), and cpu2() will see their respective reads and
+writes in order, CPUs not involved in the release-acquire chain might
+well disagree on the order.  This disagreement stems from the fact that
+the weak memory-barrier instructions used to implement smp_load_acquire()
+and smp_store_release() are not required to order prior stores against
+subsequent loads in all cases.  This means that cpu3() can see cpu0()'s
+store to u as happening -after- cpu1()'s load from v, even though
+both cpu0() and cpu1() agree that these two operations occurred in the
+intended order.
+
+However, please keep in mind that smp_load_acquire() is not magic.
+In particular, it simply reads from its argument with ordering.  It does
+-not- ensure that any particular value will be read.  Therefore, the
+following outcome is possible:
+
+	r0 == 0 && r1 == 0 && r2 == 0 && r5 == 0
+
+Note that this outcome can happen even on a mythical sequentially
+consistent system where nothing is ever reordered.
+
+To reiterate, if your code requires global transitivity, use general
+barriers throughout.
 
 
 ========================
next prev parent reply	other threads:[~2016-01-15 17:39 UTC|newest]
Thread overview: 106+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <1452426622-4471-12-git-send-email-mst@redhat.com>
2016-01-12  1:14 ` [v3,11/41] mips: reuse asm-generic/barrier.h Leonid Yegoshin
     [not found] ` <56945366.2090504@imgtec.com>
2016-01-12  8:43   ` Michael S. Tsirkin
2016-01-12  9:27   ` Peter Zijlstra
2016-01-12 10:25     ` Peter Zijlstra
     [not found]     ` <20160112102555.GV6373@twins.programming.kicks-ass.net>
2016-01-12 10:40       ` Peter Zijlstra
2016-01-12 11:41         ` Will Deacon
2016-01-12 20:45           ` Leonid Yegoshin
     [not found]           ` <569565DA.2010903@imgtec.com>
2016-01-12 21:40             ` Peter Zijlstra
     [not found]             ` <20160112214003.GQ6357@twins.programming.kicks-ass.net>
2016-01-13  0:21               ` Leonid Yegoshin
2016-01-13 10:45             ` Will Deacon
     [not found]             ` <20160113104516.GE25458@arm.com>
2016-01-13 19:02               ` Leonid Yegoshin
     [not found]               ` <56969F4B.7070001@imgtec.com>
2016-01-13 20:48                 ` Peter Zijlstra
2016-01-13 20:58                   ` Leonid Yegoshin
2016-01-14 12:04                     ` Will Deacon
     [not found]                     ` <20160114120445.GB15828@arm.com>
2016-01-14 16:16                       ` Paul E. McKenney
2016-01-14 19:42                         ` Leonid Yegoshin
2016-01-14 20:15                           ` Peter Zijlstra
     [not found]                           ` <20160114201513.GI6357@twins.programming.kicks-ass.net>
2016-01-14 20:36                             ` Paul E. McKenney
2016-01-14 20:46                             ` Peter Zijlstra
2016-01-14 20:46                             ` Leonid Yegoshin
2016-01-14 21:34                               ` Paul E. McKenney
     [not found]                               ` <20160114213440.GG3818@linux.vnet.ibm.com>
2016-01-14 21:45                                 ` Leonid Yegoshin
2016-01-14 22:24                                   ` Paul E. McKenney
2016-01-14 23:04                                     ` Leonid Yegoshin
2016-01-14 20:12                       ` Leonid Yegoshin
     [not found]                       ` <56980145.5030901@imgtec.com>
2016-01-14 20:48                         ` Paul E. McKenney
2016-01-14 21:24                           ` Leonid Yegoshin
2016-01-14 22:20                             ` Paul E. McKenney
     [not found]                             ` <20160114222046.GH3818@linux.vnet.ibm.com>
2016-01-15  9:57                               ` Will Deacon
2016-01-15 18:54                                 ` Leonid Yegoshin
2016-01-26 10:24                               ` Peter Zijlstra
2016-01-26 10:32                                 ` Peter Zijlstra
2016-01-26 11:09                                   ` Will Deacon
     [not found]                                   ` <20160126110053.GA21553@arm.com>
2016-01-26 20:11                                     ` Paul E. McKenney
2016-01-27  8:35                                       ` [PATCH] documentation: Add disclaimer Peter Zijlstra
2016-01-27 14:57                                       ` David Howells
     [not found]                                       ` <15882.1453906627@warthog.procyon.org.uk>
2016-01-27 23:35                                         ` Paul E. McKenney
2016-01-28 20:02                                         ` David Howells
2016-04-14 21:40                                         ` Paul E. McKenney
     [not found]                                       ` <20160127083546.GJ6357@twins.programming.kicks-ass.net>
2016-01-27 10:11                                         ` Will Deacon
2016-04-14 21:40                                         ` Paul E. McKenney
2016-01-26 19:44                                 ` [v3,11/41] mips: reuse asm-generic/barrier.h Paul E. McKenney
2016-01-18  8:19                           ` Herbert Xu
     [not found]                           ` <20160118081929.GA30420@gondor.apana.org.au>
2016-01-18 15:46                             ` Paul E. McKenney
2016-01-26 16:52                               ` Boqun Feng
     [not found]                               ` <20160126165207.GB6029@fixme-laptop.cn.ibm.com>
2016-01-26 17:22                                 ` Peter Zijlstra
2016-01-26 19:44                                   ` Linus Torvalds
     [not found]                                   ` <CA+55aFzcC6C8imPs5vk4yH1Y2YHjnAdFM9HCkVs04COxuDQH6w@mail.gmail.com>
2016-01-26 20:10                                     ` Paul E. McKenney
     [not found]                                     ` <20160126201037.GU4503@linux.vnet.ibm.com>
2016-01-26 22:15                                       ` Linus Torvalds
     [not found]                                       ` <CA+55aFxjb+2rs2wVHtiSCcOzgMrE8H=yDeNcjyujPQudDCtLgw@mail.gmail.com>
2016-01-26 22:33                                         ` Linus Torvalds
     [not found]                                         ` <CA+55aFwxTJd+uibcxtZD3tGnj_n=LMwyAa0s8qyx_OF0OMWQkA@mail.gmail.com>
2016-01-26 23:29                                           ` Paul E. McKenney
2016-01-26 23:45                                             ` Linus Torvalds
     [not found]                                             ` <CA+55aFyD94yaA1QzXgfeO18T-czY3TVUi5n6r-9jOUObDeR-zQ@mail.gmail.com>
2016-01-27  0:57                                               ` Paul E. McKenney
2016-01-27  2:04                                             ` Boqun Feng
     [not found]                                             ` <20160127020447.GA1293@fixme-laptop.cn.ibm.com>
2016-01-27 23:30                                               ` Paul E. McKenney
2016-01-27  7:51                                           ` Peter Zijlstra
2016-01-27  8:05                                             ` Linus Torvalds
2016-01-26 19:51                                 ` Paul E. McKenney
2016-01-13 22:26               ` Leonid Yegoshin
     [not found]               ` <5696CF08.8080700@imgtec.com>
2016-01-14  9:24                 ` Michael S. Tsirkin
2016-01-14 12:14                 ` Will Deacon
     [not found]                 ` <20160114121449.GC15828@arm.com>
2016-01-14 19:28                   ` Leonid Yegoshin
     [not found]                   ` <5697F6D2.60409@imgtec.com>
2016-01-14 20:34                     ` Paul E. McKenney
2016-01-14 21:01                       ` Leonid Yegoshin
2016-01-14 21:29                         ` Paul E. McKenney
2016-01-14 21:36                           ` Leonid Yegoshin
2016-01-14 22:55                             ` Paul E. McKenney
2016-01-14 23:33                               ` Leonid Yegoshin
2016-01-15  0:47                                 ` Paul E. McKenney
2016-01-15  1:07                                   ` Leonid Yegoshin
2016-01-27 11:26                                     ` Maciej W. Rozycki
     [not found]                                     ` <alpine.DEB.2.00.1601271116520.5958@tp.orcam.me.uk>
2016-01-28  0:48                                       ` Leonid Yegoshin
2016-01-29 13:38                                         ` Maciej W. Rozycki
2016-01-28  0:58                                       ` Leonid Yegoshin
2016-01-27 10:40                                   ` Ralf Baechle
     [not found]                                   ` <20160127104032.GB2939@linux-mips.org>
2016-01-27 12:09                                     ` Maciej W. Rozycki
2016-01-15 10:24                               ` Will Deacon
2016-01-15 17:54                                 ` Paul E. McKenney
2016-01-15 19:28                                   ` Paul E. McKenney
2016-01-25 14:41                                     ` Will Deacon
     [not found]                                     ` <20160125144133.GB22927@arm.com>
2016-01-26  1:06                                       ` Paul E. McKenney
     [not found]                                       ` <20160126010646.GH4503@linux.vnet.ibm.com>
2016-01-26 12:10                                         ` Will Deacon
     [not found]                                         ` <20160126121010.GD21553@arm.com>
2016-01-26 23:37                                           ` Paul E. McKenney
2016-01-27 10:23                                             ` Will Deacon
2016-01-15  8:55                           ` Peter Zijlstra
     [not found]                           ` <20160115085554.GF3421@worktop>
2016-01-15  9:13                             ` Peter Zijlstra
2016-01-15 17:46                               ` Paul E. McKenney
2016-01-15 21:27                                 ` Peter Zijlstra
2016-01-15 21:58                                   ` Paul E. McKenney
     [not found]                                   ` <20160115215853.GC3818@linux.vnet.ibm.com>
2016-01-25 16:42                                     ` Will Deacon
2016-01-26  6:03                                       ` Paul E. McKenney
2016-01-26 10:19                                         ` Peter Zijlstra
2016-01-26 12:16                                         ` Will Deacon
2016-01-26 14:35                                           ` Boqun Feng
2016-01-26 19:58                                           ` Paul E. McKenney
     [not found]                                           ` <20160126195820.GS4503@linux.vnet.ibm.com>
2016-01-27 10:25                                             ` Will Deacon
2016-01-27 23:32                                               ` Paul E. McKenney
     [not found]                                         ` <20160126101927.GD6357@twins.programming.kicks-ass.net>
2016-01-26 20:13                                           ` Paul E. McKenney
2016-01-27  8:39                                             ` Peter Zijlstra
2016-01-15 17:39                             ` Paul E. McKenney [this message]
2016-01-15 21:29                               ` Peter Zijlstra
2016-01-15 22:01                                 ` Paul E. McKenney
2016-01-25 18:02                               ` Will Deacon
     [not found]                               ` <20160125180234.GA26732@arm.com>
2016-01-26  6:12                                 ` Paul E. McKenney
2016-01-26 10:15                                   ` Peter Zijlstra
     [not found]   ` <20160112103913-mutt-send-email-mst@redhat.com>
2016-01-12  9:51     ` Peter Zijlstra
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox
  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):
  git send-email \
    --in-reply-to=20160115173912.GU3818@linux.vnet.ibm.com \
    --to=paulmck@linux.vnet.ibm.com \
    --cc=Leonid.Yegoshin@imgtec.com \
    --cc=adi-buildroot-devel@lists.sourceforge.net \
    --cc=arnd@arndb.de \
    --cc=ddaney.cavm@gmail.com \
    --cc=hpa@zytor.com \
    --cc=james.hogan@imgtec.com \
    --cc=linux-arch@vger.kernel.org \
    --cc=linux-ia64@vger.kernel.org \
    --cc=linux-metag@vger.kernel.org \
    --cc=linux-mips@linux-mips.org \
    --cc=linux-s390@vger.kernel.org \
    --cc=linux-sh@vger.kernel.org \
    --cc=linux-xtensa@linux-xtensa.org \
    --cc=linux@arm.linux.org.uk \
    --cc=mingo@elte.hu \
    --cc=mingo@kernel.org \
    --cc=mpe@ellerman.id.au \
    --cc=mst@redhat.com \
    --cc=peterz@infradead.org \
    --cc=sparclinux@vger.kernel.org \
    --cc=stefano.stabellini@eu.citrix.com \
    --cc=tglx@linutronix.de \
    --cc=user-mode-linux-devel@lists.sourceforge.net \
    --cc=virtualization@lists.linux-foundation.org \
    --cc=will.deacon@arm.com \
    --cc=x86@kernel.org \
    --cc=xen-devel@lists.xenproject.org \
    /path/to/YOUR_REPLY
  https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
  Be sure your reply has a Subject: header at the top and a blank line
  before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).