From mboxrd@z Thu Jan 1 00:00:00 1970 From: will.deacon@arm.com (Will Deacon) Date: Thu, 14 Jan 2016 12:14:50 +0000 Subject: [v3,11/41] mips: reuse asm-generic/barrier.h In-Reply-To: <5696CF08.8080700@imgtec.com> References: <1452426622-4471-12-git-send-email-mst@redhat.com> <56945366.2090504@imgtec.com> <20160112092711.GP6344@twins.programming.kicks-ass.net> <20160112102555.GV6373@twins.programming.kicks-ass.net> <20160112104012.GW6373@twins.programming.kicks-ass.net> <20160112114111.GB15737@arm.com> <569565DA.2010903@imgtec.com> <20160113104516.GE25458@arm.com> <5696CF08.8080700@imgtec.com> Message-ID: <20160114121449.GC15828@arm.com> To: linux-arm-kernel@lists.infradead.org List-Id: linux-arm-kernel.lists.infradead.org On Wed, Jan 13, 2016 at 02:26:16PM -0800, Leonid Yegoshin wrote: > On 01/13/2016 02:45 AM, Will Deacon wrote: > >> > >I don't think the address dependency is enough on its own. By that > >reasoning, the following variant (WRC+addr+addr) would work too: > > > > > >P0: > >Wx = 1 > > > >P1: > >Rx == 1 > >
> >Wy = 1 > > > >P2: > >Ry == 1 > > > >Rx = 0 > > > > > >So are you saying that this is also forbidden? > >Imagine that P0 and P1 are two threads that share a store buffer. What > >then? > > OK, I collected answers and it is: > > In MIPS R6 this test passes OK, I mean - P2: Rx = 1 if Ry is read as 1. > By design. > > However, it is unclear that happens in MIPS R2 1004K. How can it be unclear? If, for example, the outcome is permitted on that CPU, then your original reasoning for the WRC+sync+addr doesn't apply there and SYNC is not transitive. That's what I'm trying to get to the bottom of. Does the MIPS kernel target a particular CPU at compile time? > Moreover, there are voices against guarantee that it will be in future > and that voices point me to Documentation/memory-barriers.txt section "DATA > DEPENDENCY BARRIERS" examples which require SYNC_RMB between loading > address/index and using that for loading data based on that address or index > for shared data (look on CPU2 pseudo-code): > >To deal with this, a data dependency barrier or better must be inserted > >between the address load and the data load: > > > > CPU 1 CPU 2 > > =============== =============== > > { A == 1, B == 2, C = 3, P == &A, Q == &C } > > B = 4; > >