linuxppc-dev.lists.ozlabs.org archive mirror
 help / color / mirror / Atom feed
From: "Paul E. McKenney" <paulmck@linux.vnet.ibm.com>
To: "H. Peter Anvin" <hpa@zytor.com>
Cc: Jakub Jelinek <jakub@redhat.com>,
	One Thousand Gnomes <gnomes@lxorguk.ukuu.org.uk>,
	Tony Luck <tony.luck@intel.com>,
	linux-ia64@vger.kernel.org,
	Peter Hurley <peter@hurleysoftware.com>,
	Mikael Pettersson <mikpelinux@gmail.com>,
	Oleg Nesterov <oleg@redhat.com>,
	linux-kernel@vger.kernel.org,
	James Bottomley <James.Bottomley@HansenPartnership.com>,
	Paul Mackerras <paulus@samba.org>,
	linux-arch@vger.kernel.org, linuxppc-dev@lists.ozlabs.org,
	Miroslav Franc <mfranc@redhat.com>,
	Richard Henderson <rth@twiddle.net>
Subject: Re: bit fields && data tearing
Date: Sun, 7 Sep 2014 16:36:55 -0700	[thread overview]
Message-ID: <20140907233655.GR5001@linux.vnet.ibm.com> (raw)
In-Reply-To: <6092b453-e0c9-4f6d-922b-48bce988f774@email.android.com>

On Sun, Sep 07, 2014 at 04:17:30PM -0700, H. Peter Anvin wrote:
> I'm confused why storing 0x0102 would be a problem.  I think gcc does that even on other cpus.
> 
> More atomicity can't hurt, can it?

I must defer to James for any additional details on why PARISC systems
don't provide atomicity for partially overlapping stores.  ;-)

							Thanx, Paul

> On September 7, 2014 4:00:19 PM PDT, "Paul E. McKenney" <paulmck@linux.vnet.ibm.com> wrote:
> >On Sun, Sep 07, 2014 at 12:04:47PM -0700, James Bottomley wrote:
> >> On Sun, 2014-09-07 at 09:21 -0700, Paul E. McKenney wrote:
> >> > On Sat, Sep 06, 2014 at 10:07:22PM -0700, James Bottomley wrote:
> >> > > On Thu, 2014-09-04 at 21:06 -0700, Paul E. McKenney wrote:
> >> > > > On Thu, Sep 04, 2014 at 10:47:24PM -0400, Peter Hurley wrote:
> >> > > > > Hi James,
> >> > > > > 
> >> > > > > On 09/04/2014 10:11 PM, James Bottomley wrote:
> >> > > > > > On Thu, 2014-09-04 at 17:17 -0700, Paul E. McKenney wrote:
> >> > > > > >> +And there are anti-guarantees:
> >> > > > > >> +
> >> > > > > >> + (*) These guarantees do not apply to bitfields, because
> >compilers often
> >> > > > > >> +     generate code to modify these using non-atomic
> >read-modify-write
> >> > > > > >> +     sequences.  Do not attempt to use bitfields to
> >synchronize parallel
> >> > > > > >> +     algorithms.
> >> > > > > >> +
> >> > > > > >> + (*) Even in cases where bitfields are protected by
> >locks, all fields
> >> > > > > >> +     in a given bitfield must be protected by one lock. 
> >If two fields
> >> > > > > >> +     in a given bitfield are protected by different
> >locks, the compiler's
> >> > > > > >> +     non-atomic read-modify-write sequences can cause an
> >update to one
> >> > > > > >> +     field to corrupt the value of an adjacent field.
> >> > > > > >> +
> >> > > > > >> + (*) These guarantees apply only to properly aligned and
> >sized scalar
> >> > > > > >> +     variables.  "Properly sized" currently means "int"
> >and "long",
> >> > > > > >> +     because some CPU families do not support loads and
> >stores of
> >> > > > > >> +     other sizes.  ("Some CPU families" is currently
> >believed to
> >> > > > > >> +     be only Alpha 21064.  If this is actually the case,
> >a different
> >> > > > > >> +     non-guarantee is likely to be formulated.)
> >> > > > > > 
> >> > > > > > This is a bit unclear.  Presumably you're talking about
> >definiteness of
> >> > > > > > the outcome (as in what's seen after multiple stores to the
> >same
> >> > > > > > variable).
> >> > > > > 
> >> > > > > No, the last conditions refers to adjacent byte stores from
> >different
> >> > > > > cpu contexts (either interrupt or SMP).
> >> > > > > 
> >> > > > > > The guarantees are only for natural width on Parisc as
> >well,
> >> > > > > > so you would get a mess if you did byte stores to adjacent
> >memory
> >> > > > > > locations.
> >> > > > > 
> >> > > > > For a simple test like:
> >> > > > > 
> >> > > > > struct x {
> >> > > > > 	long a;
> >> > > > > 	char b;
> >> > > > > 	char c;
> >> > > > > 	char d;
> >> > > > > 	char e;
> >> > > > > };
> >> > > > > 
> >> > > > > void store_bc(struct x *p) {
> >> > > > > 	p->b = 1;
> >> > > > > 	p->c = 2;
> >> > > > > }
> >> > > > > 
> >> > > > > on parisc, gcc generates separate byte stores
> >> > > > > 
> >> > > > > void store_bc(struct x *p) {
> >> > > > >    0:	34 1c 00 02 	ldi 1,ret0
> >> > > > >    4:	0f 5c 12 08 	stb ret0,4(r26)
> >> > > > >    8:	34 1c 00 04 	ldi 2,ret0
> >> > > > >    c:	e8 40 c0 00 	bv r0(rp)
> >> > > > >   10:	0f 5c 12 0a 	stb ret0,5(r26)
> >> > > > > 
> >> > > > > which appears to confirm that on parisc adjacent byte data
> >> > > > > is safe from corruption by concurrent cpu updates; that is,
> >> > > > > 
> >> > > > > CPU 0                | CPU 1
> >> > > > >                      |
> >> > > > > p->b = 1             | p->c = 2
> >> > > > >                      |
> >> > > > > 
> >> > > > > will result in p->b == 1 && p->c == 2 (assume both values
> >> > > > > were 0 before the call to store_bc()).
> >> > > > 
> >> > > > What Peter said.  I would ask for suggestions for better
> >wording, but
> >> > > > I would much rather be able to say that single-byte reads and
> >writes
> >> > > > are atomic and that aligned-short reads and writes are also
> >atomic.
> >> > > > 
> >> > > > Thus far, it looks like we lose only very old Alpha systems, so
> >unless
> >> > > > I hear otherwise, I update my patch to outlaw these very old
> >systems.
> >> > > 
> >> > > This isn't universally true according to the architecture manual.
> > The
> >> > > PARISC CPU can make byte to long word stores atomic against the
> >memory
> >> > > bus but not against the I/O bus for instance.  Atomicity is a
> >property
> >> > > of the underlying substrate, not of the CPU.  Implying that
> >atomicity is
> >> > > a CPU property is incorrect.
> >> > 
> >> > OK, fair point.
> >> > 
> >> > But are there in-use-for-Linux PARISC memory fabrics (for normal
> >memory,
> >> > not I/O) that do not support single-byte and double-byte stores?
> >> 
> >> For aligned access, I believe that's always the case for the memory
> >bus
> >> (on both 32 and 64 bit systems).  However, it only applies to machine
> >> instruction loads and stores of the same width..  If you mix the
> >widths
> >> on the loads and stores, all bets are off.  That means you have to
> >> beware of the gcc penchant for coalescing loads and stores: if it
> >sees
> >> two adjacent byte stores it can coalesce them into a short store
> >> instead ... that screws up the atomicity guarantees.
> >
> >OK, that means that to make PARISC work reliably, we need to use
> >ACCESS_ONCE() for loads and stores that could have racing accesses.
> >If I understand correctly, this will -not- be needed for code guarded
> >by locks, even with Peter's examples.
> >
> >So if we have something like this:
> >
> >	struct foo {
> >		char a;
> >		char b;
> >	};
> >	struct foo *fp;
> >
> >then this code would be bad:
> >
> >	fp->a = 1;
> >	fp->b = 2;
> >
> >The reason is (as you say) that GCC would be happy to store 0x0102
> >(or vice versa, depending on endianness) to the pair.  We instead
> >need:
> >
> >	ACCESS_ONCE(fp->a) = 1;
> >	ACCESS_ONCE(fp->b) = 2;
> >
> >However, if the code is protected by locks, no problem:
> >
> >	struct foo {
> >		spinlock_t lock_a;
> >		spinlock_t lock_b;
> >		char a;
> >		char b;
> >	};
> >
> >Then it is OK to do the following:
> >
> >	spin_lock(fp->lock_a);
> >	fp->a = 1;
> >	spin_unlock(fp->lock_a);
> >	spin_lock(fp->lock_b);
> >	fp->b = 1;
> >	spin_unlock(fp->lock_b);
> >
> >Or even this, assuming ->lock_a precedes ->lock_b in the locking
> >hierarchy:
> >
> >	spin_lock(fp->lock_a);
> >	spin_lock(fp->lock_b);
> >	fp->a = 1;
> >	fp->b = 1;
> >	spin_unlock(fp->lock_a);
> >	spin_unlock(fp->lock_b);
> >
> >Here gcc might merge the assignments to fp->a and fp->b, but that is OK
> >because both locks are held, presumably preventing other assignments or
> >references to fp->a and fp->b.
> >
> >On the other hand, if either fp->a or fp->b are referenced outside of
> >their
> >respective locks, even once, then this last code fragment would still
> >need
> >ACCESS_ONCE() as follows:
> >
> >	spin_lock(fp->lock_a);
> >	spin_lock(fp->lock_b);
> >	ACCESS_ONCE(fp->a) = 1;
> >	ACCESS_ONCE(fp->b) = 1;
> >	spin_unlock(fp->lock_a);
> >	spin_unlock(fp->lock_b);
> >
> >Does that cover it?  If so, I will update memory-barriers.txt
> >accordingly.
> >
> >							Thanx, Paul
> 
> -- 
> Sent from my mobile phone.  Please pardon brevity and lack of formatting.
> 

  reply	other threads:[~2014-09-07 23:37 UTC|newest]

Thread overview: 103+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-07-12 18:13 bit fields && data tearing Oleg Nesterov
2014-07-12 20:51 ` Oleg Nesterov
2014-07-12 23:34   ` Benjamin Herrenschmidt
2014-07-13 12:29     ` Oleg Nesterov
2014-07-13 13:15     ` Peter Hurley
2014-07-13 22:25       ` Benjamin Herrenschmidt
2014-07-15 13:54         ` Peter Hurley
2014-07-15 15:02           ` Richard Henderson
2014-09-03 22:51 ` Peter Hurley
2014-09-03 23:11   ` Benjamin Herrenschmidt
2014-09-04  8:43     ` David Laight
2014-09-04  9:52       ` Benjamin Herrenschmidt
2014-09-04 22:14         ` H. Peter Anvin
2014-09-05  0:59           ` Peter Hurley
2014-09-05  2:08             ` H. Peter Anvin
2014-09-05 15:31               ` Peter Hurley
2014-09-05 15:41                 ` H. Peter Anvin
2014-09-08 17:52                   ` One Thousand Gnomes
2014-09-08 17:59                     ` H. Peter Anvin
2014-09-08 19:17                       ` One Thousand Gnomes
2014-09-09 11:18                         ` Peter Hurley
2014-09-08 22:47                       ` Peter Hurley
2014-09-09  1:59                         ` Paul E. McKenney
2014-09-09 11:14                         ` Peter Hurley
2014-09-11 10:04                         ` One Thousand Gnomes
2014-09-11 16:16                           ` Paul E. McKenney
2014-09-11 20:01                           ` Peter Hurley
2014-09-14 23:24                             ` One Thousand Gnomes
2014-09-22 19:51                               ` Paul E. McKenney
2014-09-23 18:19                               ` Peter Hurley
2014-09-23 18:39                                 ` One Thousand Gnomes
2014-09-08 18:13                     ` James Bottomley
2014-09-10 20:18                     ` H. Peter Anvin
2014-09-10 21:10                       ` Rob Landley
2014-09-05  2:08             ` H. Peter Anvin
2014-09-05  8:16               ` Michael Cree
2014-09-05 18:09                 ` Paul E. McKenney
2014-09-05 18:31                   ` Paul E. McKenney
2014-09-05 19:52                     ` Peter Zijlstra
2014-09-05 20:01                       ` Peter Hurley
2014-09-05 20:12                         ` Peter Zijlstra
2014-09-05 20:15                           ` H. Peter Anvin
2014-09-05 20:19                         ` Paul E. McKenney
2014-09-05 18:50                   ` Peter Hurley
2014-09-05 19:05                     ` Paul E. McKenney
2014-09-05 19:24                       ` Peter Hurley
2014-09-05 20:09                         ` Paul E. McKenney
2014-09-05 19:38                       ` Marc Gauthier
2014-09-05 20:14                         ` Peter Hurley
2014-09-05 20:34                           ` H. Peter Anvin
2014-09-05 20:42                             ` Michael Cree
2014-09-05 20:43                             ` Paul E. McKenney
2014-09-05 20:48                               ` Thomas Gleixner
2014-09-05 21:05                                 ` Paul E. McKenney
2014-09-05 20:39                           ` Michael Cree
2014-09-05 21:12                             ` Peter Hurley
2014-09-05 21:27                               ` Michael Cree
2014-09-05 20:42                           ` Paul E. McKenney
2014-09-04  8:57     ` Mikael Pettersson
2014-09-04  9:09       ` Jakub Jelinek
2014-09-04 12:24         ` Peter Hurley
2014-09-04 12:29           ` Jakub Jelinek
2014-09-04 16:50           ` One Thousand Gnomes
2014-09-04 19:42             ` Peter Hurley
2014-09-04 22:16               ` H. Peter Anvin
2014-09-05  0:17                 ` Paul E. McKenney
2014-09-05  1:57                   ` Peter Hurley
2014-09-05  2:11                   ` James Bottomley
2014-09-05  2:47                     ` Peter Hurley
2014-09-05  4:06                       ` Paul E. McKenney
2014-09-05  8:30                         ` David Laight
2014-09-05 12:31                           ` Peter Hurley
2014-09-05 12:37                             ` David Laight
2014-09-05 16:17                               ` Peter Hurley
2014-09-25 16:12                                 ` Pavel Machek
2014-09-07  5:07                         ` James Bottomley
2014-09-07 16:21                           ` Paul E. McKenney
2014-09-07 19:04                             ` James Bottomley
2014-09-07 20:41                               ` Peter Hurley
2014-09-08  5:50                                 ` James Bottomley
2014-09-08 20:45                                   ` Chris Metcalf
2014-09-08 22:43                                     ` James Bottomley
2014-09-09  2:27                                       ` H. Peter Anvin
2014-09-09  8:11                                         ` Arnd Bergmann
2014-09-08 23:30                                   ` Peter Hurley
2014-09-09  2:56                                     ` James Bottomley
2014-09-09  3:20                                       ` H. Peter Anvin
2014-09-09  4:30                                       ` H. Peter Anvin
2014-09-09 10:40                                       ` Peter Hurley
2014-09-10 21:48                                         ` James Bottomley
2014-09-10 23:50                                           ` Peter Hurley
2014-09-11 10:23                                           ` Will Deacon
2014-09-07 23:00                               ` Paul E. McKenney
2014-09-07 23:17                                 ` H. Peter Anvin
2014-09-07 23:36                                   ` Paul E. McKenney [this message]
2014-09-07 23:39                                     ` H. Peter Anvin
2014-09-08  5:56                                       ` James Bottomley
2014-09-08 18:12                                         ` H. Peter Anvin
2014-09-08 19:09                                           ` James Bottomley
2014-09-08 19:12                                             ` H. Peter Anvin
2014-09-08 22:39                                               ` James Bottomley
2014-09-09  2:30                                                 ` H. Peter Anvin
2014-09-08 19:12                                             ` H. Peter Anvin

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20140907233655.GR5001@linux.vnet.ibm.com \
    --to=paulmck@linux.vnet.ibm.com \
    --cc=James.Bottomley@HansenPartnership.com \
    --cc=gnomes@lxorguk.ukuu.org.uk \
    --cc=hpa@zytor.com \
    --cc=jakub@redhat.com \
    --cc=linux-arch@vger.kernel.org \
    --cc=linux-ia64@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linuxppc-dev@lists.ozlabs.org \
    --cc=mfranc@redhat.com \
    --cc=mikpelinux@gmail.com \
    --cc=oleg@redhat.com \
    --cc=paulus@samba.org \
    --cc=peter@hurleysoftware.com \
    --cc=rth@twiddle.net \
    --cc=tony.luck@intel.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).