Re: bit fields && data tearing

linuxppc-dev.lists.ozlabs.org archive mirror
 help / color / mirror / Atom feed

From: Peter Hurley <peter@hurleysoftware.com>
To: James Bottomley <James.Bottomley@HansenPartnership.com>
Cc: Jakub Jelinek <jakub@redhat.com>,
	One Thousand Gnomes <gnomes@lxorguk.ukuu.org.uk>,
	linux-arch@vger.kernel.org, linux-ia64@vger.kernel.org,
	Mikael Pettersson <mikpelinux@gmail.com>,
	Oleg Nesterov <oleg@redhat.com>,
	linux-kernel@vger.kernel.org, Tony Luck <tony.luck@intel.com>,
	Paul Mackerras <paulus@samba.org>,
	"H. Peter Anvin" <hpa@zytor.com>,
	paulmck@linux.vnet.ibm.com, linuxppc-dev@lists.ozlabs.org,
	Miroslav Franc <mfranc@redhat.com>,
	Richard Henderson <rth@twiddle.net>
Subject: Re: bit fields && data tearing
Date: Wed, 10 Sep 2014 19:50:06 -0400	[thread overview]
Message-ID: <5410E3AE.8020609@hurleysoftware.com> (raw)
In-Reply-To: <1410385686.28237.5.camel@jarvis>

On 09/10/2014 05:48 PM, James Bottomley wrote:
> On Tue, 2014-09-09 at 06:40 -0400, Peter Hurley wrote:
>> On 09/08/2014 10:56 PM, James Bottomley wrote:
>>> On Mon, 2014-09-08 at 19:30 -0400, Peter Hurley wrote:
>>>> On 09/08/2014 01:50 AM, James Bottomley wrote:
>>>>>> But additionally, even if gcc combines adjacent writes _that are part
>>>>>> of the program flow_ then I believe the situation is no worse than
>>>>>> would otherwise exist.
>>>>>>
>>>>>> For instance, given the following:
>>>>>>
>>>>>> struct x {
>>>>>> 	spinlock_t lock;
>>>>>> 	long a;
>>>>>> 	byte b;
>>>>>> 	byte c;
>>>>>> };
>>>>>>
>>>>>> void locked_store_b(struct x *p)
>>>>>> {
>>>>>> 	spin_lock(&p->lock);
>>>>>> 	p->b = 1;
>>>>>> 	spin_unlock(&p->lock);
>>>>>> 	p->c = 2;
>>>>>> }
>>>>>>
>>>>>> Granted, the author probably expects ordered writes of
>>>>>> 	STORE B
>>>>>> 	STORE C
>>>>>> but that's not guaranteed because there is no memory barrier
>>>>>> ordering B before C.
>>>>>
>>>>> Yes, there is: loads and stores may not migrate into or out of critical
>>>>> sections.
>>>>
>>>> That's a common misconception.
>>>>
>>>> The processor is free to re-order this to:
>>>>
>>>> 	STORE C
>>>> 	STORE B
>>>> 	UNLOCK
>>>>
>>>> That's because the unlock() only guarantees that:
>>>>
>>>> Stores before the unlock in program order are guaranteed to complete
>>>> before the unlock completes. Stores after the unlock _may_ complete
>>>> before the unlock completes.
>>>>
>>>> My point was that even if compiler barriers had the same semantics
>>>> as memory barriers, the situation would be no worse. That is, code
>>>> that is sensitive to memory barriers (like the example I gave above)
>>>> would merely have the same fragility with one-way compiler barriers
>>>> (with respect to the compiler combining writes).
>>>>
>>>> That's what I meant by "no worse than would otherwise exist".
>>>
>>> Actually, that's not correct.  This is actually deja vu with me on the
>>> other side of the argument.  When we first did spinlocks on PA, I argued
>>> as you did: lock only a barrier for code after and unlock for code
>>> before.  The failing case is that you can have a critical section which
>>> performs an atomically required operation and a following unit which
>>> depends on it being performed.  If you begin the following unit before
>>> the atomic requirement, you may end up losing.  It turns out this kind
>>> of pattern is inherent in a lot of mail box device drivers: you need to
>>> set up the mailbox atomically then poke it.  Setup is usually atomic,
>>> deciding which mailbox to prime and actually poking it is in the
>>> following unit.  Priming often involves an I/O bus transaction and if
>>> you poke before priming, you get a misfire.
>>
>> Take it up with the man because this was discussed extensively last
>> year and it was decided that unlocks would not be full barriers.
>> Thus the changes to memory-barriers.txt that explicitly note this
>> and the addition of smp_mb__after_unlock_lock() (for two different
>> locks; an unlock followed by a lock on the same lock is a full barrier).
>>
>> Code that expects ordered writes after an unlock needs to explicitly
>> add the memory barrier.
> 
> I don't really care what ARM does; spin locks are full barriers on
> architectures that need them.  The driver problem we had that detected
> our semi permeable spinlocks was an LSI 53c875 which is enterprise class
> PCI, so presumably not relevant to ARM anyway.

Almost certainly ia64 arch_spin_unlock() is not a full barrier.

next prev parent reply	other threads:[~2014-09-10 23:50 UTC|newest]

Thread overview: 103+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-07-12 18:13 bit fields && data tearing Oleg Nesterov
2014-07-12 20:51 ` Oleg Nesterov
2014-07-12 23:34   ` Benjamin Herrenschmidt
2014-07-13 12:29     ` Oleg Nesterov
2014-07-13 13:15     ` Peter Hurley
2014-07-13 22:25       ` Benjamin Herrenschmidt
2014-07-15 13:54         ` Peter Hurley
2014-07-15 15:02           ` Richard Henderson
2014-09-03 22:51 ` Peter Hurley
2014-09-03 23:11   ` Benjamin Herrenschmidt
2014-09-04  8:43     ` David Laight
2014-09-04  9:52       ` Benjamin Herrenschmidt
2014-09-04 22:14         ` H. Peter Anvin
2014-09-05  0:59           ` Peter Hurley
2014-09-05  2:08             ` H. Peter Anvin
2014-09-05 15:31               ` Peter Hurley
2014-09-05 15:41                 ` H. Peter Anvin
2014-09-08 17:52                   ` One Thousand Gnomes
2014-09-08 17:59                     ` H. Peter Anvin
2014-09-08 19:17                       ` One Thousand Gnomes
2014-09-09 11:18                         ` Peter Hurley
2014-09-08 22:47                       ` Peter Hurley
2014-09-09  1:59                         ` Paul E. McKenney
2014-09-09 11:14                         ` Peter Hurley
2014-09-11 10:04                         ` One Thousand Gnomes
2014-09-11 16:16                           ` Paul E. McKenney
2014-09-11 20:01                           ` Peter Hurley
2014-09-14 23:24                             ` One Thousand Gnomes
2014-09-22 19:51                               ` Paul E. McKenney
2014-09-23 18:19                               ` Peter Hurley
2014-09-23 18:39                                 ` One Thousand Gnomes
2014-09-08 18:13                     ` James Bottomley
2014-09-10 20:18                     ` H. Peter Anvin
2014-09-10 21:10                       ` Rob Landley
2014-09-05  2:08             ` H. Peter Anvin
2014-09-05  8:16               ` Michael Cree
2014-09-05 18:09                 ` Paul E. McKenney
2014-09-05 18:31                   ` Paul E. McKenney
2014-09-05 19:52                     ` Peter Zijlstra
2014-09-05 20:01                       ` Peter Hurley
2014-09-05 20:12                         ` Peter Zijlstra
2014-09-05 20:15                           ` H. Peter Anvin
2014-09-05 20:19                         ` Paul E. McKenney
2014-09-05 18:50                   ` Peter Hurley
2014-09-05 19:05                     ` Paul E. McKenney
2014-09-05 19:24                       ` Peter Hurley
2014-09-05 20:09                         ` Paul E. McKenney
2014-09-05 19:38                       ` Marc Gauthier
2014-09-05 20:14                         ` Peter Hurley
2014-09-05 20:34                           ` H. Peter Anvin
2014-09-05 20:42                             ` Michael Cree
2014-09-05 20:43                             ` Paul E. McKenney
2014-09-05 20:48                               ` Thomas Gleixner
2014-09-05 21:05                                 ` Paul E. McKenney
2014-09-05 20:39                           ` Michael Cree
2014-09-05 21:12                             ` Peter Hurley
2014-09-05 21:27                               ` Michael Cree
2014-09-05 20:42                           ` Paul E. McKenney
2014-09-04  8:57     ` Mikael Pettersson
2014-09-04  9:09       ` Jakub Jelinek
2014-09-04 12:24         ` Peter Hurley
2014-09-04 12:29           ` Jakub Jelinek
2014-09-04 16:50           ` One Thousand Gnomes
2014-09-04 19:42             ` Peter Hurley
2014-09-04 22:16               ` H. Peter Anvin
2014-09-05  0:17                 ` Paul E. McKenney
2014-09-05  1:57                   ` Peter Hurley
2014-09-05  2:11                   ` James Bottomley
2014-09-05  2:47                     ` Peter Hurley
2014-09-05  4:06                       ` Paul E. McKenney
2014-09-05  8:30                         ` David Laight
2014-09-05 12:31                           ` Peter Hurley
2014-09-05 12:37                             ` David Laight
2014-09-05 16:17                               ` Peter Hurley
2014-09-25 16:12                                 ` Pavel Machek
2014-09-07  5:07                         ` James Bottomley
2014-09-07 16:21                           ` Paul E. McKenney
2014-09-07 19:04                             ` James Bottomley
2014-09-07 20:41                               ` Peter Hurley
2014-09-08  5:50                                 ` James Bottomley
2014-09-08 20:45                                   ` Chris Metcalf
2014-09-08 22:43                                     ` James Bottomley
2014-09-09  2:27                                       ` H. Peter Anvin
2014-09-09  8:11                                         ` Arnd Bergmann
2014-09-08 23:30                                   ` Peter Hurley
2014-09-09  2:56                                     ` James Bottomley
2014-09-09  3:20                                       ` H. Peter Anvin
2014-09-09  4:30                                       ` H. Peter Anvin
2014-09-09 10:40                                       ` Peter Hurley
2014-09-10 21:48                                         ` James Bottomley
2014-09-10 23:50                                           ` Peter Hurley [this message]
2014-09-11 10:23                                           ` Will Deacon
2014-09-07 23:00                               ` Paul E. McKenney
2014-09-07 23:17                                 ` H. Peter Anvin
2014-09-07 23:36                                   ` Paul E. McKenney
2014-09-07 23:39                                     ` H. Peter Anvin
2014-09-08  5:56                                       ` James Bottomley
2014-09-08 18:12                                         ` H. Peter Anvin
2014-09-08 19:09                                           ` James Bottomley
2014-09-08 19:12                                             ` H. Peter Anvin
2014-09-08 22:39                                               ` James Bottomley
2014-09-09  2:30                                                 ` H. Peter Anvin
2014-09-08 19:12                                             ` H. Peter Anvin

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=5410E3AE.8020609@hurleysoftware.com \
    --to=peter@hurleysoftware.com \
    --cc=James.Bottomley@HansenPartnership.com \
    --cc=gnomes@lxorguk.ukuu.org.uk \
    --cc=hpa@zytor.com \
    --cc=jakub@redhat.com \
    --cc=linux-arch@vger.kernel.org \
    --cc=linux-ia64@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linuxppc-dev@lists.ozlabs.org \
    --cc=mfranc@redhat.com \
    --cc=mikpelinux@gmail.com \
    --cc=oleg@redhat.com \
    --cc=paulmck@linux.vnet.ibm.com \
    --cc=paulus@samba.org \
    --cc=rth@twiddle.net \
    --cc=tony.luck@intel.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).