From: Jamie Lokier <jamie@shareable.org>
To: Miklos Szeredi <miklos@szeredi.hu>
Cc: paulmck@linux.vnet.ibm.com, dhowells@redhat.com,
linux-kernel@vger.kernel.org, linux-arch@vger.kernel.org
Subject: Re: memory barrier question
Date: Thu, 16 Sep 2010 17:50:18 +0100 [thread overview]
Message-ID: <20100916165018.GA26539@shareable.org> (raw)
In-Reply-To: <E1OwGz7-0007Cu-CC@pomaz-ex.szeredi.hu>
Miklos Szeredi wrote:
> On Thu, 16 Sep 2010, Paul E. McKenney wrote:
> > On Thu, Sep 16, 2010 at 03:30:56PM +0100, David Howells wrote:
> > > Miklos Szeredi <miklos@szeredi.hu> wrote:
> > >
> > > > Is the rmb() really needed?
> > > >
> > > > Take this code from fs/namei.c for example:
> > > >
> > > > inode = next.dentry->d_inode;
> > > > if (!inode)
> > > > goto out_dput;
> > > >
> > > > if (inode->i_op->follow_link) {
> > > >
> > > > It happily dereferences dentry->d_inode without a barrier after
> > > > checking it for non-null, while that d_inode might have just been
> > > > initialized on another CPU with a freshly created inode. There's
> > > > absolutely no synchornization with that on this side.
> > >
> > > Perhaps it's not necessary; once set, how likely is i_op to be changed once
> > > I_NEW is cleared?
> >
> > Are the path_get()s protecting this?
>
> No, when creating a file the dentry will go from negative to positive
> independently from lookup. The dentry can get instantiated with an
> inode between the path_get() and dereferencing ->d_inode.
>
> >
> > If there is no protection, then something like rcu_dereference() is
> > needed for the assignment from next.dentry->d_inode.
>
> Do I understand correctly that the problem is that a CPU may have a
> stale cache associated with *inode, one that was loaded before the
> write barrier took effect?
>
> Funny that such a bug could stay unnoticed in so often excercised
> code. Yeah I know it's alpha only.
When I first saw read_barrier_depends(), I thought it must be Alpha's
speculative execution, fetching memory out of order and confirming
it's valid later. I was really surprised to find out it's not that -
it's a quirk of the Alpha's cache/forwarding protocol. Others
presumably don't have it because they were designed with awareness of
this coding pattern.
But...
I wonder if it can happen on IA64 with it's funky memory-alias
compiler optimisations.
I wonder if it can happen on x86 and others, if the compiler decides
this is a valid transformation (it is with a single CPU):
Original code:
foo = global_ptr_to_foo;
foo_x = foo->x;
bar = global_ptr_to_bar;
bar_y = bar->y;
// use bar_y;
Transformed by compiler:
foo = global_ptr_to_foo;
foo_x = foo->x;
bar = global_ptr_to_bar;
bar_y = (__typeof__(bar->y))foo_x;
if ((void *)bar != (void *)foo)
bar_y = bar->y;
// use bar_y;
In other words, without a barrier, the compiler doesn't have to order
the executed bar->y dereference *instruction* after the bar =
global_ptr_to_bar instruction. Thus making it a compiler property,
not a CPU one.
There is no danger of dereferencing NULL in that example, but
dereferencing the values from the wrong object is just as wrong.
-- Jamie
next prev parent reply other threads:[~2010-09-16 16:50 UTC|newest]
Thread overview: 41+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-09-15 14:36 memory barrier question Miklos Szeredi
2010-09-15 19:12 ` Rafael J. Wysocki
2010-09-16 11:55 ` David Howells
2010-09-16 13:42 ` Miklos Szeredi
2010-09-16 13:42 ` Miklos Szeredi
2010-09-16 13:42 ` Miklos Szeredi
2010-09-16 14:30 ` David Howells
2010-09-16 15:03 ` Paul E. McKenney
2010-09-16 16:06 ` Miklos Szeredi
2010-09-16 16:37 ` Paul E. McKenney
2010-09-16 16:56 ` Miklos Szeredi
2010-09-16 17:09 ` James Bottomley
2010-09-16 17:17 ` Miklos Szeredi
2010-09-16 17:40 ` James Bottomley
2010-09-17 21:49 ` Benjamin Herrenschmidt
2010-09-17 23:12 ` Paul E. McKenney
2010-09-19 2:47 ` Benjamin Herrenschmidt
2010-09-19 15:26 ` Paul E. McKenney
2010-09-19 20:15 ` Miklos Szeredi
2010-09-19 21:59 ` Paul E. McKenney
2010-09-20 0:58 ` James Bottomley
2010-09-20 1:29 ` Paul E. McKenney
2010-09-20 16:01 ` Miklos Szeredi
2010-09-20 18:25 ` Paul E. McKenney
2010-09-20 18:25 ` Paul E. McKenney
2010-09-20 18:57 ` Paul E. McKenney
2010-09-20 20:26 ` Michael Cree
2010-09-20 20:40 ` Paul E. McKenney
2010-09-21 14:59 ` Paul E. McKenney
2010-09-22 18:41 ` Paul E. McKenney
2010-09-18 1:12 ` Alan Cox
2010-09-16 16:50 ` Jamie Lokier [this message]
2010-09-16 16:18 ` Peter Zijlstra
2010-09-16 16:18 ` Peter Zijlstra
2010-09-16 17:59 ` David Howells
-- strict thread matches above, loose matches on Subject: below --
2010-09-20 10:34 George Spelvin
2010-10-29 13:23 Memory " Tetsuo Handa
2010-10-29 15:09 ` Paul E. McKenney
2010-10-30 5:48 ` Tetsuo Handa
2010-10-30 6:11 ` Eric Dumazet
2010-10-30 12:40 ` Tetsuo Handa
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20100916165018.GA26539@shareable.org \
--to=jamie@shareable.org \
--cc=dhowells@redhat.com \
--cc=linux-arch@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=miklos@szeredi.hu \
--cc=paulmck@linux.vnet.ibm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.