From: Al Viro <viro@ZenIV.linux.org.uk>
To: Sage Weil <sage@inktank.com>
Cc: linux-fsdevel@vger.kernel.org, ceph-devel@vger.kernel.org
Subject: Re: [ceph] locking fun with d_materialise_unique()
Date: Fri, 1 Feb 2013 01:14:16 +0000 [thread overview]
Message-ID: <20130201011416.GO4503@ZenIV.linux.org.uk> (raw)
In-Reply-To: <20130130144214.GJ4503@ZenIV.linux.org.uk>
On Wed, Jan 30, 2013 at 02:42:14PM +0000, Al Viro wrote:
> On Tue, Jan 29, 2013 at 01:03:23PM -0800, Sage Weil wrote:
>
> > We should drop teh mds_client.c hunk from your patch, and then do
> > something like the below. I'll put it in the ceph tree so we can do some
> > basic testing. Unfortunately, the abort case is a bit annoying to
> > trigger.. we'll need to write a new test for it.
>
> hunk dropped, the rest folded into your patch, resulting branch pushed...
BTW, moderately related issue - ceph_get_dentry_parent_inode() uses
look fishy. In ceph_rename() it seems to be pointless. We do have
the directory inode of parent of old_dentry - it's old_dir, so it's just
a fancy way to spell igrab(old_directory). And as far as I can tell, *all*
other callers are racy.
* link("a/foo", "b/bar") can happen in parallel with
rename("a/foo", "c/splat"). link(2) holds ->i_mutex on a/foo (so it can't race
with unlink) and on b/; rename(2) holds it on a/, c/ and c/splat (if the last
one exists). It also holds ->s_vfs_rename_sem, so cross-directory renames
are serialized. For normal filesystems that's enough and there ->link()
couldn't care less about a/; ceph_link() wants the inode of a/ for some
reason. If it *really* needs the parent of a/foo for the operation, the
current code is SOL - the directory we grab can cease being that parent
just as it's getting returned to ceph_link()
* ->d_revalidate() can happen in parallel with rename(). You
don't seem to be using the parent inode much in there, so that should
be reasonably easy to deal with.
* ceph_open() can race with rename(); ->atomic_open() is
called with parent locked, but ->open() isn't. AFAICS, open() with
O_TRUNC could step into that code...
* ceph_setattr() *definitely* can race with rename(); we have
the object itself locked, but not its parent (and I'm really surprised
by the need to know that parent for such operation).
* CEPH_IOC_SET_LAYOUT vs. rename() - no idea what that ioctl is
about, but opened files can be moved around, TYVM, even when they are
in the middle of ioctl(2).
* ->setxattr() and ->removexattr() - again, can happen in parallel
with rename(), and again I really wonder why do we need the parent directory
for such operations.
What's going on there?
next prev parent reply other threads:[~2013-02-01 1:14 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-01-29 4:52 [ceph] locking fun with d_materialise_unique() Al Viro
2013-01-29 5:20 ` Sage Weil
2013-01-29 8:00 ` Al Viro
2013-01-29 21:03 ` Sage Weil
2013-01-29 21:03 ` Sage Weil
2013-01-30 14:42 ` Al Viro
2013-02-01 1:14 ` Al Viro [this message]
2013-02-05 21:59 ` Sage Weil
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20130201011416.GO4503@ZenIV.linux.org.uk \
--to=viro@zeniv.linux.org.uk \
--cc=ceph-devel@vger.kernel.org \
--cc=linux-fsdevel@vger.kernel.org \
--cc=sage@inktank.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.