linux-fsdevel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Anton Altaparmakov <aia21@cam.ac.uk>
To: Matthew Wilcox <matthew@wil.cx>
Cc: Al Viro <viro@ftp.linux.org.uk>,
	linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org
Subject: Re: A missing i_mutex in rename? (Linux kernel 2.6.latest)
Date: Wed, 19 Apr 2006 13:51:21 +0100 (BST)	[thread overview]
Message-ID: <Pine.LNX.4.64.0604191328160.12158@hermes-1.csi.cam.ac.uk> (raw)
In-Reply-To: <20060419121826.GI24104@parisc-linux.org>

Hi Matthew,

Thanks for the quick reply.

On Wed, 19 Apr 2006, Matthew Wilcox wrote:
> On Wed, Apr 19, 2006 at 11:50:00AM +0100, Anton Altaparmakov wrote:
> > Both sys_unlink()/sys_rmdir() and sys_link() all end up taking the i_mutex 
> > on all parent directories and source/destination inodes before calling 
> > into the file system inode operations.
> > 
> > sys_rename() OTOH, does not take i_mutex on the old inode.  It only takes 
> > i_mutex on the two parent directories and on the target inode if it 
> > exists.
> > 
> > Why is this?  To me it seems that either sys_rename() must take i_mutex on 
> > the old inode or sys_unlink()/sys_rmdir(), sys_link(), etc do not need to 
> > hold the i_mutex.
> > 
> > What am I missing?
> 
> I believe the current locking scheme to be correct.  Reading
> Documentation/filesystems/directory-locking and pondering for a few
> minutes leads me to the following conclusions:
> 
>  - sys_rmdir() must take the lock on the parent directory and on the
>    victim.  If a different process is trying to create a file in the
>    victim, sys_rmdir() mustn't race with it.

Agreed.

>  - I don't immediately see a race that taking the lock on the victim of
>    sys_unlink() solves; however, for symmetry with sys_rmdir(), it seems
>    desirable.

I guess the symmetry thing is fair enough.

>  - sys_link() needs to lock the target to be sure it isn't removed and
>    replaced with a directory in the meantime.

Agreed.

>  - sys_rename() does not need to lock the old inode.  Since the parent
>    is already locked, the old inode can't be removed/renamed by a racing
>    process.  It doesn't matter if something's created or deleted from
>    within the old inode (if it's a directory), unlike rmdir().  It
>    doesn't need to be protected from a sys_link() race.

Agreed.

> If you need to lock the old inode inside ntfs for your own consistency
> purposes, that looks like it should be fine, but the VFS doesn't need to
> lock it for you.

Great, thanks.  That was my own conclusion also but it never hurts to be 
sure.  (-:

ntfs_rename() at the moment looks roughly like this:

if (target_inode) {
	if (S_ISDIR(target_inode->i_mode)
		ntfs_rmdir(target_dir_inode, target_dentry);
	else
		ntfs_unlink(target_dir_inode, target_dentry);
}
mutex_lock(&old_inode->i_mutex);
ntfs_link(old_dentry, target_dir_inode, target_dentry);
ntfs_unlink(old_dir_inode, old_dentry);
mutex_unlock(&old_inode->i_mutex);

Which is incredibly inefficient but very simple and works (with minimal 
special casing in ntfs_link() and ntfs_unlink() mostly so if old_inode is 
a directory we never get a link count greater one on the VFS inode) and I 
doubt sys_rename() is a very often invoked system call.  Normally, a 
sys_link() and sys_unlink() would take i_mutex on old_inode as shown in 
above code which is why I was wondering whether I should take it as shown 
above or whether I can just not worry and not take yet another lock in a 
code path where tons of locks are already being taken and released.

My conclusion is that the above code is safe even if I remove the 
mutex_lock()/mutex_unlock() around the ntfs_link()/ntfs_unlink() given 
that sys_unlink() only takes the lock for symmetry reasons and 
sys_link()'s need for locking is taken care of by the fact that 
sys_rename() has the lock on both parent directory inodes.

Would you agree?

Thanks a lot in advance!

Best regards,

	Anton
-- 
Anton Altaparmakov <aia21 at cam.ac.uk> (replace at with @)
Unix Support, Computing Service, University of Cambridge, CB2 3QH, UK
Linux NTFS maintainer, http://www.linux-ntfs.org/

  reply	other threads:[~2006-04-19 12:51 UTC|newest]

Thread overview: 5+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2006-04-19 10:50 A missing i_mutex in rename? (Linux kernel 2.6.latest) Anton Altaparmakov
2006-04-19 12:18 ` Matthew Wilcox
2006-04-19 12:51   ` Anton Altaparmakov [this message]
2006-04-20 10:59     ` Al Viro
2006-04-20 12:24       ` Anton Altaparmakov

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=Pine.LNX.4.64.0604191328160.12158@hermes-1.csi.cam.ac.uk \
    --to=aia21@cam.ac.uk \
    --cc=linux-fsdevel@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=matthew@wil.cx \
    --cc=viro@ftp.linux.org.uk \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).