From: "Paul E. McKenney" <paulmck@linux.vnet.ibm.com>
To: Richard Weinberger <richard@sigma-star.at>
Cc: linux-fsdevel@vger.kernel.org, viro@zeniv.linux.org.uk,
hch@infradead.org, jeffm@suse.com, linux-kernel@vger.kernel.org
Subject: Re: [PATCH] vfs: Fix RCU usage in __propagate_umount()
Date: Wed, 30 Jul 2014 11:39:02 -0700 [thread overview]
Message-ID: <20140730183902.GA24687@linux.vnet.ibm.com> (raw)
In-Reply-To: <1406728756-32443-1-git-send-email-richard@sigma-star.at>
On Wed, Jul 30, 2014 at 03:59:16PM +0200, Richard Weinberger wrote:
> If we use the plain list_empty() we might not see the
> hlist_del_init_rcu() and therefore miss one member of the
> list.
>
> It fixes the following issue:
> $ unshare -m /usr/bin/sleep 10000 &
> $ mkdir -p foo/proc
> $ mount -t proc none foo/proc
> $ mount -t binfmt_misc none foo/proc/sys/fs/binfmt_misc
> $ umount -l foo/proc
> $ rmdir foo/proc
> rmdir: failed to remove ‘foo/proc’: Device or resource busy
>
> rmdir fails because the last entry in the RCU list, "proc", was
> not propagated as list_empty() still returned false instead of true.
>
> Signed-off-by: Richard Weinberger <richard@sigma-star.at>
> ---
> Hi!
>
> Please review this patch with care, the comments in rculist.h
> confused me like hell:
So there are some special cases where list_empty() can be applied to
an RCU-protected list. One such case is where only one task is
permitted to remove from the list (or, equivalently, where removal
is protected by a lock). Then if that task sees !list_empty(), it
knows that the list will remain non-empty because no one else is
removing things.
There are other special cases, but this one gives the general flavor.
Thanx, Paul
> First it says:
> /*
> * Why is there no list_empty_rcu()? Because list_empty() serves this
> * purpose. The list_empty() function fetches the RCU-protected pointer
> * and compares it to the address of the list head, but neither dereferences
> * this pointer itself nor provides this pointer to the caller. Therefore,
> * it is not necessary to use rcu_dereference(), so that list_empty() can
> * be used anywhere you would want to use a list_empty_rcu().
> */
>
> And later:
> /**
> * Where are list_empty_rcu() and list_first_entry_rcu()?
> *
> * Implementing those functions following their counterparts list_empty() and
> * list_first_entry() is not advisable because they lead to subtle race
> * conditions as the following snippet shows:
> *
> * if (!list_empty_rcu(mylist)) {
> * struct foo *bar = list_first_entry_rcu(mylist, struct foo, list_member);
> * do_something(bar);
> * }
> *
> * The list may not be empty when list_empty_rcu checks it, but it may be when
> * list_first_entry_rcu rereads the ->next pointer.
> *
> * Rereading the ->next pointer is not a problem for list_empty() and
> * list_first_entry() because they would be protected by a lock that blocks
> * writers.
> *
> * See list_first_or_null_rcu for an alternative.
> */
>
> To my understanding we cannot use list_empty() and have to use list_first_or_null_rcu(),
> or am I missing something?
>
> Thanks,
> //richard
>
> fs/pnode.c | 3 ++-
> 1 file changed, 2 insertions(+), 1 deletion(-)
>
> diff --git a/fs/pnode.c b/fs/pnode.c
> index 302bf22..883901c 100644
> --- a/fs/pnode.c
> +++ b/fs/pnode.c
> @@ -380,7 +380,8 @@ static void __propagate_umount(struct mount *mnt)
> * umount the child only if the child has no
> * other children
> */
> - if (child && list_empty(&child->mnt_mounts)) {
> + if (child && list_first_or_null_rcu(&child->mnt_mounts,
> + struct mount, mnt_mounts)) {
> hlist_del_init_rcu(&child->mnt_hash);
> hlist_add_before_rcu(&child->mnt_hash, &mnt->mnt_hash);
> }
> --
> 1.8.4.5
>
next prev parent reply other threads:[~2014-07-30 18:39 UTC|newest]
Thread overview: 14+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-07-30 13:59 [PATCH] vfs: Fix RCU usage in __propagate_umount() Richard Weinberger
2014-07-30 14:20 ` Richard Weinberger
2014-07-30 18:39 ` Paul E. McKenney [this message]
2014-07-30 20:46 ` MNT_DETACH and mount namespace issue (was: Re: [PATCH] vfs: Fix RCU usage in __propagate_umount()) Richard Weinberger
2014-07-31 22:17 ` MNT_DETACH and mount namespace issue Richard Weinberger
2014-08-01 15:44 ` Ram Pai
2014-08-01 19:20 ` Richard Weinberger
2014-08-01 22:09 ` Eric W. Biederman
2014-08-04 8:40 ` Richard Weinberger
2014-08-04 16:46 ` Eric W. Biederman
2014-08-04 21:19 ` Richard Weinberger
2014-08-04 22:10 ` Ram Pai
2014-08-04 22:16 ` Richard Weinberger
2014-08-04 22:59 ` Eric W. Biederman
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20140730183902.GA24687@linux.vnet.ibm.com \
--to=paulmck@linux.vnet.ibm.com \
--cc=hch@infradead.org \
--cc=jeffm@suse.com \
--cc=linux-fsdevel@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=richard@sigma-star.at \
--cc=viro@zeniv.linux.org.uk \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox