From: Kinglong Mee <kinglongmee@gmail.com>
To: "J. Bruce Fields" <bfields@fieldses.org>, NeilBrown <neilb@suse.de>
Cc: linux-fsdevel@vger.kernel.org,
"linux-nfs@vger.kernel.org" <linux-nfs@vger.kernel.org>,
Al Viro <viro@zeniv.linux.org.uk>,
Trond Myklebust <trond.myklebust@primarydata.com>,
kinglongmee@gmail.com
Subject: Re: [PATCH 4/4] nfsd: Pin to vfsmnt instead of mntget
Date: Mon, 11 May 2015 21:08:47 +0800 [thread overview]
Message-ID: <5550A9DF.1070908@gmail.com> (raw)
In-Reply-To: <20150508134744.GA23753@fieldses.org>
On 5/8/2015 9:47 PM, J. Bruce Fields wrote:
> On Fri, May 08, 2015 at 02:40:31PM +1000, NeilBrown wrote:
>> Thanks for this patch. It looks good!
>>
>> My only comment on the code is that I would really like to see a
>> "path_get_pin()" and "path_put_unpin()" rather than open coding:
>>
>>> + dget(item->ek_path.dentry);
>>> + pin_insert_group(&new->ek_pin, item->ek_path.mnt, NULL);
>>
>> and
>>
>>> + dput(key->ek_path.dentry);
>>> + pin_remove(&key->ek_pin);
>>
>>
>> But the question you raise is an important one: Exactly which filesystems
>> should be allowed to be unmounted?
>> This is a change in behaviour - is it one that people uniformly would want?
>>
>> The kernel doesn't currently know which file systems were explicitly listed
>> in /etc/exports, and which were found by following a 'crossmnt'.
>> It could guess and allow the unmounting of anything below a 'crossmnt', but I
>> wouldn't be comfortable with that - it is error prone.
>>
>> mountd does know what is in /etc/exports, and could tell the kernel.
>> For the expkey cache, we could always use path_get_pin.
>> For the export cache (where flags are available) we could use path_get
>> or path_get_pin depending on some new flag.
>>
>> I'm not really sure it is worth it. I would rather the filesystems could
>> always be unmounted. But doing that could possibly break someone's work
>> flow. Maybe.
>>
>> Or maybe I'm seeing problems where there aren't any.
>>
>> Anyone else have an opinion?
>
> The undisputed bug here was negative cache entries preventing unmount.
> So most conservative might be just to purge negative entries.
I'd like this,
if the cache is valid, user should not be allowed to umount the filesystem.
>
> Otherwise, the only guarantees I think we've really had is that we won't
> allow unmount if you hold any actual state on the filesystem (NLM locks,
> NFSv4 locks, opens, or delegations).
Those resources hold the reference of vfsmnt.
>
> If a filesystem is exported but no clients hold state on it, then it's
> currently mostly chance whether the unmount succeeds or not. So we're
> probably free to change the behavior in this case. I'd be inclined to
> allow the unmount, but haven't thought this through carefully.
If client mount a nfsserver succeed without holds state,
nfs server umounts the exported filesystem,
client also think the filesystem is valid, but it is umounted.
>
> It could also be useful to have the ability to force an unmount even in
> the presence of locks. That's not a safe default, but an
> "allow_force_unmount" export option might be useful.
>
> We might similarly be able to add some way for the kernel to distinguish
> explicit exports from crossmnt-found exports, but I'm not seeing the use
> case for that.
Agree, I don't think we needs that right now.
thanks,
Kinglong Mee
next prev parent reply other threads:[~2015-05-11 13:08 UTC|newest]
Thread overview: 27+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-05-06 13:18 [PATCH 0/4] NFSD: Pin to vfsmount instead of mntget for export cache Kinglong Mee
2015-05-06 13:19 ` [PATCH 1/4] fs_pin: Fix uninitialized value in fs_pin Kinglong Mee
2015-05-06 13:19 ` Kinglong Mee
2015-05-07 19:43 ` J. Bruce Fields
2015-05-08 0:36 ` Kinglong Mee
2015-05-08 0:36 ` Kinglong Mee
2015-05-06 13:19 ` [PATCH 2/4] fs_pin: Export functions for specific filesystem Kinglong Mee
2015-05-06 13:19 ` Kinglong Mee
2015-05-06 13:20 ` [PATCH 3/4] sunrpc: New helper cache_force_expire for cache cleanup Kinglong Mee
2015-05-06 13:20 ` Kinglong Mee
2015-05-06 13:21 ` [PATCH 4/4] nfsd: Pin to vfsmnt instead of mntget Kinglong Mee
2015-05-06 13:21 ` Kinglong Mee
2015-05-08 4:40 ` NeilBrown
2015-05-08 13:47 ` J. Bruce Fields
2015-05-11 13:08 ` Kinglong Mee [this message]
2015-05-13 4:25 ` NeilBrown
2015-05-13 4:25 ` NeilBrown
2015-05-13 12:30 ` Kinglong Mee
2015-05-13 12:55 ` Kinglong Mee
2015-05-13 12:55 ` Kinglong Mee
2015-05-15 21:11 ` J. Bruce Fields
2015-05-15 21:11 ` J. Bruce Fields
2015-05-15 23:23 ` NeilBrown
2015-05-22 15:02 ` Kinglong Mee
2015-05-22 16:03 ` J. Bruce Fields
2015-05-15 21:09 ` J. Bruce Fields
2015-05-15 21:09 ` J. Bruce Fields
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=5550A9DF.1070908@gmail.com \
--to=kinglongmee@gmail.com \
--cc=bfields@fieldses.org \
--cc=linux-fsdevel@vger.kernel.org \
--cc=linux-nfs@vger.kernel.org \
--cc=neilb@suse.de \
--cc=trond.myklebust@primarydata.com \
--cc=viro@zeniv.linux.org.uk \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.