From: NeilBrown <neilb@suse.com>
To: "J. Bruce Fields" <bfields@fieldses.org>
Cc: Steve Dickson <SteveD@redhat.com>,
Linux NFS Mailing list <linux-nfs@vger.kernel.org>
Subject: Re: [PATCH 6/8] mountd: don't add paths to non-mounted export points to pseudo-root
Date: Mon, 25 Jul 2016 17:22:09 +1000 [thread overview]
Message-ID: <87zip641y6.fsf@notabene.neil.brown.name> (raw)
In-Reply-To: <20160721173350.GD27148@fieldses.org>
[-- Attachment #1: Type: text/plain, Size: 9325 bytes --]
On Fri, Jul 22 2016, J. Bruce Fields wrote:
> On Wed, Jul 20, 2016 at 08:59:30AM +1000, NeilBrown wrote:
>> On Tue, Jul 19 2016, J. Bruce Fields wrote:
>>
>> > On Thu, Jul 14, 2016 at 12:26:43PM +1000, NeilBrown wrote:
>> >> export points with the "mountpoint" flag should not be exported
>> >> if they aren't mounted.
>> >> They shouldn't even appear in the pseudo-root filesystem.
>> >> So add an appropriate check to v4root_set().
>> >>
>> >> This means that the v4root might need to be recomputed whenever a
>> >> filesystem is mounted or unmounted. So when there are export points
>> >> with the "mountpoint" flag, check for changes in the mount table.
>> >> This is done be measuring the size of /proc/mounts.
>> >
>> > Surely there's some more reliable measurement--could we track some data
>> > about the mountpoint itself, maybe?
>>
>> We could. But it would be more complex code for very little gain.
>> I did consider using select() on /proc/mounts to get a notification
>> whenever anything changes. What we be more reliable but more difficult.
>> I also considered calculating an SHA1, or maybe just a crc32 on the
>> contents of /proc/mounts. But then I realised that the size was very
>> easy and very nearly as reliable.
>
> So we don't care enough about the mountpoint option enough to make it
> work 100% reliably?
>
> If we expect too few users for there to be a real chance of hitting the
> bad case here, then I wonder again whether the whole feature is worth
> the trouble.
>
>> > But I'd still like some more justification for this change in logic.
>> > Does anyone currently use the "mp" option? If not, could we just
>> > deprecate it? If so, can we really get away with changing it this way?
>>
>> I have a customer complaining that it doesn't work as advertised for
>> NFSv4. So presumably they have a use-case, though I haven't asked for
>> details on exactly why they want it.
>
> I'd be inclined to ask for more details about the use case before
> continuing.
I asked, and found the answer quite helpful. So thanks for prompting
that.
For NFSv2/3, If I list "/export/foo" in /etc/exports, but /export/foo
fails to mount during boot, then a client which tries to mount
"/export/foo" will get a file handle on /export (probably the root
filesystem).
Unless subtree_check is set (which we don't like) this effectively means
that the whole root filesystem is potentially exported, if the client can
determine the filehandles.
Once the problem is fixed, the filesystem is mounted, and "exportfs -r"
is run (possibly by reboot), the root filesystem will no longer be
exported, so that filehandle that the client has becomes stale (this is
the particlar symptom the customer mentioned).
I think it is safe to argue that having 'mount' fail is safer than
having it succeed, present an empty directory, and then have that
directory suddenly become stale at some later time.
For NFSv4 the root filesystem is always exported, but usually as the
'pseudo-root', being read-only and files being completely unavailable.
If /export/foo is not mounted but /export/foo is exported, then at least
part of the root filesystem will be exported (potentially) r/w.
I'm not sure what happens with filehandles. A filehandle from the pseudo-root
filesystem has fsid=0. A filehandle from a properly exported directory
on the root filesystem might not - I'd have to check another day.
So you might not get the 'stale file handles', but would might still get
unexpected access to the root filesystem.
So I think this justifies maintaining (and maybe even encouraging) the
'mountpoint' export option.
For NFSv4 it is probably OK for the to-be-mounted-on directory to be visible,
but firmly 'pseudo'. So I can probably drop my elegant /proc/mounts
change detector which you aren't fond of.
When we get a filehandle for a filesystem which isn't currently mounted
the current code sends no response, so clients hang. My latest patch
sends ESTALE, so client gets an error.
I wonder if we could arrange to make just the exported root look like an
empty (pseudo-root-style) directory. Then when the filesystem gets
mounted the directory morphs into the real thing..
For files on the filesystem I could probably be convinced either way,
unless testing shows some unpleasant behaviour.
Are you convinced? At all?
Thanks,
NeilBrown
>
> As it is I'm inclined towards some plan like documenting "mountpoint" as
> deprecated, warning on use, and taking it out eventually.
>
> --b.
>
>> I actually think this is the most useful of the changes. It means that
>> if a filesystem isn't mounted, it isn't even visible over NFSv4.
>>
>> After all, the reality is that people export filesystems, not names in
>> their namespace. NFSv4 tries to make it all look like the same thing,
>> and there is some justification for that. But I think a lot of people
>> think about it as filesystems being exported, and the mountpoint option
>> allows that thought to be expressed in the configuration.
>>
>> NeilBrown
>>
>>
>> >
>> > --b.
>> >
>> >>
>> >> Signed-off-by: NeilBrown <neilb@suse.com>
>> >> ---
>> >> support/include/v4root.h | 2 +-
>> >> utils/mountd/auth.c | 29 +++++++++++++++++++++++++++--
>> >> utils/mountd/v4root.c | 11 ++++++++++-
>> >> 3 files changed, 38 insertions(+), 4 deletions(-)
>> >>
>> >> diff --git a/support/include/v4root.h b/support/include/v4root.h
>> >> index 706c15c70d95..406fd4e43e5a 100644
>> >> --- a/support/include/v4root.h
>> >> +++ b/support/include/v4root.h
>> >> @@ -10,6 +10,6 @@
>> >> #define V4ROOT_H
>> >>
>> >> extern int v4root_needed;
>> >> -extern void v4root_set(void);
>> >> +extern void v4root_set(int *mountpoints_checked);
>> >>
>> >> #endif /* V4ROOT_H */
>> >> diff --git a/utils/mountd/auth.c b/utils/mountd/auth.c
>> >> index 0881d9a6edba..5bd7e6622307 100644
>> >> --- a/utils/mountd/auth.c
>> >> +++ b/utils/mountd/auth.c
>> >> @@ -77,6 +77,29 @@ check_useipaddr(void)
>> >> cache_flush(1);
>> >> }
>> >>
>> >> +static int mountpoints_changed(void)
>> >> +{
>> >> + static int last_size = 0;
>> >> + int size;
>> >> + int fd;
>> >> + char buf[4096];
>> >> + int n;
>> >> +
>> >> + fd = open("/proc/mounts", O_RDONLY);
>> >> + if (fd < 0)
>> >> + /* ignore mountpoint changes if we cannot read /proc/mounts */
>> >> + return 0;
>> >> + size = 0;
>> >> + while ((n = read(fd, buf, sizeof(buf))) > 0)
>> >> + size += n;
>> >> + if (n < 0)
>> >> + return 0;
>> >> + if (size == last_size)
>> >> + return 0;
>> >> + last_size = size;
>> >> + return 1;
>> >> +}
>> >> +
>> >> unsigned int
>> >> auth_reload()
>> >> {
>> >> @@ -84,6 +107,7 @@ auth_reload()
>> >> static ino_t last_inode;
>> >> static int last_fd = -1;
>> >> static unsigned int counter;
>> >> + static int mountpoints_checked = 0;
>> >> int fd;
>> >>
>> >> if ((fd = open(_PATH_ETAB, O_RDONLY)) < 0) {
>> >> @@ -91,7 +115,8 @@ auth_reload()
>> >> } else if (fstat(fd, &stb) < 0) {
>> >> xlog(L_FATAL, "couldn't stat %s", _PATH_ETAB);
>> >> close(fd);
>> >> - } else if (last_fd != -1 && stb.st_ino == last_inode) {
>> >> + } else if (last_fd != -1 && stb.st_ino == last_inode &&
>> >> + (!mountpoints_checked || !mountpoints_changed())) {
>> >> /* We opened the etab file before, and its inode
>> >> * number hasn't changed since then.
>> >> */
>> >> @@ -114,7 +139,7 @@ auth_reload()
>> >> memset(&my_client, 0, sizeof(my_client));
>> >> xtab_export_read();
>> >> check_useipaddr();
>> >> - v4root_set();
>> >> + v4root_set(&mountpoints_checked);
>> >>
>> >> ++counter;
>> >>
>> >> diff --git a/utils/mountd/v4root.c b/utils/mountd/v4root.c
>> >> index d52172592823..1a5778f9c7de 100644
>> >> --- a/utils/mountd/v4root.c
>> >> +++ b/utils/mountd/v4root.c
>> >> @@ -183,7 +183,7 @@ static int v4root_add_parents(nfs_export *exp)
>> >> * looking for components of the v4 mount.
>> >> */
>> >> void
>> >> -v4root_set()
>> >> +v4root_set(int *mountpoints_checked)
>> >> {
>> >> nfs_export *exp;
>> >> int i;
>> >> @@ -193,6 +193,7 @@ v4root_set()
>> >> if (!v4root_support())
>> >> return;
>> >>
>> >> + *mountpoints_checked = 0;
>> >> for (i = 0; i < MCL_MAXTYPES; i++) {
>> >> for (exp = exportlist[i].p_head; exp; exp = exp->m_next) {
>> >> if (exp->m_export.e_flags & NFSEXP_V4ROOT)
>> >> @@ -202,6 +203,14 @@ v4root_set()
>> >> */
>> >> continue;
>> >>
>> >> + if (exp->m_export.e_mountpoint) {
>> >> + *mountpoints_checked = 1;
>> >> + if (!is_mountpoint(exp->m_export.e_mountpoint[0]?
>> >> + exp->m_export.e_mountpoint:
>> >> + exp->m_export.e_path))
>> >> + continue;
>> >> + }
>> >> +
>> >> if (strcmp(exp->m_export.e_path, "/") == 0 &&
>> >> !(exp->m_export.e_flags & NFSEXP_FSID)) {
>> >> /* Force '/' to be exported as fsid == 0*/
>> >>
>> >>
>> >> --
>> >> To unsubscribe from this list: send the line "unsubscribe linux-nfs" in
>> >> the body of a message to majordomo@vger.kernel.org
>> >> More majordomo info at http://vger.kernel.org/majordomo-info.html
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 818 bytes --]
next prev parent reply other threads:[~2016-07-25 7:22 UTC|newest]
Thread overview: 28+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-07-14 2:26 [PATCH 0/8] Assorted mount-related nfs-utils patches NeilBrown
2016-07-14 2:26 ` [PATCH 8/8] mount: use a public address for IPv6 callback NeilBrown
2016-07-14 2:26 ` [PATCH 6/8] mountd: don't add paths to non-mounted export points to pseudo-root NeilBrown
2016-07-18 20:32 ` J. Bruce Fields
2016-07-19 8:00 ` Chuck Lever
2016-07-19 22:59 ` NeilBrown
2016-07-21 17:33 ` J. Bruce Fields
2016-07-25 7:22 ` NeilBrown [this message]
2016-07-28 20:54 ` J. Bruce Fields
2016-07-14 2:26 ` [PATCH 7/8] mount: don't treat temporary name resolution failure as permanent NeilBrown
2016-07-19 23:01 ` NeilBrown
2016-07-14 2:26 ` [PATCH 1/8] nfs.man: clarify effect of 'retry' option NeilBrown
2016-07-14 2:26 ` [PATCH 3/8] mountd: remove 'dev_missing' checks NeilBrown
2016-07-18 20:01 ` J. Bruce Fields
2016-07-19 22:50 ` NeilBrown
2016-07-21 17:24 ` J. Bruce Fields
2016-08-11 2:51 ` NeilBrown
2016-08-16 15:21 ` J. Bruce Fields
2016-08-18 1:32 ` NeilBrown
2016-08-18 2:57 ` Chuck Lever
2016-08-19 1:31 ` NeilBrown
2016-08-18 13:57 ` J. Bruce Fields
2016-08-19 1:28 ` NeilBrown
2016-08-19 17:27 ` J. Bruce Fields
2016-07-14 2:26 ` [PATCH 5/8] mountd: Don't export unmounted exports to NFSv4 NeilBrown
2016-07-14 2:26 ` [PATCH 2/8] mountd: remove the --exports-file option NeilBrown
2016-07-18 16:19 ` J. Bruce Fields
2016-07-14 2:26 ` [PATCH 4/8] mountd: cause attempts to access unmounted exportpoints to return ESTALE NeilBrown
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=87zip641y6.fsf@notabene.neil.brown.name \
--to=neilb@suse.com \
--cc=SteveD@redhat.com \
--cc=bfields@fieldses.org \
--cc=linux-nfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).