From: "J. Bruce Fields" <bfields@fieldses.org>
To: Steve Dickson <SteveD@redhat.com>
Cc: Chuck Lever <chuck.lever@oracle.com>,
Phil Endecott
<phil_bnaqb_endecott-wZDNlLIRyE5g9hUCZPvPmw@public.gmane.org>,
linux-nfs@vger.kernel.org
Subject: Re: Make sm-notify faster if there are no servers to notify
Date: Thu, 4 Dec 2008 16:10:57 -0500 [thread overview]
Message-ID: <20081204211057.GC9593@fieldses.org> (raw)
In-Reply-To: <49183A12.7010707-AfCzQyP5zfLQT0dZR+AlfA@public.gmane.org>
On Mon, Nov 10, 2008 at 08:41:38AM -0500, Steve Dickson wrote:
> J. Bruce Fields wrote:
> > On Wed, Oct 29, 2008 at 04:30:32PM -0400, Chuck Lever wrote:
> >> I assume sync() is required because this logic performs a rename as well
> >> as a simple write?
> >
> > I think an fsync() on the containing directory (together with an fsync()
> > of the file itself) would do the job if you wanted to avoid the globaly
> > sync(). I don't think ext3 is capable of doing anything finer-grained
> > than a whole-filesystem sync, though, so this doesn't help many people
> > in practice right now.
> >
> > In any case, the rename adds an extra level of safety by ensuring the
> > nsm state is updated atomically, so we shouldn't get rid of it.
> >
> >>> Anyway, I think the nsm state updating shouldn't matter if you don't
> >>> even have any peers to notify.
> >> It probably does matter.
> >>
> >> When a system is initially installed, it likely does not have a state
> >> file in /var/lib/nfs. This may be harmless if it's not present;
> >> rpc.statd probably does the right thing in this case.
> >
> > The "right thing" in that case would be, I guess, to create a state file
> > with "0" in it. It doesn't do that. So this patch *does* break stuff.
> > Oops!
> >
> > So should we revert it and do something else, or patch statd to create
> > a new state file if necessary?
> I have to agree with Chuck, that managing the state file from one
> place is desirable... Although does it make sense to move that management
> into a init script? Maybe insuring the state is different before any
> daemons are started? Would we still need the sync()?
>
> >
> >> However, the rest of the logic in nsm_get_state() is needed to bump the
> >> system's state value properly after every reboot. It may be
> >> inconsequential if there were no mounts or no NFS clients during the
> >> last reboot, but this is subtle. I wouldn't bet on it.
> >
> > If the state is only every communicated to hosts by notifications, then
> > if we're not notifying, the update of the state can't matter.
> I had the same notion... If there are no clients to notify, why would
> any clients care if our state changed?? With that said.... I do have a
> sinking feeling that this patch will come back to bite us...
> It was just too easy of a fix... :-)
Any progress on this? I don't think we can release in the current
state, since as far as I can tell that means on a new system, unless the
install scripts create /var/lib/nfs/state, neither sm-notify nor statd
ever writes to /proc/sys/fs/nfs/nsm_local_state, and (without testing)
it looks to me like that means lockd defaults to a state of 0, which is
nonsense?
One possible compromise that I think would be correct might be to skip
the sync in the case where there are no peers to notify, instead of
exiting early in that case. As long as a sync happens at some point
before we grant any locks (which statd can guarantee), I think that will
work....
--b.
next prev parent reply other threads:[~2008-12-04 21:11 UTC|newest]
Thread overview: 29+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-10-29 0:13 Make sm-notify faster if there are no servers to notify Phil Endecott
[not found] ` <1225239200402-YnoLgZYwwYuCbKHnblo0pmrPP3OPMK55cpQHUIT47Ck@public.gmane.org>
2008-10-29 17:18 ` J. Bruce Fields
2008-10-29 17:30 ` Phil Endecott
[not found] ` <1225301403729-YnoLgZYwwYuCbKHnblo0pmrPP3OPMK55cpQHUIT47Ck@public.gmane.org>
2008-10-29 17:37 ` J. Bruce Fields
2008-10-29 17:45 ` Phil Endecott
[not found] ` <1225302305994-YnoLgZYwwYuCbKHnblo0pmrPP3OPMK55cpQHUIT47Ck@public.gmane.org>
2008-10-29 18:41 ` J. Bruce Fields
2008-10-29 20:30 ` Chuck Lever
2008-10-29 21:11 ` J. Bruce Fields
2008-11-09 19:25 ` J. Bruce Fields
2008-11-10 0:52 ` Chuck Lever
2008-11-10 0:55 ` J. Bruce Fields
2008-11-10 1:00 ` Chuck Lever
2008-11-10 9:40 ` Phil Endecott
2008-11-10 13:41 ` Steve Dickson
[not found] ` <49183A12.7010707-AfCzQyP5zfLQT0dZR+AlfA@public.gmane.org>
2008-12-04 21:10 ` J. Bruce Fields [this message]
2008-12-05 3:34 ` Neil Brown
[not found] ` <18744.41310.635618.148281-wvvUuzkyo1EYVZTmpyfIwg@public.gmane.org>
2008-12-05 3:58 ` J. Bruce Fields
2008-12-05 13:26 ` Steve Dickson
[not found] ` <49392C14.7000709-AfCzQyP5zfLQT0dZR+AlfA@public.gmane.org>
2008-12-05 16:38 ` J. Bruce Fields
2008-12-05 17:29 ` J. Bruce Fields
2008-12-05 18:41 ` Chuck Lever
2008-12-06 2:42 ` J. Bruce Fields
2008-12-08 19:50 ` Chuck Lever
2008-12-11 22:44 ` J. Bruce Fields
2008-12-05 19:12 ` Steve Dickson
[not found] ` <49397D1B.3000701-AfCzQyP5zfLQT0dZR+AlfA@public.gmane.org>
2008-12-06 2:49 ` J. Bruce Fields
2008-12-05 18:42 ` Steve Dickson
[not found] ` <4939760A.9050304-AfCzQyP5zfLQT0dZR+AlfA@public.gmane.org>
2008-12-06 3:08 ` J. Bruce Fields
2008-10-31 17:52 ` Steve Dickson
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20081204211057.GC9593@fieldses.org \
--to=bfields@fieldses.org \
--cc=SteveD@redhat.com \
--cc=chuck.lever@oracle.com \
--cc=linux-nfs@vger.kernel.org \
--cc=phil_bnaqb_endecott-wZDNlLIRyE5g9hUCZPvPmw@public.gmane.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.