From: Malahal Naineni <malahal@us.ibm.com>
To: Jeff Layton <jlayton@redhat.com>
Cc: "Myklebust, Trond" <Trond.Myklebust@netapp.com>,
"linux-nfs@vger.kernel.org" <linux-nfs@vger.kernel.org>,
"Schumaker, Bryan" <Bryan.Schumaker@netapp.com>
Subject: Re: corruption due to loss of lock
Date: Thu, 11 Jul 2013 10:39:59 -0500 [thread overview]
Message-ID: <20130711153959.GA15010@us.ibm.com> (raw)
In-Reply-To: <20130711112036.03ffe1cc@tlielax.poochiereds.net>
Jeff Layton [jlayton@redhat.com] wrote:
> On Thu, 11 Jul 2013 14:33:02 +0000
> "Myklebust, Trond" <Trond.Myklebust@netapp.com> wrote:
>
> > On Thu, 2013-07-11 at 10:28 -0400, Jeff Layton wrote:
> > > On Thu, 11 Jul 2013 14:19:10 +0000
> > > "Myklebust, Trond" <Trond.Myklebust@netapp.com> wrote:
> > >
> > > > On Thu, 2013-07-11 at 07:13 -0400, Jeff Layton wrote:
> > > > > On Thu, 13 Jun 2013 13:47:37 -0500
> > > > > Malahal Naineni <malahal@us.ibm.com> wrote:
> > > > >
> > > > > > Hi Trond,
> > > > > >
> > > > > > I saw Bryan's patches here https://patchwork.kernel.org/patch/987402/
> > > > > > that fix issues after loss of a lock. What is the status on this patch
> > > > > > set? Do they need more work? We have an application that uses range
> > > > > > locks on a file. Two threads from two different clients end up writing
> > > > > > to the same a file due to this bug after a lease expiry from a client.
> > > > > >
> > > > > > Regards, Malahal.
> > > > >
> > > > > (cc'ing Bryan since he did the original set)
> > > > >
> > > > > Yeah, this set would be a nice thing to have. A couple of comments:
> > > > >
> > > > > - I still think it would be best to make SIGLOST its own signal, but as
> > > > > Bryan points out, it would need to be larger than SIGRTMAX. I'm
> > > > > not sure that's possible on all arches with the way the RT signals
> > > > > were done. It's probably worth investigating that though before
> > > > > settling on SIGIO since it would be hard to change that retroactively.
> > > > >
> > > > > - This is not really a v4.1 specific thing. It should also be done for
> > > > > v4.0 and v2/3, though the latter two really need to be done within
> > > > > lockd.
> > > >
> > > > SIGLOST is not part of any standard. It is a hack that has been adopted
> > > > by IBM and Solaris.
> > > >
> > > > The POSIXly correct way to do this is to use EBADF to warn the
> > > > application that the file descriptor is no longer valid (in the sense
> > > > that the server is no longer honouring the lock) and EIO in order to
> > > > warn it that data may have been lost.
> > > >
> > >
> > > It is a hack...I won't argue there
> > >
> > > I'm not sure that returning errors is really the best approach though.
> > > In some cases, the fd may be fine. It may only be the lock that has
> > > been lost.
> > >
> > > With a signal, the program has more of a choice as to whether it cares
> > > about lost locks and is more immediate when the problem occurs. An
> > > error code seems like it might cause a lot of grief for programs that
> > > aren't expecting that sort of behavior.
> >
> > EBADF is a error that has an obvious meaning in POSIX: you need to
> > reopen the file and re-establish any locks.
>
> Well, EBADF means "Bad file descriptor". Consider the v2/3 case -- the
> fd might still be usable, it's only my lock that has been lost. One
> might consider that to mean that we shouldn't use that fd anymore, but
> that's a behavioral change any way you slice it...
>
> > How is that not better than
> > receiving a signal they won't be expecting? Consider that we'd have to
> > overload SIGIO, which has a completely different meaning in POSIX...
> >
>
> That's the main reason that I think we want a new signal for this. The
> default on SIGLOST should be to ignore it, and then that would allow
> processes to opt-in to paying attention to it.
We should split that patchset into two.
1. we should return EBADF/EIO (debatable which one) for operations that
require lock after loss of lock.
2. sending a signal (SIGIO/SIGLOST).
The first one is critical to avoid corruption, and second one is needed
for graceful recovery.
Regards, Malahal.
next prev parent reply other threads:[~2013-07-11 15:40 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-06-13 18:47 corruption due to loss of lock Malahal Naineni
2013-07-11 11:13 ` Jeff Layton
2013-07-11 14:12 ` Malahal Naineni
2013-07-11 14:19 ` Myklebust, Trond
2013-07-11 14:28 ` Jeff Layton
2013-07-11 14:33 ` Myklebust, Trond
2013-07-11 15:20 ` Jeff Layton
2013-07-11 15:39 ` Malahal Naineni [this message]
2013-07-11 16:28 ` Myklebust, Trond
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20130711153959.GA15010@us.ibm.com \
--to=malahal@us.ibm.com \
--cc=Bryan.Schumaker@netapp.com \
--cc=Trond.Myklebust@netapp.com \
--cc=jlayton@redhat.com \
--cc=linux-nfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).