From: "J. Bruce Fields" <bfields@fieldses.org>
To: Jeff Layton <jlayton@redhat.com>
Cc: Neil Brown <neilb@suse.de>,
linux-nfs@vger.kernel.org, linux-kernel@vger.kernel.org
Subject: Re: [PATCH - take 2] knfsd: nfsd: Handle ERESTARTSYS from syscalls.
Date: Fri, 20 Jun 2008 13:50:36 -0400 [thread overview]
Message-ID: <20080620175036.GC563@fieldses.org> (raw)
In-Reply-To: <20080619063824.00ca6381-RtJpwOs3+0O+kQycOl6kW4xkIHaj4LzF@public.gmane.org>
On Thu, Jun 19, 2008 at 06:38:24AM -0400, Jeff Layton wrote:
> On Thu, 19 Jun 2008 12:29:16 +1000
> Neil Brown <neilb@suse.de> wrote:
>
> > On Wednesday June 18, jlayton@redhat.com wrote:
> > >
> > > No objection to the patch, but what signal was being sent to nfsd when
> > > you saw this? If it's anything but a SIGKILL, then I wonder if we have
> > > a race that we need to deal with. My understanding is that we have nfsd
> > > flip between 2 sigmasks to prevent anything but a SIGKILL from being
> > > delivered while we're handling the local filesystem operation.
> >
> > SuSE /etc/init.d/nfsserver does
> >
> > killproc -n -KILL nfsd
> >
> > so it looks like a SIGKILL.
> >
> >
> > >
> > > From nfsd():
> > >
> > > ----------[snip]-----------
> > > sigprocmask(SIG_SETMASK, &shutdown_mask, NULL);
> > >
> > > /*
> > > * Find a socket with data available and call its
> > > * recvfrom routine.
> > > */
> > > while ((err = svc_recv(rqstp, 60*60*HZ)) == -EAGAIN)
> > > ;
> > > if (err < 0)
> > > break;
> > > update_thread_usage(atomic_read(&nfsd_busy));
> > > atomic_inc(&nfsd_busy);
> > >
> > > /* Lock the export hash tables for reading. */
> > > exp_readlock();
> > >
> > > /* Process request with signals blocked. */
> > > sigprocmask(SIG_SETMASK, &allowed_mask, NULL);
> > >
> > > svc_process(rqstp);
> > >
> > > ----------[snip]-----------
> > >
> > > What happens if this catches a SIGINT after the err<0 check, but before
> > > the mask is set to allowed_mask? Does svc_process() then get called with
> > > a signal pending?
> >
> > Yes, I suspect it does.
> >
> > I wonder why we have all this mucking about this signal masks anyway.
> > Anyone have any ideas about what it actually achieves?
> >
>
> HCH asked me the same question when I did the conversion to kthreads.
> My interpretation (based on guesswork here) was that we wanted to
> distinguish between SIGKILL and other allowed signals. A SIGKILL is
> allowed to interrupt the underlying I/O, but other signals should not.
>
> The question to answer here, I suppose, is whether masking a pending
> signal is sufficient to make signal_pending() return false. If I'm
> looking correctly then the answer should be "yes".
Just looking out of curiosity: signal_pending() checks whether some
thread_info->flags has TIF_SIGPENDING set.
sigprocmask() sets current->blocked to the given set, then calls
recalc_sigpending(), which (ignoring some freezer and SIGSTOP code that
I don't understand), clears TIF_SIGPENDING if any pending signals are in
the newly blocked set. So, yes.
--b.
> So I don't think we
> have a race here after all. I suspect that if SuSE used a different
> signal here, that would prevent this from happening. For the record,
> both RHEL and Fedora's init scripts use SIGINT for this.
WARNING: multiple messages have this Message-ID (diff)
From: "J. Bruce Fields" <bfields@fieldses.org>
To: Jeff Layton <jlayton@redhat.com>
Cc: Neil Brown <neilb@suse.de>,
linux-nfs@vger.kernel.org, linux-kernel@vger.kernel.org
Subject: Re: [PATCH - take 2] knfsd: nfsd: Handle ERESTARTSYS from syscalls.
Date: Fri, 20 Jun 2008 13:50:36 -0400 [thread overview]
Message-ID: <20080620175036.GC563@fieldses.org> (raw)
In-Reply-To: <20080619063824.00ca6381@tleilax.poochiereds.net>
On Thu, Jun 19, 2008 at 06:38:24AM -0400, Jeff Layton wrote:
> On Thu, 19 Jun 2008 12:29:16 +1000
> Neil Brown <neilb@suse.de> wrote:
>
> > On Wednesday June 18, jlayton@redhat.com wrote:
> > >
> > > No objection to the patch, but what signal was being sent to nfsd when
> > > you saw this? If it's anything but a SIGKILL, then I wonder if we have
> > > a race that we need to deal with. My understanding is that we have nfsd
> > > flip between 2 sigmasks to prevent anything but a SIGKILL from being
> > > delivered while we're handling the local filesystem operation.
> >
> > SuSE /etc/init.d/nfsserver does
> >
> > killproc -n -KILL nfsd
> >
> > so it looks like a SIGKILL.
> >
> >
> > >
> > > From nfsd():
> > >
> > > ----------[snip]-----------
> > > sigprocmask(SIG_SETMASK, &shutdown_mask, NULL);
> > >
> > > /*
> > > * Find a socket with data available and call its
> > > * recvfrom routine.
> > > */
> > > while ((err = svc_recv(rqstp, 60*60*HZ)) == -EAGAIN)
> > > ;
> > > if (err < 0)
> > > break;
> > > update_thread_usage(atomic_read(&nfsd_busy));
> > > atomic_inc(&nfsd_busy);
> > >
> > > /* Lock the export hash tables for reading. */
> > > exp_readlock();
> > >
> > > /* Process request with signals blocked. */
> > > sigprocmask(SIG_SETMASK, &allowed_mask, NULL);
> > >
> > > svc_process(rqstp);
> > >
> > > ----------[snip]-----------
> > >
> > > What happens if this catches a SIGINT after the err<0 check, but before
> > > the mask is set to allowed_mask? Does svc_process() then get called with
> > > a signal pending?
> >
> > Yes, I suspect it does.
> >
> > I wonder why we have all this mucking about this signal masks anyway.
> > Anyone have any ideas about what it actually achieves?
> >
>
> HCH asked me the same question when I did the conversion to kthreads.
> My interpretation (based on guesswork here) was that we wanted to
> distinguish between SIGKILL and other allowed signals. A SIGKILL is
> allowed to interrupt the underlying I/O, but other signals should not.
>
> The question to answer here, I suppose, is whether masking a pending
> signal is sufficient to make signal_pending() return false. If I'm
> looking correctly then the answer should be "yes".
Just looking out of curiosity: signal_pending() checks whether some
thread_info->flags has TIF_SIGPENDING set.
sigprocmask() sets current->blocked to the given set, then calls
recalc_sigpending(), which (ignoring some freezer and SIGSTOP code that
I don't understand), clears TIF_SIGPENDING if any pending signals are in
the newly blocked set. So, yes.
--b.
> So I don't think we
> have a race here after all. I suspect that if SuSE used a different
> signal here, that would prevent this from happening. For the record,
> both RHEL and Fedora's init scripts use SIGINT for this.
next prev parent reply other threads:[~2008-06-20 17:50 UTC|newest]
Thread overview: 18+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <20080619101025.24263.patches@notabene>
2008-06-19 0:11 ` [PATCH - take 2] knfsd: nfsd: Handle ERESTARTSYS from syscalls NeilBrown
2008-06-19 1:09 ` Jeff Layton
2008-06-19 2:29 ` Neil Brown
2008-06-19 10:38 ` Jeff Layton
[not found] ` <20080619063824.00ca6381-RtJpwOs3+0O+kQycOl6kW4xkIHaj4LzF@public.gmane.org>
2008-06-20 17:50 ` J. Bruce Fields [this message]
2008-06-20 17:50 ` J. Bruce Fields
2008-06-23 0:20 ` Neil Brown
[not found] ` <18526.60523.584503.68076-wvvUuzkyo1EYVZTmpyfIwg@public.gmane.org>
2008-06-23 0:52 ` Jeff Layton
2008-06-23 0:52 ` Jeff Layton
2008-06-23 23:55 ` [PATCH - take 2] knfsd: nfsd: Handle ERESTARTSYS from syscalls. (and possible kthread_stop changes) Neil Brown
[not found] ` <18528.14347.525397.917553-wvvUuzkyo1EYVZTmpyfIwg@public.gmane.org>
2008-06-30 12:35 ` Jeff Layton
2008-06-30 12:35 ` Jeff Layton
[not found] ` <20080630083536.680b093a-xSBYVWDuneFaJnirhKH9O4GKTjYczspe@public.gmane.org>
2008-06-30 17:10 ` J. Bruce Fields
2008-06-30 17:10 ` J. Bruce Fields
2008-06-30 18:09 ` Jeff Layton
[not found] ` <20080630140946.34154d4c-xSBYVWDuneFaJnirhKH9O4GKTjYczspe@public.gmane.org>
2008-06-30 19:29 ` J. Bruce Fields
2008-06-30 19:29 ` J. Bruce Fields
2008-06-20 17:34 ` [PATCH - take 2] knfsd: nfsd: Handle ERESTARTSYS from syscalls J. Bruce Fields
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20080620175036.GC563@fieldses.org \
--to=bfields@fieldses.org \
--cc=jlayton@redhat.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-nfs@vger.kernel.org \
--cc=neilb@suse.de \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.