From: "Paul E. McKenney" <paulmck@us.ibm.com>
To: Oleg Nesterov <oleg@tv-sign.ru>
Cc: akpm@osdl.org, linux-kernel@vger.kernel.org, dipankar@in.ibm.com,
mingo@elte.hu, suzannew@cs.pdx.edu
Subject: Re: [PATCH] Additional/catchup RCU signal fixes for -mm
Date: Sat, 5 Nov 2005 17:00:04 -0800 [thread overview]
Message-ID: <20051106010004.GB20178@us.ibm.com> (raw)
In-Reply-To: <436CDEAF.E236BC40@tv-sign.ru>
On Sat, Nov 05, 2005 at 07:32:47PM +0300, Oleg Nesterov wrote:
> "Paul E. McKenney" wrote:
> >
> > @@ -1386,7 +1387,7 @@ send_sigqueue(int sig, struct sigqueue *
> > {
> > unsigned long flags;
> > int ret = 0;
> > - struct sighand_struct *sh = p->sighand;
> > + struct sighand_struct *sh;
> >
> > BUG_ON(!(q->flags & SIGQUEUE_PREALLOC));
> >
> > @@ -1405,7 +1406,15 @@ send_sigqueue(int sig, struct sigqueue *
> > goto out_err;
> > }
> >
> > +retry:
> > + sh = rcu_dereference(p->sighand);
> > +
> > spin_lock_irqsave(&sh->siglock, flags);
> > + if (p->sighand != sh) {
> > + /* We raced with exec() in a multithreaded process... */
> > + spin_unlock_irqrestore(&sh->siglock, flags);
> > + goto retry;
>
> p->sighand can't be changed, de_thread calls exit_itimers() before
> changing ->sighand. But I still think it can be NULL, and send_sigqueue()
> should return -1 in that case.
OK, I put the NULL check in with my previous patch.
And you are absolutely right in the de_thread() case. I need to add
more cases to steamroller...
> > @@ -1464,15 +1473,8 @@ send_group_sigqueue(int sig, struct sigq
> >
> > BUG_ON(!(q->flags & SIGQUEUE_PREALLOC));
> >
> > - while (!read_trylock(&tasklist_lock)) {
> > - if (!p->sighand)
> > - return -1;
> > - cpu_relax();
> > - }
> > - if (unlikely(!p->sighand)) {
> > - ret = -1;
> > - goto out_err;
> > - }
> > + read_lock(&tasklist_lock);
> > + /* Since it_lock is held, p->sighand cannot be NULL. */
> > spin_lock_irqsave(&p->sighand->siglock, flags);
>
> Again, I think the comment is wrong.
>
> However, now I think we really have a race with exec, and this race was not
> introduced by your patches!
This patch was not mine, though I guess that it is by now. ;-)
> If !thread_group_leader() does exec de_thread() calls release_task(->group_leader)
> before calling exit_itimers(). This means that send_group_sigqueue() which
> always has p == ->group_leader parameter can oops here.
But in that case, __exit_sighand(->group_leader) would have been called,
so ->sighand would be NULL. And none of this can change while we are holding
tasklist_lock.
If we don't want to be hitting the exec()ed task with a signal, the
thing to do would be to drop the signal, as in the attached patch.
I believe that this is an acceptable approach, since had the timer
fired slightly later, it would have been disabled, right?
Thoughts?
Thanx, Paul
Signed-off-by: <paulmck@us.ibm.com>
diff -urpNa -X dontdiff linux-2.6.14-mm0-fix-2/kernel/signal.c linux-2.6.14-mm0-fix-3/kernel/signal.c
--- linux-2.6.14-mm0-fix-2/kernel/signal.c 2005-11-05 15:05:38.000000000 -0800
+++ linux-2.6.14-mm0-fix-3/kernel/signal.c 2005-11-05 16:27:52.000000000 -0800
@@ -1481,6 +1481,10 @@ send_group_sigqueue(int sig, struct sigq
read_lock(&tasklist_lock);
while (p->group_leader != p)
p = p->group_leader;
+ if (p->sighand == NULL) {
+ ret = 1;
+ goto out_err;
+ }
spin_lock_irqsave(&p->sighand->siglock, flags);
handle_stop_signal(sig, p);
next prev parent reply other threads:[~2005-11-06 1:00 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2005-11-05 1:36 [PATCH] Additional/catchup RCU signal fixes for -mm Paul E. McKenney
2005-11-05 16:32 ` Oleg Nesterov
2005-11-06 1:00 ` Paul E. McKenney [this message]
2005-11-06 14:17 ` Oleg Nesterov
2005-11-06 14:46 ` Oleg Nesterov
2005-11-06 23:02 ` Paul E. McKenney
2005-11-06 14:32 ` Posix timers vs exec problems Oleg Nesterov
2005-11-07 18:12 ` [PATCH] fix de_thread() vs send_group_sigqueue() race Oleg Nesterov
2005-11-08 20:36 ` Chris Wright
2005-11-08 20:55 ` Linus Torvalds
2005-11-16 23:26 ` [PATCH] sigaction should clear all signals on SIG_IGN, not just < 32 George Anzinger
2005-11-22 1:09 ` Thread group exec race -> null pointer... HELP George Anzinger
2005-11-22 14:45 ` Oleg Nesterov
2005-11-23 20:30 ` George Anzinger
2005-11-25 15:03 ` Ingo Molnar
2005-11-22 19:20 ` [PATCH] fix do_wait() vs exec() race Oleg Nesterov
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20051106010004.GB20178@us.ibm.com \
--to=paulmck@us.ibm.com \
--cc=akpm@osdl.org \
--cc=dipankar@in.ibm.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@elte.hu \
--cc=oleg@tv-sign.ru \
--cc=suzannew@cs.pdx.edu \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox