From: Oleg Nesterov <oleg@tv-sign.ru>
To: Roland McGrath <roland@redhat.com>
Cc: Andrew Morton <akpm@linux-foundation.org>,
Linus Torvalds <torvalds@linux-foundation.org>,
linux-kernel@vger.kernel.org
Subject: Re: [PATCH 2/2] ptrace children revamp
Date: Sat, 5 Apr 2008 18:06:56 +0400 [thread overview]
Message-ID: <20080405140656.GA109@tv-sign.ru> (raw)
In-Reply-To: <20080404210033.F26CC26F8DC@magilla.localdomain>
On 04/04, Roland McGrath wrote:
>
> > Unless I missed something again, this is not 100% right.
> >
> > Suppose that we are ->real_parent, the child was traced by us, we ignore
> > SIGCHLD, and we are doing the threaded reparenting. In that case we should
> > release the child if it is dead, like the current code does.
>
> I tried to construct a test case to demonstrate this behavior on the
> existing kernel. I couldn't manage to get it to do this. Are you this
> is "like the current code does"? I'll send you my test case attempts
> off-list and we can try together to get the right test to show this. If
> it turns out that it does not really behave this way now and so my code
> is not introducing any regression, then (as usual) I'd rather fix this
> separately after the cleanup.
Heh. You are right, the current kernel has the same (minor) bug.
I beleive it was introduced by b2b2cbc4b2a2f389442549399a993a8306420baf.
Before this patch we notified the new parent even if it is from the same
thread group. If SIGCHLD is ignored, do_notify_parent() sets ->exit_signal = -1,
and then forget_original_parent() notices this and adds the child to ptrace_dead.
Now we don't notify the parent, and so the child "hangs".
However, now I think your patch adds a more serious problem, we can really
leak a zombie. Suppose that that we are ->real_parent, the child was traced
by us, it is zombie now, and the child is not a group leader. No matter who
is the new parent, no matter what is the state of SIGCHLD, we must not reparent
the child: nobody can release it except us. It is not traced any longer,
its ->exit_signal == -1, eligible_child() doesn't like this.
No?
> > Even if we don't ignore SIGCHLD, we should notify our thread-group, but
> > reparent_thread() skips do_notify_parent() when the new parent is from
> > is from the same thread-group.
>
> Why do you think this is wrong? The notification is always group-wide
> (a SIGCHLD to the group, and wakeup of the group-shared wait_chldexit).
Well, I was thinking about another thread (the new parent) sleeping in
do_wait(__WNOTHREAD)... not sure this really matters, though.
But,
> So there already was a notification, and a second one seems wrong to me.
Yes, I missed this. The second signal doesn't look right.
> > Note also that reparent_thread() has a very old bug. When it sees the
> > EXIT_ZOMBIE task, it notifies the new parent, but doesn't check if the task
> > becomes detached because the new parent ignores SIGCHLD. Currently this means
> > that init must have a handler for SIGCHLD or we leak zombies.
>
> I had noticed that. After the cleanup, we can look into fixing that.
> As usual, I am first going for a set of cleanup patches that changes
> exactly nothing semantically.
Yes, yes, I agree. I just wanted to be sure you didn't miss this bug.
Oleg.
next prev parent reply other threads:[~2008-04-05 15:01 UTC|newest]
Thread overview: 23+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-03-29 3:34 [PATCH 1/2] do_wait reorganization Roland McGrath
2008-03-29 3:35 ` [PATCH 2/2] ptrace children revamp Roland McGrath
2008-03-29 10:39 ` Oleg Nesterov
2008-03-29 13:10 ` Oleg Nesterov
2008-03-29 14:37 ` Oleg Nesterov
2008-04-04 21:00 ` Roland McGrath
2008-04-05 14:06 ` Oleg Nesterov [this message]
2008-04-09 20:15 ` Roland McGrath
2008-04-13 14:24 ` Oleg Nesterov
2008-04-15 1:41 ` Roland McGrath
2008-03-29 10:35 ` [PATCH 1/2] do_wait reorganization Oleg Nesterov
2008-03-31 3:54 ` Roland McGrath
2008-03-29 16:16 ` Linus Torvalds
2008-03-31 3:27 ` Roland McGrath
2008-03-31 3:57 ` [PATCH 1/3] " Roland McGrath
2008-03-31 3:59 ` [PATCH 2/3] ptrace children revamp Roland McGrath
2008-03-31 9:12 ` Oleg Nesterov
2008-03-31 3:59 ` [PATCH 3/3] do_wait: return security_task_wait() error code in place of -ECHILD Roland McGrath
2008-03-31 11:03 ` Oleg Nesterov
2008-03-31 20:20 ` Roland McGrath
2008-03-31 8:51 ` [PATCH 1/3] do_wait reorganization Oleg Nesterov
2008-03-31 20:29 ` Roland McGrath
2008-03-31 20:07 ` Oleg Nesterov
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20080405140656.GA109@tv-sign.ru \
--to=oleg@tv-sign.ru \
--cc=akpm@linux-foundation.org \
--cc=linux-kernel@vger.kernel.org \
--cc=roland@redhat.com \
--cc=torvalds@linux-foundation.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox