public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* [BUG] 2.4.x RT signal leak with kupdated (and maybe others)
@ 2003-09-30 16:27 Benjamin Herrenschmidt
  2003-09-30 17:36 ` Andrea Arcangeli
  0 siblings, 1 reply; 7+ messages in thread
From: Benjamin Herrenschmidt @ 2003-09-30 16:27 UTC (permalink / raw)
  To: Linux Kernel list; +Cc: Marcelo Tosatti

Hi !

I finally figured out why on a friend machine, his nr_queued_signals is
continuously growing until reaching nr_max_signals, thus preventing
queuing of RT signals, for example causing do_notify_parent() to fail
(libpthread uses sig 33 which is RTMIN+1 typically) leading to all sorts
of zombies floating around etc...

The problem is a bug in kupdated (possibly shared by other kernel code
manipulating a task tsk->pending.signal mask "by hand") that gets
triggered, in this case, by the infamous noflushd, but other culprits
are possible.

The bug is simple: the SIGSTOP sent to kupdated gets queued (allocated
& queued actually) since we try to queue one non-RT signal nowadays.

However, when "receiving" it, kupdated will "manually" clear it from
signal pending mask and will _not_ dequeue it. Thus, that signal will
stay forever in kupdated signal queue, it will never be deallocated and
nr_queued_signals will never be decreased.

Actually, further sigstops will stack there as well since kupdated is
clearing it from tsk->pending.signal so further queuing won't "notice"
it's already there.
That clearing also prevents handle_stop_signal() from flushing it from
the queue when SIGCONT is received.

The only thing I can see that could get rid of those signals  is
flush_sigqueue(), but of course, this is never called for a kernel
thread like kupdated.

So there is a clear bug in kupdated, I suppose the fix is to call
something like dequeue_signal() from kupdated instead of hacking
tsk->pending.signal. I need to test a fix before I post a patch.

Do we have a smiliar bug(s) with other bits of kernel "manipulating"
the pending signal mask this way ? I don't know what others may do
here, so if you know something like that, please speak up.

Ben.



^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2003-10-01 14:44 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2003-09-30 16:27 [BUG] 2.4.x RT signal leak with kupdated (and maybe others) Benjamin Herrenschmidt
2003-09-30 17:36 ` Andrea Arcangeli
2003-09-30 17:47   ` Benjamin Herrenschmidt
2003-09-30 18:22     ` Andrea Arcangeli
2003-10-01  9:24       ` Benjamin Herrenschmidt
2003-10-01  9:32         ` Nigel Cunningham
2003-10-01 14:45         ` Andrea Arcangeli

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox