From: Oleg Nesterov <oleg@tv-sign.ru>
To: john stultz <johnstul@us.ibm.com>
Cc: Ingo Molnar <mingo@elte.hu>, Thomas Gleixner <tglx@linutronix.de>,
Steven Rostedt <rostedt@goodmis.org>,
linux-kernel@vger.kernel.org, Roland McGrath <roland@redhat.com>
Subject: Re: [RFC][PATCH] Avoid race w/ posix-cpu-timer and exiting tasks
Date: Wed, 14 Jun 2006 06:49:09 +0400 [thread overview]
Message-ID: <20060614024909.GA563@oleg> (raw)
john stultz wrote:
>
> Hey Ingo,
> We've occasionally come across OOPSes in posix-cpu-timer thread (as
> well as tripping over the BUG_ON(tsk->exit_state there) where it appears
> the task we're processing exits out on us while we're using it.
>
> Thus this fix tries to avoid running the posix-cpu-timers on a task that
> is exiting.
>
> --- 2.6-rt/kernel/posix-cpu-timers.c 2006-06-11 15:38:58.000000000 -0500
> +++ devrt/kernel/posix-cpu-timers.c 2006-06-12 10:52:20.000000000 -0500
> @@ -1290,12 +1290,15 @@
>
> #undef UNEXPIRED
>
> - BUG_ON(tsk->exit_state);
> -
> /*
> * Double-check with locks held.
> */
> read_lock(&tasklist_lock);
> + /* Make sure the task doesn't exit under us. */
> + if(unlikely(tsk->exit_state)) {
> + read_unlock(&tasklist_lock);
> + return;
> + }
> spin_lock(&tsk->sighand->siglock);
I strongly believe this BUG_ON() is indeed wrong, and I did a similar patch
a long ago:
[PATCH] posix-timers: remove false BUG_ON() from run_posix_cpu_timers()
Commit 3de463c7d9d58f8cf3395268230cb20a4c15bffa
However it was reverted due to some unclear problems (I think those problems
were not related to this patch).
Instead this patch was added:
[PATCH] Yet more posix-cpu-timer fixes
Commit 3de463c7d9d58f8cf3395268230cb20a4c15bffa
and I still think this patch is not correct.
Quoting myself:
>
> Roland McGrath wrote:
> >
> > @@ -566,6 +566,9 @@ static void arm_timer(struct k_itimer *t
> > struct cpu_timer_list *next;
> > unsigned long i;
> >
> > + if (CPUCLOCK_PERTHREAD(timer->it_clock) && (p->flags & PF_EXITING))
> > + return;
> > +
>
> Why CPUCLOCK_PERTHREAD() ?.
>
> Also, this is racy, no? Why should arm_timer() see PF_EXITING which is
> set on another cpu without any barriers/locking? After all, arm_timer()
> can test PF_EXITING before do_exit() sets this flag, but set ->it_xxx_expires
> after do_exit() resets it.
Oleg.
next reply other threads:[~2006-06-13 22:49 UTC|newest]
Thread overview: 4+ messages / expand[flat|nested] mbox.gz Atom feed top
2006-06-14 2:49 Oleg Nesterov [this message]
2006-06-13 23:05 ` [RFC][PATCH] Avoid race w/ posix-cpu-timer and exiting tasks john stultz
2006-06-15 1:24 ` Oleg Nesterov
-- strict thread matches above, loose matches on Subject: below --
2006-06-13 0:25 john stultz
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20060614024909.GA563@oleg \
--to=oleg@tv-sign.ru \
--cc=johnstul@us.ibm.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@elte.hu \
--cc=roland@redhat.com \
--cc=rostedt@goodmis.org \
--cc=tglx@linutronix.de \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox