From: "Paul E. McKenney" <paulmck@linux.vnet.ibm.com>
To: Steven Rostedt <rostedt@goodmis.org>
Cc: linux-kernel@vger.kernel.org, ngo Molnar <mingo@kernel.org>,
Andrew Morton <akpm@linux-foundation.org>,
stable@vger.kernel.org
Subject: Re: [PATCH 4/4] tracing: Remove RCU work arounds from stack tracer
Date: Sat, 23 Sep 2017 10:15:58 -0700 [thread overview]
Message-ID: <20170923171558.GD3521@linux.vnet.ibm.com> (raw)
In-Reply-To: <20170923072204.7662af1c@gandalf.local.home>
On Sat, Sep 23, 2017 at 07:22:04AM -0400, Steven Rostedt wrote:
> On Fri, 22 Sep 2017 23:07:37 -0700
> "Paul E. McKenney" <paulmck@linux.vnet.ibm.com> wrote:
>
> > OK, how about the following?
> >
> > It turns out that functions called from rcu_irq_enter() can
> > be subject to various kinds of tracing, which can result in
> > rcu_irq_enter() being invoked recursively. This recursion
> > causes RCU to not have been watching when it should have,
> > resulting in lockdep-RCU splats. Switching from rcu_irq_enter()
> > to rcu_nmi_enter() still resulted in failures because of calls
> > to rcu_irq_enter() between the rcu_nmi_enter() and its matching
> > rcu_nmi_exit(). Such calls again cause RCU to not be watching
> > when it should have been.
> >
> > In particular, the stack tracer called rcu_irq_enter()
> > unconditionally, which is problematic when RCU's last
> > not-watching-to-watching transition was carried out by
> > rcu_nmi_enter(), as will be the case when tracing uses
> > rcu_nmi_enter() to cause RCU to start watching the current CPU.
> > In that case, rcu_irq_enter() actually switches RCU back to
> > the not-watching state for this CPU, which results in lockdep
> > splats complaining about rcu_read_lock() being used on an idle
> > (not-watched) CPU. The first patch of this series addressed
> > this problem by having rcu_irq_enter() and rcu_irq_exit()
> > refrain from doing anything when rcu_nmi_enter() caused RCU to
> > start watching this CPU. The third patch in this series caused
> > save_stack_trace() to invoke rcu_nmi_enter() and rcu_nmi_exit()
> > as needed, so this fourth patch now removes the rcu_irq_enter()
> > and rcu_irq_exit() from within the stack tracer.
>
> I think it's a bit too much ;-) I believe it talks too much about the
> RCU internals, and the bug will be lost on us mere mortals.
>
> >
> > > Actually, thinking about this more, this doesn't need to go in stable.
> > > As recursive rcu_irq_enter() calls should not hurt, and you now allow
> > > rcu_irq_enter() to be called even after a rcu_nmi_enter() right?
> >
> > Yes, it is now the case that rcu_irq_enter() can be called even after
> > an rcu_nmi_enter() exited idle, because rcu_irq_enter() now checks for
> > this and takes an early exit if so.
> >
> > But what is it about older kernels prevents the tracing-induced recursive
> > calls to rcu_irq_enter()?
>
> Ah, the bug is if rcu_irq_enter() is called, and the stack trace
> triggers then. OK, lets keep it simple, and just say this.
>
>
> Currently the stack tracer calls rcu_irq_enter() to make sure RCU
> is watching when it records a stack trace. But if the stack tracer
> is triggered while tracing inside of a rcu_irq_enter(), calling
> rcu_irq_enter() unconditionally can be problematic.
>
> The reason for having rcu_irq_enter() in the first place has been
> fixed from within the saving of the stack trace code, and there's no
> reason for doing it in the stack tracer itself. Just remove it.
>
> Simple and to the point.
Works for me!
Thanx, Paul
next prev parent reply other threads:[~2017-09-23 17:16 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-09-22 22:15 [PATCH 0/4] rcu/tracing/extable: Fix stack dump when RCU is not watching Steven Rostedt
2017-09-22 22:15 ` [PATCH 1/4] rcu: Allow for page faults in NMI handlers Steven Rostedt
2017-09-22 22:15 ` [PATCH 2/4] extable: Consolidate *kernel_text_address() functions Steven Rostedt
2017-09-22 22:40 ` Paul E. McKenney
2017-09-22 22:15 ` [PATCH 3/4] extable: Enable RCU if it is not watching in kernel_text_address() Steven Rostedt
2017-09-22 22:28 ` Josh Poimboeuf
2017-09-23 1:12 ` [PATCH 3/4 v2] " Steven Rostedt
2017-09-22 22:44 ` [PATCH 3/4] " Paul E. McKenney
2017-09-23 1:09 ` Steven Rostedt
2017-09-22 22:15 ` [PATCH 4/4] tracing: Remove RCU work arounds from stack tracer Steven Rostedt
2017-09-22 22:54 ` Paul E. McKenney
2017-09-23 1:27 ` Steven Rostedt
2017-09-23 6:07 ` Paul E. McKenney
2017-09-23 11:22 ` Steven Rostedt
2017-09-23 17:15 ` Paul E. McKenney [this message]
-- strict thread matches above, loose matches on Subject: below --
2017-09-23 20:56 [PATCH 0/4] [GIT PULL] tracing/rcu: Fix save_stack_trace() called when RCU is not watching Steven Rostedt
2017-09-23 20:56 ` [PATCH 4/4] tracing: Remove RCU work arounds from stack tracer Steven Rostedt
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20170923171558.GD3521@linux.vnet.ibm.com \
--to=paulmck@linux.vnet.ibm.com \
--cc=akpm@linux-foundation.org \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@kernel.org \
--cc=rostedt@goodmis.org \
--cc=stable@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox