From: "Paul E. McKenney" <paulmck@linux.vnet.ibm.com>
To: Frederic Weisbecker <fweisbec@gmail.com>
Cc: LKML <linux-kernel@vger.kernel.org>, Ingo Molnar <mingo@elte.hu>,
Peter Zijlstra <a.p.zijlstra@chello.nl>,
Thomas Gleixner <tglx@linutronix.de>,
Milton Miller <miltonm@bga.com>
Subject: Re: [PATCH 1/4] rcu: Detect uses of rcu read side in extended quiescent states
Date: Sat, 18 Jun 2011 09:36:24 -0700 [thread overview]
Message-ID: <20110618163624.GC2238@linux.vnet.ibm.com> (raw)
In-Reply-To: <20110618161032.GB9266@somewhere>
On Sat, Jun 18, 2011 at 06:10:36PM +0200, Frederic Weisbecker wrote:
> On Sat, Jun 18, 2011 at 09:04:27AM -0700, Paul E. McKenney wrote:
> > On Sat, Jun 18, 2011 at 04:23:58PM +0200, Frederic Weisbecker wrote:
> > > On Fri, Jun 17, 2011 at 04:19:03PM -0700, Paul E. McKenney wrote:
> > > > On Fri, Jun 10, 2011 at 02:50:43AM +0200, Frederic Weisbecker wrote:
> > > > > On Thu, Jun 09, 2011 at 05:23:50PM -0700, Paul E. McKenney wrote:
> > > > > > On Fri, Jun 10, 2011 at 01:47:24AM +0200, Frederic Weisbecker wrote:
> > > > > > > Detect uses of rcu that are not supposed to happen when we
> > > > > > > are in an extended quiescent state.
> > > > > > >
> > > > > > > This can happen for example if we use rcu between the time we
> > > > > > > stop the tick and the time we restart it. Or inside an irq that
> > > > > > > didn't use rcu_irq_enter,exit() or other possible kind of rcu API
> > > > > > > misuse.
> > > > > > >
> > > > > > > v2: Rebase against latest rcu changes, handle tiny RCU as well
> > > > > >
> > > > > > Good idea on checking for RCU read-side critical sections happening
> > > > > > in dyntick-idle periods!
> > > > > >
> > > > > > But wouldn't it be better to put the checks in rcu_read_lock() and
> > > > > > friends? The problem I see with putting them in rcu_dereference_check()
> > > > > > is that someone can legitimately do something like the following
> > > > > > while in dyntick-idle mode:
> > > > > >
> > > > > > spin_lock(&mylock);
> > > > > > /* do a bunch of stuff */
> > > > > > p = rcu_dereference_check(myrcuptr, lockdep_is_held(&mylock));
> > > > > >
> > > > > > The logic below would complain about this usage, despite the fact
> > > > > > that it is perfectly safe because the update-side lock is held.
> > > > > >
> > > > > > Make sense, or am I missing something?
> > > > > >
> > > > > > Thanx, Paul
> > > > >
> > > > > I'm an idiot. I put my check in rcu_dereference_check() on purpose because
> > > > > it's always called from places that check one of the rcu locks are held,
> > > > > but I forgot that's also used for custom conditions with the _check()
> > > > > things.
> > > > >
> > > > > That said, putting the check in rcu_read_lock() and alike would only work
> > > > > with rcu_read_lock() itself. Few users of rcu_read_lock_sched() actually
> > > > > call it explicitely but rely on irq disabled or preempt disabled. And I can't put the
> > > > > checks there as it's fine to disabled irqs in dyntick idle.
> > > > >
> > > > > What about the below? (untested yet)
> > > > >
> > > > > And I would print the state of dynticks-idle mode in the final lockdep warning.
> > > >
> > > > Printing the dynticks-idle mode would be quite good!
> > > >
> > > > However, it is possible to have an RCU read-side critical section that does
> > > > not have an rcu_dereference() or an rcu_read_lock_held(). So I do believe
> > > > that we really do need rcu_read_lock() and friends to do this checking.
> > >
> > > Right, then we need to check everything: rcu_read_lock() and friends in case
> > > we have no rcu_read_lock_held() check made (ie: no rcu_dereference_check()),
> > > but also rcu_read_lock_held()/rcu_read_lock_sched_held()/... because preempt_disable(),
> > > local_irq_disable(), local_bh_disable() can't be checked so for rcu sched and rcu bh
> > > we can only check the ...held() things.
> >
> > Good point. To make sure I understand, we have different approaches
> > for the different types of RCU. We need to instrument the following
> > primitives:
> >
> > 1. rcu_read_lock() because a stray rcu_dereference() will already
> > be caught by PROVE_RCU.
> > 2. rcu_read_lock_bh_held() because it is OK to do local_bh_disable()
> > in dyntick-idle mode, so we cannot prohibit all of the
> > read-acquisition cases. We can also instrument rcu_read_lock_bh()
> > to catch RCU-bh read-side critical sections that don't happen
> > to contain rcu_dereference_bh().
> > 3. rcu_read_lock_sched_held() because it is OK to do preempt_disable()
> > and local_irq_save() from dyntick-idle mode, so we again
> > cannot prohibit all of the read-acquisition cases. We can
> > also instrument rcu_read_lock_sched() to catch RCU-sched
> > read-side critical sections that don't happen to contain
> > rcu_dereference_sched().
>
> Exactly!
>
> > 4. srcu_read_lock() because a stray srcu_dereference() will already
> > by caught by PROVE_RCU.
>
> Well I have no idea how srcu works so I'll first focus on the rcu part :)
Fair enough! ;-)
> > We miss a few cases, for example, an RCU-sched read-side critical
> > section that uses local_irq_disable(), but that also does not contain
> > an rcu_dereference_sched(). But still this sounds quite worthwhile.
>
> Right.
>
> So I'll send a v3 that takes the above point into accounts.
Looking forward to seeing it!
Thanx, Paul
next prev parent reply other threads:[~2011-06-18 16:36 UTC|newest]
Thread overview: 13+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-06-09 23:47 [PATCH 0/4 v2] rcu: Detect rcu uses under extended quiescent state, and fix some Frederic Weisbecker
2011-06-09 23:47 ` [PATCH 1/4] rcu: Detect uses of rcu read side in extended quiescent states Frederic Weisbecker
2011-06-10 0:23 ` Paul E. McKenney
2011-06-10 0:50 ` Frederic Weisbecker
2011-06-17 23:19 ` Paul E. McKenney
2011-06-18 14:23 ` Frederic Weisbecker
2011-06-18 16:04 ` Paul E. McKenney
2011-06-18 16:10 ` Frederic Weisbecker
2011-06-18 16:36 ` Paul E. McKenney [this message]
2011-06-09 23:47 ` [PATCH 2/4] nohz: Split extended quiescent state handling from nohz switch Frederic Weisbecker
2011-06-09 23:47 ` [PATCH 3/4] x86: Don't call idle notifier inside rcu extended QS Frederic Weisbecker
2011-06-09 23:47 ` [PATCH 4/4] x86: Call idle_exit() after irq_enter() Frederic Weisbecker
-- strict thread matches above, loose matches on Subject: below --
2011-06-06 3:10 [PATCH 0/4] rcu: Detect rcu uses under extended quiescent state, and fix some Frederic Weisbecker
2011-06-06 3:10 ` [PATCH 1/4] rcu: Detect uses of rcu read side in extended quiescent states Frederic Weisbecker
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20110618163624.GC2238@linux.vnet.ibm.com \
--to=paulmck@linux.vnet.ibm.com \
--cc=a.p.zijlstra@chello.nl \
--cc=fweisbec@gmail.com \
--cc=linux-kernel@vger.kernel.org \
--cc=miltonm@bga.com \
--cc=mingo@elte.hu \
--cc=tglx@linutronix.de \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).