public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: nicolas saenz julienne <nsaenz@kernel.org>
To: Frederic Weisbecker <frederic@kernel.org>
Cc: LKML <linux-kernel@vger.kernel.org>,
	Peter Zijlstra <peterz@infradead.org>,
	Phil Auld <pauld@redhat.com>, Alex Belits <abelits@marvell.com>,
	Xiongfeng Wang <wangxiongfeng2@huawei.com>,
	Neeraj Upadhyay <quic_neeraju@quicinc.com>,
	Thomas Gleixner <tglx@linutronix.de>,
	Yu Liao <liaoyu15@huawei.com>, Boqun Feng <boqun.feng@gmail.com>,
	"Paul E . McKenney" <paulmck@kernel.org>,
	Marcelo Tosatti <mtosatti@redhat.com>,
	Paul Gortmaker <paul.gortmaker@windriver.com>,
	Uladzislau Rezki <uladzislau.rezki@sony.com>,
	Joel Fernandes <joel@joelfernandes.org>,
	Mark Rutland <mark.rutland@arm.com>
Subject: Re: [PATCH 20/21] rcu/context_tracking: Merge dynticks counter and context tracking states
Date: Tue, 31 May 2022 18:15:36 +0200	[thread overview]
Message-ID: <bcc4c9fa41ace6f9d98d88d24d1bd67a469cbfeb.camel@kernel.org> (raw)
In-Reply-To: <20220531142335.GA1272449@lothringen>

On Tue, 2022-05-31 at 16:23 +0200, Frederic Weisbecker wrote:
> On Mon, May 30, 2022 at 08:02:57PM +0200, nicolas saenz julienne wrote:
> > Hi Frederic,
> > 
> > On Thu, 2022-05-19 at 16:58 +0200, Frederic Weisbecker wrote:
> > > Updating the context tracking state and the RCU dynticks counter
> > > atomically in a single operation is a first step towards improving CPU
> > > isolation. This makes the context tracking state updates fully ordered
> > > and therefore allow for later enhancements such as postponing some work
> > > while a task is running isolated in userspace until it ever comes back
> > > to the kernel.
> > > 
> > > The state field becomes divided in two parts:
> > > 
> > > 1) Two Lower bits for context tracking state:
> > > 
> > > 	CONTEXT_KERNEL = 0
> > >    	CONTEXT_IDLE = 1,
> > > 	CONTEXT_USER = 2,
> > > 	CONTEXT_GUEST = 3,
> > > 
> > > 2) Higher bits for RCU eqs dynticks counting:
> > > 
> > >     RCU_DYNTICKS_IDX = 4
> > > 
> > >    The dynticks counting is always incremented by this value.
> > >    (state & RCU_DYNTICKS_IDX) means we are NOT in an extended quiescent
> > >    state. This makes the chance for a collision more likely between two
> > >    RCU dynticks snapshots but wrapping up 28 bits of eqs dynticks
> > >    increments still takes some bad luck (also rdp.dynticks_snap could be
> > >    converted from int to long?)
> > > 
> > > Some RCU eqs functions have been renamed to better reflect their broader
> > > scope that now include context tracking state.
> > > 
> > > Signed-off-by: Frederic Weisbecker <frederic@kernel.org>
> > > Cc: Paul E. McKenney <paulmck@kernel.org>
> > > Cc: Peter Zijlstra <peterz@infradead.org>
> > > Cc: Thomas Gleixner <tglx@linutronix.de>
> > > Cc: Neeraj Upadhyay <quic_neeraju@quicinc.com>
> > > Cc: Uladzislau Rezki <uladzislau.rezki@sony.com>
> > > Cc: Joel Fernandes <joel@joelfernandes.org>
> > > Cc: Boqun Feng <boqun.feng@gmail.com>
> > > Cc: Nicolas Saenz Julienne <nsaenz@kernel.org>
> > > Cc: Marcelo Tosatti <mtosatti@redhat.com>
> > > Cc: Xiongfeng Wang <wangxiongfeng2@huawei.com>
> > > Cc: Yu Liao<liaoyu15@huawei.com>
> > > Cc: Phil Auld <pauld@redhat.com>
> > > Cc: Paul Gortmaker<paul.gortmaker@windriver.com>
> > > Cc: Alex Belits <abelits@marvell.com>
> > > ---
> > 
> > While working on a feature on top of this series (IPI deferral stuff) I believe
> > I've found a discrepancy on how context state is being updated:
> > 
> >  - When servicing an IRQ from user-space, we increment dynticks, and clear the
> >    ct state to show we're in-kernel.
> > 
> >  - When servicing an IRQ from idle/guest or an NMI from any context we only
> >    increment the dynticks counter. The ct state remains unchanged.
> 
> Hmm, an IRQ from userspace does:
> 
>     ct_user_enter()
>     //run in user
>         //-----IRQ
>         ct_user_exit()
>         ct_irq_enter()
>         ct_irq_exit()
>         ct_user_enter()
>     //run in user
> 
> An IRQ from guest does:
> 
>     for (;;) {
>          context_tracking_guest_enter()
>         //vmrun
> 	//IRQ pending
>         #VMEXIT
>         context_tracking_guest_exit()
> 	local_irq_enable()
>         ct_irq_enter()
>         ct_irq_exit()
> 	local_irq_disable()
>     }
> 
> 
>     (although I see there is an "sti" right before "vmrun" so it looks
>     possible to have ct_irq_enter() after context_tracking_guest_enter()
>     if a host IRQ fires between the sti and the vmrun though I might be
>     missing some kvm subtelty).
> 
> An IRQ from idle does just:
> 
>     ct_idle_enter()
>         //IRQ
>         ct_irq_enter()
>         ct_irq_exit()
>     ct_idle_exit()
> 
> So guest looks mostly ok to me (except for the little sti before vmrun for
> which I have a doubt).

Yes, shouldn't have mentioned guests. I got carried away.

> But idle at least is an exception and CONTEXT_IDLE will remain during the
> interrupt handling. It's not that trivial to handle the idle case because
> ct_irq_exit() needs to know that it is called between ct_idle_enter() and
> ct_idle_exit().

Just for the record, this behaviour was already here regardless of this series,
so it's not something it needs to fix.

Something like this should work, right?

     ct_idle_enter()
         //IRQ or NMI
	 if (__ct_state() == CONTEXT_IDLE)
	     ct_idle_exit()
         ct_irq_enter()
	    ...
         ct_irq_exit()
	 if (needs_update_state()) //using irqentry_state_t for ex.
	     ct_idle_entry()
     ct_idle_exit()

Note that it's not a big issue as we can work around this behaviour by checking
through dynticks whether a CPU is really idle.

Do you think it's worth fixing nonetheless?

Regards,
Nicolas

  reply	other threads:[~2022-05-31 16:15 UTC|newest]

Thread overview: 27+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-05-19 14:58 [PATCH 00/21] rcu/context-tracking: Merge RCU eqs-dynticks counter to context tracking v3 Frederic Weisbecker
2022-05-19 14:58 ` [PATCH 01/21] context_tracking: Remove unused context_tracking_in_user() Frederic Weisbecker
2022-05-19 14:58 ` [PATCH 02/21] rcu: Tag rcu_irq_*_irqson() as noinstr Frederic Weisbecker
2022-05-19 14:58 ` [PATCH 03/21] context_tracking: Add a note about noinstr VS unsafe context tracking functions Frederic Weisbecker
2022-05-19 14:58 ` [PATCH 04/21] context_tracking: Rename __context_tracking_enter/exit() to __ct_user_enter/exit() Frederic Weisbecker
2022-05-19 14:58 ` [PATCH 05/21] context_tracking: Rename context_tracking_user_enter/exit() to user_enter/exit_callable() Frederic Weisbecker
2022-05-19 14:58 ` [PATCH 06/21] context_tracking: Rename context_tracking_enter/exit() to ct_user_enter/exit() Frederic Weisbecker
2022-05-19 14:58 ` [PATCH 07/21] context_tracking: Rename context_tracking_cpu_set() to ct_cpu_track_user() Frederic Weisbecker
2022-05-19 14:58 ` [PATCH 08/21] context_tracking: Split user tracking Kconfig Frederic Weisbecker
2022-05-19 14:58 ` [PATCH 09/21] context_tracking: Take idle eqs entrypoints over RCU Frederic Weisbecker
2022-05-19 14:58 ` [PATCH 10/21] context_tracking: Take IRQ " Frederic Weisbecker
2022-05-19 14:58 ` [PATCH 11/21] context_tracking: Take NMI " Frederic Weisbecker
2022-05-19 14:58 ` [PATCH 12/21] rcu/context-tracking: Remove rcu_irq_enter/exit() Frederic Weisbecker
2022-05-19 14:58 ` [PATCH 13/21] rcu/context_tracking: Move dynticks counter to context tracking Frederic Weisbecker
2022-05-19 14:58 ` [PATCH 14/21] rcu/context_tracking: Move dynticks_nesting " Frederic Weisbecker
2022-05-19 14:58 ` [PATCH 15/21] rcu/context_tracking: Move dynticks_nmi_nesting " Frederic Weisbecker
2022-05-19 14:58 ` [PATCH 16/21] rcu/context-tracking: Move deferred nocb resched " Frederic Weisbecker
2022-05-19 14:58 ` [PATCH 17/21] rcu/context-tracking: Move RCU-dynticks internal functions to context_tracking Frederic Weisbecker
2022-05-19 14:58 ` [PATCH 18/21] rcu/context-tracking: Remove unused and/or unecessary middle functions Frederic Weisbecker
2022-05-19 14:58 ` [PATCH 19/21] context_tracking: Convert state to atomic_t Frederic Weisbecker
2022-05-19 14:58 ` [PATCH 20/21] rcu/context_tracking: Merge dynticks counter and context tracking states Frederic Weisbecker
2022-05-30 18:02   ` nicolas saenz julienne
2022-05-31 14:23     ` Frederic Weisbecker
2022-05-31 16:15       ` nicolas saenz julienne [this message]
2022-06-08 14:29         ` Frederic Weisbecker
2022-06-08 17:43           ` nicolas saenz julienne
2022-05-19 14:58 ` [PATCH 21/21] MAINTAINERS: Add Paul as context tracking maintainer Frederic Weisbecker

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=bcc4c9fa41ace6f9d98d88d24d1bd67a469cbfeb.camel@kernel.org \
    --to=nsaenz@kernel.org \
    --cc=abelits@marvell.com \
    --cc=boqun.feng@gmail.com \
    --cc=frederic@kernel.org \
    --cc=joel@joelfernandes.org \
    --cc=liaoyu15@huawei.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mark.rutland@arm.com \
    --cc=mtosatti@redhat.com \
    --cc=paul.gortmaker@windriver.com \
    --cc=pauld@redhat.com \
    --cc=paulmck@kernel.org \
    --cc=peterz@infradead.org \
    --cc=quic_neeraju@quicinc.com \
    --cc=tglx@linutronix.de \
    --cc=uladzislau.rezki@sony.com \
    --cc=wangxiongfeng2@huawei.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox