From: Catalin Marinas <catalin.marinas@arm.com>
To: "Okanovic, Haris" <harisokn@amazon.com>
Cc: "kvm@vger.kernel.org" <kvm@vger.kernel.org>,
"rafael@kernel.org" <rafael@kernel.org>,
"sudeep.holla@arm.com" <sudeep.holla@arm.com>,
"joao.m.martins@oracle.com" <joao.m.martins@oracle.com>,
"ankur.a.arora@oracle.com" <ankur.a.arora@oracle.com>,
"dave.hansen@linux.intel.com" <dave.hansen@linux.intel.com>,
"konrad.wilk@oracle.com" <konrad.wilk@oracle.com>,
"wanpengli@tencent.com" <wanpengli@tencent.com>,
"cl@gentwo.org" <cl@gentwo.org>,
"linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
"mingo@redhat.com" <mingo@redhat.com>,
"maobibo@loongson.cn" <maobibo@loongson.cn>,
"pbonzini@redhat.com" <pbonzini@redhat.com>,
"tglx@linutronix.de" <tglx@linutronix.de>,
"misono.tomohiro@fujitsu.com" <misono.tomohiro@fujitsu.com>,
"daniel.lezcano@linaro.org" <daniel.lezcano@linaro.org>,
"arnd@arndb.de" <arnd@arndb.de>,
"lenb@kernel.org" <lenb@kernel.org>,
"will@kernel.org" <will@kernel.org>,
"hpa@zytor.com" <hpa@zytor.com>,
"peterz@infradead.org" <peterz@infradead.org>,
"boris.ostrovsky@oracle.com" <boris.ostrovsky@oracle.com>,
"vkuznets@redhat.com" <vkuznets@redhat.com>,
"bp@alien8.de" <bp@alien8.de>,
"linux-pm@vger.kernel.org" <linux-pm@vger.kernel.org>,
"linux-arm-kernel@lists.infradead.org"
<linux-arm-kernel@lists.infradead.org>,
"mtosatti@redhat.com" <mtosatti@redhat.com>,
"x86@kernel.org" <x86@kernel.org>,
"mark.rutland@arm.com" <mark.rutland@arm.com>
Subject: Re: [PATCH 1/5] asm-generic: add smp_vcond_load_relaxed()
Date: Wed, 6 Nov 2024 19:55:46 +0000 [thread overview]
Message-ID: <ZyvJwjfKgnqMpM9P@arm.com> (raw)
In-Reply-To: <b62d938111c6ce52b91d0f2e3922857c5d4ef253.camel@amazon.com>
On Wed, Nov 06, 2024 at 06:13:35PM +0000, Okanovic, Haris wrote:
> On Wed, 2024-11-06 at 11:08 +0000, Catalin Marinas wrote:
> > On Tue, Nov 05, 2024 at 12:30:37PM -0600, Haris Okanovic wrote:
> > > diff --git a/include/asm-generic/barrier.h b/include/asm-generic/barrier.h
> > > index d4f581c1e21d..112027eabbfc 100644
> > > --- a/include/asm-generic/barrier.h
> > > +++ b/include/asm-generic/barrier.h
> > > @@ -256,6 +256,31 @@ do { \
> > > })
> > > #endif
> > >
> > > +/**
> > > + * smp_vcond_load_relaxed() - (Spin) wait until an expected value at address
> > > + * with no ordering guarantees. Spins until `(*addr & mask) == val` or
> > > + * `nsecs` elapse, and returns the last observed `*addr` value.
> > > + *
> > > + * @nsecs: timeout in nanoseconds
> >
> > FWIW, I don't mind the relative timeout, it makes the API easier to use.
> > Yes, it may take longer in absolute time if the thread is scheduled out
> > before local_clock_noinstr() is read but the same can happen in the
> > caller anyway. It's similar to udelay(), it can take longer if the
> > thread is scheduled out.
> >
> > > + * @addr: pointer to an integer
> > > + * @mask: a bit mask applied to read values
> > > + * @val: Expected value with mask
> > > + */
> > > +#ifndef smp_vcond_load_relaxed
> > > +#define smp_vcond_load_relaxed(nsecs, addr, mask, val) ({ \
> > > + const u64 __start = local_clock_noinstr(); \
> > > + u64 __nsecs = (nsecs); \
> > > + typeof(addr) __addr = (addr); \
> > > + typeof(*__addr) __mask = (mask); \
> > > + typeof(*__addr) __val = (val); \
> > > + typeof(*__addr) __cur; \
> > > + smp_cond_load_relaxed(__addr, ( \
> > > + (VAL & __mask) == __val || \
> > > + local_clock_noinstr() - __start > __nsecs \
> > > + )); \
> > > +})
> >
> > The generic implementation has the same problem as Ankur's current
> > series. smp_cond_load_relaxed() can't wait on anything other than the
> > variable at __addr. If it goes into a WFE, there's nothing executed to
> > read the timer and check for progress. Any generic implementation of
> > such function would have to use cpu_relax() and polling.
>
> How would the caller enter wfe()? Can you give a specific scenario that
> you're concerned about?
Let's take the arm64 example with the event stream disabled. Without the
subsequent patches implementing smp_vcond_load_relaxed(), just expand
the arm64 smp_cond_load_relaxed() implementation in the above macro. If
the timer check doesn't trigger an exit from the loop,
__cmpwait_relaxed() only waits on the variable to change its value,
nothing to do with the timer.
> This code already reduces to a relaxed poll, something like this:
>
> ```
> start = clock();
> while((READ_ONCE(*addr) & mask) != val && (clock() - start) < nsecs) {
> cpu_relax();
> }
> ```
Well, that's if you also use the generic implementation of
smp_cond_load_relaxed() but have you checked all the other architectures
that don't do something similar to the arm64 wfe (riscv comes close)?
Even if all other architectures just use a cpu_relax(), that's still
abusing the smp_cond_load_relaxed() semantics. And what if one places
another loop in their __cmpwait()? That's allowed because you are
supposed to wait on a single variable to change not on multiple states.
--
Catalin
next prev parent reply other threads:[~2024-11-06 19:55 UTC|newest]
Thread overview: 77+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-09-25 23:24 [PATCH v8 00/11] Enable haltpoll on arm64 Ankur Arora
2024-09-25 23:24 ` [PATCH v8 01/11] cpuidle/poll_state: poll via smp_cond_load_relaxed() Ankur Arora
2024-10-15 12:04 ` Catalin Marinas
2024-10-15 16:42 ` Christoph Lameter (Ampere)
2024-10-15 16:50 ` Catalin Marinas
2024-10-15 17:17 ` Christoph Lameter (Ampere)
2024-10-15 17:40 ` Catalin Marinas
2024-10-15 21:53 ` Ankur Arora
2024-10-15 22:28 ` Christoph Lameter (Ampere)
2024-10-16 7:06 ` Ankur Arora
2024-10-17 16:54 ` Christoph Lameter (Ampere)
2024-10-17 18:36 ` Ankur Arora
2024-10-15 22:40 ` Christoph Lameter (Ampere)
2024-10-16 9:54 ` Catalin Marinas
2024-10-17 16:56 ` Christoph Lameter (Ampere)
2024-10-17 18:15 ` Catalin Marinas
2024-10-17 19:34 ` Ankur Arora
2024-10-15 21:32 ` Ankur Arora
2024-10-16 6:20 ` maobibo
2024-10-16 10:06 ` Catalin Marinas
2024-10-16 15:13 ` Okanovic, Haris
2024-10-16 17:04 ` Ankur Arora
2024-10-16 18:04 ` Okanovic, Haris
2024-10-17 14:01 ` Catalin Marinas
2024-10-17 22:47 ` Ankur Arora
2024-10-18 11:05 ` Catalin Marinas
2024-10-18 19:00 ` Ankur Arora
2024-10-21 12:02 ` Catalin Marinas
2024-09-25 23:24 ` [PATCH v8 02/11] cpuidle: rename ARCH_HAS_CPU_RELAX to ARCH_HAS_OPTIMIZED_POLL Ankur Arora
2024-09-25 23:24 ` [PATCH v8 03/11] Kconfig: move ARCH_HAS_OPTIMIZED_POLL to arch/Kconfig Ankur Arora
2024-09-25 23:24 ` [PATCH v8 04/11] cpuidle-haltpoll: define arch_haltpoll_want() Ankur Arora
2024-09-25 23:24 ` [PATCH v8 05/11] governors/haltpoll: drop kvm_para_available() check Ankur Arora
2024-09-25 23:24 ` [PATCH v8 06/11] cpuidle-haltpoll: condition on ARCH_CPUIDLE_HALTPOLL Ankur Arora
2024-09-25 23:24 ` [PATCH v8 07/11] arm64: define TIF_POLLING_NRFLAG Ankur Arora
2024-09-25 23:24 ` [PATCH v8 08/11] arm64: idle: export arch_cpu_idle Ankur Arora
2024-09-25 23:24 ` [PATCH v8 09/11] arm64: select ARCH_HAS_OPTIMIZED_POLL Ankur Arora
2024-10-14 22:48 ` Christoph Lameter (Ampere)
2024-09-25 23:24 ` [PATCH v8 10/11] cpuidle/poll_state: limit POLL_IDLE_RELAX_COUNT on arm64 Ankur Arora
2024-09-25 23:24 ` [PATCH v8 11/11] arm64: support cpuidle-haltpoll Ankur Arora
2024-10-02 22:42 ` Okanovic, Haris
2024-10-03 3:29 ` Ankur Arora
2024-10-14 22:49 ` Christoph Lameter (Ampere)
2024-10-15 1:49 ` Ankur Arora
2024-10-16 15:13 ` Okanovic, Haris
2024-10-09 2:37 ` [PATCH v8 00/11] Enable haltpoll on arm64 zhenglifeng (A)
2024-10-15 1:53 ` Ankur Arora
2024-10-14 22:54 ` Christoph Lameter (Ampere)
2024-10-15 12:36 ` Marc Zyngier
2024-10-16 21:55 ` Ankur Arora
2024-10-17 8:19 ` Marc Zyngier
2024-10-17 18:35 ` Ankur Arora
2024-10-22 22:01 ` Ankur Arora
2024-11-05 18:30 ` Haris Okanovic
2024-11-05 18:30 ` [PATCH 1/5] asm-generic: add smp_vcond_load_relaxed() Haris Okanovic
2024-11-05 19:36 ` Christoph Lameter (Ampere)
2024-11-06 17:06 ` Okanovic, Haris
2024-11-06 11:08 ` Catalin Marinas
2024-11-06 18:13 ` Okanovic, Haris
2024-11-06 19:55 ` Catalin Marinas [this message]
2024-11-06 20:31 ` Okanovic, Haris
2024-11-06 11:39 ` Will Deacon
2024-11-06 17:18 ` Okanovic, Haris
2024-11-05 18:30 ` [PATCH 2/5] arm64: add __READ_ONCE_EX() Haris Okanovic
2024-11-05 19:39 ` Christoph Lameter (Ampere)
2024-11-06 17:37 ` Okanovic, Haris
2024-11-06 11:43 ` Will Deacon
2024-11-06 17:09 ` Okanovic, Haris
2024-11-09 9:49 ` David Laight
2024-11-05 18:30 ` [PATCH 3/5] arm64: refactor delay() to enable polling for value Haris Okanovic
2024-11-05 19:42 ` Christoph Lameter (Ampere)
2024-11-06 17:42 ` Okanovic, Haris
2024-11-06 9:18 ` Catalin Marinas
2024-11-06 17:38 ` Okanovic, Haris
2024-11-05 18:30 ` [PATCH 4/5] arm64: add smp_vcond_load_relaxed() Haris Okanovic
2024-11-05 18:30 ` [PATCH 5/5] cpuidle: implement poll_idle() using smp_vcond_load_relaxed() Haris Okanovic
2024-11-05 19:45 ` Christoph Lameter (Ampere)
2024-11-05 18:49 ` [PATCH v8 00/11] Enable haltpoll on arm64 Ankur Arora
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=ZyvJwjfKgnqMpM9P@arm.com \
--to=catalin.marinas@arm.com \
--cc=ankur.a.arora@oracle.com \
--cc=arnd@arndb.de \
--cc=boris.ostrovsky@oracle.com \
--cc=bp@alien8.de \
--cc=cl@gentwo.org \
--cc=daniel.lezcano@linaro.org \
--cc=dave.hansen@linux.intel.com \
--cc=harisokn@amazon.com \
--cc=hpa@zytor.com \
--cc=joao.m.martins@oracle.com \
--cc=konrad.wilk@oracle.com \
--cc=kvm@vger.kernel.org \
--cc=lenb@kernel.org \
--cc=linux-arm-kernel@lists.infradead.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-pm@vger.kernel.org \
--cc=maobibo@loongson.cn \
--cc=mark.rutland@arm.com \
--cc=mingo@redhat.com \
--cc=misono.tomohiro@fujitsu.com \
--cc=mtosatti@redhat.com \
--cc=pbonzini@redhat.com \
--cc=peterz@infradead.org \
--cc=rafael@kernel.org \
--cc=sudeep.holla@arm.com \
--cc=tglx@linutronix.de \
--cc=vkuznets@redhat.com \
--cc=wanpengli@tencent.com \
--cc=will@kernel.org \
--cc=x86@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.