All of lore.kernel.org
 help / color / mirror / Atom feed
From: Leonardo Bras <leobras@redhat.com>
To: Guo Ren <guoren@kernel.org>
Cc: wefu@redhat.com, keescook@chromium.org, peterz@infradead.org,
	unicorn_wang@outlook.com, atishp@atishpatra.org,
	chao.wei@sophgo.com, bjorn@rivosinc.com,
	linux-kernel@vger.kernel.org, xiaoguang.xing@sophgo.com,
	conor.dooley@microchip.com, Leonardo Bras <leobras@redhat.com>,
	palmer@dabbelt.com, jszhang@kernel.org, paul.walmsley@sifive.com,
	Guo Ren <guoren@linux.alibaba.com>,
	panqinglin2020@iscas.ac.cn, linux-riscv@lists.infradead.org,
	wuwei2016@iscas.ac.cn, Andrew Jones <ajones@ventanamicro.com>
Subject: Re: [PATCH V2 3/3] riscv: xchg: Prefetch the destination word for sc.w
Date: Wed,  3 Jan 2024 16:45:39 -0300	[thread overview]
Message-ID: <ZZW5Y85OdibCu58h@LeoBras> (raw)
In-Reply-To: <CAJF2gTQ7Oo8UKdPRs0GAAUsh9mDCgGucS8g8kuPzByaWVOtigw@mail.gmail.com>

On Wed, Jan 03, 2024 at 02:15:45PM +0800, Guo Ren wrote:
> On Tue, Jan 2, 2024 at 7:19 PM Andrew Jones <ajones@ventanamicro.com> wrote:
> >
> > On Sun, Dec 31, 2023 at 03:29:53AM -0500, guoren@kernel.org wrote:
> > > From: Guo Ren <guoren@linux.alibaba.com>
> > >
> > > The cost of changing a cacheline from shared to exclusive state can be
> > > significant, especially when this is triggered by an exclusive store,
> > > since it may result in having to retry the transaction.
> > >
> > > This patch makes use of prefetch.w to prefetch cachelines for write
> > > prior to lr/sc loops when using the xchg_small atomic routine.
> > >
> > > This patch is inspired by commit: 0ea366f5e1b6 ("arm64: atomics:
> > > prefetch the destination word for write prior to stxr").
> > >
> > > Signed-off-by: Guo Ren <guoren@linux.alibaba.com>
> > > Signed-off-by: Guo Ren <guoren@kernel.org>
> > > ---
> > >  arch/riscv/include/asm/cmpxchg.h | 4 +++-
> > >  1 file changed, 3 insertions(+), 1 deletion(-)
> > >
> > > diff --git a/arch/riscv/include/asm/cmpxchg.h b/arch/riscv/include/asm/cmpxchg.h
> > > index 26cea2395aae..d7b9d7951f08 100644
> > > --- a/arch/riscv/include/asm/cmpxchg.h
> > > +++ b/arch/riscv/include/asm/cmpxchg.h
> > > @@ -10,6 +10,7 @@
> > >
> > >  #include <asm/barrier.h>
> > >  #include <asm/fence.h>
> > > +#include <asm/processor.h>
> > >
> > >  #define __arch_xchg_masked(prepend, append, r, p, n)                 \
> >
> > Are you sure this is based on v6.7-rc7? Because I don't see this macro.
> Oh, it is based on Leobras' patches. I would remove it in the next of version.

I would say this next :)

> 
> >
> > >  ({                                                                   \
> > > @@ -23,6 +24,7 @@
> > >                                                                       \
> > >       __asm__ __volatile__ (                                          \
> > >              prepend                                                  \
> > > +            PREFETCHW_ASM(%5)                                        \
> > >              "0:      lr.w %0, %2\n"                                  \
> > >              "        and  %1, %0, %z4\n"                             \
> > >              "        or   %1, %1, %z3\n"                             \
> > > @@ -30,7 +32,7 @@
> > >              "        bnez %1, 0b\n"                                  \
> > >              append                                                   \
> > >              : "=&r" (__retx), "=&r" (__rc), "+A" (*(__ptr32b))       \
> > > -            : "rJ" (__newx), "rJ" (~__mask)                          \
> > > +            : "rJ" (__newx), "rJ" (~__mask), "rJ" (__ptr32b)         \
> >
> > I'm pretty sure we don't want to allow the J constraint for __ptr32b.
> >
> > >              : "memory");                                             \
> > >                                                                       \
> > >       r = (__typeof__(*(p)))((__retx & __mask) >> __s);               \
> > > --
> > > 2.40.1
> > >
> >
> > Thanks,
> > drew
> 
> 
> 
> -- 
> Best Regards
>  Guo Ren
> 

Nice patch :)
Any reason it's not needed in __arch_cmpxchg_masked(), and __arch_cmpxchg() ?

Thanks!
Leo


_______________________________________________
linux-riscv mailing list
linux-riscv@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/linux-riscv

WARNING: multiple messages have this Message-ID (diff)
From: Leonardo Bras <leobras@redhat.com>
To: Guo Ren <guoren@kernel.org>
Cc: Leonardo Bras <leobras@redhat.com>,
	Andrew Jones <ajones@ventanamicro.com>,
	paul.walmsley@sifive.com, palmer@dabbelt.com,
	panqinglin2020@iscas.ac.cn, bjorn@rivosinc.com,
	conor.dooley@microchip.com, peterz@infradead.org,
	keescook@chromium.org, wuwei2016@iscas.ac.cn,
	xiaoguang.xing@sophgo.com, chao.wei@sophgo.com,
	unicorn_wang@outlook.com, uwu@icenowy.me, jszhang@kernel.org,
	wefu@redhat.com, atishp@atishpatra.org,
	linux-riscv@lists.infradead.org, linux-kernel@vger.kernel.org,
	Guo Ren <guoren@linux.alibaba.com>
Subject: Re: [PATCH V2 3/3] riscv: xchg: Prefetch the destination word for sc.w
Date: Wed,  3 Jan 2024 16:45:39 -0300	[thread overview]
Message-ID: <ZZW5Y85OdibCu58h@LeoBras> (raw)
In-Reply-To: <CAJF2gTQ7Oo8UKdPRs0GAAUsh9mDCgGucS8g8kuPzByaWVOtigw@mail.gmail.com>

On Wed, Jan 03, 2024 at 02:15:45PM +0800, Guo Ren wrote:
> On Tue, Jan 2, 2024 at 7:19 PM Andrew Jones <ajones@ventanamicro.com> wrote:
> >
> > On Sun, Dec 31, 2023 at 03:29:53AM -0500, guoren@kernel.org wrote:
> > > From: Guo Ren <guoren@linux.alibaba.com>
> > >
> > > The cost of changing a cacheline from shared to exclusive state can be
> > > significant, especially when this is triggered by an exclusive store,
> > > since it may result in having to retry the transaction.
> > >
> > > This patch makes use of prefetch.w to prefetch cachelines for write
> > > prior to lr/sc loops when using the xchg_small atomic routine.
> > >
> > > This patch is inspired by commit: 0ea366f5e1b6 ("arm64: atomics:
> > > prefetch the destination word for write prior to stxr").
> > >
> > > Signed-off-by: Guo Ren <guoren@linux.alibaba.com>
> > > Signed-off-by: Guo Ren <guoren@kernel.org>
> > > ---
> > >  arch/riscv/include/asm/cmpxchg.h | 4 +++-
> > >  1 file changed, 3 insertions(+), 1 deletion(-)
> > >
> > > diff --git a/arch/riscv/include/asm/cmpxchg.h b/arch/riscv/include/asm/cmpxchg.h
> > > index 26cea2395aae..d7b9d7951f08 100644
> > > --- a/arch/riscv/include/asm/cmpxchg.h
> > > +++ b/arch/riscv/include/asm/cmpxchg.h
> > > @@ -10,6 +10,7 @@
> > >
> > >  #include <asm/barrier.h>
> > >  #include <asm/fence.h>
> > > +#include <asm/processor.h>
> > >
> > >  #define __arch_xchg_masked(prepend, append, r, p, n)                 \
> >
> > Are you sure this is based on v6.7-rc7? Because I don't see this macro.
> Oh, it is based on Leobras' patches. I would remove it in the next of version.

I would say this next :)

> 
> >
> > >  ({                                                                   \
> > > @@ -23,6 +24,7 @@
> > >                                                                       \
> > >       __asm__ __volatile__ (                                          \
> > >              prepend                                                  \
> > > +            PREFETCHW_ASM(%5)                                        \
> > >              "0:      lr.w %0, %2\n"                                  \
> > >              "        and  %1, %0, %z4\n"                             \
> > >              "        or   %1, %1, %z3\n"                             \
> > > @@ -30,7 +32,7 @@
> > >              "        bnez %1, 0b\n"                                  \
> > >              append                                                   \
> > >              : "=&r" (__retx), "=&r" (__rc), "+A" (*(__ptr32b))       \
> > > -            : "rJ" (__newx), "rJ" (~__mask)                          \
> > > +            : "rJ" (__newx), "rJ" (~__mask), "rJ" (__ptr32b)         \
> >
> > I'm pretty sure we don't want to allow the J constraint for __ptr32b.
> >
> > >              : "memory");                                             \
> > >                                                                       \
> > >       r = (__typeof__(*(p)))((__retx & __mask) >> __s);               \
> > > --
> > > 2.40.1
> > >
> >
> > Thanks,
> > drew
> 
> 
> 
> -- 
> Best Regards
>  Guo Ren
> 

Nice patch :)
Any reason it's not needed in __arch_cmpxchg_masked(), and __arch_cmpxchg() ?

Thanks!
Leo


  reply	other threads:[~2024-01-03 19:46 UTC|newest]

Thread overview: 84+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-12-31  8:29 [PATCH V2 0/3] riscv: Add Zicbop & prefetchw support guoren
2023-12-31  8:29 ` guoren
2023-12-31  8:29 ` [PATCH V2 1/3] riscv: Add Zicbop instruction definitions & cpufeature guoren
2023-12-31  8:29   ` guoren
2024-01-02 10:32   ` Andrew Jones
2024-01-02 10:32     ` Andrew Jones
2024-01-03  6:13     ` Guo Ren
2024-01-03  6:13       ` Guo Ren
2024-01-03  6:49       ` Andrew Jones
2024-01-03  6:49         ` Andrew Jones
2024-01-03 19:44         ` Andrew Jones
2024-01-03 19:44           ` Andrew Jones
2024-01-03 19:06     ` Leonardo Bras
2024-01-03 19:06       ` Leonardo Bras
2024-01-03  9:31   ` Clément Léger
2024-01-03  9:31     ` Clément Léger
2024-01-03 12:00     ` Andrew Jones
2024-01-03 12:00       ` Andrew Jones
2024-01-11 10:31       ` Clément Léger
2024-01-11 10:31         ` Clément Léger
2024-01-11 10:45         ` Andrew Jones
2024-01-11 10:45           ` Andrew Jones
2024-01-11 10:49           ` Clément Léger
2024-01-11 10:49             ` Clément Léger
2024-01-11 11:12             ` Conor Dooley
2024-01-11 11:12               ` Conor Dooley
2024-01-03 18:52   ` Leonardo Bras
2024-01-03 18:52     ` Leonardo Bras
2024-01-03 19:29     ` Andrew Jones
2024-01-03 19:29       ` Andrew Jones
2024-01-03 20:33       ` Leonardo Bras
2024-01-03 20:33         ` Leonardo Bras
2024-01-04  9:47         ` Andrew Jones
2024-01-04  9:47           ` Andrew Jones
2024-01-04 15:03           ` Leonardo Bras
2024-01-04 15:03             ` Leonardo Bras
2024-01-04 16:40             ` Andrew Jones
2024-01-04 16:40               ` Andrew Jones
2024-01-04 17:43               ` Leonardo Bras
2024-01-04 17:43                 ` Leonardo Bras
2024-01-05 13:24                 ` Andrew Jones
2024-01-05 13:24                   ` Andrew Jones
2024-01-08 14:34                   ` Leonardo Bras
2024-01-08 14:34                     ` Leonardo Bras
2024-01-08 15:24                     ` Andrew Jones
2024-01-08 15:24                       ` Andrew Jones
2024-01-08 16:14                       ` Leonardo Bras
2024-01-08 16:14                         ` Leonardo Bras
2024-01-03 19:48   ` Andrew Jones
2024-01-03 19:48     ` Andrew Jones
2024-01-03 20:34     ` Leonardo Bras
2024-01-03 20:34       ` Leonardo Bras
2023-12-31  8:29 ` [PATCH V2 2/3] riscv: Add ARCH_HAS_PRETCHW support with Zibop guoren
2023-12-31  8:29   ` guoren
2024-01-01  2:29   ` Guo Ren
2024-01-01  2:29     ` Guo Ren
2024-01-03 19:04     ` Leonardo Bras
2024-01-03 19:04       ` Leonardo Bras
2024-01-02 10:45   ` Andrew Jones
2024-01-02 10:45     ` Andrew Jones
2024-01-03  6:19     ` Guo Ren
2024-01-03  6:19       ` Guo Ren
2024-01-03 19:56       ` Andrew Jones
2024-01-03 19:56         ` Andrew Jones
2024-01-05 13:31     ` Andrew Jones
2024-01-05 13:31       ` Andrew Jones
2023-12-31  8:29 ` [PATCH V2 3/3] riscv: xchg: Prefetch the destination word for sc.w guoren
2023-12-31  8:29   ` guoren
2024-01-02 11:18   ` Andrew Jones
2024-01-02 11:18     ` Andrew Jones
2024-01-03  6:15     ` Guo Ren
2024-01-03  6:15       ` Guo Ren
2024-01-03 19:45       ` Leonardo Bras [this message]
2024-01-03 19:45         ` Leonardo Bras
2024-01-04  1:24         ` Guo Ren
2024-01-04  1:24           ` Guo Ren
2024-01-04  3:56           ` Leonardo Bras
2024-01-04  3:56             ` Leonardo Bras
2024-01-04  8:14             ` Guo Ren
2024-01-04  8:14               ` Guo Ren
2024-01-04 14:17               ` Leonardo Bras
2024-01-04 14:17                 ` Leonardo Bras
2024-01-05  1:13                 ` Guo Ren
2024-01-05  1:13                   ` Guo Ren

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=ZZW5Y85OdibCu58h@LeoBras \
    --to=leobras@redhat.com \
    --cc=ajones@ventanamicro.com \
    --cc=atishp@atishpatra.org \
    --cc=bjorn@rivosinc.com \
    --cc=chao.wei@sophgo.com \
    --cc=conor.dooley@microchip.com \
    --cc=guoren@kernel.org \
    --cc=guoren@linux.alibaba.com \
    --cc=jszhang@kernel.org \
    --cc=keescook@chromium.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-riscv@lists.infradead.org \
    --cc=palmer@dabbelt.com \
    --cc=panqinglin2020@iscas.ac.cn \
    --cc=paul.walmsley@sifive.com \
    --cc=peterz@infradead.org \
    --cc=unicorn_wang@outlook.com \
    --cc=wefu@redhat.com \
    --cc=wuwei2016@iscas.ac.cn \
    --cc=xiaoguang.xing@sophgo.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.