* [PATCH RESEND] irqchip: riscv: Order normal writes and IPI writes
@ 2025-01-27 9:38 Xu Lu
2025-01-27 10:13 ` [tip: irq/urgent] irqchip/riscv: Ensure ordering of memory " tip-bot2 for Xu Lu
` (2 more replies)
0 siblings, 3 replies; 7+ messages in thread
From: Xu Lu @ 2025-01-27 9:38 UTC (permalink / raw)
To: tglx, anup, charlie, paul.walmsley, palmer
Cc: lihangjing, xieyongji, linux-riscv, linux-kernel, Xu Lu
RISC-V distinguishes between normal memory accesses and device I/O and
uses FENCE instruction to order them as viewed by othe RISC-V harts and
external devices or coprocessors. The FENCE instruction can order any
combination of device input(I), device output(O), memory reads(R) and
memory writes(W). For example, 'fence w, o' can be used to ensure all
memory writes from instructions preceding the FENCE instruction appear
earlier in the global memory order than device output writes from
instructions after the FENCE instruction.
RISC-V issues IPI by writing certain value to IMSIC/ACLINT MMIO
registers, which is regarded as device output operation. However, the
existing implementation of IMSIC/ACLINT driver issues IPI via
writel_relaxed(), which does not guarantee the order of device output
operation and preceding memory writes. Then the hart receiving IPI may
not have seen the latest data yet.
This commit fixes this by replacing writel_relaxed() with writel() when
issuing IPI, which will use 'fence w, o' to ensure all previous writes
made by current hart are visible to other harts before they receive
the IPI.
Signed-off-by: Xu Lu <luxu.kernel@bytedance.com>
---
drivers/irqchip/irq-riscv-imsic-early.c | 2 +-
drivers/irqchip/irq-thead-c900-aclint-sswi.c | 2 +-
2 files changed, 2 insertions(+), 2 deletions(-)
diff --git a/drivers/irqchip/irq-riscv-imsic-early.c b/drivers/irqchip/irq-riscv-imsic-early.c
index c5c2e6929a2f..275df5005705 100644
--- a/drivers/irqchip/irq-riscv-imsic-early.c
+++ b/drivers/irqchip/irq-riscv-imsic-early.c
@@ -27,7 +27,7 @@ static void imsic_ipi_send(unsigned int cpu)
{
struct imsic_local_config *local = per_cpu_ptr(imsic->global.local, cpu);
- writel_relaxed(IMSIC_IPI_ID, local->msi_va);
+ writel(IMSIC_IPI_ID, local->msi_va);
}
static void imsic_ipi_starting_cpu(void)
diff --git a/drivers/irqchip/irq-thead-c900-aclint-sswi.c b/drivers/irqchip/irq-thead-c900-aclint-sswi.c
index b0e366ade427..8ff6e7a1363b 100644
--- a/drivers/irqchip/irq-thead-c900-aclint-sswi.c
+++ b/drivers/irqchip/irq-thead-c900-aclint-sswi.c
@@ -31,7 +31,7 @@ static DEFINE_PER_CPU(void __iomem *, sswi_cpu_regs);
static void thead_aclint_sswi_ipi_send(unsigned int cpu)
{
- writel_relaxed(0x1, per_cpu(sswi_cpu_regs, cpu));
+ writel(0x1, per_cpu(sswi_cpu_regs, cpu));
}
static void thead_aclint_sswi_ipi_clear(void)
--
2.20.1
^ permalink raw reply related [flat|nested] 7+ messages in thread
* [tip: irq/urgent] irqchip/riscv: Ensure ordering of memory writes and IPI writes
2025-01-27 9:38 [PATCH RESEND] irqchip: riscv: Order normal writes and IPI writes Xu Lu
@ 2025-01-27 10:13 ` tip-bot2 for Xu Lu
2025-01-27 10:33 ` [PATCH RESEND] irqchip: riscv: Order normal " Thomas Gleixner
2025-01-27 16:21 ` Arnd Bergmann
2 siblings, 0 replies; 7+ messages in thread
From: tip-bot2 for Xu Lu @ 2025-01-27 10:13 UTC (permalink / raw)
To: linux-tip-commits; +Cc: Xu Lu, Thomas Gleixner, x86, linux-kernel, maz
The following commit has been merged into the irq/urgent branch of tip:
Commit-ID: 825c78e6a60c309a59d18d5ac5968aa79cef0bd6
Gitweb: https://git.kernel.org/tip/825c78e6a60c309a59d18d5ac5968aa79cef0bd6
Author: Xu Lu <luxu.kernel@bytedance.com>
AuthorDate: Mon, 27 Jan 2025 17:38:46 +08:00
Committer: Thomas Gleixner <tglx@linutronix.de>
CommitterDate: Mon, 27 Jan 2025 11:07:03 +01:00
irqchip/riscv: Ensure ordering of memory writes and IPI writes
RISC-V distinguishes between memory accesses and device I/O and uses FENCE
instruction to order them as viewed by other RISC-V harts and external
devices or coprocessors. The FENCE instruction can order any combination of
device input(I), device output(O), memory reads(R) and memory
writes(W). For example, 'fence w, o' is used to ensure all memory writes
from instructions preceding the FENCE instruction appear earlier in the
global memory order than device output writes from instructions after the
FENCE instruction.
RISC-V issues IPIs by writing to the IMSIC/ACLINT MMIO registers, which is
regarded as device output operation. However, the existing implementation
of the IMSIC/ACLINT drivers issue the IPI via writel_relaxed(), which does
not guarantee the order of device output operation and preceding memory
writes. As a consequence the hart receiving the IPI might not observe the
IPI related data.
Fix this by replacing writel_relaxed() with writel() when issuing IPIs,
which uses 'fence w, o' to ensure all previous writes made by the current
hart are visible to other harts before they receive the IPI.
Signed-off-by: Xu Lu <luxu.kernel@bytedance.com>
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
Link: https://lore.kernel.org/all/20250127093846.98625-1-luxu.kernel@bytedance.com
---
drivers/irqchip/irq-riscv-imsic-early.c | 2 +-
drivers/irqchip/irq-thead-c900-aclint-sswi.c | 2 +-
2 files changed, 2 insertions(+), 2 deletions(-)
diff --git a/drivers/irqchip/irq-riscv-imsic-early.c b/drivers/irqchip/irq-riscv-imsic-early.c
index c5c2e69..275df50 100644
--- a/drivers/irqchip/irq-riscv-imsic-early.c
+++ b/drivers/irqchip/irq-riscv-imsic-early.c
@@ -27,7 +27,7 @@ static void imsic_ipi_send(unsigned int cpu)
{
struct imsic_local_config *local = per_cpu_ptr(imsic->global.local, cpu);
- writel_relaxed(IMSIC_IPI_ID, local->msi_va);
+ writel(IMSIC_IPI_ID, local->msi_va);
}
static void imsic_ipi_starting_cpu(void)
diff --git a/drivers/irqchip/irq-thead-c900-aclint-sswi.c b/drivers/irqchip/irq-thead-c900-aclint-sswi.c
index b0e366a..8ff6e7a 100644
--- a/drivers/irqchip/irq-thead-c900-aclint-sswi.c
+++ b/drivers/irqchip/irq-thead-c900-aclint-sswi.c
@@ -31,7 +31,7 @@ static DEFINE_PER_CPU(void __iomem *, sswi_cpu_regs);
static void thead_aclint_sswi_ipi_send(unsigned int cpu)
{
- writel_relaxed(0x1, per_cpu(sswi_cpu_regs, cpu));
+ writel(0x1, per_cpu(sswi_cpu_regs, cpu));
}
static void thead_aclint_sswi_ipi_clear(void)
^ permalink raw reply related [flat|nested] 7+ messages in thread
* Re: [PATCH RESEND] irqchip: riscv: Order normal writes and IPI writes
2025-01-27 9:38 [PATCH RESEND] irqchip: riscv: Order normal writes and IPI writes Xu Lu
2025-01-27 10:13 ` [tip: irq/urgent] irqchip/riscv: Ensure ordering of memory " tip-bot2 for Xu Lu
@ 2025-01-27 10:33 ` Thomas Gleixner
2025-01-27 14:26 ` [External] " Xu Lu
2025-01-27 16:21 ` Arnd Bergmann
2 siblings, 1 reply; 7+ messages in thread
From: Thomas Gleixner @ 2025-01-27 10:33 UTC (permalink / raw)
To: Xu Lu, anup, charlie, paul.walmsley, palmer
Cc: lihangjing, xieyongji, linux-riscv, linux-kernel, Xu Lu
On Mon, Jan 27 2025 at 17:38, Xu Lu wrote:
This is not a RESEND. The change log has been modified, no?
The prefix is incorrect. See
https://www.kernel.org/doc/html/latest/process/maintainer-tip.html
> RISC-V distinguishes between normal memory accesses and device I/O and
What is a normal memory write? Are there abnormal memory writes too?
> uses FENCE instruction to order them as viewed by othe RISC-V harts and
> external devices or coprocessors. The FENCE instruction can order any
> combination of device input(I), device output(O), memory reads(R) and
> memory writes(W). For example, 'fence w, o' can be used to ensure all
Can be? It _is_ used, no?
> memory writes from instructions preceding the FENCE instruction appear
> earlier in the global memory order than device output writes from
> instructions after the FENCE instruction.
>
> RISC-V issues IPI by writing certain value to IMSIC/ACLINT MMIO
> registers, which is regarded as device output operation. However, the
> existing implementation of IMSIC/ACLINT driver issues IPI via
> writel_relaxed(), which does not guarantee the order of device output
> operation and preceding memory writes. Then the hart receiving IPI may
> not have seen the latest data yet.
>
> This commit fixes this by replacing writel_relaxed() with writel()
> when
'This commit' is equally wrong as 'This patch'. See Documentation/process/
> issuing IPI, which will use 'fence w, o' to ensure all previous writes
> made by current hart are visible to other harts before they receive
> the IPI.
I've fixed it up for you.
Thanks,
tglx
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [External] Re: [PATCH RESEND] irqchip: riscv: Order normal writes and IPI writes
2025-01-27 10:33 ` [PATCH RESEND] irqchip: riscv: Order normal " Thomas Gleixner
@ 2025-01-27 14:26 ` Xu Lu
0 siblings, 0 replies; 7+ messages in thread
From: Xu Lu @ 2025-01-27 14:26 UTC (permalink / raw)
To: Thomas Gleixner
Cc: anup, charlie, paul.walmsley, palmer, lihangjing, xieyongji,
linux-riscv, linux-kernel
On Mon, Jan 27, 2025 at 6:33 PM Thomas Gleixner <tglx@linutronix.de> wrote:
>
> On Mon, Jan 27 2025 at 17:38, Xu Lu wrote:
>
> This is not a RESEND. The change log has been modified, no?
Sure, the change log has been modified. I will pay attention next time.
>
> The prefix is incorrect. See
>
> https://www.kernel.org/doc/html/latest/process/maintainer-tip.html
>
> > RISC-V distinguishes between normal memory accesses and device I/O and
>
> What is a normal memory write? Are there abnormal memory writes too?
Sorry for the misleading. By normal memory write, I mean memory writes
and want to distinguish it from MMIO writes.
>
> > uses FENCE instruction to order them as viewed by othe RISC-V harts and
> > external devices or coprocessors. The FENCE instruction can order any
> > combination of device input(I), device output(O), memory reads(R) and
> > memory writes(W). For example, 'fence w, o' can be used to ensure all
>
> Can be? It _is_ used, no?
Yes, it _is_ used. 'Can be' is not accurate.
>
> > memory writes from instructions preceding the FENCE instruction appear
> > earlier in the global memory order than device output writes from
> > instructions after the FENCE instruction.
> >
> > RISC-V issues IPI by writing certain value to IMSIC/ACLINT MMIO
> > registers, which is regarded as device output operation. However, the
> > existing implementation of IMSIC/ACLINT driver issues IPI via
> > writel_relaxed(), which does not guarantee the order of device output
> > operation and preceding memory writes. Then the hart receiving IPI may
> > not have seen the latest data yet.
> >
> > This commit fixes this by replacing writel_relaxed() with writel()
> > when
>
> 'This commit' is equally wrong as 'This patch'. See Documentation/process/
Thanks very much. I will check the documents.
Best Regards,
Xu Lu
>
> > issuing IPI, which will use 'fence w, o' to ensure all previous writes
> > made by current hart are visible to other harts before they receive
> > the IPI.
>
> I've fixed it up for you.
>
> Thanks,
>
> tglx
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH RESEND] irqchip: riscv: Order normal writes and IPI writes
2025-01-27 9:38 [PATCH RESEND] irqchip: riscv: Order normal writes and IPI writes Xu Lu
2025-01-27 10:13 ` [tip: irq/urgent] irqchip/riscv: Ensure ordering of memory " tip-bot2 for Xu Lu
2025-01-27 10:33 ` [PATCH RESEND] irqchip: riscv: Order normal " Thomas Gleixner
@ 2025-01-27 16:21 ` Arnd Bergmann
2025-01-27 16:37 ` [External] " Xu Lu
2 siblings, 1 reply; 7+ messages in thread
From: Arnd Bergmann @ 2025-01-27 16:21 UTC (permalink / raw)
To: Xu Lu, Thomas Gleixner, Anup Patel, Charlie Jenkins,
Paul Walmsley, Palmer Dabbelt
Cc: lihangjing, Xie Yongji, linux-riscv, linux-kernel
On Mon, Jan 27, 2025, at 10:38, Xu Lu wrote:
> RISC-V distinguishes between normal memory accesses and device I/O and
> uses FENCE instruction to order them as viewed by othe RISC-V harts and
> external devices or coprocessors. The FENCE instruction can order any
> combination of device input(I), device output(O), memory reads(R) and
> memory writes(W). For example, 'fence w, o' can be used to ensure all
> memory writes from instructions preceding the FENCE instruction appear
> earlier in the global memory order than device output writes from
> instructions after the FENCE instruction.
There is nothing risc-v specific in here really, it's just a bug
in the driver: writel() means access the mmio register with appropriate
barriers, while writel_relaxed() is a special case that should
only ever be used if a particular function is sensitive to
performance and never needs to be serialized.
> diff --git a/drivers/irqchip/irq-thead-c900-aclint-sswi.c
> b/drivers/irqchip/irq-thead-c900-aclint-sswi.c
> index b0e366ade427..8ff6e7a1363b 100644
> --- a/drivers/irqchip/irq-thead-c900-aclint-sswi.c
> +++ b/drivers/irqchip/irq-thead-c900-aclint-sswi.c
> @@ -31,7 +31,7 @@ static DEFINE_PER_CPU(void __iomem *, sswi_cpu_regs);
>
> static void thead_aclint_sswi_ipi_send(unsigned int cpu)
> {
> - writel_relaxed(0x1, per_cpu(sswi_cpu_regs, cpu));
> + writel(0x1, per_cpu(sswi_cpu_regs, cpu));
> }
>
> static void thead_aclint_sswi_ipi_clear(void)
> --
thead_aclint_sswi_ipi_clear() seems to have the same bug,
it also uses the _relaxed() version for no apparent reason.
Arnd
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [External] Re: [PATCH RESEND] irqchip: riscv: Order normal writes and IPI writes
2025-01-27 16:21 ` Arnd Bergmann
@ 2025-01-27 16:37 ` Xu Lu
2025-01-27 19:07 ` Arnd Bergmann
0 siblings, 1 reply; 7+ messages in thread
From: Xu Lu @ 2025-01-27 16:37 UTC (permalink / raw)
To: Arnd Bergmann
Cc: Thomas Gleixner, Anup Patel, Charlie Jenkins, Paul Walmsley,
Palmer Dabbelt, lihangjing, Xie Yongji, linux-riscv, linux-kernel
On Tue, Jan 28, 2025 at 12:23 AM Arnd Bergmann <arnd@arndb.de> wrote:
>
> On Mon, Jan 27, 2025, at 10:38, Xu Lu wrote:
> > RISC-V distinguishes between normal memory accesses and device I/O and
> > uses FENCE instruction to order them as viewed by othe RISC-V harts and
> > external devices or coprocessors. The FENCE instruction can order any
> > combination of device input(I), device output(O), memory reads(R) and
> > memory writes(W). For example, 'fence w, o' can be used to ensure all
> > memory writes from instructions preceding the FENCE instruction appear
> > earlier in the global memory order than device output writes from
> > instructions after the FENCE instruction.
>
> There is nothing risc-v specific in here really, it's just a bug
> in the driver: writel() means access the mmio register with appropriate
> barriers, while writel_relaxed() is a special case that should
> only ever be used if a particular function is sensitive to
> performance and never needs to be serialized.
>
> > diff --git a/drivers/irqchip/irq-thead-c900-aclint-sswi.c
> > b/drivers/irqchip/irq-thead-c900-aclint-sswi.c
> > index b0e366ade427..8ff6e7a1363b 100644
> > --- a/drivers/irqchip/irq-thead-c900-aclint-sswi.c
> > +++ b/drivers/irqchip/irq-thead-c900-aclint-sswi.c
> > @@ -31,7 +31,7 @@ static DEFINE_PER_CPU(void __iomem *, sswi_cpu_regs);
> >
> > static void thead_aclint_sswi_ipi_send(unsigned int cpu)
> > {
> > - writel_relaxed(0x1, per_cpu(sswi_cpu_regs, cpu));
> > + writel(0x1, per_cpu(sswi_cpu_regs, cpu));
> > }
> >
> > static void thead_aclint_sswi_ipi_clear(void)
> > --
>
> thead_aclint_sswi_ipi_clear() seems to have the same bug,
> it also uses the _relaxed() version for no apparent reason.
Hi Arnd,
There seems no need to modify thead_aclint_sswi_ipi_clear() as it only
clears pending IPI on current hart. No other harts require to see
strict order between preceding memory writes and this ACLINT MMIO
write. Please correct me if I missed anything.
Thanks,
Xu Lu
>
> Arnd
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [External] Re: [PATCH RESEND] irqchip: riscv: Order normal writes and IPI writes
2025-01-27 16:37 ` [External] " Xu Lu
@ 2025-01-27 19:07 ` Arnd Bergmann
0 siblings, 0 replies; 7+ messages in thread
From: Arnd Bergmann @ 2025-01-27 19:07 UTC (permalink / raw)
To: Xu Lu
Cc: Thomas Gleixner, Anup Patel, Charlie Jenkins, Paul Walmsley,
Palmer Dabbelt, lihangjing, Xie Yongji, linux-riscv, linux-kernel
On Mon, Jan 27, 2025, at 17:37, Xu Lu wrote:
> On Tue, Jan 28, 2025 at 12:23 AM Arnd Bergmann <arnd@arndb.de> wrote:
>> On Mon, Jan 27, 2025, at 10:38, Xu Lu wrote:
>>
>> thead_aclint_sswi_ipi_clear() seems to have the same bug,
>> it also uses the _relaxed() version for no apparent reason.
>
>
> There seems no need to modify thead_aclint_sswi_ipi_clear() as it only
> clears pending IPI on current hart. No other harts require to see
> strict order between preceding memory writes and this ACLINT MMIO
> write. Please correct me if I missed anything.
My point was that you should always default to the normal operations,
since the relaxed ones keep causing problems. If you still think it's
important to use writel_relaxed() in thead_aclint_sswi_ipi_clear(),
please add a comment explaining how you have shown that it's correct
and why it matters, otherwise convert them both.
Arnd
^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2025-01-27 19:08 UTC | newest]
Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2025-01-27 9:38 [PATCH RESEND] irqchip: riscv: Order normal writes and IPI writes Xu Lu
2025-01-27 10:13 ` [tip: irq/urgent] irqchip/riscv: Ensure ordering of memory " tip-bot2 for Xu Lu
2025-01-27 10:33 ` [PATCH RESEND] irqchip: riscv: Order normal " Thomas Gleixner
2025-01-27 14:26 ` [External] " Xu Lu
2025-01-27 16:21 ` Arnd Bergmann
2025-01-27 16:37 ` [External] " Xu Lu
2025-01-27 19:07 ` Arnd Bergmann
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox