From: David Laight <david.laight.linux@gmail.com>
To: Paul Walmsley <pjw@kernel.org>
Cc: Feng Jiang <jiangfeng@kylinos.cn>,
palmer@dabbelt.com, aou@eecs.berkeley.edu, alex@ghiti.fr,
samuel.holland@sifive.com, charlie@rivosinc.com,
conor.dooley@microchip.com, linux-riscv@lists.infradead.org,
linux-kernel@vger.kernel.org
Subject: Re: [PATCH] riscv: lib: optimize strlen loop efficiency
Date: Thu, 15 Jan 2026 11:19:47 +0000 [thread overview]
Message-ID: <20260115111947.54929ed0@pumpkin> (raw)
In-Reply-To: <d356705a-843c-06dc-38a3-77eae7d2ef59@kernel.org>
On Wed, 14 Jan 2026 19:03:17 -0700 (MST)
Paul Walmsley <pjw@kernel.org> wrote:
> On Thu, 18 Dec 2025, Feng Jiang wrote:
>
> > Optimize the generic strlen implementation by using a pre-decrement
> > pointer. This reduces the loop body from 4 instructions to 3 and
> > eliminates the unconditional jump ('j').
> >
> > Old loop (4 instructions, 2 branches):
> > 1: lbu t0, 0(t1); beqz t0, 2f; addi t1, t1, 1; j 1b
> >
> > New loop (3 instructions, 1 branch):
> > 1: addi t1, t1, 1; lbu t0, 0(t1); bnez t0, 1b
Is that a change to the generic C code?
Testing (++sc)[-1] might do the trick without requiring the extra read
of the first location.
> >
> > This change improves execution efficiency and reduces branch pressure
> > for systems without the Zbb extension.
>
> Looks reasonable; do you have any benchmarks on hardware that you can
> share? Any reason why this patch stands alone and isn't rolled up as part
> of your "optimize string function" series?
For 64bit you can do a lot better (in C) by loading 64bit words and doing
the correct 'shift and mask' sequence to detect a zero byte.
It usually isn't worth in for 32bit.
Does need to handle a mis-aligned base - eg by masking the bits off
the base pointer and or'ing in non-zero values to the value read from
the base pointer.
David
_______________________________________________
linux-riscv mailing list
linux-riscv@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/linux-riscv
next prev parent reply other threads:[~2026-01-15 11:20 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-12-18 3:26 [PATCH] riscv: lib: optimize strlen loop efficiency Feng Jiang
2026-01-15 2:03 ` Paul Walmsley
2026-01-15 3:23 ` Feng Jiang
2026-01-24 8:14 ` Paul Walmsley
2026-01-26 3:05 ` Feng Jiang
2026-01-15 11:19 ` David Laight [this message]
2026-01-15 18:46 ` David Laight
2026-01-26 2:52 ` Feng Jiang
2026-01-28 18:59 ` David Laight
2026-01-29 8:34 ` Feng Jiang
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20260115111947.54929ed0@pumpkin \
--to=david.laight.linux@gmail.com \
--cc=alex@ghiti.fr \
--cc=aou@eecs.berkeley.edu \
--cc=charlie@rivosinc.com \
--cc=conor.dooley@microchip.com \
--cc=jiangfeng@kylinos.cn \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-riscv@lists.infradead.org \
--cc=palmer@dabbelt.com \
--cc=pjw@kernel.org \
--cc=samuel.holland@sifive.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox