From: David Laight <david.laight.linux@gmail.com>
To: Nicolas Pitre <nico@fluxnic.net>
Cc: Andrew Morton <akpm@linux-foundation.org>,
Eric Dumazet <edumazet@google.com>,
linux-kernel <linux-kernel@vger.kernel.org>,
netdev@vger.kernel.org, Jakub Kicinski <kuba@kernel.org>,
Eric Dumazet <eric.dumazet@gmail.com>,
Paolo Abeni <pabeni@redhat.com>
Subject: Re: [PATCH] compiler_types: Introduce inline_for_performance
Date: Mon, 19 Jan 2026 19:03:41 +0000 [thread overview]
Message-ID: <20260119190341.39c3d04c@pumpkin> (raw)
In-Reply-To: <681985ss-q84n-r802-90pq-0837pr1463p5@syhkavp.arg>
On Mon, 19 Jan 2026 10:47:51 -0500 (EST)
Nicolas Pitre <nico@fluxnic.net> wrote:
> On Sun, 18 Jan 2026, David Laight wrote:
>
> > On 32bit you probably don't want to inline __arch_xprod_64(), but you do
> > want to pass (bias ? m : 0) and may want separate functions for the
> > 'no overflow' case (if it is common enough to worry about).
>
> You do want to inline it. Performance quickly degrades otherwise.
If it isn't inlined you want a real C function in div.c (or similar),
not the compiler generating a separate body in the object file of each
file that uses it.
That is just the worst of both worlds.
> Numbers are in the commit log where I introduced that change.
>
> And __arch_xprod_64() exists only for 32bit btw.
I wonder how much of a mess gcc makes of that code.
I added asm functions for u64 mul_add(u32 a, u32 b, u32 c) calculating
a * b + c without explicit zero extending any of the 32 bit values.
Without that gcc runs out of registers and starts spilling to stack
instead of just generating 'mul; add; adc $0'.
I could only find the definition in the header file - may not have
looked hard enough.
But 64bit systems without a 64x64=>128 multiply (ie without u128
support) also need the 'multiply in 32bit chunks' code.
And common code is fine with u128 support (ignoring old compilers
that generate a call on 64bit mips even though it has exactly the
instruction you want).
David
next prev parent reply other threads:[~2026-01-19 19:03 UTC|newest]
Thread overview: 17+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-01-18 15:24 [PATCH] compiler_types: Introduce inline_for_performance Eric Dumazet
2026-01-18 15:32 ` Florian Westphal
2026-01-18 15:39 ` Eric Dumazet
2026-01-18 18:36 ` kernel test robot
2026-01-18 22:33 ` David Laight
2026-01-18 19:47 ` Andrew Morton
2026-01-18 20:38 ` Eric Dumazet
2026-01-18 22:58 ` David Laight
2026-01-19 0:01 ` Andrew Morton
2026-01-19 9:33 ` David Laight
2026-01-19 10:25 ` Eric Dumazet
2026-01-19 10:33 ` Eric Dumazet
2026-01-19 10:50 ` David Laight
2026-01-19 15:47 ` Nicolas Pitre
2026-01-19 19:03 ` David Laight [this message]
2026-01-19 19:44 ` Nicolas Pitre
2026-01-18 21:04 ` kernel test robot
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20260119190341.39c3d04c@pumpkin \
--to=david.laight.linux@gmail.com \
--cc=akpm@linux-foundation.org \
--cc=edumazet@google.com \
--cc=eric.dumazet@gmail.com \
--cc=kuba@kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=netdev@vger.kernel.org \
--cc=nico@fluxnic.net \
--cc=pabeni@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.