From: David Laight <David.Laight@ACULAB.COM>
To: 'Dave Hansen' <dave.hansen@intel.com>,
'Noah Goldstein' <goldstein.w.n@gmail.com>,
Eric Dumazet <edumazet@google.com>
Cc: "tglx@linutronix.de" <tglx@linutronix.de>,
"mingo@redhat.com" <mingo@redhat.com>,
Borislav Petkov <bp@alien8.de>,
"dave.hansen@linux.intel.com" <dave.hansen@linux.intel.com>,
X86 ML <x86@kernel.org>, "hpa@zytor.com" <hpa@zytor.com>,
"peterz@infradead.org" <peterz@infradead.org>,
"alexanderduyck@fb.com" <alexanderduyck@fb.com>,
open list <linux-kernel@vger.kernel.org>,
netdev <netdev@vger.kernel.org>
Subject: RE: [PATCH] x86/lib: Remove the special case for odd-aligned buffers in csum_partial.c
Date: Mon, 13 Dec 2021 15:37:15 +0000 [thread overview]
Message-ID: <96b6a476c4154da3bd04996139cd8a6d@AcuMS.aculab.com> (raw)
In-Reply-To: <52edd5fd-daa0-729b-4646-43450552d2ab@intel.com>
From: Dave Hansen
> Sent: 13 December 2021 15:02
.c
>
> On 12/13/21 6:43 AM, David Laight wrote:
> > There is no need to special case the very unusual odd-aligned buffers.
> > They are no worse than 4n+2 aligned buffers.
> >
> > Signed-off-by: David Laight <david.laight@aculab.com>
> > ---
> >
> > On an i7-7700 misaligned buffers add 2 or 3 clocks (in 115) to a 512 byte
> > checksum.
> > That is just measuring the main loop with an lfence prior to rdpmc to
> > read PERF_COUNT_HW_CPU_CYCLES.
>
> I'm a bit confused by this changelog.
>
> Are you saying that the patch causes a (small) performance regression?
>
> Are you also saying that the optimization here is not worth it because
> it saves 15 lines of code? Or that the misalignment checks themselves
> add 2 or 3 cycles, and this is an *optimization*?
I'm saying that it can't be worth optimising for a misaligned
buffer because the cost of the buffer being misaligned is so small.
So the test for a misaligned buffer are going to cost more than
and plausible gain.
Not only that the buffer will never be odd aligned at all.
The code is left in from a previous version that did do aligned
word reads - so had to do extra for odd alignment.
Note that code is doing misaligned reads for the more likely 4n+2
aligned ethernet receive buffers.
I doubt that even a test for that would be worthwhile even if you
were checksumming full sized ethernet packets.
So the change is deleting code that is never actually executed
from the hot path.
David
-
Registered Address Lakeside, Bramley Road, Mount Farm, Milton Keynes, MK1 1PT, UK
Registration No: 1397386 (Wales)
next prev parent reply other threads:[~2021-12-13 15:37 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-12-13 14:43 [PATCH] x86/lib: Remove the special case for odd-aligned buffers in csum_partial.c David Laight
2021-12-13 15:02 ` Dave Hansen
2021-12-13 15:37 ` David Laight [this message]
2021-12-13 15:56 ` Eric Dumazet
2021-12-13 16:16 ` David Laight
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=96b6a476c4154da3bd04996139cd8a6d@AcuMS.aculab.com \
--to=david.laight@aculab.com \
--cc=alexanderduyck@fb.com \
--cc=bp@alien8.de \
--cc=dave.hansen@intel.com \
--cc=dave.hansen@linux.intel.com \
--cc=edumazet@google.com \
--cc=goldstein.w.n@gmail.com \
--cc=hpa@zytor.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@redhat.com \
--cc=netdev@vger.kernel.org \
--cc=peterz@infradead.org \
--cc=tglx@linutronix.de \
--cc=x86@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox