From: David Laight <David.Laight@ACULAB.COM>
To: 'David Howells' <dhowells@redhat.com>,
Linus Torvalds <torvalds@linux-foundation.org>
Cc: Al Viro <viro@zeniv.linux.org.uk>, Jens Axboe <axboe@kernel.dk>,
"Christoph Hellwig" <hch@list.de>,
Christian Brauner <christian@brauner.io>,
"Matthew Wilcox" <willy@infradead.org>,
Jeff Layton <jlayton@kernel.org>,
"linux-fsdevel@vger.kernel.org" <linux-fsdevel@vger.kernel.org>,
"linux-block@vger.kernel.org" <linux-block@vger.kernel.org>,
"linux-mm@kvack.org" <linux-mm@kvack.org>,
"linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>
Subject: RE: [PATCH v3 2/2] iov_iter: Don't deal with iter->copy_mc in memcpy_from_iter_mc()
Date: Fri, 18 Aug 2023 15:42:05 +0000 [thread overview]
Message-ID: <d8fce3c159b04fdca65cc4d5c307854d@AcuMS.aculab.com> (raw)
In-Reply-To: <2058762.1692371971@warthog.procyon.org.uk>
From: David Howells
> Sent: Friday, August 18, 2023 4:20 PM
>
> Linus Torvalds <torvalds@linux-foundation.org> wrote:
>
> > > Although I'm not sure the bit-fields really help.
> > > There are 8 bytes at the start of the structure, might as well
> > > use them :-)
> >
> > Actuallyç I wrote the patch that way because it seems to improve code
> > generation.
> >
> > The bitfields are generally all set together as just plain one-time
> > constants at initialization time, and gcc sees that it's a full byte
> > write. And the reason 'data_source' is not a bitfield is that it's not
> > a constant at iov_iter init time (it's an argument to all the init
> > functions), so having that one as a separate byte at init time is good
> > for code generation when you don't need to mask bits or anything like
> > that.
> >
> > And once initialized, having things be dense and doing all the
> > compares with a bitwise 'and' instead of doing them as some value
> > compare again tends to generate good code.
>
> Actually... I said that switch(enum) seemed to generate suboptimal code...
> However, if the enum is renumbered such that the constants are in the same
> order as in the switch() it generates better code.
Hmmm.. the order of the switch labels really shouldn't matter.
The advantage of the if-chain is that you can optimise for
the most common case.
> So we want this order:
>
> enum iter_type {
> ITER_UBUF,
> ITER_IOVEC,
> ITER_BVEC,
> ITER_KVEC,
> ITER_XARRAY,
> ITER_DISCARD,
> };
Will gcc actually code this version without pessimising it?
if (likely(type <= ITER_IOVEC) {
if (likely(type != ITER_IOVEC))
iterate_ubuf();
else
iterate_iovec();
} else if (likely(type) <= ITER_KVEC)) {
if (type == ITER_KVEC)
iterate_kvec();
else
iterate_bvec();
} else if (type == ITER_XARRAY) {
iterate_xarrar()
} else {
discard;
}
But I bet you can't stop it replicating the compares.
(especially with the likely().
That has two mis-predicted (are they ever right!) branches in the
common user-copy versions and three in the common kernel ones.
In some architectures you might get the default 'fall through'
to the UBUF code if the branches aren't predictable.
But I believe current x86 cpu never do static prediction.
So you always lose :-)
...
> static inline bool user_backed_iter(const struct iov_iter *i)
> {
> return iter_is_ubuf(i) || iter_is_iovec(i);
> }
>
> which gcc just changes into something like a "CMP $1" and a "JA".
That makes sense...
> Comparing Linus's bit patch (+ is better) to renumbering the switch (- is
> better):
>
....
> iov_iter_init inc 0x27 -> 0x31 +0xa
Are you hitting the gcc bug that loads the constant from memory?
> I think there may be more savings to be made if I go and convert more of the
> functions to using switch().
Size isn't everything, the code needs to be optimised for the hot paths.
David
-
Registered Address Lakeside, Bramley Road, Mount Farm, Milton Keynes, MK1 1PT, UK
Registration No: 1397386 (Wales)
next prev parent reply other threads:[~2023-08-18 15:42 UTC|newest]
Thread overview: 24+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-08-16 12:07 [PATCH v3 0/2] iov_iter: Convert the iterator macros into inline funcs David Howells
2023-08-16 12:07 ` [PATCH v3 1/2] iov_iter: Convert iterate*() to " David Howells
2023-08-16 12:07 ` [PATCH v3 2/2] iov_iter: Don't deal with iter->copy_mc in memcpy_from_iter_mc() David Howells
2023-08-16 12:28 ` David Laight
2023-08-16 13:00 ` David Howells
2023-08-16 14:19 ` David Laight
2023-08-16 18:50 ` Linus Torvalds
2023-08-16 20:35 ` David Howells
2023-08-17 4:18 ` Linus Torvalds
2023-08-17 8:41 ` David Laight
2023-08-17 14:38 ` Linus Torvalds
2023-08-17 15:16 ` David Laight
2023-08-17 15:31 ` Linus Torvalds
2023-08-17 16:06 ` David Laight
2023-08-18 15:19 ` David Howells
2023-08-18 15:42 ` David Laight [this message]
2023-08-18 16:48 ` David Howells
2023-08-18 21:39 ` David Laight
2023-08-18 11:42 ` David Howells
2023-08-18 12:16 ` David Laight
2023-08-18 12:26 ` Matthew Wilcox
2023-08-18 12:41 ` David Laight
2023-08-18 13:33 ` David Howells
2023-08-18 11:39 ` David Howells
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=d8fce3c159b04fdca65cc4d5c307854d@AcuMS.aculab.com \
--to=david.laight@aculab.com \
--cc=axboe@kernel.dk \
--cc=christian@brauner.io \
--cc=dhowells@redhat.com \
--cc=hch@list.de \
--cc=jlayton@kernel.org \
--cc=linux-block@vger.kernel.org \
--cc=linux-fsdevel@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=torvalds@linux-foundation.org \
--cc=viro@zeniv.linux.org.uk \
--cc=willy@infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).