From: Richard Henderson <richard.henderson@linaro.org>
To: qemu-devel@nongnu.org
Cc: peter.maydell@linaro.org
Subject: [PATCH 0/3] tcg: Improve vector tail clearing
Date: Sat, 18 Apr 2020 08:56:48 -0700 [thread overview]
Message-ID: <20200418155651.3901-1-richard.henderson@linaro.org> (raw)
Something I noticed while looking at AdvSIMD dumps, while
testing changes common with SVE2.
If we're going to load a zero into a vector register for
clearing the high bits of the SVE register, we might as
well use that zero to store the 8 bytes at the top of the
AdvSIMD register as well.
Output assembly goes from e.g.
00: 48 c7 85 08 10 00 00 00 movq $0x0,0x1008(%rbp)
00 00 00
0b: c5 f9 ef c0 vpxor %xmm0,%xmm0,%xmm0
0f: c5 fe 7f 85 10 10 00 00 vmovdqu %ymm0,0x1010(%rbp)
17: c5 fa 7f 85 30 10 00 00 vmovdqu %xmm0,0x1030(%rbp)
to
00: c5 f9 ef c0 vpxor %xmm0,%xmm0,%xmm0
04: c5 f9 d6 85 08 10 00 00 vmovq %xmm0,0x1008(%rbp)
0c: c5 fe 7f 85 10 10 00 00 vmovdqu %ymm0,0x1010(%rbp)
14: c5 fa 7f 85 30 10 00 00 vmovdqu %xmm0,0x1030(%rbp)
Saves a few bytes now, and more when we can do better with
loading constants into registers, where we can share the
vpxor between instructions.
The target/arm patches are not aided by the tcg patch, but
are not dependent on it.
r~
Richard Henderson (3):
tcg: Improve vector tail clearing
target/arm: Use tcg_gen_gvec_mov for clear_vec_high
target/arm: Use clear_vec_high more effectively
target/arm/translate-a64.c | 69 ++++++++++++++++++--------------
tcg/tcg-op-gvec.c | 82 +++++++++++++++++++++++++++++---------
2 files changed, 101 insertions(+), 50 deletions(-)
--
2.20.1
next reply other threads:[~2020-04-18 15:58 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-04-18 15:56 Richard Henderson [this message]
2020-04-18 15:56 ` [PATCH 1/3] tcg: Improve vector tail clearing Richard Henderson
2020-04-20 15:25 ` Alex Bennée
2020-04-18 15:56 ` [PATCH 2/3] target/arm: Use tcg_gen_gvec_mov for clear_vec_high Richard Henderson
2020-04-20 15:29 ` Alex Bennée
2020-04-18 15:56 ` [PATCH 3/3] target/arm: Use clear_vec_high more effectively Richard Henderson
2020-04-20 15:32 ` Alex Bennée
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20200418155651.3901-1-richard.henderson@linaro.org \
--to=richard.henderson@linaro.org \
--cc=peter.maydell@linaro.org \
--cc=qemu-devel@nongnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).