From: "Alex Bennée" <alex.bennee@linaro.org>
To: Richard Henderson <richard.henderson@linaro.org>
Cc: qemu-devel@nongnu.org
Subject: Re: [PATCH 08/20] tcg/i386: Implement avx512 variable shifts
Date: Tue, 01 Feb 2022 20:33:29 +0000 [thread overview]
Message-ID: <87h79iwcmb.fsf@linaro.org> (raw)
In-Reply-To: <20211218194250.247633-9-richard.henderson@linaro.org>
Richard Henderson <richard.henderson@linaro.org> writes:
> AVX512VL has VPSRAVQ, and
> AVX512BW has VPSLLVW, VPSRAVW, VPSRLVW.
>
> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
Reviewed-by: Alex Bennée <alex.bennee@linaro.org>
> ---
> tcg/i386/tcg-target.c.inc | 32 ++++++++++++++++++++++++--------
> 1 file changed, 24 insertions(+), 8 deletions(-)
>
> diff --git a/tcg/i386/tcg-target.c.inc b/tcg/i386/tcg-target.c.inc
> index 316e550b38..7b9302fcc2 100644
> --- a/tcg/i386/tcg-target.c.inc
> +++ b/tcg/i386/tcg-target.c.inc
> @@ -418,9 +418,13 @@ static bool tcg_target_const_match(int64_t val, TCGType type, int ct)
> #define OPC_VPBROADCASTQ (0x59 | P_EXT38 | P_DATA16)
> #define OPC_VPERMQ (0x00 | P_EXT3A | P_DATA16 | P_VEXW)
> #define OPC_VPERM2I128 (0x46 | P_EXT3A | P_DATA16 | P_VEXL)
> +#define OPC_VPSLLVW (0x12 | P_EXT38 | P_DATA16 | P_VEXW | P_EVEX)
> #define OPC_VPSLLVD (0x47 | P_EXT38 | P_DATA16)
> #define OPC_VPSLLVQ (0x47 | P_EXT38 | P_DATA16 | P_VEXW)
> +#define OPC_VPSRAVW (0x11 | P_EXT38 | P_DATA16 | P_VEXW | P_EVEX)
> #define OPC_VPSRAVD (0x46 | P_EXT38 | P_DATA16)
> +#define OPC_VPSRAVQ (0x46 | P_EXT38 | P_DATA16 | P_VEXW | P_EVEX)
> +#define OPC_VPSRLVW (0x10 | P_EXT38 | P_DATA16 | P_VEXW | P_EVEX)
> #define OPC_VPSRLVD (0x45 | P_EXT38 | P_DATA16)
> #define OPC_VPSRLVQ (0x45 | P_EXT38 | P_DATA16 | P_VEXW)
> #define OPC_VZEROUPPER (0x77 | P_EXT)
> @@ -2742,16 +2746,13 @@ static void tcg_out_vec_op(TCGContext *s, TCGOpcode opc,
> OPC_PMAXUB, OPC_PMAXUW, OPC_PMAXUD, OPC_UD2
> };
> static int const shlv_insn[4] = {
> - /* TODO: AVX512 adds support for MO_16. */
> - OPC_UD2, OPC_UD2, OPC_VPSLLVD, OPC_VPSLLVQ
> + OPC_UD2, OPC_VPSLLVW, OPC_VPSLLVD, OPC_VPSLLVQ
> };
> static int const shrv_insn[4] = {
> - /* TODO: AVX512 adds support for MO_16. */
> - OPC_UD2, OPC_UD2, OPC_VPSRLVD, OPC_VPSRLVQ
> + OPC_UD2, OPC_VPSRLVW, OPC_VPSRLVD, OPC_VPSRLVQ
> };
> static int const sarv_insn[4] = {
> - /* TODO: AVX512 adds support for MO_16, MO_64. */
> - OPC_UD2, OPC_UD2, OPC_VPSRAVD, OPC_UD2
> + OPC_UD2, OPC_VPSRAVW, OPC_VPSRAVD, OPC_VPSRAVQ
> };
> static int const shls_insn[4] = {
> OPC_UD2, OPC_PSLLW, OPC_PSLLD, OPC_PSLLQ
> @@ -3242,9 +3243,24 @@ int tcg_can_emit_vec_op(TCGOpcode opc, TCGType type, unsigned vece)
>
> case INDEX_op_shlv_vec:
> case INDEX_op_shrv_vec:
> - return have_avx2 && vece >= MO_32;
> + switch (vece) {
> + case MO_16:
> + return have_avx512bw;
> + case MO_32:
> + case MO_64:
> + return have_avx2;
> + }
> + return 0;
> case INDEX_op_sarv_vec:
> - return have_avx2 && vece == MO_32;
> + switch (vece) {
> + case MO_16:
> + return have_avx512bw;
> + case MO_32:
> + return have_avx2;
> + case MO_64:
> + return have_avx512vl;
> + }
> + return 0;
> case INDEX_op_rotlv_vec:
> case INDEX_op_rotrv_vec:
> return have_avx2 && vece >= MO_32 ? -1 : 0;
--
Alex Bennée
next prev parent reply other threads:[~2022-02-02 0:12 UTC|newest]
Thread overview: 51+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-12-18 19:42 [PATCH 00/20] tcg: vector improvements Richard Henderson
2021-12-18 19:42 ` [PATCH 01/20] tcg/optimize: Fix folding of vector ops Richard Henderson
2021-12-19 11:37 ` Philippe Mathieu-Daudé
2021-12-18 19:42 ` [PATCH 02/20] tcg: Add opcodes for vector nand, nor, eqv Richard Henderson
2021-12-19 11:28 ` Philippe Mathieu-Daudé
2022-02-01 18:28 ` Alex Bennée
2021-12-18 19:42 ` [PATCH 03/20] tcg/ppc: Implement vector NAND, NOR, EQV Richard Henderson
2021-12-19 0:15 ` Philippe Mathieu-Daudé
2022-02-01 18:29 ` Alex Bennée
2021-12-18 19:42 ` [PATCH 04/20] tcg/s390x: " Richard Henderson
2021-12-19 0:17 ` Philippe Mathieu-Daudé
2022-02-01 18:29 ` Alex Bennée
2022-02-01 18:31 ` Alex Bennée
2024-01-03 13:21 ` Philippe Mathieu-Daudé
2024-01-03 21:58 ` Richard Henderson
2021-12-18 19:42 ` [PATCH 05/20] tcg/i386: Detect AVX512 Richard Henderson
2022-02-01 18:41 ` Alex Bennée
2021-12-18 19:42 ` [PATCH 06/20] tcg/i386: Add tcg_out_evex_opc Richard Henderson
2022-02-01 19:20 ` Alex Bennée
2021-12-18 19:42 ` [PATCH 07/20] tcg/i386: Use tcg_can_emit_vec_op in expand_vec_cmp_noinv Richard Henderson
2022-02-01 19:21 ` Alex Bennée
2021-12-18 19:42 ` [PATCH 08/20] tcg/i386: Implement avx512 variable shifts Richard Henderson
2022-02-01 20:33 ` Alex Bennée [this message]
2021-12-18 19:42 ` [PATCH 09/20] tcg/i386: Implement avx512 scalar shift Richard Henderson
2022-02-02 13:48 ` Alex Bennée
2021-12-18 19:42 ` [PATCH 10/20] tcg/i386: Implement avx512 immediate sari shift Richard Henderson
2022-02-02 14:02 ` Alex Bennée
2021-12-18 19:42 ` [PATCH 11/20] tcg/i386: Implement avx512 immediate rotate Richard Henderson
2022-02-02 14:05 ` Alex Bennée
2022-02-03 1:26 ` Richard Henderson
2021-12-18 19:42 ` [PATCH 12/20] tcg/i386: Implement avx512 variable rotate Richard Henderson
2022-02-02 14:14 ` Alex Bennée
2021-12-18 19:42 ` [PATCH 13/20] tcg/i386: Support avx512vbmi2 vector shift-double instructions Richard Henderson
2022-02-02 14:28 ` Alex Bennée
2021-12-18 19:42 ` [PATCH 14/20] tcg/i386: Expand vector word rotate as avx512vbmi2 shift-double Richard Henderson
2022-02-03 10:32 ` Alex Bennée
2021-12-18 19:42 ` [PATCH 15/20] tcg/i386: Remove rotls_vec from tcg_target_op_def Richard Henderson
2022-02-03 10:34 ` Alex Bennée
2021-12-18 19:42 ` [PATCH 16/20] tcg/i386: Expand scalar rotate with avx512 insns Richard Henderson
2022-02-03 10:38 ` Alex Bennée
2021-12-18 19:42 ` [PATCH 17/20] tcg/i386: Implement avx512 min/max/abs Richard Henderson
2022-02-03 10:44 ` Alex Bennée
2021-12-18 19:42 ` [PATCH 18/20] tcg/i386: Implement avx512 multiply Richard Henderson
2022-02-03 10:45 ` Alex Bennée
2021-12-18 19:42 ` [PATCH 19/20] tcg/i386: Implement more logical operations for avx512 Richard Henderson
2022-02-03 10:46 ` Alex Bennée
2022-02-03 21:54 ` Richard Henderson
2021-12-18 19:42 ` [PATCH 20/20] tcg/i386: Implement bitsel " Richard Henderson
2022-02-03 10:51 ` Alex Bennée
2022-01-29 9:28 ` [PATCH 00/20] tcg: vector improvements Richard Henderson
2022-02-03 10:25 ` Alex Bennée
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=87h79iwcmb.fsf@linaro.org \
--to=alex.bennee@linaro.org \
--cc=qemu-devel@nongnu.org \
--cc=richard.henderson@linaro.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).