Re: [PATCH V2 2/4] x86/insn: Add AVX-512 support to the instruction decoder

public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed

From: Adrian Hunter <adrian.hunter@intel.com>
To: Borislav Petkov <bp@alien8.de>, Masami Hiramatsu <mhiramat@kernel.org>
Cc: <linux-kernel@vger.kernel.org>,
	Arnaldo Carvalho de Melo <acme@kernel.org>,
	Jiri Olsa <jolsa@redhat.com>,
	Dan Williams <dan.j.williams@intel.com>,
	Ingo Molnar <mingo@redhat.com>, "H. Peter Anvin" <hpa@zytor.com>,
	Thomas Gleixner <tglx@linutronix.de>,
	Andy Lutomirski <luto@amacapital.net>, X86 ML <x86@kernel.org>
Subject: Re: [PATCH V2 2/4] x86/insn: Add AVX-512 support to the instruction decoder
Date: Mon, 1 Dec 2025 20:15:19 +0200	[thread overview]
Message-ID: <1e6bfa0c-6733-4de2-80ae-5bc08ccbf58b@intel.com> (raw)
In-Reply-To: <20251201112526.GBaS17JhPrvYGiWv3L@fat_crate.local>

On 01/12/2025 13:25, Borislav Petkov wrote:
> On Sun, Nov 30, 2025 at 05:05:28PM +0100, Borislav Petkov wrote:
>> Resurrecting a very old thread...
>>
>> On Wed, Jul 20, 2016 at 11:30:35AM +0300, Adrian Hunter wrote:
>>> Add support for Intel's AVX-512 instructions to the instruction decoder.
>>>
>>> AVX-512 instructions are documented in Intel Architecture Instruction Set
>>> Extensions Programming Reference (February 2016).
>>>
>>> AVX-512 instructions are identified by a EVEX prefix which, for the purpose
>>> of instruction decoding, can be treated as though it were a 4-byte VEX
>>> prefix.
>>>
>>> Existing instructions which can now accept an EVEX prefix need not be
>>> further annotated in the op code map (x86-opcode-map.txt). In the case of
>>> new instructions, the op code map is updated accordingly.
>>>
>>> Also add associated Mask Instructions that are used to manipulate mask
>>> registers used in AVX-512 instructions.
>>>
>>> 'perf tools' instruction decoder is updated in a subsequent patch.  And a
>>> representative set of instructions is added to the perf tools new
>>> instructions test in a subsequent patch.
>>>
>>> Signed-off-by: Adrian Hunter <adrian.hunter@intel.com>
>>> ---
>>>  arch/x86/include/asm/inat.h          |  17 ++-
>>>  arch/x86/include/asm/insn.h          |  12 +-
>>>  arch/x86/lib/insn.c                  |  18 ++-
>>>  arch/x86/lib/x86-opcode-map.txt      | 263 +++++++++++++++++++++++------------
>>>  arch/x86/tools/gen-insn-attr-x86.awk |  11 +-
>>>  5 files changed, 220 insertions(+), 101 deletions(-)
>>>
>>> +78: VMREAD Ey,Gy | vcvttps2udq/pd2udq Vx,Wpd (evo) | vcvttsd2usi Gv,Wx (F2),(ev) | vcvttss2usi Gv,Wx (F3),(ev) | vcvttps2uqq/pd2uqq Vx,Wx (66),(ev)
>>> +79: VMWRITE Gy,Ey | vcvtps2udq/pd2udq Vx,Wpd (evo) | vcvtsd2usi Gv,Wx (F2),(ev) | vcvtss2usi Gv,Wx (F3),(ev) | vcvtps2uqq/pd2uqq Vx,Wx (66),(ev)
>>
>> This is all fine and dandy but those (ev*) flags cause the escape table to
>> have INAT_EVEXONLY as a flag:
>>
>> const insn_attr_t inat_escape_table_1_1[INAT_OPCODE_TABLE_SIZE] = {
>> 	...
>>
>> 		[0x79] = INAT_MODRM | INAT_VEXOK | INAT_EVEXONLY,
>>
>> };
>>
>> except that that opcode is not EVEX only. Intel's VMREAD and VMWRITE are *not*
>> EVEX insns and AMD has there EXTRQ and INSERTQ with prefixes 66 and F2
>> respectively which are SSE4a and both are not EVEX.
>>
>> The VMREAD and VMWRITE decoding happens to work out by pure chance because
>> those are without a prefix and the check for prefix id in
>> inat_get_escape_attribute() happens to not select that escape table.
>>
>> So the first thing that comes to mind is excluding opcodes like 0x79 which can
>> be mixed type from that inat_must_vex() enforcement...?
>>
>> Masami, any other ideas?
> 
> This hack seems to do the trick. We probably should take a look at all the
> insn tables and if there are more opcodes like that, to turn the mixed bool
> below into a proper flag:
> 
> 
> diff --git a/tools/arch/x86/lib/insn.c b/tools/arch/x86/lib/insn.c
> index 1d1c57c74d1f..e3216da11a7c 100644
> --- a/tools/arch/x86/lib/insn.c
> +++ b/tools/arch/x86/lib/insn.c
> @@ -276,6 +276,7 @@ int insn_get_prefixes(struct insn *insn)
>  int insn_get_opcode(struct insn *insn)
>  {
>  	struct insn_field *opcode = &insn->opcode;
> +	bool mixed = false;
>  	int pfx_id, ret;
>  	insn_byte_t op;
>  
> @@ -348,13 +359,25 @@ int insn_get_opcode(struct insn *insn)
>  	while (inat_is_escape(insn->attr)) {
>  		/* Get escaped opcode */
>  		op = get_next(insn_byte_t, insn);
> +
>  		opcode->bytes[opcode->nbytes++] = op;
>  		pfx_id = insn_last_prefix_id(insn);
> +
> +		printf("%s: escaped op: 0x%x, pfx_id (insn table: none, 66, f3, f2): 0x%x, attr: 0x%x\n",
> +			__func__, op, pfx_id, insn->attr);
> +
>  		insn->attr = inat_get_escape_attribute(op, pfx_id, insn->attr);
> +
> +		printf("got attr: 0x%x\n", insn->attr);
>  	}
>  
> -	if (inat_must_vex(insn->attr)) {
> +	mixed = (opcode->bytes[0] == 0xf) && (opcode->bytes[1] == 0x79);
> +
> +	printf("%s: must_vex, mixed: %d\n", __func__, mixed);
> +
> +	if (inat_must_vex(insn->attr) && !mixed) {
>  		/* This instruction is bad */
> +		printf("%s: must_vex bad\n", __func__);
>  		insn->attr = 0;
>  		return -EINVAL;
>  	}
> diff --git a/tools/arch/x86/lib/x86-opcode-map.txt b/tools/arch/x86/lib/x86-opcode-map.txt
> index 0139b864ceef..d059c8e63bfe 100644
> --- a/tools/arch/x86/lib/x86-opcode-map.txt
> +++ b/tools/arch/x86/lib/x86-opcode-map.txt
> @@ -474,7 +474,7 @@ AVXcode: 1
>  # Note: Remove (v), because vzeroall and vzeroupper becomes emms without VEX.
>  77: emms | vzeroupper | vzeroall
>  78: VMREAD Ey,Gy | vcvttps2udq/pd2udq Vx,Wpd (evo) | vcvttsd2usi Gv,Wx (F2),(ev) | vcvttss2usi Gv,Wx (F3),(ev) | vcvttps2uqq/pd2uqq Vx,Wx (66),(ev)
> -79: VMWRITE Gy,Ey | vcvtps2udq/pd2udq Vx,Wpd (evo) | vcvtsd2usi Gv,Wx (F2),(ev) | vcvtss2usi Gv,Wx (F3),(ev) | vcvtps2uqq/pd2uqq Vx,Wx (66),(ev) | EXTRQ
> +79: VMWRITE Gy,Ey | EXTRQ Vo,Uo (66) | vcvtps2udq/pd2udq Vx,Wpd (evo) | vcvtsd2usi Gv,Wx (F2),(ev) | vcvtss2usi Gv,Wx (F3),(ev) | vcvtps2uqq/pd2uqq Vx,Wx (66),(ev)

EXTRQ Vo,Uo (66) has a mandatory 66 prefix like vcvtps2uqq/pd2uqq Vx,Wx (66),(ev) so they end up on the same attribute table, but (ev) results in INAT_EVEXONLY which is unwanted.

Changing that from (ev) to (evo) is probably all that is needed e.g.

+79: VMWRITE Gy,Ey | EXTRQ Vo,Uo (66) | vcvtps2udq/pd2udq Vx,Wpd (evo) | vcvtsd2usi Gv,Wx (F2),(ev) | vcvtss2usi Gv,Wx (F3),(ev) | vcvtps2uqq/pd2uqq Vx,Wx (66),(evo)

next prev parent reply	other threads:[~2025-12-01 18:15 UTC|newest]

Thread overview: 18+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2016-07-20  8:30 [PATCH V2 0/4] x86/insn: perf tools: Add AVX-512 support to the instruction decoder Adrian Hunter
2016-07-20  8:30 ` [PATCH V2 1/4] x86/insn: perf tools: Fix vcvtph2ps instruction decoding Adrian Hunter
2016-07-25 18:08   ` [tip:perf/core] " tip-bot for Adrian Hunter
2016-07-20  8:30 ` [PATCH V2 2/4] x86/insn: Add AVX-512 support to the instruction decoder Adrian Hunter
2016-07-25 18:08   ` [tip:perf/core] " tip-bot for Adrian Hunter
2025-11-30 16:05   ` [PATCH V2 2/4] " Borislav Petkov
2025-12-01 11:25     ` Borislav Petkov
2025-12-01 18:15       ` Adrian Hunter [this message]
2025-12-01 19:01         ` Borislav Petkov
2025-12-02  6:26           ` Adrian Hunter
2025-12-02 10:47             ` Borislav Petkov
2016-07-20  8:30 ` [PATCH V2 3/4] perf tools: Add AVX-512 support to the instruction decoder used by Intel PT Adrian Hunter
2016-07-25 18:09   ` [tip:perf/core] " tip-bot for Adrian Hunter
2016-07-20  8:30 ` [PATCH V2 4/4] perf tools: Add AVX-512 instructions to the new instructions test Adrian Hunter
2016-07-25 18:09   ` [tip:perf/core] " tip-bot for Adrian Hunter
2016-07-20 18:11 ` [PATCH V2 0/4] x86/insn: perf tools: Add AVX-512 support to the instruction decoder Masami Hiramatsu
2016-07-21 11:18   ` Adrian Hunter
2016-07-21 12:39     ` Arnaldo Carvalho de Melo

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1e6bfa0c-6733-4de2-80ae-5bc08ccbf58b@intel.com \
    --to=adrian.hunter@intel.com \
    --cc=acme@kernel.org \
    --cc=bp@alien8.de \
    --cc=dan.j.williams@intel.com \
    --cc=hpa@zytor.com \
    --cc=jolsa@redhat.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=luto@amacapital.net \
    --cc=mhiramat@kernel.org \
    --cc=mingo@redhat.com \
    --cc=tglx@linutronix.de \
    --cc=x86@kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox