qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
* [PATCH v2 0/3] target/arm: SME1/SVE2 fixes
@ 2025-06-22 21:35 Richard Henderson
  2025-06-22 21:35 ` [PATCH v2 1/3] target/arm: Fix SME vs AdvSIMD exception priority Richard Henderson
                   ` (2 more replies)
  0 siblings, 3 replies; 9+ messages in thread
From: Richard Henderson @ 2025-06-22 21:35 UTC (permalink / raw)
  To: qemu-devel

Supercedes: 20250622175052.180728-1-richard.henderson@linaro.org
("target/arm: Fix SME vs AdvSIMD exception priority")

A couple of fixes for EC_SMETRAP, plus some insns that missed
being updated for non-streaming.


r~


Richard Henderson (3):
  target/arm: Fix SME vs AdvSIMD exception priority
  target/arm: Fix sve_access_check for SME
  target/arm: Fix 128-bit element ZIP, UZP, TRN

 target/arm/tcg/translate-a64.c | 30 +++++++++++++++++-------
 target/arm/tcg/translate-sve.c | 43 ++++++++++++++++++++++++----------
 2 files changed, 51 insertions(+), 22 deletions(-)

-- 
2.43.0



^ permalink raw reply	[flat|nested] 9+ messages in thread

* [PATCH v2 1/3] target/arm: Fix SME vs AdvSIMD exception priority
  2025-06-22 21:35 [PATCH v2 0/3] target/arm: SME1/SVE2 fixes Richard Henderson
@ 2025-06-22 21:35 ` Richard Henderson
  2025-06-24 13:05   ` Peter Maydell
  2025-06-22 21:35 ` [PATCH v2 2/3] target/arm: Fix sve_access_check for SME Richard Henderson
  2025-06-22 21:35 ` [PATCH v2 3/3] target/arm: Fix 128-bit element ZIP, UZP, TRN Richard Henderson
  2 siblings, 1 reply; 9+ messages in thread
From: Richard Henderson @ 2025-06-22 21:35 UTC (permalink / raw)
  To: qemu-devel; +Cc: qemu-stable

We failed to raise an exception when
sme_excp_el == 0 and fp_excp_el == 1.

Cc: qemu-stable@nongnu.org
Fixes: 3d74825f4d6 ("target/arm: Add SME enablement checks")
Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
---
 target/arm/tcg/translate-a64.c | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/target/arm/tcg/translate-a64.c b/target/arm/tcg/translate-a64.c
index ac80f572a2..bb49a2ce90 100644
--- a/target/arm/tcg/translate-a64.c
+++ b/target/arm/tcg/translate-a64.c
@@ -1494,7 +1494,8 @@ bool sme_enabled_check(DisasContext *s)
      * to be zero when fp_excp_el has priority.  This is because we need
      * sme_excp_el by itself for cpregs access checks.
      */
-    if (!s->fp_excp_el || s->sme_excp_el < s->fp_excp_el) {
+    if (s->sme_excp_el
+        && (!s->fp_excp_el || s->sme_excp_el <= s->fp_excp_el)) {
         bool ret = sme_access_check(s);
         s->fp_access_checked = (ret ? 1 : -1);
         return ret;
-- 
2.43.0



^ permalink raw reply related	[flat|nested] 9+ messages in thread

* [PATCH v2 2/3] target/arm: Fix sve_access_check for SME
  2025-06-22 21:35 [PATCH v2 0/3] target/arm: SME1/SVE2 fixes Richard Henderson
  2025-06-22 21:35 ` [PATCH v2 1/3] target/arm: Fix SME vs AdvSIMD exception priority Richard Henderson
@ 2025-06-22 21:35 ` Richard Henderson
  2025-06-24 13:58   ` Peter Maydell
  2025-06-22 21:35 ` [PATCH v2 3/3] target/arm: Fix 128-bit element ZIP, UZP, TRN Richard Henderson
  2 siblings, 1 reply; 9+ messages in thread
From: Richard Henderson @ 2025-06-22 21:35 UTC (permalink / raw)
  To: qemu-devel; +Cc: qemu-stable

Do not assume SME implies SVE.  Ensure that the
non-streaming check is present along the SME path,
since it is not implied by sme_*_enabled_check.

Cc: qemu-stable@nongnu.org
Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
---
 target/arm/tcg/translate-a64.c | 27 +++++++++++++++++++--------
 1 file changed, 19 insertions(+), 8 deletions(-)

diff --git a/target/arm/tcg/translate-a64.c b/target/arm/tcg/translate-a64.c
index bb49a2ce90..d7b0c81773 100644
--- a/target/arm/tcg/translate-a64.c
+++ b/target/arm/tcg/translate-a64.c
@@ -1387,11 +1387,8 @@ static bool fp_access_check_only(DisasContext *s)
     return true;
 }
 
-static bool fp_access_check(DisasContext *s)
+static bool nonstreaming_check(DisasContext *s)
 {
-    if (!fp_access_check_only(s)) {
-        return false;
-    }
     if (s->sme_trap_nonstreaming && s->is_nonstreaming) {
         gen_exception_insn(s, 0, EXCP_UDEF,
                            syn_smetrap(SME_ET_Streaming, false));
@@ -1400,6 +1397,11 @@ static bool fp_access_check(DisasContext *s)
     return true;
 }
 
+static bool fp_access_check(DisasContext *s)
+{
+    return fp_access_check_only(s) && nonstreaming_check(s);
+}
+
 /*
  * Return <0 for non-supported element sizes, with MO_16 controlled by
  * FEAT_FP16; return 0 for fp disabled; otherwise return >0 for success.
@@ -1450,11 +1452,20 @@ static int fp_access_check_vector_hsd(DisasContext *s, bool is_q, MemOp esz)
  */
 bool sve_access_check(DisasContext *s)
 {
-    if (s->pstate_sm || !dc_isar_feature(aa64_sve, s)) {
-        bool ret;
+    bool ret;
 
-        assert(dc_isar_feature(aa64_sme, s));
-        ret = sme_sm_enabled_check(s);
+    switch (dc_isar_feature(aa64_sme, s)) {
+    case true:
+        if (s->pstate_sm) {
+            ret = sme_enabled_check(s);
+        } else if (!dc_isar_feature(aa64_sve, s)) {
+            ret = sme_sm_enabled_check(s);
+        } else {
+            break;
+        }
+        if (ret) {
+            ret = nonstreaming_check(s);
+        }
         s->sve_access_checked = (ret ? 1 : -1);
         return ret;
     }
-- 
2.43.0



^ permalink raw reply related	[flat|nested] 9+ messages in thread

* [PATCH v2 3/3] target/arm: Fix 128-bit element ZIP, UZP, TRN
  2025-06-22 21:35 [PATCH v2 0/3] target/arm: SME1/SVE2 fixes Richard Henderson
  2025-06-22 21:35 ` [PATCH v2 1/3] target/arm: Fix SME vs AdvSIMD exception priority Richard Henderson
  2025-06-22 21:35 ` [PATCH v2 2/3] target/arm: Fix sve_access_check for SME Richard Henderson
@ 2025-06-22 21:35 ` Richard Henderson
  2025-06-24 14:59   ` Peter Maydell
  2 siblings, 1 reply; 9+ messages in thread
From: Richard Henderson @ 2025-06-22 21:35 UTC (permalink / raw)
  To: qemu-devel; +Cc: qemu-stable

We missed the instructions UDEF when the vector size is too small.
We missed marking the instructions non-streaming with SME.

Cc: qemu-stable@nongnu.org
Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
---
 target/arm/tcg/translate-sve.c | 43 ++++++++++++++++++++++++----------
 1 file changed, 30 insertions(+), 13 deletions(-)

diff --git a/target/arm/tcg/translate-sve.c b/target/arm/tcg/translate-sve.c
index f3cf028cb9..588a5b006b 100644
--- a/target/arm/tcg/translate-sve.c
+++ b/target/arm/tcg/translate-sve.c
@@ -2352,6 +2352,23 @@ TRANS_FEAT(PUNPKHI, aa64_sve, do_perm_pred2, a, 1, gen_helper_sve_punpk_p)
  *** SVE Permute - Interleaving Group
  */
 
+static bool do_interleave_q(DisasContext *s, gen_helper_gvec_3 *fn,
+                            arg_rrr_esz *a, int data)
+{
+    if (sve_access_check(s)) {
+        unsigned vsz = vec_full_reg_size(s);
+        if (vsz < 32) {
+            unallocated_encoding(s);
+        } else {
+            tcg_gen_gvec_3_ool(vec_full_reg_offset(s, a->rd),
+                               vec_full_reg_offset(s, a->rn),
+                               vec_full_reg_offset(s, a->rm),
+                               vsz, vsz, data, fn);
+        }
+    }
+    return true;
+}
+
 static gen_helper_gvec_3 * const zip_fns[4] = {
     gen_helper_sve_zip_b, gen_helper_sve_zip_h,
     gen_helper_sve_zip_s, gen_helper_sve_zip_d,
@@ -2361,11 +2378,11 @@ TRANS_FEAT(ZIP1_z, aa64_sve, gen_gvec_ool_arg_zzz,
 TRANS_FEAT(ZIP2_z, aa64_sve, gen_gvec_ool_arg_zzz,
            zip_fns[a->esz], a, vec_full_reg_size(s) / 2)
 
-TRANS_FEAT(ZIP1_q, aa64_sve_f64mm, gen_gvec_ool_arg_zzz,
-           gen_helper_sve2_zip_q, a, 0)
-TRANS_FEAT(ZIP2_q, aa64_sve_f64mm, gen_gvec_ool_arg_zzz,
-           gen_helper_sve2_zip_q, a,
-           QEMU_ALIGN_DOWN(vec_full_reg_size(s), 32) / 2)
+TRANS_FEAT_NONSTREAMING(ZIP1_q, aa64_sve_f64mm, do_interleave_q,
+                        gen_helper_sve2_zip_q, a, 0)
+TRANS_FEAT_NONSTREAMING(ZIP2_q, aa64_sve_f64mm, do_interleave_q,
+                        gen_helper_sve2_zip_q, a,
+                        QEMU_ALIGN_DOWN(vec_full_reg_size(s), 32) / 2)
 
 static gen_helper_gvec_3 * const uzp_fns[4] = {
     gen_helper_sve_uzp_b, gen_helper_sve_uzp_h,
@@ -2377,10 +2394,10 @@ TRANS_FEAT(UZP1_z, aa64_sve, gen_gvec_ool_arg_zzz,
 TRANS_FEAT(UZP2_z, aa64_sve, gen_gvec_ool_arg_zzz,
            uzp_fns[a->esz], a, 1 << a->esz)
 
-TRANS_FEAT(UZP1_q, aa64_sve_f64mm, gen_gvec_ool_arg_zzz,
-           gen_helper_sve2_uzp_q, a, 0)
-TRANS_FEAT(UZP2_q, aa64_sve_f64mm, gen_gvec_ool_arg_zzz,
-           gen_helper_sve2_uzp_q, a, 16)
+TRANS_FEAT_NONSTREAMING(UZP1_q, aa64_sve_f64mm, do_interleave_q,
+                        gen_helper_sve2_uzp_q, a, 0)
+TRANS_FEAT_NONSTREAMING(UZP2_q, aa64_sve_f64mm, do_interleave_q,
+                        gen_helper_sve2_uzp_q, a, 16)
 
 static gen_helper_gvec_3 * const trn_fns[4] = {
     gen_helper_sve_trn_b, gen_helper_sve_trn_h,
@@ -2392,10 +2409,10 @@ TRANS_FEAT(TRN1_z, aa64_sve, gen_gvec_ool_arg_zzz,
 TRANS_FEAT(TRN2_z, aa64_sve, gen_gvec_ool_arg_zzz,
            trn_fns[a->esz], a, 1 << a->esz)
 
-TRANS_FEAT(TRN1_q, aa64_sve_f64mm, gen_gvec_ool_arg_zzz,
-           gen_helper_sve2_trn_q, a, 0)
-TRANS_FEAT(TRN2_q, aa64_sve_f64mm, gen_gvec_ool_arg_zzz,
-           gen_helper_sve2_trn_q, a, 16)
+TRANS_FEAT_NONSTREAMING(TRN1_q, aa64_sve_f64mm, do_interleave_q,
+                        gen_helper_sve2_trn_q, a, 0)
+TRANS_FEAT_NONSTREAMING(TRN2_q, aa64_sve_f64mm, do_interleave_q,
+                        gen_helper_sve2_trn_q, a, 16)
 
 /*
  *** SVE Permute Vector - Predicated Group
-- 
2.43.0



^ permalink raw reply related	[flat|nested] 9+ messages in thread

* Re: [PATCH v2 1/3] target/arm: Fix SME vs AdvSIMD exception priority
  2025-06-22 21:35 ` [PATCH v2 1/3] target/arm: Fix SME vs AdvSIMD exception priority Richard Henderson
@ 2025-06-24 13:05   ` Peter Maydell
  0 siblings, 0 replies; 9+ messages in thread
From: Peter Maydell @ 2025-06-24 13:05 UTC (permalink / raw)
  To: Richard Henderson; +Cc: qemu-devel, qemu-stable

On Sun, 22 Jun 2025 at 22:36, Richard Henderson
<richard.henderson@linaro.org> wrote:
>
> We failed to raise an exception when
> sme_excp_el == 0 and fp_excp_el == 1.
>
> Cc: qemu-stable@nongnu.org
> Fixes: 3d74825f4d6 ("target/arm: Add SME enablement checks")
> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
> ---
>  target/arm/tcg/translate-a64.c | 3 ++-
>  1 file changed, 2 insertions(+), 1 deletion(-)


Reviewed-by: Peter Maydell <peter.maydell@linaro.org>

thanks
-- PMM


^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [PATCH v2 2/3] target/arm: Fix sve_access_check for SME
  2025-06-22 21:35 ` [PATCH v2 2/3] target/arm: Fix sve_access_check for SME Richard Henderson
@ 2025-06-24 13:58   ` Peter Maydell
  2025-06-24 14:41     ` Richard Henderson
  0 siblings, 1 reply; 9+ messages in thread
From: Peter Maydell @ 2025-06-24 13:58 UTC (permalink / raw)
  To: Richard Henderson; +Cc: qemu-devel, qemu-stable

On Sun, 22 Jun 2025 at 22:36, Richard Henderson
<richard.henderson@linaro.org> wrote:
>
> Do not assume SME implies SVE.  Ensure that the
> non-streaming check is present along the SME path,
> since it is not implied by sme_*_enabled_check.
>
> Cc: qemu-stable@nongnu.org
> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
> ---
>  target/arm/tcg/translate-a64.c | 27 +++++++++++++++++++--------
>  1 file changed, 19 insertions(+), 8 deletions(-)
>
> diff --git a/target/arm/tcg/translate-a64.c b/target/arm/tcg/translate-a64.c
> index bb49a2ce90..d7b0c81773 100644
> --- a/target/arm/tcg/translate-a64.c
> +++ b/target/arm/tcg/translate-a64.c
> @@ -1387,11 +1387,8 @@ static bool fp_access_check_only(DisasContext *s)
>      return true;
>  }
>
> -static bool fp_access_check(DisasContext *s)
> +static bool nonstreaming_check(DisasContext *s)
>  {
> -    if (!fp_access_check_only(s)) {
> -        return false;
> -    }
>      if (s->sme_trap_nonstreaming && s->is_nonstreaming) {
>          gen_exception_insn(s, 0, EXCP_UDEF,
>                             syn_smetrap(SME_ET_Streaming, false));
> @@ -1400,6 +1397,11 @@ static bool fp_access_check(DisasContext *s)
>      return true;
>  }
>
> +static bool fp_access_check(DisasContext *s)
> +{
> +    return fp_access_check_only(s) && nonstreaming_check(s);
> +}
> +
>  /*
>   * Return <0 for non-supported element sizes, with MO_16 controlled by
>   * FEAT_FP16; return 0 for fp disabled; otherwise return >0 for success.
> @@ -1450,11 +1452,20 @@ static int fp_access_check_vector_hsd(DisasContext *s, bool is_q, MemOp esz)
>   */
>  bool sve_access_check(DisasContext *s)
>  {
> -    if (s->pstate_sm || !dc_isar_feature(aa64_sve, s)) {
> -        bool ret;
> +    bool ret;
>
> -        assert(dc_isar_feature(aa64_sme, s));
> -        ret = sme_sm_enabled_check(s);
> +    switch (dc_isar_feature(aa64_sme, s)) {
> +    case true:

Why this rather than
       if (dc_isar_feature(aa64_sme, s)) {

?

> +        if (s->pstate_sm) {
> +            ret = sme_enabled_check(s);
> +        } else if (!dc_isar_feature(aa64_sve, s)) {
> +            ret = sme_sm_enabled_check(s);
> +        } else {
> +            break;
> +        }
> +        if (ret) {
> +            ret = nonstreaming_check(s);
> +        }
>          s->sve_access_checked = (ret ? 1 : -1);
>          return ret;
>      }

Otherwise
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>

thanks
-- PMM


^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [PATCH v2 2/3] target/arm: Fix sve_access_check for SME
  2025-06-24 13:58   ` Peter Maydell
@ 2025-06-24 14:41     ` Richard Henderson
  0 siblings, 0 replies; 9+ messages in thread
From: Richard Henderson @ 2025-06-24 14:41 UTC (permalink / raw)
  To: Peter Maydell; +Cc: qemu-devel, qemu-stable

On 6/24/25 06:58, Peter Maydell wrote:
>> +    switch (dc_isar_feature(aa64_sme, s)) {
>> +    case true:
> 
> Why this rather than
>         if (dc_isar_feature(aa64_sme, s)) {
> 
> ?
> 
>> +        if (s->pstate_sm) {
>> +            ret = sme_enabled_check(s);
>> +        } else if (!dc_isar_feature(aa64_sve, s)) {
>> +            ret = sme_sm_enabled_check(s);
>> +        } else {
>> +            break;
>> +        }

I used break instead of a goto, or replicating


>> +        if (ret) {
>> +            ret = nonstreaming_check(s);
>> +        }
>>           s->sve_access_checked = (ret ? 1 : -1);
>>           return ret;

this block.


r~


^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [PATCH v2 3/3] target/arm: Fix 128-bit element ZIP, UZP, TRN
  2025-06-22 21:35 ` [PATCH v2 3/3] target/arm: Fix 128-bit element ZIP, UZP, TRN Richard Henderson
@ 2025-06-24 14:59   ` Peter Maydell
  2025-06-24 19:04     ` Richard Henderson
  0 siblings, 1 reply; 9+ messages in thread
From: Peter Maydell @ 2025-06-24 14:59 UTC (permalink / raw)
  To: Richard Henderson; +Cc: qemu-devel, qemu-stable

On Sun, 22 Jun 2025 at 22:36, Richard Henderson
<richard.henderson@linaro.org> wrote:
>
> We missed the instructions UDEF when the vector size is too small.

I think this part also applies to FMMLA_d and trans_LD1RO_zprr
and trans_LD1RO_zprr ?

We should also probably put something in so that if the
user asks for a CPU with a max SVE VL of 128 then we disable
FEAT_F64MM.

> We missed marking the instructions non-streaming with SME.
>
> Cc: qemu-stable@nongnu.org
> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
> ---
>  target/arm/tcg/translate-sve.c | 43 ++++++++++++++++++++++++----------
>  1 file changed, 30 insertions(+), 13 deletions(-)

These code changes look correct to me.

thanks
-- PMM


^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [PATCH v2 3/3] target/arm: Fix 128-bit element ZIP, UZP, TRN
  2025-06-24 14:59   ` Peter Maydell
@ 2025-06-24 19:04     ` Richard Henderson
  0 siblings, 0 replies; 9+ messages in thread
From: Richard Henderson @ 2025-06-24 19:04 UTC (permalink / raw)
  To: Peter Maydell; +Cc: qemu-devel, qemu-stable

On 6/24/25 07:59, Peter Maydell wrote:
> On Sun, 22 Jun 2025 at 22:36, Richard Henderson
> <richard.henderson@linaro.org> wrote:
>>
>> We missed the instructions UDEF when the vector size is too small.
> 
> I think this part also applies to FMMLA_d and trans_LD1RO_zprr
> and trans_LD1RO_zprr ?

LD1RO is handled in do_ldro; the address calculation is done beforehand, but that's easily 
discarded with the exception.

Fixed FMMLA.

> We should also probably put something in so that if the
> user asks for a CPU with a max SVE VL of 128 then we disable
> FEAT_F64MM.

Done.


r~


^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2025-06-24 19:05 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2025-06-22 21:35 [PATCH v2 0/3] target/arm: SME1/SVE2 fixes Richard Henderson
2025-06-22 21:35 ` [PATCH v2 1/3] target/arm: Fix SME vs AdvSIMD exception priority Richard Henderson
2025-06-24 13:05   ` Peter Maydell
2025-06-22 21:35 ` [PATCH v2 2/3] target/arm: Fix sve_access_check for SME Richard Henderson
2025-06-24 13:58   ` Peter Maydell
2025-06-24 14:41     ` Richard Henderson
2025-06-22 21:35 ` [PATCH v2 3/3] target/arm: Fix 128-bit element ZIP, UZP, TRN Richard Henderson
2025-06-24 14:59   ` Peter Maydell
2025-06-24 19:04     ` Richard Henderson

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).