* [PATCH for-10.1] tcg/optimize: Don't fold INDEX_op_and_vec to extract
@ 2025-07-18 17:38 Richard Henderson
2025-07-18 19:11 ` Peter Maydell
` (2 more replies)
0 siblings, 3 replies; 5+ messages in thread
From: Richard Henderson @ 2025-07-18 17:38 UTC (permalink / raw)
To: qemu-devel; +Cc: pierrick.bouvier, philmd
There is no such thing as vector extract.
Fixes: 932522a9ddc1 ("tcg/optimize: Fold and to extract during optimize")
Resolves: https://gitlab.com/qemu-project/qemu/-/issues/3036
Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
---
tcg/optimize.c | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/tcg/optimize.c b/tcg/optimize.c
index 62a128bc9b..3638ab9fea 100644
--- a/tcg/optimize.c
+++ b/tcg/optimize.c
@@ -1454,7 +1454,7 @@ static bool fold_and(OptContext *ctx, TCGOp *op)
a_mask = t1->z_mask & ~t2->o_mask;
if (!fold_masks_zosa_int(ctx, op, z_mask, o_mask, s_mask, a_mask)) {
- if (ti_is_const(t2)) {
+ if (op->opc == INDEX_op_and && ti_is_const(t2)) {
/*
* Canonicalize on extract, if valid. This aids x86 with its
* 2 operand MOVZBL and 2 operand AND, selecting the TCGOpcode
--
2.43.0
^ permalink raw reply related [flat|nested] 5+ messages in thread* Re: [PATCH for-10.1] tcg/optimize: Don't fold INDEX_op_and_vec to extract 2025-07-18 17:38 [PATCH for-10.1] tcg/optimize: Don't fold INDEX_op_and_vec to extract Richard Henderson @ 2025-07-18 19:11 ` Peter Maydell 2025-07-18 21:26 ` Richard Henderson 2025-07-18 20:10 ` Pierrick Bouvier 2025-07-19 13:05 ` Peter Maydell 2 siblings, 1 reply; 5+ messages in thread From: Peter Maydell @ 2025-07-18 19:11 UTC (permalink / raw) To: Richard Henderson; +Cc: qemu-devel, pierrick.bouvier, philmd On Fri, 18 Jul 2025 at 18:46, Richard Henderson <richard.henderson@linaro.org> wrote: > > There is no such thing as vector extract. > > Fixes: 932522a9ddc1 ("tcg/optimize: Fold and to extract during optimize") > Resolves: https://gitlab.com/qemu-project/qemu/-/issues/3036 > Signed-off-by: Richard Henderson <richard.henderson@linaro.org> > --- > tcg/optimize.c | 2 +- > 1 file changed, 1 insertion(+), 1 deletion(-) > > diff --git a/tcg/optimize.c b/tcg/optimize.c > index 62a128bc9b..3638ab9fea 100644 > --- a/tcg/optimize.c > +++ b/tcg/optimize.c > @@ -1454,7 +1454,7 @@ static bool fold_and(OptContext *ctx, TCGOp *op) > a_mask = t1->z_mask & ~t2->o_mask; > > if (!fold_masks_zosa_int(ctx, op, z_mask, o_mask, s_mask, a_mask)) { > - if (ti_is_const(t2)) { > + if (op->opc == INDEX_op_and && ti_is_const(t2)) { > /* > * Canonicalize on extract, if valid. This aids x86 with its > * 2 operand MOVZBL and 2 operand AND, selecting the TCGOpcod How does the fold_masks_zosa stuff work for vector operations here? The masks are only 64 bits but the value we're working with is wider than that, right? thanks -- PMM ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH for-10.1] tcg/optimize: Don't fold INDEX_op_and_vec to extract 2025-07-18 19:11 ` Peter Maydell @ 2025-07-18 21:26 ` Richard Henderson 0 siblings, 0 replies; 5+ messages in thread From: Richard Henderson @ 2025-07-18 21:26 UTC (permalink / raw) To: Peter Maydell; +Cc: qemu-devel, pierrick.bouvier, philmd On 7/18/25 12:11, Peter Maydell wrote: > On Fri, 18 Jul 2025 at 18:46, Richard Henderson > <richard.henderson@linaro.org> wrote: >> >> There is no such thing as vector extract. >> >> Fixes: 932522a9ddc1 ("tcg/optimize: Fold and to extract during optimize") >> Resolves: https://gitlab.com/qemu-project/qemu/-/issues/3036 >> Signed-off-by: Richard Henderson <richard.henderson@linaro.org> >> --- >> tcg/optimize.c | 2 +- >> 1 file changed, 1 insertion(+), 1 deletion(-) >> >> diff --git a/tcg/optimize.c b/tcg/optimize.c >> index 62a128bc9b..3638ab9fea 100644 >> --- a/tcg/optimize.c >> +++ b/tcg/optimize.c >> @@ -1454,7 +1454,7 @@ static bool fold_and(OptContext *ctx, TCGOp *op) >> a_mask = t1->z_mask & ~t2->o_mask; >> >> if (!fold_masks_zosa_int(ctx, op, z_mask, o_mask, s_mask, a_mask)) { >> - if (ti_is_const(t2)) { >> + if (op->opc == INDEX_op_and && ti_is_const(t2)) { >> /* >> * Canonicalize on extract, if valid. This aids x86 with its >> * 2 operand MOVZBL and 2 operand AND, selecting the TCGOpcod > > > How does the fold_masks_zosa stuff work for vector operations here? > The masks are only 64 bits but the value we're working with is > wider than that, right? For vectors, the known one/zero bits stem from immediate operands. All vector immediates are dup_const, that is, some replication of uint64_t or smaller element. There is no way to represent (__vector uin8_t){ 1,2,3,4,5,6,7,8,9,a,b,c,d,e,f } with tcg at the moment. Thus we can treat this sort of simple vector optimization as replications of uint64_t. Any other operation resets the masks to "unknown" state. r~ ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH for-10.1] tcg/optimize: Don't fold INDEX_op_and_vec to extract 2025-07-18 17:38 [PATCH for-10.1] tcg/optimize: Don't fold INDEX_op_and_vec to extract Richard Henderson 2025-07-18 19:11 ` Peter Maydell @ 2025-07-18 20:10 ` Pierrick Bouvier 2025-07-19 13:05 ` Peter Maydell 2 siblings, 0 replies; 5+ messages in thread From: Pierrick Bouvier @ 2025-07-18 20:10 UTC (permalink / raw) To: Richard Henderson, qemu-devel; +Cc: philmd On 7/18/25 10:38 AM, Richard Henderson wrote: > There is no such thing as vector extract. > > Fixes: 932522a9ddc1 ("tcg/optimize: Fold and to extract during optimize") > Resolves: https://gitlab.com/qemu-project/qemu/-/issues/3036 > Signed-off-by: Richard Henderson <richard.henderson@linaro.org> > --- > tcg/optimize.c | 2 +- > 1 file changed, 1 insertion(+), 1 deletion(-) > > diff --git a/tcg/optimize.c b/tcg/optimize.c > index 62a128bc9b..3638ab9fea 100644 > --- a/tcg/optimize.c > +++ b/tcg/optimize.c > @@ -1454,7 +1454,7 @@ static bool fold_and(OptContext *ctx, TCGOp *op) > a_mask = t1->z_mask & ~t2->o_mask; > > if (!fold_masks_zosa_int(ctx, op, z_mask, o_mask, s_mask, a_mask)) { > - if (ti_is_const(t2)) { > + if (op->opc == INDEX_op_and && ti_is_const(t2)) { > /* > * Canonicalize on extract, if valid. This aids x86 with its > * 2 operand MOVZBL and 2 operand AND, selecting the TCGOpcode This solves the failures observed, thanks. Reviewed-by: Pierrick Bouvier <pierrick.bouvier@linaro.org> Tested-by: Pierrick Bouvier <pierrick.bouvier@linaro.org> ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH for-10.1] tcg/optimize: Don't fold INDEX_op_and_vec to extract 2025-07-18 17:38 [PATCH for-10.1] tcg/optimize: Don't fold INDEX_op_and_vec to extract Richard Henderson 2025-07-18 19:11 ` Peter Maydell 2025-07-18 20:10 ` Pierrick Bouvier @ 2025-07-19 13:05 ` Peter Maydell 2 siblings, 0 replies; 5+ messages in thread From: Peter Maydell @ 2025-07-19 13:05 UTC (permalink / raw) To: Richard Henderson; +Cc: qemu-devel, pierrick.bouvier, philmd On Fri, 18 Jul 2025 at 18:46, Richard Henderson <richard.henderson@linaro.org> wrote: > > There is no such thing as vector extract. > > Fixes: 932522a9ddc1 ("tcg/optimize: Fold and to extract during optimize") > Resolves: https://gitlab.com/qemu-project/qemu/-/issues/3036 > Signed-off-by: Richard Henderson <richard.henderson@linaro.org> > --- > tcg/optimize.c | 2 +- > 1 file changed, 1 insertion(+), 1 deletion(-) Reviewed-by: Peter Maydell <peter.maydell@linaro.org> thanks -- PMM ^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2025-07-19 13:07 UTC | newest] Thread overview: 5+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2025-07-18 17:38 [PATCH for-10.1] tcg/optimize: Don't fold INDEX_op_and_vec to extract Richard Henderson 2025-07-18 19:11 ` Peter Maydell 2025-07-18 21:26 ` Richard Henderson 2025-07-18 20:10 ` Pierrick Bouvier 2025-07-19 13:05 ` Peter Maydell
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).