* [RFC PATCH 0/2] target/loongarch: Use i128 for 128-bit loads/stores
@ 2023-10-17 12:38 Philippe Mathieu-Daudé
2023-10-17 12:38 ` [RFC PATCH 1/2] target/loongarch: Use i128 for 128-bit load/store in VST[X]/XVST Philippe Mathieu-Daudé
` (2 more replies)
0 siblings, 3 replies; 5+ messages in thread
From: Philippe Mathieu-Daudé @ 2023-10-17 12:38 UTC (permalink / raw)
To: qemu-devel, Richard Henderson; +Cc: Song Gao, Philippe Mathieu-Daudé
RFC because unsure and untested...
Based-on: <20231013175109.124308-1-richard.henderson@linaro.org>
tcg: Add tcg_gen_{ld,st}_i128
Philippe Mathieu-Daudé (2):
target/loongarch: Use i128 for 128-bit load/store in VST[X]/XVST
target/loongarch: Use i128 for 128-bit load/store in XVLD
target/loongarch/translate.c | 12 ++++++
target/loongarch/insn_trans/trans_vec.c.inc | 46 +++++++--------------
2 files changed, 28 insertions(+), 30 deletions(-)
--
2.41.0
^ permalink raw reply [flat|nested] 5+ messages in thread
* [RFC PATCH 1/2] target/loongarch: Use i128 for 128-bit load/store in VST[X]/XVST
2023-10-17 12:38 [RFC PATCH 0/2] target/loongarch: Use i128 for 128-bit loads/stores Philippe Mathieu-Daudé
@ 2023-10-17 12:38 ` Philippe Mathieu-Daudé
2023-10-17 12:38 ` [RFC PATCH 2/2] target/loongarch: Use i128 for 128-bit load/store in XVLD Philippe Mathieu-Daudé
2023-10-18 1:17 ` [RFC PATCH 0/2] target/loongarch: Use i128 for 128-bit loads/stores gaosong
2 siblings, 0 replies; 5+ messages in thread
From: Philippe Mathieu-Daudé @ 2023-10-17 12:38 UTC (permalink / raw)
To: qemu-devel, Richard Henderson; +Cc: Song Gao, Philippe Mathieu-Daudé
Signed-off-by: Philippe Mathieu-Daudé <philmd@linaro.org>
---
target/loongarch/translate.c | 6 +++++
target/loongarch/insn_trans/trans_vec.c.inc | 30 +++++++--------------
2 files changed, 15 insertions(+), 21 deletions(-)
diff --git a/target/loongarch/translate.c b/target/loongarch/translate.c
index 21f4db6fbd..c6edfc800f 100644
--- a/target/loongarch/translate.c
+++ b/target/loongarch/translate.c
@@ -61,6 +61,12 @@ static inline void set_vreg64(TCGv_i64 src, int regno, int index)
offsetof(CPULoongArchState, fpr[regno].vreg.D(index)));
}
+static inline void get_vreg128(TCGv_i128 dest, int regno, int index)
+{
+ tcg_gen_ld_i128(dest, tcg_env,
+ offsetof(CPULoongArchState, fpr[regno].vreg.Q(index)));
+}
+
static inline int plus_1(DisasContext *ctx, int x)
{
return x + 1;
diff --git a/target/loongarch/insn_trans/trans_vec.c.inc b/target/loongarch/insn_trans/trans_vec.c.inc
index 98f856bb29..dd41f5e48e 100644
--- a/target/loongarch/insn_trans/trans_vec.c.inc
+++ b/target/loongarch/insn_trans/trans_vec.c.inc
@@ -5285,7 +5285,6 @@ static bool trans_vst(DisasContext *ctx, arg_vr_i *a)
{
TCGv addr;
TCGv_i128 val;
- TCGv_i64 ah, al;
if (!avail_LSX(ctx)) {
return false;
@@ -5297,14 +5296,10 @@ static bool trans_vst(DisasContext *ctx, arg_vr_i *a)
addr = gpr_src(ctx, a->rj, EXT_NONE);
val = tcg_temp_new_i128();
- ah = tcg_temp_new_i64();
- al = tcg_temp_new_i64();
addr = make_address_i(ctx, addr, a->imm);
- get_vreg64(ah, a->vd, 1);
- get_vreg64(al, a->vd, 0);
- tcg_gen_concat_i64_i128(val, al, ah);
+ get_vreg128(val, a->vd, 0);
tcg_gen_qemu_st_i128(val, addr, ctx->mem_idx, MO_128 | MO_TE);
return true;
@@ -5342,7 +5337,6 @@ static bool trans_vldx(DisasContext *ctx, arg_vrr *a)
static bool trans_vstx(DisasContext *ctx, arg_vrr *a)
{
TCGv addr, src1, src2;
- TCGv_i64 ah, al;
TCGv_i128 val;
if (!avail_LSX(ctx)) {
@@ -5356,13 +5350,9 @@ static bool trans_vstx(DisasContext *ctx, arg_vrr *a)
src1 = gpr_src(ctx, a->rj, EXT_NONE);
src2 = gpr_src(ctx, a->rk, EXT_NONE);
val = tcg_temp_new_i128();
- ah = tcg_temp_new_i64();
- al = tcg_temp_new_i64();
addr = make_address_x(ctx, src1, src2);
- get_vreg64(ah, a->vd, 1);
- get_vreg64(al, a->vd, 0);
- tcg_gen_concat_i64_i128(val, al, ah);
+ get_vreg128(val, a->vd, 0);
tcg_gen_qemu_st_i128(val, addr, ctx->mem_idx, MO_128 | MO_TE);
return true;
@@ -5484,18 +5474,16 @@ static void gen_xvld(DisasContext *ctx, int vreg, TCGv addr)
static void gen_xvst(DisasContext * ctx, int vreg, TCGv addr)
{
- int i;
+ MemOp mop = MO_128 | MO_TE;
TCGv temp = tcg_temp_new();
- TCGv dest = tcg_temp_new();
+ TCGv_i128 dest = tcg_temp_new_i128();
- get_vreg64(dest, vreg, 0);
- tcg_gen_qemu_st_i64(dest, addr, ctx->mem_idx, MO_TEUQ);
+ get_vreg128(dest, vreg, 0);
+ tcg_gen_qemu_st_i128(dest, addr, ctx->mem_idx, mop);
- for (i = 1; i < 4; i++) {
- tcg_gen_addi_tl(temp, addr, 8 * i);
- get_vreg64(dest, vreg, i);
- tcg_gen_qemu_st_i64(dest, temp, ctx->mem_idx, MO_TEUQ);
- }
+ tcg_gen_addi_tl(temp, addr, 16);
+ get_vreg128(dest, vreg, 1);
+ tcg_gen_qemu_st_i128(dest, temp, ctx->mem_idx, mop);
}
TRANS(xvld, LASX, gen_lasx_memory, gen_xvld)
--
2.41.0
^ permalink raw reply related [flat|nested] 5+ messages in thread
* [RFC PATCH 2/2] target/loongarch: Use i128 for 128-bit load/store in XVLD
2023-10-17 12:38 [RFC PATCH 0/2] target/loongarch: Use i128 for 128-bit loads/stores Philippe Mathieu-Daudé
2023-10-17 12:38 ` [RFC PATCH 1/2] target/loongarch: Use i128 for 128-bit load/store in VST[X]/XVST Philippe Mathieu-Daudé
@ 2023-10-17 12:38 ` Philippe Mathieu-Daudé
2023-10-17 13:59 ` Richard Henderson
2023-10-18 1:17 ` [RFC PATCH 0/2] target/loongarch: Use i128 for 128-bit loads/stores gaosong
2 siblings, 1 reply; 5+ messages in thread
From: Philippe Mathieu-Daudé @ 2023-10-17 12:38 UTC (permalink / raw)
To: qemu-devel, Richard Henderson; +Cc: Song Gao, Philippe Mathieu-Daudé
Signed-off-by: Philippe Mathieu-Daudé <philmd@linaro.org>
---
target/loongarch/translate.c | 6 ++++++
target/loongarch/insn_trans/trans_vec.c.inc | 16 +++++++---------
2 files changed, 13 insertions(+), 9 deletions(-)
diff --git a/target/loongarch/translate.c b/target/loongarch/translate.c
index c6edfc800f..89e35770b7 100644
--- a/target/loongarch/translate.c
+++ b/target/loongarch/translate.c
@@ -67,6 +67,12 @@ static inline void get_vreg128(TCGv_i128 dest, int regno, int index)
offsetof(CPULoongArchState, fpr[regno].vreg.Q(index)));
}
+static inline void set_vreg128(TCGv_i128 src, int regno, int index)
+{
+ tcg_gen_st_i128(src, tcg_env,
+ offsetof(CPULoongArchState, fpr[regno].vreg.Q(index)));
+}
+
static inline int plus_1(DisasContext *ctx, int x)
{
return x + 1;
diff --git a/target/loongarch/insn_trans/trans_vec.c.inc b/target/loongarch/insn_trans/trans_vec.c.inc
index dd41f5e48e..c8b7bfa67f 100644
--- a/target/loongarch/insn_trans/trans_vec.c.inc
+++ b/target/loongarch/insn_trans/trans_vec.c.inc
@@ -5458,18 +5458,16 @@ static bool gen_lasx_memory(DisasContext *ctx, arg_vr_i *a,
static void gen_xvld(DisasContext *ctx, int vreg, TCGv addr)
{
- int i;
+ MemOp mop = MO_128 | MO_TE;
TCGv temp = tcg_temp_new();
- TCGv dest = tcg_temp_new();
+ TCGv_i128 dest = tcg_temp_new_i128();
- tcg_gen_qemu_ld_i64(dest, addr, ctx->mem_idx, MO_TEUQ);
- set_vreg64(dest, vreg, 0);
+ tcg_gen_qemu_ld_i128(dest, addr, ctx->mem_idx, mop);
+ set_vreg128(dest, vreg, 0);
- for (i = 1; i < 4; i++) {
- tcg_gen_addi_tl(temp, addr, 8 * i);
- tcg_gen_qemu_ld_i64(dest, temp, ctx->mem_idx, MO_TEUQ);
- set_vreg64(dest, vreg, i);
- }
+ tcg_gen_addi_tl(temp, addr, 16);
+ tcg_gen_qemu_ld_i128(dest, temp, ctx->mem_idx, mop);
+ set_vreg128(dest, vreg, 1);
}
static void gen_xvst(DisasContext * ctx, int vreg, TCGv addr)
--
2.41.0
^ permalink raw reply related [flat|nested] 5+ messages in thread
* Re: [RFC PATCH 2/2] target/loongarch: Use i128 for 128-bit load/store in XVLD
2023-10-17 12:38 ` [RFC PATCH 2/2] target/loongarch: Use i128 for 128-bit load/store in XVLD Philippe Mathieu-Daudé
@ 2023-10-17 13:59 ` Richard Henderson
0 siblings, 0 replies; 5+ messages in thread
From: Richard Henderson @ 2023-10-17 13:59 UTC (permalink / raw)
To: Philippe Mathieu-Daudé, qemu-devel; +Cc: Song Gao
On 10/17/23 05:38, Philippe Mathieu-Daudé wrote:
> Signed-off-by: Philippe Mathieu-Daudé <philmd@linaro.org>
> ---
> target/loongarch/translate.c | 6 ++++++
> target/loongarch/insn_trans/trans_vec.c.inc | 16 +++++++---------
> 2 files changed, 13 insertions(+), 9 deletions(-)
>
> diff --git a/target/loongarch/translate.c b/target/loongarch/translate.c
> index c6edfc800f..89e35770b7 100644
> --- a/target/loongarch/translate.c
> +++ b/target/loongarch/translate.c
> @@ -67,6 +67,12 @@ static inline void get_vreg128(TCGv_i128 dest, int regno, int index)
> offsetof(CPULoongArchState, fpr[regno].vreg.Q(index)));
> }
>
> +static inline void set_vreg128(TCGv_i128 src, int regno, int index)
> +{
> + tcg_gen_st_i128(src, tcg_env,
> + offsetof(CPULoongArchState, fpr[regno].vreg.Q(index)));
> +}
> +
> static inline int plus_1(DisasContext *ctx, int x)
> {
> return x + 1;
> diff --git a/target/loongarch/insn_trans/trans_vec.c.inc b/target/loongarch/insn_trans/trans_vec.c.inc
> index dd41f5e48e..c8b7bfa67f 100644
> --- a/target/loongarch/insn_trans/trans_vec.c.inc
> +++ b/target/loongarch/insn_trans/trans_vec.c.inc
> @@ -5458,18 +5458,16 @@ static bool gen_lasx_memory(DisasContext *ctx, arg_vr_i *a,
>
> static void gen_xvld(DisasContext *ctx, int vreg, TCGv addr)
> {
> - int i;
> + MemOp mop = MO_128 | MO_TE;
> TCGv temp = tcg_temp_new();
> - TCGv dest = tcg_temp_new();
> + TCGv_i128 dest = tcg_temp_new_i128();
>
> - tcg_gen_qemu_ld_i64(dest, addr, ctx->mem_idx, MO_TEUQ);
> - set_vreg64(dest, vreg, 0);
> + tcg_gen_qemu_ld_i128(dest, addr, ctx->mem_idx, mop);
> + set_vreg128(dest, vreg, 0);
>
> - for (i = 1; i < 4; i++) {
> - tcg_gen_addi_tl(temp, addr, 8 * i);
> - tcg_gen_qemu_ld_i64(dest, temp, ctx->mem_idx, MO_TEUQ);
> - set_vreg64(dest, vreg, i);
> - }
> + tcg_gen_addi_tl(temp, addr, 16);
> + tcg_gen_qemu_ld_i128(dest, temp, ctx->mem_idx, mop);
> + set_vreg128(dest, vreg, 1);
> }
>
> static void gen_xvst(DisasContext * ctx, int vreg, TCGv addr)
Missing the matching change to trans_vld.
r~
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [RFC PATCH 0/2] target/loongarch: Use i128 for 128-bit loads/stores
2023-10-17 12:38 [RFC PATCH 0/2] target/loongarch: Use i128 for 128-bit loads/stores Philippe Mathieu-Daudé
2023-10-17 12:38 ` [RFC PATCH 1/2] target/loongarch: Use i128 for 128-bit load/store in VST[X]/XVST Philippe Mathieu-Daudé
2023-10-17 12:38 ` [RFC PATCH 2/2] target/loongarch: Use i128 for 128-bit load/store in XVLD Philippe Mathieu-Daudé
@ 2023-10-18 1:17 ` gaosong
2 siblings, 0 replies; 5+ messages in thread
From: gaosong @ 2023-10-18 1:17 UTC (permalink / raw)
To: Philippe Mathieu-Daudé, qemu-devel, Richard Henderson
在 2023/10/17 下午8:38, Philippe Mathieu-Daudé 写道:
> RFC because unsure and untested...
>
> Based-on: <20231013175109.124308-1-richard.henderson@linaro.org>
> tcg: Add tcg_gen_{ld,st}_i128
>
> Philippe Mathieu-Daudé (2):
> target/loongarch: Use i128 for 128-bit load/store in VST[X]/XVST
> target/loongarch: Use i128 for 128-bit load/store in XVLD
>
> target/loongarch/translate.c | 12 ++++++
> target/loongarch/insn_trans/trans_vec.c.inc | 46 +++++++--------------
> 2 files changed, 28 insertions(+), 30 deletions(-)
>
I had tested it, you can drop the 'RFC' title.
Tested-by: Song Gao <gaosong@loongson.cn>
Thanks.
Song Gao
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2023-10-18 1:18 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2023-10-17 12:38 [RFC PATCH 0/2] target/loongarch: Use i128 for 128-bit loads/stores Philippe Mathieu-Daudé
2023-10-17 12:38 ` [RFC PATCH 1/2] target/loongarch: Use i128 for 128-bit load/store in VST[X]/XVST Philippe Mathieu-Daudé
2023-10-17 12:38 ` [RFC PATCH 2/2] target/loongarch: Use i128 for 128-bit load/store in XVLD Philippe Mathieu-Daudé
2023-10-17 13:59 ` Richard Henderson
2023-10-18 1:17 ` [RFC PATCH 0/2] target/loongarch: Use i128 for 128-bit loads/stores gaosong
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).