From: "Alex Bennée" <alex.bennee@linaro.org>
To: Richard Henderson <richard.henderson@linaro.org>
Cc: qemu-devel@nongnu.org, peter.maydell@linaro.org
Subject: Re: [Qemu-devel] [PATCH 8/9] target/arm: Implement FP data-processing (2 source) for fp16
Date: Tue, 01 May 2018 12:13:38 +0100 [thread overview]
Message-ID: <87muxjaasd.fsf@linaro.org> (raw)
In-Reply-To: <20180425012300.14698-9-richard.henderson@linaro.org>
Richard Henderson <richard.henderson@linaro.org> writes:
> We missed all of the scalar fp16 binary operations.
>
> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
> ---
> target/arm/translate-a64.c | 73 ++++++++++++++++++++++++++++++++++++++++++++++
> 1 file changed, 73 insertions(+)
>
> diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
> index 794ede7222..11b90b7eb0 100644
> --- a/target/arm/translate-a64.c
> +++ b/target/arm/translate-a64.c
> @@ -532,6 +532,14 @@ static TCGv_i32 read_fp_sreg(DisasContext *s, int reg)
> return v;
> }
>
> +static TCGv_i32 read_fp_hreg(DisasContext *s, int reg)
> +{
> + TCGv_i32 v = tcg_temp_new_i32();
> +
> + tcg_gen_ld16u_i32(v, cpu_env, fp_reg_offset(s, reg, MO_16));
> + return v;
> +}
> +
> /* Clear the bits above an N-bit vector, for N = (is_q ? 128 : 64).
> * If SVE is not enabled, then there are only 128 bits in the vector.
> */
> @@ -4968,6 +4976,61 @@ static void handle_fp_2src_double(DisasContext *s, int opcode,
> tcg_temp_free_i64(tcg_res);
> }
>
> +/* Floating-point data-processing (2 source) - half precision */
> +static void handle_fp_2src_half(DisasContext *s, int opcode,
> + int rd, int rn, int rm)
> +{
> + TCGv_i32 tcg_op1;
> + TCGv_i32 tcg_op2;
> + TCGv_i32 tcg_res;
> + TCGv_ptr fpst;
> +
> + tcg_res = tcg_temp_new_i32();
> + fpst = get_fpstatus_ptr(true);
> + tcg_op1 = read_fp_hreg(s, rn);
> + tcg_op2 = read_fp_hreg(s, rm);
> +
> + switch (opcode) {
> + case 0x0: /* FMUL */
> + gen_helper_advsimd_mulh(tcg_res, tcg_op1, tcg_op2, fpst);
> + break;
> + case 0x1: /* FDIV */
> + gen_helper_advsimd_divh(tcg_res, tcg_op1, tcg_op2, fpst);
> + break;
> + case 0x2: /* FADD */
> + gen_helper_advsimd_addh(tcg_res, tcg_op1, tcg_op2, fpst);
> + break;
> + case 0x3: /* FSUB */
> + gen_helper_advsimd_subh(tcg_res, tcg_op1, tcg_op2, fpst);
> + break;
> + case 0x4: /* FMAX */
> + gen_helper_advsimd_maxh(tcg_res, tcg_op1, tcg_op2, fpst);
> + break;
> + case 0x5: /* FMIN */
> + gen_helper_advsimd_minh(tcg_res, tcg_op1, tcg_op2, fpst);
> + break;
> + case 0x6: /* FMAXNM */
> + gen_helper_advsimd_maxnumh(tcg_res, tcg_op1, tcg_op2, fpst);
> + break;
> + case 0x7: /* FMINNM */
> + gen_helper_advsimd_minnumh(tcg_res, tcg_op1, tcg_op2, fpst);
> + break;
> + case 0x8: /* FNMUL */
> + gen_helper_advsimd_mulh(tcg_res, tcg_op1, tcg_op2, fpst);
> + tcg_gen_xori_i32(tcg_res, tcg_res, 0x8000);
> + break;
> + default:
> + g_assert_not_reached();
> + }
> +
> + write_fp_sreg(s, rd, tcg_res);
If we are going to the trouble of adding a read_fp_hreg() we might as
well do the same for the write case. Then we can convert the various:
read_vec_element_i32(s, tcg_vm, rm, 0, MO_16);
that we used before.
> +
> + tcg_temp_free_ptr(fpst);
> + tcg_temp_free_i32(tcg_op1);
> + tcg_temp_free_i32(tcg_op2);
> + tcg_temp_free_i32(tcg_res);
> +}
> +
> /* Floating point data-processing (2 source)
> * 31 30 29 28 24 23 22 21 20 16 15 12 11 10 9 5 4 0
> * +---+---+---+-----------+------+---+------+--------+-----+------+------+
> @@ -5000,6 +5063,16 @@ static void disas_fp_2src(DisasContext *s, uint32_t insn)
> }
> handle_fp_2src_double(s, opcode, rd, rn, rm);
> break;
> + case 3:
> + if (!arm_dc_feature(s, ARM_FEATURE_V8_FP16)) {
> + unallocated_encoding(s);
> + return;
> + }
> + if (!fp_access_check(s)) {
> + return;
> + }
> + handle_fp_2src_half(s, opcode, rd, rn, rm);
> + break;
> default:
> unallocated_encoding(s);
> }
Otherwise:
Reviewed-by: Alex Bennée <alex.bennee@linaro.org>
--
Alex Bennée
next prev parent reply other threads:[~2018-05-01 11:13 UTC|newest]
Thread overview: 34+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-04-25 1:22 [Qemu-devel] [PATCH 0/9] target/arm: Fixups for ARM_FEATURE_V8_FP16 Richard Henderson
2018-04-25 1:22 ` [Qemu-devel] [PATCH 1/9] target/arm: Implement vector shifted SCVF/UCVF for fp16 Richard Henderson
2018-04-27 16:04 ` Alex Bennée
2018-04-29 14:44 ` Richard Henderson
2018-04-29 15:27 ` Peter Maydell
2018-04-25 1:22 ` [Qemu-devel] [PATCH 2/9] target/arm: Implement vector shifted FCVT " Richard Henderson
2018-04-30 15:55 ` Alex Bennée
2018-04-25 1:22 ` [Qemu-devel] [PATCH 3/9] target/arm: Fix float16 to/from int16 Richard Henderson
2018-05-01 10:10 ` Alex Bennée
2018-04-25 1:22 ` [Qemu-devel] [PATCH 4/9] target/arm: Clear SVE high bits for FMOV Richard Henderson
2018-05-01 10:44 ` Alex Bennée
2018-04-25 1:22 ` [Qemu-devel] [PATCH 5/9] target/arm: Implement FMOV (general) for fp16 Richard Henderson
2018-04-25 1:31 ` Philippe Mathieu-Daudé
2018-04-25 8:40 ` Richard Henderson
2018-04-25 1:22 ` [Qemu-devel] [PATCH 6/9] target/arm: Implement FCVT (scalar, integer) " Richard Henderson
2018-05-01 10:55 ` Alex Bennée
2018-04-25 1:22 ` [Qemu-devel] [PATCH 7/9] target/arm: Implement FCVT (scalar, fixed-point) " Richard Henderson
2018-05-01 10:57 ` Alex Bennée
2018-04-25 1:22 ` [Qemu-devel] [PATCH 8/9] target/arm: Implement FP data-processing (2 source) " Richard Henderson
2018-05-01 11:13 ` Alex Bennée [this message]
2018-05-02 18:28 ` Richard Henderson
2018-05-02 18:47 ` Richard Henderson
2018-04-25 1:23 ` [Qemu-devel] [PATCH 9/9] target/arm: Implement FP data-processing (3 " Richard Henderson
2018-05-01 11:21 ` Alex Bennée
2018-05-02 18:49 ` Richard Henderson
2018-04-25 1:35 ` [Qemu-devel] [PATCH 0/9] target/arm: Fixups for ARM_FEATURE_V8_FP16 no-reply
2018-04-25 9:14 ` Alex Bennée
2018-04-27 17:22 ` Alex Bennée
2018-04-27 18:55 ` Alex Bennée
2018-04-27 19:50 ` Alex Bennée
2018-05-11 2:17 ` Richard Henderson
2018-05-11 21:13 ` Alex Bennée
2018-05-01 15:47 ` Alex Bennée
2018-05-01 18:35 ` Richard Henderson
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=87muxjaasd.fsf@linaro.org \
--to=alex.bennee@linaro.org \
--cc=peter.maydell@linaro.org \
--cc=qemu-devel@nongnu.org \
--cc=richard.henderson@linaro.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).