From mboxrd@z Thu Jan  1 00:00:00 1970
Received: from eggs.gnu.org ([2001:4830:134:3::10]:45088)
	by lists.gnu.org with esmtp (Exim 4.71)
	(envelope-from <rth7680@gmail.com>) id 1YYKF5-0006sr-PJ
	for qemu-devel@nongnu.org; Wed, 18 Mar 2015 16:07:08 -0400
Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71)
	(envelope-from <rth7680@gmail.com>) id 1YYKEz-0007KK-To
	for qemu-devel@nongnu.org; Wed, 18 Mar 2015 16:07:03 -0400
Received: from mail-qc0-x22f.google.com ([2607:f8b0:400d:c01::22f]:34008)
	by eggs.gnu.org with esmtp (Exim 4.71)
	(envelope-from <rth7680@gmail.com>) id 1YYKEz-0007Jo-PP
	for qemu-devel@nongnu.org; Wed, 18 Mar 2015 16:06:57 -0400
Received: by qcaz10 with SMTP id z10so48835484qca.1
	for <qemu-devel@nongnu.org>; Wed, 18 Mar 2015 13:06:56 -0700 (PDT)
Sender: Richard Henderson <rth7680@gmail.com>
Message-ID: <5509DADC.500@twiddle.net>
Date: Wed, 18 Mar 2015 13:06:52 -0700
From: Richard Henderson <rth@twiddle.net>
MIME-Version: 1.0
References: <5509A8CE.2040905@hotmail.com>
	<BLU436-SMTP11894BE395C33A1748732BB9000@phx.gbl>
In-Reply-To: <BLU436-SMTP11894BE395C33A1748732BB9000@phx.gbl>
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: 7bit
Subject: Re: [Qemu-devel] [PATCH 1/6 v6] target-tilegx: Firstly add TILE-Gx
 with minimized features
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
	<mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <http://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
	<mailto:qemu-devel-request@nongnu.org?subject=subscribe>
To: Chen Gang <xili_gchen_5257@hotmail.com>, Peter Maydell <peter.maydell@linaro.org>, Chris Metcalf <cmetcalf@ezchip.com>, =?UTF-8?B?QW5kcmVhcyBGw6RyYmVy?= <afaerber@suse.de>, Riku Voipio <riku.voipio@iki.fi>, "walt@tilera.com" <walt@tilera.com>
Cc: qemu-devel <qemu-devel@nongnu.org>

On 03/18/2015 09:34 AM, Chen Gang wrote:
> +static void gen_fnop(void)
> +{
> +    qemu_log_mask(CPU_LOG_TB_IN_ASM, "(f)nop\n");
> +}
> +
> +static void gen_cmpltui(struct DisasContext *dc,
> +                        uint8_t rdst, uint8_t rsrc, int8_t imm8)
> +{
> +    qemu_log_mask(CPU_LOG_TB_IN_ASM, "cmpltui r%d, r%d, %d\n",
> +                  rdst, rsrc, imm8);
> +    tcg_gen_setcondi_i64(TCG_COND_LTU, dest_gr(dc, rdst), load_gr(dc, rsrc),
> +                        (uint64_t)imm8);
> +}

Wow, this is a lot more than before.  Good progress.

> +/*
> + * The related functional description for bfextu in isa document:
> + *
> + * uint64_t mask = 0;
> + * mask = (-1ULL) ^ ((-1ULL << ((BFEnd - BFStart) & 63)) << 1);
> + * uint64_t rot_src = (((uint64_t) rf[SrcA]) >> BFStart)
> + *                    | (rf[SrcA] << (64 - BFStart));
> + * rf[Dest] = rot_src & mask;
> + */
> +static void gen_bfextu(struct DisasContext *dc, uint8_t rdst, uint8_t rsrc,
> +                       int8_t start, int8_t end)
> +{
> +    TCGv mask = tcg_temp_new_i64();
> +    TCGv tmp = dest_gr(dc, rdst);
> +
> +    qemu_log_mask(CPU_LOG_TB_IN_ASM, "bfextu r%d, r%d, %d, %d\n",
> +                  rdst, rsrc, start, end);
> +
> +    tcg_gen_movi_i64(tmp, -1ULL);
> +    tcg_gen_movi_i64(mask, end);
> +
> +    tcg_gen_subi_i64(mask, mask, start);
> +    tcg_gen_andi_i64(mask, mask, 63);
> +    tcg_gen_shl_i64(mask, tmp, mask);
> +    tcg_gen_shli_i64(mask, mask, 1);
> +    tcg_gen_xori_i64(mask, mask, -1ULL);

This computation of MASK is only dependent on START and END, which are known at
translation time.  Thus you should perform this computation at translation time
here in C, rather than defer the computation with TCG opcodes.

> +    tcg_gen_rotli_i64(tmp, load_gr(dc, rsrc), start);
> +    tcg_gen_and_i64(tmp, tmp, mask);

Which then makes this tcg_gen_andi_i64.

> +static void gen_mulx(struct DisasContext *dc,
> +                     uint8_t rdst, uint8_t rsrc, uint8_t rsrcb)
> +{
> +    TCGv tmp = tcg_temp_new_i64();
> +    TCGv vdst = dest_gr(dc, rdst);
> +
> +    qemu_log_mask(CPU_LOG_TB_IN_ASM, "mulx r%d, r%d, r%d\n", rdst, rsrc, rsrcb);
> +
> +    tcg_gen_ext32s_i64(vdst, load_gr(dc, rsrc));
> +    tcg_gen_ext32s_i64(tmp, load_gr(dc, rsrcb));
> +
> +    tcg_gen_mul_i64(tmp, vdst, tmp);
> +    tcg_gen_ext32s_i64(vdst, tmp);

Note that you don't need the extensions prior to the MUL.  The high 32-bits of
the inputs don't affect the low 32-bits of the product.

> +static void gen_shl16insli(struct DisasContext *dc,
> +                           uint8_t rdst, uint8_t rsrc, uint16_t uimm16)
> +{
> +    TCGv vdst = dest_gr(dc, rdst);
> +
> +    qemu_log_mask(CPU_LOG_TB_IN_ASM, "shl16insli r%d, r%d, %llx\n",
> +                  rdst, rsrc, (long long)uimm16);

Do not cast to long long.  Print with just %x.

> +static int gen_beqz(struct DisasContext *dc, uint8_t rsrc, int32_t off)
> +{
> +    qemu_log_mask(CPU_LOG_TB_IN_ASM, "beqz(t) r%d, %d\n", rsrc, off);
> +
> +    dc->jmp.dest = tcg_temp_new_i64();
> +    dc->jmp.val1 = tcg_temp_new_i64();
> +    dc->jmp.val2 = tcg_temp_new_i64();
> +
> +    dc->jmp.cond = TCG_COND_EQ;
> +    tcg_gen_movi_i64(dc->jmp.dest,
> +                     dc->pc + (int64_t)off * TILEGX_BUNDLE_SIZE_IN_BYTES);

It will be helpful for the disassembly if you print the destination of the
branch rather than the offset.

> +    qemu_log_mask(LOG_UNIMP, "UNIMP rrr_1_opcode_y0, %16.16llx\n", bundle);

You shouldn't use "llx".  You need to use PRIx64 instead.  This will enable
proper operation on Windows.

This patch set is just about ready for approval (though I don't imagine it will
be merged until the qemu 2.4 development cycle).


r~