From: "Alex Bennée" <alex.bennee@linaro.org>
To: Richard Henderson <rth@twiddle.net>
Cc: qemu-devel@nongnu.org, aurelien@aurel32.net
Subject: Re: [Qemu-devel] [PATCH 01/16] tcg: Merge opcode arguments into TCGOp
Date: Mon, 26 Jun 2017 15:44:26 +0100 [thread overview]
Message-ID: <878tkeyfjp.fsf@linaro.org> (raw)
In-Reply-To: <20170621024831.26019-2-rth@twiddle.net>
Richard Henderson <rth@twiddle.net> writes:
> Rather than have a separate buffer of 10*max_ops entries,
> give each opcode 10 entries. The result is actually a bit
> smaller and should have slightly more cache locality.
>
> Signed-off-by: Richard Henderson <rth@twiddle.net>
> ---
<snip>
The changes look fine, some questions bellow:
> index 9e37722..720e04e 100644
> --- a/tcg/tcg.h
> +++ b/tcg/tcg.h
> @@ -51,8 +51,6 @@
> #define OPC_BUF_SIZE 640
> #define OPC_MAX_SIZE (OPC_BUF_SIZE - MAX_OP_PER_INSTR)
>
> -#define OPPARAM_BUF_SIZE (OPC_BUF_SIZE * MAX_OPC_PARAM)
> -
> #define CPU_TEMP_BUF_NLONGS 128
>
> /* Default target word size to pointer size. */
> @@ -613,33 +611,29 @@ typedef struct TCGTempSet {
> #define SYNC_ARG 1
> typedef uint16_t TCGLifeData;
>
> -/* The layout here is designed to avoid crossing of a 32-bit boundary.
> - If we do so, gcc adds padding, expanding the size to 12. */
> +/* The layout here is designed to avoid crossing of a 32-bit
> boundary. */
This isn't correct now? Do we mean we now aim to be cache line aligned?
> typedef struct TCGOp {
> TCGOpcode opc : 8; /* 8 */
>
> - /* Index of the prev/next op, or 0 for the end of the list. */
> - unsigned prev : 10; /* 18 */
> - unsigned next : 10; /* 28 */
> -
> /* The number of out and in parameter for a call. */
> - unsigned calli : 4; /* 32 */
> - unsigned callo : 2; /* 34 */
> + unsigned calli : 4; /* 12 */
> + unsigned callo : 2; /* 14 */
> + unsigned : 2; /* 16 */
>
> - /* Index of the arguments for this op, or 0 for zero-operand ops. */
> - unsigned args : 14; /* 48 */
> + /* Index of the prev/next op, or 0 for the end of the list. */
> + unsigned prev : 16; /* 32 */
> + unsigned next : 16; /* 48 */
>
> /* Lifetime data of the operands. */
> unsigned life : 16; /* 64 */
> +
> + /* Arguments for the opcode. */
> + TCGArg args[MAX_OPC_PARAM];
> } TCGOp;
>
> /* Make sure operands fit in the bitfields above. */
> QEMU_BUILD_BUG_ON(NB_OPS > (1 << 8));
> -QEMU_BUILD_BUG_ON(OPC_BUF_SIZE > (1 << 10));
> -QEMU_BUILD_BUG_ON(OPPARAM_BUF_SIZE > (1 << 14));
> -
> -/* Make sure that we don't overflow 64 bits without noticing. */
> -QEMU_BUILD_BUG_ON(sizeof(TCGOp) > 8);
> +QEMU_BUILD_BUG_ON(OPC_BUF_SIZE > (1 << 16));
OPC_BUF_SIZE is statically assigned, we don't seem to be taking notice
of sizeof(TCGOp) anymore here. In fact OPC_BUF_SIZE is really
MAX_TCG_OPS right?
I see TCGArg is currently target_ulong. Is this because we never leak
the host size details into generated code safe for the statically
assigned env_ptr?
I mention this because in looking at modelling SIMD registers I'm going
to need to carry a host ptr around in TCG registers that can be passed
to helpers and the like.
>
> struct TCGContext {
> uint8_t *pool_cur, *pool_end;
> @@ -691,7 +685,6 @@ struct TCGContext {
> #endif
>
> int gen_next_op_idx;
> - int gen_next_parm_idx;
>
> /* Code generation. Note that we specifically do not use tcg_insn_unit
> here, because there's too much arithmetic throughout that relies
> @@ -723,7 +716,6 @@ struct TCGContext {
> TCGTemp *reg_to_temp[TCG_TARGET_NB_REGS];
>
> TCGOp gen_op_buf[OPC_BUF_SIZE];
> - TCGArg gen_opparam_buf[OPPARAM_BUF_SIZE];
>
> uint16_t gen_insn_end_off[TCG_MAX_INSNS];
> target_ulong gen_insn_data[TCG_MAX_INSNS][TARGET_INSN_START_WORDS];
> @@ -734,8 +726,7 @@ extern bool parallel_cpus;
>
> static inline void tcg_set_insn_param(int op_idx, int arg, TCGArg v)
> {
> - int op_argi = tcg_ctx.gen_op_buf[op_idx].args;
> - tcg_ctx.gen_opparam_buf[op_argi + arg] = v;
> + tcg_ctx.gen_op_buf[op_idx].args[arg] = v;
> }
>
> /* The number of opcodes emitted so far. */
--
Alex Bennée
next prev parent reply other threads:[~2017-06-26 14:43 UTC|newest]
Thread overview: 40+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-06-21 2:48 [Qemu-devel] [PATCH 00/16] Cleanups within TCG middle-end Richard Henderson
2017-06-21 2:48 ` [Qemu-devel] [PATCH 01/16] tcg: Merge opcode arguments into TCGOp Richard Henderson
2017-06-26 14:44 ` Alex Bennée [this message]
2017-06-26 14:55 ` Richard Henderson
2017-06-21 2:48 ` [Qemu-devel] [PATCH 02/16] tcg: Propagate args to op->args in optimizer Richard Henderson
2017-06-26 14:53 ` Alex Bennée
2017-06-21 2:48 ` [Qemu-devel] [PATCH 03/16] tcg: Propagate args to op->args in tcg.c Richard Henderson
2017-06-26 15:02 ` Alex Bennée
2017-06-26 15:07 ` Richard Henderson
2017-06-21 2:48 ` [Qemu-devel] [PATCH 04/16] tcg: Propagate TCGOp down to allocators Richard Henderson
2017-06-26 15:08 ` Alex Bennée
2017-06-21 2:48 ` [Qemu-devel] [PATCH 05/16] tcg: Introduce arg_temp Richard Henderson
2017-06-26 16:37 ` Alex Bennée
2017-06-21 2:48 ` [Qemu-devel] [PATCH 06/16] tcg: Add temp_global bit to TCGTemp Richard Henderson
2017-06-27 8:39 ` Alex Bennée
2017-06-27 16:17 ` Richard Henderson
2017-06-28 8:52 ` Alex Bennée
2017-06-21 2:48 ` [Qemu-devel] [PATCH 07/16] tcg: Return NULL temp for TCG_CALL_DUMMY_ARG Richard Henderson
2017-06-27 8:47 ` Alex Bennée
2017-06-27 16:36 ` Richard Henderson
2017-06-21 2:48 ` [Qemu-devel] [PATCH 08/16] tcg: Introduce temp_arg Richard Henderson
2017-06-21 2:48 ` [Qemu-devel] [PATCH 09/16] tcg: Use per-temp state data in liveness Richard Henderson
2017-06-27 8:57 ` Alex Bennée
2017-06-27 16:39 ` Richard Henderson
2017-06-21 2:48 ` [Qemu-devel] [PATCH 10/16] tcg: Avoid loops against variable bounds Richard Henderson
2017-06-27 9:01 ` Alex Bennée
2017-06-21 2:48 ` [Qemu-devel] [PATCH 11/16] tcg: Change temp_allocate_frame arg to TCGTemp Richard Henderson
2017-06-21 2:48 ` [Qemu-devel] [PATCH 12/16] tcg: Remove unused TCG_CALL_DUMMY_TCGV Richard Henderson
2017-06-27 9:42 ` Alex Bennée
2017-06-21 2:48 ` [Qemu-devel] [PATCH 13/16] tcg: Export temp_idx Richard Henderson
2017-06-27 9:46 ` Alex Bennée
2017-06-27 16:43 ` Richard Henderson
2017-06-21 2:48 ` [Qemu-devel] [PATCH 14/16] tcg: Use per-temp state data in optimize Richard Henderson
2017-06-27 9:59 ` Alex Bennée
2017-06-21 2:48 ` [Qemu-devel] [PATCH 15/16] tcg: Define separate structures for TCGv_* Richard Henderson
2017-06-21 2:48 ` [Qemu-devel] [PATCH 16/16] tcg: Store pointers to temporaries directly in TCGArg Richard Henderson
2017-06-21 3:43 ` [Qemu-devel] [PATCH 00/16] Cleanups within TCG middle-end no-reply
2017-06-26 16:49 ` Alex Bennée
2017-06-26 17:47 ` Richard Henderson
2017-06-26 19:19 ` Alex Bennée
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=878tkeyfjp.fsf@linaro.org \
--to=alex.bennee@linaro.org \
--cc=aurelien@aurel32.net \
--cc=qemu-devel@nongnu.org \
--cc=rth@twiddle.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).