qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
* [PATCH v4 00/11] Hexagon (target/hexagon) performance and bug fixes
@ 2022-11-08  4:05 Taylor Simpson
  2022-11-08  4:05 ` [PATCH v4 01/11] Hexagon (target/hexagon) Add pkt and insn to DisasContext Taylor Simpson
                   ` (10 more replies)
  0 siblings, 11 replies; 19+ messages in thread
From: Taylor Simpson @ 2022-11-08  4:05 UTC (permalink / raw)
  To: qemu-devel
  Cc: tsimpson, richard.henderson, philmd, ale, anjo, bcain,
	quic_mathbern

1)
Performance improvement
Add pkt and insn to DisasContext
Many functions need information from all 3 structures, so merge
them together.

2)
Bug fix
Fix predicated assignment to .tmp and .cur


3)
Performance improvement
Add overrides for S2_asr_r_r_sat/S2_asl_r_r_sat
These functions will not be handled by idef-parser

4-11)
The final 8 patches improve change-of-flow handling.

Currently, we set the PC to a new address before exiting a TB.  The
ultimate goal is to use direct block chaining.  However, several steps
are needed along the way.

4)
When a packet has more than one change-of-flow (COF) instruction, only
the first one taken is considered.  The runtime bookkeeping is only
needed when there is more than one COF instruction in a packet.

5, 6)
Remove PC and next_PC from the runtime state and always use a
translation-time constant.  Note that next_PC is used by call instructions
to set LR and by conditional COF instructions to set the fall-through
address.

7, 8, 9)
Add helper overrides for COF instructions.  In particular, we must
distinguish those that use a PC-relative address for the destination.
These are candidates for direct block chaining later.

10)
Use direct block chaining for packets that have a single PC-relative
COF instruction.  Instead of generating the code while processing the
instruction, we record the effect in DisasContext and generate the code
during gen_end_tb.

11)
Use direct block chaining for tight loops.  We look for TBs that end
with an endloop0 that will branch back to the TB start address.


**** Changes in V4 ****
Address feedback from Richard Henderson <richard.henderson@linaro.org>
    Rewrite gen_sat_i64 to be branchless
    Rewrite gen_sar to be branchless
    Rewrite gen_shl_sat to be branchless
    Pass branch condition instead of doing xor when false
    Use hex_branch_taken to hold the predicate for direct branches
    Remove HexStateFlags and use "hw/registerfields.h" macros instead

**** Changes in V3 ****
Merge previously emailed patches into single series
Merge functions that check if vreg is preloaded

**** Changes in V2 ****
Update test case to use both true and false predicates
Add fix for .cur
Simplify test in need_pkt_has_multi_cof
Address feedback from Matheus Tavares Bernardino <quic_mathbern@quicinc.com>
    Rearrange new-value-jump overrides
    Simplify gen_write_new_pc_addr


Taylor Simpson (11):
  Hexagon (target/hexagon) Add pkt and insn to DisasContext
  Hexagon (target/hexagon) Fix predicated assignment to .tmp and .cur
  Hexagon (target/hexagon) Add overrides for
    S2_asr_r_r_sat/S2_asl_r_r_sat
  Hexagon (target/hexagon) Only use branch_taken when packet has multi
    cof
  Hexagon (target/hexagon) Remove PC from the runtime state
  Hexagon (target/hexagon) Remove next_PC from runtime state
  Hexagon (target/hexagon) Add overrides for direct call instructions
  Hexagon (target/hexagon) Add overrides for compound compare and jump
  Hexagon (target/hexagon) Add overrides for various forms of jump
  Hexagon (target/hexagon) Use direct block chaining for direct
    jump/branch
  Hexagon (target/hexagon) Use direct block chaining for tight loops

 target/hexagon/cpu.h                |  14 +-
 target/hexagon/gen_tcg.h            | 414 +++++++++++++++++++++++++++-
 target/hexagon/gen_tcg_hvx.h        |   6 +-
 target/hexagon/insn.h               |   9 +-
 target/hexagon/macros.h             |  16 +-
 target/hexagon/mmvec/macros.h       |   4 +-
 target/hexagon/translate.h          |  20 +-
 target/hexagon/decode.c             |  15 +-
 target/hexagon/genptr.c             | 392 +++++++++++++++++++++++++-
 target/hexagon/op_helper.c          |  28 +-
 target/hexagon/translate.c          | 231 +++++++++++-----
 tests/tcg/hexagon/hvx_misc.c        |  72 +++++
 tests/tcg/hexagon/usr.c             |  30 +-
 target/hexagon/gen_helper_funcs.py  |  13 +-
 target/hexagon/gen_helper_protos.py |  14 +-
 target/hexagon/gen_tcg_funcs.py     |  38 ++-
 target/hexagon/hex_common.py        |  29 +-
 17 files changed, 1207 insertions(+), 138 deletions(-)

-- 
2.17.1


^ permalink raw reply	[flat|nested] 19+ messages in thread

end of thread, other threads:[~2022-11-09 15:36 UTC | newest]

Thread overview: 19+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2022-11-08  4:05 [PATCH v4 00/11] Hexagon (target/hexagon) performance and bug fixes Taylor Simpson
2022-11-08  4:05 ` [PATCH v4 01/11] Hexagon (target/hexagon) Add pkt and insn to DisasContext Taylor Simpson
2022-11-08  4:05 ` [PATCH v4 02/11] Hexagon (target/hexagon) Fix predicated assignment to .tmp and .cur Taylor Simpson
2022-11-08  4:05 ` [PATCH v4 03/11] Hexagon (target/hexagon) Add overrides for S2_asr_r_r_sat/S2_asl_r_r_sat Taylor Simpson
2022-11-08  7:19   ` Richard Henderson
2022-11-08 15:41     ` Taylor Simpson
2022-11-08  4:05 ` [PATCH v4 04/11] Hexagon (target/hexagon) Only use branch_taken when packet has multi cof Taylor Simpson
2022-11-08  4:05 ` [PATCH v4 05/11] Hexagon (target/hexagon) Remove PC from the runtime state Taylor Simpson
2022-11-08  4:05 ` [PATCH v4 06/11] Hexagon (target/hexagon) Remove next_PC from " Taylor Simpson
2022-11-08  4:05 ` [PATCH v4 07/11] Hexagon (target/hexagon) Add overrides for direct call instructions Taylor Simpson
2022-11-08  7:20   ` Richard Henderson
2022-11-08  4:05 ` [PATCH v4 08/11] Hexagon (target/hexagon) Add overrides for compound compare and jump Taylor Simpson
2022-11-08  4:05 ` [PATCH v4 09/11] Hexagon (target/hexagon) Add overrides for various forms of jump Taylor Simpson
2022-11-08  4:05 ` [PATCH v4 10/11] Hexagon (target/hexagon) Use direct block chaining for direct jump/branch Taylor Simpson
2022-11-08  7:24   ` Richard Henderson
2022-11-08 15:41     ` Taylor Simpson
2022-11-09  1:41       ` Richard Henderson
2022-11-09 15:35         ` Taylor Simpson
2022-11-08  4:05 ` [PATCH v4 11/11] Hexagon (target/hexagon) Use direct block chaining for tight loops Taylor Simpson

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).