All of lore.kernel.org
 help / color / mirror / Atom feed
From: Aurelien Jarno <aurelien@aurel32.net>
To: Richard Henderson <rth@twiddle.net>
Cc: qemu-devel@nongnu.org
Subject: Re: [Qemu-devel] [PATCH v5 08/19] tcg-arm: Improve constant generation
Date: Mon, 22 Apr 2013 11:07:08 +0200	[thread overview]
Message-ID: <20130422090708.GE16361@ohm.aurel32.net> (raw)
In-Reply-To: <1364769305-3687-9-git-send-email-rth@twiddle.net>

On Sun, Mar 31, 2013 at 03:34:54PM -0700, Richard Henderson wrote:
> Try fully rotated arguments to mov and mvn before trying movt
> or full decomposition.  Begin decomposition with mvn when it
> looks like it'll help.  Examples include
> 
> -:        mov   r9, #0x00000fa0
> -:        orr   r9, r9, #0x000ee000
> -:        orr   r9, r9, #0x0ff00000
> -:        orr   r9, r9, #0xf0000000
> +:        mvn   r9, #0x0000005f
> +:        eor   r9, r9, #0x00011000
> 
> Signed-off-by: Richard Henderson <rth@twiddle.net>
> ---
>  tcg/arm/tcg-target.c | 67 ++++++++++++++++++++++++++++++++++------------------
>  1 file changed, 44 insertions(+), 23 deletions(-)
> 
> diff --git a/tcg/arm/tcg-target.c b/tcg/arm/tcg-target.c
> index 9e8c97c..1f38795 100644
> --- a/tcg/arm/tcg-target.c
> +++ b/tcg/arm/tcg-target.c
> @@ -427,15 +427,31 @@ static inline void tcg_out_dat_imm(TCGContext *s,
>                      (rn << 16) | (rd << 12) | im);
>  }
>  
> -static inline void tcg_out_movi32(TCGContext *s,
> -                int cond, int rd, uint32_t arg)
> -{
> -    /* TODO: This is very suboptimal, we can easily have a constant
> -     * pool somewhere after all the instructions.  */
> -    if ((int)arg < 0 && (int)arg >= -0x100) {
> -        tcg_out_dat_imm(s, cond, ARITH_MVN, rd, 0, (~arg) & 0xff);
> -    } else if (use_armv7_instructions) {
> -        /* use movw/movt */
> +static void tcg_out_movi32(TCGContext *s, int cond, int rd, uint32_t arg)
> +{
> +    int rot, opc, rn;
> +
> +    /* For armv7, make sure not to use movw+movt when mov/mvn would do.
> +       Speed things up by only checking when movt would be required.
> +       Prior to armv7, have one go at fully rotated immediates before
> +       doing the decomposition thing below.  */
> +    if (!use_armv7_instructions || (arg & 0xffff0000)) {
> +        rot = encode_imm(arg);
> +        if (rot >= 0) {
> +            tcg_out_dat_imm(s, cond, ARITH_MOV, rd, 0,
> +                            rotl(arg, rot) | (rot << 7));
> +            return;
> +        }
> +        rot = encode_imm(~arg);
> +        if (rot >= 0) {
> +            tcg_out_dat_imm(s, cond, ARITH_MVN, rd, 0,
> +                            rotl(~arg, rot) | (rot << 7));
> +            return;
> +        }
> +    }
> +
> +    /* Use movw + movt.  */
> +    if (use_armv7_instructions) {
>          /* movw */
>          tcg_out32(s, (cond << 28) | 0x03000000 | (rd << 12)
>                    | ((arg << 4) & 0x000f0000) | (arg & 0xfff));
> @@ -444,22 +460,27 @@ static inline void tcg_out_movi32(TCGContext *s,
>              tcg_out32(s, (cond << 28) | 0x03400000 | (rd << 12)
>                        | ((arg >> 12) & 0x000f0000) | ((arg >> 16) & 0xfff));
>          }
> -    } else {
> -        int opc = ARITH_MOV;
> -        int rn = 0;
> -
> -        do {
> -            int i, rot;
> -
> -            i = ctz32(arg) & ~1;
> -            rot = ((32 - i) << 7) & 0xf00;
> -            tcg_out_dat_imm(s, cond, opc, rd, rn, ((arg >> i) & 0xff) | rot);
> -            arg &= ~(0xff << i);
> +        return;
> +    }
>  
> -            opc = ARITH_ORR;
> -            rn = rd;
> -        } while (arg);
> +    /* TODO: This is very suboptimal, we can easily have a constant
> +       pool somewhere after all the instructions.  */
> +    opc = ARITH_MOV;
> +    rn = 0;
> +    /* If we have lots of leading 1's, we can shorten the sequence by
> +       beginning with mvn and then clearing higher bits with eor.  */
> +    if (clz32(~arg) > clz32(arg)) {
> +        opc = ARITH_MVN, arg = ~arg;
>      }
> +    do {
> +        int i = ctz32(arg) & ~1;
> +        rot = ((32 - i) << 7) & 0xf00;
> +        tcg_out_dat_imm(s, cond, opc, rd, rn, ((arg >> i) & 0xff) | rot);
> +        arg &= ~(0xff << i);
> +
> +        opc = ARITH_EOR;
> +        rn = rd;
> +    } while (arg);
>  }
>  
>  static inline void tcg_out_dat_rI(TCGContext *s, int cond, int opc, TCGArg dst,

Reviewed-by: Aurelien Jarno <aurelien@aurel32.net>

-- 
Aurelien Jarno                          GPG: 1024D/F1BCDB73
aurelien@aurel32.net                 http://www.aurel32.net

  reply	other threads:[~2013-04-22  9:07 UTC|newest]

Thread overview: 44+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-03-31 22:34 [Qemu-devel] [PATCH v5 00/19] tcg-arm improvements Richard Henderson
2013-03-31 22:34 ` [Qemu-devel] [PATCH v5 01/19] tcg-arm: Fix local stack frame Richard Henderson
2013-04-21 10:22   ` Aurelien Jarno
2013-03-31 22:34 ` [Qemu-devel] [PATCH v5 02/19] tcg: Log the contents of the prologue with -d out_asm Richard Henderson
2013-04-21 10:22   ` Aurelien Jarno
2013-03-31 22:34 ` [Qemu-devel] [PATCH v5 03/19] tcg-arm: Use bic to implement and with constant Richard Henderson
2013-03-31 22:34 ` [Qemu-devel] [PATCH v5 04/19] tcg-arm: Handle negated constant arguments to and/sub Richard Henderson
2013-03-31 22:34 ` [Qemu-devel] [PATCH v5 05/19] tcg-arm: Allow constant first argument to sub Richard Henderson
2013-03-31 22:34 ` [Qemu-devel] [PATCH v5 06/19] tcg-arm: Use tcg_out_dat_rIN for compares Richard Henderson
2013-03-31 22:34 ` [Qemu-devel] [PATCH v5 07/19] tcg-arm: Handle constant arguments to add2/sub2 Richard Henderson
2013-04-22  9:07   ` Aurelien Jarno
2013-03-31 22:34 ` [Qemu-devel] [PATCH v5 08/19] tcg-arm: Improve constant generation Richard Henderson
2013-04-22  9:07   ` Aurelien Jarno [this message]
2013-03-31 22:34 ` [Qemu-devel] [PATCH v5 09/19] tcg-arm: Implement deposit for armv7 Richard Henderson
2013-04-21 10:35   ` Aurelien Jarno
2013-04-21 16:58     ` Richard Henderson
2013-04-22  9:08       ` Aurelien Jarno
2013-03-31 22:34 ` [Qemu-devel] [PATCH v5 10/19] tcg-arm: Implement division instructions Richard Henderson
2013-04-22  9:07   ` Aurelien Jarno
2013-03-31 22:34 ` [Qemu-devel] [PATCH v5 11/19] tcg-arm: Use TCG_REG_TMP name for the tcg temporary Richard Henderson
2013-04-22  9:07   ` Aurelien Jarno
2013-03-31 22:34 ` [Qemu-devel] [PATCH v5 12/19] tcg-arm: Use R12 " Richard Henderson
2013-04-22  9:07   ` Aurelien Jarno
2013-03-31 22:34 ` [Qemu-devel] [PATCH v5 13/19] tcg-arm: Cleanup multiply subroutines Richard Henderson
2013-04-22  9:07   ` Aurelien Jarno
2013-03-31 22:35 ` [Qemu-devel] [PATCH v5 14/19] tcg-arm: Cleanup most primitive load store subroutines Richard Henderson
2013-04-22  9:53   ` Aurelien Jarno
2013-03-31 22:35 ` [Qemu-devel] [PATCH v5 15/19] tcg-arm: Split out tcg_out_tlb_read Richard Henderson
2013-04-22  9:54   ` Aurelien Jarno
2013-03-31 22:35 ` [Qemu-devel] [PATCH v5 16/19] tcg-arm: Improve scheduling of tcg_out_tlb_read Richard Henderson
2013-04-22  9:55   ` Aurelien Jarno
2013-03-31 22:35 ` [Qemu-devel] [PATCH v5 17/19] tcg-arm: Use movi32 + blx for calls on v7 Richard Henderson
2013-04-22  9:55   ` Aurelien Jarno
2013-03-31 22:35 ` [Qemu-devel] [PATCH v5 18/19] tcg-arm: Convert to CONFIG_QEMU_LDST_OPTIMIZATION Richard Henderson
2013-04-22 12:59   ` Aurelien Jarno
2013-04-22 14:39     ` Richard Henderson
2013-04-23  6:44       ` Aurelien Jarno
2013-04-23  8:13         ` Richard Henderson
2013-04-23  8:18           ` Aurelien Jarno
2013-04-23  8:48             ` Richard Henderson
2013-03-31 22:35 ` [Qemu-devel] [PATCH v5 19/19] tcg-arm: Tidy exit_tb Richard Henderson
2013-04-22 13:00   ` Aurelien Jarno
2013-04-09 11:37 ` [Qemu-devel] [PATCH v5 00/19] tcg-arm improvements Richard Henderson
2013-04-17 14:04   ` Richard Henderson

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20130422090708.GE16361@ohm.aurel32.net \
    --to=aurelien@aurel32.net \
    --cc=qemu-devel@nongnu.org \
    --cc=rth@twiddle.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.