All of lore.kernel.org
 help / color / mirror / Atom feed
From: Gerd Hoffmann <kraxel@redhat.com>
To: Paolo Bonzini <pbonzini@redhat.com>
Cc: Wenchao Xia <xiawenc@linux.vnet.ibm.com>, qemu-devel@nongnu.org
Subject: Re: [Qemu-devel] [PATCH 1.3] build: compile translate.o at -O1 optimization
Date: Tue, 27 Nov 2012 13:09:10 +0100	[thread overview]
Message-ID: <50B4AD66.9030106@redhat.com> (raw)
In-Reply-To: <50B489C7.7080608@redhat.com>

[-- Attachment #1: Type: text/plain, Size: 367 bytes --]

  Hi,

> Note that there is probably one or two GCC options that can be
> fine-tuned to avoid the explosion instead of just -O1 vs. -O2.  If you
> have an affected machine (F18) you can help by compiling translate.c
> with -O2 -ftime-report.  I planned to do this today, but I first need to
> install a F18 virtual machine.

[x] done, see attachment

cheers,
  Gerd


[-- Attachment #2: translate-times.txt --]
[-- Type: text/plain, Size: 11683 bytes --]

  CC    i386-softmmu/target-i386/translate.o

Execution times (seconds)
 phase setup             :   0.17 (100%) usr   0.06 (100%) sys   1.20 (98%) wall    2527 kB (99%) ggc
 phase finalize          :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 2%) wall       0 kB ( 0%) ggc
 TOTAL                 :   0.17             0.06             1.23               2543 kB

Execution times (seconds)
 phase setup             :   0.00 ( 0%) usr   0.01 ( 0%) sys   0.11 ( 0%) wall    1077 kB ( 0%) ggc
 phase parsing           :   0.22 ( 1%) usr   0.41 ( 4%) sys   0.71 ( 1%) wall   13597 kB ( 1%) ggc
 phase cgraph            :  39.98 (99%) usr   9.43 (96%) sys 122.41 (99%) wall 1680634 kB (99%) ggc
 phase generate          :  39.98 (99%) usr   9.43 (96%) sys 122.42 (99%) wall 1680635 kB (99%) ggc
 phase finalize          :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.09 ( 0%) wall       0 kB ( 0%) ggc
 garbage collection      :   0.31 ( 1%) usr   0.37 ( 4%) sys  29.45 (24%) wall       0 kB ( 0%) ggc
 callgraph construction  :   0.05 ( 0%) usr   0.04 ( 0%) sys   0.05 ( 0%) wall    6753 kB ( 0%) ggc
 callgraph optimization  :   0.00 ( 0%) usr   0.01 ( 0%) sys   0.04 ( 0%) wall    1401 kB ( 0%) ggc
 varpool construction    :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall     282 kB ( 0%) ggc
 ipa cp                  :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall     384 kB ( 0%) ggc
 ipa reference           :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       0 kB ( 0%) ggc
 ipa pure const          :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall       0 kB ( 0%) ggc
 ipa SRA                 :   0.01 ( 0%) usr   0.01 ( 0%) sys   0.06 ( 0%) wall    3613 kB ( 0%) ggc
 ipa free lang data      :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall       0 kB ( 0%) ggc
 cfg construction        :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall     488 kB ( 0%) ggc
 cfg cleanup             :   0.36 ( 1%) usr   0.01 ( 0%) sys   0.74 ( 1%) wall    1440 kB ( 0%) ggc
 trivially dead code     :   0.17 ( 0%) usr   0.00 ( 0%) sys   0.17 ( 0%) wall       0 kB ( 0%) ggc
 df scan insns           :   0.09 ( 0%) usr   0.01 ( 0%) sys   0.08 ( 0%) wall      38 kB ( 0%) ggc
 df multiple defs        :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall       0 kB ( 0%) ggc
 df reaching defs        :   0.10 ( 0%) usr   0.01 ( 0%) sys   0.30 ( 0%) wall       0 kB ( 0%) ggc
 df live regs            :   0.97 ( 2%) usr   0.06 ( 1%) sys   2.96 ( 2%) wall       0 kB ( 0%) ggc
 df live&initialized regs:   0.46 ( 1%) usr   0.02 ( 0%) sys   0.69 ( 1%) wall       0 kB ( 0%) ggc
 df use-def / def-use chains:   0.06 ( 0%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall       0 kB ( 0%) ggc
 df reg dead/unused notes:   0.31 ( 1%) usr   0.02 ( 0%) sys   0.43 ( 0%) wall    2796 kB ( 0%) ggc
 register information    :   0.08 ( 0%) usr   0.00 ( 0%) sys   0.10 ( 0%) wall       0 kB ( 0%) ggc
 alias analysis          :   0.22 ( 1%) usr   0.02 ( 0%) sys   0.34 ( 0%) wall    7380 kB ( 0%) ggc
 alias stmt walking      :   0.14 ( 0%) usr   0.07 ( 1%) sys   0.27 ( 0%) wall    2333 kB ( 0%) ggc
 register scan           :   0.03 ( 0%) usr   0.01 ( 0%) sys   0.03 ( 0%) wall       3 kB ( 0%) ggc
 rebuild jump labels     :   0.10 ( 0%) usr   0.00 ( 0%) sys   0.09 ( 0%) wall       0 kB ( 0%) ggc
 preprocessing           :   0.07 ( 0%) usr   0.15 ( 2%) sys   0.16 ( 0%) wall    1496 kB ( 0%) ggc
 lexical analysis        :   0.01 ( 0%) usr   0.13 ( 1%) sys   0.16 ( 0%) wall       0 kB ( 0%) ggc
 parser (global)         :   0.03 ( 0%) usr   0.04 ( 0%) sys   0.11 ( 0%) wall    6890 kB ( 0%) ggc
 parser struct body      :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall     486 kB ( 0%) ggc
 parser enumerator list  :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall     352 kB ( 0%) ggc
 parser function body    :   0.04 ( 0%) usr   0.02 ( 0%) sys   0.10 ( 0%) wall    1263 kB ( 0%) ggc
 parser inl. func. body  :   0.07 ( 0%) usr   0.07 ( 1%) sys   0.13 ( 0%) wall    3107 kB ( 0%) ggc
 inline heuristics       :   0.11 ( 0%) usr   0.02 ( 0%) sys   0.16 ( 0%) wall    3669 kB ( 0%) ggc
 integration             :   0.34 ( 1%) usr   0.31 ( 3%) sys   0.56 ( 0%) wall   41530 kB ( 2%) ggc
 tree gimplify           :   0.04 ( 0%) usr   0.02 ( 0%) sys   0.15 ( 0%) wall    3964 kB ( 0%) ggc
 tree eh                 :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       2 kB ( 0%) ggc
 tree CFG construction   :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall    1582 kB ( 0%) ggc
 tree CFG cleanup        :   0.14 ( 0%) usr   0.01 ( 0%) sys   0.05 ( 0%) wall     456 kB ( 0%) ggc
 tree tail merge         :   0.04 ( 0%) usr   0.01 ( 0%) sys   0.03 ( 0%) wall       6 kB ( 0%) ggc
 tree VRP                :   0.31 ( 1%) usr   0.02 ( 0%) sys   0.40 ( 0%) wall    6708 kB ( 0%) ggc
 tree copy propagation   :   0.12 ( 0%) usr   0.01 ( 0%) sys   0.18 ( 0%) wall     937 kB ( 0%) ggc
 tree find ref. vars     :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall     127 kB ( 0%) ggc
 tree PTA                :   0.52 ( 1%) usr   0.28 ( 3%) sys   0.86 ( 1%) wall    2343 kB ( 0%) ggc
 tree SSA rewrite        :   0.05 ( 0%) usr   0.03 ( 0%) sys   0.14 ( 0%) wall    5755 kB ( 0%) ggc
 tree SSA other          :   0.01 ( 0%) usr   0.01 ( 0%) sys   0.02 ( 0%) wall     127 kB ( 0%) ggc
 tree SSA incremental    :   0.13 ( 0%) usr   0.02 ( 0%) sys   0.12 ( 0%) wall     680 kB ( 0%) ggc
 tree operand scan       :   0.19 ( 0%) usr   0.13 ( 1%) sys   0.37 ( 0%) wall   22560 kB ( 1%) ggc
 dominator optimization  :   0.12 ( 0%) usr   0.01 ( 0%) sys   0.14 ( 0%) wall    7169 kB ( 0%) ggc
 tree CCP                :   0.17 ( 0%) usr   0.00 ( 0%) sys   0.15 ( 0%) wall     670 kB ( 0%) ggc
 tree PHI const/copy prop:   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall       0 kB ( 0%) ggc
 tree reassociation      :   0.03 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall     155 kB ( 0%) ggc
 tree PRE                :   0.18 ( 0%) usr   0.04 ( 0%) sys   0.26 ( 0%) wall    5335 kB ( 0%) ggc
 tree FRE                :   0.42 ( 1%) usr   0.10 ( 1%) sys   0.51 ( 0%) wall   12668 kB ( 1%) ggc
 tree code sinking       :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall      77 kB ( 0%) ggc
 tree linearize phis     :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       2 kB ( 0%) ggc
 tree forward propagate  :   0.06 ( 0%) usr   0.02 ( 0%) sys   0.04 ( 0%) wall    2240 kB ( 0%) ggc
 tree conservative DCE   :   0.08 ( 0%) usr   0.05 ( 1%) sys   0.12 ( 0%) wall       4 kB ( 0%) ggc
 tree aggressive DCE     :   0.04 ( 0%) usr   0.03 ( 0%) sys   0.10 ( 0%) wall    1656 kB ( 0%) ggc
 tree buildin call DCE   :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall       0 kB ( 0%) ggc
 tree DSE                :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall       0 kB ( 0%) ggc
 tree loop invariant motion:   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall       1 kB ( 0%) ggc
 complete unrolling      :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall     182 kB ( 0%) ggc
 tree iv optimization    :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall     105 kB ( 0%) ggc
 tree loop init          :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall     149 kB ( 0%) ggc
 tree SSA uncprop        :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       0 kB ( 0%) ggc
 tree rename SSA copies  :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall       0 kB ( 0%) ggc
 tree STMT verifier      :   0.00 ( 0%) usr   0.01 ( 0%) sys   0.00 ( 0%) wall       0 kB ( 0%) ggc
 dominance computation   :   0.11 ( 0%) usr   0.00 ( 0%) sys   0.10 ( 0%) wall       0 kB ( 0%) ggc
 out of ssa              :   0.06 ( 0%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall      35 kB ( 0%) ggc
 expand vars             :   0.07 ( 0%) usr   0.01 ( 0%) sys   0.07 ( 0%) wall    3675 kB ( 0%) ggc
 expand                  :   0.26 ( 1%) usr   0.07 ( 1%) sys   0.32 ( 0%) wall   23890 kB ( 1%) ggc
 post expand cleanups    :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall    1633 kB ( 0%) ggc
 varconst                :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       0 kB ( 0%) ggc
 jump                    :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       0 kB ( 0%) ggc
 forward prop            :   0.13 ( 0%) usr   0.03 ( 0%) sys   0.16 ( 0%) wall    2816 kB ( 0%) ggc
 CSE                     :   0.38 ( 1%) usr   0.03 ( 0%) sys   0.62 ( 1%) wall    2433 kB ( 0%) ggc
 dead code elimination   :   0.08 ( 0%) usr   0.00 ( 0%) sys   0.07 ( 0%) wall       0 kB ( 0%) ggc
 dead store elim1        :   0.15 ( 0%) usr   0.01 ( 0%) sys   0.19 ( 0%) wall    2431 kB ( 0%) ggc
 dead store elim2        :   0.12 ( 0%) usr   0.00 ( 0%) sys   0.15 ( 0%) wall    1876 kB ( 0%) ggc
 loop analysis           :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall      80 kB ( 0%) ggc
 loop invariant motion   :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.16 ( 0%) wall       1 kB ( 0%) ggc
 CPROP                   :   0.38 ( 1%) usr   0.03 ( 0%) sys   0.58 ( 0%) wall    2579 kB ( 0%) ggc
 PRE                     :  26.78 (67%) usr   7.04 (71%) sys  72.36 (59%) wall 1444332 kB (85%) ggc
 CSE 2                   :   0.20 ( 0%) usr   0.01 ( 0%) sys   0.18 ( 0%) wall     653 kB ( 0%) ggc
 branch prediction       :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.02 ( 0%) wall     319 kB ( 0%) ggc
 combiner                :   0.35 ( 1%) usr   0.01 ( 0%) sys   0.50 ( 0%) wall    6538 kB ( 0%) ggc
 if-conversion           :   0.30 ( 1%) usr   0.00 ( 0%) sys   0.65 ( 1%) wall     496 kB ( 0%) ggc
 regmove                 :   0.06 ( 0%) usr   0.00 ( 0%) sys   0.12 ( 0%) wall     224 kB ( 0%) ggc
 integrated RA           :   1.03 ( 3%) usr   0.02 ( 0%) sys   1.15 ( 1%) wall   26022 kB ( 2%) ggc
 reload                  :   0.51 ( 1%) usr   0.00 ( 0%) sys   0.58 ( 0%) wall    1732 kB ( 0%) ggc
 reload CSE regs         :   0.40 ( 1%) usr   0.00 ( 0%) sys   0.74 ( 1%) wall    3965 kB ( 0%) ggc
 ree                     :   0.03 ( 0%) usr   0.01 ( 0%) sys   0.05 ( 0%) wall     162 kB ( 0%) ggc
 thread pro- & epilogue  :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.04 ( 0%) wall     822 kB ( 0%) ggc
 if-conversion 2         :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall      84 kB ( 0%) ggc
 combine stack adjustments:   0.01 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       0 kB ( 0%) ggc
 peephole 2              :   0.05 ( 0%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall     228 kB ( 0%) ggc
 hard reg cprop          :   0.05 ( 0%) usr   0.01 ( 0%) sys   0.10 ( 0%) wall       9 kB ( 0%) ggc
 scheduling 2            :   0.47 ( 1%) usr   0.29 ( 3%) sys   1.15 ( 1%) wall     328 kB ( 0%) ggc
 machine dep reorg       :   0.05 ( 0%) usr   0.00 ( 0%) sys   0.07 ( 0%) wall      44 kB ( 0%) ggc
 reorder blocks          :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.05 ( 0%) wall     867 kB ( 0%) ggc
 final                   :   0.20 ( 0%) usr   0.02 ( 0%) sys   0.33 ( 0%) wall    2060 kB ( 0%) ggc
 symout                  :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 0%) wall       0 kB ( 0%) ggc
 rest of compilation     :   0.26 ( 1%) usr   0.02 ( 0%) sys   0.54 ( 0%) wall    1438 kB ( 0%) ggc
 remove unused locals    :   0.36 ( 1%) usr   0.00 ( 0%) sys   0.41 ( 0%) wall       0 kB ( 0%) ggc
 address taken           :   0.04 ( 0%) usr   0.00 ( 0%) sys   0.08 ( 0%) wall       0 kB ( 0%) ggc
 unaccounted todo        :   0.11 ( 0%) usr   0.03 ( 0%) sys   0.12 ( 0%) wall       0 kB ( 0%) ggc
 rebuild frequencies     :   0.01 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall       1 kB ( 0%) ggc
 repair loop structures  :   0.02 ( 0%) usr   0.00 ( 0%) sys   0.03 ( 0%) wall      23 kB ( 0%) ggc
 TOTAL                 :  40.21             9.85           123.33            1695325 kB

  reply	other threads:[~2012-11-27 12:09 UTC|newest]

Thread overview: 16+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-11-27  8:36 [Qemu-devel] [PATCH 1.3] build: compile translate.o at -O1 optimization Paolo Bonzini
2012-11-27  9:27 ` Wenchao Xia
2012-11-27  9:37   ` Paolo Bonzini
2012-11-27 12:09     ` Gerd Hoffmann [this message]
  -- strict thread matches above, loose matches on Subject: below --
2012-11-27  8:34 Paolo Bonzini
2012-11-27 13:24 ` Peter Maydell
2012-11-27 13:45   ` Gerd Hoffmann
2012-11-27 13:46     ` Peter Maydell
2012-11-27 14:10       ` Paolo Bonzini
2012-11-27 13:49     ` 陳韋任 (Wei-Ren Chen)
2012-11-27 15:05       ` Paolo Bonzini
2012-11-27 16:24       ` Andreas Färber
2012-11-27 15:53 ` Avi Kivity
2012-11-27 16:02   ` Paolo Bonzini
2012-11-27 16:49   ` Markus Armbruster
2012-11-27 16:55     ` Paolo Bonzini

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=50B4AD66.9030106@redhat.com \
    --to=kraxel@redhat.com \
    --cc=pbonzini@redhat.com \
    --cc=qemu-devel@nongnu.org \
    --cc=xiawenc@linux.vnet.ibm.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.