All of lore.kernel.org
 help / color / mirror / Atom feed
From: Arnaldo Carvalho de Melo <acme@kernel.org>
To: Taeung Song <treeze.taeung@gmail.com>
Cc: Arnaldo Carvalho de Melo <arnaldo.melo@gmail.com>,
	linux-kernel@vger.kernel.org, Jiri Olsa <jolsa@kernel.org>,
	Namhyung Kim <namhyung@kernel.org>,
	Ingo Molnar <mingo@kernel.org>,
	Peter Zijlstra <peterz@infradead.org>,
	Wang Nan <wangnan0@huawei.com>,
	Masami Hiramatsu <mhiramat@kernel.org>,
	Jiri Olsa <jolsa@redhat.com>
Subject: Re: [PATCH 2/4] perf annotate: Avoid division by zero when calculating percent
Date: Tue, 21 Mar 2017 11:14:07 -0300	[thread overview]
Message-ID: <20170321141407.GB3641@kernel.org> (raw)
In-Reply-To: <939347e4-593c-4ef6-37d9-daa2fee3aed8@gmail.com>

Em Tue, Mar 21, 2017 at 07:20:20AM +0900, Taeung Song escreveu:
> And,
> I tested by perf-stat on the same situation as below.
> 
>   $ perf stat -e "{cycles,page-faults,branch-misses}" ./old <input.txt
>   6623856

Please always try to spell out all the steps needed to get to some
result, for instance, in this case the info above, that you are asking
for three counters to be recorded at once probably has the key to
reproduce this, as I think that you may run your workload and sometimes
not get one page fault, leading tho that division by zero, but I have to
try to reproduce it now that I have this clue.

Thanks,

- Arnaldo

 
>    Performance counter stats for './old':
> 
>        472,007,763      cycles                            (99.85%)
>                 71      page-faults                       (99.85%)
>            220,073      branch-misses                     (99.85%)
> 
>        0.170768608 seconds time elapsed
> 
> Many times, the number of samples 'page-faults' was 68 ~ 71.
> In spite of it, how did the below 'h->sum' is zero..
> 
> util/annotate.c:1660~1661
> 
> 1660        h = annotation__histogram(notes, evidx + k);
> 1661        src_line->samples[k].percent = 100.0 * h->addr[i] / h->sum;
> 
> 
> This patch just add if statement 'if (h->sum)' to handle the case
> that h->sum is zero. But now I wonder how h->sum could be zero..
> 
> I'll dig the problem to find the root cause of it, too !
> 
> Thanks,
> Taeung
> 
> On 03/21/2017 07:11 AM, Taeung Song wrote:
> > Hi Arnaldo :)
> > 
> > Here the perf.data is,
> > https://www.dropbox.com/s/nr4nnv8g3cipluf/perf.data?dl=1&pl=1
> > 
> > I tested as below.
> > 
> >   $ perf record -e "{cycles,page-faults,branch-misses}" ./old <input.txt
> > 
> >   $ perf annotate --stdio -l -f 2> /dev/null | grep -i nan | head -3
> >    29.04    -nan    1.52 old_pack_knapsack.c:34
> >    28.27    -nan    0.00 old_pack_knapsack.c:38
> >    16.37    -nan    0.00 old_pack_knapsack.c:37
> > 
> > 
> > Thanks,
> > Taeung
> > 
> > On 03/21/2017 03:15 AM, Arnaldo Carvalho de Melo wrote:
> > > Em Mon, Mar 20, 2017 at 11:56:55AM +0900, Taeung Song escreveu:
> > > > Currently perf-annotate with --print-line can print
> > > > -nan(0x8000000000000) because of division by zero
> > > > when calculating percent.
> > > > 
> > > > So if a sum of samples is zero, skip calculating percent.
> > > 
> > > Tried to reproduce it here, couldn't, syswide record:
> > > 
> > > [root@jouet ~]# perf evlist -v
> > > cycles: size: 112, { sample_period, sample_freq }: 4000, sample_type:
> > > IP|TID|TIME|CPU|PERIOD, disabled: 1, inherit: 1, mmap: 1, comm: 1,
> > > freq: 1, task: 1, sample_id_all: 1, exclude_guest: 1, mmap2: 1,
> > > comm_exec: 1
> > > [root@jouet ~]# perf annotate --stdio -l 2> /dev/null  | grep -i nan
> > > [root@jouet ~]#
> > > 
> > > Can you please send me a perf.data file with this problem? I have to go
> > > thru the code to see how this can take place...
> > > 
> > > - Arnaldo
> > > 
> > > 
> > > > Before:
> > > > 
> > > >     $ perf annotate --stdio -l
> > > > 
> > > > Sorted summary for file /home/taeung/workspace/a.out
> > > > ----------------------------------------------
> > > > 
> > > >    32.89    -nan    7.04 a.c:38
> > > >    25.14    -nan    0.00 a.c:34
> > > >    16.26    -nan   56.34 a.c:31
> > > >    15.88    -nan    1.41 a.c:37
> > > >     5.67    -nan    0.00 a.c:39
> > > >     1.13    -nan   35.21 a.c:26
> > > >     0.95    -nan    0.00 a.c:44
> > > >     0.57    -nan    0.00 a.c:32
> > > >  Percent                 |      Source code & Disassembly of a.out
> > > > for cycles (529 samples)
> > > > -----------------------------------------------------------------------------------------
> > > > 
> > > >                          :
> > > > ...
> > > > 
> > > >  a.c:26    0.57    -nan    4.23 :         40081a:       mov
> > > > %edi,-0x24(%rbp)
> > > >  a.c:26    0.00    -nan    9.86 :         40081d:       mov
> > > > %rsi,-0x30(%rbp)
> > > > 
> > > > ...
> > > > 
> > > > After:
> > > > 
> > > >     $ perf annotate --stdio -l
> > > > 
> > > > Sorted summary for file /home/taeung/workspace/a.out
> > > > ----------------------------------------------
> > > > 
> > > >    32.89    0.00    7.04 a.c:38
> > > >    25.14    0.00    0.00 a.c:34
> > > >    16.26    0.00   56.34 a.c:31
> > > >    15.88    0.00    1.41 a.c:37
> > > >     5.67    0.00    0.00 a.c:39
> > > >     1.13    0.00   35.21 a.c:26
> > > >     0.95    0.00    0.00 a.c:44
> > > >     0.57    0.00    0.00 a.c:32
> > > >  Percent                 |      Source code & Disassembly of old for
> > > > cycles (529 samples)
> > > > -----------------------------------------------------------------------------------------
> > > > 
> > > >                          :
> > > > ...
> > > > 
> > > > a.c:26    0.57    0.00    4.23 :         40081a:       mov
> > > > %edi,-0x24(%rbp)
> > > > a.c:26    0.00    0.00    9.86 :         40081d:       mov
> > > > %rsi,-0x30(%rbp)
> > > > 
> > > > ...
> > > > 
> > > > Cc: Namhyung Kim <namhyung@kernel.org>
> > > > Cc: Jiri Olsa <jolsa@redhat.com>
> > > > Signed-off-by: Taeung Song <treeze.taeung@gmail.com>
> > > > ---
> > > >  tools/perf/util/annotate.c | 10 +++++++---
> > > >  1 file changed, 7 insertions(+), 3 deletions(-)
> > > > 
> > > > diff --git a/tools/perf/util/annotate.c b/tools/perf/util/annotate.c
> > > > index fc91c6b..9bb43cd 100644
> > > > --- a/tools/perf/util/annotate.c
> > > > +++ b/tools/perf/util/annotate.c
> > > > @@ -1665,11 +1665,15 @@ static int symbol__get_source_line(struct
> > > > symbol *sym, struct map *map,
> > > >          src_line->nr_pcnt = nr_pcnt;
> > > > 
> > > >          for (k = 0; k < nr_pcnt; k++) {
> > > > +            double percent = 0.0;
> > > > +
> > > >              h = annotation__histogram(notes, evidx + k);
> > > > -            src_line->samples[k].percent = 100.0 * h->addr[i] / h->sum;
> > > > +            if (h->sum)
> > > > +                percent = 100.0 * h->addr[i] / h->sum;
> > > > 
> > > > -            if (src_line->samples[k].percent > percent_max)
> > > > -                percent_max = src_line->samples[k].percent;
> > > > +            if (percent > percent_max)
> > > > +                percent_max = percent;
> > > > +            src_line->samples[k].percent = percent;
> > > >          }
> > > > 
> > > >          if (percent_max <= 0.5)
> > > > --
> > > > 2.7.4

  reply	other threads:[~2017-03-21 14:14 UTC|newest]

Thread overview: 19+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2017-03-20  2:56 [PATCH 0/4] perf annotate: Bugfixes Taeung Song
2017-03-20  2:56 ` [PATCH 1/4] perf annotate: Use build-id dir when reading link name Taeung Song
2017-03-20  2:56 ` [PATCH 2/4] perf annotate: Avoid division by zero when calculating percent Taeung Song
2017-03-20 18:15   ` Arnaldo Carvalho de Melo
2017-03-20 22:11     ` Taeung Song
2017-03-20 22:20       ` Taeung Song
2017-03-21 14:14         ` Arnaldo Carvalho de Melo [this message]
2017-03-21 14:21           ` Arnaldo Carvalho de Melo
2017-03-21 14:36             ` Taeung Song
2017-03-22 12:00             ` Taeung Song
2017-03-20  2:56 ` [PATCH 3/4] perf annotate: Fix missing setting nr samples on source_line Taeung Song
2017-03-20  2:56 ` [PATCH 4/4] perf annotate: More exactly grep -v of the objdump command Taeung Song
2017-03-21 14:37   ` Arnaldo Carvalho de Melo
2017-03-21 16:19     ` Taeung Song
2017-03-21 16:19     ` Taeung Song
2017-03-21 18:29       ` Arnaldo Carvalho de Melo
2017-03-21 18:32         ` Arnaldo Carvalho de Melo
2017-03-22  7:32           ` Taeung Song
2017-03-24 18:45   ` [tip:perf/core] " tip-bot for Taeung Song

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20170321141407.GB3641@kernel.org \
    --to=acme@kernel.org \
    --cc=arnaldo.melo@gmail.com \
    --cc=jolsa@kernel.org \
    --cc=jolsa@redhat.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mhiramat@kernel.org \
    --cc=mingo@kernel.org \
    --cc=namhyung@kernel.org \
    --cc=peterz@infradead.org \
    --cc=treeze.taeung@gmail.com \
    --cc=wangnan0@huawei.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.