From: Jiri Olsa <jolsa@redhat.com>
To: namhyung@kernel.org
Cc: Arnaldo Carvalho de Melo <acme@kernel.org>,
Ingo Molnar <mingo@kernel.org>,
Peter Zijlstra <a.p.zijlstra@chello.nl>,
Mark Rutland <mark.rutland@arm.com>,
Alexander Shishkin <alexander.shishkin@linux.intel.com>,
LKML <linux-kernel@vger.kernel.org>,
Stephane Eranian <eranian@google.com>,
Ian Rogers <irogers@google.com>, Al Grant <al.grant@foss.arm.com>,
Adrian Hunter <adrian.hunter@intel.com>
Subject: Re: [PATCH] perf inject: Flush ordered events on FINISHED_ROUND
Date: Thu, 8 Oct 2020 11:06:54 +0200 [thread overview]
Message-ID: <20201008090654.GA656950@krava> (raw)
In-Reply-To: <20201006054032.GA1724372@google.com>
On Tue, Oct 06, 2020 at 02:40:32PM +0900, namhyung@kernel.org wrote:
> On Tue, Oct 06, 2020 at 11:39:49AM +0900, namhyung@kernel.org wrote:
> > > > On Fri, Oct 02, 2020 at 10:03:17PM +0900, Namhyung Kim wrote:
> > > > > Below measures time and memory usage during the perf inject and
> > > > > report using ~190MB data file.
> > > > >
> > > > > Before:
> > > > > perf inject: 11.09 s, 382148 KB
> > > > > perf report: 8.05 s, 397440 KB
> > > > >
> > > > > After:
> > > > > perf inject: 16.24 s, 83376 KB
> > > > > perf report: 7.96 s, 216184 KB
> > > > >
> > > > > As you can see, it used 2x memory of the input size. I guess it's
> > > > > because it needs to keep the copy for the whole input. But I don't
> > > > > understand why processing time of perf inject increased..
> >
> > Measuring it with time shows:
> >
> > before after
> > real 11.309s 17.040s
> > user 8.084s 13.940s
> > sys 6.535s 6.732s
> >
> > So it's user space to make the difference. I've run perf record on
> > both (with cycles:U) and the dominant function is same: queue_event.
> > (46.98% vs 65.87%)
> >
> > It seems the flushing the queue makes more overhead on sorting.
>
> So I suspect the cache-miss ratio affects the performance. With
> flushing, data is processed in the middle and all the entries are
> reused after flush so it would invalidate all the cache lines
> occasionally.
>
> This is the perf stat result:
>
> * Before
>
> 7,167,414,019 L1-dcache-loads
> 337,471,761 L1-dcache-read-misses # 4.71% of all L1-dcache hits
>
> 11.011224671 seconds time elapsed
>
>
> * After
>
> 7,075,556,792 L1-dcache-loads
> 771,810,388 L1-dcache-read-misses # 10.91% of all L1-dcache hits
>
> 17.015901863 seconds time elapsed
>
>
> Hmm.. it's a memory & time trade-off then. Maybe we need a switch to
> select which one?
I'd keep the faster one ;-) so the one before
jirka
next prev parent reply other threads:[~2020-10-08 9:07 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-10-02 13:03 [PATCH] perf inject: Flush ordered events on FINISHED_ROUND Namhyung Kim
2020-10-04 19:52 ` Jiri Olsa
2020-10-06 1:28 ` Namhyung Kim
2020-10-06 2:39 ` namhyung
2020-10-06 5:40 ` namhyung
2020-10-08 9:06 ` Jiri Olsa [this message]
2020-10-12 5:04 ` Namhyung Kim
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20201008090654.GA656950@krava \
--to=jolsa@redhat.com \
--cc=a.p.zijlstra@chello.nl \
--cc=acme@kernel.org \
--cc=adrian.hunter@intel.com \
--cc=al.grant@foss.arm.com \
--cc=alexander.shishkin@linux.intel.com \
--cc=eranian@google.com \
--cc=irogers@google.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mark.rutland@arm.com \
--cc=mingo@kernel.org \
--cc=namhyung@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox