From: "H. Peter Anvin" <hpa@zytor.com>
To: ling.ma@intel.com
Cc: mingo@elte.hu, tglx@linutronix.de, linux-kernel@vger.kernel.org
Subject: Re: [PATCH RFC] [X86] performance improvement for memcpy_64.S by fast string.
Date: Fri, 06 Nov 2009 09:07:44 -0800 [thread overview]
Message-ID: <4AF457E0.4040107@zytor.com> (raw)
In-Reply-To: <1257500482-16182-1-git-send-email-ling.ma@intel.com>
On 11/06/2009 01:41 AM, ling.ma@intel.com wrote:
>
> Performance counter stats for './static_orig' (10 runs):
>
> 2835.650105 task-clock-msecs # 0.999 CPUs ( +- 0.051% )
> 3 context-switches # 0.000 M/sec ( +- 6.503% )
> 0 CPU-migrations # 0.000 M/sec ( +- nan% )
> 4429 page-faults # 0.002 M/sec ( +- 0.003% )
> 7941098692 cycles # 2800.451 M/sec ( +- 0.051% )
> 10848100323 instructions # 1.366 IPC ( +- 0.000% )
> 322808 cache-references # 0.114 M/sec ( +- 1.467% )
> 280716 cache-misses # 0.099 M/sec ( +- 0.618% )
>
> 2.838006377 seconds time elapsed ( +- 0.051% )
>
> 'perf stat --repeat 10 ./static_new' command get data after patch:
>
> Performance counter stats for './static_new' (10 runs):
>
> 7401.423466 task-clock-msecs # 0.999 CPUs ( +- 0.108% )
> 10 context-switches # 0.000 M/sec ( +- 2.797% )
> 0 CPU-migrations # 0.000 M/sec ( +- nan% )
> 4428 page-faults # 0.001 M/sec ( +- 0.003% )
> 20727280183 cycles # 2800.445 M/sec ( +- 0.107% )
> 1472673654 instructions # 0.071 IPC ( +- 0.013% )
> 1092221 cache-references # 0.148 M/sec ( +- 12.414% )
> 290550 cache-misses # 0.039 M/sec ( +- 1.577% )
>
> 7.407006046 seconds time elapsed ( +- 0.108% )
>
I assume these are backwards? If so, it's a dramatic performance
improvement.
Where did the 1024 byte threshold come from? It seems a bit high to me,
and is at the very best a CPU-specific tuning factor.
Andi is of course correct that older CPUs might suffer (sadly enough),
which is why we'd at the very least need some idea of what the
performance impact on those older CPUs would look like -- at that point
we can make a decision to just unconditionally do the rep movs or
consider some system where we point at different implementations for
different processors -- memcpy is probably one of the very few
operations for which something like that would make sense.
-hpa
--
H. Peter Anvin, Intel Open Source Technology Center
I work for Intel. I don't speak on their behalf.
next prev parent reply other threads:[~2009-11-06 17:08 UTC|newest]
Thread overview: 33+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-11-06 9:41 [PATCH RFC] [X86] performance improvement for memcpy_64.S by fast string ling.ma
2009-11-06 16:51 ` Andi Kleen
2009-11-08 10:18 ` Ingo Molnar
2009-11-06 17:07 ` H. Peter Anvin [this message]
2009-11-06 19:26 ` H. Peter Anvin
2009-11-09 7:24 ` Ma, Ling
2009-11-09 7:36 ` H. Peter Anvin
2009-11-09 8:08 ` Ingo Molnar
2009-11-11 7:05 ` Ma, Ling
2009-11-11 7:18 ` Ingo Molnar
2009-11-11 7:57 ` Ma, Ling
2009-11-11 23:21 ` H. Peter Anvin
2009-11-12 2:12 ` Ma, Ling
2009-11-11 20:34 ` Cyrill Gorcunov
2009-11-11 22:39 ` H. Peter Anvin
2009-11-12 4:28 ` Cyrill Gorcunov
2009-11-12 4:49 ` Ma, Ling
2009-11-12 5:26 ` H. Peter Anvin
2009-11-12 7:42 ` Ma, Ling
2009-11-12 9:54 ` Cyrill Gorcunov
2009-11-12 12:16 ` Pavel Machek
2009-11-13 7:33 ` Ingo Molnar
2009-11-13 8:04 ` H. Peter Anvin
2009-11-13 8:10 ` Ingo Molnar
2009-11-09 9:26 ` Andi Kleen
2009-11-09 16:41 ` H. Peter Anvin
2009-11-09 18:54 ` Andi Kleen
2009-11-09 22:36 ` H. Peter Anvin
2009-11-12 12:16 ` Pavel Machek
2009-11-13 5:33 ` Ma, Ling
2009-11-13 6:04 ` H. Peter Anvin
2009-11-13 7:23 ` Ma, Ling
2009-11-13 7:30 ` H. Peter Anvin
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4AF457E0.4040107@zytor.com \
--to=hpa@zytor.com \
--cc=ling.ma@intel.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@elte.hu \
--cc=tglx@linutronix.de \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox