* cacheable_memcpy() versus memcpy() ==> 8% improvment on FTP throughput
@ 2015-02-03 16:35 leroy christophe
0 siblings, 0 replies; only message in thread
From: leroy christophe @ 2015-02-03 16:35 UTC (permalink / raw)
To: LinuxPPC-dev
In powerpc32 architecture we have a function called cacheable_memcpy()
which does same thing as memcpy() but using dcbz/dcbt instructions for
an optimised copy (just like __copy_tofrom_user())
What seems strange is that it is almost nowhere used (only used in
drivers/net/ethernet/ibm/emac/core.c)
I replaced all memcpy() in include/linux/skbuff.h and net/core/skbuff.c
by cacheable_memcpy() and I get around 8% improvement on FTP throughput
on MPC885.
What could be done to generalise the use of cacheable_memcpy() instead
of memcpy() whenever possible ?
Indeed, in order to use cacheable_memcpy(), we need
* The destination to be cacheable
* The source and destination to not overlap on the same cachelines
Could we check, when calling memcpy(), whether the destination is
cacheable or not, and if yes redirect the call to cacheable_memcpy() ?
How can we check that ?
Christophe
^ permalink raw reply [flat|nested] only message in thread
only message in thread, other threads:[~2015-02-03 16:35 UTC | newest]
Thread overview: (only message) (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2015-02-03 16:35 cacheable_memcpy() versus memcpy() ==> 8% improvment on FTP throughput leroy christophe
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.