From: catalin.marinas@arm.com (Catalin Marinas)
To: linux-arm-kernel@lists.infradead.org
Subject: [PATCHv2 1/6] arm64: lib: Implement optimized memcpy routine
Date: Fri, 9 May 2014 15:13:09 +0100 [thread overview]
Message-ID: <20140509141308.GE7950@arm.com> (raw)
In-Reply-To: <1398661895-5559-2-git-send-email-zhichang.yuan@linaro.org>
On Mon, Apr 28, 2014 at 06:11:29AM +0100, zhichang.yuan at linaro.org wrote:
> This patch, based on Linaro's Cortex Strings library, improves
> the performance of the assembly optimized memcpy() function.
[...]
> --- a/arch/arm64/lib/memcpy.S
> +++ b/arch/arm64/lib/memcpy.S
[...]
> ENTRY(memcpy)
[...]
> + mov dst, dstin
> + cmp count, #16
> + /*When memory length is less than 16, the accessed are not aligned.*/
> + b.lo .Ltiny15
> +
> + neg tmp2, src
> + ands tmp2, tmp2, #15/* Bytes to reach alignment. */
> + b.eq .LSrcAligned
> + sub count, count, tmp2
I started looking at this and comparing it to the original cortex
strings library. Is there any reason why at least the first part has
been rewritten? For example, the cortex strings starts with probably the
most likely case, comparing the count with 64.
--
Catalin
next prev parent reply other threads:[~2014-05-09 14:13 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-04-28 5:11 [PATCHv2 0/6] arm64:lib: the optimized string library routines for armv8 processors zhichang.yuan at linaro.org
2014-04-28 5:11 ` [PATCHv2 1/6] arm64: lib: Implement optimized memcpy routine zhichang.yuan at linaro.org
2014-05-09 14:13 ` Catalin Marinas [this message]
2014-05-13 13:33 ` zhichang.yuan
2014-04-28 5:11 ` [PATCHv2 2/6] arm64: lib: Implement optimized memmove routine zhichang.yuan at linaro.org
2014-04-28 5:11 ` [PATCHv2 3/6] arm64: lib: Implement optimized memset routine zhichang.yuan at linaro.org
2014-04-28 5:11 ` [PATCHv2 4/6] arm64: lib: Implement optimized memcmp routine zhichang.yuan at linaro.org
2014-04-28 5:11 ` [PATCHv2 5/6] arm64: lib: Implement optimized string compare routines zhichang.yuan at linaro.org
2014-04-28 5:11 ` [PATCHv2 6/6] arm64: lib: Implement optimized string length routines zhichang.yuan at linaro.org
2014-05-09 12:56 ` [PATCHv2 0/6] arm64:lib: the optimized string library routines for armv8 processors Catalin Marinas
2014-05-16 11:38 ` zhichang.yuan
2014-05-23 14:29 ` Catalin Marinas
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20140509141308.GE7950@arm.com \
--to=catalin.marinas@arm.com \
--cc=linux-arm-kernel@lists.infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.