All of lore.kernel.org
 help / color / mirror / Atom feed
From: catalin.marinas@arm.com (Catalin Marinas)
To: linux-arm-kernel@lists.infradead.org
Subject: [PATCHv2 1/6] arm64: lib: Implement optimized memcpy routine
Date: Fri, 9 May 2014 15:13:09 +0100	[thread overview]
Message-ID: <20140509141308.GE7950@arm.com> (raw)
In-Reply-To: <1398661895-5559-2-git-send-email-zhichang.yuan@linaro.org>

On Mon, Apr 28, 2014 at 06:11:29AM +0100, zhichang.yuan at linaro.org wrote:
> This patch, based on Linaro's Cortex Strings library, improves
> the performance of the assembly optimized memcpy() function.
[...]
> --- a/arch/arm64/lib/memcpy.S
> +++ b/arch/arm64/lib/memcpy.S
[...]
>  ENTRY(memcpy)
[...]
> +	mov	dst, dstin
> +	cmp	count, #16
> +	/*When memory length is less than 16, the accessed are not aligned.*/
> +	b.lo	.Ltiny15
> +
> +	neg	tmp2, src
> +	ands	tmp2, tmp2, #15/* Bytes to reach alignment. */
> +	b.eq	.LSrcAligned
> +	sub	count, count, tmp2

I started looking at this and comparing it to the original cortex
strings library. Is there any reason why at least the first part has
been rewritten? For example, the cortex strings starts with probably the
most likely case, comparing the count with 64.

-- 
Catalin

  reply	other threads:[~2014-05-09 14:13 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-04-28  5:11 [PATCHv2 0/6] arm64:lib: the optimized string library routines for armv8 processors zhichang.yuan at linaro.org
2014-04-28  5:11 ` [PATCHv2 1/6] arm64: lib: Implement optimized memcpy routine zhichang.yuan at linaro.org
2014-05-09 14:13   ` Catalin Marinas [this message]
2014-05-13 13:33     ` zhichang.yuan
2014-04-28  5:11 ` [PATCHv2 2/6] arm64: lib: Implement optimized memmove routine zhichang.yuan at linaro.org
2014-04-28  5:11 ` [PATCHv2 3/6] arm64: lib: Implement optimized memset routine zhichang.yuan at linaro.org
2014-04-28  5:11 ` [PATCHv2 4/6] arm64: lib: Implement optimized memcmp routine zhichang.yuan at linaro.org
2014-04-28  5:11 ` [PATCHv2 5/6] arm64: lib: Implement optimized string compare routines zhichang.yuan at linaro.org
2014-04-28  5:11 ` [PATCHv2 6/6] arm64: lib: Implement optimized string length routines zhichang.yuan at linaro.org
2014-05-09 12:56 ` [PATCHv2 0/6] arm64:lib: the optimized string library routines for armv8 processors Catalin Marinas
2014-05-16 11:38   ` zhichang.yuan
2014-05-23 14:29     ` Catalin Marinas

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20140509141308.GE7950@arm.com \
    --to=catalin.marinas@arm.com \
    --cc=linux-arm-kernel@lists.infradead.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.