linux-arm-kernel.lists.infradead.org archive mirror
 help / color / mirror / Atom feed
* [PATCH 1/2] ARM: dcache: select DCACHE_WORD_ACCESS for big-endian CPUs
@ 2013-12-17 12:16 Will Deacon
  2013-12-17 12:16 ` [PATCH 2/2] ARM: kconfig: select HAVE_EFFICIENT_UNALIGNED_ACCESS for CPUv6+ && MMU Will Deacon
                   ` (2 more replies)
  0 siblings, 3 replies; 7+ messages in thread
From: Will Deacon @ 2013-12-17 12:16 UTC (permalink / raw)
  To: linux-arm-kernel

With commit 11ec50caedb5 ("word-at-a-time: provide generic big-endian
zero_bytemask implementation"), the asm-generic word-at-a-time code now
provides a zero_bytemask implementation, allowing us to make use of
DCACHE_WORD_ACCESS on big-endian CPUs, providing our
load_unaligned_zeropad function is endianness-clean.

This patch reworks the load_unaligned_zeropad fixup code to work for
both big- and little-endian CPUs, then removes the !CPU_BIG_ENDIAN check
when selecting DCACHE_WORD_ACCESS.

Signed-off-by: Will Deacon <will.deacon@arm.com>
---
 arch/arm/Kconfig                      |  2 +-
 arch/arm/include/asm/word-at-a-time.h | 18 ++++++++++--------
 2 files changed, 11 insertions(+), 9 deletions(-)

diff --git a/arch/arm/Kconfig b/arch/arm/Kconfig
index 88f726d36a40..38ccfd0adbf9 100644
--- a/arch/arm/Kconfig
+++ b/arch/arm/Kconfig
@@ -11,7 +11,7 @@ config ARM
 	select BUILDTIME_EXTABLE_SORT if MMU
 	select CLONE_BACKWARDS
 	select CPU_PM if (SUSPEND || CPU_IDLE)
-	select DCACHE_WORD_ACCESS if (CPU_V6 || CPU_V6K || CPU_V7) && !CPU_BIG_ENDIAN && MMU
+	select DCACHE_WORD_ACCESS if (CPU_V6 || CPU_V6K || CPU_V7) && MMU
 	select GENERIC_ATOMIC64 if (CPU_V7M || CPU_V6 || !CPU_32v6K || !AEABI)
 	select GENERIC_CLOCKEVENTS_BROADCAST if SMP
 	select GENERIC_IDLE_POLL_SETUP
diff --git a/arch/arm/include/asm/word-at-a-time.h b/arch/arm/include/asm/word-at-a-time.h
index 4d52f92967a6..a6d0a29861e7 100644
--- a/arch/arm/include/asm/word-at-a-time.h
+++ b/arch/arm/include/asm/word-at-a-time.h
@@ -48,10 +48,14 @@ static inline unsigned long find_zero(unsigned long mask)
 	return ret;
 }
 
-#ifdef CONFIG_DCACHE_WORD_ACCESS
-
 #define zero_bytemask(mask) (mask)
 
+#else	/* __ARMEB__ */
+#include <asm-generic/word-at-a-time.h>
+#endif
+
+#ifdef CONFIG_DCACHE_WORD_ACCESS
+
 /*
  * Load an unaligned word from kernel space.
  *
@@ -73,7 +77,11 @@ static inline unsigned long load_unaligned_zeropad(const void *addr)
 	"	bic	%2, %2, #0x3\n"
 	"	ldr	%0, [%2]\n"
 	"	lsl	%1, %1, #0x3\n"
+#ifndef __ARMEB__
 	"	lsr	%0, %0, %1\n"
+#else
+	"	lsl	%0, %0, %1\n"
+#endif
 	"	b	2b\n"
 	"	.popsection\n"
 	"	.pushsection __ex_table,\"a\"\n"
@@ -86,11 +94,5 @@ static inline unsigned long load_unaligned_zeropad(const void *addr)
 	return ret;
 }
 
-
 #endif	/* DCACHE_WORD_ACCESS */
-
-#else	/* __ARMEB__ */
-#include <asm-generic/word-at-a-time.h>
-#endif
-
 #endif /* __ASM_ARM_WORD_AT_A_TIME_H */
-- 
1.8.2.2

^ permalink raw reply related	[flat|nested] 7+ messages in thread

* [PATCH 2/2] ARM: kconfig: select HAVE_EFFICIENT_UNALIGNED_ACCESS for CPUv6+ && MMU
  2013-12-17 12:16 [PATCH 1/2] ARM: dcache: select DCACHE_WORD_ACCESS for big-endian CPUs Will Deacon
@ 2013-12-17 12:16 ` Will Deacon
  2013-12-17 20:34   ` Nicolas Pitre
  2013-12-17 20:27 ` [PATCH 1/2] ARM: dcache: select DCACHE_WORD_ACCESS for big-endian CPUs Nicolas Pitre
  2014-01-06 15:45 ` Ben Dooks
  2 siblings, 1 reply; 7+ messages in thread
From: Will Deacon @ 2013-12-17 12:16 UTC (permalink / raw)
  To: linux-arm-kernel

Modern ARM CPUs can perform efficient unaligned memory accesses in
hardware and this feature is relied up on by code such as the dcache
word-at-a-time name hashing.

This patch selects HAVE_EFFICIENT_UNALIGNED_ACCESS for these cores and
reworks the kconfig select logic for DCACHE_WORD_ACCESS to use the new
symbol.

Signed-off-by: Will Deacon <will.deacon@arm.com>
---
 arch/arm/Kconfig | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/arch/arm/Kconfig b/arch/arm/Kconfig
index 38ccfd0adbf9..f8c64af7c768 100644
--- a/arch/arm/Kconfig
+++ b/arch/arm/Kconfig
@@ -11,7 +11,7 @@ config ARM
 	select BUILDTIME_EXTABLE_SORT if MMU
 	select CLONE_BACKWARDS
 	select CPU_PM if (SUSPEND || CPU_IDLE)
-	select DCACHE_WORD_ACCESS if (CPU_V6 || CPU_V6K || CPU_V7) && MMU
+	select DCACHE_WORD_ACCESS if HAVE_EFFICIENT_UNALIGNED_ACCESS
 	select GENERIC_ATOMIC64 if (CPU_V7M || CPU_V6 || !CPU_32v6K || !AEABI)
 	select GENERIC_CLOCKEVENTS_BROADCAST if SMP
 	select GENERIC_IDLE_POLL_SETUP
@@ -35,6 +35,7 @@ config ARM
 	select HAVE_DMA_ATTRS
 	select HAVE_DMA_CONTIGUOUS if MMU
 	select HAVE_DYNAMIC_FTRACE if (!XIP_KERNEL)
+	select HAVE_EFFICIENT_UNALIGNED_ACCESS if (CPU_V6 || CPU_V6K || CPU_V7) && MMU
 	select HAVE_FTRACE_MCOUNT_RECORD if (!XIP_KERNEL)
 	select HAVE_FUNCTION_GRAPH_TRACER if (!THUMB2_KERNEL)
 	select HAVE_FUNCTION_TRACER if (!XIP_KERNEL)
-- 
1.8.2.2

^ permalink raw reply related	[flat|nested] 7+ messages in thread

* [PATCH 1/2] ARM: dcache: select DCACHE_WORD_ACCESS for big-endian CPUs
  2013-12-17 12:16 [PATCH 1/2] ARM: dcache: select DCACHE_WORD_ACCESS for big-endian CPUs Will Deacon
  2013-12-17 12:16 ` [PATCH 2/2] ARM: kconfig: select HAVE_EFFICIENT_UNALIGNED_ACCESS for CPUv6+ && MMU Will Deacon
@ 2013-12-17 20:27 ` Nicolas Pitre
  2014-01-06 15:45 ` Ben Dooks
  2 siblings, 0 replies; 7+ messages in thread
From: Nicolas Pitre @ 2013-12-17 20:27 UTC (permalink / raw)
  To: linux-arm-kernel

On Tue, 17 Dec 2013, Will Deacon wrote:

> With commit 11ec50caedb5 ("word-at-a-time: provide generic big-endian
> zero_bytemask implementation"), the asm-generic word-at-a-time code now
> provides a zero_bytemask implementation, allowing us to make use of
> DCACHE_WORD_ACCESS on big-endian CPUs, providing our
> load_unaligned_zeropad function is endianness-clean.
> 
> This patch reworks the load_unaligned_zeropad fixup code to work for
> both big- and little-endian CPUs, then removes the !CPU_BIG_ENDIAN check
> when selecting DCACHE_WORD_ACCESS.
> 
> Signed-off-by: Will Deacon <will.deacon@arm.com>

Small comment below.

> ---
>  arch/arm/Kconfig                      |  2 +-
>  arch/arm/include/asm/word-at-a-time.h | 18 ++++++++++--------
>  2 files changed, 11 insertions(+), 9 deletions(-)
> 
> diff --git a/arch/arm/Kconfig b/arch/arm/Kconfig
> index 88f726d36a40..38ccfd0adbf9 100644
> --- a/arch/arm/Kconfig
> +++ b/arch/arm/Kconfig
> @@ -11,7 +11,7 @@ config ARM
>  	select BUILDTIME_EXTABLE_SORT if MMU
>  	select CLONE_BACKWARDS
>  	select CPU_PM if (SUSPEND || CPU_IDLE)
> -	select DCACHE_WORD_ACCESS if (CPU_V6 || CPU_V6K || CPU_V7) && !CPU_BIG_ENDIAN && MMU
> +	select DCACHE_WORD_ACCESS if (CPU_V6 || CPU_V6K || CPU_V7) && MMU
>  	select GENERIC_ATOMIC64 if (CPU_V7M || CPU_V6 || !CPU_32v6K || !AEABI)
>  	select GENERIC_CLOCKEVENTS_BROADCAST if SMP
>  	select GENERIC_IDLE_POLL_SETUP
> diff --git a/arch/arm/include/asm/word-at-a-time.h b/arch/arm/include/asm/word-at-a-time.h
> index 4d52f92967a6..a6d0a29861e7 100644
> --- a/arch/arm/include/asm/word-at-a-time.h
> +++ b/arch/arm/include/asm/word-at-a-time.h
> @@ -48,10 +48,14 @@ static inline unsigned long find_zero(unsigned long mask)
>  	return ret;
>  }
>  
> -#ifdef CONFIG_DCACHE_WORD_ACCESS
> -
>  #define zero_bytemask(mask) (mask)
>  
> +#else	/* __ARMEB__ */
> +#include <asm-generic/word-at-a-time.h>
> +#endif
> +
> +#ifdef CONFIG_DCACHE_WORD_ACCESS
> +
>  /*
>   * Load an unaligned word from kernel space.
>   *
> @@ -73,7 +77,11 @@ static inline unsigned long load_unaligned_zeropad(const void *addr)
>  	"	bic	%2, %2, #0x3\n"
>  	"	ldr	%0, [%2]\n"
>  	"	lsl	%1, %1, #0x3\n"
> +#ifndef __ARMEB__
>  	"	lsr	%0, %0, %1\n"
> +#else
> +	"	lsl	%0, %0, %1\n"
> +#endif

You could get rid of the #ifdef by using:

		pull	%0, %0, %1

Except this is not a pure assembly file and can't rely on 
<asm/assembler.h>.  Bah.  In that case...

Acked-by: Nicolas Pitre <nico@linaro.org>


Nicolas

^ permalink raw reply	[flat|nested] 7+ messages in thread

* [PATCH 2/2] ARM: kconfig: select HAVE_EFFICIENT_UNALIGNED_ACCESS for CPUv6+ && MMU
  2013-12-17 12:16 ` [PATCH 2/2] ARM: kconfig: select HAVE_EFFICIENT_UNALIGNED_ACCESS for CPUv6+ && MMU Will Deacon
@ 2013-12-17 20:34   ` Nicolas Pitre
  2014-01-06 14:58     ` Will Deacon
  0 siblings, 1 reply; 7+ messages in thread
From: Nicolas Pitre @ 2013-12-17 20:34 UTC (permalink / raw)
  To: linux-arm-kernel

On Tue, 17 Dec 2013, Will Deacon wrote:

> Modern ARM CPUs can perform efficient unaligned memory accesses in
> hardware and this feature is relied up on by code such as the dcache
> word-at-a-time name hashing.
> 
> This patch selects HAVE_EFFICIENT_UNALIGNED_ACCESS for these cores and
> reworks the kconfig select logic for DCACHE_WORD_ACCESS to use the new
> symbol.
> 
> Signed-off-by: Will Deacon <will.deacon@arm.com>

This symbol could be used in some more places too.  For example this 
would make the code a bit clearer:

diff --git a/arch/arm/include/asm/uaccess.h b/arch/arm/include/asm/uaccess.h
index 72abdc541f..12c3a5decc 100644
--- a/arch/arm/include/asm/uaccess.h
+++ b/arch/arm/include/asm/uaccess.h
@@ -19,7 +19,7 @@
 #include <asm/unified.h>
 #include <asm/compiler.h>
 
-#if __LINUX_ARM_ARCH__ < 6
+#ifndef CONFIG_HAVE_EFFICIENT_UNALIGNED_ACCESS
 #include <asm-generic/uaccess-unaligned.h>
 #else
 #define __get_user_unaligned __get_user


Nicolas

^ permalink raw reply related	[flat|nested] 7+ messages in thread

* [PATCH 2/2] ARM: kconfig: select HAVE_EFFICIENT_UNALIGNED_ACCESS for CPUv6+ && MMU
  2013-12-17 20:34   ` Nicolas Pitre
@ 2014-01-06 14:58     ` Will Deacon
  2014-01-06 15:17       ` Nicolas Pitre
  0 siblings, 1 reply; 7+ messages in thread
From: Will Deacon @ 2014-01-06 14:58 UTC (permalink / raw)
  To: linux-arm-kernel

Hi Nicolas,

On Tue, Dec 17, 2013 at 08:34:18PM +0000, Nicolas Pitre wrote:
> On Tue, 17 Dec 2013, Will Deacon wrote:
> 
> > Modern ARM CPUs can perform efficient unaligned memory accesses in
> > hardware and this feature is relied up on by code such as the dcache
> > word-at-a-time name hashing.
> > 
> > This patch selects HAVE_EFFICIENT_UNALIGNED_ACCESS for these cores and
> > reworks the kconfig select logic for DCACHE_WORD_ACCESS to use the new
> > symbol.
> > 
> > Signed-off-by: Will Deacon <will.deacon@arm.com>
> 
> This symbol could be used in some more places too.  For example this 
> would make the code a bit clearer:

Well spotted.

> diff --git a/arch/arm/include/asm/uaccess.h b/arch/arm/include/asm/uaccess.h
> index 72abdc541f..12c3a5decc 100644
> --- a/arch/arm/include/asm/uaccess.h
> +++ b/arch/arm/include/asm/uaccess.h
> @@ -19,7 +19,7 @@
>  #include <asm/unified.h>
>  #include <asm/compiler.h>
>  
> -#if __LINUX_ARM_ARCH__ < 6
> +#ifndef CONFIG_HAVE_EFFICIENT_UNALIGNED_ACCESS
>  #include <asm-generic/uaccess-unaligned.h>
>  #else
>  #define __get_user_unaligned __get_user

The original patch is now queued, so we can add this hunk as a follow-up
cleanup patch. I've got you as the author, but can I add your S-o-B please?

Cheers,

Will

^ permalink raw reply	[flat|nested] 7+ messages in thread

* [PATCH 2/2] ARM: kconfig: select HAVE_EFFICIENT_UNALIGNED_ACCESS for CPUv6+ && MMU
  2014-01-06 14:58     ` Will Deacon
@ 2014-01-06 15:17       ` Nicolas Pitre
  0 siblings, 0 replies; 7+ messages in thread
From: Nicolas Pitre @ 2014-01-06 15:17 UTC (permalink / raw)
  To: linux-arm-kernel

On Mon, 6 Jan 2014, Will Deacon wrote:

> Hi Nicolas,
> 
> On Tue, Dec 17, 2013 at 08:34:18PM +0000, Nicolas Pitre wrote:
> > On Tue, 17 Dec 2013, Will Deacon wrote:
> > 
> > > Modern ARM CPUs can perform efficient unaligned memory accesses in
> > > hardware and this feature is relied up on by code such as the dcache
> > > word-at-a-time name hashing.
> > > 
> > > This patch selects HAVE_EFFICIENT_UNALIGNED_ACCESS for these cores and
> > > reworks the kconfig select logic for DCACHE_WORD_ACCESS to use the new
> > > symbol.
> > > 
> > > Signed-off-by: Will Deacon <will.deacon@arm.com>
> > 
> > This symbol could be used in some more places too.  For example this 
> > would make the code a bit clearer:
> 
> Well spotted.
> 
> > diff --git a/arch/arm/include/asm/uaccess.h b/arch/arm/include/asm/uaccess.h
> > index 72abdc541f..12c3a5decc 100644
> > --- a/arch/arm/include/asm/uaccess.h
> > +++ b/arch/arm/include/asm/uaccess.h
> > @@ -19,7 +19,7 @@
> >  #include <asm/unified.h>
> >  #include <asm/compiler.h>
> >  
> > -#if __LINUX_ARM_ARCH__ < 6
> > +#ifndef CONFIG_HAVE_EFFICIENT_UNALIGNED_ACCESS
> >  #include <asm-generic/uaccess-unaligned.h>
> >  #else
> >  #define __get_user_unaligned __get_user
> 
> The original patch is now queued, so we can add this hunk as a follow-up
> cleanup patch. I've got you as the author, but can I add your S-o-B please?

Signed-off-by: Nicolas Pitre <nico@linaro.org>



> 
> Cheers,
> 
> Will
> 

^ permalink raw reply	[flat|nested] 7+ messages in thread

* [PATCH 1/2] ARM: dcache: select DCACHE_WORD_ACCESS for big-endian CPUs
  2013-12-17 12:16 [PATCH 1/2] ARM: dcache: select DCACHE_WORD_ACCESS for big-endian CPUs Will Deacon
  2013-12-17 12:16 ` [PATCH 2/2] ARM: kconfig: select HAVE_EFFICIENT_UNALIGNED_ACCESS for CPUv6+ && MMU Will Deacon
  2013-12-17 20:27 ` [PATCH 1/2] ARM: dcache: select DCACHE_WORD_ACCESS for big-endian CPUs Nicolas Pitre
@ 2014-01-06 15:45 ` Ben Dooks
  2 siblings, 0 replies; 7+ messages in thread
From: Ben Dooks @ 2014-01-06 15:45 UTC (permalink / raw)
  To: linux-arm-kernel

On 17/12/13 12:16, Will Deacon wrote:
> With commit 11ec50caedb5 ("word-at-a-time: provide generic big-endian
> zero_bytemask implementation"), the asm-generic word-at-a-time code now
> provides a zero_bytemask implementation, allowing us to make use of
> DCACHE_WORD_ACCESS on big-endian CPUs, providing our
> load_unaligned_zeropad function is endianness-clean.
>
> This patch reworks the load_unaligned_zeropad fixup code to work for
> both big- and little-endian CPUs, then removes the !CPU_BIG_ENDIAN check
> when selecting DCACHE_WORD_ACCESS.


> @@ -73,7 +77,11 @@ static inline unsigned long load_unaligned_zeropad(const void *addr)
>   	"	bic	%2, %2, #0x3\n"
>   	"	ldr	%0, [%2]\n"
>   	"	lsl	%1, %1, #0x3\n"
> +#ifndef __ARMEB__
>   	"	lsr	%0, %0, %1\n"
> +#else
> +	"	lsl	%0, %0, %1\n"
> +#endif

IIRC we have ARM_BE() and ARM_LE() helpers for this


-- 
Ben Dooks				http://www.codethink.co.uk/
Senior Engineer				Codethink - Providing Genius

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2014-01-06 15:45 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2013-12-17 12:16 [PATCH 1/2] ARM: dcache: select DCACHE_WORD_ACCESS for big-endian CPUs Will Deacon
2013-12-17 12:16 ` [PATCH 2/2] ARM: kconfig: select HAVE_EFFICIENT_UNALIGNED_ACCESS for CPUv6+ && MMU Will Deacon
2013-12-17 20:34   ` Nicolas Pitre
2014-01-06 14:58     ` Will Deacon
2014-01-06 15:17       ` Nicolas Pitre
2013-12-17 20:27 ` [PATCH 1/2] ARM: dcache: select DCACHE_WORD_ACCESS for big-endian CPUs Nicolas Pitre
2014-01-06 15:45 ` Ben Dooks

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).