linuxppc-dev.lists.ozlabs.org archive mirror
 help / color / mirror / Atom feed
* [PATCH 1/6] powerpc: Speed up clear_page by unrolling it
@ 2014-09-17  4:39 Anton Blanchard
  2014-09-17  4:39 ` [PATCH 2/6] powerpc: Use pr_fmt in module loader code Anton Blanchard
                   ` (4 more replies)
  0 siblings, 5 replies; 6+ messages in thread
From: Anton Blanchard @ 2014-09-17  4:39 UTC (permalink / raw)
  To: benh, paulus, mpe; +Cc: linuxppc-dev

Unroll clear_page 8 times. A simple microbenchmark which
allocates and frees a zeroed page:

for (i = 0; i < iterations; i++) {
	unsigned long p = __get_free_page(GFP_KERNEL | __GFP_ZERO);
	free_page(p);
}

improves 20% on POWER8.

This assumes cacheline sizes won't grow beyond 512 bytes and
page sizes wont drop below 1kB, which is unlikely, but we could
add a runtime check during early init if it makes people nervous.

Signed-off-by: Anton Blanchard <anton@samba.org>
---
 arch/powerpc/include/asm/page_64.h | 18 ++++++++++++++----
 1 file changed, 14 insertions(+), 4 deletions(-)

diff --git a/arch/powerpc/include/asm/page_64.h b/arch/powerpc/include/asm/page_64.h
index d0d6afb..b6e78cb 100644
--- a/arch/powerpc/include/asm/page_64.h
+++ b/arch/powerpc/include/asm/page_64.h
@@ -51,11 +51,21 @@ static __inline__ void clear_page(void *addr)
 
 	__asm__ __volatile__(
 	"mtctr	%1	# clear_page\n\
-1:      dcbz	0,%0\n\
-	add	%0,%0,%3\n\
+	.balign	16\n\
+1:	dcbz	0,%0\n\
+	dcbz	%3,%0\n\
+	dcbz	%4,%0\n\
+	dcbz	%5,%0\n\
+	dcbz	%6,%0\n\
+	dcbz	%7,%0\n\
+	dcbz	%8,%0\n\
+	dcbz	%9,%0\n\
+	add	%0,%0,%10\n\
 	bdnz+	1b"
-        : "=r" (addr)
-        : "r" (lines), "0" (addr), "r" (line_size)
+	: "=&r" (addr)
+	: "r" (lines/8), "0" (addr), "b" (line_size), "b" (line_size*2),
+		"b" (line_size*3), "b" (line_size*4), "b" (line_size*5),
+		"b" (line_size*6), "b" (line_size*7), "r" (line_size*8)
 	: "ctr", "memory");
 }
 
-- 
1.9.1

^ permalink raw reply related	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2014-09-17  4:39 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2014-09-17  4:39 [PATCH 1/6] powerpc: Speed up clear_page by unrolling it Anton Blanchard
2014-09-17  4:39 ` [PATCH 2/6] powerpc: Use pr_fmt in module loader code Anton Blanchard
2014-09-17  4:39 ` [PATCH 3/6] powerpc: Remove powerpc specific cmd_line Anton Blanchard
2014-09-17  4:39 ` [PATCH 4/6] powerpc: Remove ppc_md.remove_memory Anton Blanchard
2014-09-17  4:39 ` [PATCH 5/6] powerpc: Add printk levels to powernv platform code Anton Blanchard
2014-09-17  4:39 ` [PATCH 6/6] powerpc: Add printk levels to powerpc code Anton Blanchard

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).