public inbox for linux-arm-kernel@lists.infradead.org
 help / color / mirror / Atom feed
* [RFC] Improving scalability of smp_mb__[before|after]_clear_bit
@ 2011-07-12  7:34 heechul Yun
  2011-07-12 10:07 ` Catalin Marinas
  2011-07-19 10:43 ` Russell King - ARM Linux
  0 siblings, 2 replies; 5+ messages in thread
From: heechul Yun @ 2011-07-12  7:34 UTC (permalink / raw)
  To: linux-arm-kernel

I think L2 cache sync operation, called by mb(), is not necessary for bitops.
This patch improves lat_pagefault of lmbench by up to 11% on a A9 SMP.
Higher proceesor
counts can benefit more.

---
diff --git a/arch/arm/include/asm/bitops.h b/arch/arm/include/asm/bitops.h
index b4892a0..f428059 100644
--- a/arch/arm/include/asm/bitops.h
+++ b/arch/arm/include/asm/bitops.h
@@ -26,8 +26,8 @@
 #include <linux/compiler.h>
 #include <asm/system.h>

-#define smp_mb__before_clear_bit()     mb()
-#define smp_mb__after_clear_bit()      mb()
+#define smp_mb__before_clear_bit()     smp_mb()
+#define smp_mb__after_clear_bit()      smp_mb()

 /*
  * These functions are the basis of our bit ops.

^ permalink raw reply related	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2011-07-19 12:52 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-07-12  7:34 [RFC] Improving scalability of smp_mb__[before|after]_clear_bit heechul Yun
2011-07-12 10:07 ` Catalin Marinas
2011-07-19 10:43 ` Russell King - ARM Linux
2011-07-19 10:59   ` Russell King - ARM Linux
2011-07-19 12:52     ` heechul Yun

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox