From: Tejun Heo <tj@kernel.org>
To: cl@linux-foundation.org
Cc: akpm@linux-foundation.org, linux-mm@kvack.org, mingo@elte.hu,
rusty@rustcorp.com.au, davem@davemloft.net
Subject: Re: [this_cpu_xx V2 10/19] this_cpu: X86 optimized this_cpu operations
Date: Thu, 18 Jun 2009 12:00:15 +0900 [thread overview]
Message-ID: <4A39ADBF.1000505@kernel.org> (raw)
In-Reply-To: <20090617203444.731295080@gentwo.org>
cl@linux-foundation.org wrote:
> Basically the existing percpu ops can be used. However, we do not pass a
> reference to a percpu variable in. Instead an address of a percpu variable
> is provided.
>
> Both preempt, the non preempt and the irqsafe operations generate the same code.
> Signed-off-by: Christoph Lameter <cl@linux-foundation.org>
I'm a bit confused why this patch is in the middle of patches which
convert macro users? Wouldn't it be better to put this one right
after the patch which introduces this_cpu_*()?
> ---
> arch/x86/include/asm/percpu.h | 22 ++++++++++++++++++++++
> 1 file changed, 22 insertions(+)
>
> Index: linux-2.6/arch/x86/include/asm/percpu.h
> ===================================================================
> --- linux-2.6.orig/arch/x86/include/asm/percpu.h 2009-06-04 13:38:01.000000000 -0500
> +++ linux-2.6/arch/x86/include/asm/percpu.h 2009-06-04 14:21:22.000000000 -0500
> @@ -140,6 +140,28 @@ do { \
> #define percpu_or(var, val) percpu_to_op("or", per_cpu__##var, val)
> #define percpu_xor(var, val) percpu_to_op("xor", per_cpu__##var, val)
>
> +#define __this_cpu_read(pcp) percpu_from_op("mov", pcp)
^^^^
missing parentheses
and maybe adding () around val is a good idea too?
Also, I'm not quite sure these macros would operate on the correct
address. Checking... yeap, the following function,
DEFINE_PER_CPU(int, my_pcpu_cnt);
void my_func(void)
{
int *ptr = &per_cpu__my_pcpu_cnt;
*(int *)this_cpu_ptr(ptr) = 0;
this_cpu_add(ptr, 1);
percpu_add(my_pcpu_cnt, 1);
}
ends up being assembled into the following.
mov $0xdd48,%rax # save offset of my_pcpu_cnt
mov %rax,-0x10(%rbp) # into local var ptr
mov %gs:0xb800,%rdx # fetch this_cpu_off
movl $0x0,(%rax,%rdx,1) # 0 -> *(this_cpu_off + my_pcpu_cnt)
addq $0x1,%gs:-0x10(%rbp) # add 1 to %gs:ptr !!!
addl $0x1,%gs:0xdd48 # add 1 to %gs:my_pcpu_cnt
So, this_cpu_add(ptr, 1) ends up accessing the wrong address. Also,
please note the use of 'addq' instead of 'addl' as the pointer
variable is being modified.
> +#define __this_cpu_write(pcp, val) percpu_to_op("mov", (pcp), val)
> +#define __this_cpu_add(pcp, val) percpu_to_op("add", (pcp), val)
> +#define __this_cpu_sub(pcp, val) percpu_to_op("sub", (pcp), val)
> +#define __this_cpu_and(pcp, val) percpu_to_op("and", (pcp), val)
> +#define __this_cpu_or(pcp, val) percpu_to_op("or", (pcp), val)
> +#define __this_cpu_xor(pcp, val) percpu_to_op("xor", (pcp), val)
> +
> +#define this_cpu_read(pcp) percpu_from_op("mov", (pcp))
> +#define this_cpu_write(pcp, val) percpu_to_op("mov", (pcp), val)
> +#define this_cpu_add(pcp, val) percpu_to_op("add", (pcp), val)
> +#define this_cpu_sub(pcp, val) percpu_to_op("sub", (pcp), val)
> +#define this_cpu_and(pcp, val) percpu_to_op("and", (pcp), val)
> +#define this_cpu_or(pcp, val) percpu_to_op("or", (pcp), val)
> +#define this_cpu_xor(pcp, val) percpu_to_op("xor", (pcp), val)
>
> +#define irqsafe_cpu_add(pcp, val) percpu_to_op("add", (pcp), val)
> +#define irqsafe_cpu_sub(pcp, val) percpu_to_op("sub", (pcp), val)
> +#define irqsafe_cpu_and(pcp, val) percpu_to_op("and", (pcp), val)
> +#define irqsafe_cpu_or(pcp, val) percpu_to_op("or", (pcp), val)
> +#define irqsafe_cpu_xor(pcp, val) percpu_to_op("xor", (pcp), val)
Wouldn't it be clearer / easier to define preempt and irqsafe versions
as aliases of __ prefixed ones?
Thanks.
--
tejun
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2009-06-18 2:58 UTC|newest]
Thread overview: 48+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-06-17 20:33 [this_cpu_xx V2 00/19] Introduce this_cpu_xx operations cl
2009-06-17 20:33 ` [this_cpu_xx V2 01/19] Fix handling of pagesets for downed cpus cl
2009-06-17 20:33 ` [this_cpu_xx V2 02/19] Introduce this_cpu_ptr() and generic this_cpu_* operations cl
2009-06-18 1:50 ` Tejun Heo
2009-06-18 2:29 ` Tejun Heo
2009-06-18 13:54 ` Christoph Lameter
2009-06-18 14:49 ` Tejun Heo
2009-06-17 20:33 ` [this_cpu_xx V2 03/19] Use this_cpu operations for SNMP statistics cl
2009-06-18 1:55 ` Tejun Heo
2009-06-17 20:33 ` [this_cpu_xx V2 04/19] Use this_cpu operations for NFS statistics cl
2009-06-18 2:03 ` Tejun Heo
2009-06-17 20:33 ` [this_cpu_xx V2 05/19] use this_cpu ops for network statistics cl
2009-06-17 20:33 ` [this_cpu_xx V2 06/19] this_cpu_ptr: Straight transformations cl
2009-06-17 20:33 ` [this_cpu_xx V2 07/19] this_cpu_ptr: Elimninate get/put_cpu cl
2009-06-17 20:33 ` [this_cpu_xx V2 08/19] this_cpu_ptr: xfs_icsb_modify_counters does not need "cpu" variable cl
2009-06-17 20:33 ` [this_cpu_xx V2 09/19] Use this_cpu_ptr in crypto subsystem cl
2009-06-17 20:33 ` [this_cpu_xx V2 10/19] this_cpu: X86 optimized this_cpu operations cl
2009-06-18 3:00 ` Tejun Heo [this message]
2009-06-18 14:07 ` Christoph Lameter
2009-06-18 14:48 ` Tejun Heo
2009-06-18 15:39 ` Christoph Lameter
2009-06-18 16:06 ` Tejun Heo
2009-06-18 16:15 ` Tejun Heo
2009-06-18 17:05 ` Christoph Lameter
2009-06-19 5:41 ` Rusty Russell
2009-06-23 18:00 ` Christoph Lameter
2009-06-17 20:33 ` [this_cpu_xx V2 11/19] Use this_cpu ops for VM statistics cl
2009-06-18 3:05 ` Tejun Heo
2009-06-17 20:33 ` [this_cpu_xx V2 12/19] RCU: Use this_cpu operations cl
2009-06-17 20:33 ` [this_cpu_xx V2 13/19] Use this_cpu operations in slub cl
2009-06-18 6:20 ` Pekka Enberg
2009-06-18 6:25 ` Pekka Enberg
2009-06-18 13:59 ` Christoph Lameter
2009-06-25 7:12 ` Pekka Enberg
2009-06-18 6:49 ` Tejun Heo
2009-06-18 7:35 ` Pekka Enberg
2009-06-18 13:59 ` Christoph Lameter
2009-06-25 7:11 ` Pekka Enberg
2009-06-17 20:33 ` [this_cpu_xx V2 14/19] this_cpu: Remove slub kmem_cache fields cl
2009-06-17 20:33 ` [this_cpu_xx V2 15/19] Make slub statistics use this_cpu_inc cl
2009-06-17 20:33 ` [this_cpu_xx V2 16/19] this_cpu: slub aggressive use of this_cpu operations in the hotpaths cl
2009-06-18 6:33 ` Pekka Enberg
2009-06-18 11:59 ` Mathieu Desnoyers
2009-06-18 14:00 ` Christoph Lameter
2009-06-17 20:33 ` [this_cpu_xx V2 17/19] Move early initialization of pagesets out of zone_wait_table_init() cl
2009-06-18 3:13 ` Tejun Heo
2009-06-17 20:33 ` [this_cpu_xx V2 18/19] this_cpu_ops: page allocator conversion cl
2009-06-17 20:33 ` [this_cpu_xx V2 19/19] this_cpu ops: Remove pageset_notifier cl
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4A39ADBF.1000505@kernel.org \
--to=tj@kernel.org \
--cc=akpm@linux-foundation.org \
--cc=cl@linux-foundation.org \
--cc=davem@davemloft.net \
--cc=linux-mm@kvack.org \
--cc=mingo@elte.hu \
--cc=rusty@rustcorp.com.au \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).