All of lore.kernel.org
 help / color / mirror / Atom feed
From: Mathieu Desnoyers <mathieu.desnoyers@polymtl.ca>
To: Eduard - Gabriel Munteanu <eduard.munteanu@linux360.ro>
Cc: Pekka Enberg <penberg@cs.helsinki.fi>,
	Dipankar Sarma <dipankar@in.ibm.com>,
	Alexey Dobriyan <adobriyan@gmail.com>,
	Ingo Molnar <mingo@elte.hu>,
	linux-kernel@vger.kernel.org
Subject: Re: [PATCH 3/3] kmemtrace: Use tracepoints instead of markers.
Date: Mon, 5 Jan 2009 11:05:34 -0500	[thread overview]
Message-ID: <20090105160534.GA7708@Krystal> (raw)
In-Reply-To: <20090104041018.GB5198@localhost>

* Eduard - Gabriel Munteanu (eduard.munteanu@linux360.ro) wrote:
> On Fri, Jan 02, 2009 at 06:42:54PM -0500, Mathieu Desnoyers wrote:
> > Because whatever slab_buffer_size() does will be done on the fastpath of
> > the instrumented code *even when instrumentation is disabled*, and this
> > is something we need to avoid above all.
> 
> I had doubts about this, so I tried it myself. It seems that when using
> -O2 it generates optimal code, without computing a << b unnecessarily.
> It only precomputes it at -O0. Here's how I tested...
> 
> #include <stdio.h>
> 
> #define unlikely(x)	__builtin_expect(!!(x), 0)
> 
> static int do_something_enabled;
> 
> static void print_that(unsigned long num)
> {
> 	printf("input << 5 == %lu\n", num);
> }
> 
> static inline void do_something(unsigned long num)
> {
> 	if (unlikely(do_something_enabled)) /* Like DEFINE_TRACE does. */
> 		print_that(num);
> }
> 
> static void call_do_something(unsigned long in)
> {
> 	do_something(in << 5);
> }
> 
> int main(int argc, char **argv)
> {
> 	unsigned long in;
> 
> 	if (argc != 3) {
> 		printf("Wrong number of arguments!\n");
> 		return 0;
> 	}
> 
> 	sscanf(argv[1], "%d", &do_something_enabled);
> 	sscanf(argv[2], "%lu", &in);
> 
> 	call_do_something(in);
> 
> 	return 0;
> }
> 
> 
> Snippet of objdump output when using -O2:
> 
> static inline void do_something(unsigned long num)
> {
> 	if (unlikely(do_something_enabled))
>   400635:	85 c0                	test   %eax,%eax
>   400637:	74 be                	je     4005f7 <main+0x17>
> print_that():
> /home/edi/prj/src/inlineargs/inlineargs.c:9
> 
> static int do_something_enabled;
> 
> static void print_that(unsigned long num)
> {
> 	printf("input << 5 == %lu\n", num);
>   400639:	48 c1 e6 05          	shl    $0x5,%rsi
>   40063d:	bf 6e 07 40 00       	mov    $0x40076e,%edi
>   400642:	31 c0                	xor    %eax,%eax
>   400644:	e8 5f fe ff ff       	callq  4004a8 <printf@plt>
> 
> 
> Snippet of objdump output when using -O0:
> 
> static void call_do_something(unsigned long in)
> {
>   4005fd:	55                   	push   %rbp
>   4005fe:	48 89 e5             	mov    %rsp,%rbp
>   400601:	48 83 ec 10          	sub    $0x10,%rsp
>   400605:	48 89 7d f8          	mov    %rdi,-0x8(%rbp)
> /home/edi/prj/src/inlineargs/inlineargs.c:20
> 	do_something(in << 5);
>   400609:	48 8b 45 f8          	mov    -0x8(%rbp),%rax
>   40060d:	48 89 c7             	mov    %rax,%rdi
>   400610:	48 c1 e7 05          	shl    $0x5,%rdi
>   400614:	e8 02 00 00 00       	callq  40061b <do_something>
> 
> 
> Look at that shl, it indicates the left-shift (<< 5). In the first case it's
> deferred as much as possible. However, in the second case, it's done
> before calling that inline. Also confirmed with GCC using breakpoints on
> that shl.
> 
> Can we take this as general behaviour, i.e. fn(a()), where fn() is inlined
> and a() has no side-effects, will only compute a() when needed, at least on
> GCC and when -O2 is in effect?
> 
> It only seems natural to me GCC would do this on a regular basis.
> 

Hopefully it does, especially when there are no side-effects. Can you
also try with -Os ?

Mathieu


> 
> 	Cheers,
> 	Eduard
> 

-- 
Mathieu Desnoyers
OpenPGP key fingerprint: 8CD5 52C3 8E3C 4140 715F  BA06 3F25 A8FE 3BAE 9A68

  reply	other threads:[~2009-01-05 16:11 UTC|newest]

Thread overview: 22+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2008-12-29  1:40 [PATCH 0/3] kmemtrace over tracepoints Eduard - Gabriel Munteanu
2008-12-29  1:40 ` [PATCH 1/3] RCU: Move some definitions to minimal headers Eduard - Gabriel Munteanu
2008-12-29  9:42   ` Pekka Enberg
2008-12-29 12:32     ` Ingo Molnar
2008-12-30  6:31     ` Paul E. McKenney
2008-12-29 14:56   ` Ingo Molnar
2008-12-29  1:40 ` [PATCH 2/3] tracepoints: Include only minimal RCU headers in linux/tracepoint.h Eduard - Gabriel Munteanu
2008-12-29  1:40 ` [PATCH 3/3] kmemtrace: Use tracepoints instead of markers Eduard - Gabriel Munteanu
2008-12-29  9:44   ` Pekka Enberg
2009-01-02 20:54     ` Mathieu Desnoyers
2009-01-02 23:03       ` Eduard - Gabriel Munteanu
2008-12-29 13:43   ` Pekka Enberg
2008-12-29 20:11     ` Eduard - Gabriel Munteanu
2009-01-02 20:53   ` Mathieu Desnoyers
2009-01-02 23:01     ` Eduard - Gabriel Munteanu
2009-01-02 23:42       ` Mathieu Desnoyers
2009-01-04  1:53         ` Eduard - Gabriel Munteanu
2009-01-04  4:10         ` Eduard - Gabriel Munteanu
2009-01-05 16:05           ` Mathieu Desnoyers [this message]
2009-01-05 17:57             ` Eduard - Gabriel Munteanu
2009-01-05 18:09               ` Mathieu Desnoyers
2008-12-29  9:41 ` [PATCH 0/3] kmemtrace over tracepoints Pekka Enberg

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20090105160534.GA7708@Krystal \
    --to=mathieu.desnoyers@polymtl.ca \
    --cc=adobriyan@gmail.com \
    --cc=dipankar@in.ibm.com \
    --cc=eduard.munteanu@linux360.ro \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mingo@elte.hu \
    --cc=penberg@cs.helsinki.fi \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.