From: Eduard - Gabriel Munteanu <eduard.munteanu@linux360.ro>
To: Mathieu Desnoyers <mathieu.desnoyers@polymtl.ca>
Cc: Pekka Enberg <penberg@cs.helsinki.fi>,
Dipankar Sarma <dipankar@in.ibm.com>,
Alexey Dobriyan <adobriyan@gmail.com>,
Ingo Molnar <mingo@elte.hu>,
linux-kernel@vger.kernel.org
Subject: Re: [PATCH 3/3] kmemtrace: Use tracepoints instead of markers.
Date: Sun, 4 Jan 2009 06:10:18 +0200 [thread overview]
Message-ID: <20090104041018.GB5198@localhost> (raw)
In-Reply-To: <20090102234254.GB29623@Krystal>
On Fri, Jan 02, 2009 at 06:42:54PM -0500, Mathieu Desnoyers wrote:
> Because whatever slab_buffer_size() does will be done on the fastpath of
> the instrumented code *even when instrumentation is disabled*, and this
> is something we need to avoid above all.
I had doubts about this, so I tried it myself. It seems that when using
-O2 it generates optimal code, without computing a << b unnecessarily.
It only precomputes it at -O0. Here's how I tested...
#include <stdio.h>
#define unlikely(x) __builtin_expect(!!(x), 0)
static int do_something_enabled;
static void print_that(unsigned long num)
{
printf("input << 5 == %lu\n", num);
}
static inline void do_something(unsigned long num)
{
if (unlikely(do_something_enabled)) /* Like DEFINE_TRACE does. */
print_that(num);
}
static void call_do_something(unsigned long in)
{
do_something(in << 5);
}
int main(int argc, char **argv)
{
unsigned long in;
if (argc != 3) {
printf("Wrong number of arguments!\n");
return 0;
}
sscanf(argv[1], "%d", &do_something_enabled);
sscanf(argv[2], "%lu", &in);
call_do_something(in);
return 0;
}
Snippet of objdump output when using -O2:
static inline void do_something(unsigned long num)
{
if (unlikely(do_something_enabled))
400635: 85 c0 test %eax,%eax
400637: 74 be je 4005f7 <main+0x17>
print_that():
/home/edi/prj/src/inlineargs/inlineargs.c:9
static int do_something_enabled;
static void print_that(unsigned long num)
{
printf("input << 5 == %lu\n", num);
400639: 48 c1 e6 05 shl $0x5,%rsi
40063d: bf 6e 07 40 00 mov $0x40076e,%edi
400642: 31 c0 xor %eax,%eax
400644: e8 5f fe ff ff callq 4004a8 <printf@plt>
Snippet of objdump output when using -O0:
static void call_do_something(unsigned long in)
{
4005fd: 55 push %rbp
4005fe: 48 89 e5 mov %rsp,%rbp
400601: 48 83 ec 10 sub $0x10,%rsp
400605: 48 89 7d f8 mov %rdi,-0x8(%rbp)
/home/edi/prj/src/inlineargs/inlineargs.c:20
do_something(in << 5);
400609: 48 8b 45 f8 mov -0x8(%rbp),%rax
40060d: 48 89 c7 mov %rax,%rdi
400610: 48 c1 e7 05 shl $0x5,%rdi
400614: e8 02 00 00 00 callq 40061b <do_something>
Look at that shl, it indicates the left-shift (<< 5). In the first case it's
deferred as much as possible. However, in the second case, it's done
before calling that inline. Also confirmed with GCC using breakpoints on
that shl.
Can we take this as general behaviour, i.e. fn(a()), where fn() is inlined
and a() has no side-effects, will only compute a() when needed, at least on
GCC and when -O2 is in effect?
It only seems natural to me GCC would do this on a regular basis.
Cheers,
Eduard
next prev parent reply other threads:[~2009-01-04 4:10 UTC|newest]
Thread overview: 22+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-12-29 1:40 [PATCH 0/3] kmemtrace over tracepoints Eduard - Gabriel Munteanu
2008-12-29 1:40 ` [PATCH 1/3] RCU: Move some definitions to minimal headers Eduard - Gabriel Munteanu
2008-12-29 9:42 ` Pekka Enberg
2008-12-29 12:32 ` Ingo Molnar
2008-12-30 6:31 ` Paul E. McKenney
2008-12-29 14:56 ` Ingo Molnar
2008-12-29 1:40 ` [PATCH 2/3] tracepoints: Include only minimal RCU headers in linux/tracepoint.h Eduard - Gabriel Munteanu
2008-12-29 1:40 ` [PATCH 3/3] kmemtrace: Use tracepoints instead of markers Eduard - Gabriel Munteanu
2008-12-29 9:44 ` Pekka Enberg
2009-01-02 20:54 ` Mathieu Desnoyers
2009-01-02 23:03 ` Eduard - Gabriel Munteanu
2008-12-29 13:43 ` Pekka Enberg
2008-12-29 20:11 ` Eduard - Gabriel Munteanu
2009-01-02 20:53 ` Mathieu Desnoyers
2009-01-02 23:01 ` Eduard - Gabriel Munteanu
2009-01-02 23:42 ` Mathieu Desnoyers
2009-01-04 1:53 ` Eduard - Gabriel Munteanu
2009-01-04 4:10 ` Eduard - Gabriel Munteanu [this message]
2009-01-05 16:05 ` Mathieu Desnoyers
2009-01-05 17:57 ` Eduard - Gabriel Munteanu
2009-01-05 18:09 ` Mathieu Desnoyers
2008-12-29 9:41 ` [PATCH 0/3] kmemtrace over tracepoints Pekka Enberg
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20090104041018.GB5198@localhost \
--to=eduard.munteanu@linux360.ro \
--cc=adobriyan@gmail.com \
--cc=dipankar@in.ibm.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mathieu.desnoyers@polymtl.ca \
--cc=mingo@elte.hu \
--cc=penberg@cs.helsinki.fi \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox