From: Ingo Molnar <mingo@elte.hu>
To: Jason Baron <jbaron@redhat.com>, Tom Zanussi <tzanussi@gmail.com>
Cc: linux-kernel@vger.kernel.org, fweisbec@gmail.com,
laijs@cn.fujitsu.com, rostedt@goodmis.org, peterz@infradead.org,
mathieu.desnoyers@polymtl.ca, jiayingz@google.com,
mbligh@google.com, roland@redhat.com, fche@redhat.com
Subject: Re: [RFC] convert ftrace syscall tracer to TRACE_EVENT()
Date: Sat, 9 May 2009 10:37:37 +0200 [thread overview]
Message-ID: <20090509083737.GE3656@elte.hu> (raw)
In-Reply-To: <20090508210347.GA3121@redhat.com>
* Jason Baron <jbaron@redhat.com> wrote:
> Hi,
>
> I've been thinking about converting the current ftrace syscall
> tracer to the TRACE_EVENT() macros. There are a few issues with
> the current syscall tracer approach:
>
> 1) It has to be enabled for all processes and all syscalls. By
> moving to TRACE_EVENT(), it can be enabled/disabled per tracepoint
> and can also make use of the generic tracing filters, such as
> "trace all process for pid x"
>
> 2) Other tracers can not tie into it, since its not tracepoint
> based. TRACE_EVENT() fixes this.
>
> 3) data formatting. The syscall tracer I don't believe understands
> all the various types for output formatting. By moving to
> TRACE_EVENT(), we can print out a more readible syscall trace.
>
> 4) The ftrace syscall tracer needs a new arch specific code for
> each architecture. By converting to TRACE_EVENT() we don't need
> any architecutre specific code.
>
> Other issues to consider:
>
> * Maintainence. The current syscall tracer automatically picks up
> new syscalls. The TRACE_EVENT() will be harder to initially set
> up. But once its done, syscalls are obviously not added often. So
> I don't think this will be too bad.
>
> * Performance. The current syscall tracer adds a
> 'test_thread_flag()' to syscall entry/exit. The TRACE_EVENT()
> would add a per-syscall global to check. So they are going to have
> different cache profiles...however, the tracepoint infrastructure
> is hopefully moving to the 'immediate' value work, which will make
> this more highly optimized.
>
> I've also tested the patch shown below (which uses,
> DECLARE_TRACE(), as a preliminary proof of concept), using
> getpid() in a loop, and tbench, and saw very small performance
> differences. Obviously we would have to do more extensive testing
> before deciding.
>
> Patch is pretty rough, but should give a rough sense of what the
> DECLARE_TRACE() type patch might look like...
Yeah, i very much agree with the direction. (I've Cc:-ed Tom Zanussi
who also has expressed interest in this.)
I'm not sure about the implementation as you've posted it though:
Firstly, it adds two new tracepoints to every system call. That is
unnecessary - we already have the TIF flag based callbacks, and we
can use the existing syscall attributes table to get to tracepoints
- without slow down (or impacting) the fast path in any way.
Secondly, we should reuse the information we get in SYSCALL_DEFINE,
to construct the TRACE_EVENT tracepoints directly - without having
to list all syscalls again in a separate file.
Ingo
next prev parent reply other threads:[~2009-05-09 8:38 UTC|newest]
Thread overview: 21+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-05-08 21:03 [RFC] convert ftrace syscall tracer to TRACE_EVENT() Jason Baron
2009-05-09 8:37 ` Ingo Molnar [this message]
2009-05-09 12:53 ` Frédéric Weisbecker
2009-05-09 13:33 ` Ingo Molnar
2009-05-09 13:50 ` Mathieu Desnoyers
2009-05-09 14:06 ` Frédéric Weisbecker
2009-05-09 14:15 ` Ingo Molnar
2009-05-09 14:29 ` Mathieu Desnoyers
2009-05-09 15:01 ` Frédéric Weisbecker
2009-05-09 15:24 ` Mathieu Desnoyers
2009-05-09 14:47 ` Frédéric Weisbecker
2009-05-09 17:44 ` David Wagner
2009-05-09 14:02 ` Frédéric Weisbecker
2009-05-09 14:07 ` Ingo Molnar
2009-05-09 14:12 ` Frédéric Weisbecker
2009-05-09 15:36 ` Frank Ch. Eigler
2009-05-09 15:57 ` Mathieu Desnoyers
2009-05-09 16:32 ` Mathieu Desnoyers
2009-05-10 6:59 ` Tom Zanussi
2009-05-11 22:16 ` Jason Baron
2009-05-12 2:44 ` Roland McGrath
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20090509083737.GE3656@elte.hu \
--to=mingo@elte.hu \
--cc=fche@redhat.com \
--cc=fweisbec@gmail.com \
--cc=jbaron@redhat.com \
--cc=jiayingz@google.com \
--cc=laijs@cn.fujitsu.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mathieu.desnoyers@polymtl.ca \
--cc=mbligh@google.com \
--cc=peterz@infradead.org \
--cc=roland@redhat.com \
--cc=rostedt@goodmis.org \
--cc=tzanussi@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox