From: Anthony Liguori <anthony@codemonkey.ws>
To: Jan Kiszka <jan.kiszka@siemens.com>
Cc: Anthony Liguori <anthony@codemonkey.ws>,
Prerna Saxena <prerna@linux.vnet.ibm.com>,
Stefan Hajnoczi <stefanha@linux.vnet.ibm.com>,
"kvm@vger.kernel.org" <kvm@vger.kernel.org>,
"qemu-devel@nongnu.org" <qemu-devel@nongnu.org>
Subject: Re: [Qemu-devel] [PATCH 1/2] trace: Add simple tracing support
Date: Fri, 21 May 2010 16:06:39 -0500 [thread overview]
Message-ID: <4BF6F5DF.2060703@codemonkey.ws> (raw)
In-Reply-To: <4BF6BA3D.9000200@siemens.com>
On 05/21/2010 11:52 AM, Jan Kiszka wrote:
> Anthony Liguori wrote:
>
>> On 05/21/2010 08:46 AM, Jan Kiszka wrote:
>>
>>> Anthony Liguori wrote:
>>>
>>>
>>>> On 05/21/2010 04:42 AM, Stefan Hajnoczi wrote:
>>>>
>>>>
>>>>> Trace events should be defined in trace.h. Events are written to
>>>>> /tmp/trace.log and can be formatted using trace.py. Remember to add
>>>>> events to trace.py for pretty-printing.
>>>>>
>>>>> Signed-off-by: Stefan Hajnoczi<stefanha@linux.vnet.ibm.com>
>>>>> ---
>>>>> Makefile.objs | 2 +-
>>>>> trace.c | 64
>>>>> +++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>>> trace.h | 9 ++++++++
>>>>> trace.py | 30 ++++++++++++++++++++++++++
>>>>> 4 files changed, 104 insertions(+), 1 deletions(-)
>>>>> create mode 100644 trace.c
>>>>> create mode 100644 trace.h
>>>>> create mode 100755 trace.py
>>>>>
>>>>> diff --git a/Makefile.objs b/Makefile.objs
>>>>> index acbaf22..307e989 100644
>>>>> --- a/Makefile.objs
>>>>> +++ b/Makefile.objs
>>>>> @@ -8,7 +8,7 @@ qobject-obj-y += qerror.o
>>>>> # block-obj-y is code used by both qemu system emulation and qemu-img
>>>>>
>>>>> block-obj-y = cutils.o cache-utils.o qemu-malloc.o qemu-option.o
>>>>> module.o
>>>>> -block-obj-y += nbd.o block.o aio.o aes.o osdep.o qemu-config.o
>>>>> +block-obj-y += nbd.o block.o aio.o aes.o osdep.o qemu-config.o trace.o
>>>>> block-obj-$(CONFIG_POSIX) += posix-aio-compat.o
>>>>> block-obj-$(CONFIG_LINUX_AIO) += linux-aio.o
>>>>>
>>>>> diff --git a/trace.c b/trace.c
>>>>> new file mode 100644
>>>>> index 0000000..2fec4d3
>>>>> --- /dev/null
>>>>> +++ b/trace.c
>>>>> @@ -0,0 +1,64 @@
>>>>> +#include<stdlib.h>
>>>>> +#include<stdio.h>
>>>>> +#include "trace.h"
>>>>> +
>>>>> +typedef struct {
>>>>> + unsigned long event;
>>>>> + unsigned long x1;
>>>>> + unsigned long x2;
>>>>> + unsigned long x3;
>>>>> + unsigned long x4;
>>>>> + unsigned long x5;
>>>>> +} TraceRecord;
>>>>> +
>>>>> +enum {
>>>>> + TRACE_BUF_LEN = 64 * 1024 / sizeof(TraceRecord),
>>>>> +};
>>>>> +
>>>>> +static TraceRecord trace_buf[TRACE_BUF_LEN];
>>>>> +static unsigned int trace_idx;
>>>>> +static FILE *trace_fp;
>>>>> +
>>>>> +static void trace(TraceEvent event, unsigned long x1,
>>>>> + unsigned long x2, unsigned long x3,
>>>>> + unsigned long x4, unsigned long x5) {
>>>>> + TraceRecord *rec =&trace_buf[trace_idx];
>>>>> + rec->event = event;
>>>>> + rec->x1 = x1;
>>>>> + rec->x2 = x2;
>>>>> + rec->x3 = x3;
>>>>> + rec->x4 = x4;
>>>>> + rec->x5 = x5;
>>>>> +
>>>>> + if (++trace_idx == TRACE_BUF_LEN) {
>>>>> + trace_idx = 0;
>>>>> +
>>>>> + if (!trace_fp) {
>>>>> + trace_fp = fopen("/tmp/trace.log", "w");
>>>>> + }
>>>>> + if (trace_fp) {
>>>>> + size_t result = fwrite(trace_buf, sizeof trace_buf, 1,
>>>>> trace_fp);
>>>>> + result = result;
>>>>> + }
>>>>> + }
>>>>> +}
>>>>>
>>>>>
>>>>>
>>>> It is probably worth while to read trace points via the monitor or
>>>> through some other mechanism. My concern would be that writing even 64k
>>>> out to disk would introduce enough performance overhead mainly because
>>>> it runs lock-step with the guest's VCPU.
>>>>
>>>> Maybe it's worth adding a thread that syncs the ring to disk if we want
>>>> to write to disk?
>>>>
>>>>
>>> That's not what QEMU should worry about. If somehow possible, let's push
>>> this into the hands of a (user space) tracing framework, ideally one
>>> that is already designed for such requirements. E.g. there exists quite
>>> useful work in the context of LTTng (user space RCU for application
>>> tracing).
>>>
>>>
>> From what I understand, none of the current kernel approaches to
>> userspace tracing have much momentum at the moment.
>>
>>
>>> We may need simple stubs for the case that no such framework is (yet)
>>> available. But effort should focus on a QEMU infrastructure to add
>>> useful tracepoints to the code. Specifically when tracing over KVM, you
>>> usually need information about kernel states as well, so you depend on
>>> an integrated approach, not Yet Another Log File.
>>>
>>>
>> I think the simple code that Stefan pasted gives us 95% of what we need.
>>
> IMHO not 95%, but it is a start.
>
I'm not opposed to using a framework, but I'd rather have an equivalent
to kvm_stat tomorrow than wait 3 years for LTTng to not get merged.
So let's have a dirt-simple tracing mechanism and focus on adding useful
trace points. Then when we have a framework we can use, we can just
convert the tracepoints to the new framework.
Regards,
Anthony Liguori
> I would just like to avoid that too much efforts are spent on
> re-inventing smart trace buffers, trace daemons, or trace visualization
> tools. Then better pick up some semi-perfect approach (e.g. [1], it
> unfortunately still seems to lack kernel integration) and drive it
> according to our needs.
>
> Jan
>
> [1] http://lttng.org/ust
>
>
WARNING: multiple messages have this Message-ID (diff)
From: Anthony Liguori <anthony@codemonkey.ws>
To: Jan Kiszka <jan.kiszka@siemens.com>
Cc: "qemu-devel@nongnu.org" <qemu-devel@nongnu.org>,
"kvm@vger.kernel.org" <kvm@vger.kernel.org>,
Stefan Hajnoczi <stefanha@linux.vnet.ibm.com>,
Prerna Saxena <prerna@linux.vnet.ibm.com>
Subject: Re: [Qemu-devel] [PATCH 1/2] trace: Add simple tracing support
Date: Fri, 21 May 2010 16:06:39 -0500 [thread overview]
Message-ID: <4BF6F5DF.2060703@codemonkey.ws> (raw)
In-Reply-To: <4BF6BA3D.9000200@siemens.com>
On 05/21/2010 11:52 AM, Jan Kiszka wrote:
> Anthony Liguori wrote:
>
>> On 05/21/2010 08:46 AM, Jan Kiszka wrote:
>>
>>> Anthony Liguori wrote:
>>>
>>>
>>>> On 05/21/2010 04:42 AM, Stefan Hajnoczi wrote:
>>>>
>>>>
>>>>> Trace events should be defined in trace.h. Events are written to
>>>>> /tmp/trace.log and can be formatted using trace.py. Remember to add
>>>>> events to trace.py for pretty-printing.
>>>>>
>>>>> Signed-off-by: Stefan Hajnoczi<stefanha@linux.vnet.ibm.com>
>>>>> ---
>>>>> Makefile.objs | 2 +-
>>>>> trace.c | 64
>>>>> +++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>>> trace.h | 9 ++++++++
>>>>> trace.py | 30 ++++++++++++++++++++++++++
>>>>> 4 files changed, 104 insertions(+), 1 deletions(-)
>>>>> create mode 100644 trace.c
>>>>> create mode 100644 trace.h
>>>>> create mode 100755 trace.py
>>>>>
>>>>> diff --git a/Makefile.objs b/Makefile.objs
>>>>> index acbaf22..307e989 100644
>>>>> --- a/Makefile.objs
>>>>> +++ b/Makefile.objs
>>>>> @@ -8,7 +8,7 @@ qobject-obj-y += qerror.o
>>>>> # block-obj-y is code used by both qemu system emulation and qemu-img
>>>>>
>>>>> block-obj-y = cutils.o cache-utils.o qemu-malloc.o qemu-option.o
>>>>> module.o
>>>>> -block-obj-y += nbd.o block.o aio.o aes.o osdep.o qemu-config.o
>>>>> +block-obj-y += nbd.o block.o aio.o aes.o osdep.o qemu-config.o trace.o
>>>>> block-obj-$(CONFIG_POSIX) += posix-aio-compat.o
>>>>> block-obj-$(CONFIG_LINUX_AIO) += linux-aio.o
>>>>>
>>>>> diff --git a/trace.c b/trace.c
>>>>> new file mode 100644
>>>>> index 0000000..2fec4d3
>>>>> --- /dev/null
>>>>> +++ b/trace.c
>>>>> @@ -0,0 +1,64 @@
>>>>> +#include<stdlib.h>
>>>>> +#include<stdio.h>
>>>>> +#include "trace.h"
>>>>> +
>>>>> +typedef struct {
>>>>> + unsigned long event;
>>>>> + unsigned long x1;
>>>>> + unsigned long x2;
>>>>> + unsigned long x3;
>>>>> + unsigned long x4;
>>>>> + unsigned long x5;
>>>>> +} TraceRecord;
>>>>> +
>>>>> +enum {
>>>>> + TRACE_BUF_LEN = 64 * 1024 / sizeof(TraceRecord),
>>>>> +};
>>>>> +
>>>>> +static TraceRecord trace_buf[TRACE_BUF_LEN];
>>>>> +static unsigned int trace_idx;
>>>>> +static FILE *trace_fp;
>>>>> +
>>>>> +static void trace(TraceEvent event, unsigned long x1,
>>>>> + unsigned long x2, unsigned long x3,
>>>>> + unsigned long x4, unsigned long x5) {
>>>>> + TraceRecord *rec =&trace_buf[trace_idx];
>>>>> + rec->event = event;
>>>>> + rec->x1 = x1;
>>>>> + rec->x2 = x2;
>>>>> + rec->x3 = x3;
>>>>> + rec->x4 = x4;
>>>>> + rec->x5 = x5;
>>>>> +
>>>>> + if (++trace_idx == TRACE_BUF_LEN) {
>>>>> + trace_idx = 0;
>>>>> +
>>>>> + if (!trace_fp) {
>>>>> + trace_fp = fopen("/tmp/trace.log", "w");
>>>>> + }
>>>>> + if (trace_fp) {
>>>>> + size_t result = fwrite(trace_buf, sizeof trace_buf, 1,
>>>>> trace_fp);
>>>>> + result = result;
>>>>> + }
>>>>> + }
>>>>> +}
>>>>>
>>>>>
>>>>>
>>>> It is probably worth while to read trace points via the monitor or
>>>> through some other mechanism. My concern would be that writing even 64k
>>>> out to disk would introduce enough performance overhead mainly because
>>>> it runs lock-step with the guest's VCPU.
>>>>
>>>> Maybe it's worth adding a thread that syncs the ring to disk if we want
>>>> to write to disk?
>>>>
>>>>
>>> That's not what QEMU should worry about. If somehow possible, let's push
>>> this into the hands of a (user space) tracing framework, ideally one
>>> that is already designed for such requirements. E.g. there exists quite
>>> useful work in the context of LTTng (user space RCU for application
>>> tracing).
>>>
>>>
>> From what I understand, none of the current kernel approaches to
>> userspace tracing have much momentum at the moment.
>>
>>
>>> We may need simple stubs for the case that no such framework is (yet)
>>> available. But effort should focus on a QEMU infrastructure to add
>>> useful tracepoints to the code. Specifically when tracing over KVM, you
>>> usually need information about kernel states as well, so you depend on
>>> an integrated approach, not Yet Another Log File.
>>>
>>>
>> I think the simple code that Stefan pasted gives us 95% of what we need.
>>
> IMHO not 95%, but it is a start.
>
I'm not opposed to using a framework, but I'd rather have an equivalent
to kvm_stat tomorrow than wait 3 years for LTTng to not get merged.
So let's have a dirt-simple tracing mechanism and focus on adding useful
trace points. Then when we have a framework we can use, we can just
convert the tracepoints to the new framework.
Regards,
Anthony Liguori
> I would just like to avoid that too much efforts are spent on
> re-inventing smart trace buffers, trace daemons, or trace visualization
> tools. Then better pick up some semi-perfect approach (e.g. [1], it
> unfortunately still seems to lack kernel integration) and drive it
> according to our needs.
>
> Jan
>
> [1] http://lttng.org/ust
>
>
next prev parent reply other threads:[~2010-05-21 21:06 UTC|newest]
Thread overview: 35+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-05-21 9:42 [RFC 0/2] Tracing Stefan Hajnoczi
2010-05-21 9:42 ` [Qemu-devel] " Stefan Hajnoczi
2010-05-21 9:42 ` [PATCH 1/2] trace: Add simple tracing support Stefan Hajnoczi
2010-05-21 9:42 ` [Qemu-devel] " Stefan Hajnoczi
2010-05-21 9:49 ` Stefan Hajnoczi
2010-05-21 9:49 ` [Qemu-devel] " Stefan Hajnoczi
2010-05-21 11:13 ` Jan Kiszka
2010-05-21 11:13 ` [Qemu-devel] " Jan Kiszka
2010-05-21 13:10 ` Stefan Hajnoczi
2010-05-21 13:10 ` [Qemu-devel] " Stefan Hajnoczi
2010-05-21 13:22 ` Jan Kiszka
2010-05-21 13:22 ` [Qemu-devel] " Jan Kiszka
2010-05-21 12:37 ` [Qemu-devel] " Anthony Liguori
2010-05-21 13:46 ` Jan Kiszka
2010-05-21 13:46 ` Jan Kiszka
2010-05-21 14:03 ` Anthony Liguori
2010-05-21 14:03 ` Anthony Liguori
2010-05-21 16:52 ` Jan Kiszka
2010-05-21 16:52 ` Jan Kiszka
2010-05-21 20:49 ` Stefan Hajnoczi
2010-05-21 20:49 ` Stefan Hajnoczi
2010-05-21 21:26 ` Christoph Hellwig
2010-05-21 21:26 ` Christoph Hellwig
2010-05-21 21:37 ` Jan Kiszka
2010-05-21 21:37 ` Jan Kiszka
2010-05-21 21:06 ` Anthony Liguori [this message]
2010-05-21 21:06 ` Anthony Liguori
2010-05-21 21:41 ` Jan Kiszka
2010-05-21 21:41 ` Jan Kiszka
2010-05-21 21:58 ` Anthony Liguori
2010-05-21 21:58 ` Anthony Liguori
2010-05-21 9:42 ` [PATCH 2/2] trace: Trace write requests in virtio-blk, multiwrite, and paio_submit Stefan Hajnoczi
2010-05-21 9:42 ` [Qemu-devel] " Stefan Hajnoczi
2010-05-21 11:27 ` [Qemu-devel] [RFC 0/2] Tracing Prerna Saxena
2010-05-21 11:27 ` Prerna Saxena
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4BF6F5DF.2060703@codemonkey.ws \
--to=anthony@codemonkey.ws \
--cc=jan.kiszka@siemens.com \
--cc=kvm@vger.kernel.org \
--cc=prerna@linux.vnet.ibm.com \
--cc=qemu-devel@nongnu.org \
--cc=stefanha@linux.vnet.ibm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.