From: Peter Zijlstra <peterz@infradead.org>
To: Frederic Weisbecker <fweisbec@gmail.com>
Cc: Ingo Molnar <mingo@elte.hu>, LKML <linux-kernel@vger.kernel.org>,
Arnaldo Carvalho de Melo <acme@redhat.com>,
Mike Galbraith <efault@gmx.de>, Paul Mackerras <paulus@samba.org>
Subject: Re: [PATCH 4/3] perf_counter: Correct PERF_SAMPLE_RAW output
Date: Mon, 10 Aug 2009 15:06:25 +0200 [thread overview]
Message-ID: <1249909585.17467.135.camel@twins> (raw)
In-Reply-To: <20090810125731.GA5124@nowhere>
On Mon, 2009-08-10 at 14:57 +0200, Frederic Weisbecker wrote:
> > Index: linux-2.6/include/trace/ftrace.h
> > ===================================================================
> > --- linux-2.6.orig/include/trace/ftrace.h
> > +++ linux-2.6/include/trace/ftrace.h
> > @@ -687,7 +687,8 @@ static void ftrace_profile_##call(proto)
> > pc = preempt_count(); \
> > \
> > __data_size = ftrace_get_offsets_##call(&__data_offsets, args); \
> > - __entry_size = ALIGN(__data_size + sizeof(*entry), sizeof(u64));\
> > + __entry_size = ALIGN(__data_size + sizeof(*entry) + sizeof(u32),\
> > + sizeof(u64)); \
>
>
>
> Here you are reserving a room for the size of the buffer inside the buffer.
No, here I align so that __entry_size + the u32 ends up at a u64
boundary.
Oh gah, now I see.. I should have subtracted sizeof(u32) after the
alignment.
> > @@ -2717,9 +2716,15 @@ void perf_counter_output(struct perf_cou
> > }
> >
> > if (sample_type & PERF_SAMPLE_RAW) {
> > - raw = data->raw;
> > - if (raw)
> > - header.size += raw->size;
> > + int size = sizeof(u32);
> > +
> > + if (data->raw)
> > + size += data->raw->size;
>
>
>
> And here you reserve a size for the buffer once again?
This is the actual output buffer reservation code, that needs the u32
above and the data->raw size.
>
> > + else
> > + size += sizeof(u32);
> > +
> > + WARN_ON_ONCE(size & (sizeof(u64)-1));
Which when taken together should be u64 aligned.
So this will always fail with the above.
> > + header.size += size;
> > }
> >
> > ret = perf_output_begin(&handle, counter, header.size, nmi, 1);
> > @@ -2785,8 +2790,21 @@ void perf_counter_output(struct perf_cou
> > }
> > }
> >
> > - if ((sample_type & PERF_SAMPLE_RAW) && raw)
> > - perf_output_copy(&handle, raw->data, raw->size);
> > + if (sample_type & PERF_SAMPLE_RAW) {
> > + if (data->raw) {
> > + perf_output_put(&handle, data->raw->size);
> > + perf_output_copy(&handle, data->raw->data, data->raw->size);
>
> And actually you copy the buffer that has the size of the buffer
> plus the u32 reserved for the size, whereas the size has already been
> copied.
>
> When I look at a perf sample raw it gives me the following:
>
> .. 0000: 09 00 00 00 01 00 54 00 d0 c7 00 81 ff ff ff ff ......T........
> .. 0010: 69 01 00 00 69 01 00 00 e6 15 00 00 00 00 00 00 i...i..........
> .. 0020: 30 00 00 00 2b 00 01 02 69 01 00 00 69 01 00 00 0...+...i...i..
>
> ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ .................
> Buf size struct trace_entry
>
>
>
> .. 0030: 6b 6f 6e 64 65 6d 61 6e 64 2f 31 00 00 00 00 00 kondemand/1....
>
> ^ ^ ^ ^ ..................................
> Rest of struct ftrace_raw_<call>
>
> .. 0040: 69 01 00 00 70 82 46 81 ff ff ff ff 00 00 00 00 i...p.F........
>
> .................................. ^ ^ ^ ^
> The room you have
> reserved for the buffer size
> (zeroed from a tmp patch)
> .. 0050: 00 00 00 00
> ^ ^ ^ ^
> padding from alignment (ALIGN(__data_size + sizeof(*entry) + sizeof(u32),\
> sizeof(u64));
Right, one u32 too much :/
> I first thought the 0x4c bytes was a padding from gcc because
> sizeof(struct ftrace_raw_<call>) % 8 != 0
> But no, it is equal to 0: It's between 0x4c and 0x24 = 40 bytes.
>
> I'm preparing a patch to fix this. Just wanted to expose my idea here,
> because I'm perhaps wrong in the middle of this dump.
>
> I'll do the alignment right in the end, just before inserting the entry
> in the output. That will be more easy. I'll zeroe the rest in the same time.
OK.
next prev parent reply other threads:[~2009-08-10 13:06 UTC|newest]
Thread overview: 25+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-08-08 2:26 [PATCH 1/4] perf tools: callchain: Warn only once in empty node detection Frederic Weisbecker
2009-08-08 2:26 ` [PATCH 1/3] perfcounter: Initialize tracepoint record before any use Frederic Weisbecker
2009-08-08 2:30 ` Frederic Weisbecker
2009-08-10 9:27 ` [PATCH 4/3] perf_counter: Correct PERF_SAMPLE_RAW output Peter Zijlstra
2009-08-10 9:36 ` [tip:perfcounters/urgent] " tip-bot for Peter Zijlstra
2009-08-10 9:48 ` [PATCH 4/3] " Frederic Weisbecker
2009-08-10 12:57 ` Frederic Weisbecker
2009-08-10 13:06 ` Peter Zijlstra [this message]
2009-08-10 14:11 ` [PATCH 6/3] perfcounter: Substract the buffer size field from the event record size Frederic Weisbecker
2009-08-10 14:13 ` Peter Zijlstra
2009-08-10 14:21 ` [tip:perfcounters/urgent] perf_counter: Subtract " tip-bot for Frederic Weisbecker
2009-08-10 9:27 ` [PATCH 5/3] perf_counter: Require CAP_SYS_ADMIN for raw tracepoint data Peter Zijlstra
2009-08-10 9:36 ` [tip:perfcounters/urgent] " tip-bot for Peter Zijlstra
2009-08-08 2:26 ` [PATCH 2/4] perf tools: callchain: Ignore empty callchains Frederic Weisbecker
2009-08-08 2:26 ` [PATCH 2/3] perfcounter: Generalize the tracepoint sampling to generic sampling Frederic Weisbecker
2009-08-08 11:52 ` [tip:perfcounters/core] perf_counter: Fix tracepoint sampling to be part of " tip-bot for Frederic Weisbecker
2009-08-09 11:10 ` tip-bot for Frederic Weisbecker
2009-08-08 2:26 ` [PATCH 3/4] perf tools: callchain: Default display callchain from report if recorded with -g Frederic Weisbecker
2009-08-08 2:26 ` [PATCH 3/3] perfcounter: Align ftrace events raw samples to 8 bytes Frederic Weisbecker
2009-08-08 11:52 ` [tip:perfcounters/core] perf_counter: Fix ftrace events raw samples to be aligned " tip-bot for Frederic Weisbecker
2009-08-10 7:13 ` [PATCH 3/3] perfcounter: Align ftrace events raw samples " Peter Zijlstra
2009-08-10 7:32 ` Frederic Weisbecker
2009-08-10 14:38 ` [PATCH 7/3] perfcounter: Zeroe dead bytes from ftrace raw samples size alignment Frederic Weisbecker
2009-08-10 18:01 ` [tip:perfcounters/urgent] perf_counter: Zero " tip-bot for Frederic Weisbecker
2009-08-08 2:26 ` [PATCH 4/4] perf tools: callchain: Display amount of ignored chains in fractal mode Frederic Weisbecker
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1249909585.17467.135.camel@twins \
--to=peterz@infradead.org \
--cc=acme@redhat.com \
--cc=efault@gmx.de \
--cc=fweisbec@gmail.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@elte.hu \
--cc=paulus@samba.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox