From: "Wangnan (F)" <wangnan0@huawei.com>
To: Peter Zijlstra <peterz@infradead.org>
Cc: Arnaldo Carvalho de Melo <acme@kernel.org>,
Alexei Starovoitov <ast@kernel.org>,
Arnaldo Carvalho de Melo <acme@redhat.com>,
Jiri Olsa <jolsa@kernel.org>, Li Zefan <lizefan@huawei.com>,
<pi3orama@163.com>, <linux-kernel@vger.kernel.org>,
He Kuang <hekuang@huawei.com>,
Brendan Gregg <brendan.d.gregg@gmail.com>,
Masami Hiramatsu <masami.hiramatsu.pt@hitachi.com>,
Namhyung Kim <namhyung@kernel.org>
Subject: Re: [PATCH 10/46] perf core: Introduce new ioctl options to pause and resume ring buffer
Date: Thu, 3 Mar 2016 10:03:10 +0800 [thread overview]
Message-ID: <56D79B5E.4030609@huawei.com> (raw)
In-Reply-To: <20160229153929.GD32719@kernel.org>
Hi Peter,
Patch 10/46 to 14/46 were sent separately to you and modified
follow your suggestion. Do you have further comment on it?
Thank you.
On 2016/2/29 23:39, Arnaldo Carvalho de Melo wrote:
> Em Fri, Feb 26, 2016 at 09:31:58AM +0000, Wang Nan escreveu:
>> Add new ioctl() to pause/resume ring-buffer output.
>>
>> In some situations we want to read from ring buffer only when we
>> ensure nothing can write to the ring buffer during reading. Without
>> this patch we have to turn off all events attached to this ring buffer
>> to achieve this.
>>
>> This patch is for supporting overwrite ring buffer. Following
>> commits will introduce new methods support reading from overwrite ring
>> buffer. Before reading caller must ensure the ring buffer is frozen, or
>> the reading is unreliable.
> Peter, have you have the chance too look at this and the other kernel
> bits in this kit?
>
> - Arnaldo
>
>> Signed-off-by: Wang Nan <wangnan0@huawei.com>
>> Cc: He Kuang <hekuang@huawei.com>
>> Cc: Alexei Starovoitov <ast@kernel.org>
>> Cc: Arnaldo Carvalho de Melo <acme@redhat.com>
>> Cc: Brendan Gregg <brendan.d.gregg@gmail.com>
>> Cc: Jiri Olsa <jolsa@kernel.org>
>> Cc: Masami Hiramatsu <masami.hiramatsu.pt@hitachi.com>
>> Cc: Namhyung Kim <namhyung@kernel.org>
>> Cc: Peter Zijlstra <peterz@infradead.org>
>> Cc: Zefan Li <lizefan@huawei.com>
>> Cc: pi3orama@163.com
>> ---
>> include/uapi/linux/perf_event.h | 1 +
>> kernel/events/core.c | 13 +++++++++++++
>> kernel/events/internal.h | 11 +++++++++++
>> kernel/events/ring_buffer.c | 7 ++++++-
>> 4 files changed, 31 insertions(+), 1 deletion(-)
>>
>> diff --git a/include/uapi/linux/perf_event.h b/include/uapi/linux/perf_event.h
>> index 1afe962..a3c1903 100644
>> --- a/include/uapi/linux/perf_event.h
>> +++ b/include/uapi/linux/perf_event.h
>> @@ -401,6 +401,7 @@ struct perf_event_attr {
>> #define PERF_EVENT_IOC_SET_FILTER _IOW('$', 6, char *)
>> #define PERF_EVENT_IOC_ID _IOR('$', 7, __u64 *)
>> #define PERF_EVENT_IOC_SET_BPF _IOW('$', 8, __u32)
>> +#define PERF_EVENT_IOC_PAUSE_OUTPUT _IOW('$', 9, __u32)
>>
>> enum perf_event_ioc_flags {
>> PERF_IOC_FLAG_GROUP = 1U << 0,
>> diff --git a/kernel/events/core.c b/kernel/events/core.c
>> index 94c47e3..a7075ae 100644
>> --- a/kernel/events/core.c
>> +++ b/kernel/events/core.c
>> @@ -4231,6 +4231,19 @@ static long _perf_ioctl(struct perf_event *event, unsigned int cmd, unsigned lon
>> case PERF_EVENT_IOC_SET_BPF:
>> return perf_event_set_bpf_prog(event, arg);
>>
>> + case PERF_EVENT_IOC_PAUSE_OUTPUT: {
>> + struct ring_buffer *rb;
>> +
>> + rcu_read_lock();
>> + rb = rcu_dereference(event->rb);
>> + if (!event->rb) {
>> + rcu_read_unlock();
>> + return -EINVAL;
>> + }
>> + rb_toggle_paused(rb, !!arg);
>> + rcu_read_unlock();
>> + return 0;
>> + }
>> default:
>> return -ENOTTY;
>> }
>> diff --git a/kernel/events/internal.h b/kernel/events/internal.h
>> index 2bbad9c..6a93d1b 100644
>> --- a/kernel/events/internal.h
>> +++ b/kernel/events/internal.h
>> @@ -18,6 +18,7 @@ struct ring_buffer {
>> #endif
>> int nr_pages; /* nr of data pages */
>> int overwrite; /* can overwrite itself */
>> + int paused; /* can write into ring buffer */
>>
>> atomic_t poll; /* POLL_ for wakeups */
>>
>> @@ -65,6 +66,16 @@ static inline void rb_free_rcu(struct rcu_head *rcu_head)
>> rb_free(rb);
>> }
>>
>> +static inline void
>> +rb_toggle_paused(struct ring_buffer *rb,
>> + bool pause)
>> +{
>> + if (!pause && rb->nr_pages)
>> + rb->paused = 0;
>> + else
>> + rb->paused = 1;
>> +}
>> +
>> extern struct ring_buffer *
>> rb_alloc(int nr_pages, long watermark, int cpu, int flags);
>> extern void perf_event_wakeup(struct perf_event *event);
>> diff --git a/kernel/events/ring_buffer.c b/kernel/events/ring_buffer.c
>> index 1faad2c..22e1a47 100644
>> --- a/kernel/events/ring_buffer.c
>> +++ b/kernel/events/ring_buffer.c
>> @@ -125,8 +125,11 @@ int perf_output_begin(struct perf_output_handle *handle,
>> if (unlikely(!rb))
>> goto out;
>>
>> - if (unlikely(!rb->nr_pages))
>> + if (unlikely(rb->paused)) {
>> + if (rb->nr_pages)
>> + local_inc(&rb->lost);
>> goto out;
>> + }
>>
>> handle->rb = rb;
>> handle->event = event;
>> @@ -244,6 +247,8 @@ ring_buffer_init(struct ring_buffer *rb, long watermark, int flags)
>> INIT_LIST_HEAD(&rb->event_list);
>> spin_lock_init(&rb->event_lock);
>> init_irq_work(&rb->irq_work, rb_irq_work);
>> +
>> + rb->paused = rb->nr_pages ? 0 : 1;
>> }
>>
>> static void ring_buffer_put_async(struct ring_buffer *rb)
>> --
>> 1.8.3.4
next prev parent reply other threads:[~2016-03-03 2:14 UTC|newest]
Thread overview: 60+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-02-26 9:31 [PATCH 00/46] perf tools: Fix and improvements (bpf and overwrite) Wang Nan
2016-02-26 9:31 ` [PATCH 01/46] perf tools: Record text offset in dso to calculate objdump address Wang Nan
2016-03-24 7:37 ` [tip:perf/urgent] perf symbols: " tip-bot for Wang Nan
2016-02-26 9:31 ` [PATCH 02/46] perf tools: Adjust symbol for shared objects Wang Nan
2016-02-26 9:31 ` [PATCH 03/46] perf config: Bring perf_default_config to the very beginning at main() Wang Nan
2016-02-27 9:44 ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26 9:31 ` [PATCH 04/46] perf trace: Improve error message when receive non-tracepoint events Wang Nan
2016-02-26 9:31 ` [PATCH 05/46] perf tools: Only set filter for tracepoints events Wang Nan
2016-02-27 9:45 ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26 9:31 ` [PATCH 06/46] perf trace: Call bpf__apply_obj_config in 'perf trace' Wang Nan
2016-02-27 9:45 ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26 9:31 ` [PATCH 07/46] perf trace: Print content of bpf-output event Wang Nan
2016-02-27 9:45 ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26 9:31 ` [PATCH 08/46] perf data: Support converting data from bpf_perf_event_output() Wang Nan
2016-03-05 8:15 ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26 9:31 ` [PATCH 09/46] perf data: Explicitly set byte order for integer types Wang Nan
2016-03-05 8:15 ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26 9:31 ` [PATCH 10/46] perf core: Introduce new ioctl options to pause and resume ring buffer Wang Nan
2016-02-29 15:39 ` Arnaldo Carvalho de Melo
2016-03-03 2:03 ` Wangnan (F) [this message]
2016-02-26 9:31 ` [PATCH 11/46] perf core: Set event's default overflow_handler Wang Nan
2016-02-26 9:32 ` [PATCH 12/46] perf core: Prepare writing into ring buffer from end Wang Nan
2016-02-26 9:32 ` [PATCH 13/46] perf core: Add backward attribute to perf event Wang Nan
2016-02-26 9:32 ` [PATCH 14/46] perf core: Reduce perf event output overhead by new overflow handler Wang Nan
2016-02-26 9:32 ` [PATCH 15/46] perf tools: Only validate is_pos for tracking evsels Wang Nan
2016-02-26 9:32 ` [PATCH 16/46] perf tools: Print write_backward value in perf_event_attr__fprintf Wang Nan
2016-02-26 9:32 ` [PATCH 17/46] perf tools: Make ordered_events reusable Wang Nan
2016-02-26 9:32 ` [PATCH 18/46] perf record: Use WARN_ONCE to replace 'if' condition Wang Nan
2016-03-05 8:15 ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26 9:32 ` [PATCH 19/46] perf record: Extract synthesize code to record__synthesize() Wang Nan
2016-03-05 8:16 ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26 9:32 ` [PATCH 20/46] perf tools: Add perf_data_file__switch() helper Wang Nan
2016-02-26 9:32 ` [PATCH 21/46] perf record: Turns auxtrace_snapshot_enable into 3 states Wang Nan
2016-02-26 9:32 ` [PATCH 22/46] perf record: Introduce record__finish_output() to finish a perf.data Wang Nan
2016-03-05 8:16 ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26 9:32 ` [PATCH 23/46] perf record: Add '--timestamp-filename' option to append timestamp to output filename Wang Nan
2016-02-26 9:32 ` [PATCH 24/46] perf record: Split output into multiple files via '--switch-output' Wang Nan
2016-02-26 9:32 ` [PATCH 25/46] perf record: Force enable --timestamp-filename when --switch-output is provided Wang Nan
2016-02-26 9:32 ` [PATCH 26/46] perf record: Disable buildid cache options by default in switch output mode Wang Nan
2016-02-26 9:32 ` [PATCH 27/46] perf record: Re-synthesize tracking events after output switching Wang Nan
2016-02-26 9:32 ` [PATCH 28/46] perf record: Generate tracking events for process forked by perf Wang Nan
2016-02-26 9:32 ` [PATCH 29/46] perf record: Ensure return non-zero rc when mmap fail Wang Nan
2016-03-05 8:17 ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26 9:32 ` [PATCH 30/46] perf record: Prevent reading invalid data in record__mmap_read Wang Nan
2016-02-26 9:32 ` [PATCH 31/46] perf tools: Add evlist channel helpers Wang Nan
2016-02-26 9:32 ` [PATCH 32/46] perf tools: Automatically add new channel according to evlist Wang Nan
2016-02-26 9:32 ` [PATCH 33/46] perf tools: Operate multiple channels Wang Nan
2016-02-26 9:32 ` [PATCH 34/46] perf tools: Squash overwrite setting into channel Wang Nan
2016-02-26 9:32 ` [PATCH 35/46] perf record: Don't read from and poll overwrite channel Wang Nan
2016-02-26 9:32 ` [PATCH 36/46] perf record: Don't poll on " Wang Nan
2016-02-26 9:32 ` [PATCH 37/46] perf tools: Detect avalibility of write_backward Wang Nan
2016-02-26 9:32 ` [PATCH 38/46] perf tools: Enable overwrite settings Wang Nan
2016-02-26 9:32 ` [PATCH 39/46] perf tools: Set write_backward attribut bit for overwrite events Wang Nan
2016-02-26 9:32 ` [PATCH 40/46] perf tools: Record fd into perf_mmap Wang Nan
2016-02-26 9:32 ` [PATCH 41/46] perf tools: Add API to pause a channel Wang Nan
2016-02-26 9:32 ` [PATCH 42/46] perf record: Toggle overwrite ring buffer for reading Wang Nan
2016-02-26 9:32 ` [PATCH 43/46] perf record: Rename variable to make code clear Wang Nan
2016-02-26 9:32 ` [PATCH 44/46] perf record: Read from backward ring buffer Wang Nan
2016-02-26 9:32 ` [PATCH 45/46] perf record: Allow generate tracking events at the end of output Wang Nan
2016-02-26 9:32 ` [PATCH 46/46] perf tools: Don't warn about out of order event if write_backward is used Wang Nan
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=56D79B5E.4030609@huawei.com \
--to=wangnan0@huawei.com \
--cc=acme@kernel.org \
--cc=acme@redhat.com \
--cc=ast@kernel.org \
--cc=brendan.d.gregg@gmail.com \
--cc=hekuang@huawei.com \
--cc=jolsa@kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=lizefan@huawei.com \
--cc=masami.hiramatsu.pt@hitachi.com \
--cc=namhyung@kernel.org \
--cc=peterz@infradead.org \
--cc=pi3orama@163.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox