All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Wangnan (F)" <wangnan0@huawei.com>
To: Peter Zijlstra <peterz@infradead.org>
Cc: Arnaldo Carvalho de Melo <acme@kernel.org>,
	Alexei Starovoitov <ast@kernel.org>,
	Arnaldo Carvalho de Melo <acme@redhat.com>,
	Jiri Olsa <jolsa@kernel.org>, Li Zefan <lizefan@huawei.com>,
	<pi3orama@163.com>, <linux-kernel@vger.kernel.org>,
	He Kuang <hekuang@huawei.com>,
	Brendan Gregg <brendan.d.gregg@gmail.com>,
	Masami Hiramatsu <masami.hiramatsu.pt@hitachi.com>,
	Namhyung Kim <namhyung@kernel.org>
Subject: Re: [PATCH 10/46] perf core: Introduce new ioctl options to pause and resume ring buffer
Date: Thu, 3 Mar 2016 10:03:10 +0800	[thread overview]
Message-ID: <56D79B5E.4030609@huawei.com> (raw)
In-Reply-To: <20160229153929.GD32719@kernel.org>

Hi Peter,

  Patch 10/46 to 14/46 were sent separately to you and modified
follow your suggestion. Do you have further comment on it?

Thank you.

On 2016/2/29 23:39, Arnaldo Carvalho de Melo wrote:
> Em Fri, Feb 26, 2016 at 09:31:58AM +0000, Wang Nan escreveu:
>> Add new ioctl() to pause/resume ring-buffer output.
>>
>> In some situations we want to read from ring buffer only when we
>> ensure nothing can write to the ring buffer during reading. Without
>> this patch we have to turn off all events attached to this ring buffer
>> to achieve this.
>>
>> This patch is for supporting overwrite ring buffer. Following
>> commits will introduce new methods support reading from overwrite ring
>> buffer. Before reading caller must ensure the ring buffer is frozen, or
>> the reading is unreliable.
> Peter, have you have the chance too look at this and the other kernel
> bits in this kit?
>
> - Arnaldo
>   
>> Signed-off-by: Wang Nan <wangnan0@huawei.com>
>> Cc: He Kuang <hekuang@huawei.com>
>> Cc: Alexei Starovoitov <ast@kernel.org>
>> Cc: Arnaldo Carvalho de Melo <acme@redhat.com>
>> Cc: Brendan Gregg <brendan.d.gregg@gmail.com>
>> Cc: Jiri Olsa <jolsa@kernel.org>
>> Cc: Masami Hiramatsu <masami.hiramatsu.pt@hitachi.com>
>> Cc: Namhyung Kim <namhyung@kernel.org>
>> Cc: Peter Zijlstra <peterz@infradead.org>
>> Cc: Zefan Li <lizefan@huawei.com>
>> Cc: pi3orama@163.com
>> ---
>>   include/uapi/linux/perf_event.h |  1 +
>>   kernel/events/core.c            | 13 +++++++++++++
>>   kernel/events/internal.h        | 11 +++++++++++
>>   kernel/events/ring_buffer.c     |  7 ++++++-
>>   4 files changed, 31 insertions(+), 1 deletion(-)
>>
>> diff --git a/include/uapi/linux/perf_event.h b/include/uapi/linux/perf_event.h
>> index 1afe962..a3c1903 100644
>> --- a/include/uapi/linux/perf_event.h
>> +++ b/include/uapi/linux/perf_event.h
>> @@ -401,6 +401,7 @@ struct perf_event_attr {
>>   #define PERF_EVENT_IOC_SET_FILTER	_IOW('$', 6, char *)
>>   #define PERF_EVENT_IOC_ID		_IOR('$', 7, __u64 *)
>>   #define PERF_EVENT_IOC_SET_BPF		_IOW('$', 8, __u32)
>> +#define PERF_EVENT_IOC_PAUSE_OUTPUT	_IOW('$', 9, __u32)
>>   
>>   enum perf_event_ioc_flags {
>>   	PERF_IOC_FLAG_GROUP		= 1U << 0,
>> diff --git a/kernel/events/core.c b/kernel/events/core.c
>> index 94c47e3..a7075ae 100644
>> --- a/kernel/events/core.c
>> +++ b/kernel/events/core.c
>> @@ -4231,6 +4231,19 @@ static long _perf_ioctl(struct perf_event *event, unsigned int cmd, unsigned lon
>>   	case PERF_EVENT_IOC_SET_BPF:
>>   		return perf_event_set_bpf_prog(event, arg);
>>   
>> +	case PERF_EVENT_IOC_PAUSE_OUTPUT: {
>> +		struct ring_buffer *rb;
>> +
>> +		rcu_read_lock();
>> +		rb = rcu_dereference(event->rb);
>> +		if (!event->rb) {
>> +			rcu_read_unlock();
>> +			return -EINVAL;
>> +		}
>> +		rb_toggle_paused(rb, !!arg);
>> +		rcu_read_unlock();
>> +		return 0;
>> +	}
>>   	default:
>>   		return -ENOTTY;
>>   	}
>> diff --git a/kernel/events/internal.h b/kernel/events/internal.h
>> index 2bbad9c..6a93d1b 100644
>> --- a/kernel/events/internal.h
>> +++ b/kernel/events/internal.h
>> @@ -18,6 +18,7 @@ struct ring_buffer {
>>   #endif
>>   	int				nr_pages;	/* nr of data pages  */
>>   	int				overwrite;	/* can overwrite itself */
>> +	int				paused;		/* can write into ring buffer */
>>   
>>   	atomic_t			poll;		/* POLL_ for wakeups */
>>   
>> @@ -65,6 +66,16 @@ static inline void rb_free_rcu(struct rcu_head *rcu_head)
>>   	rb_free(rb);
>>   }
>>   
>> +static inline void
>> +rb_toggle_paused(struct ring_buffer *rb,
>> +		 bool pause)
>> +{
>> +	if (!pause && rb->nr_pages)
>> +		rb->paused = 0;
>> +	else
>> +		rb->paused = 1;
>> +}
>> +
>>   extern struct ring_buffer *
>>   rb_alloc(int nr_pages, long watermark, int cpu, int flags);
>>   extern void perf_event_wakeup(struct perf_event *event);
>> diff --git a/kernel/events/ring_buffer.c b/kernel/events/ring_buffer.c
>> index 1faad2c..22e1a47 100644
>> --- a/kernel/events/ring_buffer.c
>> +++ b/kernel/events/ring_buffer.c
>> @@ -125,8 +125,11 @@ int perf_output_begin(struct perf_output_handle *handle,
>>   	if (unlikely(!rb))
>>   		goto out;
>>   
>> -	if (unlikely(!rb->nr_pages))
>> +	if (unlikely(rb->paused)) {
>> +		if (rb->nr_pages)
>> +			local_inc(&rb->lost);
>>   		goto out;
>> +	}
>>   
>>   	handle->rb    = rb;
>>   	handle->event = event;
>> @@ -244,6 +247,8 @@ ring_buffer_init(struct ring_buffer *rb, long watermark, int flags)
>>   	INIT_LIST_HEAD(&rb->event_list);
>>   	spin_lock_init(&rb->event_lock);
>>   	init_irq_work(&rb->irq_work, rb_irq_work);
>> +
>> +	rb->paused = rb->nr_pages ? 0 : 1;
>>   }
>>   
>>   static void ring_buffer_put_async(struct ring_buffer *rb)
>> -- 
>> 1.8.3.4

  reply	other threads:[~2016-03-03  2:14 UTC|newest]

Thread overview: 60+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2016-02-26  9:31 [PATCH 00/46] perf tools: Fix and improvements (bpf and overwrite) Wang Nan
2016-02-26  9:31 ` [PATCH 01/46] perf tools: Record text offset in dso to calculate objdump address Wang Nan
2016-03-24  7:37   ` [tip:perf/urgent] perf symbols: " tip-bot for Wang Nan
2016-02-26  9:31 ` [PATCH 02/46] perf tools: Adjust symbol for shared objects Wang Nan
2016-02-26  9:31 ` [PATCH 03/46] perf config: Bring perf_default_config to the very beginning at main() Wang Nan
2016-02-27  9:44   ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26  9:31 ` [PATCH 04/46] perf trace: Improve error message when receive non-tracepoint events Wang Nan
2016-02-26  9:31 ` [PATCH 05/46] perf tools: Only set filter for tracepoints events Wang Nan
2016-02-27  9:45   ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26  9:31 ` [PATCH 06/46] perf trace: Call bpf__apply_obj_config in 'perf trace' Wang Nan
2016-02-27  9:45   ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26  9:31 ` [PATCH 07/46] perf trace: Print content of bpf-output event Wang Nan
2016-02-27  9:45   ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26  9:31 ` [PATCH 08/46] perf data: Support converting data from bpf_perf_event_output() Wang Nan
2016-03-05  8:15   ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26  9:31 ` [PATCH 09/46] perf data: Explicitly set byte order for integer types Wang Nan
2016-03-05  8:15   ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26  9:31 ` [PATCH 10/46] perf core: Introduce new ioctl options to pause and resume ring buffer Wang Nan
2016-02-29 15:39   ` Arnaldo Carvalho de Melo
2016-03-03  2:03     ` Wangnan (F) [this message]
2016-02-26  9:31 ` [PATCH 11/46] perf core: Set event's default overflow_handler Wang Nan
2016-02-26  9:32 ` [PATCH 12/46] perf core: Prepare writing into ring buffer from end Wang Nan
2016-02-26  9:32 ` [PATCH 13/46] perf core: Add backward attribute to perf event Wang Nan
2016-02-26  9:32 ` [PATCH 14/46] perf core: Reduce perf event output overhead by new overflow handler Wang Nan
2016-02-26  9:32 ` [PATCH 15/46] perf tools: Only validate is_pos for tracking evsels Wang Nan
2016-02-26  9:32 ` [PATCH 16/46] perf tools: Print write_backward value in perf_event_attr__fprintf Wang Nan
2016-02-26  9:32 ` [PATCH 17/46] perf tools: Make ordered_events reusable Wang Nan
2016-02-26  9:32 ` [PATCH 18/46] perf record: Use WARN_ONCE to replace 'if' condition Wang Nan
2016-03-05  8:15   ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26  9:32 ` [PATCH 19/46] perf record: Extract synthesize code to record__synthesize() Wang Nan
2016-03-05  8:16   ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26  9:32 ` [PATCH 20/46] perf tools: Add perf_data_file__switch() helper Wang Nan
2016-02-26  9:32 ` [PATCH 21/46] perf record: Turns auxtrace_snapshot_enable into 3 states Wang Nan
2016-02-26  9:32 ` [PATCH 22/46] perf record: Introduce record__finish_output() to finish a perf.data Wang Nan
2016-03-05  8:16   ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26  9:32 ` [PATCH 23/46] perf record: Add '--timestamp-filename' option to append timestamp to output filename Wang Nan
2016-02-26  9:32 ` [PATCH 24/46] perf record: Split output into multiple files via '--switch-output' Wang Nan
2016-02-26  9:32 ` [PATCH 25/46] perf record: Force enable --timestamp-filename when --switch-output is provided Wang Nan
2016-02-26  9:32 ` [PATCH 26/46] perf record: Disable buildid cache options by default in switch output mode Wang Nan
2016-02-26  9:32 ` [PATCH 27/46] perf record: Re-synthesize tracking events after output switching Wang Nan
2016-02-26  9:32 ` [PATCH 28/46] perf record: Generate tracking events for process forked by perf Wang Nan
2016-02-26  9:32 ` [PATCH 29/46] perf record: Ensure return non-zero rc when mmap fail Wang Nan
2016-03-05  8:17   ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26  9:32 ` [PATCH 30/46] perf record: Prevent reading invalid data in record__mmap_read Wang Nan
2016-02-26  9:32 ` [PATCH 31/46] perf tools: Add evlist channel helpers Wang Nan
2016-02-26  9:32 ` [PATCH 32/46] perf tools: Automatically add new channel according to evlist Wang Nan
2016-02-26  9:32 ` [PATCH 33/46] perf tools: Operate multiple channels Wang Nan
2016-02-26  9:32 ` [PATCH 34/46] perf tools: Squash overwrite setting into channel Wang Nan
2016-02-26  9:32 ` [PATCH 35/46] perf record: Don't read from and poll overwrite channel Wang Nan
2016-02-26  9:32 ` [PATCH 36/46] perf record: Don't poll on " Wang Nan
2016-02-26  9:32 ` [PATCH 37/46] perf tools: Detect avalibility of write_backward Wang Nan
2016-02-26  9:32 ` [PATCH 38/46] perf tools: Enable overwrite settings Wang Nan
2016-02-26  9:32 ` [PATCH 39/46] perf tools: Set write_backward attribut bit for overwrite events Wang Nan
2016-02-26  9:32 ` [PATCH 40/46] perf tools: Record fd into perf_mmap Wang Nan
2016-02-26  9:32 ` [PATCH 41/46] perf tools: Add API to pause a channel Wang Nan
2016-02-26  9:32 ` [PATCH 42/46] perf record: Toggle overwrite ring buffer for reading Wang Nan
2016-02-26  9:32 ` [PATCH 43/46] perf record: Rename variable to make code clear Wang Nan
2016-02-26  9:32 ` [PATCH 44/46] perf record: Read from backward ring buffer Wang Nan
2016-02-26  9:32 ` [PATCH 45/46] perf record: Allow generate tracking events at the end of output Wang Nan
2016-02-26  9:32 ` [PATCH 46/46] perf tools: Don't warn about out of order event if write_backward is used Wang Nan

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=56D79B5E.4030609@huawei.com \
    --to=wangnan0@huawei.com \
    --cc=acme@kernel.org \
    --cc=acme@redhat.com \
    --cc=ast@kernel.org \
    --cc=brendan.d.gregg@gmail.com \
    --cc=hekuang@huawei.com \
    --cc=jolsa@kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=lizefan@huawei.com \
    --cc=masami.hiramatsu.pt@hitachi.com \
    --cc=namhyung@kernel.org \
    --cc=peterz@infradead.org \
    --cc=pi3orama@163.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.