public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Arnaldo Carvalho de Melo <acme@kernel.org>
To: Peter Zijlstra <peterz@infradead.org>
Cc: Wang Nan <wangnan0@huawei.com>,
	Alexei Starovoitov <ast@kernel.org>,
	Arnaldo Carvalho de Melo <acme@redhat.com>,
	Jiri Olsa <jolsa@kernel.org>, Li Zefan <lizefan@huawei.com>,
	pi3orama@163.com, linux-kernel@vger.kernel.org,
	He Kuang <hekuang@huawei.com>,
	Brendan Gregg <brendan.d.gregg@gmail.com>,
	Masami Hiramatsu <masami.hiramatsu.pt@hitachi.com>,
	Namhyung Kim <namhyung@kernel.org>
Subject: Re: [PATCH 10/46] perf core: Introduce new ioctl options to pause and resume ring buffer
Date: Mon, 29 Feb 2016 12:39:29 -0300	[thread overview]
Message-ID: <20160229153929.GD32719@kernel.org> (raw)
In-Reply-To: <1456479154-136027-11-git-send-email-wangnan0@huawei.com>

Em Fri, Feb 26, 2016 at 09:31:58AM +0000, Wang Nan escreveu:
> Add new ioctl() to pause/resume ring-buffer output.
> 
> In some situations we want to read from ring buffer only when we
> ensure nothing can write to the ring buffer during reading. Without
> this patch we have to turn off all events attached to this ring buffer
> to achieve this.
> 
> This patch is for supporting overwrite ring buffer. Following
> commits will introduce new methods support reading from overwrite ring
> buffer. Before reading caller must ensure the ring buffer is frozen, or
> the reading is unreliable.

Peter, have you have the chance too look at this and the other kernel
bits in this kit?

- Arnaldo
 
> Signed-off-by: Wang Nan <wangnan0@huawei.com>
> Cc: He Kuang <hekuang@huawei.com>
> Cc: Alexei Starovoitov <ast@kernel.org>
> Cc: Arnaldo Carvalho de Melo <acme@redhat.com>
> Cc: Brendan Gregg <brendan.d.gregg@gmail.com>
> Cc: Jiri Olsa <jolsa@kernel.org>
> Cc: Masami Hiramatsu <masami.hiramatsu.pt@hitachi.com>
> Cc: Namhyung Kim <namhyung@kernel.org>
> Cc: Peter Zijlstra <peterz@infradead.org>
> Cc: Zefan Li <lizefan@huawei.com>
> Cc: pi3orama@163.com
> ---
>  include/uapi/linux/perf_event.h |  1 +
>  kernel/events/core.c            | 13 +++++++++++++
>  kernel/events/internal.h        | 11 +++++++++++
>  kernel/events/ring_buffer.c     |  7 ++++++-
>  4 files changed, 31 insertions(+), 1 deletion(-)
> 
> diff --git a/include/uapi/linux/perf_event.h b/include/uapi/linux/perf_event.h
> index 1afe962..a3c1903 100644
> --- a/include/uapi/linux/perf_event.h
> +++ b/include/uapi/linux/perf_event.h
> @@ -401,6 +401,7 @@ struct perf_event_attr {
>  #define PERF_EVENT_IOC_SET_FILTER	_IOW('$', 6, char *)
>  #define PERF_EVENT_IOC_ID		_IOR('$', 7, __u64 *)
>  #define PERF_EVENT_IOC_SET_BPF		_IOW('$', 8, __u32)
> +#define PERF_EVENT_IOC_PAUSE_OUTPUT	_IOW('$', 9, __u32)
>  
>  enum perf_event_ioc_flags {
>  	PERF_IOC_FLAG_GROUP		= 1U << 0,
> diff --git a/kernel/events/core.c b/kernel/events/core.c
> index 94c47e3..a7075ae 100644
> --- a/kernel/events/core.c
> +++ b/kernel/events/core.c
> @@ -4231,6 +4231,19 @@ static long _perf_ioctl(struct perf_event *event, unsigned int cmd, unsigned lon
>  	case PERF_EVENT_IOC_SET_BPF:
>  		return perf_event_set_bpf_prog(event, arg);
>  
> +	case PERF_EVENT_IOC_PAUSE_OUTPUT: {
> +		struct ring_buffer *rb;
> +
> +		rcu_read_lock();
> +		rb = rcu_dereference(event->rb);
> +		if (!event->rb) {
> +			rcu_read_unlock();
> +			return -EINVAL;
> +		}
> +		rb_toggle_paused(rb, !!arg);
> +		rcu_read_unlock();
> +		return 0;
> +	}
>  	default:
>  		return -ENOTTY;
>  	}
> diff --git a/kernel/events/internal.h b/kernel/events/internal.h
> index 2bbad9c..6a93d1b 100644
> --- a/kernel/events/internal.h
> +++ b/kernel/events/internal.h
> @@ -18,6 +18,7 @@ struct ring_buffer {
>  #endif
>  	int				nr_pages;	/* nr of data pages  */
>  	int				overwrite;	/* can overwrite itself */
> +	int				paused;		/* can write into ring buffer */
>  
>  	atomic_t			poll;		/* POLL_ for wakeups */
>  
> @@ -65,6 +66,16 @@ static inline void rb_free_rcu(struct rcu_head *rcu_head)
>  	rb_free(rb);
>  }
>  
> +static inline void
> +rb_toggle_paused(struct ring_buffer *rb,
> +		 bool pause)
> +{
> +	if (!pause && rb->nr_pages)
> +		rb->paused = 0;
> +	else
> +		rb->paused = 1;
> +}
> +
>  extern struct ring_buffer *
>  rb_alloc(int nr_pages, long watermark, int cpu, int flags);
>  extern void perf_event_wakeup(struct perf_event *event);
> diff --git a/kernel/events/ring_buffer.c b/kernel/events/ring_buffer.c
> index 1faad2c..22e1a47 100644
> --- a/kernel/events/ring_buffer.c
> +++ b/kernel/events/ring_buffer.c
> @@ -125,8 +125,11 @@ int perf_output_begin(struct perf_output_handle *handle,
>  	if (unlikely(!rb))
>  		goto out;
>  
> -	if (unlikely(!rb->nr_pages))
> +	if (unlikely(rb->paused)) {
> +		if (rb->nr_pages)
> +			local_inc(&rb->lost);
>  		goto out;
> +	}
>  
>  	handle->rb    = rb;
>  	handle->event = event;
> @@ -244,6 +247,8 @@ ring_buffer_init(struct ring_buffer *rb, long watermark, int flags)
>  	INIT_LIST_HEAD(&rb->event_list);
>  	spin_lock_init(&rb->event_lock);
>  	init_irq_work(&rb->irq_work, rb_irq_work);
> +
> +	rb->paused = rb->nr_pages ? 0 : 1;
>  }
>  
>  static void ring_buffer_put_async(struct ring_buffer *rb)
> -- 
> 1.8.3.4

  reply	other threads:[~2016-02-29 15:39 UTC|newest]

Thread overview: 60+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2016-02-26  9:31 [PATCH 00/46] perf tools: Fix and improvements (bpf and overwrite) Wang Nan
2016-02-26  9:31 ` [PATCH 01/46] perf tools: Record text offset in dso to calculate objdump address Wang Nan
2016-03-24  7:37   ` [tip:perf/urgent] perf symbols: " tip-bot for Wang Nan
2016-02-26  9:31 ` [PATCH 02/46] perf tools: Adjust symbol for shared objects Wang Nan
2016-02-26  9:31 ` [PATCH 03/46] perf config: Bring perf_default_config to the very beginning at main() Wang Nan
2016-02-27  9:44   ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26  9:31 ` [PATCH 04/46] perf trace: Improve error message when receive non-tracepoint events Wang Nan
2016-02-26  9:31 ` [PATCH 05/46] perf tools: Only set filter for tracepoints events Wang Nan
2016-02-27  9:45   ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26  9:31 ` [PATCH 06/46] perf trace: Call bpf__apply_obj_config in 'perf trace' Wang Nan
2016-02-27  9:45   ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26  9:31 ` [PATCH 07/46] perf trace: Print content of bpf-output event Wang Nan
2016-02-27  9:45   ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26  9:31 ` [PATCH 08/46] perf data: Support converting data from bpf_perf_event_output() Wang Nan
2016-03-05  8:15   ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26  9:31 ` [PATCH 09/46] perf data: Explicitly set byte order for integer types Wang Nan
2016-03-05  8:15   ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26  9:31 ` [PATCH 10/46] perf core: Introduce new ioctl options to pause and resume ring buffer Wang Nan
2016-02-29 15:39   ` Arnaldo Carvalho de Melo [this message]
2016-03-03  2:03     ` Wangnan (F)
2016-02-26  9:31 ` [PATCH 11/46] perf core: Set event's default overflow_handler Wang Nan
2016-02-26  9:32 ` [PATCH 12/46] perf core: Prepare writing into ring buffer from end Wang Nan
2016-02-26  9:32 ` [PATCH 13/46] perf core: Add backward attribute to perf event Wang Nan
2016-02-26  9:32 ` [PATCH 14/46] perf core: Reduce perf event output overhead by new overflow handler Wang Nan
2016-02-26  9:32 ` [PATCH 15/46] perf tools: Only validate is_pos for tracking evsels Wang Nan
2016-02-26  9:32 ` [PATCH 16/46] perf tools: Print write_backward value in perf_event_attr__fprintf Wang Nan
2016-02-26  9:32 ` [PATCH 17/46] perf tools: Make ordered_events reusable Wang Nan
2016-02-26  9:32 ` [PATCH 18/46] perf record: Use WARN_ONCE to replace 'if' condition Wang Nan
2016-03-05  8:15   ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26  9:32 ` [PATCH 19/46] perf record: Extract synthesize code to record__synthesize() Wang Nan
2016-03-05  8:16   ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26  9:32 ` [PATCH 20/46] perf tools: Add perf_data_file__switch() helper Wang Nan
2016-02-26  9:32 ` [PATCH 21/46] perf record: Turns auxtrace_snapshot_enable into 3 states Wang Nan
2016-02-26  9:32 ` [PATCH 22/46] perf record: Introduce record__finish_output() to finish a perf.data Wang Nan
2016-03-05  8:16   ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26  9:32 ` [PATCH 23/46] perf record: Add '--timestamp-filename' option to append timestamp to output filename Wang Nan
2016-02-26  9:32 ` [PATCH 24/46] perf record: Split output into multiple files via '--switch-output' Wang Nan
2016-02-26  9:32 ` [PATCH 25/46] perf record: Force enable --timestamp-filename when --switch-output is provided Wang Nan
2016-02-26  9:32 ` [PATCH 26/46] perf record: Disable buildid cache options by default in switch output mode Wang Nan
2016-02-26  9:32 ` [PATCH 27/46] perf record: Re-synthesize tracking events after output switching Wang Nan
2016-02-26  9:32 ` [PATCH 28/46] perf record: Generate tracking events for process forked by perf Wang Nan
2016-02-26  9:32 ` [PATCH 29/46] perf record: Ensure return non-zero rc when mmap fail Wang Nan
2016-03-05  8:17   ` [tip:perf/core] " tip-bot for Wang Nan
2016-02-26  9:32 ` [PATCH 30/46] perf record: Prevent reading invalid data in record__mmap_read Wang Nan
2016-02-26  9:32 ` [PATCH 31/46] perf tools: Add evlist channel helpers Wang Nan
2016-02-26  9:32 ` [PATCH 32/46] perf tools: Automatically add new channel according to evlist Wang Nan
2016-02-26  9:32 ` [PATCH 33/46] perf tools: Operate multiple channels Wang Nan
2016-02-26  9:32 ` [PATCH 34/46] perf tools: Squash overwrite setting into channel Wang Nan
2016-02-26  9:32 ` [PATCH 35/46] perf record: Don't read from and poll overwrite channel Wang Nan
2016-02-26  9:32 ` [PATCH 36/46] perf record: Don't poll on " Wang Nan
2016-02-26  9:32 ` [PATCH 37/46] perf tools: Detect avalibility of write_backward Wang Nan
2016-02-26  9:32 ` [PATCH 38/46] perf tools: Enable overwrite settings Wang Nan
2016-02-26  9:32 ` [PATCH 39/46] perf tools: Set write_backward attribut bit for overwrite events Wang Nan
2016-02-26  9:32 ` [PATCH 40/46] perf tools: Record fd into perf_mmap Wang Nan
2016-02-26  9:32 ` [PATCH 41/46] perf tools: Add API to pause a channel Wang Nan
2016-02-26  9:32 ` [PATCH 42/46] perf record: Toggle overwrite ring buffer for reading Wang Nan
2016-02-26  9:32 ` [PATCH 43/46] perf record: Rename variable to make code clear Wang Nan
2016-02-26  9:32 ` [PATCH 44/46] perf record: Read from backward ring buffer Wang Nan
2016-02-26  9:32 ` [PATCH 45/46] perf record: Allow generate tracking events at the end of output Wang Nan
2016-02-26  9:32 ` [PATCH 46/46] perf tools: Don't warn about out of order event if write_backward is used Wang Nan

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20160229153929.GD32719@kernel.org \
    --to=acme@kernel.org \
    --cc=acme@redhat.com \
    --cc=ast@kernel.org \
    --cc=brendan.d.gregg@gmail.com \
    --cc=hekuang@huawei.com \
    --cc=jolsa@kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=lizefan@huawei.com \
    --cc=masami.hiramatsu.pt@hitachi.com \
    --cc=namhyung@kernel.org \
    --cc=peterz@infradead.org \
    --cc=pi3orama@163.com \
    --cc=wangnan0@huawei.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox