* perf: allow command to attach local data to thread/evsel structs @ 2012-02-07 18:11 David Ahern 2012-02-07 20:10 ` Arnaldo Carvalho de Melo 0 siblings, 1 reply; 5+ messages in thread From: David Ahern @ 2012-02-07 18:11 UTC (permalink / raw) To: Arnaldo Carvalho de Melo, Ingo Molnar, Peter Zijlstra, Frederic Weisbecker, Stephane Eranian Cc: LKML This is an API I have been using for some 'local' commands that process perf events. It allows the commands to attach data to events and threads and avoid local caching and lookups. diff --git a/tools/perf/util/evsel.h b/tools/perf/util/evsel.h index 326b8e4..866de40 100644 --- a/tools/perf/util/evsel.h +++ b/tools/perf/util/evsel.h @@ -66,6 +66,12 @@ struct perf_evsel { void *data; } handler; bool supported; + + /* + * can be used by commands that process samples + * for storing local, event-based data + */ + void *private; }; struct cpu_map; diff --git a/tools/perf/util/thread.h b/tools/perf/util/thread.h index 70c2c13..9f62859 100644 --- a/tools/perf/util/thread.h +++ b/tools/perf/util/thread.h @@ -16,6 +16,12 @@ struct thread { bool comm_set; char *comm; int comm_len; + + /* + * can be used by commands that process samples + * for storing local, thread-based data + */ + void *private; }; struct machine; One wrinkle is that for the thread-based one the thread__delete(), thread__fork() and thread__set_comm() functions would free the allocations if set -- or a handler is needed to free private structs to handle layered mallocs. Any objections to pushing this kind of open-ended API into perf? David ^ permalink raw reply related [flat|nested] 5+ messages in thread
* Re: perf: allow command to attach local data to thread/evsel structs 2012-02-07 18:11 perf: allow command to attach local data to thread/evsel structs David Ahern @ 2012-02-07 20:10 ` Arnaldo Carvalho de Melo 2012-02-07 21:18 ` David Ahern 0 siblings, 1 reply; 5+ messages in thread From: Arnaldo Carvalho de Melo @ 2012-02-07 20:10 UTC (permalink / raw) To: David Ahern Cc: Ingo Molnar, Peter Zijlstra, Frederic Weisbecker, Stephane Eranian, LKML Em Tue, Feb 07, 2012 at 11:11:48AM -0700, David Ahern escreveu: > This is an API I have been using for some 'local' commands that process > perf events. It allows the commands to attach data to events and threads > and avoid local caching and lookups. In the kernel proper we try to get away with this pattern using container_of where possible. Here tho the structures are created in library functions. The symbol library has this symbol_conf.priv_size + symbol__priv() to do what you want while avoiding two allocs + one pointer in a core structure. I think we either use {thread_conf,evsel_conf} for global configuration options for these two core data structures or we just provide some optional, per perf_tool allocator. Yeah, that sounds extreme, but hey, this is a profiler, we ought to eat our own dog food, right? - Arnaldo P.S.: Are your tools too specific or are they upstreamable? > > diff --git a/tools/perf/util/evsel.h b/tools/perf/util/evsel.h > index 326b8e4..866de40 100644 > --- a/tools/perf/util/evsel.h > +++ b/tools/perf/util/evsel.h > @@ -66,6 +66,12 @@ struct perf_evsel { > void *data; > } handler; > bool supported; > + > + /* > + * can be used by commands that process samples > + * for storing local, event-based data > + */ > + void *private; > }; > > struct cpu_map; > diff --git a/tools/perf/util/thread.h b/tools/perf/util/thread.h > index 70c2c13..9f62859 100644 > --- a/tools/perf/util/thread.h > +++ b/tools/perf/util/thread.h > @@ -16,6 +16,12 @@ struct thread { > bool comm_set; > char *comm; > int comm_len; > + > + /* > + * can be used by commands that process samples > + * for storing local, thread-based data > + */ > + void *private; > }; > > struct machine; > > One wrinkle is that for the thread-based one the thread__delete(), > thread__fork() and thread__set_comm() functions would free the > allocations if set -- or a handler is needed to free private structs to > handle layered mallocs. > > Any objections to pushing this kind of open-ended API into perf? > > David ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: perf: allow command to attach local data to thread/evsel structs 2012-02-07 20:10 ` Arnaldo Carvalho de Melo @ 2012-02-07 21:18 ` David Ahern 2012-02-07 21:33 ` Arnaldo Carvalho de Melo 0 siblings, 1 reply; 5+ messages in thread From: David Ahern @ 2012-02-07 21:18 UTC (permalink / raw) To: Arnaldo Carvalho de Melo Cc: Ingo Molnar, Peter Zijlstra, Frederic Weisbecker, Stephane Eranian, LKML On 02/07/2012 01:10 PM, Arnaldo Carvalho de Melo wrote: > Em Tue, Feb 07, 2012 at 11:11:48AM -0700, David Ahern escreveu: >> This is an API I have been using for some 'local' commands that process >> perf events. It allows the commands to attach data to events and threads >> and avoid local caching and lookups. > > In the kernel proper we try to get away with this pattern using > container_of where possible. > > Here tho the structures are created in library functions. exactly. > > The symbol library has this symbol_conf.priv_size + symbol__priv() to do > what you want while avoiding two allocs + one pointer in a core > structure. Ok. Interesting. Somewhat of a hidden allocation. Symbols can't come and go like threads. > > I think we either use {thread_conf,evsel_conf} for global configuration > options for these two core data structures or we just provide some > optional, per perf_tool allocator. Meaning conf's that parallel symbol_conf -- allocated and 'owned' by the perf library but exported for the user to set values. In this case handlers are needed for allocation and free as instances come and go? e.g., thread__new invokes thread_conf.allocator if defined and thread__priv returns a pointer to private data. > > Yeah, that sounds extreme, but hey, this is a profiler, we ought to eat > our own dog food, right? um, as a library to be used by other commands libperf.so is an event collector and processor; profiler is just one use. > > - Arnaldo > > P.S.: Are your tools too specific or are they upstreamable? Too specific. One uses context-switch SW events to dump a time history of what task is running on what processor, how long it ran and how long between schedulings. The other uses a custom SW event to track futex's. In both cases the solutions are constrained by an older kernel with limited trace support - and too much effort to backport it. Key points are that in processing events I want to track data that is task specific (e.g., last scheduled out) and event specific (e.g., time last seen by cpu). David ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: perf: allow command to attach local data to thread/evsel structs 2012-02-07 21:18 ` David Ahern @ 2012-02-07 21:33 ` Arnaldo Carvalho de Melo 2012-02-07 21:55 ` David Ahern 0 siblings, 1 reply; 5+ messages in thread From: Arnaldo Carvalho de Melo @ 2012-02-07 21:33 UTC (permalink / raw) To: David Ahern Cc: Ingo Molnar, Peter Zijlstra, Frederic Weisbecker, Stephane Eranian, LKML Em Tue, Feb 07, 2012 at 02:18:12PM -0700, David Ahern escreveu: > > > On 02/07/2012 01:10 PM, Arnaldo Carvalho de Melo wrote: > > Em Tue, Feb 07, 2012 at 11:11:48AM -0700, David Ahern escreveu: > >> This is an API I have been using for some 'local' commands that process > >> perf events. It allows the commands to attach data to events and threads > >> and avoid local caching and lookups. > > > > In the kernel proper we try to get away with this pattern using > > container_of where possible. > > > > Here tho the structures are created in library functions. > > exactly. > > > > > The symbol library has this symbol_conf.priv_size + symbol__priv() to do > > what you want while avoiding two allocs + one pointer in a core > > structure. > > Ok. Interesting. Somewhat of a hidden allocation. Symbols can't come and > go like threads. Yes, standardizing a way to attach extra info to these structs seems the way to go. The experience with symbol_conf is a data point, check if it fits your needs. > > I think we either use {thread_conf,evsel_conf} for global configuration > > options for these two core data structures or we just provide some > > optional, per perf_tool allocator. > > Meaning conf's that parallel symbol_conf -- allocated and 'owned' by the > perf library but exported for the user to set values. In this case > handlers are needed for allocation and free as instances come and go? > e.g., thread__new invokes thread_conf.allocator if defined and > thread__priv returns a pointer to private data. Well, the symbol way of doing things is to just allocate as many bytes as the tool being used asks for and then at free time nothing special has to be done. Problem is when multiple things needs extra space. In the kernel we have the skb->cb way of doing things, where we have a scratch pad that is per layer, but limited in size, difficult to find the right way to do it. > > Yeah, that sounds extreme, but hey, this is a profiler, we ought to eat > > our own dog food, right? > > um, as a library to be used by other commands libperf.so is an event > collector and processor; profiler is just one use. Right, but in any case the faster and less memory intensive the better, even if not strictly needed to use lots of memory or cpu. > > > > - Arnaldo > > > > P.S.: Are your tools too specific or are they upstreamable? > > Too specific. > > One uses context-switch SW events to dump a time history of what task is > running on what processor, how long it ran and how long between > schedulings. The other uses a custom SW event to track futex's. In both > cases the solutions are constrained by an older kernel with limited > trace support - and too much effort to backport it. > > Key points are that in processing events I want to track data that is > task specific (e.g., last scheduled out) and event specific (e.g., time > last seen by cpu). Please take a look at the suggestions and propose a way to address your needs, if a void private pointer ends up the way to go... well, we'll have it :-) > David ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: perf: allow command to attach local data to thread/evsel structs 2012-02-07 21:33 ` Arnaldo Carvalho de Melo @ 2012-02-07 21:55 ` David Ahern 0 siblings, 0 replies; 5+ messages in thread From: David Ahern @ 2012-02-07 21:55 UTC (permalink / raw) To: Arnaldo Carvalho de Melo Cc: Ingo Molnar, Peter Zijlstra, Frederic Weisbecker, Stephane Eranian, LKML On 02/07/2012 02:33 PM, Arnaldo Carvalho de Melo wrote: >>> I think we either use {thread_conf,evsel_conf} for global configuration >>> options for these two core data structures or we just provide some >>> optional, per perf_tool allocator. >> >> Meaning conf's that parallel symbol_conf -- allocated and 'owned' by the >> perf library but exported for the user to set values. In this case >> handlers are needed for allocation and free as instances come and go? >> e.g., thread__new invokes thread_conf.allocator if defined and >> thread__priv returns a pointer to private data. > > Well, the symbol way of doing things is to just allocate as many bytes > as the tool being used asks for and then at free time nothing special > has to be done. Consider the case of events and you want to track time last seen by cpu. You need an array: u64 last_seen[] with CPU as an index, but do not want to assume some value of max_cpus. So one option is: struct priv { u64 *last_seen; /* time this event was last seen */ u32 max_cpu; /* highest cpu slot allocated */ }; last_seen is malloc'ed first time the event is processed and realloc'ed as max_cpu increases. Attaching that struct to an event means an embedded allocation that does not follow the static model of symbol_conf. David ^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2012-02-07 21:55 UTC | newest] Thread overview: 5+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2012-02-07 18:11 perf: allow command to attach local data to thread/evsel structs David Ahern 2012-02-07 20:10 ` Arnaldo Carvalho de Melo 2012-02-07 21:18 ` David Ahern 2012-02-07 21:33 ` Arnaldo Carvalho de Melo 2012-02-07 21:55 ` David Ahern
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox