From: Namhyung Kim <namhyung@kernel.org>
To: David Ahern <dsahern@gmail.com>
Cc: acme@ghostprotocols.net, linux-kernel@vger.kernel.org,
Ingo Molnar <mingo@kernel.org>,
Frederic Weisbecker <fweisbec@gmail.com>,
Peter Zijlstra <a.p.zijlstra@chello.nl>,
Mike Galbraith <efault@gmx.de>, Jiri Olsa <jolsa@redhat.com>,
Stephane Eranian <eranian@google.com>,
Pekka Enberg <penberg@kernel.org>
Subject: Re: [PATCH 6/8] perf sched: Introduce timehist command
Date: Thu, 28 Nov 2013 18:50:24 +0900 [thread overview]
Message-ID: <87siugq2hr.fsf@sejong.aot.lge.com> (raw)
In-Reply-To: <1384806771-2945-7-git-send-email-dsahern@gmail.com> (David Ahern's message of "Mon, 18 Nov 2013 13:32:49 -0700")
On Mon, 18 Nov 2013 13:32:49 -0700, David Ahern wrote:
> 'perf sched timehist' provides an analysis of scheduling events.
>
> Example usage:
> perf sched record -- sleep 1
> perf sched timehist
>
> By default it shows the individual schedule events, including the time between
> sched-in events for the task, the task scheduling delay (time between wakeup
> and actually running) and run time for the task:
>
> time cpu task name[tid/pid] b/n time sch delay run time
> --------------- ---- -------------------- --------- --------- ---------
> 79371.874569 [11] gcc[31949] 0.000014 0.000000 0.001148
> 79371.874591 [10] gcc[31951] 0.000000 0.000000 0.000024
> 79371.874603 [10] migration/10[59] 0.003350 0.000004 0.000011
> 79371.874604 [11] <idle> 0.001148 0.000000 0.000035
> 79371.874723 [05] <idle> 0.000016 0.000000 0.001383
> 79371.874746 [05] gcc[31949] 0.000153 0.000078 0.000022
> ...
>
> If callchains were recorded they are appended to the line with a default stack depth of 5:
>
> 79371.874569 [11] gcc[31949] 0.000014 0.000000 0.001148 wait_for_completion_killable do_fork sys_vfork stub_vfork __vfork
> 79371.874591 [10] gcc[31951] 0.000000 0.000000 0.000024 __cond_resched _cond_resched wait_for_completion stop_one_cpu sched_exec
> 79371.874603 [10] migration/10[59] 0.003350 0.000004 0.000011 smpboot_thread_fn kthread ret_from_fork
> 79371.874604 [11] <idle> 0.001148 0.000000 0.000035 cpu_startup_entry start_secondary
> 79371.874723 [05] <idle> 0.000016 0.000000 0.001383 cpu_startup_entry start_secondary
> 79371.874746 [05] gcc[31949] 0.000153 0.000078 0.000022 do_wait sys_wait4 system_call_fastpath __GI___waitpid
>
> --no-call-graph can be used to not show the callchains. --max-stack is used
> to control the number of frames shown (default of 5). -x/--excl options can
> be used to collapse redundant callchains to get more relevant data on screen.
>
> Similar to perf-trace -s and -S can be used to dump a statistical summary
> without or with events (respectively). Statistics include min run time,
> average run time and max run time. Stats are also shown for run time by
> cpu.
>
> The cpu-visual option provides a visual aid for sched switches by cpu:
> ...
> 79371.874569 [11] s gcc[31949] 0.000014 0.000000 0.001148
> 79371.874591 [10] s gcc[31951] 0.000000 0.000000 0.000024
> 79371.874603 [10] s migration/10[59] 0.003350 0.000004 0.000011
> 79371.874604 [11] i <idle> 0.001148 0.000000 0.000035
> 79371.874723 [05] i <idle> 0.000016 0.000000 0.001383
> 79371.874746 [05] s gcc[31949] 0.000153 0.000078 0.000022
> ...
>
> Additional options:
> - -w option can be used to include a dump of wakeup events
> - -c comm to only display events and stats for the given comm (csv list)
> - -p/-t to only display events and stats for given pid/tid (csv list)
Looks like a very nice feature. I haven't read the whole patch yet but
just want some comments. More will come later. :)
[SNIP]
> +static inline void printf_nsecs(unsigned long long nsecs, int width_sec)
> +{
> + unsigned long secs;
> + unsigned long usecs;
> +
> + secs = nsecs / NSECS_PER_SEC;
> + nsecs -= secs * NSECS_PER_SEC;
> + usecs = nsecs / NSECS_PER_USEC;
> + printf("%*lu.%06lu ", width_sec, secs, usecs);
> +}
It seems very similar to what timehist_time_str() does. Better to
factor out?
[SNIP]
> +static unsigned int comm_width = 20;
> +
> +static char *timehist_get_commstr(struct thread *thread)
> +{
> + static char str[32];
> + const char *comm = thread__comm_str(thread);
> + pid_t tid = thread->tid;
> + pid_t pid = thread->pid_;
> + unsigned int n;
> +
> + if (pid == 0)
> + scnprintf(str, sizeof(str), "%s", comm);
> +
> + else if (tid != pid)
> + scnprintf(str, sizeof(str), "%s[%d/%d]", comm, tid, pid);
> +
> + else
> + scnprintf(str, sizeof(str), "%s[%d]", comm, tid);
> +
> + n = strlen(str);
Why not just using return value of scnprintf()?
> + if (n > comm_width)
> + comm_width = n;
> +
> + return str;
> +}
> +
> +static void timehist_header(struct perf_sched *sched)
> +{
> + u32 max_cpus = sched->max_cpu;
> + u32 i, j;
> +
> + printf("%15s %-4s", "time", "cpu");
> +
> + if (sched->show_cpu_visual && max_cpus) {
> + printf(" ");
> + for (i = 0, j = 0; i < max_cpus; ++i) {
> + printf("%x", j++);
> + if (j > 15)
> + j = 0;
> + }
> + printf(" ");
> + }
> +
> + printf(" %-20s %9s %9s %9s",
> + "task name[tid/pid]", "b/n time", "sch delay", "run time");
> +
> + if (sched->show_wakeups)
> + printf(" %-20s", "wakeup");
> +
> + printf("\n");
> +
> + printf("%15s %4s", "---------------", "----");
You might want to use "%.15s" and graph_dotted_line.
> +
> + if (sched->show_cpu_visual && max_cpus) {
> + printf(" ");
> + for (i = 0; i < max_cpus; ++i)
> + printf("-");
> + printf(" ");
> + }
> +
> + printf(" %20s %9s %9s %9s",
> + "--------------------", "---------", "---------", "---------");
Ditto.
> +
> + if (sched->show_wakeups)
> + printf(" %20s", "--------------------");
Ditto.
> +
> + printf("\n");
> +}
> +
> +static char *timehist_time_str(char *tstr, int len, u64 t)
> +{
> + unsigned long secs, usecs;
> + unsigned long long nsecs;
> +
> + nsecs = t;
> + secs = nsecs / NSECS_PER_SEC;
> + nsecs -= secs * NSECS_PER_SEC;
> + usecs = nsecs / NSECS_PER_USEC;
> + snprintf(tstr, len, "%5lu.%06lu", secs, usecs);
> +
> + return tstr;
> +}
See above.
> +
> +static void timehist_print_sample(struct perf_sched *sched,
> + union perf_event *event,
> + struct perf_evsel *evsel,
> + struct perf_sample *sample,
> + struct thread *thread,
> + struct machine *machine)
> +{
> + struct thread_runtime *tr = thread__priv(thread);
> + char tstr[64];
> + u32 max_cpus = sched->max_cpu;
> +
> + printf("%15s ", timehist_time_str(tstr, sizeof(tstr), sample->time));
> +
> + printf("[%02d] ", sample->cpu);
> +
> + if (sched->show_cpu_visual && max_cpus) {
> + u32 i;
> + char c;
> + for (i = 0; i < max_cpus; ++i) {
> + if (i == sample->cpu)
> + c = (thread->tid == 0) ? 'i' : 's';
It'd better explaining what the 'i' and 's' mean..
> + else
> + c = ' ';
> + printf("%c", c);
> + }
> + printf(" ");
> + }
> +
> + printf("%-*s ", comm_width, timehist_get_commstr(thread));
> +
> + printf_nsecs(tr->dt_between, 2);
> + printf_nsecs(tr->dt_delay, 2);
> + printf_nsecs(tr->dt_run, 2);
> +
> + if (sched->show_wakeups)
> + printf(" %-*s ", comm_width, "");
> +
> + perf_evsel__print_ip(evsel, event, sample, machine,
> + PRINT_IP_OPT_SYM | PRINT_IP_OPT_ONELINE,
> + sched->max_stack);
> +
> + printf("\n");
> +}
[SNIP]
> +static void free_idle_threads(void)
> +{
> + int i;
> +
> + if (idle_threads == NULL)
> + return;
> +
> + for (i = 0; i <= idle_max_cpu; ++i)
> + free(idle_threads[i]);
Doesn't it need to be thread__delete(idle_threads[i]) ?
Thanks,
Namhyung
> +
> + free(idle_threads);
> +}
next prev parent reply other threads:[~2013-11-28 9:50 UTC|newest]
Thread overview: 28+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-11-18 20:32 [PATCH 0/8] perf: sched timehist command David Ahern
2013-11-18 20:32 ` [PATCH 1/8] perf tool: Skip ignored symbols while printing callchain David Ahern
2013-11-30 12:52 ` [tip:perf/core] perf evsel: " tip-bot for David Ahern
2013-11-18 20:32 ` [PATCH 2/8] perf symbols: Move idle syms check from top to generic function David Ahern
2013-11-28 8:36 ` Namhyung Kim
2013-11-30 15:58 ` David Ahern
2013-11-30 12:52 ` [tip:perf/core] " tip-bot for David Ahern
2013-11-18 20:32 ` [PATCH 3/8] perf symbol: Save vmlinux or kallsyms path loaded David Ahern
2013-11-22 18:44 ` Arnaldo Carvalho de Melo
2013-11-22 19:13 ` David Ahern
2013-11-22 19:40 ` Arnaldo Carvalho de Melo
2013-11-22 20:09 ` David Ahern
2013-11-18 20:32 ` [PATCH 4/8] perf thread: Move comm_list check into function David Ahern
2013-11-30 12:52 ` [tip:perf/core] " tip-bot for David Ahern
2013-11-18 20:32 ` [PATCH 5/8] perf tool: export setup_list David Ahern
2013-11-28 8:45 ` Namhyung Kim
2013-11-30 12:52 ` [tip:perf/core] perf tools: Export setup_list tip-bot for David Ahern
2013-11-18 20:32 ` [PATCH 6/8] perf sched: Introduce timehist command David Ahern
2013-11-28 9:50 ` Namhyung Kim [this message]
2013-11-28 16:01 ` David Ahern
2013-11-28 15:38 ` Namhyung Kim
2013-11-28 16:01 ` David Ahern
2013-11-29 0:48 ` Namhyung Kim
2013-11-29 1:58 ` David Ahern
2013-12-04 4:15 ` David Ahern
2013-11-18 20:32 ` [PATCH 7/8] perf sched timehist: Add support for context-switch event David Ahern
2013-11-18 20:32 ` [PATCH 8/8] perf sched : Add documentation for timehist options David Ahern
2013-11-28 15:42 ` Namhyung Kim
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=87siugq2hr.fsf@sejong.aot.lge.com \
--to=namhyung@kernel.org \
--cc=a.p.zijlstra@chello.nl \
--cc=acme@ghostprotocols.net \
--cc=dsahern@gmail.com \
--cc=efault@gmx.de \
--cc=eranian@google.com \
--cc=fweisbec@gmail.com \
--cc=jolsa@redhat.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@kernel.org \
--cc=penberg@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.