All of lore.kernel.org
 help / color / mirror / Atom feed
From: Masami Hiramatsu <masami.hiramatsu.pt@hitachi.com>
To: Lin Ming <ming.m.lin@intel.com>
Cc: Arnaldo Carvalho de Melo <acme@infradead.org>,
	Peter Zijlstra <a.p.zijlstra@chello.nl>,
	Ingo Molnar <mingo@elte.hu>,
	linux-kernel <linux-kernel@vger.kernel.org>
Subject: Re: [PATCH v2 -tip] perf probe: Add fastpath to do lookup by function name
Date: Fri, 25 Mar 2011 10:14:25 +0900	[thread overview]
Message-ID: <4D8BEC71.1040404@hitachi.com> (raw)
In-Reply-To: <1300975753.2283.20.camel@localhost>

(2011/03/24 23:09), Lin Ming wrote:
> v2 -> v1:
> - Don't compare file names with cu_find_realpath(...), instead, compare them
>   with the name returned by dwarf_decl_file(sp_die) 
> 
> The vmlinux file may have thousands of CUs.
> We can lookup function name from .debug_pubnames section
> to avoid the slow loop on CUs.
> 
> ./perf stat -r 10 -- ./perf probe -k /home/mlin/vmlinux \
>         -s /home/mlin/linux-2.6 \
> 	--line csum_partial_copy_to_user > tmp.log
> 
> before patch applied
> =====================
>         364.535892 task-clock-msecs         #      0.997 CPUs
>                  0 context-switches         #      0.000 M/sec
>                  0 CPU-migrations           #      0.000 M/sec
>             29,993 page-faults              #      0.082 M/sec
>        865,862,109 cycles                   #   2375.245 M/sec
>      1,255,259,630 instructions             #      1.450 IPC
>        252,400,884 branches                 #    692.390 M/sec
>          3,429,376 branch-misses            #      1.359 %
>          1,386,990 cache-references         #      3.805 M/sec
>            687,188 cache-misses             #      1.885 M/sec
> 
>         0.365792170  seconds time elapsed
> 
> after patch applied
> =====================
>          89.896405 task-clock-msecs         #      0.991 CPUs
>                  1 context-switches         #      0.000 M/sec
>                  0 CPU-migrations           #      0.000 M/sec
>             10,145 page-faults              #      0.113 M/sec
>        214,553,875 cycles                   #   2386.679 M/sec
>        226,915,559 instructions             #      1.058 IPC
>         44,536,614 branches                 #    495.422 M/sec
>            613,074 branch-misses            #      1.377 %
>            860,787 cache-references         #      9.575 M/sec
>            442,380 cache-misses             #      4.921 M/sec
> 
>         0.090716032  seconds time elapsed

Thanks! Looks very good :)

Acked-by: Masami Hiramatsu <masami.hiramatsu.pt@hitachi.com>

> 
> Signed-off-by: Lin Ming <ming.m.lin@intel.com>
> ---
>  tools/perf/util/probe-finder.c |   39 +++++++++++++++++++++++++++++++++++++++
>  tools/perf/util/probe-finder.h |    1 +
>  2 files changed, 40 insertions(+), 0 deletions(-)
> 
> diff --git a/tools/perf/util/probe-finder.c b/tools/perf/util/probe-finder.c
> index 194f9e2..5cf044c 100644
> --- a/tools/perf/util/probe-finder.c
> +++ b/tools/perf/util/probe-finder.c
> @@ -1876,6 +1876,31 @@ static int find_line_range_by_func(struct line_finder *lf)
>  	return param.retval;
>  }
>  
> +static int pubname_search_cb(Dwarf *dbg, Dwarf_Global *gl, void *data)
> +{
> +	struct line_finder *lf = data;
> +	struct line_range *lr = lf->lr;
> +
> +	if (dwarf_offdie(dbg, gl->die_offset, &lf->sp_die)) {
> +		if (dwarf_tag(&lf->sp_die) != DW_TAG_subprogram)
> +			return DWARF_CB_OK;
> +
> +		if (die_compare_name(&lf->sp_die, lr->function)) {
> +			if (!dwarf_offdie(dbg, gl->cu_offset, &lf->cu_die))
> +				return DWARF_CB_OK;
> +
> +			if (lr->file &&
> +			    strtailcmp(lr->file, dwarf_decl_file(&lf->sp_die)))
> +				return DWARF_CB_OK;
> +
> +			lf->found = 1;
> +			return DWARF_CB_ABORT;
> +		}
> +	}
> +
> +	return DWARF_CB_OK;
> +}
> +
>  int find_line_range(int fd, struct line_range *lr)
>  {
>  	struct line_finder lf = {.lr = lr, .found = 0};
> @@ -1895,6 +1920,19 @@ int find_line_range(int fd, struct line_range *lr)
>  		return -EBADF;
>  	}
>  
> +	/* Fastpath: lookup by function name from .debug_pubnames section */
> +	if (lr->function) {
> +		struct dwarf_callback_param param = {.data = (void *)&lf, .retval = 0};
> +
> +		dwarf_getpubnames(dbg, pubname_search_cb, &lf, 0);
> +		if (lf.found) {
> +			lf.found = 0;
> +			line_range_search_cb(&lf.sp_die, &param);
> +			if (lf.found)
> +				goto found;
> +		}
> +	}
> +
>  	/* Loop on CUs (Compilation Unit) */
>  	while (!lf.found && ret >= 0) {
>  		if (dwarf_nextcu(dbg, off, &noff, &cuhl, NULL, NULL, NULL) != 0)
> @@ -1923,6 +1961,7 @@ int find_line_range(int fd, struct line_range *lr)
>  		off = noff;
>  	}
>  
> +found:
>  	/* Store comp_dir */
>  	if (lf.found) {
>  		comp_dir = cu_get_comp_dir(&lf.cu_die);
> diff --git a/tools/perf/util/probe-finder.h b/tools/perf/util/probe-finder.h
> index beaefc3..4bc56a4 100644
> --- a/tools/perf/util/probe-finder.h
> +++ b/tools/perf/util/probe-finder.h
> @@ -83,6 +83,7 @@ struct line_finder {
>  	int			lno_s;		/* Start line number */
>  	int			lno_e;		/* End line number */
>  	Dwarf_Die		cu_die;		/* Current CU */
> +	Dwarf_Die		sp_die;
>  	int			found;
>  };
>  


-- 
Masami HIRAMATSU
2nd Dept. Linux Technology Center
Hitachi, Ltd., Systems Development Laboratory
E-mail: masami.hiramatsu.pt@hitachi.com

  reply	other threads:[~2011-03-25  1:14 UTC|newest]

Thread overview: 11+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2011-03-24 15:38 [PATCH] perf probe: Add fastpath to do lookup by function name Lin Ming
2011-03-24  7:58 ` Ingo Molnar
2011-03-24  8:38   ` Lin Ming
2011-03-24  8:47     ` Ingo Molnar
2011-03-24  9:08 ` Masami Hiramatsu
2011-03-24 13:47   ` Lin Ming
2011-03-24 14:09     ` [PATCH v2 -tip] " Lin Ming
2011-03-25  1:14       ` Masami Hiramatsu [this message]
2011-03-25  2:57         ` Arnaldo Carvalho de Melo
2011-03-25  6:33           ` Lin Ming
2011-03-25  8:30             ` Lin Ming

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4D8BEC71.1040404@hitachi.com \
    --to=masami.hiramatsu.pt@hitachi.com \
    --cc=a.p.zijlstra@chello.nl \
    --cc=acme@infradead.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=ming.m.lin@intel.com \
    --cc=mingo@elte.hu \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.