linux-perf-users.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [PATCH 1/2] perf report: Use map_symbol__copy() when copying callchians
@ 2025-03-06  7:51 Namhyung Kim
  2025-03-06  7:51 ` [PATCH 2/2] perf report: Fix memory leaks in the hierarchy mode Namhyung Kim
  0 siblings, 1 reply; 8+ messages in thread
From: Namhyung Kim @ 2025-03-06  7:51 UTC (permalink / raw)
  To: Arnaldo Carvalho de Melo, Ian Rogers, Kan Liang
  Cc: Jiri Olsa, Adrian Hunter, Peter Zijlstra, Ingo Molnar, LKML,
	linux-perf-users

It seems there are places to miss updating refcount of maps.
Let's use map_symbol__copy() helper to properly copy them with
refcounts updated.

Signed-off-by: Namhyung Kim <namhyung@kernel.org>
---
 tools/perf/util/callchain.c | 9 +++------
 1 file changed, 3 insertions(+), 6 deletions(-)

diff --git a/tools/perf/util/callchain.c b/tools/perf/util/callchain.c
index 0c7564747a14e539..4fe7a10de33b6de8 100644
--- a/tools/perf/util/callchain.c
+++ b/tools/perf/util/callchain.c
@@ -590,8 +590,7 @@ fill_node(struct callchain_node *node, struct callchain_cursor *cursor)
 		}
 		call->ip = cursor_node->ip;
 		call->ms = cursor_node->ms;
-		call->ms.map = map__get(call->ms.map);
-		call->ms.maps = maps__get(call->ms.maps);
+		map_symbol__copy(&call->ms, &cursor_node->ms);
 		call->srcline = cursor_node->srcline;
 
 		if (cursor_node->branch) {
@@ -1094,9 +1093,7 @@ int callchain_cursor_append(struct callchain_cursor *cursor,
 
 	node->ip = ip;
 	map_symbol__exit(&node->ms);
-	node->ms = *ms;
-	node->ms.maps = maps__get(ms->maps);
-	node->ms.map = map__get(ms->map);
+	map_symbol__copy(&node->ms, ms);
 	node->branch = branch;
 	node->nr_loop_iter = nr_loop_iter;
 	node->iter_cycles = iter_cycles;
@@ -1564,7 +1561,7 @@ int callchain_node__make_parent_list(struct callchain_node *node)
 				goto out;
 			*new = *chain;
 			new->has_children = false;
-			new->ms.map = map__get(new->ms.map);
+			map_symbol__copy(&new->ms, &chain->ms);
 			list_add_tail(&new->list, &head);
 		}
 		parent = parent->parent;
-- 
2.48.1.711.g2feabab25a-goog


^ permalink raw reply related	[flat|nested] 8+ messages in thread

* [PATCH 2/2] perf report: Fix memory leaks in the hierarchy mode
  2025-03-06  7:51 [PATCH 1/2] perf report: Use map_symbol__copy() when copying callchians Namhyung Kim
@ 2025-03-06  7:51 ` Namhyung Kim
  2025-03-06 16:55   ` Ian Rogers
  0 siblings, 1 reply; 8+ messages in thread
From: Namhyung Kim @ 2025-03-06  7:51 UTC (permalink / raw)
  To: Arnaldo Carvalho de Melo, Ian Rogers, Kan Liang
  Cc: Jiri Olsa, Adrian Hunter, Peter Zijlstra, Ingo Molnar, LKML,
	linux-perf-users

Ian told me that there are many memory leaks in the hierarchy mode.  I
can easily reproduce it with the follwing command.

  $ make DEBUG=1 EXTRA_CFLAGS=-fsanitize=leak

  $ perf record --latency -g -- ./perf test -w thloop

  $ perf report -H --stdio
  ...
  Indirect leak of 168 byte(s) in 21 object(s) allocated from:
      #0 0x7f3414c16c65 in malloc ../../../../src/libsanitizer/lsan/lsan_interceptors.cpp:75
      #1 0x55ed3602346e in map__get util/map.h:189
      #2 0x55ed36024cc4 in hist_entry__init util/hist.c:476
      #3 0x55ed36025208 in hist_entry__new util/hist.c:588
      #4 0x55ed36027c05 in hierarchy_insert_entry util/hist.c:1587
      #5 0x55ed36027e2e in hists__hierarchy_insert_entry util/hist.c:1638
      #6 0x55ed36027fa4 in hists__collapse_insert_entry util/hist.c:1685
      #7 0x55ed360283e8 in hists__collapse_resort util/hist.c:1776
      #8 0x55ed35de0323 in report__collapse_hists /home/namhyung/project/linux/tools/perf/builtin-report.c:735
      #9 0x55ed35de15b4 in __cmd_report /home/namhyung/project/linux/tools/perf/builtin-report.c:1119
      #10 0x55ed35de43dc in cmd_report /home/namhyung/project/linux/tools/perf/builtin-report.c:1867
      #11 0x55ed35e66767 in run_builtin /home/namhyung/project/linux/tools/perf/perf.c:351
      #12 0x55ed35e66a0e in handle_internal_command /home/namhyung/project/linux/tools/perf/perf.c:404
      #13 0x55ed35e66b67 in run_argv /home/namhyung/project/linux/tools/perf/perf.c:448
      #14 0x55ed35e66eb0 in main /home/namhyung/project/linux/tools/perf/perf.c:556
      #15 0x7f340ac33d67 in __libc_start_call_main ../sysdeps/nptl/libc_start_call_main.h:58
  ...

  $ perf report -H --stdio 2>&1 | grep -c '^Indirect leak'
  93

I found that hist_entry__delete() missed to release child entries in the
hierarchy tree (hroot_{in,out}).  It needs to iterate the child entries
and call hist_entry__delete() recursively.

After this change:

  $ perf report -H --stdio 2>&1 | grep -c '^Indirect leak'
  0

Reported-by: Ian Rogers <irogers@google.com>
Signed-off-by: Namhyung Kim <namhyung@kernel.org>
---
 tools/perf/util/hist.c | 9 +++++++++
 1 file changed, 9 insertions(+)

diff --git a/tools/perf/util/hist.c b/tools/perf/util/hist.c
index fbf131aeae7ffe9b..bbc6a299b5106c3b 100644
--- a/tools/perf/util/hist.c
+++ b/tools/perf/util/hist.c
@@ -1385,6 +1385,15 @@ void hist_entry__delete(struct hist_entry *he)
 {
 	struct hist_entry_ops *ops = he->ops;
 
+	while (!RB_EMPTY_ROOT(&he->hroot_out.rb_root)) {
+		struct rb_node *node = rb_first(&he->hroot_out.rb_root);
+		struct hist_entry *child = rb_entry(node, struct hist_entry, rb_node);
+
+		rb_erase_init(node, &he->hroot_out.rb_root);
+
+		hist_entry__delete(child);
+	}
+
 	thread__zput(he->thread);
 	map_symbol__exit(&he->ms);
 
-- 
2.48.1.711.g2feabab25a-goog


^ permalink raw reply related	[flat|nested] 8+ messages in thread

* Re: [PATCH 2/2] perf report: Fix memory leaks in the hierarchy mode
  2025-03-06  7:51 ` [PATCH 2/2] perf report: Fix memory leaks in the hierarchy mode Namhyung Kim
@ 2025-03-06 16:55   ` Ian Rogers
  2025-03-06 17:07     ` Namhyung Kim
  0 siblings, 1 reply; 8+ messages in thread
From: Ian Rogers @ 2025-03-06 16:55 UTC (permalink / raw)
  To: Namhyung Kim
  Cc: Arnaldo Carvalho de Melo, Kan Liang, Jiri Olsa, Adrian Hunter,
	Peter Zijlstra, Ingo Molnar, LKML, linux-perf-users

On Wed, Mar 5, 2025 at 11:51 PM Namhyung Kim <namhyung@kernel.org> wrote:
>
> Ian told me that there are many memory leaks in the hierarchy mode.  I
> can easily reproduce it with the follwing command.
>
>   $ make DEBUG=1 EXTRA_CFLAGS=-fsanitize=leak
>
>   $ perf record --latency -g -- ./perf test -w thloop
>
>   $ perf report -H --stdio
>   ...
>   Indirect leak of 168 byte(s) in 21 object(s) allocated from:
>       #0 0x7f3414c16c65 in malloc ../../../../src/libsanitizer/lsan/lsan_interceptors.cpp:75
>       #1 0x55ed3602346e in map__get util/map.h:189
>       #2 0x55ed36024cc4 in hist_entry__init util/hist.c:476
>       #3 0x55ed36025208 in hist_entry__new util/hist.c:588
>       #4 0x55ed36027c05 in hierarchy_insert_entry util/hist.c:1587
>       #5 0x55ed36027e2e in hists__hierarchy_insert_entry util/hist.c:1638
>       #6 0x55ed36027fa4 in hists__collapse_insert_entry util/hist.c:1685
>       #7 0x55ed360283e8 in hists__collapse_resort util/hist.c:1776
>       #8 0x55ed35de0323 in report__collapse_hists /home/namhyung/project/linux/tools/perf/builtin-report.c:735
>       #9 0x55ed35de15b4 in __cmd_report /home/namhyung/project/linux/tools/perf/builtin-report.c:1119
>       #10 0x55ed35de43dc in cmd_report /home/namhyung/project/linux/tools/perf/builtin-report.c:1867
>       #11 0x55ed35e66767 in run_builtin /home/namhyung/project/linux/tools/perf/perf.c:351
>       #12 0x55ed35e66a0e in handle_internal_command /home/namhyung/project/linux/tools/perf/perf.c:404
>       #13 0x55ed35e66b67 in run_argv /home/namhyung/project/linux/tools/perf/perf.c:448
>       #14 0x55ed35e66eb0 in main /home/namhyung/project/linux/tools/perf/perf.c:556
>       #15 0x7f340ac33d67 in __libc_start_call_main ../sysdeps/nptl/libc_start_call_main.h:58
>   ...
>
>   $ perf report -H --stdio 2>&1 | grep -c '^Indirect leak'
>   93
>
> I found that hist_entry__delete() missed to release child entries in the
> hierarchy tree (hroot_{in,out}).  It needs to iterate the child entries
> and call hist_entry__delete() recursively.
>
> After this change:
>
>   $ perf report -H --stdio 2>&1 | grep -c '^Indirect leak'
>   0
>
> Reported-by: Ian Rogers <irogers@google.com>
> Signed-off-by: Namhyung Kim <namhyung@kernel.org>
> ---
>  tools/perf/util/hist.c | 9 +++++++++
>  1 file changed, 9 insertions(+)
>
> diff --git a/tools/perf/util/hist.c b/tools/perf/util/hist.c
> index fbf131aeae7ffe9b..bbc6a299b5106c3b 100644
> --- a/tools/perf/util/hist.c
> +++ b/tools/perf/util/hist.c
> @@ -1385,6 +1385,15 @@ void hist_entry__delete(struct hist_entry *he)
>  {
>         struct hist_entry_ops *ops = he->ops;
>
> +       while (!RB_EMPTY_ROOT(&he->hroot_out.rb_root)) {
> +               struct rb_node *node = rb_first(&he->hroot_out.rb_root);
> +               struct hist_entry *child = rb_entry(node, struct hist_entry, rb_node);
> +
> +               rb_erase_init(node, &he->hroot_out.rb_root);
> +
> +               hist_entry__delete(child);
> +       }

Thanks for the fix! A nit, iterating the rbtree of N nodes and calling
erase on the first entry, an O(log N) operation, means this is a O(N *
log N). rbtree.h has rbtree_postorder_for_each_entry_safe:
https://web.git.kernel.org/pub/scm/linux/kernel/git/perf/perf-tools-next.git/tree/tools/include/linux/rbtree.h?h=perf-tools-next#n81
```
 * rbtree_postorder_for_each_entry_safe - iterate in post-order over rb_root of
 * given type allowing the backing memory of @pos to be invalidated
```
which is O(N). I think this code would be better something like:
```
struct hist_entry *pos, *tmp;
rbtree_postorder_for_each_entry_safe(pos, tmp, he->hroot_out.rb_root, rb_node)
        hist_entry__delete(pos);
```

Thanks!
Ian

> +
>         thread__zput(he->thread);
>         map_symbol__exit(&he->ms);
>
> --
> 2.48.1.711.g2feabab25a-goog
>

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH 2/2] perf report: Fix memory leaks in the hierarchy mode
  2025-03-06 16:55   ` Ian Rogers
@ 2025-03-06 17:07     ` Namhyung Kim
  2025-03-07  2:01       ` Namhyung Kim
  0 siblings, 1 reply; 8+ messages in thread
From: Namhyung Kim @ 2025-03-06 17:07 UTC (permalink / raw)
  To: Ian Rogers
  Cc: Arnaldo Carvalho de Melo, Kan Liang, Jiri Olsa, Adrian Hunter,
	Peter Zijlstra, Ingo Molnar, LKML, linux-perf-users

Hi Ian,

On Thu, Mar 06, 2025 at 08:55:05AM -0800, Ian Rogers wrote:
> On Wed, Mar 5, 2025 at 11:51 PM Namhyung Kim <namhyung@kernel.org> wrote:
> >
> > Ian told me that there are many memory leaks in the hierarchy mode.  I
> > can easily reproduce it with the follwing command.
> >
> >   $ make DEBUG=1 EXTRA_CFLAGS=-fsanitize=leak
> >
> >   $ perf record --latency -g -- ./perf test -w thloop
> >
> >   $ perf report -H --stdio
> >   ...
> >   Indirect leak of 168 byte(s) in 21 object(s) allocated from:
> >       #0 0x7f3414c16c65 in malloc ../../../../src/libsanitizer/lsan/lsan_interceptors.cpp:75
> >       #1 0x55ed3602346e in map__get util/map.h:189
> >       #2 0x55ed36024cc4 in hist_entry__init util/hist.c:476
> >       #3 0x55ed36025208 in hist_entry__new util/hist.c:588
> >       #4 0x55ed36027c05 in hierarchy_insert_entry util/hist.c:1587
> >       #5 0x55ed36027e2e in hists__hierarchy_insert_entry util/hist.c:1638
> >       #6 0x55ed36027fa4 in hists__collapse_insert_entry util/hist.c:1685
> >       #7 0x55ed360283e8 in hists__collapse_resort util/hist.c:1776
> >       #8 0x55ed35de0323 in report__collapse_hists /home/namhyung/project/linux/tools/perf/builtin-report.c:735
> >       #9 0x55ed35de15b4 in __cmd_report /home/namhyung/project/linux/tools/perf/builtin-report.c:1119
> >       #10 0x55ed35de43dc in cmd_report /home/namhyung/project/linux/tools/perf/builtin-report.c:1867
> >       #11 0x55ed35e66767 in run_builtin /home/namhyung/project/linux/tools/perf/perf.c:351
> >       #12 0x55ed35e66a0e in handle_internal_command /home/namhyung/project/linux/tools/perf/perf.c:404
> >       #13 0x55ed35e66b67 in run_argv /home/namhyung/project/linux/tools/perf/perf.c:448
> >       #14 0x55ed35e66eb0 in main /home/namhyung/project/linux/tools/perf/perf.c:556
> >       #15 0x7f340ac33d67 in __libc_start_call_main ../sysdeps/nptl/libc_start_call_main.h:58
> >   ...
> >
> >   $ perf report -H --stdio 2>&1 | grep -c '^Indirect leak'
> >   93
> >
> > I found that hist_entry__delete() missed to release child entries in the
> > hierarchy tree (hroot_{in,out}).  It needs to iterate the child entries
> > and call hist_entry__delete() recursively.
> >
> > After this change:
> >
> >   $ perf report -H --stdio 2>&1 | grep -c '^Indirect leak'
> >   0
> >
> > Reported-by: Ian Rogers <irogers@google.com>
> > Signed-off-by: Namhyung Kim <namhyung@kernel.org>
> > ---
> >  tools/perf/util/hist.c | 9 +++++++++
> >  1 file changed, 9 insertions(+)
> >
> > diff --git a/tools/perf/util/hist.c b/tools/perf/util/hist.c
> > index fbf131aeae7ffe9b..bbc6a299b5106c3b 100644
> > --- a/tools/perf/util/hist.c
> > +++ b/tools/perf/util/hist.c
> > @@ -1385,6 +1385,15 @@ void hist_entry__delete(struct hist_entry *he)
> >  {
> >         struct hist_entry_ops *ops = he->ops;
> >
> > +       while (!RB_EMPTY_ROOT(&he->hroot_out.rb_root)) {
> > +               struct rb_node *node = rb_first(&he->hroot_out.rb_root);
> > +               struct hist_entry *child = rb_entry(node, struct hist_entry, rb_node);
> > +
> > +               rb_erase_init(node, &he->hroot_out.rb_root);
> > +
> > +               hist_entry__delete(child);
> > +       }
> 
> Thanks for the fix! A nit, iterating the rbtree of N nodes and calling
> erase on the first entry, an O(log N) operation, means this is a O(N *
> log N). rbtree.h has rbtree_postorder_for_each_entry_safe:
> https://web.git.kernel.org/pub/scm/linux/kernel/git/perf/perf-tools-next.git/tree/tools/include/linux/rbtree.h?h=perf-tools-next#n81
> ```
>  * rbtree_postorder_for_each_entry_safe - iterate in post-order over rb_root of
>  * given type allowing the backing memory of @pos to be invalidated
> ```
> which is O(N). I think this code would be better something like:
> ```
> struct hist_entry *pos, *tmp;
> rbtree_postorder_for_each_entry_safe(pos, tmp, he->hroot_out.rb_root, rb_node)
>         hist_entry__delete(pos);
> ```

Thanks for your review!  I was wondering if there's something like this.
Will update with that.

Thanks,
Namhyung


^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH 2/2] perf report: Fix memory leaks in the hierarchy mode
  2025-03-06 17:07     ` Namhyung Kim
@ 2025-03-07  2:01       ` Namhyung Kim
  2025-03-07  4:50         ` Ian Rogers
  0 siblings, 1 reply; 8+ messages in thread
From: Namhyung Kim @ 2025-03-07  2:01 UTC (permalink / raw)
  To: Ian Rogers
  Cc: Arnaldo Carvalho de Melo, Kan Liang, Jiri Olsa, Adrian Hunter,
	Peter Zijlstra, Ingo Molnar, LKML, linux-perf-users

On Thu, Mar 06, 2025 at 09:07:00AM -0800, Namhyung Kim wrote:
> Hi Ian,
> 
> On Thu, Mar 06, 2025 at 08:55:05AM -0800, Ian Rogers wrote:
> > On Wed, Mar 5, 2025 at 11:51 PM Namhyung Kim <namhyung@kernel.org> wrote:
> > >
> > > Ian told me that there are many memory leaks in the hierarchy mode.  I
> > > can easily reproduce it with the follwing command.
> > >
> > >   $ make DEBUG=1 EXTRA_CFLAGS=-fsanitize=leak
> > >
> > >   $ perf record --latency -g -- ./perf test -w thloop
> > >
> > >   $ perf report -H --stdio
> > >   ...
> > >   Indirect leak of 168 byte(s) in 21 object(s) allocated from:
> > >       #0 0x7f3414c16c65 in malloc ../../../../src/libsanitizer/lsan/lsan_interceptors.cpp:75
> > >       #1 0x55ed3602346e in map__get util/map.h:189
> > >       #2 0x55ed36024cc4 in hist_entry__init util/hist.c:476
> > >       #3 0x55ed36025208 in hist_entry__new util/hist.c:588
> > >       #4 0x55ed36027c05 in hierarchy_insert_entry util/hist.c:1587
> > >       #5 0x55ed36027e2e in hists__hierarchy_insert_entry util/hist.c:1638
> > >       #6 0x55ed36027fa4 in hists__collapse_insert_entry util/hist.c:1685
> > >       #7 0x55ed360283e8 in hists__collapse_resort util/hist.c:1776
> > >       #8 0x55ed35de0323 in report__collapse_hists /home/namhyung/project/linux/tools/perf/builtin-report.c:735
> > >       #9 0x55ed35de15b4 in __cmd_report /home/namhyung/project/linux/tools/perf/builtin-report.c:1119
> > >       #10 0x55ed35de43dc in cmd_report /home/namhyung/project/linux/tools/perf/builtin-report.c:1867
> > >       #11 0x55ed35e66767 in run_builtin /home/namhyung/project/linux/tools/perf/perf.c:351
> > >       #12 0x55ed35e66a0e in handle_internal_command /home/namhyung/project/linux/tools/perf/perf.c:404
> > >       #13 0x55ed35e66b67 in run_argv /home/namhyung/project/linux/tools/perf/perf.c:448
> > >       #14 0x55ed35e66eb0 in main /home/namhyung/project/linux/tools/perf/perf.c:556
> > >       #15 0x7f340ac33d67 in __libc_start_call_main ../sysdeps/nptl/libc_start_call_main.h:58
> > >   ...
> > >
> > >   $ perf report -H --stdio 2>&1 | grep -c '^Indirect leak'
> > >   93
> > >
> > > I found that hist_entry__delete() missed to release child entries in the
> > > hierarchy tree (hroot_{in,out}).  It needs to iterate the child entries
> > > and call hist_entry__delete() recursively.
> > >
> > > After this change:
> > >
> > >   $ perf report -H --stdio 2>&1 | grep -c '^Indirect leak'
> > >   0
> > >
> > > Reported-by: Ian Rogers <irogers@google.com>
> > > Signed-off-by: Namhyung Kim <namhyung@kernel.org>
> > > ---
> > >  tools/perf/util/hist.c | 9 +++++++++
> > >  1 file changed, 9 insertions(+)
> > >
> > > diff --git a/tools/perf/util/hist.c b/tools/perf/util/hist.c
> > > index fbf131aeae7ffe9b..bbc6a299b5106c3b 100644
> > > --- a/tools/perf/util/hist.c
> > > +++ b/tools/perf/util/hist.c
> > > @@ -1385,6 +1385,15 @@ void hist_entry__delete(struct hist_entry *he)
> > >  {
> > >         struct hist_entry_ops *ops = he->ops;
> > >
> > > +       while (!RB_EMPTY_ROOT(&he->hroot_out.rb_root)) {
> > > +               struct rb_node *node = rb_first(&he->hroot_out.rb_root);
> > > +               struct hist_entry *child = rb_entry(node, struct hist_entry, rb_node);
> > > +
> > > +               rb_erase_init(node, &he->hroot_out.rb_root);
> > > +
> > > +               hist_entry__delete(child);
> > > +       }
> > 
> > Thanks for the fix! A nit, iterating the rbtree of N nodes and calling
> > erase on the first entry, an O(log N) operation, means this is a O(N *
> > log N). rbtree.h has rbtree_postorder_for_each_entry_safe:
> > https://web.git.kernel.org/pub/scm/linux/kernel/git/perf/perf-tools-next.git/tree/tools/include/linux/rbtree.h?h=perf-tools-next#n81
> > ```
> >  * rbtree_postorder_for_each_entry_safe - iterate in post-order over rb_root of
> >  * given type allowing the backing memory of @pos to be invalidated
> > ```
> > which is O(N). I think this code would be better something like:
> > ```
> > struct hist_entry *pos, *tmp;
> > rbtree_postorder_for_each_entry_safe(pos, tmp, he->hroot_out.rb_root, rb_node)
> >         hist_entry__delete(pos);
> > ```
> 
> Thanks for your review!  I was wondering if there's something like this.
> Will update with that.

I found that the following comments:

 * Note, however, that it cannot handle other modifications that re-order the
 * rbtree it is iterating over. This includes calling rb_erase() on @pos, as
 * rb_erase() may rebalance the tree, causing us to miss some nodes.

So I cannot use this here.

Thanks,
Namhyung


^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH 2/2] perf report: Fix memory leaks in the hierarchy mode
  2025-03-07  2:01       ` Namhyung Kim
@ 2025-03-07  4:50         ` Ian Rogers
  2025-03-07  5:19           ` Namhyung Kim
  0 siblings, 1 reply; 8+ messages in thread
From: Ian Rogers @ 2025-03-07  4:50 UTC (permalink / raw)
  To: Namhyung Kim
  Cc: Arnaldo Carvalho de Melo, Kan Liang, Jiri Olsa, Adrian Hunter,
	Peter Zijlstra, Ingo Molnar, LKML, linux-perf-users

On Thu, Mar 6, 2025 at 6:01 PM Namhyung Kim <namhyung@kernel.org> wrote:
>
> On Thu, Mar 06, 2025 at 09:07:00AM -0800, Namhyung Kim wrote:
> > Hi Ian,
> >
> > On Thu, Mar 06, 2025 at 08:55:05AM -0800, Ian Rogers wrote:
> > > On Wed, Mar 5, 2025 at 11:51 PM Namhyung Kim <namhyung@kernel.org> wrote:
> > > >
> > > > Ian told me that there are many memory leaks in the hierarchy mode.  I
> > > > can easily reproduce it with the follwing command.
> > > >
> > > >   $ make DEBUG=1 EXTRA_CFLAGS=-fsanitize=leak
> > > >
> > > >   $ perf record --latency -g -- ./perf test -w thloop
> > > >
> > > >   $ perf report -H --stdio
> > > >   ...
> > > >   Indirect leak of 168 byte(s) in 21 object(s) allocated from:
> > > >       #0 0x7f3414c16c65 in malloc ../../../../src/libsanitizer/lsan/lsan_interceptors.cpp:75
> > > >       #1 0x55ed3602346e in map__get util/map.h:189
> > > >       #2 0x55ed36024cc4 in hist_entry__init util/hist.c:476
> > > >       #3 0x55ed36025208 in hist_entry__new util/hist.c:588
> > > >       #4 0x55ed36027c05 in hierarchy_insert_entry util/hist.c:1587
> > > >       #5 0x55ed36027e2e in hists__hierarchy_insert_entry util/hist.c:1638
> > > >       #6 0x55ed36027fa4 in hists__collapse_insert_entry util/hist.c:1685
> > > >       #7 0x55ed360283e8 in hists__collapse_resort util/hist.c:1776
> > > >       #8 0x55ed35de0323 in report__collapse_hists /home/namhyung/project/linux/tools/perf/builtin-report.c:735
> > > >       #9 0x55ed35de15b4 in __cmd_report /home/namhyung/project/linux/tools/perf/builtin-report.c:1119
> > > >       #10 0x55ed35de43dc in cmd_report /home/namhyung/project/linux/tools/perf/builtin-report.c:1867
> > > >       #11 0x55ed35e66767 in run_builtin /home/namhyung/project/linux/tools/perf/perf.c:351
> > > >       #12 0x55ed35e66a0e in handle_internal_command /home/namhyung/project/linux/tools/perf/perf.c:404
> > > >       #13 0x55ed35e66b67 in run_argv /home/namhyung/project/linux/tools/perf/perf.c:448
> > > >       #14 0x55ed35e66eb0 in main /home/namhyung/project/linux/tools/perf/perf.c:556
> > > >       #15 0x7f340ac33d67 in __libc_start_call_main ../sysdeps/nptl/libc_start_call_main.h:58
> > > >   ...
> > > >
> > > >   $ perf report -H --stdio 2>&1 | grep -c '^Indirect leak'
> > > >   93
> > > >
> > > > I found that hist_entry__delete() missed to release child entries in the
> > > > hierarchy tree (hroot_{in,out}).  It needs to iterate the child entries
> > > > and call hist_entry__delete() recursively.
> > > >
> > > > After this change:
> > > >
> > > >   $ perf report -H --stdio 2>&1 | grep -c '^Indirect leak'
> > > >   0
> > > >
> > > > Reported-by: Ian Rogers <irogers@google.com>
> > > > Signed-off-by: Namhyung Kim <namhyung@kernel.org>
> > > > ---
> > > >  tools/perf/util/hist.c | 9 +++++++++
> > > >  1 file changed, 9 insertions(+)
> > > >
> > > > diff --git a/tools/perf/util/hist.c b/tools/perf/util/hist.c
> > > > index fbf131aeae7ffe9b..bbc6a299b5106c3b 100644
> > > > --- a/tools/perf/util/hist.c
> > > > +++ b/tools/perf/util/hist.c
> > > > @@ -1385,6 +1385,15 @@ void hist_entry__delete(struct hist_entry *he)
> > > >  {
> > > >         struct hist_entry_ops *ops = he->ops;
> > > >
> > > > +       while (!RB_EMPTY_ROOT(&he->hroot_out.rb_root)) {
> > > > +               struct rb_node *node = rb_first(&he->hroot_out.rb_root);
> > > > +               struct hist_entry *child = rb_entry(node, struct hist_entry, rb_node);
> > > > +
> > > > +               rb_erase_init(node, &he->hroot_out.rb_root);
> > > > +
> > > > +               hist_entry__delete(child);
> > > > +       }
> > >
> > > Thanks for the fix! A nit, iterating the rbtree of N nodes and calling
> > > erase on the first entry, an O(log N) operation, means this is a O(N *
> > > log N). rbtree.h has rbtree_postorder_for_each_entry_safe:
> > > https://web.git.kernel.org/pub/scm/linux/kernel/git/perf/perf-tools-next.git/tree/tools/include/linux/rbtree.h?h=perf-tools-next#n81
> > > ```
> > >  * rbtree_postorder_for_each_entry_safe - iterate in post-order over rb_root of
> > >  * given type allowing the backing memory of @pos to be invalidated
> > > ```
> > > which is O(N). I think this code would be better something like:
> > > ```
> > > struct hist_entry *pos, *tmp;
> > > rbtree_postorder_for_each_entry_safe(pos, tmp, he->hroot_out.rb_root, rb_node)
> > >         hist_entry__delete(pos);
> > > ```
> >
> > Thanks for your review!  I was wondering if there's something like this.
> > Will update with that.
>
> I found that the following comments:
>
>  * Note, however, that it cannot handle other modifications that re-order the
>  * rbtree it is iterating over. This includes calling rb_erase() on @pos, as
>  * rb_erase() may rebalance the tree, causing us to miss some nodes.
>
> So I cannot use this here.

I thought the tree is no longer in use after the hist_entry__delete,
so you don't need to maintain the child/parent pointers as everything
is going to get freed. Here is what I think is a similar use:
https://web.git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/tree/kernel/trace/trace_stat.c#n51

Thanks,
Ian

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH 2/2] perf report: Fix memory leaks in the hierarchy mode
  2025-03-07  4:50         ` Ian Rogers
@ 2025-03-07  5:19           ` Namhyung Kim
  2025-03-07  6:03             ` Namhyung Kim
  0 siblings, 1 reply; 8+ messages in thread
From: Namhyung Kim @ 2025-03-07  5:19 UTC (permalink / raw)
  To: Ian Rogers
  Cc: Arnaldo Carvalho de Melo, Kan Liang, Jiri Olsa, Adrian Hunter,
	Peter Zijlstra, Ingo Molnar, LKML, linux-perf-users

On Thu, Mar 06, 2025 at 08:50:46PM -0800, Ian Rogers wrote:
> On Thu, Mar 6, 2025 at 6:01 PM Namhyung Kim <namhyung@kernel.org> wrote:
> >
> > On Thu, Mar 06, 2025 at 09:07:00AM -0800, Namhyung Kim wrote:
> > > Hi Ian,
> > >
> > > On Thu, Mar 06, 2025 at 08:55:05AM -0800, Ian Rogers wrote:
> > > > On Wed, Mar 5, 2025 at 11:51 PM Namhyung Kim <namhyung@kernel.org> wrote:
> > > > >
> > > > > Ian told me that there are many memory leaks in the hierarchy mode.  I
> > > > > can easily reproduce it with the follwing command.
> > > > >
> > > > >   $ make DEBUG=1 EXTRA_CFLAGS=-fsanitize=leak
> > > > >
> > > > >   $ perf record --latency -g -- ./perf test -w thloop
> > > > >
> > > > >   $ perf report -H --stdio
> > > > >   ...
> > > > >   Indirect leak of 168 byte(s) in 21 object(s) allocated from:
> > > > >       #0 0x7f3414c16c65 in malloc ../../../../src/libsanitizer/lsan/lsan_interceptors.cpp:75
> > > > >       #1 0x55ed3602346e in map__get util/map.h:189
> > > > >       #2 0x55ed36024cc4 in hist_entry__init util/hist.c:476
> > > > >       #3 0x55ed36025208 in hist_entry__new util/hist.c:588
> > > > >       #4 0x55ed36027c05 in hierarchy_insert_entry util/hist.c:1587
> > > > >       #5 0x55ed36027e2e in hists__hierarchy_insert_entry util/hist.c:1638
> > > > >       #6 0x55ed36027fa4 in hists__collapse_insert_entry util/hist.c:1685
> > > > >       #7 0x55ed360283e8 in hists__collapse_resort util/hist.c:1776
> > > > >       #8 0x55ed35de0323 in report__collapse_hists /home/namhyung/project/linux/tools/perf/builtin-report.c:735
> > > > >       #9 0x55ed35de15b4 in __cmd_report /home/namhyung/project/linux/tools/perf/builtin-report.c:1119
> > > > >       #10 0x55ed35de43dc in cmd_report /home/namhyung/project/linux/tools/perf/builtin-report.c:1867
> > > > >       #11 0x55ed35e66767 in run_builtin /home/namhyung/project/linux/tools/perf/perf.c:351
> > > > >       #12 0x55ed35e66a0e in handle_internal_command /home/namhyung/project/linux/tools/perf/perf.c:404
> > > > >       #13 0x55ed35e66b67 in run_argv /home/namhyung/project/linux/tools/perf/perf.c:448
> > > > >       #14 0x55ed35e66eb0 in main /home/namhyung/project/linux/tools/perf/perf.c:556
> > > > >       #15 0x7f340ac33d67 in __libc_start_call_main ../sysdeps/nptl/libc_start_call_main.h:58
> > > > >   ...
> > > > >
> > > > >   $ perf report -H --stdio 2>&1 | grep -c '^Indirect leak'
> > > > >   93
> > > > >
> > > > > I found that hist_entry__delete() missed to release child entries in the
> > > > > hierarchy tree (hroot_{in,out}).  It needs to iterate the child entries
> > > > > and call hist_entry__delete() recursively.
> > > > >
> > > > > After this change:
> > > > >
> > > > >   $ perf report -H --stdio 2>&1 | grep -c '^Indirect leak'
> > > > >   0
> > > > >
> > > > > Reported-by: Ian Rogers <irogers@google.com>
> > > > > Signed-off-by: Namhyung Kim <namhyung@kernel.org>
> > > > > ---
> > > > >  tools/perf/util/hist.c | 9 +++++++++
> > > > >  1 file changed, 9 insertions(+)
> > > > >
> > > > > diff --git a/tools/perf/util/hist.c b/tools/perf/util/hist.c
> > > > > index fbf131aeae7ffe9b..bbc6a299b5106c3b 100644
> > > > > --- a/tools/perf/util/hist.c
> > > > > +++ b/tools/perf/util/hist.c
> > > > > @@ -1385,6 +1385,15 @@ void hist_entry__delete(struct hist_entry *he)
> > > > >  {
> > > > >         struct hist_entry_ops *ops = he->ops;
> > > > >
> > > > > +       while (!RB_EMPTY_ROOT(&he->hroot_out.rb_root)) {
> > > > > +               struct rb_node *node = rb_first(&he->hroot_out.rb_root);
> > > > > +               struct hist_entry *child = rb_entry(node, struct hist_entry, rb_node);
> > > > > +
> > > > > +               rb_erase_init(node, &he->hroot_out.rb_root);
> > > > > +
> > > > > +               hist_entry__delete(child);
> > > > > +       }
> > > >
> > > > Thanks for the fix! A nit, iterating the rbtree of N nodes and calling
> > > > erase on the first entry, an O(log N) operation, means this is a O(N *
> > > > log N). rbtree.h has rbtree_postorder_for_each_entry_safe:
> > > > https://web.git.kernel.org/pub/scm/linux/kernel/git/perf/perf-tools-next.git/tree/tools/include/linux/rbtree.h?h=perf-tools-next#n81
> > > > ```
> > > >  * rbtree_postorder_for_each_entry_safe - iterate in post-order over rb_root of
> > > >  * given type allowing the backing memory of @pos to be invalidated
> > > > ```
> > > > which is O(N). I think this code would be better something like:
> > > > ```
> > > > struct hist_entry *pos, *tmp;
> > > > rbtree_postorder_for_each_entry_safe(pos, tmp, he->hroot_out.rb_root, rb_node)
> > > >         hist_entry__delete(pos);
> > > > ```
> > >
> > > Thanks for your review!  I was wondering if there's something like this.
> > > Will update with that.
> >
> > I found that the following comments:
> >
> >  * Note, however, that it cannot handle other modifications that re-order the
> >  * rbtree it is iterating over. This includes calling rb_erase() on @pos, as
> >  * rb_erase() may rebalance the tree, causing us to miss some nodes.
> >
> > So I cannot use this here.
> 
> I thought the tree is no longer in use after the hist_entry__delete,
> so you don't need to maintain the child/parent pointers as everything
> is going to get freed. Here is what I think is a similar use:
> https://web.git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/tree/kernel/trace/trace_stat.c#n51

Oh, I see.  So it should not call rb_erase().  Will check again.

Thanks,
Namhyung


^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH 2/2] perf report: Fix memory leaks in the hierarchy mode
  2025-03-07  5:19           ` Namhyung Kim
@ 2025-03-07  6:03             ` Namhyung Kim
  0 siblings, 0 replies; 8+ messages in thread
From: Namhyung Kim @ 2025-03-07  6:03 UTC (permalink / raw)
  To: Ian Rogers
  Cc: Arnaldo Carvalho de Melo, Kan Liang, Jiri Olsa, Adrian Hunter,
	Peter Zijlstra, Ingo Molnar, LKML, linux-perf-users

On Thu, Mar 06, 2025 at 09:19:09PM -0800, Namhyung Kim wrote:
> On Thu, Mar 06, 2025 at 08:50:46PM -0800, Ian Rogers wrote:
> > On Thu, Mar 6, 2025 at 6:01 PM Namhyung Kim <namhyung@kernel.org> wrote:
> > >
> > > On Thu, Mar 06, 2025 at 09:07:00AM -0800, Namhyung Kim wrote:
> > > > Hi Ian,
> > > >
> > > > On Thu, Mar 06, 2025 at 08:55:05AM -0800, Ian Rogers wrote:
> > > > > On Wed, Mar 5, 2025 at 11:51 PM Namhyung Kim <namhyung@kernel.org> wrote:
> > > > > >
> > > > > > Ian told me that there are many memory leaks in the hierarchy mode.  I
> > > > > > can easily reproduce it with the follwing command.
> > > > > >
> > > > > >   $ make DEBUG=1 EXTRA_CFLAGS=-fsanitize=leak
> > > > > >
> > > > > >   $ perf record --latency -g -- ./perf test -w thloop
> > > > > >
> > > > > >   $ perf report -H --stdio
> > > > > >   ...
> > > > > >   Indirect leak of 168 byte(s) in 21 object(s) allocated from:
> > > > > >       #0 0x7f3414c16c65 in malloc ../../../../src/libsanitizer/lsan/lsan_interceptors.cpp:75
> > > > > >       #1 0x55ed3602346e in map__get util/map.h:189
> > > > > >       #2 0x55ed36024cc4 in hist_entry__init util/hist.c:476
> > > > > >       #3 0x55ed36025208 in hist_entry__new util/hist.c:588
> > > > > >       #4 0x55ed36027c05 in hierarchy_insert_entry util/hist.c:1587
> > > > > >       #5 0x55ed36027e2e in hists__hierarchy_insert_entry util/hist.c:1638
> > > > > >       #6 0x55ed36027fa4 in hists__collapse_insert_entry util/hist.c:1685
> > > > > >       #7 0x55ed360283e8 in hists__collapse_resort util/hist.c:1776
> > > > > >       #8 0x55ed35de0323 in report__collapse_hists /home/namhyung/project/linux/tools/perf/builtin-report.c:735
> > > > > >       #9 0x55ed35de15b4 in __cmd_report /home/namhyung/project/linux/tools/perf/builtin-report.c:1119
> > > > > >       #10 0x55ed35de43dc in cmd_report /home/namhyung/project/linux/tools/perf/builtin-report.c:1867
> > > > > >       #11 0x55ed35e66767 in run_builtin /home/namhyung/project/linux/tools/perf/perf.c:351
> > > > > >       #12 0x55ed35e66a0e in handle_internal_command /home/namhyung/project/linux/tools/perf/perf.c:404
> > > > > >       #13 0x55ed35e66b67 in run_argv /home/namhyung/project/linux/tools/perf/perf.c:448
> > > > > >       #14 0x55ed35e66eb0 in main /home/namhyung/project/linux/tools/perf/perf.c:556
> > > > > >       #15 0x7f340ac33d67 in __libc_start_call_main ../sysdeps/nptl/libc_start_call_main.h:58
> > > > > >   ...
> > > > > >
> > > > > >   $ perf report -H --stdio 2>&1 | grep -c '^Indirect leak'
> > > > > >   93
> > > > > >
> > > > > > I found that hist_entry__delete() missed to release child entries in the
> > > > > > hierarchy tree (hroot_{in,out}).  It needs to iterate the child entries
> > > > > > and call hist_entry__delete() recursively.
> > > > > >
> > > > > > After this change:
> > > > > >
> > > > > >   $ perf report -H --stdio 2>&1 | grep -c '^Indirect leak'
> > > > > >   0
> > > > > >
> > > > > > Reported-by: Ian Rogers <irogers@google.com>
> > > > > > Signed-off-by: Namhyung Kim <namhyung@kernel.org>
> > > > > > ---
> > > > > >  tools/perf/util/hist.c | 9 +++++++++
> > > > > >  1 file changed, 9 insertions(+)
> > > > > >
> > > > > > diff --git a/tools/perf/util/hist.c b/tools/perf/util/hist.c
> > > > > > index fbf131aeae7ffe9b..bbc6a299b5106c3b 100644
> > > > > > --- a/tools/perf/util/hist.c
> > > > > > +++ b/tools/perf/util/hist.c
> > > > > > @@ -1385,6 +1385,15 @@ void hist_entry__delete(struct hist_entry *he)
> > > > > >  {
> > > > > >         struct hist_entry_ops *ops = he->ops;
> > > > > >
> > > > > > +       while (!RB_EMPTY_ROOT(&he->hroot_out.rb_root)) {
> > > > > > +               struct rb_node *node = rb_first(&he->hroot_out.rb_root);
> > > > > > +               struct hist_entry *child = rb_entry(node, struct hist_entry, rb_node);
> > > > > > +
> > > > > > +               rb_erase_init(node, &he->hroot_out.rb_root);
> > > > > > +
> > > > > > +               hist_entry__delete(child);
> > > > > > +       }
> > > > >
> > > > > Thanks for the fix! A nit, iterating the rbtree of N nodes and calling
> > > > > erase on the first entry, an O(log N) operation, means this is a O(N *
> > > > > log N). rbtree.h has rbtree_postorder_for_each_entry_safe:
> > > > > https://web.git.kernel.org/pub/scm/linux/kernel/git/perf/perf-tools-next.git/tree/tools/include/linux/rbtree.h?h=perf-tools-next#n81
> > > > > ```
> > > > >  * rbtree_postorder_for_each_entry_safe - iterate in post-order over rb_root of
> > > > >  * given type allowing the backing memory of @pos to be invalidated
> > > > > ```
> > > > > which is O(N). I think this code would be better something like:
> > > > > ```
> > > > > struct hist_entry *pos, *tmp;
> > > > > rbtree_postorder_for_each_entry_safe(pos, tmp, he->hroot_out.rb_root, rb_node)
> > > > >         hist_entry__delete(pos);
> > > > > ```
> > > >
> > > > Thanks for your review!  I was wondering if there's something like this.
> > > > Will update with that.
> > >
> > > I found that the following comments:
> > >
> > >  * Note, however, that it cannot handle other modifications that re-order the
> > >  * rbtree it is iterating over. This includes calling rb_erase() on @pos, as
> > >  * rb_erase() may rebalance the tree, causing us to miss some nodes.
> > >
> > > So I cannot use this here.
> > 
> > I thought the tree is no longer in use after the hist_entry__delete,
> > so you don't need to maintain the child/parent pointers as everything
> > is going to get freed. Here is what I think is a similar use:
> > https://web.git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/tree/kernel/trace/trace_stat.c#n51
> 
> Oh, I see.  So it should not call rb_erase().  Will check again.

It works!  Will send v2 soon.

Thanks,
Namhyung


^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2025-03-07  6:03 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2025-03-06  7:51 [PATCH 1/2] perf report: Use map_symbol__copy() when copying callchians Namhyung Kim
2025-03-06  7:51 ` [PATCH 2/2] perf report: Fix memory leaks in the hierarchy mode Namhyung Kim
2025-03-06 16:55   ` Ian Rogers
2025-03-06 17:07     ` Namhyung Kim
2025-03-07  2:01       ` Namhyung Kim
2025-03-07  4:50         ` Ian Rogers
2025-03-07  5:19           ` Namhyung Kim
2025-03-07  6:03             ` Namhyung Kim

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).