From: Leo Yan <leo.yan@arm.com>
To: Peter Zijlstra <peterz@infradead.org>
Cc: Yeoreum Yun <yeoreum.yun@arm.com>,
mingo@redhat.com, mingo@kernel.org, acme@kernel.org,
namhyung@kernel.org, mark.rutland@arm.com,
alexander.shishkin@linux.intel.com, jolsa@kernel.org,
irogers@google.com, adrian.hunter@intel.com,
kan.liang@linux.intel.com, linux-perf-users@vger.kernel.org,
linux-kernel@vger.kernel.org, David Wang <00107082@163.com>
Subject: Re: [PATCH 1/1] perf/core: fix dangling cgroup pointer in cpuctx
Date: Thu, 5 Jun 2025 18:21:26 +0100 [thread overview]
Message-ID: <20250605172126.GG8020@e132581.arm.com> (raw)
In-Reply-To: <20250605123343.GD35970@noisy.programming.kicks-ass.net>
On Thu, Jun 05, 2025 at 02:33:43PM +0200, Peter Zijlstra wrote:
> On Thu, Jun 05, 2025 at 01:29:21PM +0200, Peter Zijlstra wrote:
>
> > But yes, slightly confusing. Let me see if I can make a less confusing
> > patch, and if not, sprinkle comments.
>
> I've settled on the below.
>
> ---
> Subject: perf: Fix cgroup state vs ERROR
> From: Peter Zijlstra <peterz@infradead.org>
> Date: Thu Jun 5 12:37:11 CEST 2025
>
> While chasing down a missing perf_cgroup_event_disable() elsewhere,
> Leo Yan found that both perf_put_aux_event() and
> perf_remove_sibling_event() were also missing one.
>
> Specifically, the rule is that events that switch to OFF,ERROR need to
> call perf_cgroup_event_disable().
>
> Unify the disable paths to ensure this.
>
> Fixes: ab43762ef010 ("perf: Allow normal events to output AUX data")
> Fixes: 9f0c4fa111dc ("perf/core: Add a new PERF_EV_CAP_SIBLING event capability")
> Reported-by: Leo Yan <leo.yan@arm.com>
> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org>
> ---
> kernel/events/core.c | 51 ++++++++++++++++++++++++++++++---------------------
> 1 file changed, 30 insertions(+), 21 deletions(-)
>
> --- a/kernel/events/core.c
> +++ b/kernel/events/core.c
> @@ -2149,8 +2149,9 @@ perf_aux_output_match(struct perf_event
> }
>
> static void put_event(struct perf_event *event);
> -static void event_sched_out(struct perf_event *event,
> - struct perf_event_context *ctx);
> +static void __event_disable(struct perf_event *event,
> + struct perf_event_context *ctx,
> + enum perf_event_state state);
>
> static void perf_put_aux_event(struct perf_event *event)
> {
> @@ -2183,8 +2184,7 @@ static void perf_put_aux_event(struct pe
> * state so that we don't try to schedule it again. Note
> * that perf_event_enable() will clear the ERROR status.
> */
> - event_sched_out(iter, ctx);
> - perf_event_set_state(event, PERF_EVENT_STATE_ERROR);
> + __event_disable(iter, ctx, PERF_EVENT_STATE_ERROR);
> }
> }
>
> @@ -2242,18 +2242,6 @@ static inline struct list_head *get_even
> &event->pmu_ctx->flexible_active;
> }
>
> -/*
> - * Events that have PERF_EV_CAP_SIBLING require being part of a group and
> - * cannot exist on their own, schedule them out and move them into the ERROR
> - * state. Also see _perf_event_enable(), it will not be able to recover
> - * this ERROR state.
> - */
> -static inline void perf_remove_sibling_event(struct perf_event *event)
> -{
> - event_sched_out(event, event->ctx);
> - perf_event_set_state(event, PERF_EVENT_STATE_ERROR);
> -}
> -
> static void perf_group_detach(struct perf_event *event)
> {
> struct perf_event *leader = event->group_leader;
> @@ -2289,8 +2277,15 @@ static void perf_group_detach(struct per
> */
> list_for_each_entry_safe(sibling, tmp, &event->sibling_list, sibling_list) {
>
> + /*
> + * Events that have PERF_EV_CAP_SIBLING require being part of
> + * a group and cannot exist on their own, schedule them out
> + * and move them into the ERROR state. Also see
> + * _perf_event_enable(), it will not be able to recover this
> + * ERROR state.
> + */
> if (sibling->event_caps & PERF_EV_CAP_SIBLING)
> - perf_remove_sibling_event(sibling);
> + __event_disable(sibling, ctx, PERF_EVENT_STATE_ERROR);
>
> sibling->group_leader = sibling;
> list_del_init(&sibling->sibling_list);
> @@ -2562,6 +2557,15 @@ static void perf_remove_from_context(str
> event_function_call(event, __perf_remove_from_context, (void *)flags);
> }
>
> +static void __event_disable(struct perf_event *event,
> + struct perf_event_context *ctx,
> + enum perf_event_state state)
> +{
> + event_sched_out(event, ctx);
> + perf_cgroup_event_disable(event, ctx);
> + perf_event_set_state(event, state);
> +}
> +
> /*
> * Cross CPU call to disable a performance event
> */
> @@ -2576,13 +2580,18 @@ static void __perf_event_disable(struct
> perf_pmu_disable(event->pmu_ctx->pmu);
> ctx_time_update_event(ctx, event);
>
> + /*
> + * When disabling a group leader, the whole group becomes ineligible
> + * to run, so schedule out the full group.
> + */
> if (event == event->group_leader)
> group_sched_out(event, ctx);
> - else
> - event_sched_out(event, ctx);
>
> - perf_event_set_state(event, PERF_EVENT_STATE_OFF);
> - perf_cgroup_event_disable(event, ctx);
> + /*
> + * But only mark the leader OFF; the siblings will remain
> + * INACTIVE.
> + */
> + __event_disable(event, ctx, PERF_EVENT_STATE_OFF);
Here, a group lead will invoke event_sched_out() twice: one is in
group_sched_out() (above) andin __event_disable(). This would be fine,
as the second call to event_sched_out() will directly bail out due to
the following condition:
if (event->state != PERF_EVENT_STATE_ACTIVE)
return;
I think you have already noticed this minor redundancy.
Reviewed-by: Leo Yan <leo.yan@arm.com>
And thanks for the explaination in your another reply, it makes sense to
me.
Leo
> perf_pmu_enable(event->pmu_ctx->pmu);
> }
next prev parent reply other threads:[~2025-06-05 17:21 UTC|newest]
Thread overview: 35+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-06-02 18:40 [PATCH 1/1] perf/core: fix dangling cgroup pointer in cpuctx Yeoreum Yun
2025-06-03 2:01 ` David Wang
2025-06-03 4:46 ` [PATCH " Yeoreum Yun
2025-06-03 5:44 ` David Wang
2025-06-03 6:34 ` Yeoreum Yun
2025-06-03 6:39 ` Yeoreum Yun
2025-06-03 6:47 ` David Wang
2025-06-03 6:42 ` David Wang
2025-06-03 7:16 ` Yeoreum Yun
2025-06-03 7:31 ` David Wang
2025-06-03 8:15 ` David Wang
2025-06-03 6:54 ` David Wang
2025-06-03 9:20 ` Yeoreum Yun
2025-06-03 10:08 ` David Wang
2025-06-03 13:41 ` Yeoreum Yun
2025-06-03 14:02 ` David Wang
2025-06-03 14:00 ` Leo Yan
2025-06-03 14:44 ` Peter Zijlstra
2025-06-03 15:17 ` Yeoreum Yun
2025-06-04 7:06 ` Peter Zijlstra
2025-06-04 8:03 ` Peter Zijlstra
2025-06-04 10:06 ` Yeoreum Yun
2025-06-04 12:37 ` Peter Zijlstra
2025-06-04 12:54 ` Yeoreum Yun
2025-06-04 10:18 ` Leo Yan
2025-06-04 13:58 ` Peter Zijlstra
2025-06-04 15:17 ` Leo Yan
2025-06-04 14:16 ` Peter Zijlstra
2025-06-04 15:46 ` Leo Yan
2025-06-04 15:59 ` Peter Zijlstra
2025-06-05 11:29 ` Peter Zijlstra
2025-06-05 12:33 ` Peter Zijlstra
2025-06-05 17:21 ` Leo Yan [this message]
2025-06-05 11:41 ` Peter Zijlstra
2025-06-03 15:05 ` Yeoreum Yun
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20250605172126.GG8020@e132581.arm.com \
--to=leo.yan@arm.com \
--cc=00107082@163.com \
--cc=acme@kernel.org \
--cc=adrian.hunter@intel.com \
--cc=alexander.shishkin@linux.intel.com \
--cc=irogers@google.com \
--cc=jolsa@kernel.org \
--cc=kan.liang@linux.intel.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-perf-users@vger.kernel.org \
--cc=mark.rutland@arm.com \
--cc=mingo@kernel.org \
--cc=mingo@redhat.com \
--cc=namhyung@kernel.org \
--cc=peterz@infradead.org \
--cc=yeoreum.yun@arm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).