public inbox for linux-doc@vger.kernel.org
 help / color / mirror / Atom feed
From: Ben Horgan <ben.horgan@arm.com>
To: Zeng Heng <zengheng4@huawei.com>
Cc: amitsinght@marvell.com, baisheng.gao@unisoc.com,
	baolin.wang@linux.alibaba.com, carl@os.amperecomputing.com,
	dave.martin@arm.com, david@kernel.org, dfustini@baylibre.com,
	fenghuay@nvidia.com, gshan@redhat.com, james.morse@arm.com,
	jonathan.cameron@huawei.com, kobak@nvidia.com,
	lcherian@marvell.com, linux-arm-kernel@lists.infradead.org,
	linux-kernel@vger.kernel.org, peternewman@google.com,
	punit.agrawal@oss.qualcomm.com, quic_jiles@quicinc.com,
	reinette.chatre@intel.com, rohit.mathew@arm.com,
	scott@os.amperecomputing.com, sdonthineni@nvidia.com,
	tan.shaopeng@fujitsu.com, xhao@linux.alibaba.com,
	catalin.marinas@arm.com, will@kernel.org, corbet@lwn.net,
	maz@kernel.org, oupton@kernel.org, joey.gouly@arm.com,
	suzuki.poulose@arm.com, kvmarm@lists.linux.dev,
	linux-doc@vger.kernel.org
Subject: Re: [PATCH v5 26/41] arm_mpam: resctrl: Add monitor initialisation and domain boilerplate
Date: Thu, 26 Feb 2026 10:26:25 +0000	[thread overview]
Message-ID: <fd0d0075-a23d-4e15-9a9f-7a476f3296fb@arm.com> (raw)
In-Reply-To: <d233d198-9c68-b32e-4a0f-73a9314bdedd@huawei.com>

Hi Zeng,

On 2/26/26 03:47, Zeng Heng wrote:
> Hi Ben,
> 
> On 2026/2/25 1:57, Ben Horgan wrote:
>> Add the boilerplate that tells resctrl about the mpam monitors that are
>> available. resctrl expects all (non-telemetry) monitors to be on the
>> L3 and
>> so advertise them there and invent an L3 resctrl resource if required.
>> The
>> L3 cache itself has to exist as the cache ids are used as the domain
>> ids.
>>
>> Bring the resctrl monitor domains online and offline based on the cpus
>> they contain.
>>
>> Support for specific monitor types is left to later.
>>
>> Signed-off-by: Ben Horgan <ben.horgan@arm.com>
>> ---
>> New patch but mostly moved from the existing patches to
>> separate the monitors from the controls and the boilerplate
>> from the specific counters.
>> Use l3->mon_capable in resctrl_arch_mon_capable() as
>> resctrl_enable_mon_event() now returns a bool.
>> ---
>>   drivers/resctrl/mpam_internal.h |   7 ++
>>   drivers/resctrl/mpam_resctrl.c  | 142 +++++++++++++++++++++++++++++---
>>   2 files changed, 139 insertions(+), 10 deletions(-)
>>
> 
> [...]
> 
>> @@ -922,6 +1000,20 @@ mpam_resctrl_alloc_domain(unsigned int cpu,
>> struct mpam_resctrl_res *res)
>>       } else {
>>           pr_debug("Skipped control domain online - no controls\n");
>>       }
>> +
>> +    if (resctrl_arch_mon_capable()) {
>> +        mon_d = &dom->resctrl_mon_dom;
>> +        mpam_resctrl_domain_hdr_init(cpu, any_mon_comp, r->rid,
>> &mon_d->hdr);
>> +        mon_d->hdr.type = RESCTRL_MON_DOMAIN;
>> +        err = resctrl_online_mon_domain(r, &mon_d->hdr);
>> +        if (err)
>> +            goto offline_ctrl_domain;
>> +
>> +        mpam_resctrl_domain_insert(&r->mon_domains, &mon_d->hdr);
>> +    } else {
>> +        pr_debug("Skipped monitor domain online - no monitors\n");
>> +    }
>> +
>>       return dom;
>>   
> 
> I noticed that resctrl_arch_mon_capable() only performs checks for L3
> monitoring functionality. This leads to an issue on platforms that
> include L2 monitoring capabilities, where the code incorrectly enters
> this branch and triggers the following warning by
> mpam_resctrl_domain_insert():
> 
> [   22.867070] ------------[ cut here ]------------
> [   22.867073] WARNING: drivers/resctrl/mpam_resctrl.c:1495 at
> mpam_resctrl_domain_insert+0x74/0x80, CPU#2: cpuhp/2/25
> [   29.376035] Modules linked in:
> [   29.379080] CPU: 2 UID: 0 PID: 25 Comm: cpuhp/2 Not tainted 7.0.0-
> rc1-g4288ec146462 #30 PREEMPT
> [   29.387853] Hardware name: To Be Filled By O.E.M. 183.0/To Be Filled
> By O.E.M., BIOS 183.0 02/12/2026
> [   29.397058] pstate: 61400009 (nZCv daif +PAN -UAO -TCO +DIT -SSBS
> BTYPE=--)
> [   29.404007] pc : mpam_resctrl_domain_insert+0x74/0x80
> [   29.409048] lr : mpam_resctrl_domain_insert+0x34/0x80
> [   29.414088] sp : ffff8000876abc60
>  ...
> [   29.488625] Call trace:
> [   29.491060]  mpam_resctrl_domain_insert+0x74/0x80 (P)
> [   29.496100]  mpam_resctrl_online_cpu+0x2b4/0x428
> [   29.500706]  mpam_cpu_online+0x274/0x298
> [   29.504618]  cpuhp_invoke_callback+0x104/0x20c
> [   29.509052]  cpuhp_thread_fun+0xa4/0x17c
> [   29.512963]  smpboot_thread_fn+0x220/0x24c
> [   29.517048]  kthread+0x120/0x12c
> [   29.520265]  ret_from_fork+0x10/0x20
> [   29.523830] ---[ end trace 0000000000000000 ]---

Thanks for reporting this bug. It looks to be because resctrl_arch_mon_capable() is telling us if
there is any mon capable resource when really what we want to know is if this resource is mon capable.
The pattern occurs in a few places. Does this diff help?

diff --git a/drivers/resctrl/mpam_resctrl.c b/drivers/resctrl/mpam_resctrl.c
index 694ea8548a05..19b306017845 100644
--- a/drivers/resctrl/mpam_resctrl.c
+++ b/drivers/resctrl/mpam_resctrl.c
@@ -1543,7 +1543,7 @@ mpam_resctrl_alloc_domain(unsigned int cpu, struct mpam_resctrl_res *res)
 	if (!dom)
 		return ERR_PTR(-ENOMEM);
 
-	if (resctrl_arch_alloc_capable()) {
+	if (r->alloc_capable) {
 		dom->ctrl_comp = ctrl_comp;
 
 		ctrl_d = &dom->resctrl_ctrl_dom;
@@ -1558,7 +1558,7 @@ mpam_resctrl_alloc_domain(unsigned int cpu, struct mpam_resctrl_res *res)
 		pr_debug("Skipped control domain online - no controls\n");
 	}
 
-	if (resctrl_arch_mon_capable()) {
+	if (r->mon_capable) {
 		struct mpam_component *any_mon_comp;
 		struct mpam_resctrl_mon *mon;
 		enum resctrl_event_id eventid;
@@ -1603,7 +1603,7 @@ mpam_resctrl_alloc_domain(unsigned int cpu, struct mpam_resctrl_res *res)
 	return dom;
 
 offline_ctrl_domain:
-	if (resctrl_arch_alloc_capable()) {
+	if (r->alloc_capable) {
 		mpam_resctrl_offline_domain_hdr(cpu, &ctrl_d->hdr);
 		resctrl_offline_ctrl_domain(r, ctrl_d);
 	}
@@ -1671,6 +1671,7 @@ int mpam_resctrl_online_cpu(unsigned int cpu)
 	guard(mutex)(&domain_list_lock);
 	for_each_mpam_resctrl_control(res, rid) {
 		struct mpam_resctrl_dom *dom;
+		struct rdt_resource *r = &res->resctrl_res;
 
 		if (!res->class)
 			continue;	// dummy_resource;
@@ -1679,12 +1680,12 @@ int mpam_resctrl_online_cpu(unsigned int cpu)
 		if (!dom) {
 			dom = mpam_resctrl_alloc_domain(cpu, res);
 		} else {
-			if (resctrl_arch_alloc_capable()) {
+			if (r->alloc_capable) {
 				struct rdt_ctrl_domain *ctrl_d = &dom->resctrl_ctrl_dom;
 
 				mpam_resctrl_online_domain_hdr(cpu, &ctrl_d->hdr);
 			}
-			if (resctrl_arch_mon_capable()) {
+			if (r->mon_capable) {
 				struct rdt_l3_mon_domain *mon_d = &dom->resctrl_mon_dom;
 
 				mpam_resctrl_online_domain_hdr(cpu, &mon_d->hdr);
@@ -1712,6 +1713,7 @@ void mpam_resctrl_offline_cpu(unsigned int cpu)
 		struct rdt_l3_mon_domain *mon_d;
 		struct rdt_ctrl_domain *ctrl_d;
 		bool ctrl_dom_empty, mon_dom_empty;
+		struct rdt_resource *r = &res->resctrl_res;
 
 		if (!res->class)
 			continue;	// dummy resource
@@ -1720,7 +1722,7 @@ void mpam_resctrl_offline_cpu(unsigned int cpu)
 		if (WARN_ON_ONCE(!dom))
 			continue;
 
-		if (resctrl_arch_alloc_capable()) {
+		if (r->alloc_capable) {
 			ctrl_d = &dom->resctrl_ctrl_dom;
 			ctrl_dom_empty = mpam_resctrl_offline_domain_hdr(cpu, &ctrl_d->hdr);
 			if (ctrl_dom_empty)
@@ -1729,7 +1731,7 @@ void mpam_resctrl_offline_cpu(unsigned int cpu)
 			ctrl_dom_empty = true;
 		}
 
-		if (resctrl_arch_mon_capable()) {
+		if (r->mon_capable) {
 			mon_d = &dom->resctrl_mon_dom;
 			mon_dom_empty = mpam_resctrl_offline_domain_hdr(cpu, &mon_d->hdr);
 			if (mon_dom_empty)


> 
> 
> To preserve the existing public interface of resctrl_arch_mon_capable(),
> please consider the following approach:
> 
> diff --git a/drivers/resctrl/mpam_resctrl.c b/drivers/resctrl/
> mpam_resctrl.c
> index 694ea8548a05..b06a89494ff0 100644
> --- a/drivers/resctrl/mpam_resctrl.c
> +++ b/drivers/resctrl/mpam_resctrl.c
> @@ -1563,6 +1563,10 @@ mpam_resctrl_alloc_domain(unsigned int cpu,
> struct mpam_resctrl_res *res)
>         if (resctrl_arch_mon_capable()) {
>                 struct mpam_component *any_mon_comp;
>                 struct mpam_resctrl_mon *mon;
>                 enum resctrl_event_id eventid;
> 
> +               /* TODO: Only supports L3 monitor type currently. */
> +               if (r->rid != RDT_RESOURCE_L3)
> +                       return dom;
> 
> 
> 
> Best regards,
> Zeng Heng

 
Thanks,

Ben


  reply	other threads:[~2026-02-26 10:26 UTC|newest]

Thread overview: 73+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-02-24 17:56 [PATCH v5 00/41] arm_mpam: Add KVM/arm64 and resctrl glue code Ben Horgan
2026-02-24 17:56 ` [PATCH v5 01/41] arm64/sysreg: Add MPAMSM_EL1 register Ben Horgan
2026-02-24 17:56 ` [PATCH v5 02/41] KVM: arm64: Preserve host MPAM configuration when changing traps Ben Horgan
2026-03-02 17:52   ` Marc Zyngier
2026-02-24 17:56 ` [PATCH v5 03/41] KVM: arm64: Make MPAMSM_EL1 accesses UNDEF Ben Horgan
2026-03-02 17:54   ` Marc Zyngier
2026-02-24 17:56 ` [PATCH v5 04/41] arm64: mpam: Context switch the MPAM registers Ben Horgan
2026-02-24 17:56 ` [PATCH v5 05/41] arm64: mpam: Re-initialise MPAM regs when CPU comes online Ben Horgan
2026-02-24 17:56 ` [PATCH v5 06/41] arm64: mpam: Drop the CONFIG_EXPERT restriction Ben Horgan
2026-03-09  6:42   ` Gavin Shan
2026-02-24 17:56 ` [PATCH v5 07/41] arm64: mpam: Advertise the CPUs MPAM limits to the driver Ben Horgan
2026-03-09  6:43   ` Gavin Shan
2026-02-24 17:56 ` [PATCH v5 08/41] arm64: mpam: Add cpu_pm notifier to restore MPAM sysregs Ben Horgan
2026-02-24 17:56 ` [PATCH v5 09/41] arm64: mpam: Initialise and context switch the MPAMSM_EL1 register Ben Horgan
2026-02-24 17:56 ` [PATCH v5 10/41] arm64: mpam: Add helpers to change a task or cpu's MPAM PARTID/PMG values Ben Horgan
2026-03-09  6:44   ` Gavin Shan
2026-02-24 17:56 ` [PATCH v5 11/41] KVM: arm64: Force guest EL1 to use user-space's partid configuration Ben Horgan
2026-03-02 17:58   ` Marc Zyngier
2026-03-09  6:45   ` Gavin Shan
2026-02-24 17:56 ` [PATCH v5 12/41] KVM: arm64: Use kernel-space partid configuration for hypercalls Ben Horgan
2026-03-02 18:15   ` Marc Zyngier
2026-03-03 16:33     ` Ben Horgan
2026-03-13  9:43       ` Ben Horgan
2026-02-24 17:56 ` [PATCH v5 13/41] arm_mpam: resctrl: Add boilerplate cpuhp and domain allocation Ben Horgan
2026-03-10  6:17   ` Gavin Shan
2026-03-10 10:34     ` Ben Horgan
2026-02-24 17:56 ` [PATCH v5 14/41] arm_mpam: resctrl: Pick the caches we will use as resctrl resources Ben Horgan
2026-02-24 17:56 ` [PATCH v5 15/41] arm_mpam: resctrl: Implement resctrl_arch_reset_all_ctrls() Ben Horgan
2026-02-25 11:03   ` Jonathan Cameron
2026-02-24 17:56 ` [PATCH v5 16/41] arm_mpam: resctrl: Add resctrl_arch_get_config() Ben Horgan
2026-02-24 17:56 ` [PATCH v5 17/41] arm_mpam: resctrl: Implement helpers to update configuration Ben Horgan
2026-02-24 17:56 ` [PATCH v5 18/41] arm_mpam: resctrl: Add plumbing against arm64 task and cpu hooks Ben Horgan
2026-02-24 17:56 ` [PATCH v5 19/41] arm_mpam: resctrl: Add CDP emulation Ben Horgan
2026-02-25  6:25   ` Zeng Heng
2026-02-24 17:56 ` [PATCH v5 20/41] arm_mpam: resctrl: Convert to/from MPAMs fixed-point formats Ben Horgan
2026-02-24 17:57 ` [PATCH v5 21/41] arm_mpam: resctrl: Add kunit test for control format conversions Ben Horgan
2026-02-24 17:57 ` [PATCH v5 22/41] arm_mpam: resctrl: Add rmid index helpers Ben Horgan
2026-02-24 17:57 ` [PATCH v5 23/41] arm_mpam: resctrl: Add kunit test for rmid idx conversions Ben Horgan
2026-02-24 17:57 ` [PATCH v5 24/41] arm_mpam: resctrl: Wait for cacheinfo to be ready Ben Horgan
2026-02-24 17:57 ` [PATCH v5 25/41] arm_mpam: resctrl: Add support for 'MB' resource Ben Horgan
2026-02-24 17:57 ` [PATCH v5 26/41] arm_mpam: resctrl: Add monitor initialisation and domain boilerplate Ben Horgan
2026-02-25 11:14   ` Jonathan Cameron
2026-02-26  3:47   ` Zeng Heng
2026-02-26 10:26     ` Ben Horgan [this message]
2026-02-27  3:01       ` Zeng Heng
2026-02-24 17:57 ` [PATCH v5 27/41] arm_mpam: resctrl: Add support for csu counters Ben Horgan
2026-02-24 17:57 ` [PATCH v5 28/41] arm_mpam: resctrl: Pick classes for use as mbm counters Ben Horgan
2026-02-24 17:57 ` [PATCH v5 29/41] arm_mpam: resctrl: Pre-allocate free running monitors Ben Horgan
2026-02-24 17:57 ` [PATCH v5 30/41] arm_mpam: resctrl: Allow resctrl to allocate monitors Ben Horgan
2026-02-24 17:57 ` [PATCH v5 31/41] arm_mpam: resctrl: Add resctrl_arch_rmid_read() and resctrl_arch_reset_rmid() Ben Horgan
2026-03-07  9:29   ` Zeng Heng
2026-03-09 16:30     ` Ben Horgan
2026-03-10  3:23       ` Zeng Heng
2026-02-24 17:57 ` [PATCH v5 32/41] arm_mpam: resctrl: Update the rmid reallocation limit Ben Horgan
2026-02-24 17:57 ` [PATCH v5 33/41] arm_mpam: resctrl: Add empty definitions for assorted resctrl functions Ben Horgan
2026-02-24 17:57 ` [PATCH v5 34/41] arm64: mpam: Select ARCH_HAS_CPU_RESCTRL Ben Horgan
2026-02-24 17:57 ` [PATCH v5 35/41] arm_mpam: resctrl: Call resctrl_init() on platforms that can support resctrl Ben Horgan
2026-02-24 17:57 ` [PATCH v5 36/41] arm_mpam: Add quirk framework Ben Horgan
2026-02-24 17:57 ` [PATCH v5 37/41] arm_mpam: Add workaround for T241-MPAM-1 Ben Horgan
2026-02-24 17:57 ` [PATCH v5 38/41] arm_mpam: Add workaround for T241-MPAM-4 Ben Horgan
2026-03-01 17:28   ` Fenghua Yu
2026-03-02 17:11     ` Ben Horgan
2026-03-09 17:39       ` Fenghua Yu
2026-03-10 11:26         ` Ben Horgan
2026-02-24 17:57 ` [PATCH v5 39/41] arm_mpam: Add workaround for T241-MPAM-6 Ben Horgan
2026-02-24 17:57 ` [PATCH v5 40/41] arm_mpam: Quirk CMN-650's CSU NRDY behaviour Ben Horgan
2026-02-24 17:57 ` [PATCH v5 41/41] arm64: mpam: Add initial MPAM documentation Ben Horgan
2026-02-25 11:01   ` Jonathan Cameron
2026-02-25 21:10 ` [PATCH v5 00/41] arm_mpam: Add KVM/arm64 and resctrl glue code Ben Horgan
2026-02-27 17:04   ` Catalin Marinas
2026-02-26  7:34 ` Zeng Heng
2026-03-03 20:18 ` Punit Agrawal
2026-03-04  9:42   ` Ben Horgan

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=fd0d0075-a23d-4e15-9a9f-7a476f3296fb@arm.com \
    --to=ben.horgan@arm.com \
    --cc=amitsinght@marvell.com \
    --cc=baisheng.gao@unisoc.com \
    --cc=baolin.wang@linux.alibaba.com \
    --cc=carl@os.amperecomputing.com \
    --cc=catalin.marinas@arm.com \
    --cc=corbet@lwn.net \
    --cc=dave.martin@arm.com \
    --cc=david@kernel.org \
    --cc=dfustini@baylibre.com \
    --cc=fenghuay@nvidia.com \
    --cc=gshan@redhat.com \
    --cc=james.morse@arm.com \
    --cc=joey.gouly@arm.com \
    --cc=jonathan.cameron@huawei.com \
    --cc=kobak@nvidia.com \
    --cc=kvmarm@lists.linux.dev \
    --cc=lcherian@marvell.com \
    --cc=linux-arm-kernel@lists.infradead.org \
    --cc=linux-doc@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=maz@kernel.org \
    --cc=oupton@kernel.org \
    --cc=peternewman@google.com \
    --cc=punit.agrawal@oss.qualcomm.com \
    --cc=quic_jiles@quicinc.com \
    --cc=reinette.chatre@intel.com \
    --cc=rohit.mathew@arm.com \
    --cc=scott@os.amperecomputing.com \
    --cc=sdonthineni@nvidia.com \
    --cc=suzuki.poulose@arm.com \
    --cc=tan.shaopeng@fujitsu.com \
    --cc=will@kernel.org \
    --cc=xhao@linux.alibaba.com \
    --cc=zengheng4@huawei.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox