public inbox for linux-arm-kernel@lists.infradead.org
 help / color / mirror / Atom feed
From: Reinette Chatre <reinette.chatre@intel.com>
To: Peter Newman <peternewman@google.com>, Ben Horgan <ben.horgan@arm.com>
Cc: <amitsinght@marvell.com>, <baisheng.gao@unisoc.com>,
	<baolin.wang@linux.alibaba.com>, <carl@os.amperecomputing.com>,
	<dave.martin@arm.com>, <david@kernel.org>,
	<dfustini@baylibre.com>, <fenghuay@nvidia.com>,
	<gshan@redhat.com>, <james.morse@arm.com>,
	<jonathan.cameron@huawei.com>, <kobak@nvidia.com>,
	<lcherian@marvell.com>, <linux-arm-kernel@lists.infradead.org>,
	<linux-kernel@vger.kernel.org>, <punit.agrawal@oss.qualcomm.com>,
	<quic_jiles@quicinc.com>, <rohit.mathew@arm.com>,
	<scott@os.amperecomputing.com>, <sdonthineni@nvidia.com>,
	<tan.shaopeng@fujitsu.com>, <xhao@linux.alibaba.com>,
	<catalin.marinas@arm.com>, <will@kernel.org>, <corbet@lwn.net>,
	<maz@kernel.org>, <oupton@kernel.org>, <joey.gouly@arm.com>,
	<suzuki.poulose@arm.com>, <kvmarm@lists.linux.dev>
Subject: Re: [PATCH v3 28/47] arm_mpam: resctrl: Add support for csu counters
Date: Wed, 21 Jan 2026 09:58:45 -0800	[thread overview]
Message-ID: <8858c7ee-1af1-4bca-8ea5-ec62c78dcde5@intel.com> (raw)
In-Reply-To: <CALPaoCjK=VoYriKCGUrtTs=xKxkU34nnwxHKjjy6QVbmD99yhw@mail.gmail.com>

Hi Ben and Peter,

On 1/20/26 7:28 AM, Peter Newman wrote:
> Hi Ben,
> 
> On Fri, Jan 16, 2026 at 11:29 AM Ben Horgan <ben.horgan@arm.com> wrote:
>>
>> Hi Reinette, Peter,
>>
>> On 1/15/26 18:54, Reinette Chatre wrote:
>>> Hi Ben,
>>>
>>> On 1/15/26 7:43 AM, Ben Horgan wrote:
>>>> On 1/13/26 23:14, Reinette Chatre wrote:
>>>>> On 1/12/26 8:58 AM, Ben Horgan wrote:
>>> ...
>>>>>> +
>>>>>> +          /*
>>>>>> +           * Unfortunately, num_rmid doesn't mean anything for
>>>>>> +           * mpam, and its exposed to user-space!
>>>>>> +           *
>>>>>
>>>>> The idea of adding a per MON group "num_mon_groups" file has been floated a couple of
>>>>> times now. I have not heard any objections against doing something like this.
>>>>> https://lore.kernel.org/all/cbe665c2-fe83-e446-1696-7115c0f9fd76@arm.com/
>>>>> https://lore.kernel.org/lkml/46767ca7-1f1b-48e8-8ce6-be4b00d129f9@intel.com/
>>>>
>>>> Hmm, I see now that 'num_rmid' is documented as an upper bound and so
>>>> neither 1 or mpam_pmg_max + 1 agree with the documentation.
>>>>
>>>> "
>>>> "num_rmids":
>>>>              The number of RMIDs available. This is the
>>>>              upper bound for how many "CTRL_MON" + "MON"
>>>>              groups can be created.
>>>> "
>>>
>>> Please note that this documentation has been refactored (without changing its
>>> meaning). The above quoted text is specific to L3 monitoring and with the
>>> addition of telemetry monitoring the relevant text now reads:
>>>       The upper bound for how many "CTRL_MON" + "MON" can be created
>>>       is the smaller of the L3_MON and PERF_PKG_MON "num_rmids" values.
>>>
>>>>
>>>> So, if I understand correctly you're proposing setting
>>>> num_rmids = num_pmg * num_partids on arm platforms and that in the
>>>> interim this can then be used to calculate the num_pmg by calculating
>>>> num_closid/num_rmid but that a per CTRL_MON num_mon_groups should be
>>>> added to make this consistent across architectures?
>>>
>>> Yes for num_rmids = num_pmg * num_partids.
>>
>> Ok, I don't really see another option.
>>
>> The motivation for this is that to me
>>> this looks like the value that best matches the num_rmids documentation. I understand
>>> the RMID vs PMG is difficult so my proposal is certainly not set in stone and I would like to
>>> hear motivation for different interpretations. "calculating num_pmg" is not obvious
>>> though. I interpret "num_pmg" here as number of monitor groups per control group and on
>>> an Arm system this is indeed num_closid/num_rmids (if num_rmids = num_pmg * num_partids)
>>> but on x86 it is just num_rmids. Having user space depend on such computation to determine how
>>> many monitor groups per control group would thus require that user space knows whether the
>>> underlying system is Arm or x86 and would go against goal of having resctrl as a generic interface.
>>>
>>> The way forward may be to deprecate (somehow) num_rmids and transition to something
>>> like "num_mon_groups" but it is currently vague how "num_mon_groups" may look like. That thread
>>> (https://lore.kernel.org/lkml/46767ca7-1f1b-48e8-8ce6-be4b00d129f9@intel.com/) fizzled
>>> out after raising a few options how it may look.
>>>
>>> Another proposal was to add a "mon_id_includes_control_id" to use as another "guide" to
>>> determine how many monitoring groups can be created but at the time it seemed an intermediary
>>> step for user to determine the number of monitor groups that resctrl can also provide.
>>> https://lore.kernel.org/lkml/CALPaoChad6=xqz+BQQd=dB915xhj1gusmcrS9ya+T2GyhTQc5Q@mail.gmail.com/
>>
>> Just thinking about it now but the "mon_id_includes_control_id" option
>> seems the best to me as it is a single bit option that along with
>> "num_rmids" let's you know which monitor groups you can create and if
>> it's sensible to move monitor groups between CTRL MON groups.
>>
>> The "num_mon_groups" per CTRL MON group would also need to be
>> interpreted together with "num_rmid" to know if it is a global or per
>> CTRL MON upper bound. This option also uses multiple files to give the
>> same bit of information.
>>
>>>
>>> Making this consistent across architectures is the goal since resctrl aims to be
>>> a generic interface. Users should not need to do things like infer which system they
>>> are running on by looking at output of resctrl files as mentioned.
>>>
>>> fwiw ...  there seems to be a usage by Google to compare num_rmids to num_closids to determine
>>> how to interact with resctrl:
>>> https://lore.kernel.org/lkml/CALPaoCgSO7HzK9BjyM8yL50oPyq9kBj64Nkgyo1WEJrWy5uHUg@mail.gmail.com/
>>
>> Unfortunately, it looks like we're about to break this heuristic :( At
>> least, until a way to get this information generically in resctrl is
>> decided upon.
> 
> We actually ended up going with the "mon_id_includes_control_id" approach.

Thank you for confirming. I was hoping we could deprecate num_rmids after introducing a
per resource group file but this does not seem to support all the use cases as highlighted by
Ben. 

As I see it, a name like "mon_id_includes_control_id" also implies that "num_rmids", perhaps
linked to a new "num_mon_ids" as Peter suggested in [2], should contain num_pmg * num_partids.

One concern from earlier was that "mon_id_includes_control_id" may be used as a
heuristic for whether monitor groups can be moved or not. Instead I seem to remember that
there was a plan for MPAM to support moving monitor groups, with the caveat that
counters will reset for which resctrl may need another flag.

> The property it represents is rather fundamental to what a monitoring
> group actually is and is a low-level implementation detail that is
> difficult to hide. Google generally needs support for as many
> monitoring IDs as jobs it expects to be able to run on a machine, so
> the number of monitoring groups will be routinely maxed out (and there
> will be some jobs that are forever stuck in the default group because
> no RMIDs were free at the time it started[1])
> 
> Thanks,
> -Peter
> 
> [1] https://lore.kernel.org/lkml/CALPaoCjTwySGX9i7uAtCWLKQpmELKP55xDLJhHmUve8ptsfFTw@mail.gmail.com/

Reinette

[2] https://lore.kernel.org/lkml/CALPaoChad6=xqz+BQQd=dB915xhj1gusmcrS9ya+T2GyhTQc5Q@mail.gmail.com/


  reply	other threads:[~2026-01-21 17:59 UTC|newest]

Thread overview: 160+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-01-12 16:58 [PATCH v3 00/47] arm_mpam: Add KVM/arm64 and resctrl glue code Ben Horgan
2026-01-12 16:58 ` [PATCH v3 01/47] arm_mpam: Remove duplicate linux/srcu.h header Ben Horgan
2026-01-12 17:13   ` Fenghua Yu
2026-01-15  2:12   ` Gavin Shan
2026-01-12 16:58 ` [PATCH v3 02/47] arm_mpam: Use non-atomic bitops when modifying feature bitmap Ben Horgan
2026-01-15  2:14   ` Gavin Shan
2026-01-16 11:57   ` Catalin Marinas
2026-01-16 12:02     ` Ben Horgan
2026-01-16 12:12       ` Ben Horgan
2026-01-16 15:51         ` Catalin Marinas
2026-01-12 16:58 ` [PATCH v3 03/47] arm64/sysreg: Add MPAMSM_EL1 register Ben Horgan
2026-01-15  2:16   ` Gavin Shan
2026-01-15 17:59   ` Catalin Marinas
2026-01-12 16:58 ` [PATCH v3 04/47] KVM: arm64: Preserve host MPAM configuration when changing traps Ben Horgan
2026-01-15  2:33   ` Gavin Shan
2026-01-12 16:58 ` [PATCH v3 05/47] KVM: arm64: Make MPAMSM_EL1 accesses UNDEF Ben Horgan
2026-01-15  2:34   ` Gavin Shan
2026-01-12 16:58 ` [PATCH v3 06/47] arm64: mpam: Context switch the MPAM registers Ben Horgan
2026-01-15  6:47   ` Gavin Shan
2026-01-15 12:09     ` Jonathan Cameron
2026-01-19 14:00       ` Ben Horgan
2026-01-20  1:42         ` Gavin Shan
2026-01-15 17:58   ` Catalin Marinas
2026-01-19 12:23     ` Ben Horgan
2026-01-23 14:29       ` Catalin Marinas
2026-01-26 14:30         ` Ben Horgan
2026-01-26 14:50           ` Ben Horgan
2026-01-12 16:58 ` [PATCH v3 07/47] arm64: mpam: Re-initialise MPAM regs when CPU comes online Ben Horgan
2026-01-15  6:50   ` Gavin Shan
2026-01-15 18:14   ` Catalin Marinas
2026-01-19 13:38     ` Ben Horgan
2026-01-19 14:22       ` Ben Horgan
2026-01-12 16:58 ` [PATCH v3 08/47] arm64: mpam: Advertise the CPUs MPAM limits to the driver Ben Horgan
2026-01-15 18:16   ` Catalin Marinas
2026-01-19  6:37   ` Gavin Shan
2026-01-19 14:49     ` Ben Horgan
2026-01-12 16:58 ` [PATCH v3 09/47] arm64: mpam: Add cpu_pm notifier to restore MPAM sysregs Ben Horgan
2026-01-15 18:20   ` Catalin Marinas
2026-01-19  6:40   ` Gavin Shan
2026-01-19  6:50   ` Gavin Shan
2026-01-19 15:08     ` Ben Horgan
2026-01-12 16:58 ` [PATCH v3 10/47] arm64: mpam: Initialise and context switch the MPAMSM_EL1 register Ben Horgan
2026-01-15 19:08   ` Catalin Marinas
2026-01-19 13:40     ` Ben Horgan
2026-01-19  6:51   ` Gavin Shan
2026-01-19 15:31     ` Ben Horgan
2026-01-12 16:58 ` [PATCH v3 11/47] arm64: mpam: Add helpers to change a task or cpu's MPAM PARTID/PMG values Ben Horgan
2026-01-15 19:13   ` Catalin Marinas
2026-01-19  6:56     ` Gavin Shan
2026-01-19 15:47     ` Ben Horgan
2026-01-19  7:01   ` Gavin Shan
2026-01-19 15:49     ` Ben Horgan
2026-01-12 16:58 ` [PATCH v3 12/47] KVM: arm64: Force guest EL1 to use user-space's partid configuration Ben Horgan
2026-01-13 14:19   ` Jonathan Cameron
2026-01-14 12:06   ` Marc Zyngier
2026-01-14 14:50     ` Ben Horgan
2026-01-15  9:05       ` Marc Zyngier
2026-01-15 11:14         ` Ben Horgan
2026-01-12 16:58 ` [PATCH v3 13/47] KVM: arm64: Use kernel-space partid configuration for hypercalls Ben Horgan
2026-01-13 14:21   ` Jonathan Cameron
2026-01-13 14:35     ` Ben Horgan
2026-01-14 12:09   ` Marc Zyngier
2026-01-14 14:39     ` Ben Horgan
2026-01-14 16:50       ` Ben Horgan
2026-01-14 17:50         ` Marc Zyngier
2026-01-12 16:58 ` [PATCH v3 14/47] arm_mpam: resctrl: Add boilerplate cpuhp and domain allocation Ben Horgan
2026-01-13 16:49   ` Reinette Chatre
2026-01-19 17:20     ` Ben Horgan
2026-01-12 16:58 ` [PATCH v3 15/47] arm_mpam: resctrl: Sort the order of the domain lists Ben Horgan
2026-01-12 16:58 ` [PATCH v3 16/47] arm_mpam: resctrl: Pick the caches we will use as resctrl resources Ben Horgan
2026-01-12 16:58 ` [PATCH v3 17/47] arm_mpam: resctrl: Implement resctrl_arch_reset_all_ctrls() Ben Horgan
2026-01-13 14:46   ` Jonathan Cameron
2026-01-13 14:58     ` Ben Horgan
2026-01-12 16:58 ` [PATCH v3 18/47] arm_mpam: resctrl: Add resctrl_arch_get_config() Ben Horgan
2026-01-12 16:58 ` [PATCH v3 19/47] arm_mpam: resctrl: Implement helpers to update configuration Ben Horgan
2026-01-12 16:58 ` [PATCH v3 20/47] arm_mpam: resctrl: Add plumbing against arm64 task and cpu hooks Ben Horgan
2026-01-12 16:58 ` [PATCH v3 21/47] arm_mpam: resctrl: Add CDP emulation Ben Horgan
2026-01-12 16:58 ` [PATCH v3 22/47] arm_mpam: resctrl: Convert to/from MPAMs fixed-point formats Ben Horgan
2026-01-12 16:58 ` [PATCH v3 23/47] arm_mpam: resctrl: Add kunit test for control format conversions Ben Horgan
2026-01-12 16:58 ` [PATCH v3 24/47] arm_mpam: resctrl: Add rmid index helpers Ben Horgan
2026-01-13 14:55   ` Jonathan Cameron
2026-01-12 16:58 ` [PATCH v3 25/47] arm_mpam: resctrl: Add kunit test for rmid idx conversions Ben Horgan
2026-01-13 14:59   ` Jonathan Cameron
2026-01-12 16:58 ` [PATCH v3 26/47] arm_mpam: resctrl: Wait for cacheinfo to be ready Ben Horgan
2026-01-13 15:01   ` Jonathan Cameron
2026-01-13 15:15     ` Ben Horgan
2026-01-12 16:58 ` [PATCH v3 27/47] arm_mpam: resctrl: Add support for 'MB' resource Ben Horgan
2026-01-13 15:06   ` Jonathan Cameron
2026-01-13 22:18   ` Reinette Chatre
2026-01-19 11:53   ` Gavin Shan
2026-01-19 13:53     ` Ben Horgan
2026-01-12 16:58 ` [PATCH v3 28/47] arm_mpam: resctrl: Add support for csu counters Ben Horgan
2026-01-13 23:14   ` Reinette Chatre
2026-01-15 15:43     ` Ben Horgan
2026-01-15 18:54       ` Reinette Chatre
2026-01-16 10:29         ` Ben Horgan
2026-01-20 15:28           ` Peter Newman
2026-01-21 17:58             ` Reinette Chatre [this message]
2026-01-30 11:07               ` Ben Horgan
2026-01-30 11:19   ` Ben Horgan
2026-01-12 16:58 ` [PATCH v3 29/47] arm_mpam: resctrl: Pick classes for use as mbm counters Ben Horgan
2026-01-15 15:49   ` Peter Newman
2026-01-19 12:04     ` James Morse
2026-01-19 12:47       ` Peter Newman
2026-01-26 16:00         ` Ben Horgan
2026-01-30 13:04           ` Peter Newman
2026-01-30 14:38             ` Ben Horgan
2026-01-12 16:58 ` [PATCH v3 30/47] arm_mpam: resctrl: Pre-allocate free running monitors Ben Horgan
2026-01-13 15:10   ` Jonathan Cameron
2026-01-19 11:57   ` Gavin Shan
2026-01-19 20:27     ` Ben Horgan
2026-01-12 16:58 ` [PATCH v3 31/47] arm_mpam: resctrl: Pre-allocate assignable monitors Ben Horgan
2026-01-16 10:34   ` Shaopeng Tan (Fujitsu)
2026-01-16 11:04     ` Ben Horgan
2026-01-19 20:34       ` Ben Horgan
2026-01-12 16:58 ` [PATCH v3 32/47] arm_mpam: resctrl: Add kunit test for ABMC/CDP interactions Ben Horgan
2026-01-13 15:26   ` Jonathan Cameron
2026-01-12 16:59 ` [PATCH v3 33/47] arm_mpam: resctrl: Add resctrl_arch_config_cntr() for ABMC use Ben Horgan
2026-01-12 16:59 ` [PATCH v3 34/47] arm_mpam: resctrl: Allow resctrl to allocate monitors Ben Horgan
2026-01-12 16:59 ` [PATCH v3 35/47] arm_mpam: resctrl: Add resctrl_arch_rmid_read() and resctrl_arch_reset_rmid() Ben Horgan
2026-01-12 16:59 ` [PATCH v3 36/47] arm_mpam: resctrl: Add resctrl_arch_cntr_read() & resctrl_arch_reset_cntr() Ben Horgan
2026-01-12 16:59 ` [PATCH v3 37/47] arm_mpam: resctrl: Update the rmid reallocation limit Ben Horgan
2026-01-15 10:05   ` Shaopeng Tan (Fujitsu)
2026-01-15 16:02     ` Ben Horgan
2026-01-12 16:59 ` [PATCH v3 38/47] arm_mpam: resctrl: Add empty definitions for assorted resctrl functions Ben Horgan
2026-01-12 16:59 ` [PATCH v3 39/47] arm64: mpam: Select ARCH_HAS_CPU_RESCTRL Ben Horgan
2026-01-15 19:16   ` Catalin Marinas
2026-01-12 16:59 ` [PATCH v3 40/47] arm_mpam: resctrl: Call resctrl_init() on platforms that can support resctrl Ben Horgan
2026-01-12 16:59 ` [PATCH v3 41/47] arm_mpam: Generate a configuration for min controls Ben Horgan
2026-01-13 15:39   ` Jonathan Cameron
2026-01-30 14:17     ` Ben Horgan
2026-01-31  2:30       ` Shanker Donthineni
2026-02-02 10:21         ` Ben Horgan
2026-02-02 16:34           ` Shanker Donthineni
2026-02-03  9:33             ` Ben Horgan
2026-01-12 16:59 ` [PATCH v3 42/47] arm_mpam: resctrl: Add kunit test for mbw min control generation Ben Horgan
2026-01-13 15:43   ` Jonathan Cameron
2026-01-12 16:59 ` [PATCH v3 43/47] arm_mpam: Add quirk framework Ben Horgan
2026-01-19 12:14   ` Gavin Shan
2026-01-19 20:48     ` Ben Horgan
2026-01-12 16:59 ` [PATCH v3 44/47] arm_mpam: Add workaround for T241-MPAM-1 Ben Horgan
2026-01-19 12:16   ` Gavin Shan
2026-01-19 20:54     ` Ben Horgan
2026-01-12 16:59 ` [PATCH v3 45/47] arm_mpam: Add workaround for T241-MPAM-4 Ben Horgan
2026-01-15 23:20   ` Fenghua Yu
2026-01-19 20:56     ` Ben Horgan
2026-01-29 22:14       ` Fenghua Yu
2026-01-30 12:21         ` Ben Horgan
2026-01-12 16:59 ` [PATCH v3 46/47] arm_mpam: Add workaround for T241-MPAM-6 Ben Horgan
2026-01-12 16:59 ` [PATCH v3 47/47] arm_mpam: Quirk CMN-650's CSU NRDY behaviour Ben Horgan
2026-01-19 12:18   ` Gavin Shan
2026-01-19 20:58     ` Ben Horgan
2026-01-14  6:51 ` [PATCH RESEND v2 0/45] arm_mpam: Add KVM/arm64 and resctrl glue code Zeng Heng
2026-01-15 14:37   ` Ben Horgan
2026-01-15 11:14 ` [PATCH v3 00/47] " Peter Newman
2026-01-15 11:36   ` Ben Horgan
2026-01-16 10:47 ` Shaopeng Tan (Fujitsu)
2026-01-16 11:05   ` Ben Horgan
2026-01-16 15:47 ` (subset) " Catalin Marinas
2026-01-19  1:30 ` Gavin Shan

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=8858c7ee-1af1-4bca-8ea5-ec62c78dcde5@intel.com \
    --to=reinette.chatre@intel.com \
    --cc=amitsinght@marvell.com \
    --cc=baisheng.gao@unisoc.com \
    --cc=baolin.wang@linux.alibaba.com \
    --cc=ben.horgan@arm.com \
    --cc=carl@os.amperecomputing.com \
    --cc=catalin.marinas@arm.com \
    --cc=corbet@lwn.net \
    --cc=dave.martin@arm.com \
    --cc=david@kernel.org \
    --cc=dfustini@baylibre.com \
    --cc=fenghuay@nvidia.com \
    --cc=gshan@redhat.com \
    --cc=james.morse@arm.com \
    --cc=joey.gouly@arm.com \
    --cc=jonathan.cameron@huawei.com \
    --cc=kobak@nvidia.com \
    --cc=kvmarm@lists.linux.dev \
    --cc=lcherian@marvell.com \
    --cc=linux-arm-kernel@lists.infradead.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=maz@kernel.org \
    --cc=oupton@kernel.org \
    --cc=peternewman@google.com \
    --cc=punit.agrawal@oss.qualcomm.com \
    --cc=quic_jiles@quicinc.com \
    --cc=rohit.mathew@arm.com \
    --cc=scott@os.amperecomputing.com \
    --cc=sdonthineni@nvidia.com \
    --cc=suzuki.poulose@arm.com \
    --cc=tan.shaopeng@fujitsu.com \
    --cc=will@kernel.org \
    --cc=xhao@linux.alibaba.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox