public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Reinette Chatre <reinette.chatre@intel.com>
To: Ben Horgan <ben.horgan@arm.com>, <linux-kernel@vger.kernel.org>
Cc: <tony.luck@intel.com>, <Dave.Martin@arm.com>,
	<james.morse@arm.com>, <babu.moger@amd.com>, <tglx@kernel.org>,
	<mingo@redhat.com>, <bp@alien8.de>, <dave.hansen@linux.intel.com>,
	<x86@kernel.org>, <hpa@zytor.com>, <fenghuay@nvidia.com>,
	<tan.shaopeng@fujitsu.com>
Subject: Re: [PATCH v4 5/7] fs/resctrl: Continue counter allocation after failure
Date: Wed, 15 Apr 2026 10:28:54 -0700	[thread overview]
Message-ID: <62be0037-abba-4cd6-9058-45d784479893@intel.com> (raw)
In-Reply-To: <4f3df929-1f5b-47c3-93ca-bc70ca26542d@arm.com>

Hi Ben,

On 4/15/26 9:31 AM, Ben Horgan wrote:
> On 4/15/26 16:38, Reinette Chatre wrote:
>> On 4/15/26 7:46 AM, Ben Horgan wrote:
>>> On 4/15/26 15:27, Reinette Chatre wrote:
>>>> On 4/14/26 7:42 AM, Ben Horgan wrote:
>>>>> On 3/27/26 16:21, Reinette Chatre wrote:
>>>>
>>>>>> Consider a changelog like below that just focuses on problem being solved
>>>>>> (but please correct me if you find I am missing the point):
>>>>>>
>>>>>> 	In mbm_event mode, with mbm_assign_on_mkdir set to 1, when a user
>>>>>> 	creates a new CTRL_MON or MON group resctrl attempts to allocate
>>>>>> 	counters for each of the supported MBM events on each resctrl
>>>>>> 	domain. As counters are limited, such allocation may fail and
>>>>>> 	when it does counter allocations for the remaining domains are
>>>>>> 	skipped even if the domains have available counters.                                                                       
>>>>>>                                                                                 
>>>>>> 	Since a counter allocation failure may result in counter allocation
>>>>>> 	skipped on other domains the user needs to view the resource group's
>>>>>
>>>>> skipped -> being skipped
>>>>>
>>>>>> 	mbm_L3_assignments files to get an accurate view of counter assignment
>>>>>> 	in a new resource group and then manually create counters in the skipped
>>>>>> 	domains	with available counters.                                             
>>>>>>                                                                                 
>>>>>> 	Writes to mbm_L3_assignments using the wildcard format, <event>:*=e,            
>>>>>> 	also skip counter allocation in other domains after a counter allocation      
>>>>>> 	failure.                                                                        
>>>>>>                                                                                 
>>>>>> 	When handling a request to create counters in all domains it is unnecessary
>>>>>> 	for a counter allocation in one domain to prevent counter allocation in
>>>>>> 	other domains. Always attempt to allocate all the counters requested.     
>>>>>
>>>>> I can use this but how about if I add,
>>>>>
>>>>> Skipping counter allocation in subsequent domains after failure makes predicting which
>>>>> counters will be allocated harder for the user as they need to know the ordering of the
>>>>> domains as well as the expected failures.
>>>>
>>>> I do not see why the user needs to make any predictions with the current implementation.
>>>> mbm_L3_assignments will always contain accurate information regarding counter assignment, no?
>>>
>>> They can see the result with mbm_L3_assignments. In general, if the user is doing any operation it helps for them to
>>> know what they can expect from that operation before doing it.
>>
>> I totally agree. I seem to be missing the goal here. Are you saying that currently it is not clear
>> what the user can expect when running these commands? I believe that is clarified with the documentation
>> update in patch #6?
> 
> With this patch and the documentation in patch #6 the user should be clear on what to expect. Without this patch the

Agree with "With this patch ...".

> expectation is to iterate through the domains allocating counters but not consider subsequent domains after a failure.

I hear what you are trying to say with the "Without this patch ..." part but I am hesitant to
equate current undocumented kernel behavior as established user expectations. Instead I think
when user requests counter allocation in all domains then expectation would/should be that
counters will be allocated in all domains (that have available counters). I thus see this work
more as a fix that brings behavior closer to match what user may expect when running
impacted commands.

One possible expectation that user space may develop based on experience with existing implementation
is that requests to allocate counters may fail even if counters are available in some domains. If user
space does have such expectation then they still need to consider mbm_L3_assignments to know
what was actually allocated and I still do not see how they need to make any predictions to
use resctrl.

We thus seem to disagree on user expectations? Since this is quite subjective that may be ok since
I believe this work addresses all aspects?

> This means that the order of the domains are considered effects the outcome of command.

Right.

> 
> With this patch: each counter is attempted to be allocated
> Without this patch: Domains are considered in order of ascending id and each counter in the domain is attempted to be
> allocated but if a counter allocation fails no subsequent domains are considered.

Understood.

> 
> I think the behaviour with this patch is easier to understand and what would be most useful to the user.

I agree.

> 
> Suppose you have two domains, domain 0 and 1 and one counter remaining on one of the domains and create a new group
> (with mbm_assign_on_mkdir set to 1). Without this patch the available counter is only allocated if that domain is
> considered first (lower id) but with this patch it allocated regardless of which domain it's in.

Understood.

>>> Happy to drop the extra sentence if you don't think it adds anything. Making all allocations of multiple counters best
>>> effort is the main point.
>>
>> I do not object adding extra sentences but the proposal mentions how user space needs to
>> predict behaviors and know about kernel internals which I do not believe is required now nor
>> with planned changes. I really seem to be missing something here so would appreciate if you
>> could elaborate the goals here.
> 
> It seems that we agree on the patch but I've somehow confused you on the motivation. In essence, it's making a better
> effort at best effort counter allocation. Does this help clarify things?

I understand motivation and I agree this patch improves current behavior. You need not 
motivate this patch. My comment was about the additional information that you propose
adding to the changelog where I still do not see user needing to ever make any predictions to
use the existing interfaces.

Reinette

  reply	other threads:[~2026-04-15 17:29 UTC|newest]

Thread overview: 25+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-03-26 17:25 [PATCH v4 0/7] x86,fs/resctrl: Pave the way for MPAM counter assignment Ben Horgan
2026-03-26 17:25 ` [PATCH v4 1/7] fs/resctrl: Tidy up the error path in resctrl_mkdir_event_configs() Ben Horgan
2026-03-27 16:06   ` Reinette Chatre
2026-03-26 17:25 ` [PATCH v4 2/7] x86,fs/resctrl: Make 'event_filter' files read only if they're not configurable Ben Horgan
2026-03-27 16:13   ` Reinette Chatre
2026-03-27 17:15     ` Ben Horgan
2026-03-26 17:25 ` [PATCH v4 3/7] fs/resctrl: Disallow the software controller when MBM counters are assignable Ben Horgan
2026-03-27 16:14   ` Reinette Chatre
2026-03-27 17:24     ` Ben Horgan
2026-03-26 17:25 ` [PATCH v4 4/7] x86,fs/resctrl: Add monitor property 'mbm_cntr_assign_fixed' Ben Horgan
2026-03-27 16:15   ` Reinette Chatre
2026-03-26 17:25 ` [PATCH v4 5/7] fs/resctrl: Continue counter allocation after failure Ben Horgan
2026-03-27 16:21   ` Reinette Chatre
2026-04-14 14:42     ` Ben Horgan
2026-04-15 14:27       ` Reinette Chatre
2026-04-15 14:46         ` Ben Horgan
2026-04-15 15:38           ` Reinette Chatre
2026-04-15 16:31             ` Ben Horgan
2026-04-15 17:28               ` Reinette Chatre [this message]
2026-04-16  8:32                 ` Ben Horgan
2026-03-26 17:25 ` [PATCH v4 6/7] fs/resctrl: Document that automatic counter assignment is best effort Ben Horgan
2026-03-27 16:24   ` Reinette Chatre
2026-04-14 14:55     ` Ben Horgan
2026-03-26 17:25 ` [PATCH v4 7/7] fs/resctrl: Document tasks file behaviour for task id 0 and idle tasks Ben Horgan
2026-03-27 16:40   ` Reinette Chatre

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=62be0037-abba-4cd6-9058-45d784479893@intel.com \
    --to=reinette.chatre@intel.com \
    --cc=Dave.Martin@arm.com \
    --cc=babu.moger@amd.com \
    --cc=ben.horgan@arm.com \
    --cc=bp@alien8.de \
    --cc=dave.hansen@linux.intel.com \
    --cc=fenghuay@nvidia.com \
    --cc=hpa@zytor.com \
    --cc=james.morse@arm.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mingo@redhat.com \
    --cc=tan.shaopeng@fujitsu.com \
    --cc=tglx@kernel.org \
    --cc=tony.luck@intel.com \
    --cc=x86@kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox