public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Ben Horgan <ben.horgan@arm.com>
To: Reinette Chatre <reinette.chatre@intel.com>,
	linux-kernel@vger.kernel.org
Cc: tony.luck@intel.com, Dave.Martin@arm.com, james.morse@arm.com,
	babu.moger@amd.com, tglx@kernel.org, mingo@redhat.com,
	bp@alien8.de, dave.hansen@linux.intel.com, x86@kernel.org,
	hpa@zytor.com, fenghuay@nvidia.com, tan.shaopeng@fujitsu.com
Subject: Re: [PATCH v4 5/7] fs/resctrl: Continue counter allocation after failure
Date: Tue, 14 Apr 2026 15:42:27 +0100	[thread overview]
Message-ID: <82ff4c2a-88fb-4e87-a30e-3c00fe8b3025@arm.com> (raw)
In-Reply-To: <ca4890b0-f4ec-4aa7-bca8-e1fef34d67fb@intel.com>

Hi Reinette,

On 3/27/26 16:21, Reinette Chatre wrote:
> Hi Ben,
> 
> On 3/26/26 10:25 AM, Ben Horgan wrote:
>> In mbm_event mode, with mbm_assign_on_mkdir set to 1, when a user creates a
>> new CTRL_MON or MON group resctrl will attempt to allocate counters for
> 
> "will attempt" -> "attempts"
> 
>> each of the supported mbm events on each resctrl domain. As counters are
> 
> "mbm" -> "MBM"
> 
>> limited, these allocations may fail. If an mbm_total_event counter
> 
> This switches from high level description to example, back to high level description.
> 
>> allocation fails then the mbm_total_event counter allocations for the
>> remaining domains are skipped and then the mbm_local_event counter
>> allocations are made. These failures don't cause the group allocation to
> 
> "are made -> "are attempted"
> 
>> fail but the user should still be aware of them. A message for each
>> attempted allocation is reported in last_cmd_status but in order to fully
>> interpret that information the user needs to know what was skipped.  This
>> is knowable as the domain list is sorted but it is undesirable to rely on
>> such implementation details.
> 
> User can always get most accurate counter assignment state from
> mbm_L3_assignments. There is no need for any guessing or needing to know
> implementation details.
> 
>>
>> Writes to mbm_L3_assignments using the wildcard format, <event>:*=e, will
>> also skip counter allocation after any counter allocation failure.  Leading
>> once again to counters that are allocated but have no corresponding message
>> in last_cmd_status to indicate that.
> 
> User should not rely on last_cmd_status for state and we should not move
> towards making it part of ABI.
> 
>>
>> When a new group is created always attempt to allocate all the counters
>> requested. Similarly, when a a wildcard assign operation is written to
> 
> "a a wildcard" 
> 
>> mbm_L3_assignments, attempt to allocate all counters requested by that
>> particular operation.
>>
>> For mbm_L3_assignments, continue to return an error on counter allocation
>> failure and for a write specifying multiple assign operations continue to
>> abort after the first failing assign operation.
> 
> I support the change to attempt counter creation in all domains but I am
> concerned about the motivation here - the goal should not be to document
> all failed domains in last_cmd_status and pointing user to it to learn which

Seems sensible to me.

> allocations failed. Instead user should use mbm_L3_assignments for most
> accurate state.
> 
> Consider a changelog like below that just focuses on problem being solved
> (but please correct me if you find I am missing the point):
> 
> 	In mbm_event mode, with mbm_assign_on_mkdir set to 1, when a user
> 	creates a new CTRL_MON or MON group resctrl attempts to allocate
> 	counters for each of the supported MBM events on each resctrl
> 	domain. As counters are limited, such allocation may fail and
> 	when it does counter allocations for the remaining domains are
> 	skipped even if the domains have available counters.                                                                       
>                                                                                 
> 	Since a counter allocation failure may result in counter allocation
> 	skipped on other domains the user needs to view the resource group's

skipped -> being skipped

> 	mbm_L3_assignments files to get an accurate view of counter assignment
> 	in a new resource group and then manually create counters in the skipped
> 	domains	with available counters.                                             
>                                                                                 
> 	Writes to mbm_L3_assignments using the wildcard format, <event>:*=e,            
> 	also skip counter allocation in other domains after a counter allocation      
> 	failure.                                                                        
>                                                                                 
> 	When handling a request to create counters in all domains it is unnecessary
> 	for a counter allocation in one domain to prevent counter allocation in
> 	other domains. Always attempt to allocate all the counters requested.     

I can use this but how about if I add,

Skipping counter allocation in subsequent domains after failure makes predicting which
counters will be allocated harder for the user as they need to know the ordering of the
domains as well as the expected failures.

before the final sentence to make it clear that this change is an improvement not just
a change in policy.

Thanks,

Ben


  reply	other threads:[~2026-04-14 14:42 UTC|newest]

Thread overview: 25+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-03-26 17:25 [PATCH v4 0/7] x86,fs/resctrl: Pave the way for MPAM counter assignment Ben Horgan
2026-03-26 17:25 ` [PATCH v4 1/7] fs/resctrl: Tidy up the error path in resctrl_mkdir_event_configs() Ben Horgan
2026-03-27 16:06   ` Reinette Chatre
2026-03-26 17:25 ` [PATCH v4 2/7] x86,fs/resctrl: Make 'event_filter' files read only if they're not configurable Ben Horgan
2026-03-27 16:13   ` Reinette Chatre
2026-03-27 17:15     ` Ben Horgan
2026-03-26 17:25 ` [PATCH v4 3/7] fs/resctrl: Disallow the software controller when MBM counters are assignable Ben Horgan
2026-03-27 16:14   ` Reinette Chatre
2026-03-27 17:24     ` Ben Horgan
2026-03-26 17:25 ` [PATCH v4 4/7] x86,fs/resctrl: Add monitor property 'mbm_cntr_assign_fixed' Ben Horgan
2026-03-27 16:15   ` Reinette Chatre
2026-03-26 17:25 ` [PATCH v4 5/7] fs/resctrl: Continue counter allocation after failure Ben Horgan
2026-03-27 16:21   ` Reinette Chatre
2026-04-14 14:42     ` Ben Horgan [this message]
2026-04-15 14:27       ` Reinette Chatre
2026-04-15 14:46         ` Ben Horgan
2026-04-15 15:38           ` Reinette Chatre
2026-04-15 16:31             ` Ben Horgan
2026-04-15 17:28               ` Reinette Chatre
2026-04-16  8:32                 ` Ben Horgan
2026-03-26 17:25 ` [PATCH v4 6/7] fs/resctrl: Document that automatic counter assignment is best effort Ben Horgan
2026-03-27 16:24   ` Reinette Chatre
2026-04-14 14:55     ` Ben Horgan
2026-03-26 17:25 ` [PATCH v4 7/7] fs/resctrl: Document tasks file behaviour for task id 0 and idle tasks Ben Horgan
2026-03-27 16:40   ` Reinette Chatre

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=82ff4c2a-88fb-4e87-a30e-3c00fe8b3025@arm.com \
    --to=ben.horgan@arm.com \
    --cc=Dave.Martin@arm.com \
    --cc=babu.moger@amd.com \
    --cc=bp@alien8.de \
    --cc=dave.hansen@linux.intel.com \
    --cc=fenghuay@nvidia.com \
    --cc=hpa@zytor.com \
    --cc=james.morse@arm.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mingo@redhat.com \
    --cc=reinette.chatre@intel.com \
    --cc=tan.shaopeng@fujitsu.com \
    --cc=tglx@kernel.org \
    --cc=tony.luck@intel.com \
    --cc=x86@kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox