public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: "Chen, Yu C" <yu.c.chen@intel.com>
To: Valentin Schneider <vschneid@redhat.com>
Cc: Dietmar Eggemann <dietmar.eggemann@arm.com>,
	Steven Rostedt <rostedt@goodmis.org>,
	Ben Segall <bsegall@google.com>, Mel Gorman <mgorman@suse.de>,
	Shrikanth Hegde <sshegde@linux.ibm.com>,
	"Gautham R. Shenoy" <gautham.shenoy@amd.com>,
	K Prateek Nayak <kprateek.nayak@amd.com>,
	Ingo Molnar <mingo@redhat.com>,
	Juri Lelli <juri.lelli@redhat.com>,
	"Vincent Guittot" <vincent.guittot@linaro.org>,
	Peter Zijlstra <peterz@infradead.org>,
	<linux-kernel@vger.kernel.org>
Subject: Re: [PATCH v3 3/8] sched/topology: Switch to assigning "sd->shared" from s_data
Date: Sat, 14 Feb 2026 11:04:08 +0800	[thread overview]
Message-ID: <cc74535f-5794-4228-958e-ebe579b950cb@intel.com> (raw)
In-Reply-To: <xhsmh343e43fd.mognet@vschneid-thinkpadt14sgen2i.remote.csb>

On 2/6/2026 5:38 PM, Valentin Schneider wrote:
> On 06/02/26 10:50, K Prateek Nayak wrote:
>> Hello Valentin,
>>
>> On 2/5/2026 10:23 PM, Valentin Schneider wrote:
>>> On 20/01/26 11:32, K Prateek Nayak wrote:

[ ... ]

> 
> AIUI we currently write sd->imb_numa_nr for all SD's, but it's only useful
> for the SD_NUMA ones... How about the lightly tested:
> ---
> diff --git a/kernel/sched/topology.c b/kernel/sched/topology.c
> index afb2c26efb4e5..03db45658f6bd 100644
> --- a/kernel/sched/topology.c
> +++ b/kernel/sched/topology.c
> @@ -2575,6 +2575,51 @@ static bool topology_span_sane(const struct cpumask *cpu_map)
>   	return true;
>   }
>   
> +static inline void adjust_numa_imbalance(struct sched_domain *sd_llc,
> +					 unsigned int *imb, unsigned int *imb_span)
> +{
> +	/*
> +	 * For a single LLC per node, allow an
> +	 * imbalance up to 12.5% of the node. This is
> +	 * arbitrary cutoff based two factors -- SMT and
> +	 * memory channels. For SMT-2, the intent is to
> +	 * avoid premature sharing of HT resources but
> +	 * SMT-4 or SMT-8 *may* benefit from a different
> +	 * cutoff. For memory channels, this is a very
> +	 * rough estimate of how many channels may be
> +	 * active and is based on recent CPUs with
> +	 * many cores.
> +	 *
> +	 * For multiple LLCs, allow an imbalance
> +	 * until multiple tasks would share an LLC
> +	 * on one node while LLCs on another node
> +	 * remain idle. This assumes that there are
> +	 * enough logical CPUs per LLC to avoid SMT
> +	 * factors and that there is a correlation
> +	 * between LLCs and memory channels.
> +	 */
> +	struct sched_domain *top_p;
> +	unsigned int nr_llcs;
> +
> +	WARN_ON(!(sd_llc->flags & SD_SHARE_LLC));
> +	WARN_ON(!sd_llc->parent);
> +
> +	nr_llcs = sd_llc->parent->span_weight / sd_llc->span_weight;
> +	if (nr_llcs == 1)
> +		*imb = sd_llc->parent->span_weight >> 3;
> +	else
> +		*imb = nr_llcs;
> +	*imb = max(1U, *imb);
> +	sd_llc->parent->imb_numa_nr = *imb;
> +
> +	/* Set span based on the first NUMA domain. */
> +	top_p = sd_llc->parent->parent;
> +	while (top_p && !(top_p->flags & SD_NUMA)) {
> +		top_p = top_p->parent;
> +	}
> +	*imb_span = top_p ? top_p->span_weight : sd_llc->parent->span_weight;
> +}
> +
>   /*
>    * Build sched domains for a given set of CPUs and attach the sched domains
>    * to the individual CPUs
> @@ -2640,63 +2685,30 @@ build_sched_domains(const struct cpumask *cpu_map, struct sched_domain_attr *att
>   		unsigned int imb = 0;
>   		unsigned int imb_span = 1;
>   
> -		for (sd = *per_cpu_ptr(d.sd, i); sd; sd = sd->parent) {
> -			struct sched_domain *parent = sd->parent;
> -
> -			/* Topmost SD_SHARE_LLC domain. */
> -			if ((sd->flags & SD_SHARE_LLC) &&
> -			    (!parent || !(parent->flags & SD_SHARE_LLC))) {
> -				int sd_id = cpumask_first(sched_domain_span(sd));
> -				struct sched_domain *top_p;
> -				unsigned int nr_llcs;
> -
> -				sd->shared = *per_cpu_ptr(d.sds, sd_id);
> -				atomic_set(&sd->shared->nr_busy_cpus, sd->span_weight);
> -				atomic_inc(&sd->shared->ref);
> -
> -				/* No SD_NUMA domains. */
> -				if (!parent)
> -					break;
> -
> -				/*
> -				 * For a single LLC per node, allow an
> -				 * imbalance up to 12.5% of the node. This is
> -				 * arbitrary cutoff based two factors -- SMT and
> -				 * memory channels. For SMT-2, the intent is to
> -				 * avoid premature sharing of HT resources but
> -				 * SMT-4 or SMT-8 *may* benefit from a different
> -				 * cutoff. For memory channels, this is a very
> -				 * rough estimate of how many channels may be
> -				 * active and is based on recent CPUs with
> -				 * many cores.
> -				 *
> -				 * For multiple LLCs, allow an imbalance
> -				 * until multiple tasks would share an LLC
> -				 * on one node while LLCs on another node
> -				 * remain idle. This assumes that there are
> -				 * enough logical CPUs per LLC to avoid SMT
> -				 * factors and that there is a correlation
> -				 * between LLCs and memory channels.
> -				 */
> -				nr_llcs = parent->span_weight / sd->span_weight;
> -				if (nr_llcs == 1)
> -					imb = sd->span_weight >> 3;
> -				else
> -					imb = nr_llcs;
> -				imb = max(1U, imb);
> -				sd->imb_numa_nr = imb;
> -
> -				/* Set span based on the first NUMA domain. */
> -				top_p = parent;
> -				while (top_p && !(top_p->flags & SD_NUMA)) {
> -					top_p = top_p->parent;
> -				}
> -				imb_span = top_p ? top_p->span_weight : parent->span_weight;
> -			} else {
> -				int factor = max(1U, (sd->span_weight / imb_span));
> +		sd = *per_cpu_ptr(d.sd, i);

might be
		if (!sd)
			continue;
otherwise sd->flags below might cause NULL pointer exception.

thanks,
Chenyu


  reply	other threads:[~2026-02-14  3:04 UTC|newest]

Thread overview: 36+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-01-20 11:32 [PATCH v3 0/8] sched/topology: Optimize sd->shared allocation K Prateek Nayak
2026-01-20 11:32 ` [PATCH v3 1/8] sched/topology: Compute sd_weight considering cpuset partitions K Prateek Nayak
2026-01-21 14:45   ` Chen, Yu C
2026-01-21 15:42   ` Shrikanth Hegde
2026-01-22  2:51     ` K Prateek Nayak
2026-02-05 16:53   ` Valentin Schneider
2026-01-20 11:32 ` [PATCH v3 2/8] sched/topology: Allocate per-CPU sched_domain_shared in s_data K Prateek Nayak
2026-01-21 15:17   ` Chen, Yu C
2026-02-05 16:53   ` Valentin Schneider
2026-01-20 11:32 ` [PATCH v3 3/8] sched/topology: Switch to assigning "sd->shared" from s_data K Prateek Nayak
2026-01-21 15:26   ` Chen, Yu C
2026-01-22  2:49     ` K Prateek Nayak
2026-01-22  8:12   ` Shrikanth Hegde
2026-01-22  8:36     ` K Prateek Nayak
2026-01-23  4:08       ` Shrikanth Hegde
2026-01-23  4:53         ` K Prateek Nayak
2026-02-05 16:53   ` Valentin Schneider
2026-02-06  5:20     ` K Prateek Nayak
2026-02-06  9:38       ` Valentin Schneider
2026-02-14  3:04         ` Chen, Yu C [this message]
2026-02-16  3:50           ` K Prateek Nayak
2026-02-14  2:59       ` Chen, Yu C
2026-01-20 11:32 ` [PATCH v3 4/8] sched/topology: Remove sched_domain_shared allocation with sd_data K Prateek Nayak
2026-02-05 16:53   ` Valentin Schneider
2026-01-20 11:32 ` [PATCH v3 5/8] sched/core: Check for rcu_read_lock_any_held() in idle_get_state() K Prateek Nayak
2026-01-20 11:32 ` [PATCH v3 6/8] sched/fair: Remove superfluous rcu_read_lock() in the wakeup path K Prateek Nayak
2026-01-20 11:32 ` [PATCH v3 7/8] sched/fair: Simplify the entry condition for update_idle_cpu_scan() K Prateek Nayak
2026-02-14 15:41   ` Chen, Yu C
2026-01-20 11:32 ` [PATCH v3 8/8] sched/fair: Simplify SIS_UTIL handling in select_idle_cpu() K Prateek Nayak
2026-01-23  6:06   ` Shrikanth Hegde
2026-01-23  6:27     ` K Prateek Nayak
2026-01-23  7:14       ` Shrikanth Hegde
2026-02-14 15:56   ` Chen, Yu C
2026-01-21 16:16 ` [PATCH v3 0/8] sched/topology: Optimize sd->shared allocation Peter Zijlstra
2026-01-22  2:56   ` K Prateek Nayak
2026-01-23  9:54     ` Peter Zijlstra

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=cc74535f-5794-4228-958e-ebe579b950cb@intel.com \
    --to=yu.c.chen@intel.com \
    --cc=bsegall@google.com \
    --cc=dietmar.eggemann@arm.com \
    --cc=gautham.shenoy@amd.com \
    --cc=juri.lelli@redhat.com \
    --cc=kprateek.nayak@amd.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mgorman@suse.de \
    --cc=mingo@redhat.com \
    --cc=peterz@infradead.org \
    --cc=rostedt@goodmis.org \
    --cc=sshegde@linux.ibm.com \
    --cc=vincent.guittot@linaro.org \
    --cc=vschneid@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox