public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* [PATCH 0/2 RESEND] sched/topology: Optimize topology_span_sane()
@ 2024-04-09 15:52 Kyle Meyer
  2024-04-09 15:52 ` [PATCH 1/2 RESEND] cpumask: Add for_each_cpu_from() Kyle Meyer
                   ` (2 more replies)
  0 siblings, 3 replies; 9+ messages in thread
From: Kyle Meyer @ 2024-04-09 15:52 UTC (permalink / raw)
  To: linux-kernel, yury.norov, andriy.shevchenko, linux, mingo, peterz,
	juri.lelli, vincent.guittot, dietmar.eggemann, rostedt, bsegall,
	mgorman, bristot, vschneid
  Cc: russ.anderson, dimitri.sivanich, steve.wahl, Kyle Meyer

A soft lockup is being detected in build_sched_domains() on 32 socket
Sapphire Rapids systems with 3840 processors.

topology_span_sane(), called by build_sched_domains(), checks that each
processor's non-NUMA scheduling domains are completely equal or
completely disjoint. If a non-NUMA scheduling domain partially overlaps
another, scheduling groups can break.

This series adds for_each_cpu_from() as a generic cpumask macro to
optimize topology_span_sane() by removing duplicate comparisons. The
total number of comparisons is reduced from N * (N - 1) to
N * (N - 1) / 2 (per non-NUMA scheduling domain level), decreasing the
boot time by approximately 20 seconds and preventing the soft lockup on
the mentioned systems.

RESEND because Valentin Schneider reported that PATCH 2/2 wasn't
delivered to all recipients.

Kyle Meyer (2):
  cpumask: Add for_each_cpu_from()
  sched/topology: Optimize topology_span_sane()

 include/linux/cpumask.h | 10 ++++++++++
 kernel/sched/topology.c |  6 ++----
 2 files changed, 12 insertions(+), 4 deletions(-)

-- 
2.44.0


^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2024-04-10 13:47 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-04-09 15:52 [PATCH 0/2 RESEND] sched/topology: Optimize topology_span_sane() Kyle Meyer
2024-04-09 15:52 ` [PATCH 1/2 RESEND] cpumask: Add for_each_cpu_from() Kyle Meyer
2024-04-10  7:26   ` Vincent Guittot
2024-04-09 15:52 ` [PATCH 2/2 RESEND] sched/topology: Optimize topology_span_sane() Kyle Meyer
2024-04-09 16:25   ` Andy Shevchenko
2024-04-09 19:29     ` Kyle Meyer
2024-04-10 13:47       ` Andy Shevchenko
2024-04-10  7:34   ` Vincent Guittot
2024-04-10 13:27 ` [PATCH 0/2 " Yury Norov

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox