* [tip:sched/core] [sched/fair] d206fbad93: hackbench.throughput 6.5% regression
@ 2025-11-28 6:56 kernel test robot
0 siblings, 0 replies; only message in thread
From: kernel test robot @ 2025-11-28 6:56 UTC (permalink / raw)
To: Peter Zijlstra
Cc: oe-lkp, lkp, linux-kernel, x86, Joseph Salisbury, Adam Li,
Dietmar Eggemann, Hazem Mohamed Abuelfotoh, Chris Mason,
aubrey.li, yu.c.chen, oliver.sang
Hello,
for the change reverted by this commit, we reported several regressions before
[1][2][3] majorly for unixbench tests.
[1] https://lore.kernel.org/all/202507150846.538fc133-lkp@intel.com/
[2] https://lore.kernel.org/all/202507281046.e71b853a-lkp@intel.com/
[3] https://lore.kernel.org/all/202508150416.d7153a48-lkp@intel.com/
but now, for d206fbad93, we also capture a regression in hackbench tests.
below report is just FYI what's the possible performance impact of this revert
commit.
kernel test robot noticed a 6.5% regression of hackbench.throughput on:
commit: d206fbad9328ddb68ebabd7cf7413392acd38081 ("sched/fair: Revert max_newidle_lb_cost bump")
https://git.kernel.org/cgit/linux/kernel/git/tip/tip.git sched/core
[still regression on linux-next/master ef68bf704646690aba5e81c2f7be8d6ef13d7ad8]
testcase: hackbench
config: x86_64-rhel-9.4
compiler: gcc-14
test machine: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory
parameters:
nr_threads: 50%
iterations: 4
mode: threads
ipc: pipe
cpufreq_governor: performance
If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <oliver.sang@intel.com>
| Closes: https://lore.kernel.org/oe-lkp/202511281428.bfcbd5a8-lkp@intel.com
Details are as below:
-------------------------------------------------------------------------------------------------->
The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20251128/202511281428.bfcbd5a8-lkp@intel.com
=========================================================================================
compiler/cpufreq_governor/ipc/iterations/kconfig/mode/nr_threads/rootfs/tbox_group/testcase:
gcc-14/performance/pipe/4/x86_64-rhel-9.4/threads/50%/debian-13-x86_64-20250902.cgz/lkp-icl-2sp7/hackbench
commit:
e837456fdc ("sched/fair: Reimplement NEXT_BUDDY to align with EEVDF goals")
d206fbad93 ("sched/fair: Revert max_newidle_lb_cost bump")
e837456fdca81899 d206fbad9328ddb68ebabd7cf74
---------------- ---------------------------
%stddev %change %stddev
\ | \
2.603e+08 ± 6% -33.6% 1.73e+08 ± 4% cpuidle..time
5881942 ± 3% -62.9% 2180816 ± 13% cpuidle..usage
97892 ± 5% +17.9% 115437 ± 2% perf-c2c.HITM.local
98482 ± 5% +17.6% 115850 ± 2% perf-c2c.HITM.total
3.99 ± 8% -1.4 2.62 ± 3% mpstat.cpu.all.idle%
0.71 ± 2% -0.1 0.64 ± 2% mpstat.cpu.all.irq%
0.05 -0.0 0.03 ± 2% mpstat.cpu.all.soft%
1305956 ± 8% +19.7% 1562588 ± 11% numa-meminfo.node1.Active
1305948 ± 8% +19.7% 1562580 ± 11% numa-meminfo.node1.Active(anon)
863383 +20.4% 1039636 ± 11% numa-meminfo.node1.Shmem
326485 ± 8% +19.7% 390946 ± 10% numa-vmstat.node1.nr_active_anon
215818 +20.6% 260214 ± 11% numa-vmstat.node1.nr_shmem
326485 ± 8% +19.7% 390945 ± 10% numa-vmstat.node1.nr_zone_active_anon
1631510 +10.4% 1801907 ± 6% meminfo.Active
1631494 +10.4% 1801890 ± 6% meminfo.Active(anon)
1772671 +9.5% 1940770 ± 5% meminfo.Committed_AS
909474 +19.7% 1088760 ± 10% meminfo.Shmem
5.36 ±107% -3.2 2.11 ±223% perf-profile.calltrace.cycles-pp.process_simple.reader__read_event.perf_session__process_events.record__finish_output.cmd_record
6.59 ±100% -2.2 4.41 ±141% perf-profile.calltrace.cycles-pp.__ordered_events__flush.perf_session__process_user_event.reader__read_event.perf_session__process_events.record__finish_output
6.59 ±100% -2.2 4.41 ±141% perf-profile.calltrace.cycles-pp.perf_session__process_user_event.reader__read_event.perf_session__process_events.record__finish_output.cmd_record
5.36 ±107% -3.2 2.11 ±223% perf-profile.children.cycles-pp.process_simple
0.34 ± 4% -0.2 0.11 ± 14% turbostat.C1%
1.10 -0.8 0.28 ± 16% turbostat.C1E%
2.96 ± 8% -0.6 2.38 ± 2% turbostat.C6%
1.25 ± 5% -15.5% 1.05 ± 4% turbostat.CPU%c1
795.01 ± 4% -10.1% 714.37 ± 2% sched_debug.cfs_rq:/.util_est.avg
4969 ± 9% +122.0% 11030 ± 47% sched_debug.cpu.avg_idle.min
1716 ±141% +111.4% 3628 ± 61% sched_debug.cpu.curr->pid.min
1063335 ± 3% -47.4% 559053 ± 23% sched_debug.cpu.max_idle_balance_cost.avg
712039 ± 2% -29.8% 500000 sched_debug.cpu.max_idle_balance_cost.min
407716 +10.5% 450503 ± 6% proc-vmstat.nr_active_anon
1139851 +3.9% 1184811 ± 2% proc-vmstat.nr_file_pages
33054 +0.8% 33323 proc-vmstat.nr_kernel_stack
88427 ± 2% +6.0% 93718 ± 2% proc-vmstat.nr_mapped
227232 +19.8% 272194 ± 10% proc-vmstat.nr_shmem
407716 +10.5% 450503 ± 6% proc-vmstat.nr_zone_active_anon
44509495 -2.8% 43284126 proc-vmstat.numa_hit
44442837 -2.8% 43217032 proc-vmstat.numa_local
44571379 -2.7% 43348289 proc-vmstat.pgalloc_normal
43993792 -2.9% 42727500 proc-vmstat.pgfree
342580 -6.5% 320227 hackbench.throughput
334468 -6.2% 313595 hackbench.throughput_avg
342580 -6.5% 320227 hackbench.throughput_best
319665 -6.4% 299259 hackbench.throughput_worst
90.24 +6.6% 96.22 hackbench.time.elapsed_time
90.24 +6.6% 96.22 hackbench.time.elapsed_time.max
1.47e+08 +10.0% 1.617e+08 hackbench.time.involuntary_context_switches
37736 ± 3% +8.1% 40787 hackbench.time.minor_page_faults
6097 +1.8% 6206 hackbench.time.percent_of_cpu_this_job_got
5041 +9.1% 5501 hackbench.time.system_time
461.18 +2.2% 471.26 hackbench.time.user_time
4.238e+08 +4.5% 4.429e+08 hackbench.time.voluntary_context_switches
2.525e+10 -4.4% 2.415e+10 perf-stat.i.branch-instructions
5.10 -0.6 4.55 ± 4% perf-stat.i.cache-miss-rate%
3.459e+08 +12.9% 3.906e+08 perf-stat.i.cache-references
1.83 +5.5% 1.93 perf-stat.i.cpi
1.92e+11 +1.3% 1.946e+11 perf-stat.i.cpu-cycles
423018 +19.6% 506120 perf-stat.i.cpu-migrations
1.074e+11 -4.1% 1.03e+11 perf-stat.i.instructions
0.56 -5.4% 0.53 perf-stat.i.ipc
4.79 ± 2% -0.7 4.14 ± 5% perf-stat.overall.cache-miss-rate%
1.79 +5.7% 1.89 perf-stat.overall.cpi
0.56 -5.4% 0.53 perf-stat.overall.ipc
2.499e+10 -4.3% 2.392e+10 perf-stat.ps.branch-instructions
3.42e+08 +13.0% 3.865e+08 perf-stat.ps.cache-references
1.9e+11 +1.4% 1.926e+11 perf-stat.ps.cpu-cycles
417843 +19.8% 500456 perf-stat.ps.cpu-migrations
1.063e+11 -4.1% 1.02e+11 perf-stat.ps.instructions
9.709e+12 +2.4% 9.941e+12 perf-stat.total.instructions
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki
^ permalink raw reply [flat|nested] only message in thread
only message in thread, other threads:[~2025-11-28 6:56 UTC | newest]
Thread overview: (only message) (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2025-11-28 6:56 [tip:sched/core] [sched/fair] d206fbad93: hackbench.throughput 6.5% regression kernel test robot
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.