From: kernel test robot <oliver.sang@intel.com>
To: Zecheng Li <zecheng@google.com>
Cc: <oe-lkp@lists.linux.dev>, <lkp@intel.com>,
<linux-kernel@vger.kernel.org>,
Peter Zijlstra <peterz@infradead.org>,
Zecheng Li <zli94@ncsu.edu>,
"K Prateek Nayak" <kprateek.nayak@amd.com>,
Josh Don <joshdon@google.com>, <aubrey.li@linux.intel.com>,
<yu.c.chen@intel.com>, <oliver.sang@intel.com>
Subject: [linus:master] [sched/fair] b8fea7af0e: stress-ng.session.ops_per_sec 91.5% improvement
Date: Fri, 26 Jun 2026 17:11:58 +0800 [thread overview]
Message-ID: <202606261633.32eca12c-lkp@intel.com> (raw)
Hello,
kernel test robot noticed a 91.5% improvement of stress-ng.session.ops_per_sec on:
commit: b8fea7af0e40feb6d9cbbd60b66ff0ec265e868f ("sched/fair: Allocate cfs_tg_state with percpu allocator")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master
testcase: stress-ng
config: x86_64-rhel-9.4
compiler: gcc-14
test machine: 192 threads 2 sockets Intel(R) Xeon(R) 6740E CPU @ 2.4GHz (Sierra Forest) with 256G memory
parameters:
nr_threads: 100%
testtime: 60s
test: session
cpufreq_governor: performance
Details are as below:
-------------------------------------------------------------------------------------------------->
The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20260626/202606261633.32eca12c-lkp@intel.com
=========================================================================================
compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
gcc-14/performance/x86_64-rhel-9.4/100%/debian-13-x86_64-20250902.cgz/lkp-srf-2sp3/session/stress-ng/60s
commit:
89e1f67186 ("sched/fair: Remove task_group->se pointer array")
b8fea7af0e ("sched/fair: Allocate cfs_tg_state with percpu allocator")
89e1f67186baca35 b8fea7af0e40feb6d9cbbd60b66
---------------- ---------------------------
%stddev %change %stddev
\ | \
610356 +91.5% 1168547 ± 2% stress-ng.session.ops
10187 +91.5% 19511 ± 2% stress-ng.session.ops_per_sec
682208 +23.9% 845283 ± 2% stress-ng.time.involuntary_context_switches
9315 ± 36% +196.0% 27570 ± 46% stress-ng.time.major_page_faults
74894407 ± 3% +91.4% 1.433e+08 ± 4% stress-ng.time.minor_page_faults
13490 -2.5% 13152 stress-ng.time.percent_of_cpu_this_job_got
8076 -3.1% 7829 stress-ng.time.system_time
33.20 ± 2% +130.4% 76.48 ± 3% stress-ng.time.user_time
3019455 +147.5% 7471743 ± 2% stress-ng.time.voluntary_context_switches
2.619e+09 ± 6% +41.1% 3.697e+09 ± 3% cpuidle..time
4789881 +106.5% 9891495 ± 2% cpuidle..usage
22.08 ± 4% +44.9% 31.98 ± 3% vmstat.cpu.id
4955056 +9.8% 5439814 vmstat.memory.cache
169.44 ± 2% -13.4% 146.67 ± 2% vmstat.procs.r
124532 +120.7% 274835 ± 2% vmstat.system.cs
455602 +21.4% 553218 vmstat.system.in
19.73 ± 5% +10.0 29.75 ± 4% mpstat.cpu.all.idle%
0.03 ± 8% -0.0 0.00 ± 81% mpstat.cpu.all.iowait%
0.37 +0.2 0.56 ± 3% mpstat.cpu.all.irq%
11.52 -9.3 2.22 ± 2% mpstat.cpu.all.soft%
0.89 +0.3 1.16 mpstat.cpu.all.usr%
87.80 -6.3% 82.23 mpstat.max_utilization_pct
23158413 ± 2% +91.2% 44277233 ± 3% numa-numastat.node0.local_node
23346017 ± 2% +95.5% 45639044 ± 3% numa-numastat.node0.numa_hit
187646 ± 17% +625.7% 1361768 ± 6% numa-numastat.node0.other_node
21382055 ± 2% +87.6% 40123301 ± 6% numa-numastat.node1.local_node
21639867 ± 2% +92.2% 41600375 ± 6% numa-numastat.node1.numa_hit
257307 ± 11% +474.1% 1477073 ± 6% numa-numastat.node1.other_node
2540 -12.4% 2225 turbostat.Avg_MHz
79.44 -9.9 69.53 turbostat.Busy%
22.02 ± 5% +9.2 31.19 ± 3% turbostat.C1%
20.36 ± 5% +48.8% 30.30 ± 3% turbostat.CPU%c1
0.15 +60.0% 0.24 turbostat.IPC
30697679 +24.4% 38174002 turbostat.IRQ
313.32 +2.6% 321.34 turbostat.PkgWatt
24.18 +18.1% 28.56 turbostat.RAMWatt
1537143 +32.6% 2038945 meminfo.Active
1537110 +32.6% 2038912 meminfo.Active(anon)
4808401 +9.8% 5281769 meminfo.Cached
5073192 +8.4% 5498934 meminfo.Committed_AS
391992 -15.3% 332050 meminfo.Mapped
93966 ± 4% +21.0% 113702 ± 5% meminfo.PageTables
172086 ± 7% +325.6% 732317 meminfo.Percpu
1095287 -56.6% 475325 meminfo.SUnreclaim
716648 +66.1% 1190009 meminfo.Shmem
1237725 -50.0% 618879 meminfo.Slab
0.23 -66.5% 0.08 perf-sched.sch_delay.avg.ms.[unknown].[unknown].[unknown].[unknown].[unknown]
56.29 ± 20% -37.3% 35.30 ± 13% perf-sched.sch_delay.max.ms.[unknown].[unknown].[unknown].[unknown].[unknown]
0.23 -66.5% 0.08 perf-sched.total_sch_delay.average.ms
56.29 ± 20% -37.3% 35.30 ± 13% perf-sched.total_sch_delay.max.ms
9.88 -51.7% 4.78 perf-sched.total_wait_and_delay.average.ms
399971 +106.1% 824497 perf-sched.total_wait_and_delay.count.ms
9.65 -51.3% 4.70 perf-sched.total_wait_time.average.ms
9.88 -51.7% 4.78 perf-sched.wait_and_delay.avg.ms.[unknown].[unknown].[unknown].[unknown].[unknown]
399971 +106.1% 824497 perf-sched.wait_and_delay.count.[unknown].[unknown].[unknown].[unknown].[unknown]
9.65 -51.3% 4.70 perf-sched.wait_time.avg.ms.[unknown].[unknown].[unknown].[unknown].[unknown]
133243 +116.4% 288278 ± 3% perf-stat.i.context-switches
194336 -1.1% 192149 perf-stat.i.cpu-clock
27884 +91.6% 53428 ± 3% perf-stat.i.cpu-migrations
158.09 ± 37% +185.5% 451.31 ± 46% perf-stat.i.major-faults
1256057 ± 3% +88.1% 2362437 ± 4% perf-stat.i.minor-faults
1256215 ± 3% +88.1% 2362888 ± 4% perf-stat.i.page-faults
194336 -1.1% 192149 perf-stat.i.task-clock
128658 +118.2% 280754 ± 3% perf-stat.ps.context-switches
26831 +93.3% 51879 ± 3% perf-stat.ps.cpu-migrations
150.27 ± 37% +193.4% 440.89 ± 46% perf-stat.ps.major-faults
1212558 ± 3% +89.3% 2295418 ± 3% perf-stat.ps.minor-faults
1212708 ± 3% +89.3% 2295859 ± 3% perf-stat.ps.page-faults
167364 ± 66% +787.8% 1485853 ± 18% numa-meminfo.node0.Active
167345 ± 66% +787.9% 1485838 ± 18% numa-meminfo.node0.Active(anon)
49987 ± 92% +324.2% 212032 ± 19% numa-meminfo.node0.Mapped
619973 ± 6% -58.1% 259551 ± 26% numa-meminfo.node0.SUnreclaim
29716 ± 78% +3008.8% 923832 ± 4% numa-meminfo.node0.Shmem
705775 ± 7% -54.1% 323802 ± 30% numa-meminfo.node0.Slab
1370362 ± 8% -59.5% 555194 ± 51% numa-meminfo.node1.Active
1370348 ± 8% -59.5% 555175 ± 51% numa-meminfo.node1.Active(anon)
362187 ± 10% -74.2% 93542 ±159% numa-meminfo.node1.AnonHugePages
341339 ± 12% -64.6% 120912 ± 35% numa-meminfo.node1.Mapped
45901 ± 16% +23.8% 56827 ± 4% numa-meminfo.node1.PageTables
476900 ± 10% -54.7% 215830 ± 31% numa-meminfo.node1.SUnreclaim
686948 ± 3% -61.1% 267149 ± 15% numa-meminfo.node1.Shmem
533533 ± 10% -44.7% 295138 ± 33% numa-meminfo.node1.Slab
383948 +32.8% 509815 proc-vmstat.nr_active_anon
206513 +3.4% 213464 proc-vmstat.nr_anon_pages
1201861 +9.9% 1320471 proc-vmstat.nr_file_pages
97614 ± 2% -14.9% 83031 proc-vmstat.nr_mapped
23581 ± 3% +20.7% 28463 ± 4% proc-vmstat.nr_page_table_pages
178922 +66.3% 297531 proc-vmstat.nr_shmem
274192 -56.7% 118831 proc-vmstat.nr_slab_unreclaimable
80759 +3.2% 83318 proc-vmstat.nr_vmalloc
383948 +32.8% 509815 proc-vmstat.nr_zone_active_anon
44989050 ± 2% +93.9% 87242239 ± 2% proc-vmstat.numa_hit
44543635 ± 2% +89.5% 84403353 ± 2% proc-vmstat.numa_local
444953 ± 3% +538.0% 2838842 proc-vmstat.numa_other
48904767 ± 2% +85.6% 90766237 ± 2% proc-vmstat.pgalloc_normal
75424210 ± 3% +90.8% 1.439e+08 ± 4% proc-vmstat.pgfault
48116497 ± 2% +86.5% 89739194 ± 2% proc-vmstat.pgfree
4261223 ± 10% +96.1% 8358094 ± 6% proc-vmstat.pgreuse
41819 ± 66% +788.2% 371430 ± 18% numa-vmstat.node0.nr_active_anon
12503 ± 92% +324.2% 53034 ± 19% numa-vmstat.node0.nr_mapped
7429 ± 78% +3008.7% 230968 ± 4% numa-vmstat.node0.nr_shmem
155024 ± 6% -58.1% 64888 ± 26% numa-vmstat.node0.nr_slab_unreclaimable
41818 ± 66% +788.2% 371428 ± 18% numa-vmstat.node0.nr_zone_active_anon
23345970 ± 2% +95.5% 45639379 ± 3% numa-vmstat.node0.numa_hit
23158366 ± 2% +91.2% 44277567 ± 3% numa-vmstat.node0.numa_local
187646 ± 17% +625.7% 1361768 ± 6% numa-vmstat.node0.numa_other
342699 ± 8% -59.5% 138864 ± 51% numa-vmstat.node1.nr_active_anon
176.85 ± 10% -74.2% 45.68 ±159% numa-vmstat.node1.nr_anon_transparent_hugepages
85411 ± 12% -64.6% 30246 ± 35% numa-vmstat.node1.nr_mapped
11500 ± 16% +23.8% 14240 ± 4% numa-vmstat.node1.nr_page_table_pages
171755 ± 3% -61.1% 66798 ± 15% numa-vmstat.node1.nr_shmem
119206 ± 10% -54.7% 53962 ± 31% numa-vmstat.node1.nr_slab_unreclaimable
342698 ± 8% -59.5% 138863 ± 51% numa-vmstat.node1.nr_zone_active_anon
21639864 ± 2% +92.2% 41600300 ± 6% numa-vmstat.node1.numa_hit
21382052 ± 2% +87.6% 40123226 ± 6% numa-vmstat.node1.numa_local
257307 ± 11% +474.1% 1477073 ± 6% numa-vmstat.node1.numa_other
0.50 ± 10% -16.4% 0.42 ± 8% sched_debug.cfs_rq:/.h_nr_queued.avg
0.49 ± 11% -15.5% 0.41 ± 8% sched_debug.cfs_rq:/.h_nr_runnable.avg
32477 ± 11% -20.0% 25967 ± 15% sched_debug.cfs_rq:/.left_deadline.max
32462 ± 11% -20.1% 25950 ± 15% sched_debug.cfs_rq:/.left_vruntime.max
495516 ± 11% -18.3% 404915 ± 9% sched_debug.cfs_rq:/.load.avg
1675 ± 12% -22.9% 1291 ± 24% sched_debug.cfs_rq:/.load_avg.avg
17.83 ± 37% +73.3% 30.90 ± 29% sched_debug.cfs_rq:/.load_avg.min
0.50 ± 10% -16.5% 0.42 ± 8% sched_debug.cfs_rq:/.nr_queued.avg
32462 ± 11% -20.1% 25950 ± 15% sched_debug.cfs_rq:/.right_vruntime.max
8.253e+11 ± 46% -68.5% 2.6e+11 ± 44% sched_debug.cfs_rq:/.sum_w_vruntime.max
-1.142e+12 -79.2% -2.372e+11 sched_debug.cfs_rq:/.sum_w_vruntime.min
1.641e+11 ± 14% -80.5% 3.196e+10 ± 30% sched_debug.cfs_rq:/.sum_w_vruntime.stddev
562.81 ± 4% -6.3% 527.61 ± 6% sched_debug.cpu.clock_task.stddev
408623 ± 13% +44.3% 589658 ± 7% sched_debug.cpu.curr->pid.avg
609584 +90.0% 1157918 ± 2% sched_debug.cpu.curr->pid.max
278101 ± 6% +106.4% 574007 ± 2% sched_debug.cpu.curr->pid.stddev
0.50 ± 10% -16.5% 0.42 ± 8% sched_debug.cpu.nr_running.avg
22206 +111.3% 46925 ± 3% sched_debug.cpu.nr_switches.avg
59463 ± 30% +66.6% 99041 ± 11% sched_debug.cpu.nr_switches.max
16171 ± 11% +62.8% 26329 ± 21% sched_debug.cpu.nr_switches.min
-115.25 +180.9% -323.70 sched_debug.cpu.nr_uninterruptible.min
47.53 ± 13% +134.1% 111.26 ± 21% sched_debug.cpu.nr_uninterruptible.stddev
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki
reply other threads:[~2026-06-26 9:12 UTC|newest]
Thread overview: [no followups] expand[flat|nested] mbox.gz Atom feed
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=202606261633.32eca12c-lkp@intel.com \
--to=oliver.sang@intel.com \
--cc=aubrey.li@linux.intel.com \
--cc=joshdon@google.com \
--cc=kprateek.nayak@amd.com \
--cc=linux-kernel@vger.kernel.org \
--cc=lkp@intel.com \
--cc=oe-lkp@lists.linux.dev \
--cc=peterz@infradead.org \
--cc=yu.c.chen@intel.com \
--cc=zecheng@google.com \
--cc=zli94@ncsu.edu \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.