All of lore.kernel.org
 help / color / mirror / Atom feed
From: kernel test robot <oliver.sang@intel.com>
To: Zecheng Li <zecheng@google.com>
Cc: <oe-lkp@lists.linux.dev>, <lkp@intel.com>,
	<linux-kernel@vger.kernel.org>,
	Peter Zijlstra <peterz@infradead.org>,
	Zecheng Li <zli94@ncsu.edu>,
	"K Prateek Nayak" <kprateek.nayak@amd.com>,
	Josh Don <joshdon@google.com>, <aubrey.li@linux.intel.com>,
	<yu.c.chen@intel.com>, <oliver.sang@intel.com>
Subject: [linus:master] [sched/fair]  b8fea7af0e: stress-ng.session.ops_per_sec 91.5% improvement
Date: Fri, 26 Jun 2026 17:11:58 +0800	[thread overview]
Message-ID: <202606261633.32eca12c-lkp@intel.com> (raw)



Hello,

kernel test robot noticed a 91.5% improvement of stress-ng.session.ops_per_sec on:


commit: b8fea7af0e40feb6d9cbbd60b66ff0ec265e868f ("sched/fair: Allocate cfs_tg_state with percpu allocator")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master


testcase: stress-ng
config: x86_64-rhel-9.4
compiler: gcc-14
test machine: 192 threads 2 sockets Intel(R) Xeon(R) 6740E  CPU @ 2.4GHz (Sierra Forest) with 256G memory
parameters:

	nr_threads: 100%
	testtime: 60s
	test: session
	cpufreq_governor: performance


Details are as below:
-------------------------------------------------------------------------------------------------->


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20260626/202606261633.32eca12c-lkp@intel.com

=========================================================================================
compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
  gcc-14/performance/x86_64-rhel-9.4/100%/debian-13-x86_64-20250902.cgz/lkp-srf-2sp3/session/stress-ng/60s

commit: 
  89e1f67186 ("sched/fair: Remove task_group->se pointer array")
  b8fea7af0e ("sched/fair: Allocate cfs_tg_state with percpu allocator")

89e1f67186baca35 b8fea7af0e40feb6d9cbbd60b66 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
    610356           +91.5%    1168547 ±  2%  stress-ng.session.ops
     10187           +91.5%      19511 ±  2%  stress-ng.session.ops_per_sec
    682208           +23.9%     845283 ±  2%  stress-ng.time.involuntary_context_switches
      9315 ± 36%    +196.0%      27570 ± 46%  stress-ng.time.major_page_faults
  74894407 ±  3%     +91.4%  1.433e+08 ±  4%  stress-ng.time.minor_page_faults
     13490            -2.5%      13152        stress-ng.time.percent_of_cpu_this_job_got
      8076            -3.1%       7829        stress-ng.time.system_time
     33.20 ±  2%    +130.4%      76.48 ±  3%  stress-ng.time.user_time
   3019455          +147.5%    7471743 ±  2%  stress-ng.time.voluntary_context_switches
 2.619e+09 ±  6%     +41.1%  3.697e+09 ±  3%  cpuidle..time
   4789881          +106.5%    9891495 ±  2%  cpuidle..usage
     22.08 ±  4%     +44.9%      31.98 ±  3%  vmstat.cpu.id
   4955056            +9.8%    5439814        vmstat.memory.cache
    169.44 ±  2%     -13.4%     146.67 ±  2%  vmstat.procs.r
    124532          +120.7%     274835 ±  2%  vmstat.system.cs
    455602           +21.4%     553218        vmstat.system.in
     19.73 ±  5%     +10.0       29.75 ±  4%  mpstat.cpu.all.idle%
      0.03 ±  8%      -0.0        0.00 ± 81%  mpstat.cpu.all.iowait%
      0.37            +0.2        0.56 ±  3%  mpstat.cpu.all.irq%
     11.52            -9.3        2.22 ±  2%  mpstat.cpu.all.soft%
      0.89            +0.3        1.16        mpstat.cpu.all.usr%
     87.80            -6.3%      82.23        mpstat.max_utilization_pct
  23158413 ±  2%     +91.2%   44277233 ±  3%  numa-numastat.node0.local_node
  23346017 ±  2%     +95.5%   45639044 ±  3%  numa-numastat.node0.numa_hit
    187646 ± 17%    +625.7%    1361768 ±  6%  numa-numastat.node0.other_node
  21382055 ±  2%     +87.6%   40123301 ±  6%  numa-numastat.node1.local_node
  21639867 ±  2%     +92.2%   41600375 ±  6%  numa-numastat.node1.numa_hit
    257307 ± 11%    +474.1%    1477073 ±  6%  numa-numastat.node1.other_node
      2540           -12.4%       2225        turbostat.Avg_MHz
     79.44            -9.9       69.53        turbostat.Busy%
     22.02 ±  5%      +9.2       31.19 ±  3%  turbostat.C1%
     20.36 ±  5%     +48.8%      30.30 ±  3%  turbostat.CPU%c1
      0.15           +60.0%       0.24        turbostat.IPC
  30697679           +24.4%   38174002        turbostat.IRQ
    313.32            +2.6%     321.34        turbostat.PkgWatt
     24.18           +18.1%      28.56        turbostat.RAMWatt
   1537143           +32.6%    2038945        meminfo.Active
   1537110           +32.6%    2038912        meminfo.Active(anon)
   4808401            +9.8%    5281769        meminfo.Cached
   5073192            +8.4%    5498934        meminfo.Committed_AS
    391992           -15.3%     332050        meminfo.Mapped
     93966 ±  4%     +21.0%     113702 ±  5%  meminfo.PageTables
    172086 ±  7%    +325.6%     732317        meminfo.Percpu
   1095287           -56.6%     475325        meminfo.SUnreclaim
    716648           +66.1%    1190009        meminfo.Shmem
   1237725           -50.0%     618879        meminfo.Slab
      0.23           -66.5%       0.08        perf-sched.sch_delay.avg.ms.[unknown].[unknown].[unknown].[unknown].[unknown]
     56.29 ± 20%     -37.3%      35.30 ± 13%  perf-sched.sch_delay.max.ms.[unknown].[unknown].[unknown].[unknown].[unknown]
      0.23           -66.5%       0.08        perf-sched.total_sch_delay.average.ms
     56.29 ± 20%     -37.3%      35.30 ± 13%  perf-sched.total_sch_delay.max.ms
      9.88           -51.7%       4.78        perf-sched.total_wait_and_delay.average.ms
    399971          +106.1%     824497        perf-sched.total_wait_and_delay.count.ms
      9.65           -51.3%       4.70        perf-sched.total_wait_time.average.ms
      9.88           -51.7%       4.78        perf-sched.wait_and_delay.avg.ms.[unknown].[unknown].[unknown].[unknown].[unknown]
    399971          +106.1%     824497        perf-sched.wait_and_delay.count.[unknown].[unknown].[unknown].[unknown].[unknown]
      9.65           -51.3%       4.70        perf-sched.wait_time.avg.ms.[unknown].[unknown].[unknown].[unknown].[unknown]
    133243          +116.4%     288278 ±  3%  perf-stat.i.context-switches
    194336            -1.1%     192149        perf-stat.i.cpu-clock
     27884           +91.6%      53428 ±  3%  perf-stat.i.cpu-migrations
    158.09 ± 37%    +185.5%     451.31 ± 46%  perf-stat.i.major-faults
   1256057 ±  3%     +88.1%    2362437 ±  4%  perf-stat.i.minor-faults
   1256215 ±  3%     +88.1%    2362888 ±  4%  perf-stat.i.page-faults
    194336            -1.1%     192149        perf-stat.i.task-clock
    128658          +118.2%     280754 ±  3%  perf-stat.ps.context-switches
     26831           +93.3%      51879 ±  3%  perf-stat.ps.cpu-migrations
    150.27 ± 37%    +193.4%     440.89 ± 46%  perf-stat.ps.major-faults
   1212558 ±  3%     +89.3%    2295418 ±  3%  perf-stat.ps.minor-faults
   1212708 ±  3%     +89.3%    2295859 ±  3%  perf-stat.ps.page-faults
    167364 ± 66%    +787.8%    1485853 ± 18%  numa-meminfo.node0.Active
    167345 ± 66%    +787.9%    1485838 ± 18%  numa-meminfo.node0.Active(anon)
     49987 ± 92%    +324.2%     212032 ± 19%  numa-meminfo.node0.Mapped
    619973 ±  6%     -58.1%     259551 ± 26%  numa-meminfo.node0.SUnreclaim
     29716 ± 78%   +3008.8%     923832 ±  4%  numa-meminfo.node0.Shmem
    705775 ±  7%     -54.1%     323802 ± 30%  numa-meminfo.node0.Slab
   1370362 ±  8%     -59.5%     555194 ± 51%  numa-meminfo.node1.Active
   1370348 ±  8%     -59.5%     555175 ± 51%  numa-meminfo.node1.Active(anon)
    362187 ± 10%     -74.2%      93542 ±159%  numa-meminfo.node1.AnonHugePages
    341339 ± 12%     -64.6%     120912 ± 35%  numa-meminfo.node1.Mapped
     45901 ± 16%     +23.8%      56827 ±  4%  numa-meminfo.node1.PageTables
    476900 ± 10%     -54.7%     215830 ± 31%  numa-meminfo.node1.SUnreclaim
    686948 ±  3%     -61.1%     267149 ± 15%  numa-meminfo.node1.Shmem
    533533 ± 10%     -44.7%     295138 ± 33%  numa-meminfo.node1.Slab
    383948           +32.8%     509815        proc-vmstat.nr_active_anon
    206513            +3.4%     213464        proc-vmstat.nr_anon_pages
   1201861            +9.9%    1320471        proc-vmstat.nr_file_pages
     97614 ±  2%     -14.9%      83031        proc-vmstat.nr_mapped
     23581 ±  3%     +20.7%      28463 ±  4%  proc-vmstat.nr_page_table_pages
    178922           +66.3%     297531        proc-vmstat.nr_shmem
    274192           -56.7%     118831        proc-vmstat.nr_slab_unreclaimable
     80759            +3.2%      83318        proc-vmstat.nr_vmalloc
    383948           +32.8%     509815        proc-vmstat.nr_zone_active_anon
  44989050 ±  2%     +93.9%   87242239 ±  2%  proc-vmstat.numa_hit
  44543635 ±  2%     +89.5%   84403353 ±  2%  proc-vmstat.numa_local
    444953 ±  3%    +538.0%    2838842        proc-vmstat.numa_other
  48904767 ±  2%     +85.6%   90766237 ±  2%  proc-vmstat.pgalloc_normal
  75424210 ±  3%     +90.8%  1.439e+08 ±  4%  proc-vmstat.pgfault
  48116497 ±  2%     +86.5%   89739194 ±  2%  proc-vmstat.pgfree
   4261223 ± 10%     +96.1%    8358094 ±  6%  proc-vmstat.pgreuse
     41819 ± 66%    +788.2%     371430 ± 18%  numa-vmstat.node0.nr_active_anon
     12503 ± 92%    +324.2%      53034 ± 19%  numa-vmstat.node0.nr_mapped
      7429 ± 78%   +3008.7%     230968 ±  4%  numa-vmstat.node0.nr_shmem
    155024 ±  6%     -58.1%      64888 ± 26%  numa-vmstat.node0.nr_slab_unreclaimable
     41818 ± 66%    +788.2%     371428 ± 18%  numa-vmstat.node0.nr_zone_active_anon
  23345970 ±  2%     +95.5%   45639379 ±  3%  numa-vmstat.node0.numa_hit
  23158366 ±  2%     +91.2%   44277567 ±  3%  numa-vmstat.node0.numa_local
    187646 ± 17%    +625.7%    1361768 ±  6%  numa-vmstat.node0.numa_other
    342699 ±  8%     -59.5%     138864 ± 51%  numa-vmstat.node1.nr_active_anon
    176.85 ± 10%     -74.2%      45.68 ±159%  numa-vmstat.node1.nr_anon_transparent_hugepages
     85411 ± 12%     -64.6%      30246 ± 35%  numa-vmstat.node1.nr_mapped
     11500 ± 16%     +23.8%      14240 ±  4%  numa-vmstat.node1.nr_page_table_pages
    171755 ±  3%     -61.1%      66798 ± 15%  numa-vmstat.node1.nr_shmem
    119206 ± 10%     -54.7%      53962 ± 31%  numa-vmstat.node1.nr_slab_unreclaimable
    342698 ±  8%     -59.5%     138863 ± 51%  numa-vmstat.node1.nr_zone_active_anon
  21639864 ±  2%     +92.2%   41600300 ±  6%  numa-vmstat.node1.numa_hit
  21382052 ±  2%     +87.6%   40123226 ±  6%  numa-vmstat.node1.numa_local
    257307 ± 11%    +474.1%    1477073 ±  6%  numa-vmstat.node1.numa_other
      0.50 ± 10%     -16.4%       0.42 ±  8%  sched_debug.cfs_rq:/.h_nr_queued.avg
      0.49 ± 11%     -15.5%       0.41 ±  8%  sched_debug.cfs_rq:/.h_nr_runnable.avg
     32477 ± 11%     -20.0%      25967 ± 15%  sched_debug.cfs_rq:/.left_deadline.max
     32462 ± 11%     -20.1%      25950 ± 15%  sched_debug.cfs_rq:/.left_vruntime.max
    495516 ± 11%     -18.3%     404915 ±  9%  sched_debug.cfs_rq:/.load.avg
      1675 ± 12%     -22.9%       1291 ± 24%  sched_debug.cfs_rq:/.load_avg.avg
     17.83 ± 37%     +73.3%      30.90 ± 29%  sched_debug.cfs_rq:/.load_avg.min
      0.50 ± 10%     -16.5%       0.42 ±  8%  sched_debug.cfs_rq:/.nr_queued.avg
     32462 ± 11%     -20.1%      25950 ± 15%  sched_debug.cfs_rq:/.right_vruntime.max
 8.253e+11 ± 46%     -68.5%    2.6e+11 ± 44%  sched_debug.cfs_rq:/.sum_w_vruntime.max
-1.142e+12           -79.2% -2.372e+11        sched_debug.cfs_rq:/.sum_w_vruntime.min
 1.641e+11 ± 14%     -80.5%  3.196e+10 ± 30%  sched_debug.cfs_rq:/.sum_w_vruntime.stddev
    562.81 ±  4%      -6.3%     527.61 ±  6%  sched_debug.cpu.clock_task.stddev
    408623 ± 13%     +44.3%     589658 ±  7%  sched_debug.cpu.curr->pid.avg
    609584           +90.0%    1157918 ±  2%  sched_debug.cpu.curr->pid.max
    278101 ±  6%    +106.4%     574007 ±  2%  sched_debug.cpu.curr->pid.stddev
      0.50 ± 10%     -16.5%       0.42 ±  8%  sched_debug.cpu.nr_running.avg
     22206          +111.3%      46925 ±  3%  sched_debug.cpu.nr_switches.avg
     59463 ± 30%     +66.6%      99041 ± 11%  sched_debug.cpu.nr_switches.max
     16171 ± 11%     +62.8%      26329 ± 21%  sched_debug.cpu.nr_switches.min
   -115.25          +180.9%    -323.70        sched_debug.cpu.nr_uninterruptible.min
     47.53 ± 13%    +134.1%     111.26 ± 21%  sched_debug.cpu.nr_uninterruptible.stddev




Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki


                 reply	other threads:[~2026-06-26  9:12 UTC|newest]

Thread overview: [no followups] expand[flat|nested]  mbox.gz  Atom feed

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=202606261633.32eca12c-lkp@intel.com \
    --to=oliver.sang@intel.com \
    --cc=aubrey.li@linux.intel.com \
    --cc=joshdon@google.com \
    --cc=kprateek.nayak@amd.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=lkp@intel.com \
    --cc=oe-lkp@lists.linux.dev \
    --cc=peterz@infradead.org \
    --cc=yu.c.chen@intel.com \
    --cc=zecheng@google.com \
    --cc=zli94@ncsu.edu \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.