All of lore.kernel.org
 help / color / mirror / Atom feed
* [tip:core/rseq] [rseq]  9f6ffd4ceb:  stress-ng.timer.ops_per_sec 5.5% improvement
@ 2025-11-11  7:11 kernel test robot
  0 siblings, 0 replies; only message in thread
From: kernel test robot @ 2025-11-11  7:11 UTC (permalink / raw)
  To: Thomas Gleixner
  Cc: oe-lkp, lkp, linux-kernel, x86, Ingo Molnar, Peter Zijlstra,
	Mathieu Desnoyers, oliver.sang



Hello,

kernel test robot noticed a 5.5% improvement of stress-ng.timer.ops_per_sec on:


commit: 9f6ffd4cebda86841700775de3213f22bb0ea22d ("rseq: Separate the signal delivery path")
https://git.kernel.org/cgit/linux/kernel/git/tip/tip.git core/rseq


testcase: stress-ng
config: x86_64-rhel-9.4
compiler: gcc-14
test machine: 192 threads 2 sockets Intel(R) Xeon(R) 6740E  CPU @ 2.4GHz (Sierra Forest) with 256G memory
parameters:

	nr_threads: 100%
	testtime: 60s
	test: timer
	cpufreq_governor: performance



Details are as below:
-------------------------------------------------------------------------------------------------->


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20251111/202511111524.d2e6b63d-lkp@intel.com

=========================================================================================
compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
  gcc-14/performance/x86_64-rhel-9.4/100%/debian-13-x86_64-20250902.cgz/lkp-srf-2sp3/timer/stress-ng/60s

commit: 
  0f085b4188 ("rseq: Provide and use rseq_set_ids()")
  9f6ffd4ceb ("rseq: Separate the signal delivery path")

0f085b41880e3140 9f6ffd4cebda86841700775de32 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
    147208 ±  3%     +15.8%     170452 ±  4%  meminfo.Mapped
     50.63           +34.3       84.92        mpstat.cpu.all.sys%
     36.46           -34.8        1.67 ± 13%  mpstat.cpu.all.usr%
     14932 ± 35%     +67.9%      25072 ± 15%  numa-vmstat.node0.nr_slab_reclaimable
     22290 ± 24%     -45.5%      12149 ± 32%  numa-vmstat.node1.nr_slab_reclaimable
     10.17 ± 44%    +375.4%      48.33 ±  9%  perf-c2c.DRAM.local
    162.67 ± 31%    +558.7%       1071 ± 12%  perf-c2c.DRAM.remote
    141.00 ± 29%    +578.6%     956.83 ± 12%  perf-c2c.HITM.local
    154.50 ± 32%    +476.2%     890.17 ± 13%  perf-c2c.HITM.remote
     37215 ±  3%     +15.2%      42879 ±  5%  proc-vmstat.nr_mapped
     66948            +8.7%      72780        proc-vmstat.nr_shmem
    818668            +2.7%     840430        proc-vmstat.numa_hit
    620609            +3.5%     642526        proc-vmstat.numa_local
     59728 ± 35%     +67.9%     100287 ± 15%  numa-meminfo.node0.KReclaimable
     59728 ± 35%     +67.9%     100287 ± 15%  numa-meminfo.node0.SReclaimable
    304140 ±  8%     +16.4%     353946 ±  7%  numa-meminfo.node0.Slab
     89141 ± 24%     -45.5%      48551 ± 32%  numa-meminfo.node1.KReclaimable
     89141 ± 24%     -45.5%      48551 ± 32%  numa-meminfo.node1.SReclaimable
     47698 ±  2%     +16.6%      55634        stress-ng.time.involuntary_context_switches
      5931           +67.8%       9952        stress-ng.time.system_time
      4259           -95.8%     179.67 ± 14%  stress-ng.time.user_time
 5.054e+09            +5.5%  5.335e+09        stress-ng.timer.ops
  84269084            +5.5%   88943393        stress-ng.timer.ops_per_sec
      0.05 ±  4%     -14.9%       0.04 ±  2%  perf-sched.sch_delay.avg.ms.[unknown].[unknown].[unknown].[unknown].[unknown]
      0.05 ±  4%     -14.9%       0.04 ±  2%  perf-sched.total_sch_delay.average.ms
    159.90 ±  2%     -15.6%     135.02        perf-sched.total_wait_and_delay.average.ms
      7663 ±  2%     +18.3%       9069 ±  2%  perf-sched.total_wait_and_delay.count.ms
    159.86 ±  2%     -15.6%     134.98        perf-sched.total_wait_time.average.ms
    159.90 ±  2%     -15.6%     135.02        perf-sched.wait_and_delay.avg.ms.[unknown].[unknown].[unknown].[unknown].[unknown]
      7663 ±  2%     +18.3%       9069 ±  2%  perf-sched.wait_and_delay.count.[unknown].[unknown].[unknown].[unknown].[unknown]
    159.86 ±  2%     -15.6%     134.98        perf-sched.wait_time.avg.ms.[unknown].[unknown].[unknown].[unknown].[unknown]
  6.55e+10            +3.2%  6.757e+10        perf-stat.i.branch-instructions
  92397918            +6.4%   98318472 ±  2%  perf-stat.i.branch-misses
   6980157            +3.0%    7189854        perf-stat.i.cache-references
      1977 ±  2%     +12.2%       2219        perf-stat.i.context-switches
      1.76            -2.6%       1.71        perf-stat.i.cpi
    301.54            -2.7%     293.44        perf-stat.i.cpu-migrations
 3.435e+11            +2.6%  3.526e+11        perf-stat.i.instructions
      0.57            +2.7%       0.58        perf-stat.i.ipc
      1.76            -2.6%       1.71        perf-stat.overall.cpi
      0.57            +2.7%       0.58        perf-stat.overall.ipc
 6.443e+10            +3.2%  6.647e+10        perf-stat.ps.branch-instructions
  90890135            +6.4%   96717754 ±  2%  perf-stat.ps.branch-misses
   6864424            +3.0%    7069354        perf-stat.ps.cache-references
      1945 ±  2%     +12.2%       2183        perf-stat.ps.context-switches
    296.64            -2.7%     288.77        perf-stat.ps.cpu-migrations
 3.379e+11            +2.7%  3.469e+11        perf-stat.ps.instructions
 2.077e+13            +2.7%  2.133e+13        perf-stat.total.instructions




Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki


^ permalink raw reply	[flat|nested] only message in thread

only message in thread, other threads:[~2025-11-11  7:11 UTC | newest]

Thread overview: (only message) (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2025-11-11  7:11 [tip:core/rseq] [rseq] 9f6ffd4ceb: stress-ng.timer.ops_per_sec 5.5% improvement kernel test robot

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.