All of lore.kernel.org
 help / color / mirror / Atom feed
From: kernel test robot <oliver.sang@intel.com>
To: Thomas Gleixner <tglx@linutronix.de>
Cc: <oe-lkp@lists.linux.dev>, <lkp@intel.com>, <oliver.sang@intel.com>
Subject: [tglx-devel:locking/futex] [x86/vdso]  698a86a85c: stress-ng.priv-instr.ops_per_sec 5.8% improvement
Date: Wed, 27 May 2026 13:03:24 +0800	[thread overview]
Message-ID: <202605271217.16dfca4d-lkp@intel.com> (raw)



Hello,

kernel test robot noticed a 5.8% improvement of stress-ng.priv-instr.ops_per_sec on:


commit: 698a86a85c4db4b800faa8a2890810ff9cec2f79 ("x86/vdso: Implement __vdso_futex_robust_try_unlock()")
https://git.kernel.org/cgit/linux/kernel/git/tglx/devel.git locking/futex

testcase: stress-ng
config: x86_64-rhel-9.4
compiler: gcc-14
test machine: 192 threads 2 sockets Intel(R) Xeon(R) 6740E  CPU @ 2.4GHz (Sierra Forest) with 256G memory
parameters:

	nr_threads: 100%
	testtime: 60s
	test: priv-instr
	cpufreq_governor: performance


we notice the config diff which seems reasonable:

--- /pkg/linux/x86_64-rhel-9.4/gcc-14/35b728574c5b70b8223b53787bdbc40e55ca8ac4/.config  2026-05-25 13:01:14.615107724 +0800
+++ /pkg/linux/x86_64-rhel-9.4/gcc-14/698a86a85c4db4b800faa8a2890810ff9cec2f79/.config  2026-05-25 11:39:09.283994568 +0800
@@ -281,6 +281,8 @@ CONFIG_FUTEX=y
 CONFIG_FUTEX_PI=y
 CONFIG_FUTEX_PRIVATE_HASH=y
 CONFIG_FUTEX_MPOL=y
+CONFIG_HAVE_FUTEX_ROBUST_UNLOCK=y
+CONFIG_FUTEX_ROBUST_UNLOCK=y
 CONFIG_EPOLL=y
 CONFIG_SIGNALFD=y
 CONFIG_TIMERFD=y



Details are as below:
-------------------------------------------------------------------------------------------------->


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20260527/202605271217.16dfca4d-lkp@intel.com

=========================================================================================
compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
  gcc-14/performance/x86_64-rhel-9.4/100%/debian-13-x86_64-20250902.cgz/lkp-srf-2sp3/priv-instr/stress-ng/60s

commit: 
  35b728574c ("x86/vdso: Prepare for robust futex unlock support")
  698a86a85c ("x86/vdso: Implement __vdso_futex_robust_try_unlock()")

35b728574c5b70b8 698a86a85c4db4b800faa8a2890 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
     28468            -5.6%      26878        stress-ng.priv-instr.nanosecs_per_privileged_op_trap
 3.983e+08            +5.8%  4.214e+08        stress-ng.priv-instr.ops
   6643685            +5.8%    7028732        stress-ng.priv-instr.ops_per_sec
     60.10            +0.0%      60.10        stress-ng.time.elapsed_time
     60.10            +0.0%      60.10        stress-ng.time.elapsed_time.max
     47894            -0.0%      47889        stress-ng.time.involuntary_context_switches
      7847            +0.0%       7848        stress-ng.time.maximum_resident_set_size
     18927 ±  2%      +0.7%      19061 ±  2%  stress-ng.time.minor_page_faults
      4096            +0.0%       4096        stress-ng.time.page_size
     19086            +0.0%      19087        stress-ng.time.percent_of_cpu_this_job_got
     11320            -0.1%      11312        stress-ng.time.system_time
    150.74            +5.9%     159.68        stress-ng.time.user_time
    671.60            +1.0%     678.17 ±  3%  stress-ng.time.voluntary_context_switches
      0.00         +3e+102%       3.00 ±107%  perf-c2c.DRAM.local
      0.00       +8.5e+103%      85.33 ±111%  perf-c2c.DRAM.remote
      0.00       +6.6e+104%     660.17 ±100%  perf-c2c.HITM.local
      0.00       +8.2e+103%      81.83 ±110%  perf-c2c.HITM.remote
      0.00       +7.4e+104%     742.00 ±101%  perf-c2c.HITM.total
    247011 ± 28%      -8.9%     225103 ± 35%  numa-numastat.node0.local_node
    373383 ± 15%      -1.0%     369611 ± 13%  numa-numastat.node0.numa_hit
    126371 ± 47%     +14.4%     144508 ± 30%  numa-numastat.node0.other_node
    367783 ± 18%      +5.5%     387890 ± 20%  numa-numastat.node1.local_node
    440260 ± 12%      +0.5%     442288 ± 11%  numa-numastat.node1.numa_hit
     72476 ± 81%     -24.9%      54397 ± 80%  numa-numastat.node1.other_node
      3.27            -0.0        3.26        mpstat.cpu.all.idle%
      0.09            +0.0        0.09 ±  3%  mpstat.cpu.all.irq%
      0.00 ± 52%      +0.0        0.00 ± 31%  mpstat.cpu.all.soft%
     95.15            -0.1       95.08        mpstat.cpu.all.sys%
      1.49            +0.1        1.56        mpstat.cpu.all.usr%
      3.00            +0.0%       3.00        mpstat.max_utilization.seconds
    100.00            +0.0%     100.00        mpstat.max_utilization_pct
     60.10            +0.0%      60.10        time.elapsed_time
     60.10            +0.0%      60.10        time.elapsed_time.max
     47894            -0.0%      47889        time.involuntary_context_switches
      7847            +0.0%       7848        time.maximum_resident_set_size
     18927 ±  2%      +0.7%      19061 ±  2%  time.minor_page_faults
      4096            +0.0%       4096        time.page_size
     19086            +0.0%      19087        time.percent_of_cpu_this_job_got
     11320            -0.1%      11312        time.system_time
    150.74            +5.9%     159.68        time.user_time
    671.60            +1.0%     678.17 ±  3%  time.voluntary_context_switches
      6.08 ±  9%      -7.8%       5.61 ± 13%  vmstat.cpu.id
     92.50            +0.5%      92.93        vmstat.cpu.sy
      1.42            +2.0%       1.45 ±  2%  vmstat.cpu.us
      0.33            +0.5%       0.33        vmstat.io.bi
      4.00            +0.0%       4.00        vmstat.memory.buff
   4549331            -0.0%    4547648        vmstat.memory.cache
 2.514e+08            -0.0%  2.514e+08        vmstat.memory.free
    181.12            +0.9%     182.71        vmstat.procs.r
      2505 ±  2%      -3.0%       2430 ±  2%  vmstat.system.cs
    443254            +0.4%     445179        vmstat.system.in
      0.07 ±  5%      -3.7%       0.07 ±  4%  perf-sched.sch_delay.avg.ms.[unknown].[unknown].[unknown].[unknown].[unknown]
     10.80 ± 42%      -6.8%      10.06 ± 38%  perf-sched.sch_delay.max.ms.[unknown].[unknown].[unknown].[unknown].[unknown]
      0.07 ±  5%      -3.7%       0.07 ±  4%  perf-sched.total_sch_delay.average.ms
     10.80 ± 42%      -6.8%      10.06 ± 38%  perf-sched.total_sch_delay.max.ms
    176.52 ±  2%      -0.6%     175.50 ±  5%  perf-sched.total_wait_and_delay.average.ms
      7255 ±  2%      +1.9%       7392 ±  6%  perf-sched.total_wait_and_delay.count.ms
      4979            +0.0%       4981        perf-sched.total_wait_and_delay.max.ms
    176.45 ±  2%      -0.6%     175.43 ±  5%  perf-sched.total_wait_time.average.ms
      4979            +0.0%       4981        perf-sched.total_wait_time.max.ms
    176.52 ±  2%      -0.6%     175.50 ±  5%  perf-sched.wait_and_delay.avg.ms.[unknown].[unknown].[unknown].[unknown].[unknown]
      7255 ±  2%      +1.9%       7392 ±  6%  perf-sched.wait_and_delay.count.[unknown].[unknown].[unknown].[unknown].[unknown]
      4979            +0.0%       4981        perf-sched.wait_and_delay.max.ms.[unknown].[unknown].[unknown].[unknown].[unknown]
    176.45 ±  2%      -0.6%     175.43 ±  5%  perf-sched.wait_time.avg.ms.[unknown].[unknown].[unknown].[unknown].[unknown]
      4979            +0.0%       4981        perf-sched.wait_time.max.ms.[unknown].[unknown].[unknown].[unknown].[unknown]
      2004            -0.3%       1999        perf-stat.i.context-switches
    192058            -0.0%     192055        perf-stat.i.cpu-clock
    311.00            -0.7%     308.78        perf-stat.i.cpu-migrations
      0.02 ± 99%     +23.7%       0.02 ± 93%  perf-stat.i.major-faults
      3937 ±  3%      +2.8%       4049 ±  4%  perf-stat.i.minor-faults
      3937 ±  3%      +2.8%       4049 ±  4%  perf-stat.i.page-faults
    192058            -0.0%     192055        perf-stat.i.task-clock
      1972            -0.2%       1968        perf-stat.ps.context-switches
    188859            -0.0%     188847        perf-stat.ps.cpu-clock
    305.80            -0.7%     303.61        perf-stat.ps.cpu-migrations
      0.02 ± 96%     +25.7%       0.02 ± 92%  perf-stat.ps.major-faults
      3872 ±  3%      +2.8%       3979 ±  4%  perf-stat.ps.minor-faults
      3872 ±  3%      +2.8%       3979 ±  4%  perf-stat.ps.page-faults
    188859            -0.0%     188847        perf-stat.ps.task-clock
      1852 ± 81%     +66.9%       3093        turbostat.Avg_MHz
     57.87 ± 81%     +38.8       96.62        turbostat.Busy%
      1920 ± 81%     +66.7%       3201        turbostat.Bzy_MHz
      1.99 ± 82%      +1.2        3.15 ±  5%  turbostat.C1%
      1.99 ± 82%     +58.0%       3.14 ±  5%  turbostat.CPU%c1
     30.20 ± 81%     +68.3%      50.83        turbostat.CoreTmp
      0.03 ± 81%     +66.7%       0.05        turbostat.IPC
  16893263 ± 81%     +66.7%   28163412        turbostat.IRQ
   4937639 ± 81%     +66.8%    8234664        turbostat.NMI
     30.60 ± 81%     +66.1%      50.83        turbostat.PkgTmp
    171.65 ± 81%     +67.0%     286.58        turbostat.PkgWatt
      9.03 ± 81%     +66.6%      15.05        turbostat.RAMWatt
      0.01 ± 81%     +66.7%       0.02        turbostat.SysWatt
      1440 ± 81%     +66.7%       2400        turbostat.TSC_MHz
      1584 ±  6%      -8.7%       1447        sched_debug.cfs_rq:/.runnable_avg.max
    162.28 ±  5%      -8.7%     148.14 ±  4%  sched_debug.cfs_rq:/.runnable_avg.stddev
      1025 ±  4%      +7.6%       1103 ±  4%  sched_debug.cfs_rq:/.util_avg.max




Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki


                 reply	other threads:[~2026-05-27  5:03 UTC|newest]

Thread overview: [no followups] expand[flat|nested]  mbox.gz  Atom feed

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=202605271217.16dfca4d-lkp@intel.com \
    --to=oliver.sang@intel.com \
    --cc=lkp@intel.com \
    --cc=oe-lkp@lists.linux.dev \
    --cc=tglx@linutronix.de \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.