All of lore.kernel.org
 help / color / mirror / Atom feed
From: kernel test robot <oliver.sang@intel.com>
To: "Michal Koutný" <mkoutny@suse.com>
Cc: oe-lkp@lists.linux.dev, lkp@intel.com,
	linux-kernel@vger.kernel.org, linux-fsdevel@vger.kernel.org,
	linux-trace-kernel@vger.kernel.org,
	"Christian Brauner" <brauner@kernel.org>,
	"Alexander Mikhalitsyn" <alexander@mihalicyn.com>,
	linux-doc@vger.kernel.org, "Jonathan Corbet" <corbet@lwn.net>,
	"Kees Cook" <kees@kernel.org>,
	"Andrew Morton" <akpm@linux-foundation.org>,
	"Eric W . Biederman" <ebiederm@xmission.com>,
	"Michal Koutný" <mkoutny@suse.com>,
	"Oleg Nesterov" <oleg@redhat.com>,
	oliver.sang@intel.com
Subject: Re: [PATCH 1/2] Revert "pid: allow pid_max to be set per pid namespace"
Date: Mon, 10 Mar 2025 15:32:11 +0800	[thread overview]
Message-ID: <202503101532.348576bb-lkp@intel.com> (raw)
In-Reply-To: <20250221170249.890014-2-mkoutny@suse.com>



Hello,

kernel test robot noticed a 23.4% improvement of stress-ng.sigxfsz.ops_per_sec on:


commit: ee2a5c3e36093d0ff5709bc8f21d3793cf55f746 ("[PATCH 1/2] Revert "pid: allow pid_max to be set per pid namespace"")
url: https://github.com/intel-lab-lkp/linux/commits/Michal-Koutn/Revert-pid-allow-pid_max-to-be-set-per-pid-namespace/20250222-010942
patch link: https://lore.kernel.org/all/20250221170249.890014-2-mkoutny@suse.com/
patch subject: [PATCH 1/2] Revert "pid: allow pid_max to be set per pid namespace"

testcase: stress-ng
config: x86_64-rhel-9.4
compiler: gcc-12
test machine: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory
parameters:

	nr_threads: 100%
	testtime: 60s
	test: sigxfsz
	cpufreq_governor: performance


In addition to that, the commit also has significant impact on the following tests:

+------------------+-------------------------------------------------------------------------------------------+
| testcase: change | stress-ng: stress-ng.mprotect.ops_per_sec 4.5% improvement                                |
| test machine     | 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory |
| test parameters  | cpufreq_governor=performance                                                              |
|                  | nr_threads=100%                                                                           |
|                  | test=mprotect                                                                             |
|                  | testtime=60s                                                                              |
+------------------+-------------------------------------------------------------------------------------------+
| testcase: change | stress-ng: stress-ng.sigrt.ops_per_sec 15.7% improvement                                  |
| test machine     | 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory |
| test parameters  | cpufreq_governor=performance                                                              |
|                  | nr_threads=100%                                                                           |
|                  | test=sigrt                                                                                |
|                  | testtime=60s                                                                              |
+------------------+-------------------------------------------------------------------------------------------+
| testcase: change | stress-ng: stress-ng.sigbus.ops_per_sec 20.6% improvement                                 |
| test machine     | 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory |
| test parameters  | cpufreq_governor=performance                                                              |
|                  | nr_threads=100%                                                                           |
|                  | test=sigbus                                                                               |
|                  | testtime=60s                                                                              |
+------------------+-------------------------------------------------------------------------------------------+




Details are as below:
-------------------------------------------------------------------------------------------------->


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20250310/202503101532.348576bb-lkp@intel.com

=========================================================================================
compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
  gcc-12/performance/x86_64-rhel-9.4/100%/debian-12-x86_64-20240206.cgz/lkp-icl-2sp8/sigxfsz/stress-ng/60s

commit: 
  3344260945 ("Merge tag 'for-v6.14-rc' of git://git.kernel.org/pub/scm/linux/kernel/git/sre/linux-power-supply")
  ee2a5c3e36 ("Revert "pid: allow pid_max to be set per pid namespace"")

334426094588f817 ee2a5c3e36093d0ff5709bc8f21 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
      5.11            +1.3        6.43        mpstat.cpu.all.usr%
      3737 ±  6%     -38.8%       2286 ± 42%  proc-vmstat.numa_hint_faults_local
   1212920 ±  4%     -10.4%    1086901 ±  5%  sched_debug.cpu.avg_idle.max
     35.50 ± 16%     -30.0%      24.83 ± 20%  perf-c2c.DRAM.local
      1517 ±  4%     -46.5%     812.17 ±  3%  perf-c2c.DRAM.remote
      1808 ±  2%     +57.0%       2840        perf-c2c.HITM.local
      1360 ±  5%     -49.9%     680.83 ±  2%  perf-c2c.HITM.remote
      5.22 ±  3%     +19.8%       6.26 ±  7%  perf-sched.wait_and_delay.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
     53.33 ± 15%     +25.0%      66.67 ± 15%  perf-sched.wait_and_delay.count.__cond_resched.vfs_write.__x64_sys_pwrite64.do_syscall_64.entry_SYSCALL_64_after_hwframe
    953.83 ±  3%     -16.5%     796.33 ±  7%  perf-sched.wait_and_delay.count.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
      5.21 ±  3%     +20.0%       6.25 ±  7%  perf-sched.wait_time.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
    163515           +27.8%     208915        stress-ng.sigxfsz.SIGXFSZ_signals_per_sec
 6.668e+08           +23.4%   8.23e+08        stress-ng.sigxfsz.ops
  11113966           +23.4%   13716156        stress-ng.sigxfsz.ops_per_sec
      3623            -1.4%       3573        stress-ng.time.system_time
    163.26           +31.7%     214.98        stress-ng.time.user_time
      0.25           -54.7%       0.12 ±  2%  perf-stat.i.MPKI
 1.125e+10           +22.1%  1.373e+10        perf-stat.i.branch-instructions
      0.54            -0.0        0.50        perf-stat.i.branch-miss-rate%
  59748239           +10.9%   66264440        perf-stat.i.branch-misses
     33.30           -17.9       15.38 ±  2%  perf-stat.i.cache-miss-rate%
  13040640           -45.8%    7066419 ±  2%  perf-stat.i.cache-misses
  39047103           +15.5%   45098530        perf-stat.i.cache-references
      4.39           -18.2%       3.59        perf-stat.i.cpi
     17823           +97.0%      35113        perf-stat.i.cycles-between-cache-misses
 5.144e+10           +22.0%  6.275e+10        perf-stat.i.instructions
      0.23           +21.3%       0.28        perf-stat.i.ipc
      0.25           -55.6%       0.11 ±  2%  perf-stat.overall.MPKI
      0.53            -0.0        0.48        perf-stat.overall.branch-miss-rate%
     33.40           -17.7       15.67 ±  2%  perf-stat.overall.cache-miss-rate%
      4.40           -18.0%       3.60        perf-stat.overall.cpi
     17350           +84.6%      32027 ±  2%  perf-stat.overall.cycles-between-cache-misses
      0.23           +22.0%       0.28        perf-stat.overall.ipc
 1.106e+10           +22.1%   1.35e+10        perf-stat.ps.branch-instructions
  58763534           +10.9%   65180843        perf-stat.ps.branch-misses
  12827760           -45.8%    6951883 ±  2%  perf-stat.ps.cache-misses
  38411225           +15.5%   44365626        perf-stat.ps.cache-references
  5.06e+10           +22.0%  6.172e+10        perf-stat.ps.instructions
 3.106e+12           +21.9%  3.787e+12        perf-stat.total.instructions


***************************************************************************************************
lkp-icl-2sp7: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
  gcc-12/performance/x86_64-rhel-9.4/100%/debian-12-x86_64-20240206.cgz/lkp-icl-2sp7/mprotect/stress-ng/60s

commit: 
  3344260945 ("Merge tag 'for-v6.14-rc' of git://git.kernel.org/pub/scm/linux/kernel/git/sre/linux-power-supply")
  ee2a5c3e36 ("Revert "pid: allow pid_max to be set per pid namespace"")

334426094588f817 ee2a5c3e36093d0ff5709bc8f21 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
     10205 ± 25%     +33.5%      13621 ± 16%  numa-meminfo.node0.KernelStack
      0.02 ± 37%     -37.8%       0.01 ± 13%  perf-sched.sch_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
      0.82 ± 32%     -37.7%       0.51 ±  7%  perf-sched.sch_delay.max.ms.schedule_timeout.kcompactd.kthread.ret_from_fork
    807.17 ±  5%      -8.5%     738.67 ±  5%  perf-sched.wait_and_delay.count.__cond_resched.down_write.vma_prepare.__split_vma.vma_modify
    433709            +4.9%     454923 ±  5%  proc-vmstat.nr_active_anon
     61940 ±  3%     +31.3%      81315 ± 35%  proc-vmstat.nr_shmem
    433709            +4.9%     454923 ±  5%  proc-vmstat.nr_zone_active_anon
 4.903e+08            +4.5%  5.124e+08        stress-ng.mprotect.ops
   8163833            +4.5%    8533021        stress-ng.mprotect.ops_per_sec
    239.55            +4.7%     250.91        stress-ng.time.user_time
   3960356 ±  7%     -16.0%    3325457        numa-numastat.node0.local_node
   3990670 ±  7%     -16.1%    3348370        numa-numastat.node0.numa_hit
   2608139 ±  6%     +34.5%    3507199 ±  4%  numa-numastat.node1.local_node
   2644058 ±  6%     +34.3%    3550893 ±  4%  numa-numastat.node1.numa_hit
   3986137 ±  7%     -16.0%    3349506        numa-vmstat.node0.numa_hit
   3955823 ±  7%     -15.9%    3326594        numa-vmstat.node0.numa_local
   2639425 ±  6%     +34.6%    3552253 ±  4%  numa-vmstat.node1.numa_hit
   2603506 ±  6%     +34.8%    3508559 ±  4%  numa-vmstat.node1.numa_local
      1.11 ± 20%     -38.9%       0.68 ± 31%  sched_debug.cfs_rq:/.h_nr_queued.stddev
      1.11 ± 19%     -38.6%       0.68 ± 31%  sched_debug.cfs_rq:/.h_nr_runnable.stddev
      5890 ±  6%     -10.7%       5262        sched_debug.cfs_rq:/.runnable_avg.max
      1064 ± 20%     -41.1%     626.67 ± 33%  sched_debug.cfs_rq:/.runnable_avg.stddev
      1151           -12.2%       1010        sched_debug.cpu.clock_task.stddev
      1.11 ± 20%     -39.1%       0.68 ± 32%  sched_debug.cpu.nr_running.stddev
 1.861e+10            +4.5%  1.945e+10        perf-stat.i.branch-instructions
 1.264e+08            +4.1%  1.316e+08        perf-stat.i.branch-misses
  1.45e+08            +5.3%  1.526e+08        perf-stat.i.cache-references
      2.28            -4.3%       2.18        perf-stat.i.cpi
 8.533e+10            +4.5%   8.92e+10        perf-stat.i.instructions
      0.44            +4.5%       0.46        perf-stat.i.ipc
     63.03            +4.5%      65.90        perf-stat.i.metric.K/sec
   4035009            +4.5%    4218051        perf-stat.i.page-faults
      2.29            -4.4%       2.19        perf-stat.overall.cpi
      0.44            +4.6%       0.46        perf-stat.overall.ipc
 1.829e+10            +4.5%  1.912e+10        perf-stat.ps.branch-instructions
 1.242e+08            +4.1%  1.293e+08        perf-stat.ps.branch-misses
 1.424e+08            +5.3%  1.499e+08        perf-stat.ps.cache-references
 8.385e+10            +4.6%  8.767e+10        perf-stat.ps.instructions
   3966080            +4.6%    4146673        perf-stat.ps.page-faults
 5.154e+12            +4.6%  5.389e+12        perf-stat.total.instructions
     36.24            -1.9       34.36 ±  2%  perf-profile.calltrace.cycles-pp.asm_exc_page_fault.stress_mprotect_mem
     38.30            -1.7       36.58 ±  2%  perf-profile.calltrace.cycles-pp.stress_mprotect_mem
     14.45 ±  2%      -1.7       12.80 ±  2%  perf-profile.calltrace.cycles-pp.get_signal.arch_do_signal_or_restart.irqentry_exit_to_user_mode.asm_exc_page_fault.stress_mprotect_mem
     17.12            -1.5       15.58 ±  2%  perf-profile.calltrace.cycles-pp.irqentry_exit_to_user_mode.asm_exc_page_fault.stress_mprotect_mem
     17.06            -1.5       15.54 ±  2%  perf-profile.calltrace.cycles-pp.arch_do_signal_or_restart.irqentry_exit_to_user_mode.asm_exc_page_fault.stress_mprotect_mem
     12.44 ±  2%      -1.5       10.92 ±  2%  perf-profile.calltrace.cycles-pp.do_dec_rlimit_put_ucounts.__sigqueue_free.get_signal.arch_do_signal_or_restart.irqentry_exit_to_user_mode
     12.46 ±  2%      -1.5       10.94 ±  2%  perf-profile.calltrace.cycles-pp.__sigqueue_free.get_signal.arch_do_signal_or_restart.irqentry_exit_to_user_mode.asm_exc_page_fault
      0.54 ±  2%      -0.1        0.43 ± 44%  perf-profile.calltrace.cycles-pp.up_read.__bad_area.bad_area_access_error.exc_page_fault.asm_exc_page_fault
      0.84            -0.1        0.75 ±  4%  perf-profile.calltrace.cycles-pp.down_write.__split_vma.vma_modify.vma_modify_flags.mprotect_fixup
      1.60            -0.1        1.51 ±  2%  perf-profile.calltrace.cycles-pp.asm_exc_page_fault.stress_sig_handler
      1.59            -0.1        1.51 ±  2%  perf-profile.calltrace.cycles-pp.irqentry_exit_to_user_mode.asm_exc_page_fault.stress_sig_handler
      0.82 ±  3%      -0.1        0.74 ±  2%  perf-profile.calltrace.cycles-pp.sigprocmask.__x64_sys_rt_sigprocmask.do_syscall_64.entry_SYSCALL_64_after_hwframe.pthread_sigmask
      1.44            -0.1        1.37 ±  2%  perf-profile.calltrace.cycles-pp.arch_do_signal_or_restart.irqentry_exit_to_user_mode.asm_exc_page_fault.stress_sig_handler
      1.03 ±  2%      -0.1        0.98        perf-profile.calltrace.cycles-pp.__x64_sys_rt_sigprocmask.do_syscall_64.entry_SYSCALL_64_after_hwframe.pthread_sigmask
      1.29 ±  2%      -0.1        1.23        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.pthread_sigmask
      0.68 ±  3%      -0.0        0.64 ±  2%  perf-profile.calltrace.cycles-pp.up_write.vma_complete.__split_vma.vma_modify.vma_modify_flags
      0.58 ±  2%      -0.0        0.54 ±  3%  perf-profile.calltrace.cycles-pp.__bad_area.bad_area_access_error.exc_page_fault.asm_exc_page_fault.stress_mprotect_mem
      0.58 ±  2%      -0.0        0.56        perf-profile.calltrace.cycles-pp.fpu__clear_user_states.handle_signal.arch_do_signal_or_restart.irqentry_exit_to_user_mode.asm_exc_page_fault
      0.62 ±  3%      +0.1        0.67 ±  2%  perf-profile.calltrace.cycles-pp.mas_prev_slot.do_mprotect_pkey.__x64_sys_mprotect.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.01            +0.1        1.07        perf-profile.calltrace.cycles-pp.copy_fpstate_to_sigframe.get_sigframe.x64_setup_rt_frame.handle_signal.arch_do_signal_or_restart
      1.23            +0.1        1.30 ±  2%  perf-profile.calltrace.cycles-pp.do_user_addr_fault.exc_page_fault.asm_exc_page_fault.stress_mprotect_mem
      0.84 ±  3%      +0.1        0.91 ±  2%  perf-profile.calltrace.cycles-pp.vma_interval_tree_insert.vma_complete.commit_merge.vma_merge_existing_range.vma_modify
      0.84 ±  2%      +0.1        0.91        perf-profile.calltrace.cycles-pp.mas_preallocate.__split_vma.vma_modify.vma_modify_flags.mprotect_fixup
      1.75 ±  2%      +0.1        1.83        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.__mprotect
      0.59 ±  2%      +0.1        0.67 ±  2%  perf-profile.calltrace.cycles-pp.simple_dname.perf_event_mmap_event.perf_event_mmap.mprotect_fixup.do_mprotect_pkey
      2.41 ±  2%      +0.1        2.50        perf-profile.calltrace.cycles-pp.clear_bhb_loop.__mprotect
      1.77            +0.1        1.88        perf-profile.calltrace.cycles-pp.get_sigframe.x64_setup_rt_frame.handle_signal.arch_do_signal_or_restart.irqentry_exit_to_user_mode
      2.02            +0.1        2.14        perf-profile.calltrace.cycles-pp.x64_setup_rt_frame.handle_signal.arch_do_signal_or_restart.irqentry_exit_to_user_mode.asm_exc_page_fault
      0.98 ± 18%      +0.1        1.10        perf-profile.calltrace.cycles-pp.change_protection_range.mprotect_fixup.do_mprotect_pkey.__x64_sys_mprotect.do_syscall_64
      2.57            +0.1        2.70        perf-profile.calltrace.cycles-pp.handle_signal.arch_do_signal_or_restart.irqentry_exit_to_user_mode.asm_exc_page_fault.stress_mprotect_mem
      3.13 ±  3%      +0.2        3.34 ±  2%  perf-profile.calltrace.cycles-pp.asm_exc_page_fault.__mprotect
      0.00            +0.6        0.55 ±  2%  perf-profile.calltrace.cycles-pp.prepend_copy.simple_dname.perf_event_mmap_event.perf_event_mmap.mprotect_fixup
     34.00            +1.1       35.12 ±  2%  perf-profile.calltrace.cycles-pp.mprotect_fixup.do_mprotect_pkey.__x64_sys_mprotect.do_syscall_64.entry_SYSCALL_64_after_hwframe
     46.05            +1.1       47.19        perf-profile.calltrace.cycles-pp.do_mprotect_pkey.__x64_sys_mprotect.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mprotect
     46.28            +1.2       47.43        perf-profile.calltrace.cycles-pp.__x64_sys_mprotect.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mprotect
     48.43            +1.2       49.61        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mprotect
     48.86            +1.2       50.06        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__mprotect
     55.84            +1.6       57.41        perf-profile.calltrace.cycles-pp.__mprotect
     39.48            -1.9       37.62 ±  2%  perf-profile.children.cycles-pp.asm_exc_page_fault
     14.48 ±  2%      -1.6       12.83 ±  2%  perf-profile.children.cycles-pp.get_signal
     18.72            -1.6       17.11        perf-profile.children.cycles-pp.irqentry_exit_to_user_mode
     39.92            -1.6       38.32 ±  2%  perf-profile.children.cycles-pp.stress_mprotect_mem
     18.52            -1.6       16.92        perf-profile.children.cycles-pp.arch_do_signal_or_restart
     12.47 ±  2%      -1.5       10.94 ±  2%  perf-profile.children.cycles-pp.__sigqueue_free
     12.44 ±  2%      -1.5       10.92 ±  2%  perf-profile.children.cycles-pp.do_dec_rlimit_put_ucounts
      5.00            -0.2        4.83 ±  2%  perf-profile.children.cycles-pp.up_write
      0.47 ± 10%      -0.1        0.34 ±  7%  perf-profile.children.cycles-pp.__sysvec_apic_timer_interrupt
      0.47 ± 10%      -0.1        0.34 ±  7%  perf-profile.children.cycles-pp.hrtimer_interrupt
      1.16 ±  3%      -0.1        1.05        perf-profile.children.cycles-pp.recalc_sigpending
      0.35 ±  7%      -0.1        0.24 ±  6%  perf-profile.children.cycles-pp.__hrtimer_run_queues
      0.89 ±  6%      -0.1        0.79 ±  5%  perf-profile.children.cycles-pp._raw_spin_lock_irq
      0.34 ±  8%      -0.1        0.24 ±  6%  perf-profile.children.cycles-pp.tick_nohz_handler
      0.86 ±  2%      -0.1        0.78        perf-profile.children.cycles-pp.sigprocmask
      0.28 ± 10%      -0.1        0.21 ±  6%  perf-profile.children.cycles-pp.update_process_times
      1.05 ±  2%      -0.1        0.98        perf-profile.children.cycles-pp.__x64_sys_rt_sigprocmask
      0.30 ±  3%      -0.0        0.26 ±  3%  perf-profile.children.cycles-pp.fpregs_mark_activate
      0.17 ± 10%      -0.0        0.13 ±  6%  perf-profile.children.cycles-pp.sched_tick
      0.47 ±  3%      -0.0        0.43 ±  3%  perf-profile.children.cycles-pp.complete_signal
      0.54 ±  2%      -0.0        0.51 ±  2%  perf-profile.children.cycles-pp.up_read
      0.58 ±  2%      -0.0        0.55 ±  2%  perf-profile.children.cycles-pp.__bad_area
      0.61            -0.0        0.58        perf-profile.children.cycles-pp.fpu__clear_user_states
      0.12 ±  5%      +0.0        0.14 ±  4%  perf-profile.children.cycles-pp.__get_user_nocheck_4
      0.13 ±  3%      +0.0        0.14 ±  3%  perf-profile.children.cycles-pp.ima_file_mprotect
      0.22 ±  5%      +0.0        0.24 ±  2%  perf-profile.children.cycles-pp.security_file_mprotect
      0.25 ±  3%      +0.0        0.28 ±  4%  perf-profile.children.cycles-pp.stress_mwc16
      0.18 ±  5%      +0.0        0.20 ±  6%  perf-profile.children.cycles-pp.stress_mwc16modn
      0.34 ±  3%      +0.0        0.37 ±  3%  perf-profile.children.cycles-pp.mas_ascend
      0.12 ±  4%      +0.0        0.15 ±  5%  perf-profile.children.cycles-pp.copy_from_kernel_nofault_allowed
      0.30 ±  8%      +0.0        0.33 ±  2%  perf-profile.children.cycles-pp.rcu_all_qs
      0.26 ±  4%      +0.0        0.29 ±  6%  perf-profile.children.cycles-pp.mas_pop_node
      0.44 ±  2%      +0.0        0.47        perf-profile.children.cycles-pp.vma_set_page_prot
      0.49 ±  3%      +0.0        0.53 ±  3%  perf-profile.children.cycles-pp.save_xstate_epilog
      0.66 ±  2%      +0.0        0.71 ±  2%  perf-profile.children.cycles-pp.native_irq_return_iret
      0.02 ± 99%      +0.1        0.08 ± 11%  perf-profile.children.cycles-pp.anon_vma_clone
      1.27            +0.1        1.33        perf-profile.children.cycles-pp.do_user_addr_fault
      0.84            +0.1        0.90        perf-profile.children.cycles-pp.mas_prev_slot
      1.04            +0.1        1.11        perf-profile.children.cycles-pp.copy_fpstate_to_sigframe
      0.73 ±  7%      +0.1        0.79 ±  2%  perf-profile.children.cycles-pp.__cond_resched
      0.46 ±  3%      +0.1        0.53 ±  2%  perf-profile.children.cycles-pp.copy_from_kernel_nofault
      1.30 ±  2%      +0.1        1.37        perf-profile.children.cycles-pp.entry_SYSCALL_64
      0.50 ±  2%      +0.1        0.58 ±  2%  perf-profile.children.cycles-pp.prepend_copy
      1.68            +0.1        1.75        perf-profile.children.cycles-pp.mas_preallocate
      0.61 ±  3%      +0.1        0.70 ±  3%  perf-profile.children.cycles-pp.simple_dname
      2.77 ±  2%      +0.1        2.87        perf-profile.children.cycles-pp.clear_bhb_loop
      3.27            +0.1        3.37        perf-profile.children.cycles-pp.handle_signal
      1.78            +0.1        1.89        perf-profile.children.cycles-pp.get_sigframe
      2.05            +0.1        2.16        perf-profile.children.cycles-pp.x64_setup_rt_frame
      0.99 ± 18%      +0.1        1.11        perf-profile.children.cycles-pp.change_protection_range
      7.00            +0.2        7.24 ±  2%  perf-profile.children.cycles-pp.vma_prepare
     34.09            +1.1       35.22 ±  2%  perf-profile.children.cycles-pp.mprotect_fixup
     50.17            +1.1       51.31        perf-profile.children.cycles-pp.do_syscall_64
     46.24            +1.2       47.39        perf-profile.children.cycles-pp.do_mprotect_pkey
     46.33            +1.2       47.49        perf-profile.children.cycles-pp.__x64_sys_mprotect
     50.61            +1.2       51.78        perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
     55.94            +1.6       57.52        perf-profile.children.cycles-pp.__mprotect
     12.44 ±  2%      -1.5       10.91 ±  2%  perf-profile.self.cycles-pp.do_dec_rlimit_put_ucounts
      4.36            -0.1        4.22 ±  2%  perf-profile.self.cycles-pp.up_write
      1.14 ±  3%      -0.1        1.03        perf-profile.self.cycles-pp.recalc_sigpending
      0.87 ±  6%      -0.1        0.78 ±  5%  perf-profile.self.cycles-pp._raw_spin_lock_irq
      2.83            -0.1        2.75        perf-profile.self.cycles-pp.down_write
      0.28 ±  5%      -0.0        0.23 ±  5%  perf-profile.self.cycles-pp.fpregs_mark_activate
      0.19 ± 10%      -0.0        0.14 ± 12%  perf-profile.self.cycles-pp.__perf_event_header__init_id
      0.40 ±  3%      -0.0        0.36 ±  5%  perf-profile.self.cycles-pp.complete_signal
      0.52 ±  2%      -0.0        0.48 ±  2%  perf-profile.self.cycles-pp.up_read
      0.15 ±  2%      -0.0        0.14 ±  3%  perf-profile.self.cycles-pp.__send_signal_locked
      0.10 ±  4%      -0.0        0.09 ±  4%  perf-profile.self.cycles-pp.__bad_area_nosemaphore
      0.30 ±  3%      +0.0        0.33 ±  4%  perf-profile.self.cycles-pp.mas_ascend
      0.10 ±  5%      +0.0        0.12 ±  5%  perf-profile.self.cycles-pp.do_user_addr_fault
      0.10 ±  4%      +0.0        0.12 ±  3%  perf-profile.self.cycles-pp.copy_from_kernel_nofault_allowed
      0.21 ±  6%      +0.0        0.24 ±  4%  perf-profile.self.cycles-pp.rwsem_down_write_slowpath
      0.40            +0.0        0.43 ±  2%  perf-profile.self.cycles-pp.change_protection_range
      0.44            +0.0        0.47        perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
      0.24 ±  3%      +0.0        0.27 ±  6%  perf-profile.self.cycles-pp.mas_pop_node
      0.34 ±  2%      +0.0        0.38 ±  3%  perf-profile.self.cycles-pp.mas_preallocate
      0.37 ±  8%      +0.0        0.41 ±  3%  perf-profile.self.cycles-pp.__cond_resched
      0.72            +0.0        0.76 ±  2%  perf-profile.self.cycles-pp.copy_fpstate_to_sigframe
      0.41            +0.0        0.45 ±  3%  perf-profile.self.cycles-pp.mas_prev_slot
      0.66 ±  2%      +0.0        0.71 ±  2%  perf-profile.self.cycles-pp.native_irq_return_iret
      0.30 ±  4%      +0.0        0.35 ±  2%  perf-profile.self.cycles-pp.copy_from_kernel_nofault
      0.02 ±141%      +0.1        0.08 ± 11%  perf-profile.self.cycles-pp.anon_vma_clone
      1.21 ±  2%      +0.1        1.30 ±  2%  perf-profile.self.cycles-pp.__mprotect
      2.73 ±  2%      +0.1        2.83        perf-profile.self.cycles-pp.clear_bhb_loop
      2.76            +0.1        2.88        perf-profile.self.cycles-pp.do_mprotect_pkey
      3.48 ±  3%      +0.3        3.74 ±  2%  perf-profile.self.cycles-pp.stress_mprotect_mem



***************************************************************************************************
lkp-icl-2sp8: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
  gcc-12/performance/x86_64-rhel-9.4/100%/debian-12-x86_64-20240206.cgz/lkp-icl-2sp8/sigrt/stress-ng/60s

commit: 
  3344260945 ("Merge tag 'for-v6.14-rc' of git://git.kernel.org/pub/scm/linux/kernel/git/sre/linux-power-supply")
  ee2a5c3e36 ("Revert "pid: allow pid_max to be set per pid namespace"")

334426094588f817 ee2a5c3e36093d0ff5709bc8f21 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
      1345 ±  9%     -15.8%       1132 ±  5%  perf-c2c.HITM.remote
   5328778           +18.0%    6289475        vmstat.system.cs
    197362            +2.0%     201296        vmstat.system.in
     45.97 ±118%     -85.4%       6.71 ± 55%  perf-sched.sch_delay.max.ms.__cond_resched.__alloc_frozen_pages_noprof.alloc_pages_mpol.folio_alloc_mpol_noprof.shmem_alloc_folio
    582.79 ± 39%     -39.2%     354.28 ± 31%  perf-sched.sch_delay.max.ms.schedule_hrtimeout_range.do_sigtimedwait.isra.0.__x64_sys_rt_sigtimedwait
      1260 ± 46%     -43.7%     709.74 ± 31%  perf-sched.wait_and_delay.max.ms.schedule_hrtimeout_range.do_sigtimedwait.isra.0.__x64_sys_rt_sigtimedwait
     45.97 ±118%     -85.4%       6.71 ± 55%  perf-sched.wait_time.max.ms.__cond_resched.__alloc_frozen_pages_noprof.alloc_pages_mpol.folio_alloc_mpol_noprof.shmem_alloc_folio
    705.59 ± 50%     -48.9%     360.90 ± 32%  perf-sched.wait_time.max.ms.schedule_hrtimeout_range.do_sigtimedwait.isra.0.__x64_sys_rt_sigtimedwait
     83250           -16.0%      69935        stress-ng.sigrt.nanosecs_between_sigqueue_and_sigwaitinfo_completion
 3.362e+08           +15.7%   3.89e+08        stress-ng.sigrt.ops
   5601334           +15.7%    6480915        stress-ng.sigrt.ops_per_sec
  65582158           +17.7%   77176472        stress-ng.time.involuntary_context_switches
      3423            -1.4%       3375        stress-ng.time.system_time
    335.13 ±  2%     +14.5%     383.80 ±  2%  stress-ng.time.user_time
 2.714e+08           +17.4%  3.185e+08        stress-ng.time.voluntary_context_switches
   4202907 ± 15%     -24.2%    3184715 ± 12%  sched_debug.cfs_rq:/.avg_vruntime.max
     82.07 ± 12%    +391.9%     403.68 ± 94%  sched_debug.cfs_rq:/.load_avg.avg
    169.48 ±  8%   +1182.4%       2173 ±115%  sched_debug.cfs_rq:/.load_avg.stddev
   4202907 ± 15%     -24.2%    3184715 ± 12%  sched_debug.cfs_rq:/.min_vruntime.max
      1239 ±  8%     +14.2%       1415 ± 12%  sched_debug.cfs_rq:/.util_avg.max
   2593172           +17.4%    3044316        sched_debug.cpu.nr_switches.avg
   1526897 ±  3%     +66.4%    2540867 ±  2%  sched_debug.cpu.nr_switches.min
    606805           -67.2%     198918 ±  9%  sched_debug.cpu.nr_switches.stddev
 1.902e+10           +14.8%  2.184e+10        perf-stat.i.branch-instructions
  1.42e+08 ±  3%     +16.2%   1.65e+08        perf-stat.i.branch-misses
      6.65 ±  4%      -0.9        5.77 ±  7%  perf-stat.i.cache-miss-rate%
 3.931e+08 ±  9%     +17.1%  4.605e+08 ±  6%  perf-stat.i.cache-references
   5534190           +17.4%    6498045        perf-stat.i.context-switches
      2.71           -14.3%       2.33        perf-stat.i.cpi
 8.694e+10           +14.8%  9.976e+10        perf-stat.i.instructions
      0.39           +14.2%       0.45        perf-stat.i.ipc
     86.53           +17.4%     101.60        perf-stat.i.metric.K/sec
      6.82 ±  5%      -0.9        5.91 ±  9%  perf-stat.overall.cache-miss-rate%
      2.59           -12.9%       2.26        perf-stat.overall.cpi
      0.39           +14.7%       0.44        perf-stat.overall.ipc
 1.871e+10           +14.8%  2.149e+10        perf-stat.ps.branch-instructions
 1.396e+08 ±  3%     +16.2%  1.622e+08        perf-stat.ps.branch-misses
 3.868e+08 ±  9%     +17.1%   4.53e+08 ±  6%  perf-stat.ps.cache-references
   5443676           +17.4%    6391319        perf-stat.ps.context-switches
 8.552e+10           +14.8%  9.813e+10        perf-stat.ps.instructions
 5.251e+12           +14.3%      6e+12        perf-stat.total.instructions



***************************************************************************************************
lkp-icl-2sp8: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
  gcc-12/performance/x86_64-rhel-9.4/100%/debian-12-x86_64-20240206.cgz/lkp-icl-2sp8/sigbus/stress-ng/60s

commit: 
  3344260945 ("Merge tag 'for-v6.14-rc' of git://git.kernel.org/pub/scm/linux/kernel/git/sre/linux-power-supply")
  ee2a5c3e36 ("Revert "pid: allow pid_max to be set per pid namespace"")

334426094588f817 ee2a5c3e36093d0ff5709bc8f21 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
      7.64            +1.7        9.30        mpstat.cpu.all.usr%
     36.50 ± 16%     -42.9%      20.83 ± 31%  perf-c2c.DRAM.local
      2312 ±  6%     -68.7%     723.17 ±  4%  perf-c2c.DRAM.remote
      3690 ±  3%     +44.9%       5347 ±  6%  perf-c2c.HITM.local
      2155 ±  6%     -71.8%     608.17 ±  4%  perf-c2c.HITM.remote
      4477 ± 69%     -70.3%       1328 ± 35%  proc-vmstat.numa_hint_faults
      2459 ± 11%     -64.8%     866.33 ± 47%  proc-vmstat.numa_hint_faults_local
    140611 ± 21%     -33.6%      93302 ± 45%  proc-vmstat.numa_pte_updates
 7.197e+08           +20.7%  8.685e+08        proc-vmstat.pgfault
 7.201e+08           +20.6%  8.682e+08        stress-ng.sigbus.ops
  12001759           +20.6%   14469786        stress-ng.sigbus.ops_per_sec
      3526            -1.8%       3461        stress-ng.time.system_time
    261.31           +25.4%     327.64        stress-ng.time.user_time
      0.03 ± 55%     -64.6%       0.01 ± 17%  perf-sched.sch_delay.avg.ms.do_wait.kernel_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.86 ±150%     -90.1%       0.09 ±201%  perf-sched.sch_delay.avg.ms.schedule_hrtimeout_range.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
      0.02 ± 50%     -58.7%       0.01 ± 14%  perf-sched.sch_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
      1.08 ± 18%     -34.1%       0.71 ± 14%  perf-sched.sch_delay.avg.ms.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
      0.31 ± 72%     -65.9%       0.11 ± 71%  perf-sched.sch_delay.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
      0.02 ± 10%     -23.4%       0.01 ± 15%  perf-sched.sch_delay.max.ms.rcu_gp_kthread.kthread.ret_from_fork.ret_from_fork_asm
      1.91 ±218%     -99.2%       0.02 ± 11%  perf-sched.sch_delay.max.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
      4.00 ± 49%     -71.6%       1.14 ± 56%  perf-sched.sch_delay.max.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
    261.25 ± 37%    +199.1%     781.43 ± 15%  perf-sched.wait_and_delay.avg.ms.__cond_resched.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
     81.02 ± 59%    +274.1%     303.13 ± 50%  perf-sched.wait_and_delay.avg.ms.schedule_hrtimeout_range.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
      6.60 ±  2%     +16.9%       7.71 ±  3%  perf-sched.wait_and_delay.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
    108.83 ± 63%     -81.2%      20.50 ±113%  perf-sched.wait_and_delay.count.devkmsg_read.vfs_read.ksys_read.do_syscall_64
      3107 ±  3%     -12.6%       2714 ±  5%  perf-sched.wait_and_delay.count.irqentry_exit_to_user_mode.asm_exc_page_fault.[unknown]
    124.17 ± 63%     -70.1%      37.17 ± 60%  perf-sched.wait_and_delay.count.schedule_hrtimeout_range.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
    751.00 ±  2%     -17.0%     623.50 ±  2%  perf-sched.wait_and_delay.count.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
      1550 ± 31%    +119.7%       3406 ± 19%  perf-sched.wait_and_delay.max.ms.__cond_resched.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
    261.24 ± 37%    +199.1%     781.42 ± 15%  perf-sched.wait_time.avg.ms.__cond_resched.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
     80.16 ± 60%    +278.0%     303.05 ± 50%  perf-sched.wait_time.avg.ms.schedule_hrtimeout_range.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
      6.59 ±  2%     +17.0%       7.71 ±  3%  perf-sched.wait_time.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
      1550 ± 31%    +119.7%       3406 ± 19%  perf-sched.wait_time.max.ms.__cond_resched.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
      0.18           -49.0%       0.09 ±  3%  perf-stat.i.MPKI
  1.59e+10           +19.7%  1.903e+10        perf-stat.i.branch-instructions
      0.28            -0.0        0.25        perf-stat.i.branch-miss-rate%
  40989724            +5.3%   43173098 ±  2%  perf-stat.i.branch-misses
     32.63           -15.8       16.81 ±  2%  perf-stat.i.cache-miss-rate%
  12733301 ±  2%     -40.3%    7597041 ±  3%  perf-stat.i.cache-misses
  38933806           +14.5%   44591128        perf-stat.i.cache-references
      3.17           -16.4%       2.65        perf-stat.i.cpi
     18224           +75.2%      31921        perf-stat.i.cycles-between-cache-misses
 7.098e+10           +19.6%  8.489e+10        perf-stat.i.instructions
      0.32           +19.0%       0.38        perf-stat.i.ipc
    184.67           +20.6%     222.65        perf-stat.i.metric.K/sec
  11819123           +20.6%   14249011        perf-stat.i.page-faults
      0.18           -50.1%       0.09 ±  3%  perf-stat.overall.MPKI
      0.26            -0.0        0.23        perf-stat.overall.branch-miss-rate%
     32.70           -15.7       17.04 ±  3%  perf-stat.overall.cache-miss-rate%
      3.19           -16.4%       2.66        perf-stat.overall.cpi
     17772 ±  2%     +67.6%      29795 ±  2%  perf-stat.overall.cycles-between-cache-misses
      0.31           +19.6%       0.38        perf-stat.overall.ipc
 1.564e+10           +19.7%  1.871e+10        perf-stat.ps.branch-instructions
  40314687            +5.4%   42478375 ±  2%  perf-stat.ps.branch-misses
  12525837 ±  2%     -40.3%    7473864 ±  3%  perf-stat.ps.cache-misses
  38300912           +14.5%   43866104        perf-stat.ps.cache-references
 6.982e+10           +19.6%   8.35e+10        perf-stat.ps.instructions
  11626044           +20.6%   14016280        perf-stat.ps.page-faults
 4.284e+12           +19.5%  5.117e+12        perf-stat.total.instructions





Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki


  parent reply	other threads:[~2025-03-10  7:32 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2025-02-21 17:02 [PATCH 0/2] Alternative "pid_max" for 32-bit userspace Michal Koutný
2025-02-21 17:02 ` [PATCH 1/2] Revert "pid: allow pid_max to be set per pid namespace" Michal Koutný
2025-02-25 17:36   ` Alexander Mikhalitsyn
2025-03-10  7:32   ` kernel test robot [this message]
2025-02-21 17:02 ` [PATCH 2/2] pid: Optional first-fit pid allocation Michal Koutný
2025-02-22  0:18   ` Andrew Morton
2025-02-22  9:02     ` David Laight
2025-03-05 15:01       ` Michal Koutný
2025-03-05 15:04     ` Michal Koutný
2025-02-25 17:30   ` Alexander Mikhalitsyn
2025-03-06  8:59   ` Christian Brauner
2025-03-06  9:09     ` Michal Koutný

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=202503101532.348576bb-lkp@intel.com \
    --to=oliver.sang@intel.com \
    --cc=akpm@linux-foundation.org \
    --cc=alexander@mihalicyn.com \
    --cc=brauner@kernel.org \
    --cc=corbet@lwn.net \
    --cc=ebiederm@xmission.com \
    --cc=kees@kernel.org \
    --cc=linux-doc@vger.kernel.org \
    --cc=linux-fsdevel@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-trace-kernel@vger.kernel.org \
    --cc=lkp@intel.com \
    --cc=mkoutny@suse.com \
    --cc=oe-lkp@lists.linux.dev \
    --cc=oleg@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.