All of lore.kernel.org
 help / color / mirror / Atom feed
* [linus:master] [mm/mmap]  6852c46c78:  stress-ng.spawn.ops_per_sec 2.9% improvement
@ 2023-09-21  7:43 kernel test robot
  0 siblings, 0 replies; only message in thread
From: kernel test robot @ 2023-09-21  7:43 UTC (permalink / raw)
  To: Yu Ma
  Cc: oe-lkp, lkp, linux-kernel, Andrew Morton, Tim Chen, Dan Williams,
	Dave Hansen, Kirill A . Shutemov, Liam R. Howlett, Shakeel Butt,
	Zhu, Lipeng, linux-mm, ying.huang, feng.tang, fengwei.yin,
	oliver.sang



Hello,

kernel test robot noticed a 2.9% improvement of stress-ng.spawn.ops_per_sec on:


commit: 6852c46c783d20a4c0153d14d2990040e5e6e47e ("mm/mmap: move vma operations to mm_struct out of the critical section of file mapping lock")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master

testcase: stress-ng
test machine: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory
parameters:

	nr_threads: 100%
	disk: 1HDD
	testtime: 60s
	class: exec_spawn
	test: spawn
	cpufreq_governor: performance


In addition to that, the commit also has significant impact on the following tests:

+------------------+----------------------------------------------------------------------------------------------------+
| testcase: change | unixbench: unixbench.score 6.4% improvement                                                        |
| test machine     | 224 threads 2 sockets Intel(R) Xeon(R) Platinum 8480L (Sapphire Rapids) with 512G memory           |
| test parameters  | cpufreq_governor=performance                                                                       |
|                  | nr_task=100%                                                                                       |
|                  | runtime=300s                                                                                       |
|                  | test=shell8                                                                                        |
+------------------+----------------------------------------------------------------------------------------------------+
| testcase: change | vm-scalability: vm-scalability.throughput 23.5% improvement                                        |
| test machine     | 224 threads 4 sockets Intel(R) Xeon(R) Platinum 8380H CPU @ 2.90GHz (Cooper Lake) with 192G memory |
| test parameters  | cpufreq_governor=performance                                                                       |
|                  | runtime=300s                                                                                       |
|                  | test=small-allocs                                                                                  |
+------------------+----------------------------------------------------------------------------------------------------+




Details are as below:
-------------------------------------------------------------------------------------------------->


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20230921/202309211530.ca9c00a1-oliver.sang@intel.com

=========================================================================================
class/compiler/cpufreq_governor/disk/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
  exec_spawn/gcc-12/performance/1HDD/x86_64-rhel-8.3/100%/debian-11.1-x86_64-20220510.cgz/lkp-icl-2sp8/spawn/stress-ng/60s

commit: 
  73e791d738 ("mm: remove clear_page_idle()")
  6852c46c78 ("mm/mmap: move vma operations to mm_struct out of the critical section of file mapping lock")

73e791d73877e904 6852c46c783d20a4c0153d14d29 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
     10609 ± 11%     -19.9%       8493 ±  9%  numa-meminfo.node0.PageTables
      2666 ± 11%     -22.4%       2069 ±  9%  numa-vmstat.node0.nr_page_table_pages
     73.76            -1.5%      72.67        iostat.cpu.system
     11.69            +3.3%      12.07        iostat.cpu.user
      9706 ±  5%     +35.6%      13159 ± 22%  turbostat.POLL
     86.46            +0.9%      87.20        turbostat.RAMWatt
    301.24 ± 14%     -16.3%     252.23 ±  5%  sched_debug.cfs_rq:/.runnable_avg.stddev
    -60827           +81.7%    -110508        sched_debug.cfs_rq:/.spread0.avg
      1216 ± 13%     +33.8%       1627 ± 12%  sched_debug.cpu.nr_uninterruptible.max
   1282527            +2.9%    1319688        stress-ng.spawn.ops
     21375            +2.9%      21994        stress-ng.spawn.ops_per_sec
    593917           +46.2%     868090        stress-ng.time.major_page_faults
 2.721e+08            +2.9%  2.799e+08        stress-ng.time.minor_page_faults
      2943            -1.5%       2899        stress-ng.time.system_time
    434.40            +4.3%     453.01        stress-ng.time.user_time
    602081 ±  2%      +4.2%     627620        proc-vmstat.nr_anon_pages
    823789 ±  2%      +4.2%     858619        proc-vmstat.nr_inactive_anon
    219961            +3.6%     227832        proc-vmstat.nr_mapped
    346359            +2.1%     353585        proc-vmstat.nr_shmem
    823789 ±  2%      +4.2%     858619        proc-vmstat.nr_zone_inactive_anon
 1.826e+08            +3.0%  1.881e+08        proc-vmstat.numa_hit
 1.825e+08            +3.0%   1.88e+08        proc-vmstat.numa_local
    425017            +2.2%     434312        proc-vmstat.pgactivate
 1.874e+08            +3.0%  1.931e+08        proc-vmstat.pgalloc_normal
 2.767e+08            +3.0%   2.85e+08        proc-vmstat.pgfault
 1.857e+08            +3.0%  1.913e+08        proc-vmstat.pgfree
      5030            +2.6%       5159        proc-vmstat.thp_split_pmd
      0.01 ±  9%     +30.2%       0.01 ± 17%  perf-sched.sch_delay.avg.ms.__cond_resched.kmem_cache_alloc.vm_area_alloc.alloc_bprm.do_execveat_common
      0.54 ± 22%     -46.9%       0.29 ± 52%  perf-sched.sch_delay.max.ms.__cond_resched.filemap_read.__kernel_read.search_binary_handler.exec_binprm
      0.15 ± 80%     -67.6%       0.05 ± 41%  perf-sched.sch_delay.max.ms.__cond_resched.kmem_cache_alloc.__anon_vma_prepare.do_cow_fault.do_fault
      0.63 ± 12%     -32.5%       0.42 ± 12%  perf-sched.sch_delay.max.ms.__cond_resched.remove_vma.do_vmi_align_munmap.do_vmi_munmap.mmap_region
      0.03 ±  2%     +15.9%       0.03 ±  4%  perf-sched.wait_and_delay.avg.ms.__cond_resched.stop_one_cpu.sched_exec.bprm_execve.part
      4292 ±  5%     -14.2%       3683 ±  4%  perf-sched.wait_and_delay.count.__cond_resched.__kmem_cache_alloc_node.kmalloc_trace.perf_event_mmap_event.perf_event_mmap
      3397 ±  4%     +16.8%       3966 ±  4%  perf-sched.wait_and_delay.count.__cond_resched.zap_pmd_range.isra.0.unmap_page_range
      5346 ±  4%     +14.1%       6100 ±  2%  perf-sched.wait_and_delay.count.exit_to_user_mode_loop.exit_to_user_mode_prepare.irqentry_exit_to_user_mode.asm_exc_page_fault
     27421 ±  2%     +49.2%      40919        perf-sched.wait_and_delay.count.io_schedule.folio_wait_bit_common.filemap_fault.__do_fault
     18122 ±  5%     -13.9%      15598 ±  3%  perf-sched.wait_and_delay.count.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
     78347 ±  5%     -14.6%      66894 ±  3%  perf-sched.wait_and_delay.count.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.unlink_file_vma
      2.02 ± 27%     -46.2%       1.09 ± 14%  perf-sched.wait_and_delay.max.ms.__cond_resched.zap_pmd_range.isra.0.unmap_page_range
      0.01 ± 23%    +109.1%       0.02 ± 63%  perf-sched.wait_time.avg.ms.__cond_resched.__alloc_pages.pte_alloc_one.__do_fault.do_cow_fault
      0.02 ±  6%     +68.4%       0.03 ± 29%  perf-sched.wait_time.avg.ms.__cond_resched.down_write.do_brk_flags.vm_brk_flags.load_elf_interp
      0.02 ± 21%     +49.0%       0.03 ± 26%  perf-sched.wait_time.avg.ms.__cond_resched.kmem_cache_alloc.getname_flags.part.0
      0.02 ±  6%     +19.5%       0.03 ± 10%  perf-sched.wait_time.avg.ms.__cond_resched.kmem_cache_alloc.vm_area_dup.__split_vma.mprotect_fixup
      0.01 ±  3%     +20.3%       0.01 ±  7%  perf-sched.wait_time.avg.ms.__cond_resched.stop_one_cpu.sched_exec.bprm_execve.part
      0.04 ± 54%    +138.4%       0.10 ± 68%  perf-sched.wait_time.max.ms.__cond_resched.down_read_killable.iterate_dir.__x64_sys_getdents64.do_syscall_64
      0.08 ± 52%    +290.1%       0.33 ± 61%  perf-sched.wait_time.max.ms.__cond_resched.down_write.do_brk_flags.vm_brk_flags.load_elf_interp
      0.03 ± 70%    +101.1%       0.06 ± 16%  perf-sched.wait_time.max.ms.__cond_resched.down_write.exit_mmap.__mmput.exit_mm
      0.54 ±  7%     -48.8%       0.27 ± 69%  perf-sched.wait_time.max.ms.__cond_resched.down_write_killable.vm_mmap_pgoff.elf_map.load_elf_interp
      0.54 ±  3%     +18.9%       0.64 ± 16%  perf-sched.wait_time.max.ms.__cond_resched.kmem_cache_alloc.alloc_empty_file.path_openat.do_filp_open
      2.30 ± 18%     -22.0%       1.79 ±  8%  perf-sched.wait_time.max.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
     13.82            +1.5%      14.03        perf-stat.i.MPKI
      1.93            +0.0        1.96        perf-stat.i.branch-miss-rate%
 4.182e+08            +2.3%   4.28e+08        perf-stat.i.branch-misses
 2.549e+08            +1.9%  2.598e+08        perf-stat.i.cache-misses
 1.545e+09            +2.5%  1.584e+09        perf-stat.i.cache-references
      1.86            -2.0%       1.82        perf-stat.i.cpi
    945.76 ±  3%      -5.3%     895.17 ±  2%  perf-stat.i.cycles-between-cache-misses
 1.249e+10            +2.7%  1.283e+10        perf-stat.i.dTLB-stores
      0.55            +3.0%       0.56        perf-stat.i.ipc
      9557           +46.8%      14033        perf-stat.i.major-faults
      2025            +2.4%       2074        perf-stat.i.metric.K/sec
    996.43            +1.3%       1009        perf-stat.i.metric.M/sec
   4413113            +2.7%    4533786        perf-stat.i.minor-faults
  12764008 ±  2%      +3.7%   13236279        perf-stat.i.node-loads
  47197681            +2.6%   48424472        perf-stat.i.node-stores
   4422671            +2.8%    4547820        perf-stat.i.page-faults
     14.16            +1.6%      14.39        perf-stat.overall.MPKI
      1.92            +0.0        1.95        perf-stat.overall.branch-miss-rate%
      1.87            -1.8%       1.84        perf-stat.overall.cpi
    810.88            -2.8%     788.33        perf-stat.overall.cycles-between-cache-misses
      0.53            +1.8%       0.54        perf-stat.overall.ipc
 4.094e+08            +2.3%   4.19e+08        perf-stat.ps.branch-misses
 1.511e+09            +2.5%  1.549e+09        perf-stat.ps.cache-references
     17556 ±  2%      +3.9%      18239        perf-stat.ps.cpu-migrations
 1.226e+10            +2.7%   1.26e+10        perf-stat.ps.dTLB-stores
      9359           +46.0%      13666        perf-stat.ps.major-faults
   4325409            +2.8%    4445394        perf-stat.ps.minor-faults
  12743809 ±  2%      +4.1%   13262041        perf-stat.ps.node-loads
  46090814            +2.7%   47343915        perf-stat.ps.node-stores
   4334769            +2.9%    4459060        perf-stat.ps.page-faults
     17.36            -1.2       16.11        perf-profile.calltrace.cycles-pp.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.unlink_file_vma.free_pgtables
     14.04            -1.1       12.96        perf-profile.calltrace.cycles-pp.osq_lock.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.unlink_file_vma
     69.96            -0.9       69.02        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe
     70.01            -0.9       69.08        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe
     12.28            -0.9       11.37        perf-profile.calltrace.cycles-pp.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.vma_prepare.__split_vma
      9.72            -0.8        8.90        perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.unlink_file_vma.free_pgtables.exit_mmap
     10.90 ±  2%      -0.8       10.10        perf-profile.calltrace.cycles-pp.osq_lock.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.vma_prepare
      9.96            -0.8        9.16        perf-profile.calltrace.cycles-pp.down_write.unlink_file_vma.free_pgtables.exit_mmap.__mmput
      7.46 ±  2%      -0.8        6.69        perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.mmap_region.do_mmap.vm_mmap_pgoff
      7.33 ±  2%      -0.8        6.56        perf-profile.calltrace.cycles-pp.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.mmap_region.do_mmap
     10.95            -0.8       10.20        perf-profile.calltrace.cycles-pp.unlink_file_vma.free_pgtables.exit_mmap.__mmput.exit_mm
     12.30            -0.7       11.60        perf-profile.calltrace.cycles-pp.free_pgtables.exit_mmap.__mmput.exit_mm.do_exit
      8.52 ±  2%      -0.5        8.03        perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.vma_prepare.__split_vma.do_vmi_align_munmap
      8.72 ±  2%      -0.5        8.24        perf-profile.calltrace.cycles-pp.down_write.vma_prepare.__split_vma.do_vmi_align_munmap.do_vmi_munmap
      7.36            -0.4        6.92        perf-profile.calltrace.cycles-pp.down_write.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
      7.95            -0.4        7.52        perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.unlink_file_vma.free_pgtables.unmap_region
      5.72 ±  2%      -0.4        5.30        perf-profile.calltrace.cycles-pp.osq_lock.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.mmap_region
      3.96 ±  2%      -0.4        3.55        perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.vma_prepare.__split_vma.mprotect_fixup
      8.40            -0.4        7.98        perf-profile.calltrace.cycles-pp.unlink_file_vma.free_pgtables.unmap_region.do_vmi_align_munmap.do_vmi_munmap
      8.12            -0.4        7.71        perf-profile.calltrace.cycles-pp.down_write.unlink_file_vma.free_pgtables.unmap_region.do_vmi_align_munmap
      4.10 ±  2%      -0.4        3.70        perf-profile.calltrace.cycles-pp.down_write.vma_prepare.__split_vma.mprotect_fixup.do_mprotect_pkey
      8.68            -0.4        8.28        perf-profile.calltrace.cycles-pp.free_pgtables.unmap_region.do_vmi_align_munmap.do_vmi_munmap.mmap_region
      5.16            -0.3        4.84        perf-profile.calltrace.cycles-pp.vma_prepare.__split_vma.mprotect_fixup.do_mprotect_pkey.__x64_sys_mprotect
      9.61            -0.3        9.30        perf-profile.calltrace.cycles-pp.unmap_region.do_vmi_align_munmap.do_vmi_munmap.mmap_region.do_mmap
      7.33            -0.2        7.14        perf-profile.calltrace.cycles-pp.__split_vma.mprotect_fixup.do_mprotect_pkey.__x64_sys_mprotect.do_syscall_64
      2.84            -0.2        2.66        perf-profile.calltrace.cycles-pp.rwsem_spin_on_owner.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.unlink_file_vma
      2.18            -0.2        2.02        perf-profile.calltrace.cycles-pp.rwsem_spin_on_owner.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.vma_prepare
      1.18            -0.1        1.11        perf-profile.calltrace.cycles-pp.rwsem_spin_on_owner.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.mmap_region
      0.84 ±  3%      -0.1        0.77 ±  3%  perf-profile.calltrace.cycles-pp.vm_mmap_pgoff.elf_map.load_elf_interp.load_elf_binary.search_binary_handler
      0.79 ±  3%      -0.1        0.73 ±  3%  perf-profile.calltrace.cycles-pp.mmap_region.do_mmap.vm_mmap_pgoff.elf_map.load_elf_interp
      0.82            -0.0        0.78        perf-profile.calltrace.cycles-pp.mas_store_prealloc.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
      0.56            +0.0        0.58        perf-profile.calltrace.cycles-pp.perf_iterate_sb.perf_event_mmap_event.perf_event_mmap.mmap_region.do_mmap
      0.60            +0.0        0.63        perf-profile.calltrace.cycles-pp.__sysconf
      0.57 ±  2%      +0.0        0.60        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.wait4
      0.55 ±  2%      +0.0        0.58        perf-profile.calltrace.cycles-pp.kernel_wait4.__do_sys_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe.wait4
      0.57 ±  2%      +0.0        0.60        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.wait4
      0.67            +0.0        0.70        perf-profile.calltrace.cycles-pp.sync_regs.asm_exc_page_fault
      0.55            +0.0        0.58 ±  2%  perf-profile.calltrace.cycles-pp.vma_interval_tree_insert.vma_complete.__split_vma.mprotect_fixup.do_mprotect_pkey
      0.61            +0.0        0.64        perf-profile.calltrace.cycles-pp.copy_process.kernel_clone.__do_sys_clone.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.04            +0.0        1.08        perf-profile.calltrace.cycles-pp.perf_event_mmap_event.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
      1.08            +0.0        1.12        perf-profile.calltrace.cycles-pp.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
      0.56            +0.0        0.60 ±  3%  perf-profile.calltrace.cycles-pp.alloc_bprm.do_execveat_common.__x64_sys_execve.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.64            +0.0        0.68        perf-profile.calltrace.cycles-pp.mas_store_prealloc.vma_complete.__split_vma.mprotect_fixup.do_mprotect_pkey
      0.54            +0.0        0.58        perf-profile.calltrace.cycles-pp.do_set_pte.filemap_map_pages.do_read_fault.do_fault.__handle_mm_fault
      0.54 ±  2%      +0.0        0.58        perf-profile.calltrace.cycles-pp.finish_fault.do_cow_fault.do_fault.__handle_mm_fault.handle_mm_fault
      0.93            +0.0        0.98        perf-profile.calltrace.cycles-pp.path_openat.do_filp_open.do_sys_openat2.__x64_sys_openat.do_syscall_64
      0.94            +0.0        0.99        perf-profile.calltrace.cycles-pp.do_filp_open.do_sys_openat2.__x64_sys_openat.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.63 ±  2%      +0.1        0.68        perf-profile.calltrace.cycles-pp._compound_head.zap_pte_range.zap_pmd_range.unmap_page_range.unmap_vmas
      1.06            +0.1        1.12        perf-profile.calltrace.cycles-pp.do_sys_openat2.__x64_sys_openat.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.00            +0.1        1.06        perf-profile.calltrace.cycles-pp._dl_addr
      1.07            +0.1        1.12        perf-profile.calltrace.cycles-pp.__x64_sys_openat.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.58 ±  2%      +0.1        0.63 ±  2%  perf-profile.calltrace.cycles-pp.vma_interval_tree_insert.vma_prepare.__split_vma.mprotect_fixup.do_mprotect_pkey
      0.93            +0.1        0.99        perf-profile.calltrace.cycles-pp.kernfs_fop_readdir.iterate_dir.__x64_sys_getdents64.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.03            +0.1        1.09        perf-profile.calltrace.cycles-pp.__do_sys_clone.do_syscall_64.entry_SYSCALL_64_after_hwframe.__clone
      1.03            +0.1        1.09        perf-profile.calltrace.cycles-pp.kernel_clone.__do_sys_clone.do_syscall_64.entry_SYSCALL_64_after_hwframe.__clone
      0.97            +0.1        1.04        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.getdents64
      0.97            +0.1        1.04        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.getdents64
      0.96            +0.1        1.03        perf-profile.calltrace.cycles-pp.iterate_dir.__x64_sys_getdents64.do_syscall_64.entry_SYSCALL_64_after_hwframe.getdents64
      0.98            +0.1        1.04        perf-profile.calltrace.cycles-pp.getdents64
      0.97            +0.1        1.03        perf-profile.calltrace.cycles-pp.__x64_sys_getdents64.do_syscall_64.entry_SYSCALL_64_after_hwframe.getdents64
      1.37            +0.1        1.44        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__clone
      1.33 ±  2%      +0.1        1.40        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__clone
      0.96            +0.1        1.04        perf-profile.calltrace.cycles-pp.mas_store_prealloc.vma_complete.__split_vma.do_vmi_align_munmap.do_vmi_munmap
      1.50            +0.1        1.58        perf-profile.calltrace.cycles-pp.vma_complete.__split_vma.mprotect_fixup.do_mprotect_pkey.__x64_sys_mprotect
      1.30            +0.1        1.38        perf-profile.calltrace.cycles-pp.wp_page_copy.__handle_mm_fault.handle_mm_fault.do_user_addr_fault.exc_page_fault
      1.51            +0.1        1.60        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.execve.__clone
      1.51            +0.1        1.60        perf-profile.calltrace.cycles-pp.__x64_sys_execve.do_syscall_64.entry_SYSCALL_64_after_hwframe.execve.__clone
      1.52            +0.1        1.61        perf-profile.calltrace.cycles-pp.execve.__clone
      1.52            +0.1        1.60        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.execve.__clone
      1.60            +0.1        1.70        perf-profile.calltrace.cycles-pp.vma_complete.__split_vma.do_vmi_align_munmap.do_vmi_munmap.mmap_region
      1.26 ±  2%      +0.1        1.36        perf-profile.calltrace.cycles-pp.page_remove_rmap.zap_pte_range.zap_pmd_range.unmap_page_range.unmap_vmas
      1.20            +0.1        1.30        perf-profile.calltrace.cycles-pp.vma_interval_tree_insert.vma_prepare.__split_vma.do_vmi_align_munmap.do_vmi_munmap
      1.52            +0.1        1.62        perf-profile.calltrace.cycles-pp.next_uptodate_page.filemap_map_pages.do_read_fault.do_fault.__handle_mm_fault
      0.43 ± 44%      +0.1        0.55 ±  3%  perf-profile.calltrace.cycles-pp.mm_init.alloc_bprm.do_execveat_common.__x64_sys_execve.do_syscall_64
      1.86 ±  2%      +0.2        2.03        perf-profile.calltrace.cycles-pp.release_pages.tlb_batch_pages_flush.tlb_finish_mmu.exit_mmap.__mmput
      3.24            +0.2        3.42        perf-profile.calltrace.cycles-pp.__clone
      2.30            +0.2        2.50        perf-profile.calltrace.cycles-pp.tlb_batch_pages_flush.tlb_finish_mmu.exit_mmap.__mmput.exit_mm
      2.33            +0.2        2.52        perf-profile.calltrace.cycles-pp.tlb_finish_mmu.exit_mmap.__mmput.exit_mm.do_exit
      2.64            +0.2        2.84        perf-profile.calltrace.cycles-pp.filemap_map_pages.do_read_fault.do_fault.__handle_mm_fault.handle_mm_fault
      2.83            +0.2        3.03        perf-profile.calltrace.cycles-pp.do_read_fault.do_fault.__handle_mm_fault.handle_mm_fault.do_user_addr_fault
      3.13            +0.2        3.34        perf-profile.calltrace.cycles-pp.zap_pte_range.zap_pmd_range.unmap_page_range.unmap_vmas.exit_mmap
      3.34            +0.2        3.55        perf-profile.calltrace.cycles-pp.zap_pmd_range.unmap_page_range.unmap_vmas.exit_mmap.__mmput
      3.41            +0.2        3.63        perf-profile.calltrace.cycles-pp.unmap_page_range.unmap_vmas.exit_mmap.__mmput.exit_mm
      2.02            +0.2        2.24        perf-profile.calltrace.cycles-pp.do_cow_fault.do_fault.__handle_mm_fault.handle_mm_fault.do_user_addr_fault
      3.62            +0.2        3.86        perf-profile.calltrace.cycles-pp.unmap_vmas.exit_mmap.__mmput.exit_mm.do_exit
      4.88            +0.4        5.31        perf-profile.calltrace.cycles-pp.do_fault.__handle_mm_fault.handle_mm_fault.do_user_addr_fault.exc_page_fault
      0.00            +0.5        0.51 ±  2%  perf-profile.calltrace.cycles-pp.copy_page.do_cow_fault.do_fault.__handle_mm_fault.handle_mm_fault
      7.28            +0.6        7.86        perf-profile.calltrace.cycles-pp.__handle_mm_fault.handle_mm_fault.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
      7.68            +0.6        8.29        perf-profile.calltrace.cycles-pp.handle_mm_fault.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
      8.90            +0.7        9.59        perf-profile.calltrace.cycles-pp.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
      8.97            +0.7        9.67        perf-profile.calltrace.cycles-pp.exc_page_fault.asm_exc_page_fault
      9.85            +0.7       10.58        perf-profile.calltrace.cycles-pp.asm_exc_page_fault
     39.11            -3.0       36.10        perf-profile.children.cycles-pp.rwsem_optimistic_spin
     39.79            -3.0       36.79        perf-profile.children.cycles-pp.rwsem_down_write_slowpath
     40.99            -2.9       38.11        perf-profile.children.cycles-pp.down_write
     31.42            -2.5       28.90        perf-profile.children.cycles-pp.osq_lock
     19.60            -1.2       18.41        perf-profile.children.cycles-pp.unlink_file_vma
     21.27            -1.1       20.14        perf-profile.children.cycles-pp.free_pgtables
     40.67            -1.0       39.70        perf-profile.children.cycles-pp.mmap_region
     41.06            -1.0       40.10        perf-profile.children.cycles-pp.do_mmap
     41.28            -0.9       40.34        perf-profile.children.cycles-pp.vm_mmap_pgoff
     79.55            -0.8       78.73        perf-profile.children.cycles-pp.do_syscall_64
     79.62            -0.8       78.82        perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
     17.54            -0.7       16.80        perf-profile.children.cycles-pp.vma_prepare
      6.88            -0.5        6.38        perf-profile.children.cycles-pp.rwsem_spin_on_owner
     24.95            -0.5       24.48        perf-profile.children.cycles-pp.do_vmi_align_munmap
     25.10            -0.5       24.64        perf-profile.children.cycles-pp.do_vmi_munmap
     21.02            -0.4       20.64        perf-profile.children.cycles-pp.__split_vma
     10.25            -0.3        9.94        perf-profile.children.cycles-pp.unmap_region
      2.79 ±  4%      -0.2        2.54 ±  3%  perf-profile.children.cycles-pp.elf_map
      0.39            -0.1        0.30        perf-profile.children.cycles-pp.mas_wr_walk
      1.50            -0.0        1.46        perf-profile.children.cycles-pp.mas_wr_store_entry
      0.11            +0.0        0.12 ±  3%  perf-profile.children.cycles-pp.security_file_alloc
      0.23 ±  2%      +0.0        0.24 ±  2%  perf-profile.children.cycles-pp.__percpu_counter_sum
      0.16 ±  2%      +0.0        0.18 ±  2%  perf-profile.children.cycles-pp.init_file
      0.16 ±  3%      +0.0        0.17        perf-profile.children.cycles-pp.unmap_single_vma
      0.25            +0.0        0.27        perf-profile.children.cycles-pp.__memcpy
      0.24 ±  3%      +0.0        0.26        perf-profile.children.cycles-pp.native_flush_tlb_one_user
      0.21 ±  2%      +0.0        0.23        perf-profile.children.cycles-pp.mas_push_data
      0.16            +0.0        0.18 ±  2%  perf-profile.children.cycles-pp.free_unref_page_list
      0.14 ±  3%      +0.0        0.15 ±  3%  perf-profile.children.cycles-pp.padzero
      0.38            +0.0        0.40        perf-profile.children.cycles-pp.d_path
      0.30            +0.0        0.31        perf-profile.children.cycles-pp.__vm_area_free
      0.29            +0.0        0.31        perf-profile.children.cycles-pp.mas_find
      0.45            +0.0        0.47        perf-profile.children.cycles-pp.percpu_counter_add_batch
      0.34 ±  2%      +0.0        0.35 ±  2%  perf-profile.children.cycles-pp.find_idlest_group
      0.30            +0.0        0.32        perf-profile.children.cycles-pp.update_sg_wakeup_stats
      0.36 ±  2%      +0.0        0.37        perf-profile.children.cycles-pp.find_idlest_cpu
      0.29 ±  2%      +0.0        0.31 ±  2%  perf-profile.children.cycles-pp.__filemap_get_folio
      0.34            +0.0        0.36        perf-profile.children.cycles-pp.link_path_walk
      0.44 ±  2%      +0.0        0.46        perf-profile.children.cycles-pp.__mem_cgroup_charge
      0.41            +0.0        0.43        perf-profile.children.cycles-pp.unlink_anon_vmas
      0.31 ±  2%      +0.0        0.33        perf-profile.children.cycles-pp.alloc_empty_file
      0.25 ±  2%      +0.0        0.27        perf-profile.children.cycles-pp.wake_up_new_task
      0.41 ±  2%      +0.0        0.43 ±  2%  perf-profile.children.cycles-pp.__mmdrop
      0.35 ±  2%      +0.0        0.37        perf-profile.children.cycles-pp.mas_next_slot
      0.35 ±  2%      +0.0        0.37        perf-profile.children.cycles-pp.flush_tlb_mm_range
      0.33            +0.0        0.35        perf-profile.children.cycles-pp.find_vma
      0.45 ±  2%      +0.0        0.48 ±  2%  perf-profile.children.cycles-pp.__rb_erase_color
      0.28 ±  2%      +0.0        0.30 ±  2%  perf-profile.children.cycles-pp.filemap_get_entry
      0.27            +0.0        0.29 ±  3%  perf-profile.children.cycles-pp.up_read
      0.51            +0.0        0.53        perf-profile.children.cycles-pp.mas_store_gfp
      0.41 ±  2%      +0.0        0.43        perf-profile.children.cycles-pp.free_swap_cache
      0.44            +0.0        0.47        perf-profile.children.cycles-pp.___perf_sw_event
      0.05 ± 13%      +0.0        0.08 ± 14%  perf-profile.children.cycles-pp.get_any_partial
      0.35            +0.0        0.38 ±  2%  perf-profile.children.cycles-pp.mt_find
      0.37 ±  2%      +0.0        0.39 ±  2%  perf-profile.children.cycles-pp.vfs_read
      0.33 ±  2%      +0.0        0.35 ±  2%  perf-profile.children.cycles-pp.vma_interval_tree_augment_rotate
      0.50 ±  2%      +0.0        0.52 ±  2%  perf-profile.children.cycles-pp.copy_page
      0.33 ±  2%      +0.0        0.36 ±  2%  perf-profile.children.cycles-pp.__percpu_counter_init
      0.24 ±  5%      +0.0        0.26 ±  3%  perf-profile.children.cycles-pp.down_read
      0.31            +0.0        0.33 ±  2%  perf-profile.children.cycles-pp.lru_add_fn
      0.37            +0.0        0.40        perf-profile.children.cycles-pp.__open64_nocancel
      0.39 ±  2%      +0.0        0.42 ±  2%  perf-profile.children.cycles-pp.ksys_read
      0.37 ±  2%      +0.0        0.40        perf-profile.children.cycles-pp.__mod_lruvec_page_state
      0.53            +0.0        0.56        perf-profile.children.cycles-pp.__perf_sw_event
      0.43 ±  2%      +0.0        0.46        perf-profile.children.cycles-pp.free_pages_and_swap_cache
      0.66            +0.0        0.68        perf-profile.children.cycles-pp.get_page_from_freelist
      0.62            +0.0        0.65        perf-profile.children.cycles-pp.__sysconf
      0.55 ±  2%      +0.0        0.58        perf-profile.children.cycles-pp.kernel_wait4
      0.59 ±  2%      +0.0        0.62        perf-profile.children.cycles-pp.wait4
      0.47 ±  2%      +0.0        0.50 ±  3%  perf-profile.children.cycles-pp.try_to_wake_up
      0.53            +0.0        0.56        perf-profile.children.cycles-pp.mas_split
      0.30            +0.0        0.33 ±  2%  perf-profile.children.cycles-pp.folio_add_lru_vma
      0.06 ± 11%      +0.0        0.09 ±  6%  perf-profile.children.cycles-pp.io_schedule
      0.51            +0.0        0.54        perf-profile.children.cycles-pp.lock_mm_and_find_vma
      0.47            +0.0        0.50 ±  2%  perf-profile.children.cycles-pp.__x64_sys_munmap
      0.41            +0.0        0.44 ±  3%  perf-profile.children.cycles-pp.pcpu_alloc
      0.60            +0.0        0.64        perf-profile.children.cycles-pp.mas_wr_bnode
      0.06 ± 11%      +0.0        0.10 ±  5%  perf-profile.children.cycles-pp.folio_wait_bit_common
      0.61 ±  2%      +0.0        0.65        perf-profile.children.cycles-pp.copy_process
      0.48            +0.0        0.51        perf-profile.children.cycles-pp.page_add_file_rmap
      0.75            +0.0        0.79        perf-profile.children.cycles-pp.native_irq_return_iret
      0.69            +0.0        0.73        perf-profile.children.cycles-pp.vm_area_alloc
      0.56            +0.0        0.60 ±  2%  perf-profile.children.cycles-pp._raw_spin_lock
      0.54            +0.0        0.58        perf-profile.children.cycles-pp.lock_vma_under_rcu
      0.46 ±  2%      +0.0        0.50 ±  2%  perf-profile.children.cycles-pp.__pte_offset_map_lock
      0.51            +0.0        0.55 ±  2%  perf-profile.children.cycles-pp.mm_init
      0.15 ±  2%      +0.0        0.19 ±  3%  perf-profile.children.cycles-pp.__wake_up_common
      0.61            +0.0        0.65        perf-profile.children.cycles-pp.mas_walk
      0.67            +0.0        0.71        perf-profile.children.cycles-pp.vma_alloc_folio
      0.88            +0.0        0.92        perf-profile.children.cycles-pp.sync_regs
      0.83            +0.0        0.87        perf-profile.children.cycles-pp.perf_event_mmap_output
      0.56            +0.0        0.60 ±  2%  perf-profile.children.cycles-pp.alloc_bprm
      0.83            +0.0        0.87        perf-profile.children.cycles-pp.mas_wr_node_store
      0.58            +0.0        0.62        perf-profile.children.cycles-pp.__rb_insert_augmented
      0.79            +0.0        0.83 ±  2%  perf-profile.children.cycles-pp.memcg_slab_post_alloc_hook
      0.06 ±  8%      +0.0        0.10        perf-profile.children.cycles-pp.folio_wake_bit
      0.92            +0.0        0.97        perf-profile.children.cycles-pp.__alloc_pages
      0.75            +0.0        0.80        perf-profile.children.cycles-pp.mtree_range_walk
      1.04            +0.0        1.09        perf-profile.children.cycles-pp.__slab_free
      0.77            +0.0        0.82        perf-profile.children.cycles-pp.vm_area_dup
      0.55 ±  2%      +0.0        0.60        perf-profile.children.cycles-pp.finish_fault
      1.01            +0.1        1.06        perf-profile.children.cycles-pp._dl_addr
      1.06            +0.1        1.12        perf-profile.children.cycles-pp.perf_iterate_sb
      0.75 ±  2%      +0.1        0.81        perf-profile.children.cycles-pp._compound_head
      1.03            +0.1        1.09        perf-profile.children.cycles-pp.__do_sys_clone
      1.03            +0.1        1.09        perf-profile.children.cycles-pp.kernel_clone
      0.96            +0.1        1.03        perf-profile.children.cycles-pp.iterate_dir
      0.94            +0.1        1.00        perf-profile.children.cycles-pp.kernfs_fop_readdir
      0.98            +0.1        1.04        perf-profile.children.cycles-pp.getdents64
      0.97            +0.1        1.03        perf-profile.children.cycles-pp.__x64_sys_getdents64
      0.36 ±  4%      +0.1        0.42 ±  2%  perf-profile.children.cycles-pp.__unfreeze_partials
      0.01 ±223%      +0.1        0.08 ±  6%  perf-profile.children.cycles-pp.wake_page_function
      0.36            +0.1        0.43 ±  2%  perf-profile.children.cycles-pp.filemap_fault
      1.81            +0.1        1.89        perf-profile.children.cycles-pp.perf_event_mmap_event
      1.42            +0.1        1.49        perf-profile.children.cycles-pp.do_sys_openat2
      0.56            +0.1        0.63        perf-profile.children.cycles-pp.__do_fault
      1.87            +0.1        1.95        perf-profile.children.cycles-pp.perf_event_mmap
      0.76 ±  2%      +0.1        0.84        perf-profile.children.cycles-pp.kmem_cache_free_bulk
      1.42            +0.1        1.50        perf-profile.children.cycles-pp.__x64_sys_openat
      0.32 ±  3%      +0.1        0.40 ±  5%  perf-profile.children.cycles-pp.get_partial_node
      1.40            +0.1        1.48        perf-profile.children.cycles-pp.path_openat
      1.42            +0.1        1.50        perf-profile.children.cycles-pp.do_filp_open
      0.70            +0.1        0.79 ±  2%  perf-profile.children.cycles-pp.lru_add_drain_cpu
      0.71            +0.1        0.79        perf-profile.children.cycles-pp.lru_add_drain
      1.39            +0.1        1.48        perf-profile.children.cycles-pp.wp_page_copy
      1.43            +0.1        1.53        perf-profile.children.cycles-pp.do_set_pte
      3.10            +0.1        3.20        perf-profile.children.cycles-pp.mas_store_prealloc
      2.00            +0.1        2.11        perf-profile.children.cycles-pp.kmem_cache_alloc
      1.35 ±  2%      +0.1        1.46        perf-profile.children.cycles-pp.page_remove_rmap
      0.94            +0.1        1.04        perf-profile.children.cycles-pp.folio_batch_move_lru
      0.76 ±  2%      +0.1        0.87 ±  3%  perf-profile.children.cycles-pp.___slab_alloc
      1.22            +0.1        1.33        perf-profile.children.cycles-pp.mas_destroy
      0.98            +0.1        1.11 ±  2%  perf-profile.children.cycles-pp.__kmem_cache_alloc_bulk
      1.11            +0.1        1.24 ±  2%  perf-profile.children.cycles-pp.kmem_cache_alloc_bulk
      0.96 ±  3%      +0.1        1.09        perf-profile.children.cycles-pp.folio_lruvec_lock_irqsave
      1.38            +0.1        1.52        perf-profile.children.cycles-pp.mas_preallocate
      1.43            +0.1        1.58        perf-profile.children.cycles-pp.mas_alloc_nodes
      2.02            +0.2        2.19        perf-profile.children.cycles-pp.release_pages
      3.25            +0.2        3.43        perf-profile.children.cycles-pp.__clone
      3.39            +0.2        3.58        perf-profile.children.cycles-pp.vma_complete
      2.40            +0.2        2.60        perf-profile.children.cycles-pp.tlb_batch_pages_flush
      4.42            +0.2        4.62        perf-profile.children.cycles-pp.vma_interval_tree_insert
      2.82            +0.2        3.02        perf-profile.children.cycles-pp.next_uptodate_page
      2.65            +0.2        2.86        perf-profile.children.cycles-pp.tlb_finish_mmu
      3.50            +0.2        3.73        perf-profile.children.cycles-pp.zap_pte_range
      2.10            +0.2        2.33        perf-profile.children.cycles-pp.do_cow_fault
      3.59            +0.2        3.82        perf-profile.children.cycles-pp.zap_pmd_range
      3.73            +0.2        3.97        perf-profile.children.cycles-pp.unmap_page_range
      1.61 ±  2%      +0.3        1.87 ±  2%  perf-profile.children.cycles-pp._raw_spin_lock_irqsave
      3.99            +0.3        4.26        perf-profile.children.cycles-pp.unmap_vmas
      1.33 ±  2%      +0.3        1.60 ±  2%  perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
      5.00            +0.3        5.33        perf-profile.children.cycles-pp.filemap_map_pages
      5.37            +0.4        5.72        perf-profile.children.cycles-pp.do_read_fault
      7.51            +0.6        8.09        perf-profile.children.cycles-pp.do_fault
     10.64            +0.8       11.41        perf-profile.children.cycles-pp.__handle_mm_fault
     11.19            +0.8       12.00        perf-profile.children.cycles-pp.handle_mm_fault
     12.75            +0.9       13.66        perf-profile.children.cycles-pp.do_user_addr_fault
     12.82            +0.9       13.73        perf-profile.children.cycles-pp.exc_page_fault
     14.29            +1.0       15.27        perf-profile.children.cycles-pp.asm_exc_page_fault
     30.81            -2.5       28.33        perf-profile.self.cycles-pp.osq_lock
      6.72            -0.5        6.23        perf-profile.self.cycles-pp.rwsem_spin_on_owner
      0.34 ±  2%      -0.1        0.26        perf-profile.self.cycles-pp.mas_wr_walk
      0.09            +0.0        0.10        perf-profile.self.cycles-pp.access_error
      0.20            +0.0        0.21        perf-profile.self.cycles-pp.__memcpy
      0.11 ±  3%      +0.0        0.12 ±  4%  perf-profile.self.cycles-pp.__unfreeze_partials
      0.38            +0.0        0.40        perf-profile.self.cycles-pp.___perf_sw_event
      0.37            +0.0        0.39        perf-profile.self.cycles-pp.free_swap_cache
      0.26            +0.0        0.28 ±  4%  perf-profile.self.cycles-pp.up_read
      0.47            +0.0        0.49 ±  2%  perf-profile.self.cycles-pp.kmem_cache_free_bulk
      0.32 ±  2%      +0.0        0.34 ±  2%  perf-profile.self.cycles-pp.vma_interval_tree_augment_rotate
      0.22 ±  4%      +0.0        0.24 ±  3%  perf-profile.self.cycles-pp.down_read
      0.49            +0.0        0.51 ±  2%  perf-profile.self.cycles-pp.copy_page
      0.36            +0.0        0.38        perf-profile.self.cycles-pp.do_set_pte
      0.37            +0.0        0.40        perf-profile.self.cycles-pp.__kmem_cache_alloc_bulk
      0.41            +0.0        0.44 ±  3%  perf-profile.self.cycles-pp.___slab_alloc
      0.06 ±  7%      +0.0        0.09        perf-profile.self.cycles-pp.try_to_wake_up
      0.49            +0.0        0.52        perf-profile.self.cycles-pp.memcg_slab_post_alloc_hook
      0.50            +0.0        0.52 ±  2%  perf-profile.self.cycles-pp._raw_spin_lock
      0.41            +0.0        0.44        perf-profile.self.cycles-pp.page_add_file_rmap
      0.58 ±  2%      +0.0        0.62        perf-profile.self.cycles-pp.kmem_cache_alloc
      0.75            +0.0        0.79        perf-profile.self.cycles-pp.native_irq_return_iret
      0.50            +0.0        0.53        perf-profile.self.cycles-pp.rwsem_down_write_slowpath
      0.55            +0.0        0.59        perf-profile.self.cycles-pp.__rb_insert_augmented
      0.77            +0.0        0.81        perf-profile.self.cycles-pp.filemap_map_pages
      0.87            +0.0        0.91        perf-profile.self.cycles-pp.sync_regs
      0.72            +0.0        0.76        perf-profile.self.cycles-pp.mtree_range_walk
      1.03            +0.0        1.07 ±  2%  perf-profile.self.cycles-pp.__slab_free
      0.89            +0.0        0.94        perf-profile.self.cycles-pp._dl_addr
      1.10            +0.1        1.15        perf-profile.self.cycles-pp.zap_pte_range
      0.69 ±  2%      +0.1        0.74        perf-profile.self.cycles-pp._compound_head
      1.08            +0.1        1.16        perf-profile.self.cycles-pp.release_pages
      1.14 ±  2%      +0.1        1.23        perf-profile.self.cycles-pp.page_remove_rmap
      1.10 ±  2%      +0.1        1.20        perf-profile.self.cycles-pp.down_write
      4.32            +0.2        4.52        perf-profile.self.cycles-pp.vma_interval_tree_insert
      2.72            +0.2        2.92        perf-profile.self.cycles-pp.next_uptodate_page
      1.33 ±  2%      +0.3        1.60 ±  2%  perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath


***************************************************************************************************
lkp-spr-2sp3: 224 threads 2 sockets Intel(R) Xeon(R) Platinum 8480L (Sapphire Rapids) with 512G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/nr_task/rootfs/runtime/tbox_group/test/testcase:
  gcc-12/performance/x86_64-rhel-8.3/100%/debian-11.1-x86_64-20220510.cgz/300s/lkp-spr-2sp3/shell8/unixbench

commit: 
  73e791d738 ("mm: remove clear_page_idle()")
  6852c46c78 ("mm/mmap: move vma operations to mm_struct out of the critical section of file mapping lock")

73e791d73877e904 6852c46c783d20a4c0153d14d29 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
    128621 ± 44%     -35.5%      83021 ± 18%  sched_debug.cfs_rq:/.spread0.stddev
     68.67            -2.7%      66.83        turbostat.PkgTmp
     30.36            +1.2%      30.74        turbostat.RAMWatt
     23470 ±  2%      +6.4%      24970 ±  2%  unixbench.score
    799587 ±  3%     +11.6%     892414 ±  3%  unixbench.time.involuntary_context_switches
     83795 ±  8%     +39.5%     116914 ±  5%  unixbench.time.major_page_faults
 1.003e+08 ±  2%      +6.5%  1.068e+08 ±  2%  unixbench.time.minor_page_faults
    328.24 ±  2%      +6.1%     348.20        unixbench.time.user_time
    894180 ±  2%      +6.7%     953859 ±  2%  unixbench.workload
  70334849 ±  2%      +6.3%   74748722 ±  2%  proc-vmstat.numa_hit
  70084753 ±  2%      +6.3%   74504737 ±  2%  proc-vmstat.numa_local
     54187 ±  2%      +6.3%      57597 ±  2%  proc-vmstat.pgactivate
  74859673 ±  2%      +6.3%   79582401 ±  2%  proc-vmstat.pgalloc_normal
 1.024e+08 ±  2%      +6.4%   1.09e+08 ±  2%  proc-vmstat.pgfault
  74344826 ±  2%      +6.4%   79067399 ±  2%  proc-vmstat.pgfree
   4212400 ±  2%      +6.2%    4474364 ±  2%  proc-vmstat.pgreuse
      3869 ±  2%      +7.1%       4145 ±  3%  proc-vmstat.thp_fault_alloc
   1612101 ±  2%      +6.3%    1713519 ±  2%  proc-vmstat.unevictable_pgs_culled
      5.69 ±  2%      +5.1%       5.98 ±  2%  perf-stat.i.MPKI
      0.94 ±  2%      +0.0        0.98        perf-stat.i.branch-miss-rate%
 2.934e+08 ±  2%      +5.8%  3.105e+08 ±  2%  perf-stat.i.branch-misses
 2.844e+08 ±  2%      +6.1%  3.016e+08 ±  2%  perf-stat.i.cache-misses
  9.76e+08 ±  2%      +5.8%  1.033e+09 ±  2%  perf-stat.i.cache-references
      2.99            -1.3%       2.95        perf-stat.i.cpi
 5.226e+11            -0.9%  5.179e+11        perf-stat.i.cpu-cycles
  33882826 ±  2%      +7.2%   36317869 ±  2%  perf-stat.i.dTLB-store-misses
 9.478e+09 ±  2%      +5.9%  1.004e+10 ±  2%  perf-stat.i.dTLB-stores
      0.36            +2.1%       0.36        perf-stat.i.ipc
      1312 ±  8%     +39.6%       1831 ±  5%  perf-stat.i.major-faults
      2.33            -0.9%       2.31        perf-stat.i.metric.GHz
    513.19 ±  2%      +6.0%     544.03 ±  2%  perf-stat.i.metric.K/sec
   1545804 ±  2%      +6.5%    1646269 ±  2%  perf-stat.i.minor-faults
  60502067 ±  2%      +5.4%   63798162 ±  2%  perf-stat.i.node-load-misses
  14871403 ±  2%      +6.2%   15796551 ±  2%  perf-stat.i.node-loads
   1547117 ±  2%      +6.5%    1648101 ±  2%  perf-stat.i.page-faults
      5.78 ±  2%      +5.4%       6.10 ±  2%  perf-stat.overall.MPKI
      0.87 ±  2%      +0.0        0.92 ±  2%  perf-stat.overall.branch-miss-rate%
      3.10            -1.3%       3.06        perf-stat.overall.cpi
      1839 ±  3%      -6.6%       1718 ±  2%  perf-stat.overall.cycles-between-cache-misses
  12132105 ±  2%      -5.5%   11462241 ±  2%  perf-stat.overall.path-length
 2.901e+08 ±  2%      +5.8%   3.07e+08 ±  2%  perf-stat.ps.branch-misses
 2.815e+08 ±  2%      +6.1%  2.986e+08 ±  2%  perf-stat.ps.cache-misses
 9.661e+08 ±  2%      +5.8%  1.022e+09 ±  2%  perf-stat.ps.cache-references
 5.174e+11            -0.9%  5.128e+11        perf-stat.ps.cpu-cycles
  33544567 ±  2%      +7.2%   35957170 ±  2%  perf-stat.ps.dTLB-store-misses
 9.376e+09 ±  2%      +5.9%  9.933e+09 ±  2%  perf-stat.ps.dTLB-stores
      1299 ±  8%     +39.6%       1813 ±  5%  perf-stat.ps.major-faults
   1529961 ±  2%      +6.5%    1629550 ±  2%  perf-stat.ps.minor-faults
  59908353 ±  2%      +5.4%   63173333 ±  2%  perf-stat.ps.node-load-misses
  14730891 ±  2%      +6.2%   15646714 ±  2%  perf-stat.ps.node-loads
   1531260 ±  2%      +6.5%    1631363 ±  2%  perf-stat.ps.page-faults



***************************************************************************************************
lkp-cpl-4sp2: 224 threads 4 sockets Intel(R) Xeon(R) Platinum 8380H CPU @ 2.90GHz (Cooper Lake) with 192G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/rootfs/runtime/tbox_group/test/testcase:
  gcc-12/performance/x86_64-rhel-8.3/debian-11.1-x86_64-20220510.cgz/300s/lkp-cpl-4sp2/small-allocs/vm-scalability

commit: 
  73e791d738 ("mm: remove clear_page_idle()")
  6852c46c78 ("mm/mmap: move vma operations to mm_struct out of the critical section of file mapping lock")

73e791d73877e904 6852c46c783d20a4c0153d14d29 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
     42167           +23.5%      52085        vm-scalability.median
     55.63            +4.8       60.48 ±  5%  vm-scalability.stddev%
   9445724           +23.5%   11667563        vm-scalability.throughput
    380.73            +5.2%     400.43        vm-scalability.time.elapsed_time
    380.73            +5.2%     400.43        vm-scalability.time.elapsed_time.max
    106067           +51.4%     160541        vm-scalability.time.involuntary_context_switches
 6.307e+08           +23.6%  7.795e+08        vm-scalability.time.minor_page_faults
     78306            +4.6%      81916        vm-scalability.time.system_time
    777.26           +22.8%     954.12        vm-scalability.time.user_time
   2562168 ±  2%     -19.3%    2068789 ±  3%  vm-scalability.time.voluntary_context_switches
 2.838e+09           +23.6%  3.507e+09        vm-scalability.workload
 5.815e+09           +10.4%  6.421e+09 ±  3%  cpuidle..time
      6.66            +0.3        6.99 ±  3%  turbostat.C1%
     17067 ±  2%     -16.7%      14220 ±  2%  vmstat.system.cs
      0.05            -0.0        0.04 ±  3%  mpstat.cpu.all.soft%
      0.96            +0.1        1.11        mpstat.cpu.all.usr%
      2261 ±  5%     +22.8%       2776 ±  6%  perf-c2c.HITM.local
      1329 ±  8%     +20.8%       1605 ±  4%  perf-c2c.HITM.remote
  18499960           +19.0%   22006663        meminfo.Memused
   2990186           +26.8%    3790943        meminfo.PageTables
  10350121           +26.1%   13050399        meminfo.SUnreclaim
  10515083           +25.7%   13215478        meminfo.Slab
  18692470           +19.1%   22260889        meminfo.max_used_kB
    407798 ± 17%     +67.2%     682010 ± 20%  sched_debug.cfs_rq:/.load.max
    869895 ± 16%     -34.9%     566399 ± 18%  sched_debug.cfs_rq:/.spread0.avg
   -743993           +53.2%   -1139615        sched_debug.cfs_rq:/.spread0.min
     17559 ±  2%     -12.8%      15312 ±  2%  sched_debug.cpu.nr_switches.avg
     13015 ±  2%     -10.6%      11629 ±  2%  sched_debug.cpu.nr_switches.min
      4612           -18.2%       3772 ± 10%  sched_debug.cpu.nr_switches.stddev
    223.38 ±  8%     -15.7%     188.21 ± 12%  sched_debug.cpu.nr_uninterruptible.max
   1169323           +13.4%    1326004 ±  4%  numa-numastat.node0.local_node
   1239338           +13.9%    1411769 ±  2%  numa-numastat.node0.numa_hit
   1115777 ±  2%     +16.7%    1302136 ±  3%  numa-numastat.node1.local_node
   1181029 ±  2%     +19.6%    1412041 ±  3%  numa-numastat.node1.numa_hit
     65251 ± 31%     +68.4%     109907 ±  7%  numa-numastat.node1.other_node
   1111909 ±  5%     +15.5%    1284183 ±  2%  numa-numastat.node2.local_node
   1218212 ±  5%     +11.5%    1358029        numa-numastat.node2.numa_hit
   1285007 ±  2%     +25.7%    1615828 ±  2%  numa-numastat.node3.local_node
   1391326 ±  2%     +21.8%    1694543 ±  2%  numa-numastat.node3.numa_hit
    137025 ±  2%      +6.1%     145442        proc-vmstat.nr_active_anon
     95233            -1.5%      93787        proc-vmstat.nr_anon_pages
   4427600            -2.0%    4339914        proc-vmstat.nr_dirty_background_threshold
   8866026            -2.0%    8690440        proc-vmstat.nr_dirty_threshold
  44521360            -2.0%   43643224        proc-vmstat.nr_free_pages
    747466           +26.8%     948001        proc-vmstat.nr_page_table_pages
    168941 ±  2%      +4.9%     177164        proc-vmstat.nr_shmem
   2587999           +26.1%    3264111        proc-vmstat.nr_slab_unreclaimable
    137025 ±  2%      +6.1%     145442        proc-vmstat.nr_zone_active_anon
   5032565           +16.8%    5879446        proc-vmstat.numa_hit
   4684671           +18.1%    5530835        proc-vmstat.numa_local
   7168214           +18.3%    8479220        proc-vmstat.pgalloc_normal
 6.322e+08           +23.5%   7.81e+08        proc-vmstat.pgfault
   6881231           +18.9%    8181469        proc-vmstat.pgfree
   3627776            +4.5%    3791232        proc-vmstat.unevictable_pgs_scanned
   3935473 ±  5%     +34.6%    5297695 ± 16%  numa-meminfo.node0.MemUsed
    748076           +26.7%     947801        numa-meminfo.node0.PageTables
   2597779           +26.1%    3275316        numa-meminfo.node0.SUnreclaim
   2625903           +26.1%    3310684        numa-meminfo.node0.Slab
    747754           +26.9%     948714        numa-meminfo.node1.PageTables
   2580884           +26.5%    3264576        numa-meminfo.node1.SUnreclaim
   2652930           +25.8%    3336480        numa-meminfo.node1.Slab
    141326 ± 10%     -21.1%     111532 ± 19%  numa-meminfo.node2.Inactive(anon)
   3886953 ±  2%     +20.1%    4666698        numa-meminfo.node2.MemUsed
    751265           +26.3%     949188        numa-meminfo.node2.PageTables
   2591514           +25.6%    3255894        numa-meminfo.node2.SUnreclaim
   2622178           +24.9%    3274703        numa-meminfo.node2.Slab
    327366 ± 10%     +39.0%     454988 ± 20%  numa-meminfo.node3.Active
    327339 ± 10%     +39.0%     454931 ± 20%  numa-meminfo.node3.Active(anon)
     17238 ±  7%     +16.9%      20158 ± 11%  numa-meminfo.node3.Mapped
   4217474           +22.9%    5184418 ±  3%  numa-meminfo.node3.MemUsed
    745042           +27.0%     946381        numa-meminfo.node3.PageTables
   2586773           +26.0%    3259792        numa-meminfo.node3.SUnreclaim
    379407 ± 18%     +30.5%     495065 ± 18%  numa-meminfo.node3.Shmem
   2620922           +25.9%    3298794        numa-meminfo.node3.Slab
      0.88 ± 71%     -80.8%       0.17 ± 52%  perf-sched.sch_delay.avg.ms.__cond_resched.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode
      0.13 ± 53%    +311.7%       0.53 ± 55%  perf-sched.sch_delay.avg.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt
      0.20 ± 13%     +32.6%       0.26 ± 15%  perf-sched.sch_delay.avg.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
      0.10 ±  9%    +110.7%       0.21 ±  8%  perf-sched.sch_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
      0.06 ± 28%    +138.4%       0.13 ± 29%  perf-sched.sch_delay.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
      0.01        +14036.1%       1.70 ±221%  perf-sched.sch_delay.max.ms.__x64_sys_pause.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
      9.50 ± 34%    +476.1%      54.73 ± 84%  perf-sched.sch_delay.max.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt
      2.31 ± 39%     -78.8%       0.49 ±216%  perf-sched.sch_delay.max.ms.schedule_hrtimeout_range_clock.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
    138.30 ± 14%     +48.7%     205.63 ± 23%  perf-sched.sch_delay.max.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
     13.72 ± 12%    +100.6%      27.53 ± 35%  perf-sched.sch_delay.max.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
      9.63 ± 18%    +114.0%      20.60 ± 10%  perf-sched.sch_delay.max.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
      2.26 ±  6%     +17.9%       2.66 ±  3%  perf-sched.wait_and_delay.avg.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
    213.13 ±  2%     -11.2%     189.35 ±  2%  perf-sched.wait_and_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
      1074 ±  4%     +65.8%       1781 ±  4%  perf-sched.wait_and_delay.count.__cond_resched.__kmem_cache_alloc_node.kmalloc_trace.perf_event_mmap_event.perf_event_mmap
     39318 ±  7%     -17.4%      32472 ±  5%  perf-sched.wait_and_delay.count.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
    171.51 ± 29%     +87.8%     322.16 ± 43%  perf-sched.wait_and_delay.max.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
      2.06 ±  6%     +16.5%       2.40 ±  3%  perf-sched.wait_time.avg.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
    213.03 ±  2%     -11.2%     189.14 ±  2%  perf-sched.wait_time.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
     10.68 ± 19%     +53.1%      16.34 ± 21%  perf-sched.wait_time.max.ms.__cond_resched.__wait_for_common.stop_two_cpus.migrate_swap.task_numa_migrate
      0.02 ± 29%     -42.3%       0.01 ±  6%  perf-sched.wait_time.max.ms.__cond_resched.down_write_killable.exec_mmap.begin_new_exec.load_elf_binary
     27.23 ± 10%     +56.4%      42.58 ± 17%  perf-sched.wait_time.max.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt
     72.93 ± 46%    +126.7%     165.37 ± 43%  perf-sched.wait_time.max.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
    186850           +26.8%     236835        numa-vmstat.node0.nr_page_table_pages
    648927           +26.1%     818461        numa-vmstat.node0.nr_slab_unreclaimable
   1239327           +13.9%    1411687 ±  2%  numa-vmstat.node0.numa_hit
   1169311           +13.4%    1325922 ±  4%  numa-vmstat.node0.numa_local
    186769           +26.9%     237070        numa-vmstat.node1.nr_page_table_pages
    644727           +26.5%     815732        numa-vmstat.node1.nr_slab_unreclaimable
   1180822 ±  2%     +19.6%    1411883 ±  3%  numa-vmstat.node1.numa_hit
   1115570 ±  2%     +16.7%    1301978 ±  3%  numa-vmstat.node1.numa_local
     65251 ± 31%     +68.4%     109907 ±  7%  numa-vmstat.node1.numa_other
     35329 ± 10%     -21.1%      27874 ± 19%  numa-vmstat.node2.nr_inactive_anon
    187645           +26.4%     237184        numa-vmstat.node2.nr_page_table_pages
    647359           +25.7%     813590        numa-vmstat.node2.nr_slab_unreclaimable
     35330 ± 10%     -21.1%      27874 ± 19%  numa-vmstat.node2.nr_zone_inactive_anon
   1218056 ±  5%     +11.5%    1357863        numa-vmstat.node2.numa_hit
   1111753 ±  5%     +15.5%    1284018 ±  2%  numa-vmstat.node2.numa_local
     81831 ± 10%     +38.9%     113699 ± 20%  numa-vmstat.node3.nr_active_anon
      4376 ±  8%     +17.1%       5125 ± 10%  numa-vmstat.node3.nr_mapped
    186088           +27.1%     236505        numa-vmstat.node3.nr_page_table_pages
     94854 ± 18%     +30.5%     123744 ± 18%  numa-vmstat.node3.nr_shmem
    646172           +26.1%     814570        numa-vmstat.node3.nr_slab_unreclaimable
     81831 ± 10%     +38.9%     113699 ± 20%  numa-vmstat.node3.nr_zone_active_anon
   1391229 ±  2%     +21.8%    1694455 ±  2%  numa-vmstat.node3.numa_hit
   1284909 ±  2%     +25.7%    1615740 ±  2%  numa-vmstat.node3.numa_local
 2.182e+10            +5.7%  2.307e+10        perf-stat.i.branch-instructions
     67.44            -1.4       66.01        perf-stat.i.cache-miss-rate%
 1.084e+08            +5.0%  1.138e+08 ±  3%  perf-stat.i.cache-references
     17046 ±  2%     -17.1%      14132 ±  2%  perf-stat.i.context-switches
      8.89            -2.0%       8.71        perf-stat.i.cpi
    488.32            +5.3%     514.19        perf-stat.i.cpu-migrations
      0.03            +0.0        0.04 ±  2%  perf-stat.i.dTLB-load-miss-rate%
   9099499           +14.7%   10433460 ±  2%  perf-stat.i.dTLB-load-misses
 2.527e+10            +4.3%  2.636e+10        perf-stat.i.dTLB-loads
   1732945           +15.5%    2001701        perf-stat.i.dTLB-store-misses
 2.005e+09           +12.5%  2.256e+09        perf-stat.i.dTLB-stores
     93.66            -3.6       90.02        perf-stat.i.iTLB-load-miss-rate%
    431003 ±  5%     +42.8%     615338        perf-stat.i.iTLB-loads
 9.349e+10            +4.1%  9.729e+10        perf-stat.i.instructions
     17597            +4.2%      18333 ±  2%  perf-stat.i.instructions-per-iTLB-miss
      0.12            +4.5%       0.12        perf-stat.i.ipc
    218.73            +5.1%     229.81        perf-stat.i.metric.M/sec
   1645267           +16.2%    1911350        perf-stat.i.minor-faults
     80.15            +1.7       81.87        perf-stat.i.node-load-miss-rate%
  22474764 ±  2%      +3.5%   23258402        perf-stat.i.node-load-misses
   6215674 ±  3%      -6.3%    5824416        perf-stat.i.node-loads
   5207478           +17.4%    6113199        perf-stat.i.node-store-misses
    831077           +17.9%     979621        perf-stat.i.node-stores
   1645267           +16.2%    1911350        perf-stat.i.page-faults
     68.37            -2.1       66.31        perf-stat.overall.cache-miss-rate%
      8.49            -4.5%       8.11        perf-stat.overall.cpi
      0.04            +0.0        0.04 ±  2%  perf-stat.overall.dTLB-load-miss-rate%
      0.09            +0.0        0.09        perf-stat.overall.dTLB-store-miss-rate%
     93.41            -2.5       90.92        perf-stat.overall.iTLB-load-miss-rate%
     15175            +3.4%      15687        perf-stat.overall.instructions-per-iTLB-miss
      0.12            +4.7%       0.12        perf-stat.overall.ipc
     78.20            +1.6       79.79        perf-stat.overall.node-load-miss-rate%
     12543           -11.2%      11140        perf-stat.overall.path-length
 2.183e+10            +6.1%  2.316e+10        perf-stat.ps.branch-instructions
 1.102e+08            +5.6%  1.164e+08 ±  2%  perf-stat.ps.cache-references
     17070 ±  2%     -16.9%      14191 ±  2%  perf-stat.ps.context-switches
    486.54            +5.7%     514.33        perf-stat.ps.cpu-migrations
   9294872           +15.8%   10767066 ±  2%  perf-stat.ps.dTLB-load-misses
 2.525e+10            +4.5%  2.639e+10        perf-stat.ps.dTLB-loads
   1747511           +16.6%    2038370        perf-stat.ps.dTLB-store-misses
 2.024e+09           +13.5%  2.298e+09        perf-stat.ps.dTLB-stores
    434045 ±  5%     +42.8%     619721        perf-stat.ps.iTLB-loads
 9.339e+10            +4.3%  9.739e+10        perf-stat.ps.instructions
   1658111           +17.4%    1946167        perf-stat.ps.minor-faults
  22728635 ±  2%      +4.0%   23636858        perf-stat.ps.node-load-misses
   6338224 ±  3%      -5.5%    5988664        perf-stat.ps.node-loads
   5139882           +17.1%    6017185        perf-stat.ps.node-store-misses
    840111           +18.9%     998845        perf-stat.ps.node-stores
   1658111           +17.4%    1946167        perf-stat.ps.page-faults
  3.56e+13            +9.8%  3.907e+13        perf-stat.total.instructions
     94.07            -1.6       92.46        perf-profile.calltrace.cycles-pp.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.mmap_region.do_mmap
     93.61            -1.6       92.01        perf-profile.calltrace.cycles-pp.osq_lock.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.mmap_region
     94.14            -1.6       92.57        perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.mmap_region.do_mmap.vm_mmap_pgoff
     94.18            -1.6       92.62        perf-profile.calltrace.cycles-pp.down_write.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
     94.87            -1.3       93.61        perf-profile.calltrace.cycles-pp.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64
     94.94            -1.2       93.69        perf-profile.calltrace.cycles-pp.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
     94.95            -1.2       93.70        perf-profile.calltrace.cycles-pp.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
     94.99            -1.2       93.76        perf-profile.calltrace.cycles-pp.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
     96.35            -0.9       95.49        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__mmap
     96.35            -0.9       95.49        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
     96.39            -0.9       95.53        perf-profile.calltrace.cycles-pp.__mmap
      0.77            +0.2        0.98        perf-profile.calltrace.cycles-pp.do_rw_once
      0.55            +0.3        0.81 ±  2%  perf-profile.calltrace.cycles-pp.do_user_addr_fault.exc_page_fault.asm_exc_page_fault.do_access
      0.56            +0.3        0.82 ±  2%  perf-profile.calltrace.cycles-pp.exc_page_fault.asm_exc_page_fault.do_access
      0.80            +0.3        1.12        perf-profile.calltrace.cycles-pp.asm_exc_page_fault.do_access
      1.36            +0.4        1.72 ±  5%  perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
      1.36            +0.4        1.72 ±  5%  perf-profile.calltrace.cycles-pp.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
      1.36            +0.4        1.72 ±  5%  perf-profile.calltrace.cycles-pp.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.36            +0.4        1.72 ±  5%  perf-profile.calltrace.cycles-pp.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64
      1.33            +0.4        1.70 ±  5%  perf-profile.calltrace.cycles-pp.task_numa_work.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode
      1.16            +0.4        1.53 ±  6%  perf-profile.calltrace.cycles-pp.vma_migratable.task_numa_work.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare
      1.07            +0.4        1.48        perf-profile.calltrace.cycles-pp.do_access
     94.07            -1.6       92.46        perf-profile.children.cycles-pp.rwsem_optimistic_spin
     93.63            -1.6       92.03        perf-profile.children.cycles-pp.osq_lock
     94.14            -1.6       92.57        perf-profile.children.cycles-pp.rwsem_down_write_slowpath
     94.18            -1.6       92.62        perf-profile.children.cycles-pp.down_write
     94.88            -1.3       93.61        perf-profile.children.cycles-pp.mmap_region
     94.94            -1.2       93.69        perf-profile.children.cycles-pp.do_mmap
     94.95            -1.2       93.70        perf-profile.children.cycles-pp.vm_mmap_pgoff
     94.99            -1.2       93.76        perf-profile.children.cycles-pp.ksys_mmap_pgoff
     96.38            -0.9       95.52        perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
     96.38            -0.9       95.52        perf-profile.children.cycles-pp.do_syscall_64
     96.39            -0.9       95.54        perf-profile.children.cycles-pp.__mmap
      0.38            -0.0        0.35        perf-profile.children.cycles-pp.rwsem_spin_on_owner
      0.05            +0.0        0.06        perf-profile.children.cycles-pp.mas_walk
      0.05            +0.0        0.06 ±  6%  perf-profile.children.cycles-pp.vm_unmapped_area
      0.05 ±  8%      +0.0        0.07 ±  7%  perf-profile.children.cycles-pp.__perf_sw_event
      0.06 ±  8%      +0.0        0.07        perf-profile.children.cycles-pp.lock_mm_and_find_vma
      0.06 ±  7%      +0.0        0.08        perf-profile.children.cycles-pp.sync_regs
      0.08            +0.0        0.10 ±  3%  perf-profile.children.cycles-pp.mtree_range_walk
      0.10            +0.0        0.13 ±  3%  perf-profile.children.cycles-pp.native_irq_return_iret
      0.06 ±  7%      +0.0        0.09 ±  7%  perf-profile.children.cycles-pp.__unfreeze_partials
      0.17 ±  2%      +0.0        0.21 ±  8%  perf-profile.children.cycles-pp.update_cfs_group
      0.12 ±  4%      +0.0        0.16 ±  3%  perf-profile.children.cycles-pp.lock_vma_under_rcu
      0.06            +0.0        0.10 ± 10%  perf-profile.children.cycles-pp.kmem_cache_free_bulk
      0.28 ±  2%      +0.0        0.33 ±  8%  perf-profile.children.cycles-pp.task_tick_fair
      0.07 ±  7%      +0.0        0.11 ±  8%  perf-profile.children.cycles-pp.___slab_alloc
      0.13 ±  3%      +0.0        0.18 ±  6%  perf-profile.children.cycles-pp.mas_store_prealloc
      0.00            +0.1        0.05        perf-profile.children.cycles-pp.try_to_wake_up
      0.00            +0.1        0.05        perf-profile.children.cycles-pp.wake_up_q
      0.00            +0.1        0.05        perf-profile.children.cycles-pp.perf_event_mmap
      0.00            +0.1        0.05        perf-profile.children.cycles-pp.perf_event_mmap_event
      0.07 ±  7%      +0.1        0.12 ±  7%  perf-profile.children.cycles-pp.mas_destroy
      0.00            +0.1        0.05 ±  7%  perf-profile.children.cycles-pp.___perf_sw_event
      0.00            +0.1        0.05 ±  7%  perf-profile.children.cycles-pp.mt_find
      0.00            +0.1        0.05 ±  7%  perf-profile.children.cycles-pp.mas_empty_area_rev
      0.00            +0.1        0.05 ±  8%  perf-profile.children.cycles-pp.find_vma
      0.10            +0.1        0.16 ±  6%  perf-profile.children.cycles-pp.mas_preallocate
      0.10            +0.1        0.16 ±  6%  perf-profile.children.cycles-pp.mas_alloc_nodes
      0.09            +0.1        0.14 ±  6%  perf-profile.children.cycles-pp.__kmem_cache_alloc_bulk
      0.00            +0.1        0.06 ± 13%  perf-profile.children.cycles-pp.irqentry_exit_to_user_mode
      0.09            +0.1        0.15 ±  7%  perf-profile.children.cycles-pp.kmem_cache_alloc_bulk
      0.10 ±  4%      +0.1        0.16 ±  4%  perf-profile.children.cycles-pp.rwsem_wake
      0.00            +0.1        0.06 ± 11%  perf-profile.children.cycles-pp._raw_spin_lock_irq
      0.00            +0.1        0.07 ± 11%  perf-profile.children.cycles-pp.get_partial_node
      0.15 ±  5%      +0.1        0.22 ±  3%  perf-profile.children.cycles-pp.up_write
      0.17 ±  2%      +0.1        0.24        perf-profile.children.cycles-pp.vma_interval_tree_insert
      0.14 ±  6%      +0.1        0.25 ±  5%  perf-profile.children.cycles-pp._raw_spin_lock_irqsave
      0.08            +0.1        0.20 ±  6%  perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
      0.28 ±  2%      +0.2        0.45 ±  3%  perf-profile.children.cycles-pp.__handle_mm_fault
      0.34 ±  2%      +0.2        0.51 ±  2%  perf-profile.children.cycles-pp.handle_mm_fault
      2.24 ±  3%      +0.2        2.47 ±  5%  perf-profile.children.cycles-pp.asm_sysvec_apic_timer_interrupt
      0.90            +0.2        1.15        perf-profile.children.cycles-pp.do_rw_once
      0.56            +0.3        0.81        perf-profile.children.cycles-pp.do_user_addr_fault
      0.57            +0.3        0.82        perf-profile.children.cycles-pp.exc_page_fault
      0.83            +0.3        1.16        perf-profile.children.cycles-pp.asm_exc_page_fault
      1.36            +0.4        1.73 ±  5%  perf-profile.children.cycles-pp.syscall_exit_to_user_mode
      1.39            +0.4        1.78 ±  5%  perf-profile.children.cycles-pp.exit_to_user_mode_prepare
      1.38            +0.4        1.77 ±  5%  perf-profile.children.cycles-pp.exit_to_user_mode_loop
      1.38            +0.4        1.77 ±  5%  perf-profile.children.cycles-pp.task_work_run
      1.36            +0.4        1.75 ±  5%  perf-profile.children.cycles-pp.task_numa_work
      1.18            +0.4        1.58 ±  5%  perf-profile.children.cycles-pp.vma_migratable
      1.46            +0.5        1.96        perf-profile.children.cycles-pp.do_access
     92.91            -1.6       91.28        perf-profile.self.cycles-pp.osq_lock
      0.38            -0.0        0.35        perf-profile.self.cycles-pp.rwsem_spin_on_owner
      0.06 ±  7%      +0.0        0.08        perf-profile.self.cycles-pp.sync_regs
      0.10            +0.0        0.12 ±  4%  perf-profile.self.cycles-pp.native_irq_return_iret
      0.07 ±  6%      +0.0        0.10 ±  3%  perf-profile.self.cycles-pp.mtree_range_walk
      0.06            +0.0        0.09 ±  5%  perf-profile.self.cycles-pp.lock_vma_under_rcu
      0.08 ±  5%      +0.0        0.11 ±  6%  perf-profile.self.cycles-pp._raw_spin_lock_irqsave
      0.17 ±  2%      +0.0        0.21 ±  8%  perf-profile.self.cycles-pp.update_cfs_group
      0.02 ±141%      +0.0        0.06 ±  7%  perf-profile.self.cycles-pp.rwsem_optimistic_spin
      0.00            +0.1        0.05        perf-profile.self.cycles-pp.___perf_sw_event
      0.00            +0.1        0.06 ±  6%  perf-profile.self.cycles-pp.down_write
      0.17 ±  2%      +0.1        0.24        perf-profile.self.cycles-pp.vma_interval_tree_insert
      0.08            +0.1        0.20 ±  6%  perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
      0.55            +0.1        0.70        perf-profile.self.cycles-pp.do_access
      0.24 ±  3%      +0.2        0.39 ±  3%  perf-profile.self.cycles-pp.__handle_mm_fault
      0.68            +0.2        0.86        perf-profile.self.cycles-pp.do_rw_once
      1.16            +0.4        1.56 ±  6%  perf-profile.self.cycles-pp.vma_migratable





Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki


^ permalink raw reply	[flat|nested] only message in thread

only message in thread, other threads:[~2023-09-21  7:44 UTC | newest]

Thread overview: (only message) (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2023-09-21  7:43 [linus:master] [mm/mmap] 6852c46c78: stress-ng.spawn.ops_per_sec 2.9% improvement kernel test robot

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.