* [linus:master] [mm/mmap] 6852c46c78: stress-ng.spawn.ops_per_sec 2.9% improvement
@ 2023-09-21 7:43 kernel test robot
0 siblings, 0 replies; only message in thread
From: kernel test robot @ 2023-09-21 7:43 UTC (permalink / raw)
To: Yu Ma
Cc: oe-lkp, lkp, linux-kernel, Andrew Morton, Tim Chen, Dan Williams,
Dave Hansen, Kirill A . Shutemov, Liam R. Howlett, Shakeel Butt,
Zhu, Lipeng, linux-mm, ying.huang, feng.tang, fengwei.yin,
oliver.sang
Hello,
kernel test robot noticed a 2.9% improvement of stress-ng.spawn.ops_per_sec on:
commit: 6852c46c783d20a4c0153d14d2990040e5e6e47e ("mm/mmap: move vma operations to mm_struct out of the critical section of file mapping lock")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master
testcase: stress-ng
test machine: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory
parameters:
nr_threads: 100%
disk: 1HDD
testtime: 60s
class: exec_spawn
test: spawn
cpufreq_governor: performance
In addition to that, the commit also has significant impact on the following tests:
+------------------+----------------------------------------------------------------------------------------------------+
| testcase: change | unixbench: unixbench.score 6.4% improvement |
| test machine | 224 threads 2 sockets Intel(R) Xeon(R) Platinum 8480L (Sapphire Rapids) with 512G memory |
| test parameters | cpufreq_governor=performance |
| | nr_task=100% |
| | runtime=300s |
| | test=shell8 |
+------------------+----------------------------------------------------------------------------------------------------+
| testcase: change | vm-scalability: vm-scalability.throughput 23.5% improvement |
| test machine | 224 threads 4 sockets Intel(R) Xeon(R) Platinum 8380H CPU @ 2.90GHz (Cooper Lake) with 192G memory |
| test parameters | cpufreq_governor=performance |
| | runtime=300s |
| | test=small-allocs |
+------------------+----------------------------------------------------------------------------------------------------+
Details are as below:
-------------------------------------------------------------------------------------------------->
The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20230921/202309211530.ca9c00a1-oliver.sang@intel.com
=========================================================================================
class/compiler/cpufreq_governor/disk/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
exec_spawn/gcc-12/performance/1HDD/x86_64-rhel-8.3/100%/debian-11.1-x86_64-20220510.cgz/lkp-icl-2sp8/spawn/stress-ng/60s
commit:
73e791d738 ("mm: remove clear_page_idle()")
6852c46c78 ("mm/mmap: move vma operations to mm_struct out of the critical section of file mapping lock")
73e791d73877e904 6852c46c783d20a4c0153d14d29
---------------- ---------------------------
%stddev %change %stddev
\ | \
10609 ± 11% -19.9% 8493 ± 9% numa-meminfo.node0.PageTables
2666 ± 11% -22.4% 2069 ± 9% numa-vmstat.node0.nr_page_table_pages
73.76 -1.5% 72.67 iostat.cpu.system
11.69 +3.3% 12.07 iostat.cpu.user
9706 ± 5% +35.6% 13159 ± 22% turbostat.POLL
86.46 +0.9% 87.20 turbostat.RAMWatt
301.24 ± 14% -16.3% 252.23 ± 5% sched_debug.cfs_rq:/.runnable_avg.stddev
-60827 +81.7% -110508 sched_debug.cfs_rq:/.spread0.avg
1216 ± 13% +33.8% 1627 ± 12% sched_debug.cpu.nr_uninterruptible.max
1282527 +2.9% 1319688 stress-ng.spawn.ops
21375 +2.9% 21994 stress-ng.spawn.ops_per_sec
593917 +46.2% 868090 stress-ng.time.major_page_faults
2.721e+08 +2.9% 2.799e+08 stress-ng.time.minor_page_faults
2943 -1.5% 2899 stress-ng.time.system_time
434.40 +4.3% 453.01 stress-ng.time.user_time
602081 ± 2% +4.2% 627620 proc-vmstat.nr_anon_pages
823789 ± 2% +4.2% 858619 proc-vmstat.nr_inactive_anon
219961 +3.6% 227832 proc-vmstat.nr_mapped
346359 +2.1% 353585 proc-vmstat.nr_shmem
823789 ± 2% +4.2% 858619 proc-vmstat.nr_zone_inactive_anon
1.826e+08 +3.0% 1.881e+08 proc-vmstat.numa_hit
1.825e+08 +3.0% 1.88e+08 proc-vmstat.numa_local
425017 +2.2% 434312 proc-vmstat.pgactivate
1.874e+08 +3.0% 1.931e+08 proc-vmstat.pgalloc_normal
2.767e+08 +3.0% 2.85e+08 proc-vmstat.pgfault
1.857e+08 +3.0% 1.913e+08 proc-vmstat.pgfree
5030 +2.6% 5159 proc-vmstat.thp_split_pmd
0.01 ± 9% +30.2% 0.01 ± 17% perf-sched.sch_delay.avg.ms.__cond_resched.kmem_cache_alloc.vm_area_alloc.alloc_bprm.do_execveat_common
0.54 ± 22% -46.9% 0.29 ± 52% perf-sched.sch_delay.max.ms.__cond_resched.filemap_read.__kernel_read.search_binary_handler.exec_binprm
0.15 ± 80% -67.6% 0.05 ± 41% perf-sched.sch_delay.max.ms.__cond_resched.kmem_cache_alloc.__anon_vma_prepare.do_cow_fault.do_fault
0.63 ± 12% -32.5% 0.42 ± 12% perf-sched.sch_delay.max.ms.__cond_resched.remove_vma.do_vmi_align_munmap.do_vmi_munmap.mmap_region
0.03 ± 2% +15.9% 0.03 ± 4% perf-sched.wait_and_delay.avg.ms.__cond_resched.stop_one_cpu.sched_exec.bprm_execve.part
4292 ± 5% -14.2% 3683 ± 4% perf-sched.wait_and_delay.count.__cond_resched.__kmem_cache_alloc_node.kmalloc_trace.perf_event_mmap_event.perf_event_mmap
3397 ± 4% +16.8% 3966 ± 4% perf-sched.wait_and_delay.count.__cond_resched.zap_pmd_range.isra.0.unmap_page_range
5346 ± 4% +14.1% 6100 ± 2% perf-sched.wait_and_delay.count.exit_to_user_mode_loop.exit_to_user_mode_prepare.irqentry_exit_to_user_mode.asm_exc_page_fault
27421 ± 2% +49.2% 40919 perf-sched.wait_and_delay.count.io_schedule.folio_wait_bit_common.filemap_fault.__do_fault
18122 ± 5% -13.9% 15598 ± 3% perf-sched.wait_and_delay.count.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
78347 ± 5% -14.6% 66894 ± 3% perf-sched.wait_and_delay.count.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.unlink_file_vma
2.02 ± 27% -46.2% 1.09 ± 14% perf-sched.wait_and_delay.max.ms.__cond_resched.zap_pmd_range.isra.0.unmap_page_range
0.01 ± 23% +109.1% 0.02 ± 63% perf-sched.wait_time.avg.ms.__cond_resched.__alloc_pages.pte_alloc_one.__do_fault.do_cow_fault
0.02 ± 6% +68.4% 0.03 ± 29% perf-sched.wait_time.avg.ms.__cond_resched.down_write.do_brk_flags.vm_brk_flags.load_elf_interp
0.02 ± 21% +49.0% 0.03 ± 26% perf-sched.wait_time.avg.ms.__cond_resched.kmem_cache_alloc.getname_flags.part.0
0.02 ± 6% +19.5% 0.03 ± 10% perf-sched.wait_time.avg.ms.__cond_resched.kmem_cache_alloc.vm_area_dup.__split_vma.mprotect_fixup
0.01 ± 3% +20.3% 0.01 ± 7% perf-sched.wait_time.avg.ms.__cond_resched.stop_one_cpu.sched_exec.bprm_execve.part
0.04 ± 54% +138.4% 0.10 ± 68% perf-sched.wait_time.max.ms.__cond_resched.down_read_killable.iterate_dir.__x64_sys_getdents64.do_syscall_64
0.08 ± 52% +290.1% 0.33 ± 61% perf-sched.wait_time.max.ms.__cond_resched.down_write.do_brk_flags.vm_brk_flags.load_elf_interp
0.03 ± 70% +101.1% 0.06 ± 16% perf-sched.wait_time.max.ms.__cond_resched.down_write.exit_mmap.__mmput.exit_mm
0.54 ± 7% -48.8% 0.27 ± 69% perf-sched.wait_time.max.ms.__cond_resched.down_write_killable.vm_mmap_pgoff.elf_map.load_elf_interp
0.54 ± 3% +18.9% 0.64 ± 16% perf-sched.wait_time.max.ms.__cond_resched.kmem_cache_alloc.alloc_empty_file.path_openat.do_filp_open
2.30 ± 18% -22.0% 1.79 ± 8% perf-sched.wait_time.max.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
13.82 +1.5% 14.03 perf-stat.i.MPKI
1.93 +0.0 1.96 perf-stat.i.branch-miss-rate%
4.182e+08 +2.3% 4.28e+08 perf-stat.i.branch-misses
2.549e+08 +1.9% 2.598e+08 perf-stat.i.cache-misses
1.545e+09 +2.5% 1.584e+09 perf-stat.i.cache-references
1.86 -2.0% 1.82 perf-stat.i.cpi
945.76 ± 3% -5.3% 895.17 ± 2% perf-stat.i.cycles-between-cache-misses
1.249e+10 +2.7% 1.283e+10 perf-stat.i.dTLB-stores
0.55 +3.0% 0.56 perf-stat.i.ipc
9557 +46.8% 14033 perf-stat.i.major-faults
2025 +2.4% 2074 perf-stat.i.metric.K/sec
996.43 +1.3% 1009 perf-stat.i.metric.M/sec
4413113 +2.7% 4533786 perf-stat.i.minor-faults
12764008 ± 2% +3.7% 13236279 perf-stat.i.node-loads
47197681 +2.6% 48424472 perf-stat.i.node-stores
4422671 +2.8% 4547820 perf-stat.i.page-faults
14.16 +1.6% 14.39 perf-stat.overall.MPKI
1.92 +0.0 1.95 perf-stat.overall.branch-miss-rate%
1.87 -1.8% 1.84 perf-stat.overall.cpi
810.88 -2.8% 788.33 perf-stat.overall.cycles-between-cache-misses
0.53 +1.8% 0.54 perf-stat.overall.ipc
4.094e+08 +2.3% 4.19e+08 perf-stat.ps.branch-misses
1.511e+09 +2.5% 1.549e+09 perf-stat.ps.cache-references
17556 ± 2% +3.9% 18239 perf-stat.ps.cpu-migrations
1.226e+10 +2.7% 1.26e+10 perf-stat.ps.dTLB-stores
9359 +46.0% 13666 perf-stat.ps.major-faults
4325409 +2.8% 4445394 perf-stat.ps.minor-faults
12743809 ± 2% +4.1% 13262041 perf-stat.ps.node-loads
46090814 +2.7% 47343915 perf-stat.ps.node-stores
4334769 +2.9% 4459060 perf-stat.ps.page-faults
17.36 -1.2 16.11 perf-profile.calltrace.cycles-pp.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.unlink_file_vma.free_pgtables
14.04 -1.1 12.96 perf-profile.calltrace.cycles-pp.osq_lock.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.unlink_file_vma
69.96 -0.9 69.02 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe
70.01 -0.9 69.08 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe
12.28 -0.9 11.37 perf-profile.calltrace.cycles-pp.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.vma_prepare.__split_vma
9.72 -0.8 8.90 perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.unlink_file_vma.free_pgtables.exit_mmap
10.90 ± 2% -0.8 10.10 perf-profile.calltrace.cycles-pp.osq_lock.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.vma_prepare
9.96 -0.8 9.16 perf-profile.calltrace.cycles-pp.down_write.unlink_file_vma.free_pgtables.exit_mmap.__mmput
7.46 ± 2% -0.8 6.69 perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.mmap_region.do_mmap.vm_mmap_pgoff
7.33 ± 2% -0.8 6.56 perf-profile.calltrace.cycles-pp.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.mmap_region.do_mmap
10.95 -0.8 10.20 perf-profile.calltrace.cycles-pp.unlink_file_vma.free_pgtables.exit_mmap.__mmput.exit_mm
12.30 -0.7 11.60 perf-profile.calltrace.cycles-pp.free_pgtables.exit_mmap.__mmput.exit_mm.do_exit
8.52 ± 2% -0.5 8.03 perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.vma_prepare.__split_vma.do_vmi_align_munmap
8.72 ± 2% -0.5 8.24 perf-profile.calltrace.cycles-pp.down_write.vma_prepare.__split_vma.do_vmi_align_munmap.do_vmi_munmap
7.36 -0.4 6.92 perf-profile.calltrace.cycles-pp.down_write.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
7.95 -0.4 7.52 perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.unlink_file_vma.free_pgtables.unmap_region
5.72 ± 2% -0.4 5.30 perf-profile.calltrace.cycles-pp.osq_lock.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.mmap_region
3.96 ± 2% -0.4 3.55 perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.vma_prepare.__split_vma.mprotect_fixup
8.40 -0.4 7.98 perf-profile.calltrace.cycles-pp.unlink_file_vma.free_pgtables.unmap_region.do_vmi_align_munmap.do_vmi_munmap
8.12 -0.4 7.71 perf-profile.calltrace.cycles-pp.down_write.unlink_file_vma.free_pgtables.unmap_region.do_vmi_align_munmap
4.10 ± 2% -0.4 3.70 perf-profile.calltrace.cycles-pp.down_write.vma_prepare.__split_vma.mprotect_fixup.do_mprotect_pkey
8.68 -0.4 8.28 perf-profile.calltrace.cycles-pp.free_pgtables.unmap_region.do_vmi_align_munmap.do_vmi_munmap.mmap_region
5.16 -0.3 4.84 perf-profile.calltrace.cycles-pp.vma_prepare.__split_vma.mprotect_fixup.do_mprotect_pkey.__x64_sys_mprotect
9.61 -0.3 9.30 perf-profile.calltrace.cycles-pp.unmap_region.do_vmi_align_munmap.do_vmi_munmap.mmap_region.do_mmap
7.33 -0.2 7.14 perf-profile.calltrace.cycles-pp.__split_vma.mprotect_fixup.do_mprotect_pkey.__x64_sys_mprotect.do_syscall_64
2.84 -0.2 2.66 perf-profile.calltrace.cycles-pp.rwsem_spin_on_owner.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.unlink_file_vma
2.18 -0.2 2.02 perf-profile.calltrace.cycles-pp.rwsem_spin_on_owner.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.vma_prepare
1.18 -0.1 1.11 perf-profile.calltrace.cycles-pp.rwsem_spin_on_owner.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.mmap_region
0.84 ± 3% -0.1 0.77 ± 3% perf-profile.calltrace.cycles-pp.vm_mmap_pgoff.elf_map.load_elf_interp.load_elf_binary.search_binary_handler
0.79 ± 3% -0.1 0.73 ± 3% perf-profile.calltrace.cycles-pp.mmap_region.do_mmap.vm_mmap_pgoff.elf_map.load_elf_interp
0.82 -0.0 0.78 perf-profile.calltrace.cycles-pp.mas_store_prealloc.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
0.56 +0.0 0.58 perf-profile.calltrace.cycles-pp.perf_iterate_sb.perf_event_mmap_event.perf_event_mmap.mmap_region.do_mmap
0.60 +0.0 0.63 perf-profile.calltrace.cycles-pp.__sysconf
0.57 ± 2% +0.0 0.60 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.wait4
0.55 ± 2% +0.0 0.58 perf-profile.calltrace.cycles-pp.kernel_wait4.__do_sys_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe.wait4
0.57 ± 2% +0.0 0.60 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.wait4
0.67 +0.0 0.70 perf-profile.calltrace.cycles-pp.sync_regs.asm_exc_page_fault
0.55 +0.0 0.58 ± 2% perf-profile.calltrace.cycles-pp.vma_interval_tree_insert.vma_complete.__split_vma.mprotect_fixup.do_mprotect_pkey
0.61 +0.0 0.64 perf-profile.calltrace.cycles-pp.copy_process.kernel_clone.__do_sys_clone.do_syscall_64.entry_SYSCALL_64_after_hwframe
1.04 +0.0 1.08 perf-profile.calltrace.cycles-pp.perf_event_mmap_event.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
1.08 +0.0 1.12 perf-profile.calltrace.cycles-pp.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
0.56 +0.0 0.60 ± 3% perf-profile.calltrace.cycles-pp.alloc_bprm.do_execveat_common.__x64_sys_execve.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.64 +0.0 0.68 perf-profile.calltrace.cycles-pp.mas_store_prealloc.vma_complete.__split_vma.mprotect_fixup.do_mprotect_pkey
0.54 +0.0 0.58 perf-profile.calltrace.cycles-pp.do_set_pte.filemap_map_pages.do_read_fault.do_fault.__handle_mm_fault
0.54 ± 2% +0.0 0.58 perf-profile.calltrace.cycles-pp.finish_fault.do_cow_fault.do_fault.__handle_mm_fault.handle_mm_fault
0.93 +0.0 0.98 perf-profile.calltrace.cycles-pp.path_openat.do_filp_open.do_sys_openat2.__x64_sys_openat.do_syscall_64
0.94 +0.0 0.99 perf-profile.calltrace.cycles-pp.do_filp_open.do_sys_openat2.__x64_sys_openat.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.63 ± 2% +0.1 0.68 perf-profile.calltrace.cycles-pp._compound_head.zap_pte_range.zap_pmd_range.unmap_page_range.unmap_vmas
1.06 +0.1 1.12 perf-profile.calltrace.cycles-pp.do_sys_openat2.__x64_sys_openat.do_syscall_64.entry_SYSCALL_64_after_hwframe
1.00 +0.1 1.06 perf-profile.calltrace.cycles-pp._dl_addr
1.07 +0.1 1.12 perf-profile.calltrace.cycles-pp.__x64_sys_openat.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.58 ± 2% +0.1 0.63 ± 2% perf-profile.calltrace.cycles-pp.vma_interval_tree_insert.vma_prepare.__split_vma.mprotect_fixup.do_mprotect_pkey
0.93 +0.1 0.99 perf-profile.calltrace.cycles-pp.kernfs_fop_readdir.iterate_dir.__x64_sys_getdents64.do_syscall_64.entry_SYSCALL_64_after_hwframe
1.03 +0.1 1.09 perf-profile.calltrace.cycles-pp.__do_sys_clone.do_syscall_64.entry_SYSCALL_64_after_hwframe.__clone
1.03 +0.1 1.09 perf-profile.calltrace.cycles-pp.kernel_clone.__do_sys_clone.do_syscall_64.entry_SYSCALL_64_after_hwframe.__clone
0.97 +0.1 1.04 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.getdents64
0.97 +0.1 1.04 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.getdents64
0.96 +0.1 1.03 perf-profile.calltrace.cycles-pp.iterate_dir.__x64_sys_getdents64.do_syscall_64.entry_SYSCALL_64_after_hwframe.getdents64
0.98 +0.1 1.04 perf-profile.calltrace.cycles-pp.getdents64
0.97 +0.1 1.03 perf-profile.calltrace.cycles-pp.__x64_sys_getdents64.do_syscall_64.entry_SYSCALL_64_after_hwframe.getdents64
1.37 +0.1 1.44 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__clone
1.33 ± 2% +0.1 1.40 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__clone
0.96 +0.1 1.04 perf-profile.calltrace.cycles-pp.mas_store_prealloc.vma_complete.__split_vma.do_vmi_align_munmap.do_vmi_munmap
1.50 +0.1 1.58 perf-profile.calltrace.cycles-pp.vma_complete.__split_vma.mprotect_fixup.do_mprotect_pkey.__x64_sys_mprotect
1.30 +0.1 1.38 perf-profile.calltrace.cycles-pp.wp_page_copy.__handle_mm_fault.handle_mm_fault.do_user_addr_fault.exc_page_fault
1.51 +0.1 1.60 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.execve.__clone
1.51 +0.1 1.60 perf-profile.calltrace.cycles-pp.__x64_sys_execve.do_syscall_64.entry_SYSCALL_64_after_hwframe.execve.__clone
1.52 +0.1 1.61 perf-profile.calltrace.cycles-pp.execve.__clone
1.52 +0.1 1.60 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.execve.__clone
1.60 +0.1 1.70 perf-profile.calltrace.cycles-pp.vma_complete.__split_vma.do_vmi_align_munmap.do_vmi_munmap.mmap_region
1.26 ± 2% +0.1 1.36 perf-profile.calltrace.cycles-pp.page_remove_rmap.zap_pte_range.zap_pmd_range.unmap_page_range.unmap_vmas
1.20 +0.1 1.30 perf-profile.calltrace.cycles-pp.vma_interval_tree_insert.vma_prepare.__split_vma.do_vmi_align_munmap.do_vmi_munmap
1.52 +0.1 1.62 perf-profile.calltrace.cycles-pp.next_uptodate_page.filemap_map_pages.do_read_fault.do_fault.__handle_mm_fault
0.43 ± 44% +0.1 0.55 ± 3% perf-profile.calltrace.cycles-pp.mm_init.alloc_bprm.do_execveat_common.__x64_sys_execve.do_syscall_64
1.86 ± 2% +0.2 2.03 perf-profile.calltrace.cycles-pp.release_pages.tlb_batch_pages_flush.tlb_finish_mmu.exit_mmap.__mmput
3.24 +0.2 3.42 perf-profile.calltrace.cycles-pp.__clone
2.30 +0.2 2.50 perf-profile.calltrace.cycles-pp.tlb_batch_pages_flush.tlb_finish_mmu.exit_mmap.__mmput.exit_mm
2.33 +0.2 2.52 perf-profile.calltrace.cycles-pp.tlb_finish_mmu.exit_mmap.__mmput.exit_mm.do_exit
2.64 +0.2 2.84 perf-profile.calltrace.cycles-pp.filemap_map_pages.do_read_fault.do_fault.__handle_mm_fault.handle_mm_fault
2.83 +0.2 3.03 perf-profile.calltrace.cycles-pp.do_read_fault.do_fault.__handle_mm_fault.handle_mm_fault.do_user_addr_fault
3.13 +0.2 3.34 perf-profile.calltrace.cycles-pp.zap_pte_range.zap_pmd_range.unmap_page_range.unmap_vmas.exit_mmap
3.34 +0.2 3.55 perf-profile.calltrace.cycles-pp.zap_pmd_range.unmap_page_range.unmap_vmas.exit_mmap.__mmput
3.41 +0.2 3.63 perf-profile.calltrace.cycles-pp.unmap_page_range.unmap_vmas.exit_mmap.__mmput.exit_mm
2.02 +0.2 2.24 perf-profile.calltrace.cycles-pp.do_cow_fault.do_fault.__handle_mm_fault.handle_mm_fault.do_user_addr_fault
3.62 +0.2 3.86 perf-profile.calltrace.cycles-pp.unmap_vmas.exit_mmap.__mmput.exit_mm.do_exit
4.88 +0.4 5.31 perf-profile.calltrace.cycles-pp.do_fault.__handle_mm_fault.handle_mm_fault.do_user_addr_fault.exc_page_fault
0.00 +0.5 0.51 ± 2% perf-profile.calltrace.cycles-pp.copy_page.do_cow_fault.do_fault.__handle_mm_fault.handle_mm_fault
7.28 +0.6 7.86 perf-profile.calltrace.cycles-pp.__handle_mm_fault.handle_mm_fault.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
7.68 +0.6 8.29 perf-profile.calltrace.cycles-pp.handle_mm_fault.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
8.90 +0.7 9.59 perf-profile.calltrace.cycles-pp.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
8.97 +0.7 9.67 perf-profile.calltrace.cycles-pp.exc_page_fault.asm_exc_page_fault
9.85 +0.7 10.58 perf-profile.calltrace.cycles-pp.asm_exc_page_fault
39.11 -3.0 36.10 perf-profile.children.cycles-pp.rwsem_optimistic_spin
39.79 -3.0 36.79 perf-profile.children.cycles-pp.rwsem_down_write_slowpath
40.99 -2.9 38.11 perf-profile.children.cycles-pp.down_write
31.42 -2.5 28.90 perf-profile.children.cycles-pp.osq_lock
19.60 -1.2 18.41 perf-profile.children.cycles-pp.unlink_file_vma
21.27 -1.1 20.14 perf-profile.children.cycles-pp.free_pgtables
40.67 -1.0 39.70 perf-profile.children.cycles-pp.mmap_region
41.06 -1.0 40.10 perf-profile.children.cycles-pp.do_mmap
41.28 -0.9 40.34 perf-profile.children.cycles-pp.vm_mmap_pgoff
79.55 -0.8 78.73 perf-profile.children.cycles-pp.do_syscall_64
79.62 -0.8 78.82 perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
17.54 -0.7 16.80 perf-profile.children.cycles-pp.vma_prepare
6.88 -0.5 6.38 perf-profile.children.cycles-pp.rwsem_spin_on_owner
24.95 -0.5 24.48 perf-profile.children.cycles-pp.do_vmi_align_munmap
25.10 -0.5 24.64 perf-profile.children.cycles-pp.do_vmi_munmap
21.02 -0.4 20.64 perf-profile.children.cycles-pp.__split_vma
10.25 -0.3 9.94 perf-profile.children.cycles-pp.unmap_region
2.79 ± 4% -0.2 2.54 ± 3% perf-profile.children.cycles-pp.elf_map
0.39 -0.1 0.30 perf-profile.children.cycles-pp.mas_wr_walk
1.50 -0.0 1.46 perf-profile.children.cycles-pp.mas_wr_store_entry
0.11 +0.0 0.12 ± 3% perf-profile.children.cycles-pp.security_file_alloc
0.23 ± 2% +0.0 0.24 ± 2% perf-profile.children.cycles-pp.__percpu_counter_sum
0.16 ± 2% +0.0 0.18 ± 2% perf-profile.children.cycles-pp.init_file
0.16 ± 3% +0.0 0.17 perf-profile.children.cycles-pp.unmap_single_vma
0.25 +0.0 0.27 perf-profile.children.cycles-pp.__memcpy
0.24 ± 3% +0.0 0.26 perf-profile.children.cycles-pp.native_flush_tlb_one_user
0.21 ± 2% +0.0 0.23 perf-profile.children.cycles-pp.mas_push_data
0.16 +0.0 0.18 ± 2% perf-profile.children.cycles-pp.free_unref_page_list
0.14 ± 3% +0.0 0.15 ± 3% perf-profile.children.cycles-pp.padzero
0.38 +0.0 0.40 perf-profile.children.cycles-pp.d_path
0.30 +0.0 0.31 perf-profile.children.cycles-pp.__vm_area_free
0.29 +0.0 0.31 perf-profile.children.cycles-pp.mas_find
0.45 +0.0 0.47 perf-profile.children.cycles-pp.percpu_counter_add_batch
0.34 ± 2% +0.0 0.35 ± 2% perf-profile.children.cycles-pp.find_idlest_group
0.30 +0.0 0.32 perf-profile.children.cycles-pp.update_sg_wakeup_stats
0.36 ± 2% +0.0 0.37 perf-profile.children.cycles-pp.find_idlest_cpu
0.29 ± 2% +0.0 0.31 ± 2% perf-profile.children.cycles-pp.__filemap_get_folio
0.34 +0.0 0.36 perf-profile.children.cycles-pp.link_path_walk
0.44 ± 2% +0.0 0.46 perf-profile.children.cycles-pp.__mem_cgroup_charge
0.41 +0.0 0.43 perf-profile.children.cycles-pp.unlink_anon_vmas
0.31 ± 2% +0.0 0.33 perf-profile.children.cycles-pp.alloc_empty_file
0.25 ± 2% +0.0 0.27 perf-profile.children.cycles-pp.wake_up_new_task
0.41 ± 2% +0.0 0.43 ± 2% perf-profile.children.cycles-pp.__mmdrop
0.35 ± 2% +0.0 0.37 perf-profile.children.cycles-pp.mas_next_slot
0.35 ± 2% +0.0 0.37 perf-profile.children.cycles-pp.flush_tlb_mm_range
0.33 +0.0 0.35 perf-profile.children.cycles-pp.find_vma
0.45 ± 2% +0.0 0.48 ± 2% perf-profile.children.cycles-pp.__rb_erase_color
0.28 ± 2% +0.0 0.30 ± 2% perf-profile.children.cycles-pp.filemap_get_entry
0.27 +0.0 0.29 ± 3% perf-profile.children.cycles-pp.up_read
0.51 +0.0 0.53 perf-profile.children.cycles-pp.mas_store_gfp
0.41 ± 2% +0.0 0.43 perf-profile.children.cycles-pp.free_swap_cache
0.44 +0.0 0.47 perf-profile.children.cycles-pp.___perf_sw_event
0.05 ± 13% +0.0 0.08 ± 14% perf-profile.children.cycles-pp.get_any_partial
0.35 +0.0 0.38 ± 2% perf-profile.children.cycles-pp.mt_find
0.37 ± 2% +0.0 0.39 ± 2% perf-profile.children.cycles-pp.vfs_read
0.33 ± 2% +0.0 0.35 ± 2% perf-profile.children.cycles-pp.vma_interval_tree_augment_rotate
0.50 ± 2% +0.0 0.52 ± 2% perf-profile.children.cycles-pp.copy_page
0.33 ± 2% +0.0 0.36 ± 2% perf-profile.children.cycles-pp.__percpu_counter_init
0.24 ± 5% +0.0 0.26 ± 3% perf-profile.children.cycles-pp.down_read
0.31 +0.0 0.33 ± 2% perf-profile.children.cycles-pp.lru_add_fn
0.37 +0.0 0.40 perf-profile.children.cycles-pp.__open64_nocancel
0.39 ± 2% +0.0 0.42 ± 2% perf-profile.children.cycles-pp.ksys_read
0.37 ± 2% +0.0 0.40 perf-profile.children.cycles-pp.__mod_lruvec_page_state
0.53 +0.0 0.56 perf-profile.children.cycles-pp.__perf_sw_event
0.43 ± 2% +0.0 0.46 perf-profile.children.cycles-pp.free_pages_and_swap_cache
0.66 +0.0 0.68 perf-profile.children.cycles-pp.get_page_from_freelist
0.62 +0.0 0.65 perf-profile.children.cycles-pp.__sysconf
0.55 ± 2% +0.0 0.58 perf-profile.children.cycles-pp.kernel_wait4
0.59 ± 2% +0.0 0.62 perf-profile.children.cycles-pp.wait4
0.47 ± 2% +0.0 0.50 ± 3% perf-profile.children.cycles-pp.try_to_wake_up
0.53 +0.0 0.56 perf-profile.children.cycles-pp.mas_split
0.30 +0.0 0.33 ± 2% perf-profile.children.cycles-pp.folio_add_lru_vma
0.06 ± 11% +0.0 0.09 ± 6% perf-profile.children.cycles-pp.io_schedule
0.51 +0.0 0.54 perf-profile.children.cycles-pp.lock_mm_and_find_vma
0.47 +0.0 0.50 ± 2% perf-profile.children.cycles-pp.__x64_sys_munmap
0.41 +0.0 0.44 ± 3% perf-profile.children.cycles-pp.pcpu_alloc
0.60 +0.0 0.64 perf-profile.children.cycles-pp.mas_wr_bnode
0.06 ± 11% +0.0 0.10 ± 5% perf-profile.children.cycles-pp.folio_wait_bit_common
0.61 ± 2% +0.0 0.65 perf-profile.children.cycles-pp.copy_process
0.48 +0.0 0.51 perf-profile.children.cycles-pp.page_add_file_rmap
0.75 +0.0 0.79 perf-profile.children.cycles-pp.native_irq_return_iret
0.69 +0.0 0.73 perf-profile.children.cycles-pp.vm_area_alloc
0.56 +0.0 0.60 ± 2% perf-profile.children.cycles-pp._raw_spin_lock
0.54 +0.0 0.58 perf-profile.children.cycles-pp.lock_vma_under_rcu
0.46 ± 2% +0.0 0.50 ± 2% perf-profile.children.cycles-pp.__pte_offset_map_lock
0.51 +0.0 0.55 ± 2% perf-profile.children.cycles-pp.mm_init
0.15 ± 2% +0.0 0.19 ± 3% perf-profile.children.cycles-pp.__wake_up_common
0.61 +0.0 0.65 perf-profile.children.cycles-pp.mas_walk
0.67 +0.0 0.71 perf-profile.children.cycles-pp.vma_alloc_folio
0.88 +0.0 0.92 perf-profile.children.cycles-pp.sync_regs
0.83 +0.0 0.87 perf-profile.children.cycles-pp.perf_event_mmap_output
0.56 +0.0 0.60 ± 2% perf-profile.children.cycles-pp.alloc_bprm
0.83 +0.0 0.87 perf-profile.children.cycles-pp.mas_wr_node_store
0.58 +0.0 0.62 perf-profile.children.cycles-pp.__rb_insert_augmented
0.79 +0.0 0.83 ± 2% perf-profile.children.cycles-pp.memcg_slab_post_alloc_hook
0.06 ± 8% +0.0 0.10 perf-profile.children.cycles-pp.folio_wake_bit
0.92 +0.0 0.97 perf-profile.children.cycles-pp.__alloc_pages
0.75 +0.0 0.80 perf-profile.children.cycles-pp.mtree_range_walk
1.04 +0.0 1.09 perf-profile.children.cycles-pp.__slab_free
0.77 +0.0 0.82 perf-profile.children.cycles-pp.vm_area_dup
0.55 ± 2% +0.0 0.60 perf-profile.children.cycles-pp.finish_fault
1.01 +0.1 1.06 perf-profile.children.cycles-pp._dl_addr
1.06 +0.1 1.12 perf-profile.children.cycles-pp.perf_iterate_sb
0.75 ± 2% +0.1 0.81 perf-profile.children.cycles-pp._compound_head
1.03 +0.1 1.09 perf-profile.children.cycles-pp.__do_sys_clone
1.03 +0.1 1.09 perf-profile.children.cycles-pp.kernel_clone
0.96 +0.1 1.03 perf-profile.children.cycles-pp.iterate_dir
0.94 +0.1 1.00 perf-profile.children.cycles-pp.kernfs_fop_readdir
0.98 +0.1 1.04 perf-profile.children.cycles-pp.getdents64
0.97 +0.1 1.03 perf-profile.children.cycles-pp.__x64_sys_getdents64
0.36 ± 4% +0.1 0.42 ± 2% perf-profile.children.cycles-pp.__unfreeze_partials
0.01 ±223% +0.1 0.08 ± 6% perf-profile.children.cycles-pp.wake_page_function
0.36 +0.1 0.43 ± 2% perf-profile.children.cycles-pp.filemap_fault
1.81 +0.1 1.89 perf-profile.children.cycles-pp.perf_event_mmap_event
1.42 +0.1 1.49 perf-profile.children.cycles-pp.do_sys_openat2
0.56 +0.1 0.63 perf-profile.children.cycles-pp.__do_fault
1.87 +0.1 1.95 perf-profile.children.cycles-pp.perf_event_mmap
0.76 ± 2% +0.1 0.84 perf-profile.children.cycles-pp.kmem_cache_free_bulk
1.42 +0.1 1.50 perf-profile.children.cycles-pp.__x64_sys_openat
0.32 ± 3% +0.1 0.40 ± 5% perf-profile.children.cycles-pp.get_partial_node
1.40 +0.1 1.48 perf-profile.children.cycles-pp.path_openat
1.42 +0.1 1.50 perf-profile.children.cycles-pp.do_filp_open
0.70 +0.1 0.79 ± 2% perf-profile.children.cycles-pp.lru_add_drain_cpu
0.71 +0.1 0.79 perf-profile.children.cycles-pp.lru_add_drain
1.39 +0.1 1.48 perf-profile.children.cycles-pp.wp_page_copy
1.43 +0.1 1.53 perf-profile.children.cycles-pp.do_set_pte
3.10 +0.1 3.20 perf-profile.children.cycles-pp.mas_store_prealloc
2.00 +0.1 2.11 perf-profile.children.cycles-pp.kmem_cache_alloc
1.35 ± 2% +0.1 1.46 perf-profile.children.cycles-pp.page_remove_rmap
0.94 +0.1 1.04 perf-profile.children.cycles-pp.folio_batch_move_lru
0.76 ± 2% +0.1 0.87 ± 3% perf-profile.children.cycles-pp.___slab_alloc
1.22 +0.1 1.33 perf-profile.children.cycles-pp.mas_destroy
0.98 +0.1 1.11 ± 2% perf-profile.children.cycles-pp.__kmem_cache_alloc_bulk
1.11 +0.1 1.24 ± 2% perf-profile.children.cycles-pp.kmem_cache_alloc_bulk
0.96 ± 3% +0.1 1.09 perf-profile.children.cycles-pp.folio_lruvec_lock_irqsave
1.38 +0.1 1.52 perf-profile.children.cycles-pp.mas_preallocate
1.43 +0.1 1.58 perf-profile.children.cycles-pp.mas_alloc_nodes
2.02 +0.2 2.19 perf-profile.children.cycles-pp.release_pages
3.25 +0.2 3.43 perf-profile.children.cycles-pp.__clone
3.39 +0.2 3.58 perf-profile.children.cycles-pp.vma_complete
2.40 +0.2 2.60 perf-profile.children.cycles-pp.tlb_batch_pages_flush
4.42 +0.2 4.62 perf-profile.children.cycles-pp.vma_interval_tree_insert
2.82 +0.2 3.02 perf-profile.children.cycles-pp.next_uptodate_page
2.65 +0.2 2.86 perf-profile.children.cycles-pp.tlb_finish_mmu
3.50 +0.2 3.73 perf-profile.children.cycles-pp.zap_pte_range
2.10 +0.2 2.33 perf-profile.children.cycles-pp.do_cow_fault
3.59 +0.2 3.82 perf-profile.children.cycles-pp.zap_pmd_range
3.73 +0.2 3.97 perf-profile.children.cycles-pp.unmap_page_range
1.61 ± 2% +0.3 1.87 ± 2% perf-profile.children.cycles-pp._raw_spin_lock_irqsave
3.99 +0.3 4.26 perf-profile.children.cycles-pp.unmap_vmas
1.33 ± 2% +0.3 1.60 ± 2% perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
5.00 +0.3 5.33 perf-profile.children.cycles-pp.filemap_map_pages
5.37 +0.4 5.72 perf-profile.children.cycles-pp.do_read_fault
7.51 +0.6 8.09 perf-profile.children.cycles-pp.do_fault
10.64 +0.8 11.41 perf-profile.children.cycles-pp.__handle_mm_fault
11.19 +0.8 12.00 perf-profile.children.cycles-pp.handle_mm_fault
12.75 +0.9 13.66 perf-profile.children.cycles-pp.do_user_addr_fault
12.82 +0.9 13.73 perf-profile.children.cycles-pp.exc_page_fault
14.29 +1.0 15.27 perf-profile.children.cycles-pp.asm_exc_page_fault
30.81 -2.5 28.33 perf-profile.self.cycles-pp.osq_lock
6.72 -0.5 6.23 perf-profile.self.cycles-pp.rwsem_spin_on_owner
0.34 ± 2% -0.1 0.26 perf-profile.self.cycles-pp.mas_wr_walk
0.09 +0.0 0.10 perf-profile.self.cycles-pp.access_error
0.20 +0.0 0.21 perf-profile.self.cycles-pp.__memcpy
0.11 ± 3% +0.0 0.12 ± 4% perf-profile.self.cycles-pp.__unfreeze_partials
0.38 +0.0 0.40 perf-profile.self.cycles-pp.___perf_sw_event
0.37 +0.0 0.39 perf-profile.self.cycles-pp.free_swap_cache
0.26 +0.0 0.28 ± 4% perf-profile.self.cycles-pp.up_read
0.47 +0.0 0.49 ± 2% perf-profile.self.cycles-pp.kmem_cache_free_bulk
0.32 ± 2% +0.0 0.34 ± 2% perf-profile.self.cycles-pp.vma_interval_tree_augment_rotate
0.22 ± 4% +0.0 0.24 ± 3% perf-profile.self.cycles-pp.down_read
0.49 +0.0 0.51 ± 2% perf-profile.self.cycles-pp.copy_page
0.36 +0.0 0.38 perf-profile.self.cycles-pp.do_set_pte
0.37 +0.0 0.40 perf-profile.self.cycles-pp.__kmem_cache_alloc_bulk
0.41 +0.0 0.44 ± 3% perf-profile.self.cycles-pp.___slab_alloc
0.06 ± 7% +0.0 0.09 perf-profile.self.cycles-pp.try_to_wake_up
0.49 +0.0 0.52 perf-profile.self.cycles-pp.memcg_slab_post_alloc_hook
0.50 +0.0 0.52 ± 2% perf-profile.self.cycles-pp._raw_spin_lock
0.41 +0.0 0.44 perf-profile.self.cycles-pp.page_add_file_rmap
0.58 ± 2% +0.0 0.62 perf-profile.self.cycles-pp.kmem_cache_alloc
0.75 +0.0 0.79 perf-profile.self.cycles-pp.native_irq_return_iret
0.50 +0.0 0.53 perf-profile.self.cycles-pp.rwsem_down_write_slowpath
0.55 +0.0 0.59 perf-profile.self.cycles-pp.__rb_insert_augmented
0.77 +0.0 0.81 perf-profile.self.cycles-pp.filemap_map_pages
0.87 +0.0 0.91 perf-profile.self.cycles-pp.sync_regs
0.72 +0.0 0.76 perf-profile.self.cycles-pp.mtree_range_walk
1.03 +0.0 1.07 ± 2% perf-profile.self.cycles-pp.__slab_free
0.89 +0.0 0.94 perf-profile.self.cycles-pp._dl_addr
1.10 +0.1 1.15 perf-profile.self.cycles-pp.zap_pte_range
0.69 ± 2% +0.1 0.74 perf-profile.self.cycles-pp._compound_head
1.08 +0.1 1.16 perf-profile.self.cycles-pp.release_pages
1.14 ± 2% +0.1 1.23 perf-profile.self.cycles-pp.page_remove_rmap
1.10 ± 2% +0.1 1.20 perf-profile.self.cycles-pp.down_write
4.32 +0.2 4.52 perf-profile.self.cycles-pp.vma_interval_tree_insert
2.72 +0.2 2.92 perf-profile.self.cycles-pp.next_uptodate_page
1.33 ± 2% +0.3 1.60 ± 2% perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
***************************************************************************************************
lkp-spr-2sp3: 224 threads 2 sockets Intel(R) Xeon(R) Platinum 8480L (Sapphire Rapids) with 512G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/nr_task/rootfs/runtime/tbox_group/test/testcase:
gcc-12/performance/x86_64-rhel-8.3/100%/debian-11.1-x86_64-20220510.cgz/300s/lkp-spr-2sp3/shell8/unixbench
commit:
73e791d738 ("mm: remove clear_page_idle()")
6852c46c78 ("mm/mmap: move vma operations to mm_struct out of the critical section of file mapping lock")
73e791d73877e904 6852c46c783d20a4c0153d14d29
---------------- ---------------------------
%stddev %change %stddev
\ | \
128621 ± 44% -35.5% 83021 ± 18% sched_debug.cfs_rq:/.spread0.stddev
68.67 -2.7% 66.83 turbostat.PkgTmp
30.36 +1.2% 30.74 turbostat.RAMWatt
23470 ± 2% +6.4% 24970 ± 2% unixbench.score
799587 ± 3% +11.6% 892414 ± 3% unixbench.time.involuntary_context_switches
83795 ± 8% +39.5% 116914 ± 5% unixbench.time.major_page_faults
1.003e+08 ± 2% +6.5% 1.068e+08 ± 2% unixbench.time.minor_page_faults
328.24 ± 2% +6.1% 348.20 unixbench.time.user_time
894180 ± 2% +6.7% 953859 ± 2% unixbench.workload
70334849 ± 2% +6.3% 74748722 ± 2% proc-vmstat.numa_hit
70084753 ± 2% +6.3% 74504737 ± 2% proc-vmstat.numa_local
54187 ± 2% +6.3% 57597 ± 2% proc-vmstat.pgactivate
74859673 ± 2% +6.3% 79582401 ± 2% proc-vmstat.pgalloc_normal
1.024e+08 ± 2% +6.4% 1.09e+08 ± 2% proc-vmstat.pgfault
74344826 ± 2% +6.4% 79067399 ± 2% proc-vmstat.pgfree
4212400 ± 2% +6.2% 4474364 ± 2% proc-vmstat.pgreuse
3869 ± 2% +7.1% 4145 ± 3% proc-vmstat.thp_fault_alloc
1612101 ± 2% +6.3% 1713519 ± 2% proc-vmstat.unevictable_pgs_culled
5.69 ± 2% +5.1% 5.98 ± 2% perf-stat.i.MPKI
0.94 ± 2% +0.0 0.98 perf-stat.i.branch-miss-rate%
2.934e+08 ± 2% +5.8% 3.105e+08 ± 2% perf-stat.i.branch-misses
2.844e+08 ± 2% +6.1% 3.016e+08 ± 2% perf-stat.i.cache-misses
9.76e+08 ± 2% +5.8% 1.033e+09 ± 2% perf-stat.i.cache-references
2.99 -1.3% 2.95 perf-stat.i.cpi
5.226e+11 -0.9% 5.179e+11 perf-stat.i.cpu-cycles
33882826 ± 2% +7.2% 36317869 ± 2% perf-stat.i.dTLB-store-misses
9.478e+09 ± 2% +5.9% 1.004e+10 ± 2% perf-stat.i.dTLB-stores
0.36 +2.1% 0.36 perf-stat.i.ipc
1312 ± 8% +39.6% 1831 ± 5% perf-stat.i.major-faults
2.33 -0.9% 2.31 perf-stat.i.metric.GHz
513.19 ± 2% +6.0% 544.03 ± 2% perf-stat.i.metric.K/sec
1545804 ± 2% +6.5% 1646269 ± 2% perf-stat.i.minor-faults
60502067 ± 2% +5.4% 63798162 ± 2% perf-stat.i.node-load-misses
14871403 ± 2% +6.2% 15796551 ± 2% perf-stat.i.node-loads
1547117 ± 2% +6.5% 1648101 ± 2% perf-stat.i.page-faults
5.78 ± 2% +5.4% 6.10 ± 2% perf-stat.overall.MPKI
0.87 ± 2% +0.0 0.92 ± 2% perf-stat.overall.branch-miss-rate%
3.10 -1.3% 3.06 perf-stat.overall.cpi
1839 ± 3% -6.6% 1718 ± 2% perf-stat.overall.cycles-between-cache-misses
12132105 ± 2% -5.5% 11462241 ± 2% perf-stat.overall.path-length
2.901e+08 ± 2% +5.8% 3.07e+08 ± 2% perf-stat.ps.branch-misses
2.815e+08 ± 2% +6.1% 2.986e+08 ± 2% perf-stat.ps.cache-misses
9.661e+08 ± 2% +5.8% 1.022e+09 ± 2% perf-stat.ps.cache-references
5.174e+11 -0.9% 5.128e+11 perf-stat.ps.cpu-cycles
33544567 ± 2% +7.2% 35957170 ± 2% perf-stat.ps.dTLB-store-misses
9.376e+09 ± 2% +5.9% 9.933e+09 ± 2% perf-stat.ps.dTLB-stores
1299 ± 8% +39.6% 1813 ± 5% perf-stat.ps.major-faults
1529961 ± 2% +6.5% 1629550 ± 2% perf-stat.ps.minor-faults
59908353 ± 2% +5.4% 63173333 ± 2% perf-stat.ps.node-load-misses
14730891 ± 2% +6.2% 15646714 ± 2% perf-stat.ps.node-loads
1531260 ± 2% +6.5% 1631363 ± 2% perf-stat.ps.page-faults
***************************************************************************************************
lkp-cpl-4sp2: 224 threads 4 sockets Intel(R) Xeon(R) Platinum 8380H CPU @ 2.90GHz (Cooper Lake) with 192G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/rootfs/runtime/tbox_group/test/testcase:
gcc-12/performance/x86_64-rhel-8.3/debian-11.1-x86_64-20220510.cgz/300s/lkp-cpl-4sp2/small-allocs/vm-scalability
commit:
73e791d738 ("mm: remove clear_page_idle()")
6852c46c78 ("mm/mmap: move vma operations to mm_struct out of the critical section of file mapping lock")
73e791d73877e904 6852c46c783d20a4c0153d14d29
---------------- ---------------------------
%stddev %change %stddev
\ | \
42167 +23.5% 52085 vm-scalability.median
55.63 +4.8 60.48 ± 5% vm-scalability.stddev%
9445724 +23.5% 11667563 vm-scalability.throughput
380.73 +5.2% 400.43 vm-scalability.time.elapsed_time
380.73 +5.2% 400.43 vm-scalability.time.elapsed_time.max
106067 +51.4% 160541 vm-scalability.time.involuntary_context_switches
6.307e+08 +23.6% 7.795e+08 vm-scalability.time.minor_page_faults
78306 +4.6% 81916 vm-scalability.time.system_time
777.26 +22.8% 954.12 vm-scalability.time.user_time
2562168 ± 2% -19.3% 2068789 ± 3% vm-scalability.time.voluntary_context_switches
2.838e+09 +23.6% 3.507e+09 vm-scalability.workload
5.815e+09 +10.4% 6.421e+09 ± 3% cpuidle..time
6.66 +0.3 6.99 ± 3% turbostat.C1%
17067 ± 2% -16.7% 14220 ± 2% vmstat.system.cs
0.05 -0.0 0.04 ± 3% mpstat.cpu.all.soft%
0.96 +0.1 1.11 mpstat.cpu.all.usr%
2261 ± 5% +22.8% 2776 ± 6% perf-c2c.HITM.local
1329 ± 8% +20.8% 1605 ± 4% perf-c2c.HITM.remote
18499960 +19.0% 22006663 meminfo.Memused
2990186 +26.8% 3790943 meminfo.PageTables
10350121 +26.1% 13050399 meminfo.SUnreclaim
10515083 +25.7% 13215478 meminfo.Slab
18692470 +19.1% 22260889 meminfo.max_used_kB
407798 ± 17% +67.2% 682010 ± 20% sched_debug.cfs_rq:/.load.max
869895 ± 16% -34.9% 566399 ± 18% sched_debug.cfs_rq:/.spread0.avg
-743993 +53.2% -1139615 sched_debug.cfs_rq:/.spread0.min
17559 ± 2% -12.8% 15312 ± 2% sched_debug.cpu.nr_switches.avg
13015 ± 2% -10.6% 11629 ± 2% sched_debug.cpu.nr_switches.min
4612 -18.2% 3772 ± 10% sched_debug.cpu.nr_switches.stddev
223.38 ± 8% -15.7% 188.21 ± 12% sched_debug.cpu.nr_uninterruptible.max
1169323 +13.4% 1326004 ± 4% numa-numastat.node0.local_node
1239338 +13.9% 1411769 ± 2% numa-numastat.node0.numa_hit
1115777 ± 2% +16.7% 1302136 ± 3% numa-numastat.node1.local_node
1181029 ± 2% +19.6% 1412041 ± 3% numa-numastat.node1.numa_hit
65251 ± 31% +68.4% 109907 ± 7% numa-numastat.node1.other_node
1111909 ± 5% +15.5% 1284183 ± 2% numa-numastat.node2.local_node
1218212 ± 5% +11.5% 1358029 numa-numastat.node2.numa_hit
1285007 ± 2% +25.7% 1615828 ± 2% numa-numastat.node3.local_node
1391326 ± 2% +21.8% 1694543 ± 2% numa-numastat.node3.numa_hit
137025 ± 2% +6.1% 145442 proc-vmstat.nr_active_anon
95233 -1.5% 93787 proc-vmstat.nr_anon_pages
4427600 -2.0% 4339914 proc-vmstat.nr_dirty_background_threshold
8866026 -2.0% 8690440 proc-vmstat.nr_dirty_threshold
44521360 -2.0% 43643224 proc-vmstat.nr_free_pages
747466 +26.8% 948001 proc-vmstat.nr_page_table_pages
168941 ± 2% +4.9% 177164 proc-vmstat.nr_shmem
2587999 +26.1% 3264111 proc-vmstat.nr_slab_unreclaimable
137025 ± 2% +6.1% 145442 proc-vmstat.nr_zone_active_anon
5032565 +16.8% 5879446 proc-vmstat.numa_hit
4684671 +18.1% 5530835 proc-vmstat.numa_local
7168214 +18.3% 8479220 proc-vmstat.pgalloc_normal
6.322e+08 +23.5% 7.81e+08 proc-vmstat.pgfault
6881231 +18.9% 8181469 proc-vmstat.pgfree
3627776 +4.5% 3791232 proc-vmstat.unevictable_pgs_scanned
3935473 ± 5% +34.6% 5297695 ± 16% numa-meminfo.node0.MemUsed
748076 +26.7% 947801 numa-meminfo.node0.PageTables
2597779 +26.1% 3275316 numa-meminfo.node0.SUnreclaim
2625903 +26.1% 3310684 numa-meminfo.node0.Slab
747754 +26.9% 948714 numa-meminfo.node1.PageTables
2580884 +26.5% 3264576 numa-meminfo.node1.SUnreclaim
2652930 +25.8% 3336480 numa-meminfo.node1.Slab
141326 ± 10% -21.1% 111532 ± 19% numa-meminfo.node2.Inactive(anon)
3886953 ± 2% +20.1% 4666698 numa-meminfo.node2.MemUsed
751265 +26.3% 949188 numa-meminfo.node2.PageTables
2591514 +25.6% 3255894 numa-meminfo.node2.SUnreclaim
2622178 +24.9% 3274703 numa-meminfo.node2.Slab
327366 ± 10% +39.0% 454988 ± 20% numa-meminfo.node3.Active
327339 ± 10% +39.0% 454931 ± 20% numa-meminfo.node3.Active(anon)
17238 ± 7% +16.9% 20158 ± 11% numa-meminfo.node3.Mapped
4217474 +22.9% 5184418 ± 3% numa-meminfo.node3.MemUsed
745042 +27.0% 946381 numa-meminfo.node3.PageTables
2586773 +26.0% 3259792 numa-meminfo.node3.SUnreclaim
379407 ± 18% +30.5% 495065 ± 18% numa-meminfo.node3.Shmem
2620922 +25.9% 3298794 numa-meminfo.node3.Slab
0.88 ± 71% -80.8% 0.17 ± 52% perf-sched.sch_delay.avg.ms.__cond_resched.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode
0.13 ± 53% +311.7% 0.53 ± 55% perf-sched.sch_delay.avg.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt
0.20 ± 13% +32.6% 0.26 ± 15% perf-sched.sch_delay.avg.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
0.10 ± 9% +110.7% 0.21 ± 8% perf-sched.sch_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
0.06 ± 28% +138.4% 0.13 ± 29% perf-sched.sch_delay.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
0.01 +14036.1% 1.70 ±221% perf-sched.sch_delay.max.ms.__x64_sys_pause.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
9.50 ± 34% +476.1% 54.73 ± 84% perf-sched.sch_delay.max.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt
2.31 ± 39% -78.8% 0.49 ±216% perf-sched.sch_delay.max.ms.schedule_hrtimeout_range_clock.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
138.30 ± 14% +48.7% 205.63 ± 23% perf-sched.sch_delay.max.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
13.72 ± 12% +100.6% 27.53 ± 35% perf-sched.sch_delay.max.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
9.63 ± 18% +114.0% 20.60 ± 10% perf-sched.sch_delay.max.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
2.26 ± 6% +17.9% 2.66 ± 3% perf-sched.wait_and_delay.avg.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
213.13 ± 2% -11.2% 189.35 ± 2% perf-sched.wait_and_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
1074 ± 4% +65.8% 1781 ± 4% perf-sched.wait_and_delay.count.__cond_resched.__kmem_cache_alloc_node.kmalloc_trace.perf_event_mmap_event.perf_event_mmap
39318 ± 7% -17.4% 32472 ± 5% perf-sched.wait_and_delay.count.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
171.51 ± 29% +87.8% 322.16 ± 43% perf-sched.wait_and_delay.max.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
2.06 ± 6% +16.5% 2.40 ± 3% perf-sched.wait_time.avg.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
213.03 ± 2% -11.2% 189.14 ± 2% perf-sched.wait_time.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
10.68 ± 19% +53.1% 16.34 ± 21% perf-sched.wait_time.max.ms.__cond_resched.__wait_for_common.stop_two_cpus.migrate_swap.task_numa_migrate
0.02 ± 29% -42.3% 0.01 ± 6% perf-sched.wait_time.max.ms.__cond_resched.down_write_killable.exec_mmap.begin_new_exec.load_elf_binary
27.23 ± 10% +56.4% 42.58 ± 17% perf-sched.wait_time.max.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt
72.93 ± 46% +126.7% 165.37 ± 43% perf-sched.wait_time.max.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.mmap_region
186850 +26.8% 236835 numa-vmstat.node0.nr_page_table_pages
648927 +26.1% 818461 numa-vmstat.node0.nr_slab_unreclaimable
1239327 +13.9% 1411687 ± 2% numa-vmstat.node0.numa_hit
1169311 +13.4% 1325922 ± 4% numa-vmstat.node0.numa_local
186769 +26.9% 237070 numa-vmstat.node1.nr_page_table_pages
644727 +26.5% 815732 numa-vmstat.node1.nr_slab_unreclaimable
1180822 ± 2% +19.6% 1411883 ± 3% numa-vmstat.node1.numa_hit
1115570 ± 2% +16.7% 1301978 ± 3% numa-vmstat.node1.numa_local
65251 ± 31% +68.4% 109907 ± 7% numa-vmstat.node1.numa_other
35329 ± 10% -21.1% 27874 ± 19% numa-vmstat.node2.nr_inactive_anon
187645 +26.4% 237184 numa-vmstat.node2.nr_page_table_pages
647359 +25.7% 813590 numa-vmstat.node2.nr_slab_unreclaimable
35330 ± 10% -21.1% 27874 ± 19% numa-vmstat.node2.nr_zone_inactive_anon
1218056 ± 5% +11.5% 1357863 numa-vmstat.node2.numa_hit
1111753 ± 5% +15.5% 1284018 ± 2% numa-vmstat.node2.numa_local
81831 ± 10% +38.9% 113699 ± 20% numa-vmstat.node3.nr_active_anon
4376 ± 8% +17.1% 5125 ± 10% numa-vmstat.node3.nr_mapped
186088 +27.1% 236505 numa-vmstat.node3.nr_page_table_pages
94854 ± 18% +30.5% 123744 ± 18% numa-vmstat.node3.nr_shmem
646172 +26.1% 814570 numa-vmstat.node3.nr_slab_unreclaimable
81831 ± 10% +38.9% 113699 ± 20% numa-vmstat.node3.nr_zone_active_anon
1391229 ± 2% +21.8% 1694455 ± 2% numa-vmstat.node3.numa_hit
1284909 ± 2% +25.7% 1615740 ± 2% numa-vmstat.node3.numa_local
2.182e+10 +5.7% 2.307e+10 perf-stat.i.branch-instructions
67.44 -1.4 66.01 perf-stat.i.cache-miss-rate%
1.084e+08 +5.0% 1.138e+08 ± 3% perf-stat.i.cache-references
17046 ± 2% -17.1% 14132 ± 2% perf-stat.i.context-switches
8.89 -2.0% 8.71 perf-stat.i.cpi
488.32 +5.3% 514.19 perf-stat.i.cpu-migrations
0.03 +0.0 0.04 ± 2% perf-stat.i.dTLB-load-miss-rate%
9099499 +14.7% 10433460 ± 2% perf-stat.i.dTLB-load-misses
2.527e+10 +4.3% 2.636e+10 perf-stat.i.dTLB-loads
1732945 +15.5% 2001701 perf-stat.i.dTLB-store-misses
2.005e+09 +12.5% 2.256e+09 perf-stat.i.dTLB-stores
93.66 -3.6 90.02 perf-stat.i.iTLB-load-miss-rate%
431003 ± 5% +42.8% 615338 perf-stat.i.iTLB-loads
9.349e+10 +4.1% 9.729e+10 perf-stat.i.instructions
17597 +4.2% 18333 ± 2% perf-stat.i.instructions-per-iTLB-miss
0.12 +4.5% 0.12 perf-stat.i.ipc
218.73 +5.1% 229.81 perf-stat.i.metric.M/sec
1645267 +16.2% 1911350 perf-stat.i.minor-faults
80.15 +1.7 81.87 perf-stat.i.node-load-miss-rate%
22474764 ± 2% +3.5% 23258402 perf-stat.i.node-load-misses
6215674 ± 3% -6.3% 5824416 perf-stat.i.node-loads
5207478 +17.4% 6113199 perf-stat.i.node-store-misses
831077 +17.9% 979621 perf-stat.i.node-stores
1645267 +16.2% 1911350 perf-stat.i.page-faults
68.37 -2.1 66.31 perf-stat.overall.cache-miss-rate%
8.49 -4.5% 8.11 perf-stat.overall.cpi
0.04 +0.0 0.04 ± 2% perf-stat.overall.dTLB-load-miss-rate%
0.09 +0.0 0.09 perf-stat.overall.dTLB-store-miss-rate%
93.41 -2.5 90.92 perf-stat.overall.iTLB-load-miss-rate%
15175 +3.4% 15687 perf-stat.overall.instructions-per-iTLB-miss
0.12 +4.7% 0.12 perf-stat.overall.ipc
78.20 +1.6 79.79 perf-stat.overall.node-load-miss-rate%
12543 -11.2% 11140 perf-stat.overall.path-length
2.183e+10 +6.1% 2.316e+10 perf-stat.ps.branch-instructions
1.102e+08 +5.6% 1.164e+08 ± 2% perf-stat.ps.cache-references
17070 ± 2% -16.9% 14191 ± 2% perf-stat.ps.context-switches
486.54 +5.7% 514.33 perf-stat.ps.cpu-migrations
9294872 +15.8% 10767066 ± 2% perf-stat.ps.dTLB-load-misses
2.525e+10 +4.5% 2.639e+10 perf-stat.ps.dTLB-loads
1747511 +16.6% 2038370 perf-stat.ps.dTLB-store-misses
2.024e+09 +13.5% 2.298e+09 perf-stat.ps.dTLB-stores
434045 ± 5% +42.8% 619721 perf-stat.ps.iTLB-loads
9.339e+10 +4.3% 9.739e+10 perf-stat.ps.instructions
1658111 +17.4% 1946167 perf-stat.ps.minor-faults
22728635 ± 2% +4.0% 23636858 perf-stat.ps.node-load-misses
6338224 ± 3% -5.5% 5988664 perf-stat.ps.node-loads
5139882 +17.1% 6017185 perf-stat.ps.node-store-misses
840111 +18.9% 998845 perf-stat.ps.node-stores
1658111 +17.4% 1946167 perf-stat.ps.page-faults
3.56e+13 +9.8% 3.907e+13 perf-stat.total.instructions
94.07 -1.6 92.46 perf-profile.calltrace.cycles-pp.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.mmap_region.do_mmap
93.61 -1.6 92.01 perf-profile.calltrace.cycles-pp.osq_lock.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.mmap_region
94.14 -1.6 92.57 perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.mmap_region.do_mmap.vm_mmap_pgoff
94.18 -1.6 92.62 perf-profile.calltrace.cycles-pp.down_write.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
94.87 -1.3 93.61 perf-profile.calltrace.cycles-pp.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64
94.94 -1.2 93.69 perf-profile.calltrace.cycles-pp.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
94.95 -1.2 93.70 perf-profile.calltrace.cycles-pp.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
94.99 -1.2 93.76 perf-profile.calltrace.cycles-pp.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
96.35 -0.9 95.49 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__mmap
96.35 -0.9 95.49 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
96.39 -0.9 95.53 perf-profile.calltrace.cycles-pp.__mmap
0.77 +0.2 0.98 perf-profile.calltrace.cycles-pp.do_rw_once
0.55 +0.3 0.81 ± 2% perf-profile.calltrace.cycles-pp.do_user_addr_fault.exc_page_fault.asm_exc_page_fault.do_access
0.56 +0.3 0.82 ± 2% perf-profile.calltrace.cycles-pp.exc_page_fault.asm_exc_page_fault.do_access
0.80 +0.3 1.12 perf-profile.calltrace.cycles-pp.asm_exc_page_fault.do_access
1.36 +0.4 1.72 ± 5% perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
1.36 +0.4 1.72 ± 5% perf-profile.calltrace.cycles-pp.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
1.36 +0.4 1.72 ± 5% perf-profile.calltrace.cycles-pp.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
1.36 +0.4 1.72 ± 5% perf-profile.calltrace.cycles-pp.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64
1.33 +0.4 1.70 ± 5% perf-profile.calltrace.cycles-pp.task_numa_work.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode
1.16 +0.4 1.53 ± 6% perf-profile.calltrace.cycles-pp.vma_migratable.task_numa_work.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare
1.07 +0.4 1.48 perf-profile.calltrace.cycles-pp.do_access
94.07 -1.6 92.46 perf-profile.children.cycles-pp.rwsem_optimistic_spin
93.63 -1.6 92.03 perf-profile.children.cycles-pp.osq_lock
94.14 -1.6 92.57 perf-profile.children.cycles-pp.rwsem_down_write_slowpath
94.18 -1.6 92.62 perf-profile.children.cycles-pp.down_write
94.88 -1.3 93.61 perf-profile.children.cycles-pp.mmap_region
94.94 -1.2 93.69 perf-profile.children.cycles-pp.do_mmap
94.95 -1.2 93.70 perf-profile.children.cycles-pp.vm_mmap_pgoff
94.99 -1.2 93.76 perf-profile.children.cycles-pp.ksys_mmap_pgoff
96.38 -0.9 95.52 perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
96.38 -0.9 95.52 perf-profile.children.cycles-pp.do_syscall_64
96.39 -0.9 95.54 perf-profile.children.cycles-pp.__mmap
0.38 -0.0 0.35 perf-profile.children.cycles-pp.rwsem_spin_on_owner
0.05 +0.0 0.06 perf-profile.children.cycles-pp.mas_walk
0.05 +0.0 0.06 ± 6% perf-profile.children.cycles-pp.vm_unmapped_area
0.05 ± 8% +0.0 0.07 ± 7% perf-profile.children.cycles-pp.__perf_sw_event
0.06 ± 8% +0.0 0.07 perf-profile.children.cycles-pp.lock_mm_and_find_vma
0.06 ± 7% +0.0 0.08 perf-profile.children.cycles-pp.sync_regs
0.08 +0.0 0.10 ± 3% perf-profile.children.cycles-pp.mtree_range_walk
0.10 +0.0 0.13 ± 3% perf-profile.children.cycles-pp.native_irq_return_iret
0.06 ± 7% +0.0 0.09 ± 7% perf-profile.children.cycles-pp.__unfreeze_partials
0.17 ± 2% +0.0 0.21 ± 8% perf-profile.children.cycles-pp.update_cfs_group
0.12 ± 4% +0.0 0.16 ± 3% perf-profile.children.cycles-pp.lock_vma_under_rcu
0.06 +0.0 0.10 ± 10% perf-profile.children.cycles-pp.kmem_cache_free_bulk
0.28 ± 2% +0.0 0.33 ± 8% perf-profile.children.cycles-pp.task_tick_fair
0.07 ± 7% +0.0 0.11 ± 8% perf-profile.children.cycles-pp.___slab_alloc
0.13 ± 3% +0.0 0.18 ± 6% perf-profile.children.cycles-pp.mas_store_prealloc
0.00 +0.1 0.05 perf-profile.children.cycles-pp.try_to_wake_up
0.00 +0.1 0.05 perf-profile.children.cycles-pp.wake_up_q
0.00 +0.1 0.05 perf-profile.children.cycles-pp.perf_event_mmap
0.00 +0.1 0.05 perf-profile.children.cycles-pp.perf_event_mmap_event
0.07 ± 7% +0.1 0.12 ± 7% perf-profile.children.cycles-pp.mas_destroy
0.00 +0.1 0.05 ± 7% perf-profile.children.cycles-pp.___perf_sw_event
0.00 +0.1 0.05 ± 7% perf-profile.children.cycles-pp.mt_find
0.00 +0.1 0.05 ± 7% perf-profile.children.cycles-pp.mas_empty_area_rev
0.00 +0.1 0.05 ± 8% perf-profile.children.cycles-pp.find_vma
0.10 +0.1 0.16 ± 6% perf-profile.children.cycles-pp.mas_preallocate
0.10 +0.1 0.16 ± 6% perf-profile.children.cycles-pp.mas_alloc_nodes
0.09 +0.1 0.14 ± 6% perf-profile.children.cycles-pp.__kmem_cache_alloc_bulk
0.00 +0.1 0.06 ± 13% perf-profile.children.cycles-pp.irqentry_exit_to_user_mode
0.09 +0.1 0.15 ± 7% perf-profile.children.cycles-pp.kmem_cache_alloc_bulk
0.10 ± 4% +0.1 0.16 ± 4% perf-profile.children.cycles-pp.rwsem_wake
0.00 +0.1 0.06 ± 11% perf-profile.children.cycles-pp._raw_spin_lock_irq
0.00 +0.1 0.07 ± 11% perf-profile.children.cycles-pp.get_partial_node
0.15 ± 5% +0.1 0.22 ± 3% perf-profile.children.cycles-pp.up_write
0.17 ± 2% +0.1 0.24 perf-profile.children.cycles-pp.vma_interval_tree_insert
0.14 ± 6% +0.1 0.25 ± 5% perf-profile.children.cycles-pp._raw_spin_lock_irqsave
0.08 +0.1 0.20 ± 6% perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
0.28 ± 2% +0.2 0.45 ± 3% perf-profile.children.cycles-pp.__handle_mm_fault
0.34 ± 2% +0.2 0.51 ± 2% perf-profile.children.cycles-pp.handle_mm_fault
2.24 ± 3% +0.2 2.47 ± 5% perf-profile.children.cycles-pp.asm_sysvec_apic_timer_interrupt
0.90 +0.2 1.15 perf-profile.children.cycles-pp.do_rw_once
0.56 +0.3 0.81 perf-profile.children.cycles-pp.do_user_addr_fault
0.57 +0.3 0.82 perf-profile.children.cycles-pp.exc_page_fault
0.83 +0.3 1.16 perf-profile.children.cycles-pp.asm_exc_page_fault
1.36 +0.4 1.73 ± 5% perf-profile.children.cycles-pp.syscall_exit_to_user_mode
1.39 +0.4 1.78 ± 5% perf-profile.children.cycles-pp.exit_to_user_mode_prepare
1.38 +0.4 1.77 ± 5% perf-profile.children.cycles-pp.exit_to_user_mode_loop
1.38 +0.4 1.77 ± 5% perf-profile.children.cycles-pp.task_work_run
1.36 +0.4 1.75 ± 5% perf-profile.children.cycles-pp.task_numa_work
1.18 +0.4 1.58 ± 5% perf-profile.children.cycles-pp.vma_migratable
1.46 +0.5 1.96 perf-profile.children.cycles-pp.do_access
92.91 -1.6 91.28 perf-profile.self.cycles-pp.osq_lock
0.38 -0.0 0.35 perf-profile.self.cycles-pp.rwsem_spin_on_owner
0.06 ± 7% +0.0 0.08 perf-profile.self.cycles-pp.sync_regs
0.10 +0.0 0.12 ± 4% perf-profile.self.cycles-pp.native_irq_return_iret
0.07 ± 6% +0.0 0.10 ± 3% perf-profile.self.cycles-pp.mtree_range_walk
0.06 +0.0 0.09 ± 5% perf-profile.self.cycles-pp.lock_vma_under_rcu
0.08 ± 5% +0.0 0.11 ± 6% perf-profile.self.cycles-pp._raw_spin_lock_irqsave
0.17 ± 2% +0.0 0.21 ± 8% perf-profile.self.cycles-pp.update_cfs_group
0.02 ±141% +0.0 0.06 ± 7% perf-profile.self.cycles-pp.rwsem_optimistic_spin
0.00 +0.1 0.05 perf-profile.self.cycles-pp.___perf_sw_event
0.00 +0.1 0.06 ± 6% perf-profile.self.cycles-pp.down_write
0.17 ± 2% +0.1 0.24 perf-profile.self.cycles-pp.vma_interval_tree_insert
0.08 +0.1 0.20 ± 6% perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
0.55 +0.1 0.70 perf-profile.self.cycles-pp.do_access
0.24 ± 3% +0.2 0.39 ± 3% perf-profile.self.cycles-pp.__handle_mm_fault
0.68 +0.2 0.86 perf-profile.self.cycles-pp.do_rw_once
1.16 +0.4 1.56 ± 6% perf-profile.self.cycles-pp.vma_migratable
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki
^ permalink raw reply [flat|nested] only message in thread
only message in thread, other threads:[~2023-09-21 7:44 UTC | newest]
Thread overview: (only message) (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2023-09-21 7:43 [linus:master] [mm/mmap] 6852c46c78: stress-ng.spawn.ops_per_sec 2.9% improvement kernel test robot
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.