Linux-mm Archive on lore.kernel.org
 help / color / mirror / Atom feed
* [linux-next:master] [mm/slub]  0fc52deec1: will-it-scale.per_process_ops 6.9% improvement
@ 2026-06-10  8:19 kernel test robot
  0 siblings, 0 replies; only message in thread
From: kernel test robot @ 2026-06-10  8:19 UTC (permalink / raw)
  To: Hao Li; +Cc: oe-lkp, lkp, Vlastimil Babka, Harry Yoo, linux-mm, oliver.sang



Hello,

kernel test robot noticed a 6.9% improvement of will-it-scale.per_process_ops on:


commit: 0fc52deec1068ea3cc8eaa6e045c96fbf73f20e2 ("mm/slub: detach and reattach partial slabs in batch")
https://git.kernel.org/cgit/linux/kernel/git/next/linux-next.git master


testcase: will-it-scale
config: x86_64-rhel-9.4
compiler: gcc-14
test machine: 192 threads 2 sockets Intel(R) Xeon(R) 6740E  CPU @ 2.4GHz (Sierra Forest) with 256G memory
parameters:

	nr_task: 100%
	mode: process
	test: mmap1
	cpufreq_governor: performance



Details are as below:
-------------------------------------------------------------------------------------------------->


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20260610/202606101634.f786b0f9-lkp@intel.com

=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase:
  gcc-14/performance/x86_64-rhel-9.4/process/100%/debian-13-x86_64-20250902.cgz/lkp-srf-2sp2/mmap1/will-it-scale

commit: 
  7e23073874 ("mm/slub: introduce helpers for node partial slab state")
  0fc52deec1 ("mm/slub: detach and reattach partial slabs in batch")

7e230738746ce9a7 0fc52deec1068ea3cc8eaa6e045 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
  35949149            +6.9%   38424030        will-it-scale.192.processes
    187234            +6.9%     200124        will-it-scale.per_process_ops
  35949149            +6.9%   38424030        will-it-scale.workload
   1130464 ± 14%     -15.9%     951025 ±  6%  numa-meminfo.node1.Mapped
    765203            +6.8%     817048        meminfo.SUnreclaim
    916508            +5.7%     968444        meminfo.Slab
     21415 ±  2%     +35.9%      29105        vmstat.system.cs
    254768            +1.3%     258105        vmstat.system.in
      0.13 ±  3%      +0.0        0.14 ±  2%  mpstat.cpu.all.irq%
     18.10            +2.1       20.23        mpstat.cpu.all.soft%
      1.73            +0.2        1.91        mpstat.cpu.all.usr%
   4375386           -50.0%    2186988 ±  6%  numa-numastat.node0.local_node
   4460280           -49.9%    2236714 ±  5%  numa-numastat.node0.numa_hit
   5597529 ±  2%     -42.1%    3243169 ±  3%  numa-numastat.node1.local_node
   5712900 ±  2%     -40.6%    3393053 ±  3%  numa-numastat.node1.numa_hit
     21530           +35.8%      29239        perf-stat.i.context-switches
    263.83            +2.2%     269.76        perf-stat.i.cpu-migrations
     21371 ±  2%     +36.1%      29093        perf-stat.ps.context-switches
    261.10            +2.2%     266.90        perf-stat.ps.cpu-migrations
   4460290           -49.9%    2236236 ±  5%  numa-vmstat.node0.numa_hit
   4375396           -50.0%    2186510 ±  6%  numa-vmstat.node0.numa_local
    282371 ± 14%     -16.0%     237208 ±  6%  numa-vmstat.node1.nr_mapped
   5712955 ±  2%     -40.6%    3392995 ±  3%  numa-vmstat.node1.numa_hit
   5597585 ±  2%     -42.1%    3243112 ±  3%  numa-vmstat.node1.numa_local
    191228            +6.9%     204459        proc-vmstat.nr_slab_unreclaimable
  10177434           -44.7%    5632836 ±  3%  proc-vmstat.numa_hit
   9977167           -45.5%    5433226 ±  3%  proc-vmstat.numa_local
  33527355           -54.2%   15349765 ±  5%  proc-vmstat.pgalloc_normal
  32033712           -56.8%   13845940 ±  6%  proc-vmstat.pgfree
      0.21           +36.6%       0.29        perf-sched.sch_delay.avg.ms.[unknown].[unknown].[unknown].[unknown].[unknown]
      0.21           +36.6%       0.29        perf-sched.total_sch_delay.average.ms
     26.08 ±  3%     -25.9%      19.32 ±  2%  perf-sched.total_wait_and_delay.average.ms
    110198 ±  2%     +34.6%     148367        perf-sched.total_wait_and_delay.count.ms
     25.87 ±  3%     -26.4%      19.03 ±  2%  perf-sched.total_wait_time.average.ms
     26.08 ±  3%     -25.9%      19.32 ±  2%  perf-sched.wait_and_delay.avg.ms.[unknown].[unknown].[unknown].[unknown].[unknown]
    110198 ±  2%     +34.6%     148367        perf-sched.wait_and_delay.count.[unknown].[unknown].[unknown].[unknown].[unknown]
     25.87 ±  3%     -26.4%      19.03 ±  2%  perf-sched.wait_time.avg.ms.[unknown].[unknown].[unknown].[unknown].[unknown]
      0.21 ±  6%     +18.6%       0.25 ±  5%  sched_debug.cfs_rq:/.h_nr_queued.stddev
      0.21 ±  5%     +14.6%       0.24 ±  2%  sched_debug.cfs_rq:/.h_nr_runnable.stddev
   3324769 ± 12%     +28.7%    4277348 ± 10%  sched_debug.cfs_rq:/.left_deadline.stddev
   3324734 ± 12%     +28.7%    4277309 ± 10%  sched_debug.cfs_rq:/.left_vruntime.stddev
     27152 ± 13%     +57.1%      42666 ± 16%  sched_debug.cfs_rq:/.load.avg
    127543 ±  9%     +38.2%     176277 ±  9%  sched_debug.cfs_rq:/.load.stddev
      6.75 ± 11%     +91.8%      12.94 ± 14%  sched_debug.cfs_rq:/.load_avg.min
      0.15 ±  7%     +29.3%       0.20 ±  7%  sched_debug.cfs_rq:/.nr_queued.stddev
   3324735 ± 12%     +28.7%    4277309 ± 10%  sched_debug.cfs_rq:/.right_vruntime.stddev
      3746 ± 36%    +147.1%       9256 ± 41%  sched_debug.cfs_rq:/.sum_weight.avg
     39887 ± 41%     +90.1%      75816 ± 21%  sched_debug.cfs_rq:/.sum_weight.stddev
      1195 ±  9%     +24.2%       1483 ±  5%  sched_debug.cpu.curr->pid.stddev
      0.00 ±  3%     +13.8%       0.00 ±  5%  sched_debug.cpu.next_balance.stddev
      0.21 ±  7%     +18.7%       0.25 ±  5%  sched_debug.cpu.nr_running.stddev
     18361 ±  2%     +32.9%      24405        sched_debug.cpu.nr_switches.avg
     15708 ±  2%     +38.1%      21687 ±  2%  sched_debug.cpu.nr_switches.min
     34.64           -21.3       13.38        perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.__refill_objects_node.refill_objects.__pcs_replace_empty_main.kmem_cache_alloc_noprof
     34.48           -21.2       13.29        perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.__refill_objects_node.refill_objects.__pcs_replace_empty_main
     12.24            -6.8        5.47        perf-profile.calltrace.cycles-pp.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt._raw_spin_unlock_irqrestore.__refill_objects_node.refill_objects
     12.24            -6.8        5.48        perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt._raw_spin_unlock_irqrestore.__refill_objects_node.refill_objects.__pcs_replace_empty_main
     12.24            -6.8        5.48        perf-profile.calltrace.cycles-pp._raw_spin_unlock_irqrestore.__refill_objects_node.refill_objects.__pcs_replace_empty_main.kmem_cache_alloc_noprof
     12.20            -6.7        5.46        perf-profile.calltrace.cycles-pp.__irq_exit_rcu.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt._raw_spin_unlock_irqrestore.__refill_objects_node
     26.52 ±  3%      -5.4       21.08 ±  7%  perf-profile.calltrace.cycles-pp.refill_objects.__pcs_replace_empty_main.kmem_cache_alloc_noprof.mas_store_gfp.do_vmi_align_munmap
     26.02 ±  3%      -5.2       20.87 ±  7%  perf-profile.calltrace.cycles-pp.__refill_objects_node.refill_objects.__pcs_replace_empty_main.kmem_cache_alloc_noprof.mas_store_gfp
     31.47 ±  3%      -4.9       26.54 ±  7%  perf-profile.calltrace.cycles-pp.__pcs_replace_empty_main.kmem_cache_alloc_noprof.mas_store_gfp.do_vmi_align_munmap.do_vmi_munmap
     31.67 ±  3%      -4.9       26.75 ±  7%  perf-profile.calltrace.cycles-pp.kmem_cache_alloc_noprof.mas_store_gfp.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap
     37.72 ±  2%      -3.5       34.26 ±  6%  perf-profile.calltrace.cycles-pp.mas_store_gfp.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap.__x64_sys_munmap
     23.32 ±  3%      -2.6       20.73 ±  8%  perf-profile.calltrace.cycles-pp.refill_objects.__pcs_replace_empty_main.kmem_cache_alloc_noprof.mas_preallocate.__mmap_new_vma
     22.88 ±  3%      -2.4       20.52 ±  8%  perf-profile.calltrace.cycles-pp.__refill_objects_node.refill_objects.__pcs_replace_empty_main.kmem_cache_alloc_noprof.mas_preallocate
      2.02            -1.5        0.54 ±  4%  perf-profile.calltrace.cycles-pp.barn_put_full_sheaf.rcu_do_batch.rcu_core.handle_softirqs.__irq_exit_rcu
      2.51            -1.1        1.40 ± 14%  perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.barn_put_full_sheaf.rcu_do_batch.rcu_core
      2.54            -1.0        1.50 ±  3%  perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.barn_put_full_sheaf.rcu_do_batch.rcu_core.handle_softirqs
      0.77            -0.1        0.67        perf-profile.calltrace.cycles-pp.perf_iterate_sb.perf_event_mmap_event.perf_event_mmap.__mmap_complete.__mmap_region
      1.87            -0.1        1.81        perf-profile.calltrace.cycles-pp.__mmap_complete.__mmap_region.do_mmap.vm_mmap_pgoff.do_syscall_64
      1.22            -0.0        1.18        perf-profile.calltrace.cycles-pp.perf_event_mmap_event.perf_event_mmap.__mmap_complete.__mmap_region.do_mmap
      1.62            +0.0        1.65        perf-profile.calltrace.cycles-pp.perf_event_mmap.__mmap_complete.__mmap_region.do_mmap.vm_mmap_pgoff
      0.62            +0.0        0.66        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.__munmap
      0.62            +0.0        0.67        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.__mmap
      0.53            +0.1        0.59        perf-profile.calltrace.cycles-pp.mas_store_gfp.vms_gather_munmap_vmas.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap
      0.65            +0.1        0.72        perf-profile.calltrace.cycles-pp.mas_empty_area_rev.unmapped_area_topdown.vm_unmapped_area.arch_get_unmapped_area_topdown.thp_get_unmapped_area_vmflags
      0.59            +0.1        0.68        perf-profile.calltrace.cycles-pp.__memcg_slab_post_alloc_hook.kmem_cache_alloc_noprof.vm_area_alloc.__mmap_new_vma.__mmap_region
      1.41            +0.1        1.51        perf-profile.calltrace.cycles-pp.unmapped_area_topdown.vm_unmapped_area.arch_get_unmapped_area_topdown.thp_get_unmapped_area_vmflags.__get_unmapped_area
      0.74            +0.1        0.84        perf-profile.calltrace.cycles-pp.kmem_cache_alloc_noprof.vm_area_alloc.__mmap_new_vma.__mmap_region.do_mmap
      1.43            +0.1        1.54        perf-profile.calltrace.cycles-pp.vm_unmapped_area.arch_get_unmapped_area_topdown.thp_get_unmapped_area_vmflags.__get_unmapped_area.do_mmap
      1.75            +0.1        1.87        perf-profile.calltrace.cycles-pp.arch_get_unmapped_area_topdown.thp_get_unmapped_area_vmflags.__get_unmapped_area.do_mmap.vm_mmap_pgoff
      0.54            +0.1        0.66        perf-profile.calltrace.cycles-pp.kmem_cache_free.vms_complete_munmap_vmas.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap
      1.11            +0.1        1.25        perf-profile.calltrace.cycles-pp.vm_area_alloc.__mmap_new_vma.__mmap_region.do_mmap.vm_mmap_pgoff
      1.92            +0.2        2.07        perf-profile.calltrace.cycles-pp.thp_get_unmapped_area_vmflags.__get_unmapped_area.do_mmap.vm_mmap_pgoff.do_syscall_64
      1.25 ±  4%      +0.2        1.42 ±  7%  perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.barn_get_empty_sheaf.__pcs_replace_empty_main.kmem_cache_alloc_noprof.mas_preallocate
      1.98            +0.2        2.15        perf-profile.calltrace.cycles-pp.__get_unmapped_area.do_mmap.vm_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
      2.65            +0.2        2.84 ±  2%  perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.barn_get_empty_sheaf.__pcs_replace_empty_main.kmem_cache_alloc_noprof
      2.26            +0.2        2.44        perf-profile.calltrace.cycles-pp.free_p4d_range.free_pgd_range.free_pgtables.unmap_region.vms_complete_munmap_vmas
      2.03            +0.2        2.22        perf-profile.calltrace.cycles-pp.free_pud_range.free_p4d_range.free_pgd_range.free_pgtables.unmap_region
      2.39            +0.2        2.58        perf-profile.calltrace.cycles-pp.free_pgd_range.free_pgtables.unmap_region.vms_complete_munmap_vmas.do_vmi_align_munmap
      2.38            +0.2        2.62        perf-profile.calltrace.cycles-pp.vms_gather_munmap_vmas.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap.__x64_sys_munmap
      2.62            +0.2        2.86        perf-profile.calltrace.cycles-pp.free_pgtables.unmap_region.vms_complete_munmap_vmas.do_vmi_align_munmap.do_vmi_munmap
      1.45 ±  3%      +0.2        1.69 ±  7%  perf-profile.calltrace.cycles-pp.barn_get_empty_sheaf.__pcs_replace_empty_main.kmem_cache_alloc_noprof.mas_preallocate.__mmap_new_vma
      2.85            +0.2        3.10        perf-profile.calltrace.cycles-pp.__zap_vma_range.unmap_vmas.unmap_region.vms_complete_munmap_vmas.do_vmi_align_munmap
      2.63            +0.2        2.88        perf-profile.calltrace.cycles-pp.zap_pmd_range.__zap_vma_range.unmap_vmas.unmap_region.vms_complete_munmap_vmas
      3.01            +0.3        3.26        perf-profile.calltrace.cycles-pp.unmap_vmas.unmap_region.vms_complete_munmap_vmas.do_vmi_align_munmap.do_vmi_munmap
      2.29 ±  3%      +0.4        2.68 ±  7%  perf-profile.calltrace.cycles-pp.barn_get_empty_sheaf.__kfree_rcu_sheaf.kvfree_call_rcu.mas_wr_node_store.mas_store_prealloc
      1.58            +0.4        1.98        perf-profile.calltrace.cycles-pp.__pi_memcpy.mas_wr_node_store.mas_store_prealloc.__mmap_new_vma.__mmap_region
      2.47 ±  2%      +0.4        2.88 ±  6%  perf-profile.calltrace.cycles-pp.__kfree_rcu_sheaf.kvfree_call_rcu.mas_wr_node_store.mas_store_prealloc.__mmap_new_vma
      1.53            +0.4        1.94        perf-profile.calltrace.cycles-pp.__pi_memcpy.mas_wr_node_store.mas_store_gfp.do_vmi_align_munmap.do_vmi_munmap
      0.55 ±  2%      +0.4        0.98 ±  2%  perf-profile.calltrace.cycles-pp.barn_put_full_sheaf.rcu_do_batch.rcu_core.handle_softirqs.run_ksoftirqd
      2.34 ±  3%      +0.5        2.81 ±  8%  perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.barn_replace_empty_sheaf.__pcs_replace_empty_main.kmem_cache_alloc_noprof.mas_preallocate
      2.71 ±  2%      +0.5        3.18 ±  5%  perf-profile.calltrace.cycles-pp.kvfree_call_rcu.mas_wr_node_store.mas_store_prealloc.__mmap_new_vma.__mmap_region
      0.00            +0.5        0.52 ±  2%  perf-profile.calltrace.cycles-pp.mas_update_gap.mas_wr_node_store.mas_store_gfp.do_vmi_align_munmap.do_vmi_munmap
      0.00            +0.5        0.52 ±  2%  perf-profile.calltrace.cycles-pp.mas_update_gap.mas_wr_node_store.mas_store_prealloc.__mmap_new_vma.__mmap_region
      6.19            +0.5        6.72        perf-profile.calltrace.cycles-pp.unmap_region.vms_complete_munmap_vmas.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap
      0.00            +0.5        0.53        perf-profile.calltrace.cycles-pp.__vma_start_write.vms_gather_munmap_vmas.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap
      0.17 ±141%      +0.6        0.73 ±  9%  perf-profile.calltrace.cycles-pp._raw_spin_unlock_irqrestore.barn_replace_empty_sheaf.__pcs_replace_empty_main.kmem_cache_alloc_noprof.mas_store_gfp
      3.62            +0.6        4.19 ±  2%  perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.barn_get_empty_sheaf.__kfree_rcu_sheaf.kvfree_call_rcu.mas_wr_node_store
      3.59            +0.6        4.16 ±  2%  perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.barn_get_empty_sheaf.__kfree_rcu_sheaf.kvfree_call_rcu
      7.68            +0.6        8.31        perf-profile.calltrace.cycles-pp.vms_complete_munmap_vmas.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap.__x64_sys_munmap
      4.96            +0.6        5.61 ±  2%  perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.barn_replace_empty_sheaf.__pcs_replace_empty_main.kmem_cache_alloc_noprof
      2.19            +0.7        2.91 ±  7%  perf-profile.calltrace.cycles-pp.barn_get_empty_sheaf.__kfree_rcu_sheaf.kvfree_call_rcu.mas_wr_node_store.mas_store_gfp
      0.00            +0.7        0.73 ±  7%  perf-profile.calltrace.cycles-pp._raw_spin_unlock_irqrestore.barn_replace_empty_sheaf.__pcs_replace_empty_main.kmem_cache_alloc_noprof.mas_preallocate
      2.37            +0.7        3.11 ±  7%  perf-profile.calltrace.cycles-pp.__kfree_rcu_sheaf.kvfree_call_rcu.mas_wr_node_store.mas_store_gfp.do_vmi_align_munmap
      2.86 ±  3%      +0.8        3.64 ±  8%  perf-profile.calltrace.cycles-pp.barn_replace_empty_sheaf.__pcs_replace_empty_main.kmem_cache_alloc_noprof.mas_preallocate.__mmap_new_vma
      2.59            +0.8        3.38 ±  6%  perf-profile.calltrace.cycles-pp.kvfree_call_rcu.mas_wr_node_store.mas_store_gfp.do_vmi_align_munmap.do_vmi_munmap
     10.48            +1.0       11.49 ±  2%  perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.__slab_free.__kmem_cache_free_bulk.rcu_free_sheaf
     10.61            +1.0       11.64 ±  2%  perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.__slab_free.__kmem_cache_free_bulk.rcu_free_sheaf.rcu_do_batch
      5.37            +1.0        6.40 ±  2%  perf-profile.calltrace.cycles-pp.mas_wr_node_store.mas_store_prealloc.__mmap_new_vma.__mmap_region.do_mmap
     10.96            +1.0       12.00 ±  2%  perf-profile.calltrace.cycles-pp.__slab_free.__kmem_cache_free_bulk.rcu_free_sheaf.rcu_do_batch.rcu_core
     11.04            +1.0       12.08 ±  2%  perf-profile.calltrace.cycles-pp.__kmem_cache_free_bulk.rcu_free_sheaf.rcu_do_batch.rcu_core.handle_softirqs
      5.92            +1.1        7.02 ±  2%  perf-profile.calltrace.cycles-pp.mas_store_prealloc.__mmap_new_vma.__mmap_region.do_mmap.vm_mmap_pgoff
      0.00            +1.2        1.23 ±  3%  perf-profile.calltrace.cycles-pp.__irq_exit_rcu.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt._raw_spin_unlock_irqrestore.barn_get_empty_sheaf
      0.00            +1.2        1.24 ±  3%  perf-profile.calltrace.cycles-pp._raw_spin_unlock_irqrestore.barn_get_empty_sheaf.__kfree_rcu_sheaf.kvfree_call_rcu.mas_wr_node_store
      0.00            +1.2        1.24 ±  3%  perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt._raw_spin_unlock_irqrestore.barn_get_empty_sheaf.__kfree_rcu_sheaf.kvfree_call_rcu
      0.00            +1.2        1.24 ±  3%  perf-profile.calltrace.cycles-pp.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt._raw_spin_unlock_irqrestore.barn_get_empty_sheaf.__kfree_rcu_sheaf
      0.17 ±141%      +1.3        1.46 ±  2%  perf-profile.calltrace.cycles-pp.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt._raw_spin_unlock_irqrestore.barn_replace_empty_sheaf.__pcs_replace_empty_main
      0.17 ±141%      +1.3        1.46 ±  3%  perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt._raw_spin_unlock_irqrestore.barn_replace_empty_sheaf.__pcs_replace_empty_main.kmem_cache_alloc_noprof
      1.53 ±  3%      +1.3        2.82        perf-profile.calltrace.cycles-pp.rcu_free_sheaf.rcu_do_batch.rcu_core.handle_softirqs.run_ksoftirqd
      0.08 ±223%      +1.4        1.45 ±  3%  perf-profile.calltrace.cycles-pp.__irq_exit_rcu.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt._raw_spin_unlock_irqrestore.barn_replace_empty_sheaf
      5.27            +1.4        6.66 ±  3%  perf-profile.calltrace.cycles-pp.mas_wr_node_store.mas_store_gfp.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap
      2.16 ±  3%      +1.8        3.95        perf-profile.calltrace.cycles-pp.rcu_do_batch.rcu_core.handle_softirqs.run_ksoftirqd.smpboot_thread_fn
      2.16 ±  3%      +1.8        3.95        perf-profile.calltrace.cycles-pp.handle_softirqs.run_ksoftirqd.smpboot_thread_fn.kthread.ret_from_fork
      2.16 ±  3%      +1.8        3.95        perf-profile.calltrace.cycles-pp.rcu_core.handle_softirqs.run_ksoftirqd.smpboot_thread_fn.kthread
      2.16 ±  3%      +1.8        3.95        perf-profile.calltrace.cycles-pp.run_ksoftirqd.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
      2.18 ±  3%      +1.8        3.96        perf-profile.calltrace.cycles-pp.kthread.ret_from_fork.ret_from_fork_asm
      2.18 ±  3%      +1.8        3.96        perf-profile.calltrace.cycles-pp.ret_from_fork.ret_from_fork_asm
      2.18 ±  3%      +1.8        3.96        perf-profile.calltrace.cycles-pp.ret_from_fork_asm
      2.17 ±  3%      +1.8        3.96        perf-profile.calltrace.cycles-pp.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
     12.27 ±  2%      +2.2       14.47        perf-profile.calltrace.cycles-pp.rcu_core.handle_softirqs.__irq_exit_rcu.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt
     12.26 ±  2%      +2.2       14.46        perf-profile.calltrace.cycles-pp.rcu_do_batch.rcu_core.handle_softirqs.__irq_exit_rcu.sysvec_apic_timer_interrupt
     12.28 ±  2%      +2.2       14.48        perf-profile.calltrace.cycles-pp.handle_softirqs.__irq_exit_rcu.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt._raw_spin_unlock_irqrestore
      0.00            +6.3        6.34 ±  2%  perf-profile.calltrace.cycles-pp.__irq_exit_rcu.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt._raw_spin_unlock_irqrestore.get_partial_node_bulk
      0.00            +6.4        6.36 ±  2%  perf-profile.calltrace.cycles-pp.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt._raw_spin_unlock_irqrestore.get_partial_node_bulk.__refill_objects_node
      0.00            +6.4        6.36 ±  2%  perf-profile.calltrace.cycles-pp._raw_spin_unlock_irqrestore.get_partial_node_bulk.__refill_objects_node.refill_objects.__pcs_replace_empty_main
      0.00            +6.4        6.36 ±  2%  perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt._raw_spin_unlock_irqrestore.get_partial_node_bulk.__refill_objects_node.refill_objects
      0.00           +14.1       14.13        perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.get_partial_node_bulk.__refill_objects_node.refill_objects
      0.00           +14.2       14.21        perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.get_partial_node_bulk.__refill_objects_node.refill_objects.__pcs_replace_empty_main
      0.00           +20.9       20.94        perf-profile.calltrace.cycles-pp.get_partial_node_bulk.__refill_objects_node.refill_objects.__pcs_replace_empty_main.kmem_cache_alloc_noprof
     49.86            -8.0       41.83        perf-profile.children.cycles-pp.refill_objects
     48.92            -7.5       41.41        perf-profile.children.cycles-pp.__refill_objects_node
     59.17            -6.5       52.68        perf-profile.children.cycles-pp.__pcs_replace_empty_main
     60.32            -6.3       53.98        perf-profile.children.cycles-pp.kmem_cache_alloc_noprof
     38.28 ±  2%      -3.4       34.87 ±  5%  perf-profile.children.cycles-pp.mas_store_gfp
     64.39            -2.9       61.51        perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
     64.91            -2.8       62.08        perf-profile.children.cycles-pp._raw_spin_lock_irqsave
     94.44            -2.0       92.40        perf-profile.children.cycles-pp.do_syscall_64
     94.59            -2.0       92.55        perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
      0.92 ±  2%      -0.5        0.40 ±  7%  perf-profile.children.cycles-pp.alloc_from_new_slab
      0.81            -0.1        0.70        perf-profile.children.cycles-pp.perf_iterate_sb
      0.38            -0.1        0.30        perf-profile.children.cycles-pp.up_read
      1.88            -0.1        1.81        perf-profile.children.cycles-pp.__mmap_complete
      0.15 ±  3%      -0.0        0.13        perf-profile.children.cycles-pp.percpu_counter_add_batch
      0.20            +0.0        0.21        perf-profile.children.cycles-pp.tlb_finish_mmu
      0.07            +0.0        0.08        perf-profile.children.cycles-pp.prandom_u32_state
      0.17            +0.0        0.18        perf-profile.children.cycles-pp.entry_SYSCALL_64_safe_stack
      0.09            +0.0        0.10        perf-profile.children.cycles-pp.mas_wr_store_entry
      0.11 ±  3%      +0.0        0.12        perf-profile.children.cycles-pp.testcase
      0.08 ±  4%      +0.0        0.09        perf-profile.children.cycles-pp.userfaultfd_unmap_complete
      0.06 ±  6%      +0.0        0.07        perf-profile.children.cycles-pp.__call_rcu_common
      0.14 ±  2%      +0.0        0.15 ±  2%  perf-profile.children.cycles-pp.mas_prev
      0.14            +0.0        0.15 ±  3%  perf-profile.children.cycles-pp.syscall_return_via_sysret
      0.14            +0.0        0.15 ±  3%  perf-profile.children.cycles-pp.up_write
      0.10            +0.0        0.12        perf-profile.children.cycles-pp.x64_sys_call
      0.07            +0.0        0.09        perf-profile.children.cycles-pp.unlink_file_vma_batch_final
      0.06            +0.0        0.08        perf-profile.children.cycles-pp.vm_get_page_prot
      0.16            +0.0        0.18        perf-profile.children.cycles-pp.may_expand_vm
      0.13 ±  2%      +0.0        0.15        perf-profile.children.cycles-pp.mas_prev_range
      0.24            +0.0        0.26        perf-profile.children.cycles-pp.security_vm_enough_memory_mm
      1.64            +0.0        1.66        perf-profile.children.cycles-pp.perf_event_mmap
      0.24            +0.0        0.27        perf-profile.children.cycles-pp.downgrade_write
      0.36            +0.0        0.39        perf-profile.children.cycles-pp.__vma_start_exclude_readers
      0.29            +0.0        0.32        perf-profile.children.cycles-pp.down_write_killable
      0.32            +0.0        0.35        perf-profile.children.cycles-pp.vma_set_page_prot
      0.46            +0.0        0.50        perf-profile.children.cycles-pp.vma_merge_new_range
      0.53            +0.0        0.58        perf-profile.children.cycles-pp.mas_wr_store_type
      0.46            +0.0        0.50        perf-profile.children.cycles-pp.mas_rev_awalk
      0.70            +0.0        0.74        perf-profile.children.cycles-pp.mas_prev_slot
      0.30            +0.0        0.34        perf-profile.children.cycles-pp.__rcu_free_sheaf_prepare
      0.07            +0.0        0.12 ±  4%  perf-profile.children.cycles-pp.kfree
      0.53            +0.0        0.58        perf-profile.children.cycles-pp.mas_walk
      0.00            +0.1        0.05 ±  7%  perf-profile.children.cycles-pp.mm_get_unmapped_area_vmflags
      0.54            +0.1        0.59        perf-profile.children.cycles-pp.__vma_start_write
      0.45            +0.1        0.52 ±  3%  perf-profile.children.cycles-pp.barn_put_empty_sheaf
      0.00            +0.1        0.07 ±  5%  perf-profile.children.cycles-pp.mmap_region
      0.65            +0.1        0.72        perf-profile.children.cycles-pp.mas_empty_area_rev
      1.00            +0.1        1.07        perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack
      0.00            +0.1        0.09 ±  5%  perf-profile.children.cycles-pp.strlen
      0.01 ±223%      +0.1        0.10        perf-profile.children.cycles-pp.vm_area_free
      0.59            +0.1        0.68        perf-profile.children.cycles-pp.__memcg_slab_post_alloc_hook
      1.28            +0.1        1.37        perf-profile.children.cycles-pp.entry_SYSCALL_64
      1.42            +0.1        1.52        perf-profile.children.cycles-pp.unmapped_area_topdown
      1.44            +0.1        1.54        perf-profile.children.cycles-pp.vm_unmapped_area
      0.54            +0.1        0.66        perf-profile.children.cycles-pp.kmem_cache_free
      1.76            +0.1        1.88        perf-profile.children.cycles-pp.arch_get_unmapped_area_topdown
      1.11            +0.1        1.25        perf-profile.children.cycles-pp.vm_area_alloc
      1.92            +0.2        2.07        perf-profile.children.cycles-pp.thp_get_unmapped_area_vmflags
      1.99            +0.2        2.16        perf-profile.children.cycles-pp.__get_unmapped_area
      2.27            +0.2        2.45        perf-profile.children.cycles-pp.free_p4d_range
      2.04            +0.2        2.22        perf-profile.children.cycles-pp.free_pud_range
      2.40            +0.2        2.59        perf-profile.children.cycles-pp.free_pgd_range
      0.82 ±  2%      +0.2        1.04 ±  2%  perf-profile.children.cycles-pp.mas_update_gap
      2.40            +0.2        2.64        perf-profile.children.cycles-pp.vms_gather_munmap_vmas
      2.87            +0.2        3.11        perf-profile.children.cycles-pp.__zap_vma_range
      2.70            +0.2        2.94        perf-profile.children.cycles-pp.free_pgtables
      2.64            +0.2        2.89        perf-profile.children.cycles-pp.zap_pmd_range
      3.02            +0.3        3.27        perf-profile.children.cycles-pp.unmap_vmas
     14.89            +0.5       15.36        perf-profile.children.cycles-pp._raw_spin_unlock_irqrestore
      6.20            +0.5        6.72        perf-profile.children.cycles-pp.unmap_region
      7.86            +0.5        8.41        perf-profile.children.cycles-pp.vms_complete_munmap_vmas
      4.10            +0.7        4.83 ±  2%  perf-profile.children.cycles-pp.barn_put_full_sheaf
      3.13            +0.8        3.96        perf-profile.children.cycles-pp.__pi_memcpy
      5.92            +1.1        7.02 ±  2%  perf-profile.children.cycles-pp.mas_store_prealloc
      4.84            +1.2        6.00 ±  2%  perf-profile.children.cycles-pp.__kfree_rcu_sheaf
      6.11            +1.2        7.34 ±  2%  perf-profile.children.cycles-pp.barn_replace_empty_sheaf
      5.31            +1.3        6.58 ±  2%  perf-profile.children.cycles-pp.kvfree_call_rcu
     17.71            +1.3       19.06        perf-profile.children.cycles-pp.__irq_exit_rcu
     17.83            +1.4       19.18        perf-profile.children.cycles-pp.sysvec_apic_timer_interrupt
     17.85            +1.4       19.21        perf-profile.children.cycles-pp.asm_sysvec_apic_timer_interrupt
      7.59            +1.4        9.01 ±  2%  perf-profile.children.cycles-pp.barn_get_empty_sheaf
      2.16 ±  3%      +1.8        3.95        perf-profile.children.cycles-pp.run_ksoftirqd
      2.18 ±  3%      +1.8        3.96        perf-profile.children.cycles-pp.kthread
      2.18 ±  3%      +1.8        3.96        perf-profile.children.cycles-pp.ret_from_fork
      2.18 ±  3%      +1.8        3.96        perf-profile.children.cycles-pp.ret_from_fork_asm
      2.17 ±  3%      +1.8        3.96        perf-profile.children.cycles-pp.smpboot_thread_fn
     14.64            +2.2       16.86        perf-profile.children.cycles-pp.__kmem_cache_free_bulk
     14.66            +2.3       16.93        perf-profile.children.cycles-pp.__slab_free
     15.25            +2.3       17.57        perf-profile.children.cycles-pp.rcu_free_sheaf
     10.75            +2.5       13.20        perf-profile.children.cycles-pp.mas_wr_node_store
     19.86            +3.1       22.99        perf-profile.children.cycles-pp.rcu_core
     19.87            +3.1       23.00        perf-profile.children.cycles-pp.handle_softirqs
     19.84            +3.1       22.98        perf-profile.children.cycles-pp.rcu_do_batch
      0.00           +20.9       20.94        perf-profile.children.cycles-pp.get_partial_node_bulk
     64.39            -2.9       61.51        perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
      1.90            -0.4        1.48 ±  2%  perf-profile.self.cycles-pp.__refill_objects_node
      0.27            -0.1        0.17        perf-profile.self.cycles-pp.vms_complete_munmap_vmas
      0.58            -0.1        0.49        perf-profile.self.cycles-pp.perf_iterate_sb
      0.34            -0.1        0.26        perf-profile.self.cycles-pp.up_read
      0.25 ±  3%      -0.0        0.23 ±  4%  perf-profile.self.cycles-pp.__memcg_slab_free_hook
      0.12            -0.0        0.11        perf-profile.self.cycles-pp.vm_mmap_pgoff
      0.14            +0.0        0.15        perf-profile.self.cycles-pp.syscall_return_via_sysret
      0.05            +0.0        0.06        perf-profile.self.cycles-pp.do_vmi_munmap
      0.07            +0.0        0.08        perf-profile.self.cycles-pp.mas_prev
      0.07            +0.0        0.08        perf-profile.self.cycles-pp.testcase
      0.07            +0.0        0.08        perf-profile.self.cycles-pp.unmap_region
      0.07            +0.0        0.08        perf-profile.self.cycles-pp.unmap_vmas
      0.13            +0.0        0.14        perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
      0.13            +0.0        0.14        perf-profile.self.cycles-pp.entry_SYSCALL_64_safe_stack
      0.06            +0.0        0.07        perf-profile.self.cycles-pp.prandom_u32_state
      0.13            +0.0        0.14        perf-profile.self.cycles-pp.up_write
      0.06            +0.0        0.07        perf-profile.self.cycles-pp.userfaultfd_unmap_complete
      0.14 ±  3%      +0.0        0.16        perf-profile.self.cycles-pp.may_expand_vm
      0.11            +0.0        0.13 ±  3%  perf-profile.self.cycles-pp.thp_get_unmapped_area_vmflags
      0.17            +0.0        0.19        perf-profile.self.cycles-pp.__vma_start_write
      0.17            +0.0        0.19        perf-profile.self.cycles-pp.barn_get_empty_sheaf
      0.18            +0.0        0.20        perf-profile.self.cycles-pp.mas_empty_area_rev
      0.09            +0.0        0.11 ±  3%  perf-profile.self.cycles-pp.x64_sys_call
      0.40            +0.0        0.42        perf-profile.self.cycles-pp.__vm_munmap
      0.27            +0.0        0.29        perf-profile.self.cycles-pp.__kfree_rcu_sheaf
      0.20            +0.0        0.22        perf-profile.self.cycles-pp.barn_replace_empty_sheaf
      0.40            +0.0        0.42        perf-profile.self.cycles-pp.unmapped_area_topdown
      0.20 ±  2%      +0.0        0.22        perf-profile.self.cycles-pp.free_pgtables
      0.33            +0.0        0.35        perf-profile.self.cycles-pp.vma_merge_new_range
      0.27            +0.0        0.29        perf-profile.self.cycles-pp.down_write_killable
      0.22            +0.0        0.24        perf-profile.self.cycles-pp.downgrade_write
      0.30            +0.0        0.33        perf-profile.self.cycles-pp.__vma_start_exclude_readers
      0.47            +0.0        0.50        perf-profile.self.cycles-pp.__mmap
      0.33            +0.0        0.36        perf-profile.self.cycles-pp.vm_area_alloc
      0.11 ±  3%      +0.0        0.14        perf-profile.self.cycles-pp.do_mmap
      0.64            +0.0        0.66        perf-profile.self.cycles-pp.mas_prev_slot
      0.37            +0.0        0.40        perf-profile.self.cycles-pp.perf_event_mmap
      0.05            +0.0        0.08        perf-profile.self.cycles-pp.vma_set_page_prot
      0.38            +0.0        0.41        perf-profile.self.cycles-pp.vms_gather_munmap_vmas
      0.30            +0.0        0.33        perf-profile.self.cycles-pp.mas_wr_store_type
      0.43            +0.0        0.46        perf-profile.self.cycles-pp.__munmap
      0.37            +0.0        0.40        perf-profile.self.cycles-pp.mas_preallocate
      0.42            +0.0        0.45        perf-profile.self.cycles-pp.mas_rev_awalk
      0.48            +0.0        0.51        perf-profile.self.cycles-pp.mas_walk
      0.06 ±  9%      +0.0        0.09        perf-profile.self.cycles-pp.kfree
      0.40            +0.0        0.44        perf-profile.self.cycles-pp.mas_store_prealloc
      0.22            +0.0        0.26        perf-profile.self.cycles-pp.perf_event_mmap_event
      0.52            +0.0        0.56        perf-profile.self.cycles-pp.kmem_cache_alloc_noprof
      0.30            +0.0        0.34        perf-profile.self.cycles-pp.__rcu_free_sheaf_prepare
      0.57            +0.0        0.61        perf-profile.self.cycles-pp.do_vmi_align_munmap
      0.00            +0.1        0.05        perf-profile.self.cycles-pp.__get_unmapped_area
      0.00            +0.1        0.05        perf-profile.self.cycles-pp.mmap_region
      0.63            +0.1        0.68        perf-profile.self.cycles-pp.entry_SYSCALL_64
      0.52            +0.1        0.57        perf-profile.self.cycles-pp._raw_spin_lock_irqsave
      0.87            +0.1        0.92        perf-profile.self.cycles-pp.mas_store_gfp
      0.48            +0.1        0.54        perf-profile.self.cycles-pp.__slab_free
      0.50            +0.1        0.57        perf-profile.self.cycles-pp.__memcg_slab_post_alloc_hook
      0.99            +0.1        1.07        perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack
      0.42            +0.1        0.51        perf-profile.self.cycles-pp.kvfree_call_rcu
      0.19            +0.1        0.30        perf-profile.self.cycles-pp.kmem_cache_free
      1.48            +0.1        1.61        perf-profile.self.cycles-pp.mas_wr_node_store
      1.86            +0.1        2.00        perf-profile.self.cycles-pp.free_pud_range
      1.76            +0.1        1.90        perf-profile.self.cycles-pp.__mmap_region
      2.42            +0.2        2.60        perf-profile.self.cycles-pp.zap_pmd_range
      0.75            +0.2        0.94 ±  2%  perf-profile.self.cycles-pp.mas_update_gap
      0.00            +0.3        0.34        perf-profile.self.cycles-pp.get_partial_node_bulk
      2.77            +0.7        3.44        perf-profile.self.cycles-pp.__pi_memcpy




Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki



^ permalink raw reply	[flat|nested] only message in thread

only message in thread, other threads:[~2026-06-10  8:19 UTC | newest]

Thread overview: (only message) (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2026-06-10  8:19 [linux-next:master] [mm/slub] 0fc52deec1: will-it-scale.per_process_ops 6.9% improvement kernel test robot

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox