All of lore.kernel.org
 help / color / mirror / Atom feed
* [linus:master] [vfs]  93faf426e3:  stress-ng.dynlib.ops_per_sec 7.2% improvement
@ 2024-01-29  8:05 kernel test robot
  0 siblings, 0 replies; only message in thread
From: kernel test robot @ 2024-01-29  8:05 UTC (permalink / raw)
  To: Mateusz Guzik
  Cc: oe-lkp, lkp, linux-kernel, Christian Brauner, linux-fsdevel,
	ying.huang, feng.tang, fengwei.yin, oliver.sang



Hello,

kernel test robot noticed a 7.2% improvement of stress-ng.dynlib.ops_per_sec on:


commit: 93faf426e3cc000c95f1a5d3510b77ce99adac52 ("vfs: shave work on failed file open")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master

testcase: stress-ng
test machine: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory
parameters:

	nr_threads: 10%
	disk: 1HDD
	testtime: 60s
	fs: ext4
	class: os
	test: dynlib
	cpufreq_governor: performance



Details are as below:
-------------------------------------------------------------------------------------------------->


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20240129/202401291500.8546fbc3-oliver.sang@intel.com

=========================================================================================
class/compiler/cpufreq_governor/disk/fs/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
  os/gcc-12/performance/1HDD/ext4/x86_64-rhel-8.3/10%/debian-11.1-x86_64-20220510.cgz/lkp-icl-2sp7/dynlib/stress-ng/60s

commit: 
  6036c5f131 ("fs: simplify misleading code to remove ambiguity regarding ihold()/iput()")
  93faf426e3 ("vfs: shave work on failed file open")

6036c5f131752689 93faf426e3cc000c95f1a5d3510 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
      8.49            -0.9%       8.41        iostat.cpu.system
      0.61            -0.2        0.39        mpstat.cpu.all.soft%
     55.50 ± 11%     -44.7%      30.67 ± 21%  perf-c2c.DRAM.local
    165542 ±  2%      -9.7%     149471        numa-meminfo.node1.Active
    161615            -9.0%     147100 ±  2%  numa-meminfo.node1.Active(anon)
     40405            -9.0%      36775 ±  2%  numa-vmstat.node1.nr_active_anon
     40405            -9.0%      36775 ±  2%  numa-vmstat.node1.nr_zone_active_anon
     51245 ± 35%     -42.9%      29277 ±  7%  sched_debug.cfs_rq:/.avg_vruntime.max
      9910 ± 54%     -48.0%       5156 ±  8%  sched_debug.cfs_rq:/.avg_vruntime.stddev
     51247 ± 35%     -42.9%      29277 ±  7%  sched_debug.cfs_rq:/.min_vruntime.max
      9911 ± 54%     -48.0%       5156 ±  8%  sched_debug.cfs_rq:/.min_vruntime.stddev
    249.51            -6.7%     232.83        stress-ng.dynlib.nanosecs_per_dlsym_lookup
    287979            +7.2%     308763        stress-ng.dynlib.ops
      4799            +7.2%       5146        stress-ng.dynlib.ops_per_sec
      2343 ±  5%     -39.9%       1409 ±  5%  stress-ng.time.involuntary_context_switches
  20176141            +7.4%   21671738        stress-ng.time.minor_page_faults
    535.17            +2.6%     548.83        stress-ng.time.percent_of_cpu_this_job_got
    291.91            +2.2%     298.23        stress-ng.time.system_time
     40770            -7.3%      37774 ±  2%  proc-vmstat.nr_active_anon
      3098 ±  2%      +3.4%       3204        proc-vmstat.nr_inactive_file
     34934            -5.1%      33139        proc-vmstat.nr_mapped
     70157            -6.1%      65866        proc-vmstat.nr_shmem
     42286            -3.3%      40888        proc-vmstat.nr_slab_unreclaimable
     40770            -7.3%      37774 ±  2%  proc-vmstat.nr_zone_active_anon
      3098 ±  2%      +3.4%       3204        proc-vmstat.nr_zone_inactive_file
  15514950            -3.9%   14905053        proc-vmstat.numa_hit
  15448751            -3.9%   14838822        proc-vmstat.numa_local
    124008            -5.2%     117617        proc-vmstat.pgactivate
  18884464           -11.2%   16772814        proc-vmstat.pgalloc_normal
  20593501            +7.2%   22078068        proc-vmstat.pgfault
  18735781           -11.2%   16638429        proc-vmstat.pgfree
      0.36 ±179%     -94.5%       0.02 ± 44%  perf-sched.sch_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
    165.10 ±215%     -97.9%       3.40 ± 37%  perf-sched.sch_delay.max.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
      0.11 ± 40%    -100.0%       0.00        perf-sched.wait_and_delay.avg.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64
      0.13 ± 25%     -39.4%       0.08 ± 21%  perf-sched.wait_and_delay.avg.ms.io_schedule.folio_wait_bit_common.filemap_fault.__do_fault
     23.83 ± 16%    -100.0%       0.00        perf-sched.wait_and_delay.count.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64
    545.33 ±  4%     -14.0%     468.83 ±  7%  perf-sched.wait_and_delay.count.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
      1.19 ± 41%    -100.0%       0.00        perf-sched.wait_and_delay.max.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64
      0.46 ±124%     -99.3%       0.00 ±146%  perf-sched.wait_time.avg.ms.__cond_resched.dput.open_last_lookups.path_openat.do_filp_open
      0.09 ± 63%     -92.8%       0.01 ± 77%  perf-sched.wait_time.avg.ms.__cond_resched.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode
      0.11 ± 23%     -37.4%       0.07 ± 20%  perf-sched.wait_time.avg.ms.io_schedule.folio_wait_bit_common.filemap_fault.__do_fault
      2.02 ±  6%     -29.8%       1.42 ± 44%  perf-sched.wait_time.avg.ms.syslog_print.do_syslog.kmsg_read.vfs_read
      0.56 ±104%     -99.3%       0.00 ±156%  perf-sched.wait_time.max.ms.__cond_resched.dput.open_last_lookups.path_openat.do_filp_open
      0.44 ± 81%     -98.5%       0.01 ± 76%  perf-sched.wait_time.max.ms.__cond_resched.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode
      4.02 ±  6%     -12.3%       3.53 ±  8%  perf-sched.wait_time.max.ms.devkmsg_read.vfs_read.ksys_read.do_syscall_64
      1.18 ± 42%     -63.7%       0.43 ± 83%  perf-sched.wait_time.max.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64
      3.24 ± 22%     -44.2%       1.81 ± 19%  perf-sched.wait_time.max.ms.io_schedule.folio_wait_bit_common.filemap_fault.__do_fault
      4.05 ±  6%     -29.8%       2.84 ± 44%  perf-sched.wait_time.max.ms.syslog_print.do_syslog.kmsg_read.vfs_read
      0.58           -12.8%       0.51 ±  3%  perf-stat.i.MPKI
   9185336 ±  2%     -11.2%    8152378 ±  3%  perf-stat.i.cache-misses
  51941695           -13.4%   44958941        perf-stat.i.cache-references
      1.29            -2.8%       1.26        perf-stat.i.cpi
 2.039e+10            -0.9%  2.022e+10        perf-stat.i.cpu-cycles
    107.66 ±  3%      -8.6%      98.35 ±  5%  perf-stat.i.cpu-migrations
      2258           +11.1%       2508 ±  3%  perf-stat.i.cycles-between-cache-misses
      0.05 ±  2%      +0.0        0.06 ±  2%  perf-stat.i.dTLB-load-miss-rate%
   2224314 ±  2%      +5.6%    2349507 ±  2%  perf-stat.i.dTLB-load-misses
      0.08            +0.0        0.08        perf-stat.i.dTLB-store-miss-rate%
   1755339            +7.6%    1889357        perf-stat.i.dTLB-store-misses
      0.78            +2.8%       0.80        perf-stat.i.ipc
      0.32            -0.8%       0.32        perf-stat.i.metric.GHz
    965.70           -11.8%     851.51        perf-stat.i.metric.K/sec
    326896            +7.3%     350610        perf-stat.i.minor-faults
     89.04            +3.5       92.53        perf-stat.i.node-load-miss-rate%
    617886 ±  7%     -34.0%     407524 ± 12%  perf-stat.i.node-loads
     49.79 ±  3%      +6.2       56.00        perf-stat.i.node-store-miss-rate%
   1915596 ±  3%      -9.6%    1731427 ±  4%  perf-stat.i.node-store-misses
   2010814 ±  4%     -29.8%    1410941 ±  2%  perf-stat.i.node-stores
    327425            +7.3%     351195        perf-stat.i.page-faults
      0.58           -13.1%       0.51 ±  3%  perf-stat.overall.MPKI
      1.30            -2.9%       1.26        perf-stat.overall.cpi
      2219           +11.8%       2482 ±  3%  perf-stat.overall.cycles-between-cache-misses
      0.05 ±  2%      +0.0        0.06 ±  2%  perf-stat.overall.dTLB-load-miss-rate%
      0.08            +0.0        0.08        perf-stat.overall.dTLB-store-miss-rate%
      0.77            +3.0%       0.79        perf-stat.overall.ipc
     87.69            +5.5       93.21 ±  3%  perf-stat.overall.node-load-miss-rate%
     48.82 ±  3%      +6.3       55.14 ±  2%  perf-stat.overall.node-store-miss-rate%
   9052155 ±  2%     -11.3%    8027092 ±  3%  perf-stat.ps.cache-misses
  51114488           -13.4%   44240106        perf-stat.ps.cache-references
 2.008e+10            -0.9%  1.991e+10        perf-stat.ps.cpu-cycles
    106.26 ±  3%      -8.7%      97.05 ±  5%  perf-stat.ps.cpu-migrations
   2190693 ±  2%      +5.6%    2313598 ±  2%  perf-stat.ps.dTLB-load-misses
   1728462            +7.6%    1860472        perf-stat.ps.dTLB-store-misses
    321900            +7.3%     345241        perf-stat.ps.minor-faults
    621192 ±  7%     -34.3%     407816 ± 12%  perf-stat.ps.node-loads
   1885823 ±  3%      -9.6%    1703871 ±  4%  perf-stat.ps.node-store-misses
   1978205 ±  4%     -30.0%    1385178 ±  2%  perf-stat.ps.node-stores
    322420            +7.3%     345817        perf-stat.ps.page-faults
      6.48            -4.9        1.63 ±  4%  perf-profile.calltrace.cycles-pp.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
      6.66            -4.8        1.81 ±  4%  perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
      6.41            -4.8        1.57 ±  4%  perf-profile.calltrace.cycles-pp.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
      6.06            -4.5        1.53 ±  4%  perf-profile.calltrace.cycles-pp.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64
      5.03            -3.6        1.41 ±  5%  perf-profile.calltrace.cycles-pp.__fput.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode
      9.04 ±  2%      -2.3        6.78        perf-profile.calltrace.cycles-pp.alloc_empty_file.path_openat.do_filp_open.do_sys_openat2.__x64_sys_openat
      6.84 ±  2%      -1.6        5.24        perf-profile.calltrace.cycles-pp.init_file.alloc_empty_file.path_openat.do_filp_open.do_sys_openat2
     72.38            -0.9       71.44        perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe
     72.16            -0.9       71.21        perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.99 ±  3%      -0.7        1.31 ±  3%  perf-profile.calltrace.cycles-pp.kmem_cache_alloc.alloc_empty_file.path_openat.do_filp_open.do_sys_openat2
      0.65 ±  6%      -0.4        0.27 ±100%  perf-profile.calltrace.cycles-pp.__do_softirq.__irq_exit_rcu.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.cpuidle_enter_state
      0.69 ±  5%      -0.3        0.38 ± 70%  perf-profile.calltrace.cycles-pp.__irq_exit_rcu.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.cpuidle_enter_state.cpuidle_enter
      1.37 ±  4%      -0.1        1.22 ±  6%  perf-profile.calltrace.cycles-pp.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.mmap_region.do_mmap
      1.45 ±  4%      -0.1        1.32 ±  6%  perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.mmap_region.do_mmap.vm_mmap_pgoff
      1.69 ±  3%      -0.1        1.57 ±  6%  perf-profile.calltrace.cycles-pp.down_write.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
      0.74 ±  5%      -0.1        0.63 ±  9%  perf-profile.calltrace.cycles-pp.osq_lock.rwsem_optimistic_spin.rwsem_down_write_slowpath.down_write.mmap_region
      0.67 ±  3%      +0.1        0.72        perf-profile.calltrace.cycles-pp.mas_preallocate.__split_vma.do_vmi_align_munmap.do_vmi_munmap.mmap_region
      0.94 ±  3%      +0.1        0.99 ±  2%  perf-profile.calltrace.cycles-pp.__call_rcu_common.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap.__x64_sys_munmap
      0.94 ±  3%      +0.1        1.01        perf-profile.calltrace.cycles-pp.vma_interval_tree_insert.vma_prepare.__split_vma.do_vmi_align_munmap.do_vmi_munmap
      0.75 ±  4%      +0.1        0.83 ±  2%  perf-profile.calltrace.cycles-pp.rcu_segcblist_enqueue.__call_rcu_common.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap
      0.89 ±  3%      +0.1        0.97 ±  2%  perf-profile.calltrace.cycles-pp.up_write.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap.__x64_sys_munmap
      1.06 ±  4%      +0.1        1.14 ±  3%  perf-profile.calltrace.cycles-pp.vma_expand.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
      1.10 ±  2%      +0.1        1.20 ±  2%  perf-profile.calltrace.cycles-pp.release_pages.tlb_batch_pages_flush.tlb_finish_mmu.unmap_region.do_vmi_align_munmap
      1.06 ±  4%      +0.1        1.16 ±  2%  perf-profile.calltrace.cycles-pp.link_path_walk.path_openat.do_filp_open.do_sys_openat2.__x64_sys_openat
      1.74            +0.1        1.88        perf-profile.calltrace.cycles-pp.tlb_batch_pages_flush.tlb_finish_mmu.unmap_region.do_vmi_align_munmap.do_vmi_munmap
      2.44            +0.2        2.58 ±  2%  perf-profile.calltrace.cycles-pp.vma_complete.__split_vma.do_vmi_align_munmap.do_vmi_munmap.mmap_region
      2.43 ±  2%      +0.2        2.60        perf-profile.calltrace.cycles-pp.flush_tlb_mm_range.tlb_finish_mmu.unmap_region.do_vmi_align_munmap.do_vmi_munmap
      2.79            +0.2        2.96 ±  3%  perf-profile.calltrace.cycles-pp.__x64_sys_mprotect.do_syscall_64.entry_SYSCALL_64_after_hwframe._dl_catch_exception
      2.84            +0.2        3.01 ±  3%  perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe._dl_catch_exception
      2.78            +0.2        2.95 ±  3%  perf-profile.calltrace.cycles-pp.do_mprotect_pkey.__x64_sys_mprotect.do_syscall_64.entry_SYSCALL_64_after_hwframe._dl_catch_exception
      2.17 ±  2%      +0.2        2.34        perf-profile.calltrace.cycles-pp.native_flush_tlb_one_user.flush_tlb_func.flush_tlb_mm_range.tlb_finish_mmu.unmap_region
      2.82            +0.2        3.00 ±  3%  perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe._dl_catch_exception
      2.91 ±  2%      +0.2        3.09 ±  3%  perf-profile.calltrace.cycles-pp.mprotect_fixup.do_mprotect_pkey.__x64_sys_mprotect.do_syscall_64.entry_SYSCALL_64_after_hwframe
      1.58 ±  6%      +0.2        1.76 ±  6%  perf-profile.calltrace.cycles-pp.down_write.vma_prepare.__split_vma.do_vmi_align_munmap.do_vmi_munmap
      2.22 ±  2%      +0.2        2.40        perf-profile.calltrace.cycles-pp.flush_tlb_func.flush_tlb_mm_range.tlb_finish_mmu.unmap_region.do_vmi_align_munmap
      0.26 ±100%      +0.3        0.54 ±  3%  perf-profile.calltrace.cycles-pp.lock_vma_under_rcu.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
      1.86 ±  4%      +0.3        2.14 ±  4%  perf-profile.calltrace.cycles-pp.free_pgtables.unmap_region.do_vmi_align_munmap.do_vmi_munmap.mmap_region
      3.32 ±  2%      +0.3        3.60 ±  3%  perf-profile.calltrace.cycles-pp.vma_prepare.__split_vma.do_vmi_align_munmap.do_vmi_munmap.mmap_region
      2.44 ±  3%      +0.3        2.73 ±  4%  perf-profile.calltrace.cycles-pp.unmap_region.do_vmi_align_munmap.do_vmi_munmap.mmap_region.do_mmap
      4.31            +0.3        4.62        perf-profile.calltrace.cycles-pp.tlb_finish_mmu.unmap_region.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap
      5.40            +0.3        5.73 ±  3%  perf-profile.calltrace.cycles-pp.next_uptodate_folio.filemap_map_pages.do_read_fault.do_fault.__handle_mm_fault
      8.33 ±  3%      +0.4        8.68 ±  4%  perf-profile.calltrace.cycles-pp.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify
      8.40 ±  3%      +0.4        8.75 ±  4%  perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify
      7.06            +0.4        7.47 ±  3%  perf-profile.calltrace.cycles-pp.filemap_map_pages.do_read_fault.do_fault.__handle_mm_fault.handle_mm_fault
      7.11            +0.4        7.53 ±  3%  perf-profile.calltrace.cycles-pp.do_read_fault.do_fault.__handle_mm_fault.handle_mm_fault.do_user_addr_fault
      8.33            +0.4        8.78 ±  2%  perf-profile.calltrace.cycles-pp.do_fault.__handle_mm_fault.handle_mm_fault.do_user_addr_fault.exc_page_fault
      1.94 ± 12%      +0.5        2.40 ±  3%  perf-profile.calltrace.cycles-pp.__split_vma.mprotect_fixup.do_mprotect_pkey.__x64_sys_mprotect.do_syscall_64
      9.37            +0.5        9.84 ±  2%  perf-profile.calltrace.cycles-pp.__handle_mm_fault.handle_mm_fault.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
     15.46            +0.5       15.94        perf-profile.calltrace.cycles-pp.unmap_region.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap.__x64_sys_munmap
      9.70            +0.5       10.20        perf-profile.calltrace.cycles-pp.handle_mm_fault.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
     10.75            +0.5       11.30        perf-profile.calltrace.cycles-pp.do_user_addr_fault.exc_page_fault.asm_exc_page_fault
     10.79            +0.5       11.34        perf-profile.calltrace.cycles-pp.exc_page_fault.asm_exc_page_fault
      7.83            +0.6        8.43        perf-profile.calltrace.cycles-pp.__split_vma.do_vmi_align_munmap.do_vmi_munmap.mmap_region.do_mmap
     11.55            +0.6       12.16        perf-profile.calltrace.cycles-pp.asm_exc_page_fault
     20.74            +0.7       21.44        perf-profile.calltrace.cycles-pp.do_vmi_align_munmap.do_vmi_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64
     20.88            +0.7       21.58        perf-profile.calltrace.cycles-pp.do_vmi_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe
     20.95            +0.7       21.67        perf-profile.calltrace.cycles-pp.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe
     20.95            +0.7       21.67        perf-profile.calltrace.cycles-pp.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.00            +0.7        0.74 ±  5%  perf-profile.calltrace.cycles-pp.kmem_cache_free.path_openat.do_filp_open.do_sys_openat2.__x64_sys_openat
      3.49 ±  2%      +0.8        4.31        perf-profile.calltrace.cycles-pp.apparmor_file_alloc_security.security_file_alloc.init_file.alloc_empty_file.path_openat
      3.96            +0.8        4.78        perf-profile.calltrace.cycles-pp.security_file_alloc.init_file.alloc_empty_file.path_openat.do_filp_open
     11.55            +0.9       12.50 ±  2%  perf-profile.calltrace.cycles-pp.do_vmi_align_munmap.do_vmi_munmap.mmap_region.do_mmap.vm_mmap_pgoff
     11.78            +1.0       12.74 ±  2%  perf-profile.calltrace.cycles-pp.do_vmi_munmap.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
     23.77            +1.3       25.09        perf-profile.calltrace.cycles-pp.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
     21.46            +1.3       22.79        perf-profile.calltrace.cycles-pp.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64
     22.25            +1.3       23.58        perf-profile.calltrace.cycles-pp.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
     23.91            +1.3       25.25        perf-profile.calltrace.cycles-pp.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
     14.68            +1.5       16.19        perf-profile.calltrace.cycles-pp.path_openat.do_filp_open.do_sys_openat2.__x64_sys_openat.do_syscall_64
     14.99            +1.6       16.54        perf-profile.calltrace.cycles-pp.do_filp_open.do_sys_openat2.__x64_sys_openat.do_syscall_64.entry_SYSCALL_64_after_hwframe
     17.95            +1.7       19.64        perf-profile.calltrace.cycles-pp.do_sys_openat2.__x64_sys_openat.do_syscall_64.entry_SYSCALL_64_after_hwframe
     18.05            +1.7       19.74        perf-profile.calltrace.cycles-pp.__x64_sys_openat.do_syscall_64.entry_SYSCALL_64_after_hwframe
      0.00            +2.5        2.54 ±  5%  perf-profile.calltrace.cycles-pp.apparmor_file_free_security.security_file_free.release_empty_file.path_openat.do_filp_open
      0.00            +2.6        2.56 ±  5%  perf-profile.calltrace.cycles-pp.security_file_free.release_empty_file.path_openat.do_filp_open.do_sys_openat2
      0.00            +3.2        3.21 ±  4%  perf-profile.calltrace.cycles-pp.release_empty_file.path_openat.do_filp_open.do_sys_openat2.__x64_sys_openat
      6.53            -4.9        1.67 ±  4%  perf-profile.children.cycles-pp.exit_to_user_mode_prepare
      6.44            -4.9        1.58 ±  4%  perf-profile.children.cycles-pp.exit_to_user_mode_loop
      6.72            -4.8        1.88 ±  3%  perf-profile.children.cycles-pp.syscall_exit_to_user_mode
      6.10            -4.6        1.54 ±  4%  perf-profile.children.cycles-pp.task_work_run
      5.05            -3.6        1.42 ±  5%  perf-profile.children.cycles-pp.__fput
      7.18 ±  2%      -2.6        4.60 ±  4%  perf-profile.children.cycles-pp.__do_softirq
      6.77 ±  2%      -2.6        4.20 ±  4%  perf-profile.children.cycles-pp.rcu_core
      6.72 ±  2%      -2.6        4.17 ±  4%  perf-profile.children.cycles-pp.rcu_do_batch
      7.08 ±  2%      -2.5        4.59 ±  3%  perf-profile.children.cycles-pp.__irq_exit_rcu
      9.07 ±  2%      -2.3        6.81        perf-profile.children.cycles-pp.alloc_empty_file
     10.42            -2.1        8.34 ±  4%  perf-profile.children.cycles-pp.asm_sysvec_apic_timer_interrupt
     10.17            -2.1        8.09 ±  4%  perf-profile.children.cycles-pp.sysvec_apic_timer_interrupt
      6.86 ±  2%      -1.6        5.26        perf-profile.children.cycles-pp.init_file
      3.66 ±  4%      -0.9        2.80 ±  4%  perf-profile.children.cycles-pp.apparmor_file_free_security
      3.70 ±  4%      -0.9        2.84 ±  5%  perf-profile.children.cycles-pp.security_file_free
      1.21 ±  3%      -0.7        0.50 ±  7%  perf-profile.children.cycles-pp.rcu_cblist_dequeue
      1.55 ±  2%      -0.6        0.94 ±  4%  perf-profile.children.cycles-pp.___slab_alloc
      4.97 ±  2%      -0.6        4.38 ±  2%  perf-profile.children.cycles-pp.kmem_cache_alloc
      2.62 ±  2%      -0.6        2.05 ±  4%  perf-profile.children.cycles-pp.__slab_free
      2.26            -0.5        1.71        perf-profile.children.cycles-pp.__call_rcu_common
      0.58 ±  6%      -0.5        0.10 ± 19%  perf-profile.children.cycles-pp.file_free_rcu
      0.63 ±  2%      -0.4        0.18 ±  4%  perf-profile.children.cycles-pp.fput
      0.38 ±  4%      -0.3        0.08 ± 12%  perf-profile.children.cycles-pp.task_work_add
      0.66 ±  2%      -0.3        0.37 ±  7%  perf-profile.children.cycles-pp.allocate_slab
      0.46 ±  3%      -0.2        0.27 ±  8%  perf-profile.children.cycles-pp.shuffle_freelist
      0.29 ±  5%      -0.1        0.15 ±  8%  perf-profile.children.cycles-pp.inc_slabs_node
      0.18 ±  5%      -0.1        0.05 ± 47%  perf-profile.children.cycles-pp._raw_spin_lock_irq
      0.17 ± 10%      -0.1        0.07 ± 48%  perf-profile.children.cycles-pp.smpboot_thread_fn
      0.15 ± 10%      -0.1        0.06 ± 51%  perf-profile.children.cycles-pp.run_ksoftirqd
      0.20 ±  3%      -0.1        0.12 ±  9%  perf-profile.children.cycles-pp.rcu_nocb_try_bypass
      0.20 ±  7%      -0.1        0.13 ±  5%  perf-profile.children.cycles-pp.__unfreeze_partials
      1.16 ±  2%      -0.1        1.10        perf-profile.children.cycles-pp.rcu_segcblist_enqueue
      0.29 ±  5%      -0.1        0.24 ±  5%  perf-profile.children.cycles-pp.refill_obj_stock
      0.22 ±  4%      -0.0        0.17 ± 11%  perf-profile.children.cycles-pp.get_page_from_freelist
      0.14 ± 10%      -0.0        0.09 ±  9%  perf-profile.children.cycles-pp.setup_object
      0.18 ±  7%      -0.0        0.15 ±  8%  perf-profile.children.cycles-pp.__kmem_cache_alloc_node
      0.09 ± 14%      -0.0        0.06 ± 11%  perf-profile.children.cycles-pp.free_unref_page
      0.09 ±  7%      +0.0        0.11 ±  6%  perf-profile.children.cycles-pp.cp_new_stat
      0.20 ±  9%      +0.0        0.24 ±  9%  perf-profile.children.cycles-pp.generic_file_mmap
      0.53 ±  4%      +0.0        0.57 ±  2%  perf-profile.children.cycles-pp.lock_vma_under_rcu
      0.27 ±  5%      +0.0        0.31 ±  6%  perf-profile.children.cycles-pp.folio_lruvec_lock_irqsave
      0.16 ± 12%      +0.0        0.21 ±  7%  perf-profile.children.cycles-pp.put_unused_fd
      0.24 ±  6%      +0.1        0.29 ±  7%  perf-profile.children.cycles-pp.path_init
      0.26 ±  7%      +0.1        0.31 ± 10%  perf-profile.children.cycles-pp.inode_permission
      0.01 ±223%      +0.1        0.07 ± 11%  perf-profile.children.cycles-pp.__irqentry_text_end
      0.62 ±  3%      +0.1        0.67 ±  3%  perf-profile.children.cycles-pp.__rb_insert_augmented
      0.11 ± 31%      +0.1        0.18 ± 17%  perf-profile.children.cycles-pp.tick_sched_do_timer
      1.06 ±  4%      +0.1        1.15 ±  3%  perf-profile.children.cycles-pp.vma_expand
      0.12 ± 22%      +0.1        0.21 ± 28%  perf-profile.children.cycles-pp.ktime_get_update_offsets_now
      1.14 ±  2%      +0.1        1.24 ±  2%  perf-profile.children.cycles-pp.release_pages
      1.09 ±  4%      +0.1        1.19 ±  2%  perf-profile.children.cycles-pp.link_path_walk
      2.12 ±  2%      +0.1        2.24 ±  2%  perf-profile.children.cycles-pp.vma_interval_tree_remove
      0.51 ±  5%      +0.1        0.64 ±  4%  perf-profile.children.cycles-pp.percpu_counter_add_batch
      1.75            +0.1        1.90        perf-profile.children.cycles-pp.tlb_batch_pages_flush
      3.61 ±  2%      +0.2        3.77 ±  2%  perf-profile.children.cycles-pp._dl_catch_exception
      2.66 ±  2%      +0.2        2.83        perf-profile.children.cycles-pp.flush_tlb_mm_range
      2.91 ±  2%      +0.2        3.09 ±  3%  perf-profile.children.cycles-pp.mprotect_fixup
      3.36            +0.2        3.54        perf-profile.children.cycles-pp.vma_complete
      2.28            +0.2        2.46        perf-profile.children.cycles-pp.kmem_cache_free
      2.32 ±  2%      +0.2        2.51        perf-profile.children.cycles-pp.native_flush_tlb_one_user
      3.38            +0.2        3.58 ±  2%  perf-profile.children.cycles-pp.__x64_sys_mprotect
      3.38            +0.2        3.58 ±  2%  perf-profile.children.cycles-pp.do_mprotect_pkey
      2.41 ±  2%      +0.2        2.62        perf-profile.children.cycles-pp.flush_tlb_func
      3.09 ±  2%      +0.2        3.33 ±  2%  perf-profile.children.cycles-pp.up_write
      3.12 ±  2%      +0.3        3.37 ±  3%  perf-profile.children.cycles-pp.vma_interval_tree_insert
      4.52            +0.3        4.83        perf-profile.children.cycles-pp.tlb_finish_mmu
      0.21 ± 62%      +0.4        0.57 ± 60%  perf-profile.children.cycles-pp.tick_irq_enter
      5.76            +0.4        6.13 ±  3%  perf-profile.children.cycles-pp.next_uptodate_folio
      4.70 ±  2%      +0.5        5.15 ±  3%  perf-profile.children.cycles-pp.vma_prepare
      7.52            +0.5        7.98 ±  2%  perf-profile.children.cycles-pp.filemap_map_pages
      7.56            +0.5        8.03 ±  2%  perf-profile.children.cycles-pp.do_read_fault
      8.78            +0.5        9.27 ±  2%  perf-profile.children.cycles-pp.do_fault
      9.84            +0.5       10.37        perf-profile.children.cycles-pp.__handle_mm_fault
     10.18            +0.6       10.74        perf-profile.children.cycles-pp.handle_mm_fault
     11.30            +0.6       11.90        perf-profile.children.cycles-pp.exc_page_fault
     11.27            +0.6       11.87        perf-profile.children.cycles-pp.do_user_addr_fault
     12.10            +0.7       12.76        perf-profile.children.cycles-pp.asm_exc_page_fault
     20.95            +0.7       21.67        perf-profile.children.cycles-pp.__vm_munmap
     20.95            +0.7       21.67        perf-profile.children.cycles-pp.__x64_sys_munmap
     10.11            +0.7       10.85        perf-profile.children.cycles-pp.__split_vma
     18.12            +0.8       18.88        perf-profile.children.cycles-pp.unmap_region
      3.51 ±  2%      +0.8        4.34        perf-profile.children.cycles-pp.apparmor_file_alloc_security
      3.98            +0.8        4.81        perf-profile.children.cycles-pp.security_file_alloc
     24.22            +1.3       25.56        perf-profile.children.cycles-pp.vm_mmap_pgoff
     22.69            +1.3       24.04        perf-profile.children.cycles-pp.do_mmap
     21.92            +1.4       23.27        perf-profile.children.cycles-pp.mmap_region
     23.92            +1.4       25.27        perf-profile.children.cycles-pp.ksys_mmap_pgoff
     14.74            +1.5       16.27        perf-profile.children.cycles-pp.path_openat
     15.02            +1.6       16.58        perf-profile.children.cycles-pp.do_filp_open
     32.60            +1.7       34.26        perf-profile.children.cycles-pp.do_vmi_align_munmap
     32.93            +1.7       34.60        perf-profile.children.cycles-pp.do_vmi_munmap
     18.02            +1.7       19.71        perf-profile.children.cycles-pp.do_sys_openat2
     18.07            +1.7       19.77        perf-profile.children.cycles-pp.__x64_sys_openat
      0.00            +3.2        3.22 ±  4%  perf-profile.children.cycles-pp.release_empty_file
      2.63 ±  4%      -2.2        0.41 ±  6%  perf-profile.self.cycles-pp.init_file
      3.44 ±  4%      -0.8        2.68 ±  4%  perf-profile.self.cycles-pp.apparmor_file_free_security
      1.20 ±  3%      -0.7        0.49 ±  7%  perf-profile.self.cycles-pp.rcu_cblist_dequeue
      2.54 ±  2%      -0.5        2.01 ±  4%  perf-profile.self.cycles-pp.__slab_free
      0.57 ±  6%      -0.5        0.09 ± 13%  perf-profile.self.cycles-pp.file_free_rcu
      0.78 ±  3%      -0.3        0.44 ±  4%  perf-profile.self.cycles-pp.__call_rcu_common
      0.30 ±  6%      -0.2        0.07 ± 14%  perf-profile.self.cycles-pp.task_work_add
      0.42 ±  6%      -0.1        0.28 ±  3%  perf-profile.self.cycles-pp.___slab_alloc
      0.28 ±  5%      -0.1        0.14 ±  4%  perf-profile.self.cycles-pp.inc_slabs_node
      0.34 ±  6%      -0.1        0.20 ±  7%  perf-profile.self.cycles-pp.shuffle_freelist
      0.23 ±  3%      -0.1        0.10 ± 12%  perf-profile.self.cycles-pp.fput
      0.16 ±  5%      -0.1        0.04 ± 71%  perf-profile.self.cycles-pp._raw_spin_lock_irq
      0.14 ±  4%      -0.1        0.08 ± 14%  perf-profile.self.cycles-pp.rcu_nocb_try_bypass
      1.13 ±  2%      -0.1        1.08        perf-profile.self.cycles-pp.rcu_segcblist_enqueue
      0.27 ±  4%      -0.0        0.22 ±  6%  perf-profile.self.cycles-pp.refill_obj_stock
      0.10 ±  8%      -0.0        0.06 ±  6%  perf-profile.self.cycles-pp.__unfreeze_partials
      0.11 ±  8%      -0.0        0.08 ± 10%  perf-profile.self.cycles-pp.rcu_do_batch
      0.11 ±  4%      +0.0        0.14 ± 10%  perf-profile.self.cycles-pp.mas_walk
      0.07 ± 16%      +0.0        0.10 ±  6%  perf-profile.self.cycles-pp.atime_needs_update
      0.21 ±  5%      +0.0        0.25 ±  6%  perf-profile.self.cycles-pp.vm_area_dup
      0.89            +0.0        0.93        perf-profile.self.cycles-pp.zap_pte_range
      0.40 ±  5%      +0.0        0.44 ±  2%  perf-profile.self.cycles-pp.__split_vma
      0.44 ±  3%      +0.0        0.48 ±  4%  perf-profile.self.cycles-pp.vma_interval_tree_augment_rotate
      0.15 ±  8%      +0.0        0.20 ±  7%  perf-profile.self.cycles-pp.syscall_enter_from_user_mode
      0.17 ± 11%      +0.0        0.22 ±  7%  perf-profile.self.cycles-pp.path_init
      0.52 ±  3%      +0.0        0.57 ±  3%  perf-profile.self.cycles-pp.__rb_insert_augmented
      0.20 ±  8%      +0.1        0.26 ±  4%  perf-profile.self.cycles-pp.get_obj_cgroup_from_current
      0.01 ±223%      +0.1        0.06 ± 14%  perf-profile.self.cycles-pp.__irqentry_text_end
      0.49 ±  4%      +0.1        0.55 ±  4%  perf-profile.self.cycles-pp.free_swap_cache
      0.60 ±  5%      +0.1        0.66 ±  3%  perf-profile.self.cycles-pp.mmap_region
      0.74 ±  3%      +0.1        0.81        perf-profile.self.cycles-pp._raw_spin_lock
      0.08 ± 53%      +0.1        0.15 ± 20%  perf-profile.self.cycles-pp.tick_sched_do_timer
      0.11 ± 24%      +0.1        0.19 ± 29%  perf-profile.self.cycles-pp.ktime_get_update_offsets_now
      0.35 ± 11%      +0.1        0.44 ±  5%  perf-profile.self.cycles-pp.rwsem_down_write_slowpath
      0.44 ±  7%      +0.1        0.58 ±  3%  perf-profile.self.cycles-pp.percpu_counter_add_batch
      1.33 ±  2%      +0.2        1.48 ±  3%  perf-profile.self.cycles-pp.down_write
      1.92 ±  2%      +0.2        2.09 ±  2%  perf-profile.self.cycles-pp.vma_interval_tree_remove
      2.29 ±  2%      +0.2        2.48        perf-profile.self.cycles-pp.native_flush_tlb_one_user
      1.34            +0.2        1.58 ±  2%  perf-profile.self.cycles-pp.kmem_cache_free
      2.90 ±  2%      +0.2        3.14 ±  2%  perf-profile.self.cycles-pp.up_write
      0.00            +0.3        0.26 ±  3%  perf-profile.self.cycles-pp.release_empty_file
      2.89            +0.3        3.20 ±  3%  perf-profile.self.cycles-pp.vma_interval_tree_insert
      2.54 ±  3%      +0.3        2.85 ±  4%  perf-profile.self.cycles-pp.rwsem_spin_on_owner
      5.28            +0.4        5.73 ±  3%  perf-profile.self.cycles-pp.next_uptodate_folio
      3.22            +0.9        4.07        perf-profile.self.cycles-pp.apparmor_file_alloc_security




Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki


^ permalink raw reply	[flat|nested] only message in thread

only message in thread, other threads:[~2024-01-29  8:05 UTC | newest]

Thread overview: (only message) (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-01-29  8:05 [linus:master] [vfs] 93faf426e3: stress-ng.dynlib.ops_per_sec 7.2% improvement kernel test robot

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.