* [daveh-devel:prefault-20250128] [filemap] bc10506d7c: unixbench.throughput 5.1% improvement
@ 2025-02-12 8:13 kernel test robot
0 siblings, 0 replies; only message in thread
From: kernel test robot @ 2025-02-12 8:13 UTC (permalink / raw)
To: Dave Hansen
Cc: oe-lkp, lkp, Dave Hansen, linux-xfs, linux-fsdevel, oliver.sang
Hello,
kernel test robot noticed a 5.1% improvement of unixbench.throughput on:
commit: bc10506d7c3cec7a236483876d5c717875d3d5aa ("filemap: Move prefaulting out of hot write path")
https://git.kernel.org/cgit/linux/kernel/git/daveh/devel.git prefault-20250128
testcase: unixbench
config: x86_64-rhel-9.4
compiler: gcc-12
test machine: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory
parameters:
runtime: 300s
nr_task: 100%
test: fsbuffer-w
cpufreq_governor: performance
In addition to that, the commit also has significant impact on the following tests:
+------------------+--------------------------------------------------------------------------------+
| testcase: change | will-it-scale: will-it-scale.per_thread_ops 8.8% improvement |
| test machine | 384 threads 2 sockets Intel(R) Xeon(R) 6972P (Granite Rapids) with 128G memory |
| test parameters | cpufreq_governor=performance |
| | mode=thread |
| | nr_task=100% |
| | test=pwrite1 |
+------------------+--------------------------------------------------------------------------------+
Details are as below:
-------------------------------------------------------------------------------------------------->
The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20250212/202502121529.d62a409e-lkp@intel.com
=========================================================================================
compiler/cpufreq_governor/kconfig/nr_task/rootfs/runtime/tbox_group/test/testcase:
gcc-12/performance/x86_64-rhel-9.4/100%/debian-12-x86_64-20240206.cgz/300s/lkp-icl-2sp9/fsbuffer-w/unixbench
commit:
v6.13
bc10506d7c ("filemap: Move prefaulting out of hot write path")
v6.13 bc10506d7c3cec7a236483876d5
---------------- ---------------------------
%stddev %change %stddev
\ | \
1.33 ± 31% -55.0% 0.60 ± 58% perf-sched.sch_delay.max.ms.pipe_read.vfs_read.ksys_read.do_syscall_64
1360 ± 54% +60.1% 2178 ± 37% perf-sched.wait_and_delay.max.ms.do_task_dead.do_exit.do_group_exit.__x64_sys_exit_group.x64_sys_call
1360 ± 54% +60.1% 2178 ± 37% perf-sched.wait_time.max.ms.do_task_dead.do_exit.do_group_exit.__x64_sys_exit_group.x64_sys_call
32677306 +5.1% 34349550 unixbench.throughput
1889 +4.5% 1975 unixbench.time.user_time
1.211e+10 +5.1% 1.273e+10 unixbench.workload
4.557e+10 +1.7% 4.635e+10 perf-stat.i.branch-instructions
1.04 +1.0% 1.05 perf-stat.i.ipc
0.94 -1.0% 0.93 perf-stat.overall.cpi
1.06 +1.1% 1.07 perf-stat.overall.ipc
5942 -4.0% 5705 perf-stat.overall.path-length
4.548e+10 +1.7% 4.626e+10 perf-stat.ps.branch-instructions
5.46 -5.5 0.00 perf-profile.calltrace.cycles-pp.fault_in_iov_iter_readable.generic_perform_write.generic_file_write_iter.vfs_write.ksys_write
35.65 -3.1 32.55 perf-profile.calltrace.cycles-pp.generic_perform_write.generic_file_write_iter.vfs_write.ksys_write.do_syscall_64
58.13 -2.2 55.93 perf-profile.calltrace.cycles-pp.generic_file_write_iter.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe
66.56 -1.7 64.87 perf-profile.calltrace.cycles-pp.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
71.11 -1.5 69.63 perf-profile.calltrace.cycles-pp.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
77.66 -1.1 76.52 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
79.00 -1.1 77.91 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.write
98.58 -0.1 98.50 perf-profile.calltrace.cycles-pp.write
0.60 +0.0 0.64 perf-profile.calltrace.cycles-pp.setattr_should_drop_suidgid.file_remove_privs_flags.__generic_file_write_iter.generic_file_write_iter.vfs_write
0.66 +0.0 0.70 perf-profile.calltrace.cycles-pp.__cond_resched.generic_perform_write.generic_file_write_iter.vfs_write.ksys_write
0.58 +0.0 0.61 perf-profile.calltrace.cycles-pp.folio_wait_stable.__filemap_get_folio.simple_write_begin.generic_perform_write.generic_file_write_iter
0.61 +0.0 0.65 perf-profile.calltrace.cycles-pp.w_test
0.86 +0.0 0.90 perf-profile.calltrace.cycles-pp.x64_sys_call.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
0.72 +0.0 0.76 perf-profile.calltrace.cycles-pp.__cond_resched.down_write.generic_file_write_iter.vfs_write.ksys_write
0.93 +0.0 0.98 perf-profile.calltrace.cycles-pp.generic_write_check_limits.generic_write_checks.generic_file_write_iter.vfs_write.ksys_write
0.78 +0.0 0.83 perf-profile.calltrace.cycles-pp.__cond_resched.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.76 ± 3% +0.1 0.81 perf-profile.calltrace.cycles-pp.xattr_resolve_name.__vfs_getxattr.cap_inode_need_killpriv.security_inode_need_killpriv.file_remove_privs_flags
0.80 +0.1 0.86 perf-profile.calltrace.cycles-pp.__cond_resched.__filemap_get_folio.simple_write_begin.generic_perform_write.generic_file_write_iter
0.86 +0.1 0.92 perf-profile.calltrace.cycles-pp.folio_mark_dirty.simple_write_end.generic_perform_write.generic_file_write_iter.vfs_write
1.08 +0.1 1.14 perf-profile.calltrace.cycles-pp.rw_verify_area.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe
1.33 ± 2% +0.1 1.40 perf-profile.calltrace.cycles-pp.ktime_get_coarse_real_ts64_mg.current_time.inode_needs_update_time.file_update_time.__generic_file_write_iter
1.47 +0.1 1.55 perf-profile.calltrace.cycles-pp.__fsnotify_parent.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe
3.00 +0.1 3.08 perf-profile.calltrace.cycles-pp.__vfs_getxattr.cap_inode_need_killpriv.security_inode_need_killpriv.file_remove_privs_flags.__generic_file_write_iter
2.07 +0.1 2.16 perf-profile.calltrace.cycles-pp.generic_write_checks.generic_file_write_iter.vfs_write.ksys_write.do_syscall_64
1.57 +0.1 1.68 perf-profile.calltrace.cycles-pp.folio_unlock.simple_write_end.generic_perform_write.generic_file_write_iter.vfs_write
1.54 +0.1 1.65 perf-profile.calltrace.cycles-pp.up_write.generic_file_write_iter.vfs_write.ksys_write.do_syscall_64
1.00 +0.1 1.12 perf-profile.calltrace.cycles-pp.balance_dirty_pages_ratelimited_flags.generic_perform_write.generic_file_write_iter.vfs_write.ksys_write
3.67 +0.1 3.79 perf-profile.calltrace.cycles-pp.cap_inode_need_killpriv.security_inode_need_killpriv.file_remove_privs_flags.__generic_file_write_iter.generic_file_write_iter
2.34 +0.1 2.47 perf-profile.calltrace.cycles-pp.down_write.generic_file_write_iter.vfs_write.ksys_write.do_syscall_64
3.06 +0.1 3.19 perf-profile.calltrace.cycles-pp.current_time.inode_needs_update_time.file_update_time.__generic_file_write_iter.generic_file_write_iter
4.44 +0.2 4.60 perf-profile.calltrace.cycles-pp.security_inode_need_killpriv.file_remove_privs_flags.__generic_file_write_iter.generic_file_write_iter.vfs_write
3.42 +0.2 3.59 perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
4.23 +0.2 4.44 perf-profile.calltrace.cycles-pp.inode_needs_update_time.file_update_time.__generic_file_write_iter.generic_file_write_iter.vfs_write
5.28 +0.2 5.53 perf-profile.calltrace.cycles-pp.file_update_time.__generic_file_write_iter.generic_file_write_iter.vfs_write.ksys_write
7.41 +0.3 7.69 perf-profile.calltrace.cycles-pp.file_remove_privs_flags.__generic_file_write_iter.generic_file_write_iter.vfs_write.ksys_write
4.92 +0.3 5.22 perf-profile.calltrace.cycles-pp.simple_write_end.generic_perform_write.generic_file_write_iter.vfs_write.ksys_write
0.69 +0.4 1.06 perf-profile.calltrace.cycles-pp.xas_start.xas_load.filemap_get_entry.__filemap_get_folio.simple_write_begin
7.27 +0.4 7.69 perf-profile.calltrace.cycles-pp.copy_page_from_iter_atomic.generic_perform_write.generic_file_write_iter.vfs_write.ksys_write
7.44 +0.5 7.90 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.write
9.73 +0.5 10.20 perf-profile.calltrace.cycles-pp.clear_bhb_loop.write
0.00 +0.5 0.52 ± 2% perf-profile.calltrace.cycles-pp.security_file_permission.rw_verify_area.vfs_write.ksys_write.do_syscall_64
14.06 +0.6 14.64 perf-profile.calltrace.cycles-pp.__generic_file_write_iter.generic_file_write_iter.vfs_write.ksys_write.do_syscall_64
2.19 +0.8 2.96 perf-profile.calltrace.cycles-pp.xas_load.filemap_get_entry.__filemap_get_folio.simple_write_begin.generic_perform_write
5.88 +1.1 7.00 perf-profile.calltrace.cycles-pp.filemap_get_entry.__filemap_get_folio.simple_write_begin.generic_perform_write.generic_file_write_iter
11.08 +1.5 12.60 perf-profile.calltrace.cycles-pp.__filemap_get_folio.simple_write_begin.generic_perform_write.generic_file_write_iter.vfs_write
12.05 +1.6 13.63 perf-profile.calltrace.cycles-pp.simple_write_begin.generic_perform_write.generic_file_write_iter.vfs_write.ksys_write
5.67 -5.7 0.00 perf-profile.children.cycles-pp.fault_in_iov_iter_readable
36.46 -3.2 33.30 perf-profile.children.cycles-pp.generic_perform_write
58.61 -2.2 56.44 perf-profile.children.cycles-pp.generic_file_write_iter
67.18 -1.6 65.54 perf-profile.children.cycles-pp.vfs_write
71.43 -1.5 69.98 perf-profile.children.cycles-pp.ksys_write
78.06 -1.1 76.94 perf-profile.children.cycles-pp.do_syscall_64
79.29 -1.1 78.22 perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
98.87 -0.0 98.83 perf-profile.children.cycles-pp.write
0.33 ± 2% +0.0 0.35 perf-profile.children.cycles-pp.__x64_sys_write
0.30 +0.0 0.32 perf-profile.children.cycles-pp.is_bad_inode
0.37 ± 2% +0.0 0.38 perf-profile.children.cycles-pp.entry_SYSCALL_64_safe_stack
0.57 +0.0 0.59 perf-profile.children.cycles-pp.syscall_exit_to_user_mode_prepare
0.30 +0.0 0.32 perf-profile.children.cycles-pp.noop_dirty_folio
0.59 +0.0 0.62 perf-profile.children.cycles-pp.security_file_permission
0.71 +0.0 0.74 perf-profile.children.cycles-pp.setattr_should_drop_suidgid
0.34 +0.0 0.38 perf-profile.children.cycles-pp.inode_to_bdi
1.11 +0.0 1.16 perf-profile.children.cycles-pp.w_test
0.96 +0.0 1.00 perf-profile.children.cycles-pp.x64_sys_call
0.77 +0.0 0.82 perf-profile.children.cycles-pp.folio_wait_stable
1.02 +0.1 1.08 perf-profile.children.cycles-pp.generic_write_check_limits
0.99 +0.1 1.04 perf-profile.children.cycles-pp.folio_mapping
1.06 +0.1 1.12 perf-profile.children.cycles-pp.folio_mark_dirty
0.98 ± 2% +0.1 1.05 perf-profile.children.cycles-pp.xattr_resolve_name
1.44 ± 2% +0.1 1.52 perf-profile.children.cycles-pp.ktime_get_coarse_real_ts64_mg
1.27 +0.1 1.35 perf-profile.children.cycles-pp.rw_verify_area
1.57 +0.1 1.66 perf-profile.children.cycles-pp.__fsnotify_parent
1.63 +0.1 1.72 perf-profile.children.cycles-pp.rcu_all_qs
1.67 +0.1 1.78 perf-profile.children.cycles-pp.folio_unlock
1.64 +0.1 1.75 perf-profile.children.cycles-pp.up_write
2.35 +0.1 2.46 perf-profile.children.cycles-pp.generic_write_checks
3.58 +0.1 3.70 perf-profile.children.cycles-pp.__vfs_getxattr
3.93 +0.1 4.06 perf-profile.children.cycles-pp.cap_inode_need_killpriv
1.20 +0.1 1.33 perf-profile.children.cycles-pp.balance_dirty_pages_ratelimited_flags
2.53 +0.1 2.68 perf-profile.children.cycles-pp.down_write
3.26 +0.1 3.41 perf-profile.children.cycles-pp.current_time
4.68 +0.2 4.85 perf-profile.children.cycles-pp.security_inode_need_killpriv
3.62 +0.2 3.80 perf-profile.children.cycles-pp.syscall_exit_to_user_mode
3.38 +0.2 3.57 perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack
3.74 +0.2 3.96 perf-profile.children.cycles-pp.__cond_resched
4.44 +0.2 4.67 perf-profile.children.cycles-pp.inode_needs_update_time
5.48 +0.3 5.74 perf-profile.children.cycles-pp.file_update_time
4.24 +0.3 4.51 perf-profile.children.cycles-pp.entry_SYSCALL_64
7.71 +0.3 8.01 perf-profile.children.cycles-pp.file_remove_privs_flags
5.22 +0.3 5.54 perf-profile.children.cycles-pp.simple_write_end
0.80 +0.4 1.16 perf-profile.children.cycles-pp.xas_start
7.38 +0.4 7.82 perf-profile.children.cycles-pp.copy_page_from_iter_atomic
9.82 +0.5 10.30 perf-profile.children.cycles-pp.clear_bhb_loop
14.35 +0.6 14.96 perf-profile.children.cycles-pp.__generic_file_write_iter
2.48 +0.8 3.27 perf-profile.children.cycles-pp.xas_load
6.08 +1.2 7.23 perf-profile.children.cycles-pp.filemap_get_entry
11.46 +1.6 13.01 perf-profile.children.cycles-pp.__filemap_get_folio
12.25 +1.6 13.84 perf-profile.children.cycles-pp.simple_write_begin
4.36 -0.1 4.26 perf-profile.self.cycles-pp.generic_perform_write
0.20 ± 2% +0.0 0.22 ± 2% perf-profile.self.cycles-pp.noop_dirty_folio
0.29 +0.0 0.30 perf-profile.self.cycles-pp.folio_wait_stable
0.36 +0.0 0.38 perf-profile.self.cycles-pp.entry_SYSCALL_64_safe_stack
0.49 +0.0 0.52 perf-profile.self.cycles-pp.security_file_permission
0.76 +0.0 0.79 perf-profile.self.cycles-pp.security_inode_need_killpriv
0.24 +0.0 0.27 perf-profile.self.cycles-pp.inode_to_bdi
0.56 +0.0 0.59 perf-profile.self.cycles-pp.folio_mark_dirty
0.61 +0.0 0.64 perf-profile.self.cycles-pp.setattr_should_drop_suidgid
0.84 +0.0 0.88 perf-profile.self.cycles-pp.x64_sys_call
0.84 +0.0 0.88 perf-profile.self.cycles-pp.generic_write_check_limits
0.68 +0.0 0.72 perf-profile.self.cycles-pp.rw_verify_area
0.79 +0.0 0.84 perf-profile.self.cycles-pp.folio_mapping
0.59 ± 3% +0.0 0.64 ± 2% perf-profile.self.cycles-pp.xattr_resolve_name
1.59 +0.0 1.64 perf-profile.self.cycles-pp.generic_file_write_iter
0.78 +0.1 0.83 perf-profile.self.cycles-pp.simple_write_begin
1.16 +0.1 1.21 perf-profile.self.cycles-pp.__generic_file_write_iter
1.22 +0.1 1.27 perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
1.03 +0.1 1.10 perf-profile.self.cycles-pp.entry_SYSCALL_64
1.44 +0.1 1.51 perf-profile.self.cycles-pp.__vfs_getxattr
1.41 +0.1 1.48 perf-profile.self.cycles-pp.generic_write_checks
1.33 ± 2% +0.1 1.40 perf-profile.self.cycles-pp.ktime_get_coarse_real_ts64_mg
1.24 +0.1 1.30 perf-profile.self.cycles-pp.rcu_all_qs
1.81 +0.1 1.88 perf-profile.self.cycles-pp.current_time
1.49 +0.1 1.56 perf-profile.self.cycles-pp.ksys_write
1.46 +0.1 1.54 perf-profile.self.cycles-pp.__fsnotify_parent
1.18 +0.1 1.26 perf-profile.self.cycles-pp.inode_needs_update_time
1.61 +0.1 1.70 perf-profile.self.cycles-pp.down_write
0.85 +0.1 0.95 perf-profile.self.cycles-pp.balance_dirty_pages_ratelimited_flags
1.55 +0.1 1.65 perf-profile.self.cycles-pp.folio_unlock
1.54 +0.1 1.64 perf-profile.self.cycles-pp.up_write
1.89 +0.1 2.00 perf-profile.self.cycles-pp.do_syscall_64
2.42 +0.1 2.52 perf-profile.self.cycles-pp.file_remove_privs_flags
2.10 +0.1 2.24 perf-profile.self.cycles-pp.__cond_resched
2.38 +0.2 2.53 perf-profile.self.cycles-pp.simple_write_end
3.05 +0.2 3.21 perf-profile.self.cycles-pp.syscall_exit_to_user_mode
3.26 +0.2 3.45 perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack
4.02 +0.2 4.27 perf-profile.self.cycles-pp.write
3.78 +0.3 4.07 perf-profile.self.cycles-pp.__filemap_get_folio
5.10 +0.3 5.43 perf-profile.self.cycles-pp.vfs_write
0.60 ± 2% +0.4 0.96 perf-profile.self.cycles-pp.xas_start
3.58 ± 2% +0.4 3.96 ± 2% perf-profile.self.cycles-pp.filemap_get_entry
1.77 +0.4 2.17 perf-profile.self.cycles-pp.xas_load
7.16 +0.4 7.58 perf-profile.self.cycles-pp.copy_page_from_iter_atomic
9.72 +0.5 10.20 perf-profile.self.cycles-pp.clear_bhb_loop
***************************************************************************************************
lkp-gnr-2ap2: 384 threads 2 sockets Intel(R) Xeon(R) 6972P (Granite Rapids) with 128G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase:
gcc-12/performance/x86_64-rhel-9.4/thread/100%/debian-12-x86_64-20240206.cgz/lkp-gnr-2ap2/pwrite1/will-it-scale
commit:
v6.13
bc10506d7c ("filemap: Move prefaulting out of hot write path")
v6.13 bc10506d7c3cec7a236483876d5
---------------- ---------------------------
%stddev %change %stddev
\ | \
14.26 +1.5 15.81 mpstat.cpu.all.usr%
3452706 ± 5% -9.7% 3117051 ± 8% numa-meminfo.node1.Active
3452706 ± 5% -9.7% 3117051 ± 8% numa-meminfo.node1.Active(anon)
863593 ± 5% -9.7% 779394 ± 8% numa-vmstat.node1.nr_active_anon
863592 ± 5% -9.7% 779394 ± 8% numa-vmstat.node1.nr_zone_active_anon
1.112e+09 +8.8% 1.209e+09 will-it-scale.384.threads
2895332 +8.8% 3149461 will-it-scale.per_thread_ops
1.112e+09 +8.8% 1.209e+09 will-it-scale.workload
966386 -3.5% 932439 proc-vmstat.nr_active_anon
1575486 -2.6% 1534047 proc-vmstat.nr_file_pages
702352 ± 2% -5.9% 660903 proc-vmstat.nr_shmem
966386 -3.5% 932439 proc-vmstat.nr_zone_active_anon
0.25 ± 53% -45.1% 0.14 ± 9% perf-sched.sch_delay.avg.ms.__cond_resched.generic_perform_write.shmem_file_write_iter.vfs_write.__x64_sys_pwrite64
8.04 ± 38% -42.3% 4.64 ± 16% perf-sched.sch_delay.max.ms.__cond_resched.shmem_get_folio_gfp.shmem_write_begin.generic_perform_write.shmem_file_write_iter
0.13 ± 45% -38.8% 0.08 ± 2% perf-sched.total_sch_delay.average.ms
0.50 ± 51% -45.5% 0.27 ± 9% perf-sched.wait_and_delay.avg.ms.__cond_resched.generic_perform_write.shmem_file_write_iter.vfs_write.__x64_sys_pwrite64
839.33 ± 5% +11.6% 936.83 ± 3% perf-sched.wait_and_delay.count.__cond_resched.generic_perform_write.shmem_file_write_iter.vfs_write.__x64_sys_pwrite64
873.83 ± 6% -18.4% 713.17 ± 6% perf-sched.wait_and_delay.count.__cond_resched.shmem_get_folio_gfp.shmem_write_begin.generic_perform_write.shmem_file_write_iter
16.08 ± 38% -42.3% 9.28 ± 16% perf-sched.wait_and_delay.max.ms.__cond_resched.shmem_get_folio_gfp.shmem_write_begin.generic_perform_write.shmem_file_write_iter
0.25 ± 49% -45.9% 0.14 ± 10% perf-sched.wait_time.avg.ms.__cond_resched.generic_perform_write.shmem_file_write_iter.vfs_write.__x64_sys_pwrite64
8.04 ± 38% -42.3% 4.64 ± 16% perf-sched.wait_time.max.ms.__cond_resched.shmem_get_folio_gfp.shmem_write_begin.generic_perform_write.shmem_file_write_iter
3.505e+11 +3.6% 3.632e+11 perf-stat.i.branch-instructions
0.02 -0.0 0.02 ± 2% perf-stat.i.branch-miss-rate%
0.74 -4.3% 0.71 perf-stat.i.cpi
1.266e+12 -1.3% 1.25e+12 perf-stat.i.cpu-cycles
1.719e+12 +3.2% 1.775e+12 perf-stat.i.instructions
1.36 +4.6% 1.42 perf-stat.i.ipc
0.00 ± 8% -0.0 0.00 ± 12% perf-stat.overall.branch-miss-rate%
0.74 -4.4% 0.70 perf-stat.overall.cpi
1.36 +4.6% 1.42 perf-stat.overall.ipc
466638 -5.1% 442782 perf-stat.overall.path-length
3.493e+11 +3.6% 3.619e+11 perf-stat.ps.branch-instructions
1.261e+12 -1.3% 1.245e+12 perf-stat.ps.cpu-cycles
1.713e+12 +3.2% 1.769e+12 perf-stat.ps.instructions
5.188e+14 +3.2% 5.355e+14 perf-stat.total.instructions
45.78 -5.2 40.57 perf-profile.calltrace.cycles-pp.generic_perform_write.shmem_file_write_iter.vfs_write.__x64_sys_pwrite64.do_syscall_64
59.00 -4.3 54.71 perf-profile.calltrace.cycles-pp.shmem_file_write_iter.vfs_write.__x64_sys_pwrite64.do_syscall_64.entry_SYSCALL_64_after_hwframe
67.90 -3.5 64.37 perf-profile.calltrace.cycles-pp.vfs_write.__x64_sys_pwrite64.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_pwrite
75.33 -2.8 72.55 perf-profile.calltrace.cycles-pp.__x64_sys_pwrite64.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_pwrite
11.38 -2.3 9.10 ± 2% perf-profile.calltrace.cycles-pp.shmem_write_begin.generic_perform_write.shmem_file_write_iter.vfs_write.__x64_sys_pwrite64
81.99 -2.0 79.97 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_pwrite
10.15 -1.9 8.22 ± 2% perf-profile.calltrace.cycles-pp.shmem_get_folio_gfp.shmem_write_begin.generic_perform_write.shmem_file_write_iter.vfs_write
83.41 -1.8 81.63 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__libc_pwrite
4.16 -1.2 2.98 ± 2% perf-profile.calltrace.cycles-pp.filemap_get_entry.shmem_get_folio_gfp.shmem_write_begin.generic_perform_write.shmem_file_write_iter
94.89 -0.5 94.38 perf-profile.calltrace.cycles-pp.__libc_pwrite
1.66 ± 2% -0.3 1.38 ± 5% perf-profile.calltrace.cycles-pp.folio_mark_accessed.shmem_get_folio_gfp.shmem_write_begin.generic_perform_write.shmem_file_write_iter
0.81 -0.2 0.59 ± 3% perf-profile.calltrace.cycles-pp.xas_load.filemap_get_entry.shmem_get_folio_gfp.shmem_write_begin.generic_perform_write
0.60 -0.1 0.52 perf-profile.calltrace.cycles-pp.file_remove_privs_flags.shmem_file_write_iter.vfs_write.__x64_sys_pwrite64.do_syscall_64
0.71 ± 2% +0.1 0.77 ± 3% perf-profile.calltrace.cycles-pp.__cond_resched.vfs_write.__x64_sys_pwrite64.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.63 ± 2% +0.1 0.71 ± 2% perf-profile.calltrace.cycles-pp.generic_write_check_limits.generic_write_checks.shmem_file_write_iter.vfs_write.__x64_sys_pwrite64
0.78 ± 8% +0.1 0.90 ± 4% perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.__libc_pwrite
0.70 ± 4% +0.1 0.83 ± 8% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_safe_stack.__libc_pwrite
1.44 +0.1 1.58 ± 3% perf-profile.calltrace.cycles-pp.folio_mark_dirty.shmem_write_end.generic_perform_write.shmem_file_write_iter.vfs_write
1.71 +0.2 1.88 perf-profile.calltrace.cycles-pp.fput.__x64_sys_pwrite64.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_pwrite
2.29 ± 3% +0.2 2.46 ± 2% perf-profile.calltrace.cycles-pp.current_time.inode_needs_update_time.file_update_time.shmem_file_write_iter.vfs_write
3.78 ± 2% +0.2 4.00 ± 2% perf-profile.calltrace.cycles-pp.file_update_time.shmem_file_write_iter.vfs_write.__x64_sys_pwrite64.do_syscall_64
1.58 ± 2% +0.2 1.79 ± 3% perf-profile.calltrace.cycles-pp.generic_write_checks.shmem_file_write_iter.vfs_write.__x64_sys_pwrite64.do_syscall_64
3.34 +0.2 3.58 perf-profile.calltrace.cycles-pp.down_write.shmem_file_write_iter.vfs_write.__x64_sys_pwrite64.do_syscall_64
3.16 ± 2% +0.2 3.40 ± 2% perf-profile.calltrace.cycles-pp.inode_needs_update_time.file_update_time.shmem_file_write_iter.vfs_write.__x64_sys_pwrite64
3.78 ± 2% +0.2 4.03 ± 3% perf-profile.calltrace.cycles-pp.fdget.__x64_sys_pwrite64.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_pwrite
2.40 +0.3 2.71 ± 2% perf-profile.calltrace.cycles-pp.folio_unlock.shmem_write_end.generic_perform_write.shmem_file_write_iter.vfs_write
1.67 ± 3% +0.4 2.06 ± 4% perf-profile.calltrace.cycles-pp.__fsnotify_parent.vfs_write.__x64_sys_pwrite64.do_syscall_64.entry_SYSCALL_64_after_hwframe
1.94 +0.4 2.36 perf-profile.calltrace.cycles-pp.up_write.shmem_file_write_iter.vfs_write.__x64_sys_pwrite64.do_syscall_64
0.00 +0.5 0.54 ± 3% perf-profile.calltrace.cycles-pp.testcase
3.36 +0.6 3.97 perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_pwrite
7.16 +0.8 7.98 ± 2% perf-profile.calltrace.cycles-pp.shmem_write_end.generic_perform_write.shmem_file_write_iter.vfs_write.__x64_sys_pwrite64
16.72 +1.0 17.71 ± 2% perf-profile.calltrace.cycles-pp.copy_page_from_iter_atomic.generic_perform_write.shmem_file_write_iter.vfs_write.__x64_sys_pwrite64
14.98 +1.6 16.57 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.__libc_pwrite
46.84 -5.4 41.44 perf-profile.children.cycles-pp.generic_perform_write
60.00 -4.2 55.76 perf-profile.children.cycles-pp.shmem_file_write_iter
68.74 -3.4 65.33 perf-profile.children.cycles-pp.vfs_write
75.77 -2.7 73.02 perf-profile.children.cycles-pp.__x64_sys_pwrite64
11.62 -2.3 9.35 ± 2% perf-profile.children.cycles-pp.shmem_write_begin
82.48 -2.0 80.46 perf-profile.children.cycles-pp.do_syscall_64
10.63 -2.0 8.62 ± 2% perf-profile.children.cycles-pp.shmem_get_folio_gfp
83.72 -1.8 81.92 perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
4.37 -1.2 3.15 ± 2% perf-profile.children.cycles-pp.filemap_get_entry
1.18 -0.3 0.87 perf-profile.children.cycles-pp.xas_load
1.77 ± 2% -0.3 1.50 ± 4% perf-profile.children.cycles-pp.folio_mark_accessed
0.79 -0.2 0.58 ± 2% perf-profile.children.cycles-pp.xas_start
0.24 ± 4% -0.1 0.14 ± 3% perf-profile.children.cycles-pp.balance_dirty_pages_ratelimited
0.66 -0.1 0.59 perf-profile.children.cycles-pp.file_remove_privs_flags
0.20 ± 2% -0.0 0.17 ± 2% perf-profile.children.cycles-pp.ksys_write
0.20 ± 2% -0.0 0.18 ± 2% perf-profile.children.cycles-pp.record__pushfn
0.20 ± 2% -0.0 0.18 ± 2% perf-profile.children.cycles-pp.write
0.20 ± 2% -0.0 0.18 ± 2% perf-profile.children.cycles-pp.writen
0.05 ± 7% +0.0 0.07 perf-profile.children.cycles-pp.perf_mmap__read_head
0.05 +0.0 0.07 perf-profile.children.cycles-pp.ring_buffer_read_head
0.44 +0.0 0.48 ± 3% perf-profile.children.cycles-pp.noop_dirty_folio
0.49 ± 3% +0.0 0.53 perf-profile.children.cycles-pp.syscall_exit_to_user_mode_prepare
0.65 ± 3% +0.1 0.73 ± 3% perf-profile.children.cycles-pp.testcase
0.72 +0.1 0.81 perf-profile.children.cycles-pp.generic_write_check_limits
0.85 ± 8% +0.1 0.97 ± 4% perf-profile.children.cycles-pp.syscall_return_via_sysret
0.71 ± 3% +0.1 0.85 ± 8% perf-profile.children.cycles-pp.entry_SYSCALL_64_safe_stack
1.72 +0.2 1.89 ± 3% perf-profile.children.cycles-pp.folio_mark_dirty
1.78 +0.2 1.95 perf-profile.children.cycles-pp.fput
2.46 ± 3% +0.2 2.64 ± 2% perf-profile.children.cycles-pp.current_time
3.34 ± 2% +0.2 3.57 ± 2% perf-profile.children.cycles-pp.inode_needs_update_time
3.92 ± 2% +0.2 4.16 ± 3% perf-profile.children.cycles-pp.fdget
1.80 ± 2% +0.2 2.04 ± 2% perf-profile.children.cycles-pp.generic_write_checks
3.53 +0.2 3.78 perf-profile.children.cycles-pp.down_write
2.50 +0.3 2.82 ± 2% perf-profile.children.cycles-pp.folio_unlock
3.43 +0.4 3.81 ± 2% perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack
1.79 ± 3% +0.4 2.19 ± 4% perf-profile.children.cycles-pp.__fsnotify_parent
2.01 +0.4 2.44 perf-profile.children.cycles-pp.up_write
3.51 +0.6 4.14 perf-profile.children.cycles-pp.syscall_exit_to_user_mode
7.58 +0.9 8.43 ± 2% perf-profile.children.cycles-pp.shmem_write_end
8.38 +0.9 9.26 perf-profile.children.cycles-pp.entry_SYSCALL_64
16.96 +1.0 17.97 ± 2% perf-profile.children.cycles-pp.copy_page_from_iter_atomic
3.15 -0.9 2.24 perf-profile.self.cycles-pp.filemap_get_entry
3.72 -0.6 3.12 perf-profile.self.cycles-pp.shmem_get_folio_gfp
1.03 -0.2 0.81 ± 2% perf-profile.self.cycles-pp.shmem_write_begin
1.53 ± 2% -0.2 1.34 ± 3% perf-profile.self.cycles-pp.folio_mark_accessed
0.56 -0.2 0.39 ± 3% perf-profile.self.cycles-pp.xas_start
0.50 ± 4% -0.1 0.37 ± 2% perf-profile.self.cycles-pp.xas_load
0.58 -0.1 0.51 ± 2% perf-profile.self.cycles-pp.file_remove_privs_flags
0.05 +0.0 0.07 perf-profile.self.cycles-pp.ring_buffer_read_head
0.40 ± 4% +0.0 0.44 perf-profile.self.cycles-pp.syscall_exit_to_user_mode_prepare
0.52 ± 3% +0.1 0.58 ± 4% perf-profile.self.cycles-pp.testcase
0.56 +0.1 0.64 perf-profile.self.cycles-pp.generic_write_check_limits
0.78 +0.1 0.86 ± 2% perf-profile.self.cycles-pp.folio_mark_dirty
0.94 ± 3% +0.1 1.02 ± 4% perf-profile.self.cycles-pp.rw_verify_area
0.85 ± 9% +0.1 0.97 ± 4% perf-profile.self.cycles-pp.syscall_return_via_sysret
1.06 ± 2% +0.1 1.19 ± 3% perf-profile.self.cycles-pp.generic_write_checks
0.68 ± 4% +0.1 0.82 ± 8% perf-profile.self.cycles-pp.entry_SYSCALL_64_safe_stack
1.63 +0.2 1.80 perf-profile.self.cycles-pp.fput
1.63 ± 3% +0.2 1.81 ± 3% perf-profile.self.cycles-pp.shmem_file_write_iter
1.75 +0.2 1.97 ± 2% perf-profile.self.cycles-pp.__x64_sys_pwrite64
1.32 ± 4% +0.2 1.55 ± 4% perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
3.69 ± 2% +0.2 3.93 ± 3% perf-profile.self.cycles-pp.fdget
2.35 ± 2% +0.3 2.64 ± 2% perf-profile.self.cycles-pp.down_write
2.30 +0.3 2.59 ± 3% perf-profile.self.cycles-pp.folio_unlock
1.59 ± 3% +0.3 1.88 ± 3% perf-profile.self.cycles-pp.__fsnotify_parent
3.17 ± 2% +0.4 3.54 ± 3% perf-profile.self.cycles-pp.shmem_write_end
3.35 ± 2% +0.4 3.74 ± 3% perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack
1.84 +0.4 2.25 perf-profile.self.cycles-pp.up_write
4.57 +0.4 4.99 ± 2% perf-profile.self.cycles-pp.vfs_write
3.04 +0.6 3.60 perf-profile.self.cycles-pp.syscall_exit_to_user_mode
8.42 +0.9 9.29 perf-profile.self.cycles-pp.__libc_pwrite
16.42 +1.0 17.45 perf-profile.self.cycles-pp.copy_page_from_iter_atomic
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki
^ permalink raw reply [flat|nested] only message in thread
only message in thread, other threads:[~2025-02-12 8:14 UTC | newest]
Thread overview: (only message) (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2025-02-12 8:13 [daveh-devel:prefault-20250128] [filemap] bc10506d7c: unixbench.throughput 5.1% improvement kernel test robot
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox