From: kernel test robot <oliver.sang@intel.com>
To: Brian Foster <bfoster@redhat.com>
Cc: <oe-lkp@lists.linux.dev>, <lkp@intel.com>,
<linux-kernel@vger.kernel.org>, Theodore Ts'o <tytso@mit.edu>,
<linux-ext4@vger.kernel.org>, <oliver.sang@intel.com>
Subject: [linus:master] [ext4] c7fc0366c6: stress-ng.metamix.ops_per_sec 22.9% improvement
Date: Fri, 6 Dec 2024 17:21:53 +0800 [thread overview]
Message-ID: <202412061635.d962a313-lkp@intel.com> (raw)
Hello,
kernel test robot noticed a 22.9% improvement of stress-ng.metamix.ops_per_sec on:
commit: c7fc0366c65628fd69bfc310affec4918199aae2 ("ext4: partial zero eof block on unaligned inode size extension")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master
testcase: stress-ng
config: x86_64-rhel-9.4
compiler: gcc-12
test machine: 8 threads 1 sockets Intel(R) Core(TM) i7-3770K CPU @ 3.50GHz (Ivy Bridge) with 16G memory
parameters:
nr_threads: 100%
disk: 1HDD
testtime: 60s
fs: ext4
test: metamix
cpufreq_governor: performance
Details are as below:
-------------------------------------------------------------------------------------------------->
The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20241206/202412061635.d962a313-lkp@intel.com
=========================================================================================
compiler/cpufreq_governor/disk/fs/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
gcc-12/performance/1HDD/ext4/x86_64-rhel-9.4/100%/debian-12-x86_64-20240206.cgz/lkp-ivb-d01/metamix/stress-ng/60s
commit:
25f51ea8ac ("ext4: disambiguate the return value of ext4_dio_write_end_io()")
c7fc0366c6 ("ext4: partial zero eof block on unaligned inode size extension")
25f51ea8ac8144af c7fc0366c65628fd69bfc310aff
---------------- ---------------------------
%stddev %change %stddev
\ | \
553.38 ± 4% +12.1% 620.15 ± 2% uptime.idle
26348 ± 3% -40.4% 15711 ± 7% meminfo.Dirty
16924 ± 2% -32.5% 11430 ± 7% meminfo.Writeback
6.91 ± 9% +163.1% 18.17 ± 7% iostat.cpu.idle
90.85 -13.0% 79.04 iostat.cpu.iowait
1.45 +33.6% 1.93 iostat.cpu.system
1.117e+08 ± 4% +15.9% 1.295e+08 ± 6% sched_debug.cfs_rq:/.avg_vruntime.avg
8990780 ± 48% +201.7% 27121462 ± 23% sched_debug.cfs_rq:/.avg_vruntime.min
1.117e+08 ± 4% +15.9% 1.295e+08 ± 6% sched_debug.cfs_rq:/.min_vruntime.avg
8990780 ± 48% +201.7% 27121462 ± 23% sched_debug.cfs_rq:/.min_vruntime.min
4.41 ± 14% +11.6 16.01 ± 9% mpstat.cpu.all.idle%
93.54 -12.1 81.39 mpstat.cpu.all.iowait%
0.23 ± 2% +0.1 0.31 ± 2% mpstat.cpu.all.irq%
0.07 ± 2% +0.0 0.07 mpstat.cpu.all.soft%
1.13 +0.4 1.54 mpstat.cpu.all.sys%
6.94 ± 9% +162.5% 18.21 ± 7% vmstat.cpu.id
90.85 -13.0% 79.05 vmstat.cpu.wa
31576 +20.8% 38129 vmstat.io.bo
64.71 ± 3% -33.6% 42.98 ± 8% vmstat.procs.b
2893 +2.9% 2976 vmstat.system.in
1911 ± 2% +21.8% 2328 stress-ng.metamix.ops
30.37 +22.9% 37.32 ± 2% stress-ng.metamix.ops_per_sec
4111968 ± 2% +20.4% 4952304 stress-ng.time.file_system_outputs
24323 +8.5% 26386 stress-ng.time.minor_page_faults
6.17 ± 6% +54.1% 9.50 ± 5% stress-ng.time.percent_of_cpu_this_job_got
23964 ± 2% +18.6% 28423 stress-ng.time.voluntary_context_switches
514496 ± 2% +20.4% 619656 proc-vmstat.nr_dirtied
6606 ± 3% -42.4% 3807 ± 5% proc-vmstat.nr_dirty
4338 +3.4% 4486 ± 2% proc-vmstat.nr_page_table_pages
4266 ± 2% -33.9% 2820 ± 7% proc-vmstat.nr_writeback
514089 ± 2% +20.5% 619358 proc-vmstat.nr_written
10850 ± 3% -39.1% 6607 ± 6% proc-vmstat.nr_zone_write_pending
666231 ± 2% +15.4% 769072 proc-vmstat.numa_hit
666231 ± 2% +15.5% 769184 proc-vmstat.numa_local
698850 ± 2% +14.8% 802298 proc-vmstat.pgalloc_normal
678127 ± 3% +15.7% 784461 ± 2% proc-vmstat.pgfree
2088116 ± 2% +19.9% 2503923 proc-vmstat.pgpgout
94023007 +15.2% 1.083e+08 perf-stat.i.branch-instructions
2405282 +15.0% 2765956 perf-stat.i.branch-misses
14.62 ± 2% -1.2 13.38 perf-stat.i.cache-miss-rate%
583290 ± 2% +17.5% 685524 ± 2% perf-stat.i.cache-misses
3648640 ± 2% +29.2% 4714050 perf-stat.i.cache-references
5.461e+08 +24.7% 6.808e+08 ± 2% perf-stat.i.cpu-cycles
164.88 ± 2% +24.0% 204.37 perf-stat.i.cpu-migrations
833.08 +7.0% 891.37 ± 2% perf-stat.i.cycles-between-cache-misses
4.545e+08 +16.6% 5.301e+08 perf-stat.i.instructions
0.61 -1.1% 0.60 perf-stat.i.ipc
15.99 -1.4 14.54 perf-stat.overall.cache-miss-rate%
1.20 +6.9% 1.28 perf-stat.overall.cpi
936.52 ± 2% +6.1% 993.24 perf-stat.overall.cycles-between-cache-misses
0.83 -6.4% 0.78 perf-stat.overall.ipc
92548187 +15.2% 1.066e+08 perf-stat.ps.branch-instructions
2367842 +15.0% 2723489 perf-stat.ps.branch-misses
574185 ± 2% +17.5% 674749 ± 2% perf-stat.ps.cache-misses
3591758 ± 2% +29.2% 4640021 perf-stat.ps.cache-references
5.375e+08 +24.7% 6.701e+08 ± 2% perf-stat.ps.cpu-cycles
162.31 ± 2% +23.9% 201.17 perf-stat.ps.cpu-migrations
4.474e+08 +16.6% 5.218e+08 perf-stat.ps.instructions
2.874e+10 +15.8% 3.328e+10 ± 2% perf-stat.total.instructions
0.02 ± 9% +25.2% 0.02 ± 6% perf-sched.sch_delay.avg.ms.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
0.48 ± 44% +216.9% 1.52 ±106% perf-sched.sch_delay.avg.ms.io_schedule.bit_wait_io.__wait_on_bit.out_of_line_wait_on_bit
0.00 ± 16% +148.1% 0.01 ± 76% perf-sched.sch_delay.avg.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
0.02 ± 10% +26.8% 0.03 ± 8% perf-sched.sch_delay.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
7.94 ± 52% -76.1% 1.90 ±131% perf-sched.sch_delay.max.ms.__cond_resched.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write
0.02 ± 55% +1263.0% 0.21 ±185% perf-sched.sch_delay.max.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
252.86 ± 13% -46.6% 134.96 ± 19% perf-sched.wait_and_delay.avg.ms.io_schedule.folio_wait_bit_common.folio_wait_writeback.__filemap_fdatawait_range
291.92 ± 4% -38.6% 179.21 ± 17% perf-sched.wait_and_delay.avg.ms.io_schedule.rq_qos_wait.wbt_wait.__rq_qos_throttle
45.27 ± 9% -11.1% 40.23 ± 7% perf-sched.wait_and_delay.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
26.86 ± 2% +22.0% 32.76 ± 6% perf-sched.wait_and_delay.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
292.00 ± 12% +71.9% 501.83 ± 16% perf-sched.wait_and_delay.count.io_schedule.folio_wait_bit_common.folio_wait_writeback.__filemap_fdatawait_range
323.83 ± 7% +29.0% 417.83 ± 12% perf-sched.wait_and_delay.count.jbd2_log_wait_commit.ext4_sync_file.do_fsync.__x64_sys_fdatasync
2817 ± 2% -17.3% 2329 ± 8% perf-sched.wait_and_delay.count.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
999.12 ± 4% +71.7% 1715 ± 20% perf-sched.wait_and_delay.max.ms.io_schedule.rq_qos_wait.wbt_wait.__rq_qos_throttle
909.77 ± 11% +88.7% 1716 ± 20% perf-sched.wait_and_delay.max.ms.jbd2_log_wait_commit.__jbd2_journal_force_commit.jbd2_journal_force_commit.ext4_sync_file
970.08 ± 4% +51.7% 1471 ± 28% perf-sched.wait_and_delay.max.ms.jbd2_log_wait_commit.ext4_sync_file.do_fsync.__x64_sys_fdatasync
252.77 ± 13% -46.6% 134.90 ± 19% perf-sched.wait_time.avg.ms.io_schedule.folio_wait_bit_common.folio_wait_writeback.__filemap_fdatawait_range
291.02 ± 4% -38.8% 178.11 ± 17% perf-sched.wait_time.avg.ms.io_schedule.rq_qos_wait.wbt_wait.__rq_qos_throttle
21.25 ±192% +2117.6% 471.24 ± 90% perf-sched.wait_time.avg.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.do_unlinkat
45.22 ± 9% -11.1% 40.19 ± 7% perf-sched.wait_time.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
26.84 ± 2% +22.0% 32.73 ± 6% perf-sched.wait_time.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
998.61 ± 4% +71.3% 1710 ± 20% perf-sched.wait_time.max.ms.io_schedule.rq_qos_wait.wbt_wait.__rq_qos_throttle
908.10 ± 12% +88.5% 1711 ± 20% perf-sched.wait_time.max.ms.jbd2_log_wait_commit.__jbd2_journal_force_commit.jbd2_journal_force_commit.ext4_sync_file
969.71 ± 4% +51.4% 1468 ± 28% perf-sched.wait_time.max.ms.jbd2_log_wait_commit.ext4_sync_file.do_fsync.__x64_sys_fdatasync
28.45 ±194% +3281.7% 962.20 ± 56% perf-sched.wait_time.max.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.do_unlinkat
40.18 ± 3% -4.3 35.85 ± 2% perf-profile.calltrace.cycles-pp.cpu_startup_entry.start_secondary.common_startup_64
40.18 ± 3% -4.3 35.85 ± 2% perf-profile.calltrace.cycles-pp.start_secondary.common_startup_64
44.88 ± 3% -4.3 40.58 perf-profile.calltrace.cycles-pp.common_startup_64
40.10 ± 3% -4.3 35.80 ± 2% perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_secondary.common_startup_64
36.18 ± 4% -4.2 31.96 ± 2% perf-profile.calltrace.cycles-pp.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary
39.12 ± 3% -4.2 34.91 ± 2% perf-profile.calltrace.cycles-pp.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary.common_startup_64
39.55 ± 4% -4.1 35.44 perf-profile.calltrace.cycles-pp.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry
22.45 ± 4% -1.7 20.71 ± 4% perf-profile.calltrace.cycles-pp.intel_idle.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle
3.22 ± 14% -1.6 1.58 ± 15% perf-profile.calltrace.cycles-pp.poll_idle.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle
6.97 ± 7% -1.6 5.40 ± 5% perf-profile.calltrace.cycles-pp.__x64_sys_fdatasync.do_syscall_64.entry_SYSCALL_64_after_hwframe.fdatasync.stress_metamix
6.97 ± 7% -1.6 5.40 ± 5% perf-profile.calltrace.cycles-pp.do_fsync.__x64_sys_fdatasync.do_syscall_64.entry_SYSCALL_64_after_hwframe.fdatasync
6.96 ± 7% -1.6 5.40 ± 5% perf-profile.calltrace.cycles-pp.ext4_sync_file.do_fsync.__x64_sys_fdatasync.do_syscall_64.entry_SYSCALL_64_after_hwframe
7.03 ± 7% -1.6 5.46 ± 6% perf-profile.calltrace.cycles-pp.fdatasync.stress_metamix
6.99 ± 7% -1.6 5.43 ± 6% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.fdatasync.stress_metamix
6.99 ± 7% -1.6 5.43 ± 6% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.fdatasync.stress_metamix
6.17 ± 7% -1.5 4.71 ± 7% perf-profile.calltrace.cycles-pp.file_write_and_wait_range.ext4_sync_file.do_fsync.__x64_sys_fdatasync.do_syscall_64
5.84 ± 8% -1.3 4.51 ± 7% perf-profile.calltrace.cycles-pp.__filemap_fdatawrite_range.file_write_and_wait_range.ext4_sync_file.do_fsync.__x64_sys_fdatasync
5.84 ± 8% -1.3 4.51 ± 7% perf-profile.calltrace.cycles-pp.filemap_fdatawrite_wbc.__filemap_fdatawrite_range.file_write_and_wait_range.ext4_sync_file.do_fsync
5.83 ± 8% -1.3 4.51 ± 7% perf-profile.calltrace.cycles-pp.do_writepages.filemap_fdatawrite_wbc.__filemap_fdatawrite_range.file_write_and_wait_range.ext4_sync_file
5.82 ± 8% -1.3 4.51 ± 7% perf-profile.calltrace.cycles-pp.ext4_writepages.do_writepages.filemap_fdatawrite_wbc.__filemap_fdatawrite_range.file_write_and_wait_range
5.82 ± 9% -1.3 4.50 ± 7% perf-profile.calltrace.cycles-pp.ext4_do_writepages.ext4_writepages.do_writepages.filemap_fdatawrite_wbc.__filemap_fdatawrite_range
5.87 ± 9% -0.9 4.99 ± 6% perf-profile.calltrace.cycles-pp.llseek.stress_metamix
3.42 ± 6% -0.6 2.78 ± 5% perf-profile.calltrace.cycles-pp.mpage_map_and_submit_extent.ext4_do_writepages.ext4_writepages.do_writepages.filemap_fdatawrite_wbc
3.20 ± 5% -0.6 2.64 ± 14% perf-profile.calltrace.cycles-pp.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
2.42 ± 6% -0.5 1.96 ± 15% perf-profile.calltrace.cycles-pp.process_one_work.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
2.09 ± 8% -0.4 1.66 ± 9% perf-profile.calltrace.cycles-pp.mpage_map_and_submit_buffers.mpage_map_and_submit_extent.ext4_do_writepages.ext4_writepages.do_writepages
1.00 ± 16% -0.4 0.59 ± 45% perf-profile.calltrace.cycles-pp.mpage_prepare_extent_to_map.ext4_do_writepages.ext4_writepages.do_writepages.filemap_fdatawrite_wbc
1.79 ± 5% -0.4 1.39 ± 15% perf-profile.calltrace.cycles-pp.entry_SYSRETQ_unsafe_stack.llseek.stress_metamix
1.37 ± 16% -0.3 1.03 ± 11% perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.llseek.stress_metamix
1.62 ± 8% -0.3 1.34 ± 9% perf-profile.calltrace.cycles-pp.mpage_submit_folio.mpage_map_and_submit_buffers.mpage_map_and_submit_extent.ext4_do_writepages.ext4_writepages
0.89 ± 16% -0.2 0.68 ± 17% perf-profile.calltrace.cycles-pp.sched_balance_rq.sched_balance_domains.handle_softirqs.__irq_exit_rcu.sysvec_apic_timer_interrupt
0.00 +0.6 0.62 ± 17% perf-profile.calltrace.cycles-pp.start_this_handle.jbd2__journal_start.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter
0.00 +0.8 0.76 ± 13% perf-profile.calltrace.cycles-pp.mark_buffer_dirty.__block_commit_write.block_write_end.ext4_da_do_write_end.generic_perform_write
0.00 +0.8 0.76 ± 25% perf-profile.calltrace.cycles-pp.__ext4_get_inode_loc.ext4_get_inode_loc.ext4_reserve_inode_write.__ext4_mark_inode_dirty.ext4_da_do_write_end
0.00 +0.8 0.76 ± 24% perf-profile.calltrace.cycles-pp.mpage_prepare_extent_to_map.ext4_do_writepages.ext4_normal_submit_inode_data_buffers.jbd2_journal_commit_transaction.kjournald2
0.00 +0.8 0.78 ± 23% perf-profile.calltrace.cycles-pp.ext4_get_inode_loc.ext4_reserve_inode_write.__ext4_mark_inode_dirty.ext4_da_do_write_end.generic_perform_write
0.00 +0.8 0.78 ± 19% perf-profile.calltrace.cycles-pp.crc32c_pcl_intel_update.ext4_inode_csum.ext4_inode_csum_set.ext4_fill_raw_inode.ext4_do_update_inode
0.00 +0.8 0.78 ± 23% perf-profile.calltrace.cycles-pp.ext4_do_writepages.ext4_normal_submit_inode_data_buffers.jbd2_journal_commit_transaction.kjournald2.kthread
0.00 +0.8 0.78 ± 23% perf-profile.calltrace.cycles-pp.ext4_normal_submit_inode_data_buffers.jbd2_journal_commit_transaction.kjournald2.kthread.ret_from_fork
0.19 ±142% +0.9 1.10 ± 21% perf-profile.calltrace.cycles-pp.jbd2_journal_commit_transaction.kjournald2.kthread.ret_from_fork.ret_from_fork_asm
0.19 ±142% +0.9 1.10 ± 21% perf-profile.calltrace.cycles-pp.kjournald2.kthread.ret_from_fork.ret_from_fork_asm
0.00 +1.0 0.96 ± 9% perf-profile.calltrace.cycles-pp.jbd2__journal_start.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write
0.00 +1.0 0.97 ± 16% perf-profile.calltrace.cycles-pp.__block_commit_write.block_write_end.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter
0.00 +1.0 1.03 ± 16% perf-profile.calltrace.cycles-pp.ext4_reserve_inode_write.__ext4_mark_inode_dirty.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter
0.00 +1.0 1.03 ± 16% perf-profile.calltrace.cycles-pp.block_write_end.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write
0.00 +1.1 1.08 ± 8% perf-profile.calltrace.cycles-pp.ext4_inode_csum.ext4_inode_csum_set.ext4_fill_raw_inode.ext4_do_update_inode.ext4_mark_iloc_dirty
0.00 +1.1 1.08 ± 18% perf-profile.calltrace.cycles-pp.memset_orig.zero_user_segments.__ext4_block_zero_page_range.ext4_da_do_write_end.generic_perform_write
0.00 +1.2 1.15 ± 9% perf-profile.calltrace.cycles-pp.ext4_inode_csum_set.ext4_fill_raw_inode.ext4_do_update_inode.ext4_mark_iloc_dirty.__ext4_mark_inode_dirty
0.00 +1.2 1.16 ± 19% perf-profile.calltrace.cycles-pp.zero_user_segments.__ext4_block_zero_page_range.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter
0.00 +1.7 1.70 ± 11% perf-profile.calltrace.cycles-pp.ext4_fill_raw_inode.ext4_do_update_inode.ext4_mark_iloc_dirty.__ext4_mark_inode_dirty.ext4_da_do_write_end
0.00 +1.7 1.71 ± 10% perf-profile.calltrace.cycles-pp.__ext4_block_zero_page_range.ext4_zero_partial_blocks.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter
0.00 +1.8 1.78 ± 9% perf-profile.calltrace.cycles-pp.ext4_zero_partial_blocks.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write
0.00 +1.9 1.88 ± 13% perf-profile.calltrace.cycles-pp.__ext4_block_zero_page_range.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write
0.00 +2.2 2.16 ± 9% perf-profile.calltrace.cycles-pp.ext4_do_update_inode.ext4_mark_iloc_dirty.__ext4_mark_inode_dirty.ext4_da_do_write_end.generic_perform_write
0.00 +2.3 2.34 ± 8% perf-profile.calltrace.cycles-pp.ext4_mark_iloc_dirty.__ext4_mark_inode_dirty.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter
0.00 +3.4 3.42 ± 5% perf-profile.calltrace.cycles-pp.__ext4_mark_inode_dirty.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write
43.79 ± 4% +5.7 49.45 perf-profile.calltrace.cycles-pp.stress_metamix
13.98 ± 10% +8.6 22.60 ± 3% perf-profile.calltrace.cycles-pp.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write.do_syscall_64
15.52 ± 10% +8.7 24.25 ± 3% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.write.stress_metamix
15.46 ± 10% +8.7 24.21 ± 3% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.write.stress_metamix
14.55 ± 10% +8.8 23.32 ± 2% perf-profile.calltrace.cycles-pp.ext4_buffered_write_iter.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe
14.97 ± 10% +8.8 23.79 ± 2% perf-profile.calltrace.cycles-pp.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
15.16 ± 10% +8.8 24.01 ± 2% perf-profile.calltrace.cycles-pp.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write.stress_metamix
17.88 ± 10% +8.9 26.79 ± 2% perf-profile.calltrace.cycles-pp.write.stress_metamix
0.00 +10.0 10.04 ± 6% perf-profile.calltrace.cycles-pp.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write
40.18 ± 3% -4.3 35.85 ± 2% perf-profile.children.cycles-pp.start_secondary
44.88 ± 3% -4.3 40.58 perf-profile.children.cycles-pp.common_startup_64
44.88 ± 3% -4.3 40.58 perf-profile.children.cycles-pp.cpu_startup_entry
44.87 ± 3% -4.3 40.57 perf-profile.children.cycles-pp.do_idle
43.72 ± 3% -4.2 39.52 perf-profile.children.cycles-pp.cpuidle_idle_call
40.17 ± 4% -4.1 36.03 perf-profile.children.cycles-pp.cpuidle_enter_state
40.30 ± 4% -4.1 36.18 perf-profile.children.cycles-pp.cpuidle_enter
22.45 ± 4% -1.7 20.71 ± 4% perf-profile.children.cycles-pp.intel_idle
3.44 ± 14% -1.7 1.79 ± 16% perf-profile.children.cycles-pp.poll_idle
7.44 ± 7% -1.6 5.84 ± 6% perf-profile.children.cycles-pp.do_fsync
7.43 ± 7% -1.6 5.84 ± 6% perf-profile.children.cycles-pp.ext4_sync_file
7.01 ± 7% -1.6 5.45 ± 6% perf-profile.children.cycles-pp.__x64_sys_fdatasync
7.07 ± 7% -1.5 5.54 ± 6% perf-profile.children.cycles-pp.fdatasync
6.20 ± 7% -1.5 4.74 ± 7% perf-profile.children.cycles-pp.file_write_and_wait_range
5.89 ± 8% -1.3 4.54 ± 7% perf-profile.children.cycles-pp.do_writepages
5.87 ± 8% -1.3 4.54 ± 7% perf-profile.children.cycles-pp.__filemap_fdatawrite_range
5.87 ± 8% -1.3 4.54 ± 7% perf-profile.children.cycles-pp.filemap_fdatawrite_wbc
5.86 ± 8% -1.3 4.54 ± 7% perf-profile.children.cycles-pp.ext4_writepages
6.07 ± 9% -0.9 5.19 ± 6% perf-profile.children.cycles-pp.llseek
3.42 ± 6% -0.6 2.78 ± 6% perf-profile.children.cycles-pp.mpage_map_and_submit_extent
3.21 ± 5% -0.6 2.64 ± 14% perf-profile.children.cycles-pp.worker_thread
3.42 ± 10% -0.5 2.90 ± 9% perf-profile.children.cycles-pp.handle_softirqs
2.42 ± 6% -0.5 1.96 ± 15% perf-profile.children.cycles-pp.process_one_work
2.09 ± 8% -0.4 1.66 ± 9% perf-profile.children.cycles-pp.mpage_map_and_submit_buffers
0.34 ± 16% -0.2 0.13 ± 55% perf-profile.children.cycles-pp.mpage_process_page_bufs
0.50 ± 23% -0.2 0.33 ± 28% perf-profile.children.cycles-pp.native_irq_return_iret
0.24 ± 31% -0.1 0.13 ± 36% perf-profile.children.cycles-pp.ext4_file_read_iter
0.25 ± 20% -0.1 0.15 ± 8% perf-profile.children.cycles-pp.khugepaged
0.24 ± 17% -0.1 0.15 ± 8% perf-profile.children.cycles-pp.hpage_collapse_scan_pmd
0.24 ± 17% -0.1 0.15 ± 8% perf-profile.children.cycles-pp.khugepaged_scan_mm_slot
0.23 ± 13% -0.1 0.15 ± 12% perf-profile.children.cycles-pp.collapse_huge_page
0.17 ± 13% -0.0 0.14 ± 7% perf-profile.children.cycles-pp.__collapse_huge_page_copy
0.07 ± 51% +0.1 0.14 ± 27% perf-profile.children.cycles-pp.tmigr_quick_check
0.11 ± 48% +0.1 0.21 ± 15% perf-profile.children.cycles-pp.tmigr_requires_handle_remote
0.00 +0.1 0.14 ± 41% perf-profile.children.cycles-pp.jbd2_journal_dirty_metadata
0.12 ± 72% +0.2 0.27 ± 19% perf-profile.children.cycles-pp.inode_to_bdi
0.13 ± 55% +0.2 0.32 ± 25% perf-profile.children.cycles-pp.jbd2_journal_get_write_access
0.01 ±223% +0.2 0.21 ± 45% perf-profile.children.cycles-pp.add_transaction_credits
0.30 ± 42% +0.2 0.50 ± 14% perf-profile.children.cycles-pp.__find_get_block
0.06 ± 75% +0.2 0.27 ± 20% perf-profile.children.cycles-pp.folio_mark_accessed
0.31 ± 41% +0.2 0.54 ± 13% perf-profile.children.cycles-pp.bdev_getblk
0.03 ±145% +0.2 0.28 ± 27% perf-profile.children.cycles-pp.__ext4_handle_dirty_metadata
0.01 ±223% +0.2 0.26 ± 27% perf-profile.children.cycles-pp.stop_this_handle
0.05 ±165% +0.3 0.31 ± 13% perf-profile.children.cycles-pp.ext4_inode_journal_mode
0.00 +0.3 0.28 ± 26% perf-profile.children.cycles-pp.filemap_dirty_folio
0.37 ± 15% +0.3 0.65 ± 23% perf-profile.children.cycles-pp.__cond_resched
0.14 ± 52% +0.3 0.43 ± 18% perf-profile.children.cycles-pp.__ext4_journal_get_write_access
0.00 +0.3 0.29 ± 33% perf-profile.children.cycles-pp.folio_redirty_for_writepage
0.36 ± 25% +0.4 0.75 ± 10% perf-profile.children.cycles-pp.filemap_get_entry
0.10 ± 80% +0.4 0.55 ± 9% perf-profile.children.cycles-pp.jbd2_journal_stop
0.00 +0.5 0.45 ± 18% perf-profile.children.cycles-pp.jbd2_journal_file_inode
0.10 ± 80% +0.5 0.61 ± 11% perf-profile.children.cycles-pp.__ext4_journal_stop
0.22 ± 57% +0.6 0.86 ± 27% perf-profile.children.cycles-pp.__ext4_get_inode_loc
0.22 ± 58% +0.6 0.87 ± 27% perf-profile.children.cycles-pp.ext4_get_inode_loc
0.44 ± 27% +0.7 1.10 ± 21% perf-profile.children.cycles-pp.kjournald2
0.44 ± 28% +0.7 1.10 ± 21% perf-profile.children.cycles-pp.jbd2_journal_commit_transaction
0.05 ±103% +0.7 0.72 ± 11% perf-profile.children.cycles-pp.start_this_handle
0.42 ± 15% +0.8 1.18 ± 13% perf-profile.children.cycles-pp.crc32c_pcl_intel_update
0.00 +0.8 0.78 ± 23% perf-profile.children.cycles-pp.ext4_normal_submit_inode_data_buffers
0.28 ± 41% +1.0 1.23 ± 17% perf-profile.children.cycles-pp.ext4_reserve_inode_write
0.08 ± 74% +1.0 1.04 ± 6% perf-profile.children.cycles-pp.jbd2__journal_start
0.26 ± 23% +1.0 1.23 ± 5% perf-profile.children.cycles-pp.ext4_inode_csum
0.28 ± 19% +1.0 1.32 ± 6% perf-profile.children.cycles-pp.ext4_inode_csum_set
3.99 ± 13% +1.1 5.13 ± 6% perf-profile.children.cycles-pp.__filemap_get_folio
3.80 ± 18% +1.2 4.95 ± 6% perf-profile.children.cycles-pp.memset_orig
3.80 ± 18% +1.4 5.20 ± 5% perf-profile.children.cycles-pp.zero_user_segments
0.40 ± 19% +1.6 1.96 ± 11% perf-profile.children.cycles-pp.ext4_fill_raw_inode
0.00 +1.8 1.80 ± 9% perf-profile.children.cycles-pp.ext4_zero_partial_blocks
0.55 ± 16% +2.0 2.55 ± 7% perf-profile.children.cycles-pp.ext4_do_update_inode
0.58 ± 19% +2.2 2.78 ± 6% perf-profile.children.cycles-pp.ext4_mark_iloc_dirty
0.85 ± 16% +3.2 4.03 ± 4% perf-profile.children.cycles-pp.__ext4_mark_inode_dirty
0.00 +3.6 3.61 ± 9% perf-profile.children.cycles-pp.__ext4_block_zero_page_range
38.33 ± 4% +5.5 43.88 ± 2% perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
38.13 ± 4% +5.6 43.69 ± 2% perf-profile.children.cycles-pp.do_syscall_64
43.79 ± 4% +5.7 49.45 perf-profile.children.cycles-pp.stress_metamix
13.98 ± 10% +8.6 22.61 ± 3% perf-profile.children.cycles-pp.generic_perform_write
14.57 ± 10% +8.8 23.33 ± 2% perf-profile.children.cycles-pp.ext4_buffered_write_iter
15.14 ± 9% +8.8 23.92 ± 2% perf-profile.children.cycles-pp.vfs_write
15.35 ± 9% +8.8 24.15 ± 2% perf-profile.children.cycles-pp.ksys_write
18.28 ± 9% +8.8 27.11 ± 2% perf-profile.children.cycles-pp.write
0.00 +10.0 10.04 ± 6% perf-profile.children.cycles-pp.ext4_da_do_write_end
22.43 ± 4% -1.7 20.71 ± 4% perf-profile.self.cycles-pp.intel_idle
3.24 ± 15% -1.6 1.69 ± 18% perf-profile.self.cycles-pp.poll_idle
0.34 ± 16% -0.2 0.13 ± 53% perf-profile.self.cycles-pp.mpage_process_page_bufs
0.50 ± 23% -0.2 0.33 ± 28% perf-profile.self.cycles-pp.native_irq_return_iret
0.24 ± 31% -0.1 0.12 ± 30% perf-profile.self.cycles-pp.ext4_file_read_iter
0.01 ±223% +0.1 0.10 ± 33% perf-profile.self.cycles-pp.mmap_region
0.00 +0.1 0.11 ± 34% perf-profile.self.cycles-pp.jbd2_journal_dirty_metadata
0.01 ±223% +0.1 0.15 ± 55% perf-profile.self.cycles-pp.jbd2_journal_get_write_access
0.10 ± 85% +0.1 0.24 ± 20% perf-profile.self.cycles-pp.inode_to_bdi
0.00 +0.1 0.14 ± 32% perf-profile.self.cycles-pp.jbd2__journal_start
0.00 +0.1 0.14 ± 25% perf-profile.self.cycles-pp.jbd2_journal_file_inode
0.18 ± 43% +0.2 0.34 ± 11% perf-profile.self.cycles-pp.__cond_resched
0.05 ±102% +0.2 0.22 ± 26% perf-profile.self.cycles-pp.ext4_do_update_inode
0.07 ± 78% +0.2 0.24 ± 43% perf-profile.self.cycles-pp.__ext4_get_inode_loc
0.06 ± 75% +0.2 0.24 ± 33% perf-profile.self.cycles-pp.folio_mark_accessed
0.00 +0.2 0.18 ± 43% perf-profile.self.cycles-pp.ext4_da_do_write_end
0.00 +0.2 0.19 ± 38% perf-profile.self.cycles-pp.ext4_mark_iloc_dirty
0.03 ±142% +0.2 0.22 ± 32% perf-profile.self.cycles-pp.ext4_inode_csum
0.00 +0.2 0.20 ± 45% perf-profile.self.cycles-pp.add_transaction_credits
0.00 +0.2 0.20 ± 38% perf-profile.self.cycles-pp.__ext4_block_zero_page_range
0.00 +0.2 0.21 ± 37% perf-profile.self.cycles-pp.stop_this_handle
0.71 ± 19% +0.2 0.93 ± 15% perf-profile.self.cycles-pp._raw_spin_lock
0.10 ± 55% +0.2 0.34 ± 9% perf-profile.self.cycles-pp.filemap_get_entry
0.05 ±165% +0.2 0.29 ± 12% perf-profile.self.cycles-pp.ext4_inode_journal_mode
0.07 ± 50% +0.3 0.34 ± 32% perf-profile.self.cycles-pp.zero_user_segments
0.00 +0.3 0.34 ± 7% perf-profile.self.cycles-pp.start_this_handle
0.05 ±103% +0.5 0.50 ± 24% perf-profile.self.cycles-pp.ext4_fill_raw_inode
0.19 ± 19% +0.7 0.89 ± 9% perf-profile.self.cycles-pp.crc32c_pcl_intel_update
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki
reply other threads:[~2024-12-06 9:22 UTC|newest]
Thread overview: [no followups] expand[flat|nested] mbox.gz Atom feed
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=202412061635.d962a313-lkp@intel.com \
--to=oliver.sang@intel.com \
--cc=bfoster@redhat.com \
--cc=linux-ext4@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=lkp@intel.com \
--cc=oe-lkp@lists.linux.dev \
--cc=tytso@mit.edu \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.