All of lore.kernel.org
 help / color / mirror / Atom feed
* [linus:master] [ext4]  c7fc0366c6:  stress-ng.metamix.ops_per_sec 22.9% improvement
@ 2024-12-06  9:21 kernel test robot
  0 siblings, 0 replies; only message in thread
From: kernel test robot @ 2024-12-06  9:21 UTC (permalink / raw)
  To: Brian Foster
  Cc: oe-lkp, lkp, linux-kernel, Theodore Ts'o, linux-ext4,
	oliver.sang



Hello,

kernel test robot noticed a 22.9% improvement of stress-ng.metamix.ops_per_sec on:


commit: c7fc0366c65628fd69bfc310affec4918199aae2 ("ext4: partial zero eof block on unaligned inode size extension")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master


testcase: stress-ng
config: x86_64-rhel-9.4
compiler: gcc-12
test machine: 8 threads 1 sockets Intel(R) Core(TM) i7-3770K CPU @ 3.50GHz (Ivy Bridge) with 16G memory
parameters:

	nr_threads: 100%
	disk: 1HDD
	testtime: 60s
	fs: ext4
	test: metamix
	cpufreq_governor: performance



Details are as below:
-------------------------------------------------------------------------------------------------->


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20241206/202412061635.d962a313-lkp@intel.com

=========================================================================================
compiler/cpufreq_governor/disk/fs/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
  gcc-12/performance/1HDD/ext4/x86_64-rhel-9.4/100%/debian-12-x86_64-20240206.cgz/lkp-ivb-d01/metamix/stress-ng/60s

commit: 
  25f51ea8ac ("ext4: disambiguate the return value of ext4_dio_write_end_io()")
  c7fc0366c6 ("ext4: partial zero eof block on unaligned inode size extension")

25f51ea8ac8144af c7fc0366c65628fd69bfc310aff 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
    553.38 ±  4%     +12.1%     620.15 ±  2%  uptime.idle
     26348 ±  3%     -40.4%      15711 ±  7%  meminfo.Dirty
     16924 ±  2%     -32.5%      11430 ±  7%  meminfo.Writeback
      6.91 ±  9%    +163.1%      18.17 ±  7%  iostat.cpu.idle
     90.85           -13.0%      79.04        iostat.cpu.iowait
      1.45           +33.6%       1.93        iostat.cpu.system
 1.117e+08 ±  4%     +15.9%  1.295e+08 ±  6%  sched_debug.cfs_rq:/.avg_vruntime.avg
   8990780 ± 48%    +201.7%   27121462 ± 23%  sched_debug.cfs_rq:/.avg_vruntime.min
 1.117e+08 ±  4%     +15.9%  1.295e+08 ±  6%  sched_debug.cfs_rq:/.min_vruntime.avg
   8990780 ± 48%    +201.7%   27121462 ± 23%  sched_debug.cfs_rq:/.min_vruntime.min
      4.41 ± 14%     +11.6       16.01 ±  9%  mpstat.cpu.all.idle%
     93.54           -12.1       81.39        mpstat.cpu.all.iowait%
      0.23 ±  2%      +0.1        0.31 ±  2%  mpstat.cpu.all.irq%
      0.07 ±  2%      +0.0        0.07        mpstat.cpu.all.soft%
      1.13            +0.4        1.54        mpstat.cpu.all.sys%
      6.94 ±  9%    +162.5%      18.21 ±  7%  vmstat.cpu.id
     90.85           -13.0%      79.05        vmstat.cpu.wa
     31576           +20.8%      38129        vmstat.io.bo
     64.71 ±  3%     -33.6%      42.98 ±  8%  vmstat.procs.b
      2893            +2.9%       2976        vmstat.system.in
      1911 ±  2%     +21.8%       2328        stress-ng.metamix.ops
     30.37           +22.9%      37.32 ±  2%  stress-ng.metamix.ops_per_sec
   4111968 ±  2%     +20.4%    4952304        stress-ng.time.file_system_outputs
     24323            +8.5%      26386        stress-ng.time.minor_page_faults
      6.17 ±  6%     +54.1%       9.50 ±  5%  stress-ng.time.percent_of_cpu_this_job_got
     23964 ±  2%     +18.6%      28423        stress-ng.time.voluntary_context_switches
    514496 ±  2%     +20.4%     619656        proc-vmstat.nr_dirtied
      6606 ±  3%     -42.4%       3807 ±  5%  proc-vmstat.nr_dirty
      4338            +3.4%       4486 ±  2%  proc-vmstat.nr_page_table_pages
      4266 ±  2%     -33.9%       2820 ±  7%  proc-vmstat.nr_writeback
    514089 ±  2%     +20.5%     619358        proc-vmstat.nr_written
     10850 ±  3%     -39.1%       6607 ±  6%  proc-vmstat.nr_zone_write_pending
    666231 ±  2%     +15.4%     769072        proc-vmstat.numa_hit
    666231 ±  2%     +15.5%     769184        proc-vmstat.numa_local
    698850 ±  2%     +14.8%     802298        proc-vmstat.pgalloc_normal
    678127 ±  3%     +15.7%     784461 ±  2%  proc-vmstat.pgfree
   2088116 ±  2%     +19.9%    2503923        proc-vmstat.pgpgout
  94023007           +15.2%  1.083e+08        perf-stat.i.branch-instructions
   2405282           +15.0%    2765956        perf-stat.i.branch-misses
     14.62 ±  2%      -1.2       13.38        perf-stat.i.cache-miss-rate%
    583290 ±  2%     +17.5%     685524 ±  2%  perf-stat.i.cache-misses
   3648640 ±  2%     +29.2%    4714050        perf-stat.i.cache-references
 5.461e+08           +24.7%  6.808e+08 ±  2%  perf-stat.i.cpu-cycles
    164.88 ±  2%     +24.0%     204.37        perf-stat.i.cpu-migrations
    833.08            +7.0%     891.37 ±  2%  perf-stat.i.cycles-between-cache-misses
 4.545e+08           +16.6%  5.301e+08        perf-stat.i.instructions
      0.61            -1.1%       0.60        perf-stat.i.ipc
     15.99            -1.4       14.54        perf-stat.overall.cache-miss-rate%
      1.20            +6.9%       1.28        perf-stat.overall.cpi
    936.52 ±  2%      +6.1%     993.24        perf-stat.overall.cycles-between-cache-misses
      0.83            -6.4%       0.78        perf-stat.overall.ipc
  92548187           +15.2%  1.066e+08        perf-stat.ps.branch-instructions
   2367842           +15.0%    2723489        perf-stat.ps.branch-misses
    574185 ±  2%     +17.5%     674749 ±  2%  perf-stat.ps.cache-misses
   3591758 ±  2%     +29.2%    4640021        perf-stat.ps.cache-references
 5.375e+08           +24.7%  6.701e+08 ±  2%  perf-stat.ps.cpu-cycles
    162.31 ±  2%     +23.9%     201.17        perf-stat.ps.cpu-migrations
 4.474e+08           +16.6%  5.218e+08        perf-stat.ps.instructions
 2.874e+10           +15.8%  3.328e+10 ±  2%  perf-stat.total.instructions
      0.02 ±  9%     +25.2%       0.02 ±  6%  perf-sched.sch_delay.avg.ms.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
      0.48 ± 44%    +216.9%       1.52 ±106%  perf-sched.sch_delay.avg.ms.io_schedule.bit_wait_io.__wait_on_bit.out_of_line_wait_on_bit
      0.00 ± 16%    +148.1%       0.01 ± 76%  perf-sched.sch_delay.avg.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
      0.02 ± 10%     +26.8%       0.03 ±  8%  perf-sched.sch_delay.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
      7.94 ± 52%     -76.1%       1.90 ±131%  perf-sched.sch_delay.max.ms.__cond_resched.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write
      0.02 ± 55%   +1263.0%       0.21 ±185%  perf-sched.sch_delay.max.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
    252.86 ± 13%     -46.6%     134.96 ± 19%  perf-sched.wait_and_delay.avg.ms.io_schedule.folio_wait_bit_common.folio_wait_writeback.__filemap_fdatawait_range
    291.92 ±  4%     -38.6%     179.21 ± 17%  perf-sched.wait_and_delay.avg.ms.io_schedule.rq_qos_wait.wbt_wait.__rq_qos_throttle
     45.27 ±  9%     -11.1%      40.23 ±  7%  perf-sched.wait_and_delay.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
     26.86 ±  2%     +22.0%      32.76 ±  6%  perf-sched.wait_and_delay.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
    292.00 ± 12%     +71.9%     501.83 ± 16%  perf-sched.wait_and_delay.count.io_schedule.folio_wait_bit_common.folio_wait_writeback.__filemap_fdatawait_range
    323.83 ±  7%     +29.0%     417.83 ± 12%  perf-sched.wait_and_delay.count.jbd2_log_wait_commit.ext4_sync_file.do_fsync.__x64_sys_fdatasync
      2817 ±  2%     -17.3%       2329 ±  8%  perf-sched.wait_and_delay.count.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
    999.12 ±  4%     +71.7%       1715 ± 20%  perf-sched.wait_and_delay.max.ms.io_schedule.rq_qos_wait.wbt_wait.__rq_qos_throttle
    909.77 ± 11%     +88.7%       1716 ± 20%  perf-sched.wait_and_delay.max.ms.jbd2_log_wait_commit.__jbd2_journal_force_commit.jbd2_journal_force_commit.ext4_sync_file
    970.08 ±  4%     +51.7%       1471 ± 28%  perf-sched.wait_and_delay.max.ms.jbd2_log_wait_commit.ext4_sync_file.do_fsync.__x64_sys_fdatasync
    252.77 ± 13%     -46.6%     134.90 ± 19%  perf-sched.wait_time.avg.ms.io_schedule.folio_wait_bit_common.folio_wait_writeback.__filemap_fdatawait_range
    291.02 ±  4%     -38.8%     178.11 ± 17%  perf-sched.wait_time.avg.ms.io_schedule.rq_qos_wait.wbt_wait.__rq_qos_throttle
     21.25 ±192%   +2117.6%     471.24 ± 90%  perf-sched.wait_time.avg.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.do_unlinkat
     45.22 ±  9%     -11.1%      40.19 ±  7%  perf-sched.wait_time.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
     26.84 ±  2%     +22.0%      32.73 ±  6%  perf-sched.wait_time.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
    998.61 ±  4%     +71.3%       1710 ± 20%  perf-sched.wait_time.max.ms.io_schedule.rq_qos_wait.wbt_wait.__rq_qos_throttle
    908.10 ± 12%     +88.5%       1711 ± 20%  perf-sched.wait_time.max.ms.jbd2_log_wait_commit.__jbd2_journal_force_commit.jbd2_journal_force_commit.ext4_sync_file
    969.71 ±  4%     +51.4%       1468 ± 28%  perf-sched.wait_time.max.ms.jbd2_log_wait_commit.ext4_sync_file.do_fsync.__x64_sys_fdatasync
     28.45 ±194%   +3281.7%     962.20 ± 56%  perf-sched.wait_time.max.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.do_unlinkat
     40.18 ±  3%      -4.3       35.85 ±  2%  perf-profile.calltrace.cycles-pp.cpu_startup_entry.start_secondary.common_startup_64
     40.18 ±  3%      -4.3       35.85 ±  2%  perf-profile.calltrace.cycles-pp.start_secondary.common_startup_64
     44.88 ±  3%      -4.3       40.58        perf-profile.calltrace.cycles-pp.common_startup_64
     40.10 ±  3%      -4.3       35.80 ±  2%  perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_secondary.common_startup_64
     36.18 ±  4%      -4.2       31.96 ±  2%  perf-profile.calltrace.cycles-pp.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary
     39.12 ±  3%      -4.2       34.91 ±  2%  perf-profile.calltrace.cycles-pp.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary.common_startup_64
     39.55 ±  4%      -4.1       35.44        perf-profile.calltrace.cycles-pp.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry
     22.45 ±  4%      -1.7       20.71 ±  4%  perf-profile.calltrace.cycles-pp.intel_idle.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle
      3.22 ± 14%      -1.6        1.58 ± 15%  perf-profile.calltrace.cycles-pp.poll_idle.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle
      6.97 ±  7%      -1.6        5.40 ±  5%  perf-profile.calltrace.cycles-pp.__x64_sys_fdatasync.do_syscall_64.entry_SYSCALL_64_after_hwframe.fdatasync.stress_metamix
      6.97 ±  7%      -1.6        5.40 ±  5%  perf-profile.calltrace.cycles-pp.do_fsync.__x64_sys_fdatasync.do_syscall_64.entry_SYSCALL_64_after_hwframe.fdatasync
      6.96 ±  7%      -1.6        5.40 ±  5%  perf-profile.calltrace.cycles-pp.ext4_sync_file.do_fsync.__x64_sys_fdatasync.do_syscall_64.entry_SYSCALL_64_after_hwframe
      7.03 ±  7%      -1.6        5.46 ±  6%  perf-profile.calltrace.cycles-pp.fdatasync.stress_metamix
      6.99 ±  7%      -1.6        5.43 ±  6%  perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.fdatasync.stress_metamix
      6.99 ±  7%      -1.6        5.43 ±  6%  perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.fdatasync.stress_metamix
      6.17 ±  7%      -1.5        4.71 ±  7%  perf-profile.calltrace.cycles-pp.file_write_and_wait_range.ext4_sync_file.do_fsync.__x64_sys_fdatasync.do_syscall_64
      5.84 ±  8%      -1.3        4.51 ±  7%  perf-profile.calltrace.cycles-pp.__filemap_fdatawrite_range.file_write_and_wait_range.ext4_sync_file.do_fsync.__x64_sys_fdatasync
      5.84 ±  8%      -1.3        4.51 ±  7%  perf-profile.calltrace.cycles-pp.filemap_fdatawrite_wbc.__filemap_fdatawrite_range.file_write_and_wait_range.ext4_sync_file.do_fsync
      5.83 ±  8%      -1.3        4.51 ±  7%  perf-profile.calltrace.cycles-pp.do_writepages.filemap_fdatawrite_wbc.__filemap_fdatawrite_range.file_write_and_wait_range.ext4_sync_file
      5.82 ±  8%      -1.3        4.51 ±  7%  perf-profile.calltrace.cycles-pp.ext4_writepages.do_writepages.filemap_fdatawrite_wbc.__filemap_fdatawrite_range.file_write_and_wait_range
      5.82 ±  9%      -1.3        4.50 ±  7%  perf-profile.calltrace.cycles-pp.ext4_do_writepages.ext4_writepages.do_writepages.filemap_fdatawrite_wbc.__filemap_fdatawrite_range
      5.87 ±  9%      -0.9        4.99 ±  6%  perf-profile.calltrace.cycles-pp.llseek.stress_metamix
      3.42 ±  6%      -0.6        2.78 ±  5%  perf-profile.calltrace.cycles-pp.mpage_map_and_submit_extent.ext4_do_writepages.ext4_writepages.do_writepages.filemap_fdatawrite_wbc
      3.20 ±  5%      -0.6        2.64 ± 14%  perf-profile.calltrace.cycles-pp.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
      2.42 ±  6%      -0.5        1.96 ± 15%  perf-profile.calltrace.cycles-pp.process_one_work.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
      2.09 ±  8%      -0.4        1.66 ±  9%  perf-profile.calltrace.cycles-pp.mpage_map_and_submit_buffers.mpage_map_and_submit_extent.ext4_do_writepages.ext4_writepages.do_writepages
      1.00 ± 16%      -0.4        0.59 ± 45%  perf-profile.calltrace.cycles-pp.mpage_prepare_extent_to_map.ext4_do_writepages.ext4_writepages.do_writepages.filemap_fdatawrite_wbc
      1.79 ±  5%      -0.4        1.39 ± 15%  perf-profile.calltrace.cycles-pp.entry_SYSRETQ_unsafe_stack.llseek.stress_metamix
      1.37 ± 16%      -0.3        1.03 ± 11%  perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.llseek.stress_metamix
      1.62 ±  8%      -0.3        1.34 ±  9%  perf-profile.calltrace.cycles-pp.mpage_submit_folio.mpage_map_and_submit_buffers.mpage_map_and_submit_extent.ext4_do_writepages.ext4_writepages
      0.89 ± 16%      -0.2        0.68 ± 17%  perf-profile.calltrace.cycles-pp.sched_balance_rq.sched_balance_domains.handle_softirqs.__irq_exit_rcu.sysvec_apic_timer_interrupt
      0.00            +0.6        0.62 ± 17%  perf-profile.calltrace.cycles-pp.start_this_handle.jbd2__journal_start.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter
      0.00            +0.8        0.76 ± 13%  perf-profile.calltrace.cycles-pp.mark_buffer_dirty.__block_commit_write.block_write_end.ext4_da_do_write_end.generic_perform_write
      0.00            +0.8        0.76 ± 25%  perf-profile.calltrace.cycles-pp.__ext4_get_inode_loc.ext4_get_inode_loc.ext4_reserve_inode_write.__ext4_mark_inode_dirty.ext4_da_do_write_end
      0.00            +0.8        0.76 ± 24%  perf-profile.calltrace.cycles-pp.mpage_prepare_extent_to_map.ext4_do_writepages.ext4_normal_submit_inode_data_buffers.jbd2_journal_commit_transaction.kjournald2
      0.00            +0.8        0.78 ± 23%  perf-profile.calltrace.cycles-pp.ext4_get_inode_loc.ext4_reserve_inode_write.__ext4_mark_inode_dirty.ext4_da_do_write_end.generic_perform_write
      0.00            +0.8        0.78 ± 19%  perf-profile.calltrace.cycles-pp.crc32c_pcl_intel_update.ext4_inode_csum.ext4_inode_csum_set.ext4_fill_raw_inode.ext4_do_update_inode
      0.00            +0.8        0.78 ± 23%  perf-profile.calltrace.cycles-pp.ext4_do_writepages.ext4_normal_submit_inode_data_buffers.jbd2_journal_commit_transaction.kjournald2.kthread
      0.00            +0.8        0.78 ± 23%  perf-profile.calltrace.cycles-pp.ext4_normal_submit_inode_data_buffers.jbd2_journal_commit_transaction.kjournald2.kthread.ret_from_fork
      0.19 ±142%      +0.9        1.10 ± 21%  perf-profile.calltrace.cycles-pp.jbd2_journal_commit_transaction.kjournald2.kthread.ret_from_fork.ret_from_fork_asm
      0.19 ±142%      +0.9        1.10 ± 21%  perf-profile.calltrace.cycles-pp.kjournald2.kthread.ret_from_fork.ret_from_fork_asm
      0.00            +1.0        0.96 ±  9%  perf-profile.calltrace.cycles-pp.jbd2__journal_start.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write
      0.00            +1.0        0.97 ± 16%  perf-profile.calltrace.cycles-pp.__block_commit_write.block_write_end.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter
      0.00            +1.0        1.03 ± 16%  perf-profile.calltrace.cycles-pp.ext4_reserve_inode_write.__ext4_mark_inode_dirty.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter
      0.00            +1.0        1.03 ± 16%  perf-profile.calltrace.cycles-pp.block_write_end.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write
      0.00            +1.1        1.08 ±  8%  perf-profile.calltrace.cycles-pp.ext4_inode_csum.ext4_inode_csum_set.ext4_fill_raw_inode.ext4_do_update_inode.ext4_mark_iloc_dirty
      0.00            +1.1        1.08 ± 18%  perf-profile.calltrace.cycles-pp.memset_orig.zero_user_segments.__ext4_block_zero_page_range.ext4_da_do_write_end.generic_perform_write
      0.00            +1.2        1.15 ±  9%  perf-profile.calltrace.cycles-pp.ext4_inode_csum_set.ext4_fill_raw_inode.ext4_do_update_inode.ext4_mark_iloc_dirty.__ext4_mark_inode_dirty
      0.00            +1.2        1.16 ± 19%  perf-profile.calltrace.cycles-pp.zero_user_segments.__ext4_block_zero_page_range.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter
      0.00            +1.7        1.70 ± 11%  perf-profile.calltrace.cycles-pp.ext4_fill_raw_inode.ext4_do_update_inode.ext4_mark_iloc_dirty.__ext4_mark_inode_dirty.ext4_da_do_write_end
      0.00            +1.7        1.71 ± 10%  perf-profile.calltrace.cycles-pp.__ext4_block_zero_page_range.ext4_zero_partial_blocks.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter
      0.00            +1.8        1.78 ±  9%  perf-profile.calltrace.cycles-pp.ext4_zero_partial_blocks.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write
      0.00            +1.9        1.88 ± 13%  perf-profile.calltrace.cycles-pp.__ext4_block_zero_page_range.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write
      0.00            +2.2        2.16 ±  9%  perf-profile.calltrace.cycles-pp.ext4_do_update_inode.ext4_mark_iloc_dirty.__ext4_mark_inode_dirty.ext4_da_do_write_end.generic_perform_write
      0.00            +2.3        2.34 ±  8%  perf-profile.calltrace.cycles-pp.ext4_mark_iloc_dirty.__ext4_mark_inode_dirty.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter
      0.00            +3.4        3.42 ±  5%  perf-profile.calltrace.cycles-pp.__ext4_mark_inode_dirty.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write
     43.79 ±  4%      +5.7       49.45        perf-profile.calltrace.cycles-pp.stress_metamix
     13.98 ± 10%      +8.6       22.60 ±  3%  perf-profile.calltrace.cycles-pp.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write.do_syscall_64
     15.52 ± 10%      +8.7       24.25 ±  3%  perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.write.stress_metamix
     15.46 ± 10%      +8.7       24.21 ±  3%  perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.write.stress_metamix
     14.55 ± 10%      +8.8       23.32 ±  2%  perf-profile.calltrace.cycles-pp.ext4_buffered_write_iter.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe
     14.97 ± 10%      +8.8       23.79 ±  2%  perf-profile.calltrace.cycles-pp.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
     15.16 ± 10%      +8.8       24.01 ±  2%  perf-profile.calltrace.cycles-pp.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write.stress_metamix
     17.88 ± 10%      +8.9       26.79 ±  2%  perf-profile.calltrace.cycles-pp.write.stress_metamix
      0.00           +10.0       10.04 ±  6%  perf-profile.calltrace.cycles-pp.ext4_da_do_write_end.generic_perform_write.ext4_buffered_write_iter.vfs_write.ksys_write
     40.18 ±  3%      -4.3       35.85 ±  2%  perf-profile.children.cycles-pp.start_secondary
     44.88 ±  3%      -4.3       40.58        perf-profile.children.cycles-pp.common_startup_64
     44.88 ±  3%      -4.3       40.58        perf-profile.children.cycles-pp.cpu_startup_entry
     44.87 ±  3%      -4.3       40.57        perf-profile.children.cycles-pp.do_idle
     43.72 ±  3%      -4.2       39.52        perf-profile.children.cycles-pp.cpuidle_idle_call
     40.17 ±  4%      -4.1       36.03        perf-profile.children.cycles-pp.cpuidle_enter_state
     40.30 ±  4%      -4.1       36.18        perf-profile.children.cycles-pp.cpuidle_enter
     22.45 ±  4%      -1.7       20.71 ±  4%  perf-profile.children.cycles-pp.intel_idle
      3.44 ± 14%      -1.7        1.79 ± 16%  perf-profile.children.cycles-pp.poll_idle
      7.44 ±  7%      -1.6        5.84 ±  6%  perf-profile.children.cycles-pp.do_fsync
      7.43 ±  7%      -1.6        5.84 ±  6%  perf-profile.children.cycles-pp.ext4_sync_file
      7.01 ±  7%      -1.6        5.45 ±  6%  perf-profile.children.cycles-pp.__x64_sys_fdatasync
      7.07 ±  7%      -1.5        5.54 ±  6%  perf-profile.children.cycles-pp.fdatasync
      6.20 ±  7%      -1.5        4.74 ±  7%  perf-profile.children.cycles-pp.file_write_and_wait_range
      5.89 ±  8%      -1.3        4.54 ±  7%  perf-profile.children.cycles-pp.do_writepages
      5.87 ±  8%      -1.3        4.54 ±  7%  perf-profile.children.cycles-pp.__filemap_fdatawrite_range
      5.87 ±  8%      -1.3        4.54 ±  7%  perf-profile.children.cycles-pp.filemap_fdatawrite_wbc
      5.86 ±  8%      -1.3        4.54 ±  7%  perf-profile.children.cycles-pp.ext4_writepages
      6.07 ±  9%      -0.9        5.19 ±  6%  perf-profile.children.cycles-pp.llseek
      3.42 ±  6%      -0.6        2.78 ±  6%  perf-profile.children.cycles-pp.mpage_map_and_submit_extent
      3.21 ±  5%      -0.6        2.64 ± 14%  perf-profile.children.cycles-pp.worker_thread
      3.42 ± 10%      -0.5        2.90 ±  9%  perf-profile.children.cycles-pp.handle_softirqs
      2.42 ±  6%      -0.5        1.96 ± 15%  perf-profile.children.cycles-pp.process_one_work
      2.09 ±  8%      -0.4        1.66 ±  9%  perf-profile.children.cycles-pp.mpage_map_and_submit_buffers
      0.34 ± 16%      -0.2        0.13 ± 55%  perf-profile.children.cycles-pp.mpage_process_page_bufs
      0.50 ± 23%      -0.2        0.33 ± 28%  perf-profile.children.cycles-pp.native_irq_return_iret
      0.24 ± 31%      -0.1        0.13 ± 36%  perf-profile.children.cycles-pp.ext4_file_read_iter
      0.25 ± 20%      -0.1        0.15 ±  8%  perf-profile.children.cycles-pp.khugepaged
      0.24 ± 17%      -0.1        0.15 ±  8%  perf-profile.children.cycles-pp.hpage_collapse_scan_pmd
      0.24 ± 17%      -0.1        0.15 ±  8%  perf-profile.children.cycles-pp.khugepaged_scan_mm_slot
      0.23 ± 13%      -0.1        0.15 ± 12%  perf-profile.children.cycles-pp.collapse_huge_page
      0.17 ± 13%      -0.0        0.14 ±  7%  perf-profile.children.cycles-pp.__collapse_huge_page_copy
      0.07 ± 51%      +0.1        0.14 ± 27%  perf-profile.children.cycles-pp.tmigr_quick_check
      0.11 ± 48%      +0.1        0.21 ± 15%  perf-profile.children.cycles-pp.tmigr_requires_handle_remote
      0.00            +0.1        0.14 ± 41%  perf-profile.children.cycles-pp.jbd2_journal_dirty_metadata
      0.12 ± 72%      +0.2        0.27 ± 19%  perf-profile.children.cycles-pp.inode_to_bdi
      0.13 ± 55%      +0.2        0.32 ± 25%  perf-profile.children.cycles-pp.jbd2_journal_get_write_access
      0.01 ±223%      +0.2        0.21 ± 45%  perf-profile.children.cycles-pp.add_transaction_credits
      0.30 ± 42%      +0.2        0.50 ± 14%  perf-profile.children.cycles-pp.__find_get_block
      0.06 ± 75%      +0.2        0.27 ± 20%  perf-profile.children.cycles-pp.folio_mark_accessed
      0.31 ± 41%      +0.2        0.54 ± 13%  perf-profile.children.cycles-pp.bdev_getblk
      0.03 ±145%      +0.2        0.28 ± 27%  perf-profile.children.cycles-pp.__ext4_handle_dirty_metadata
      0.01 ±223%      +0.2        0.26 ± 27%  perf-profile.children.cycles-pp.stop_this_handle
      0.05 ±165%      +0.3        0.31 ± 13%  perf-profile.children.cycles-pp.ext4_inode_journal_mode
      0.00            +0.3        0.28 ± 26%  perf-profile.children.cycles-pp.filemap_dirty_folio
      0.37 ± 15%      +0.3        0.65 ± 23%  perf-profile.children.cycles-pp.__cond_resched
      0.14 ± 52%      +0.3        0.43 ± 18%  perf-profile.children.cycles-pp.__ext4_journal_get_write_access
      0.00            +0.3        0.29 ± 33%  perf-profile.children.cycles-pp.folio_redirty_for_writepage
      0.36 ± 25%      +0.4        0.75 ± 10%  perf-profile.children.cycles-pp.filemap_get_entry
      0.10 ± 80%      +0.4        0.55 ±  9%  perf-profile.children.cycles-pp.jbd2_journal_stop
      0.00            +0.5        0.45 ± 18%  perf-profile.children.cycles-pp.jbd2_journal_file_inode
      0.10 ± 80%      +0.5        0.61 ± 11%  perf-profile.children.cycles-pp.__ext4_journal_stop
      0.22 ± 57%      +0.6        0.86 ± 27%  perf-profile.children.cycles-pp.__ext4_get_inode_loc
      0.22 ± 58%      +0.6        0.87 ± 27%  perf-profile.children.cycles-pp.ext4_get_inode_loc
      0.44 ± 27%      +0.7        1.10 ± 21%  perf-profile.children.cycles-pp.kjournald2
      0.44 ± 28%      +0.7        1.10 ± 21%  perf-profile.children.cycles-pp.jbd2_journal_commit_transaction
      0.05 ±103%      +0.7        0.72 ± 11%  perf-profile.children.cycles-pp.start_this_handle
      0.42 ± 15%      +0.8        1.18 ± 13%  perf-profile.children.cycles-pp.crc32c_pcl_intel_update
      0.00            +0.8        0.78 ± 23%  perf-profile.children.cycles-pp.ext4_normal_submit_inode_data_buffers
      0.28 ± 41%      +1.0        1.23 ± 17%  perf-profile.children.cycles-pp.ext4_reserve_inode_write
      0.08 ± 74%      +1.0        1.04 ±  6%  perf-profile.children.cycles-pp.jbd2__journal_start
      0.26 ± 23%      +1.0        1.23 ±  5%  perf-profile.children.cycles-pp.ext4_inode_csum
      0.28 ± 19%      +1.0        1.32 ±  6%  perf-profile.children.cycles-pp.ext4_inode_csum_set
      3.99 ± 13%      +1.1        5.13 ±  6%  perf-profile.children.cycles-pp.__filemap_get_folio
      3.80 ± 18%      +1.2        4.95 ±  6%  perf-profile.children.cycles-pp.memset_orig
      3.80 ± 18%      +1.4        5.20 ±  5%  perf-profile.children.cycles-pp.zero_user_segments
      0.40 ± 19%      +1.6        1.96 ± 11%  perf-profile.children.cycles-pp.ext4_fill_raw_inode
      0.00            +1.8        1.80 ±  9%  perf-profile.children.cycles-pp.ext4_zero_partial_blocks
      0.55 ± 16%      +2.0        2.55 ±  7%  perf-profile.children.cycles-pp.ext4_do_update_inode
      0.58 ± 19%      +2.2        2.78 ±  6%  perf-profile.children.cycles-pp.ext4_mark_iloc_dirty
      0.85 ± 16%      +3.2        4.03 ±  4%  perf-profile.children.cycles-pp.__ext4_mark_inode_dirty
      0.00            +3.6        3.61 ±  9%  perf-profile.children.cycles-pp.__ext4_block_zero_page_range
     38.33 ±  4%      +5.5       43.88 ±  2%  perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
     38.13 ±  4%      +5.6       43.69 ±  2%  perf-profile.children.cycles-pp.do_syscall_64
     43.79 ±  4%      +5.7       49.45        perf-profile.children.cycles-pp.stress_metamix
     13.98 ± 10%      +8.6       22.61 ±  3%  perf-profile.children.cycles-pp.generic_perform_write
     14.57 ± 10%      +8.8       23.33 ±  2%  perf-profile.children.cycles-pp.ext4_buffered_write_iter
     15.14 ±  9%      +8.8       23.92 ±  2%  perf-profile.children.cycles-pp.vfs_write
     15.35 ±  9%      +8.8       24.15 ±  2%  perf-profile.children.cycles-pp.ksys_write
     18.28 ±  9%      +8.8       27.11 ±  2%  perf-profile.children.cycles-pp.write
      0.00           +10.0       10.04 ±  6%  perf-profile.children.cycles-pp.ext4_da_do_write_end
     22.43 ±  4%      -1.7       20.71 ±  4%  perf-profile.self.cycles-pp.intel_idle
      3.24 ± 15%      -1.6        1.69 ± 18%  perf-profile.self.cycles-pp.poll_idle
      0.34 ± 16%      -0.2        0.13 ± 53%  perf-profile.self.cycles-pp.mpage_process_page_bufs
      0.50 ± 23%      -0.2        0.33 ± 28%  perf-profile.self.cycles-pp.native_irq_return_iret
      0.24 ± 31%      -0.1        0.12 ± 30%  perf-profile.self.cycles-pp.ext4_file_read_iter
      0.01 ±223%      +0.1        0.10 ± 33%  perf-profile.self.cycles-pp.mmap_region
      0.00            +0.1        0.11 ± 34%  perf-profile.self.cycles-pp.jbd2_journal_dirty_metadata
      0.01 ±223%      +0.1        0.15 ± 55%  perf-profile.self.cycles-pp.jbd2_journal_get_write_access
      0.10 ± 85%      +0.1        0.24 ± 20%  perf-profile.self.cycles-pp.inode_to_bdi
      0.00            +0.1        0.14 ± 32%  perf-profile.self.cycles-pp.jbd2__journal_start
      0.00            +0.1        0.14 ± 25%  perf-profile.self.cycles-pp.jbd2_journal_file_inode
      0.18 ± 43%      +0.2        0.34 ± 11%  perf-profile.self.cycles-pp.__cond_resched
      0.05 ±102%      +0.2        0.22 ± 26%  perf-profile.self.cycles-pp.ext4_do_update_inode
      0.07 ± 78%      +0.2        0.24 ± 43%  perf-profile.self.cycles-pp.__ext4_get_inode_loc
      0.06 ± 75%      +0.2        0.24 ± 33%  perf-profile.self.cycles-pp.folio_mark_accessed
      0.00            +0.2        0.18 ± 43%  perf-profile.self.cycles-pp.ext4_da_do_write_end
      0.00            +0.2        0.19 ± 38%  perf-profile.self.cycles-pp.ext4_mark_iloc_dirty
      0.03 ±142%      +0.2        0.22 ± 32%  perf-profile.self.cycles-pp.ext4_inode_csum
      0.00            +0.2        0.20 ± 45%  perf-profile.self.cycles-pp.add_transaction_credits
      0.00            +0.2        0.20 ± 38%  perf-profile.self.cycles-pp.__ext4_block_zero_page_range
      0.00            +0.2        0.21 ± 37%  perf-profile.self.cycles-pp.stop_this_handle
      0.71 ± 19%      +0.2        0.93 ± 15%  perf-profile.self.cycles-pp._raw_spin_lock
      0.10 ± 55%      +0.2        0.34 ±  9%  perf-profile.self.cycles-pp.filemap_get_entry
      0.05 ±165%      +0.2        0.29 ± 12%  perf-profile.self.cycles-pp.ext4_inode_journal_mode
      0.07 ± 50%      +0.3        0.34 ± 32%  perf-profile.self.cycles-pp.zero_user_segments
      0.00            +0.3        0.34 ±  7%  perf-profile.self.cycles-pp.start_this_handle
      0.05 ±103%      +0.5        0.50 ± 24%  perf-profile.self.cycles-pp.ext4_fill_raw_inode
      0.19 ± 19%      +0.7        0.89 ±  9%  perf-profile.self.cycles-pp.crc32c_pcl_intel_update




Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki


^ permalink raw reply	[flat|nested] only message in thread

only message in thread, other threads:[~2024-12-06  9:22 UTC | newest]

Thread overview: (only message) (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-12-06  9:21 [linus:master] [ext4] c7fc0366c6: stress-ng.metamix.ops_per_sec 22.9% improvement kernel test robot

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.