* [brauner-vfs:vfs.misc.backing_file] [fs/pipe] 8114dc703a: stress-ng.pipeherd.ops_per_sec 70.9% improvement
@ 2023-10-20 14:16 kernel test robot
0 siblings, 0 replies; only message in thread
From: kernel test robot @ 2023-10-20 14:16 UTC (permalink / raw)
To: Max Kellermann
Cc: oe-lkp, lkp, Christian Brauner, Christian Brauner, David Howells,
linux-fsdevel, ying.huang, feng.tang, fengwei.yin, oliver.sang
Hello,
kernel test robot noticed a 70.9% improvement of stress-ng.pipeherd.ops_per_sec on:
commit: 8114dc703a4833be4a98a37f5ed0a3abb55dcb34 ("fs/pipe: use spinlock in pipe_read() only if there is a watch_queue")
https://git.kernel.org/cgit/linux/kernel/git/vfs/vfs.git vfs.misc.backing_file
testcase: stress-ng
test machine: 36 threads 1 sockets Intel(R) Core(TM) i9-10980XE CPU @ 3.00GHz (Cascade Lake) with 128G memory
parameters:
nr_threads: 1
testtime: 60s
class: pipe
test: pipeherd
cpufreq_governor: performance
In addition to that, the commit also has significant impact on the following tests:
+------------------+-------------------------------------------------------------------------------------------------+
| testcase: change | stress-ng: stress-ng.pipeherd.ops_per_sec 64.1% improvement |
| test machine | 36 threads 1 sockets Intel(R) Core(TM) i9-10980XE CPU @ 3.00GHz (Cascade Lake) with 128G memory |
| test parameters | class=memory |
| | cpufreq_governor=performance |
| | nr_threads=1 |
| | test=pipeherd |
| | testtime=60s |
+------------------+-------------------------------------------------------------------------------------------------+
Details are as below:
-------------------------------------------------------------------------------------------------->
The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20231020/202310202136.48fa5db-oliver.sang@intel.com
=========================================================================================
class/compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
pipe/gcc-12/performance/x86_64-rhel-8.3/1/debian-11.1-x86_64-20220510.cgz/lkp-csl-d02/pipeherd/stress-ng/60s
commit:
cc03a5d65a ("fs/pipe: remove unnecessary spinlock from pipe_write()")
8114dc703a ("fs/pipe: use spinlock in pipe_read() only if there is a watch_queue")
cc03a5d65a4032f8 8114dc703a4833be4a98a37f5ed
---------------- ---------------------------
%stddev %change %stddev
\ | \
2662 -68.4% 841.59 uptime.idle
1.96e+09 -92.7% 1.427e+08 ± 6% cpuidle..time
26722190 ± 2% -99.2% 211047 ± 3% cpuidle..usage
86.45 -81.2 5.26 ± 13% mpstat.cpu.all.idle%
0.02 ± 9% -0.0 0.00 ± 34% mpstat.cpu.all.soft%
12.05 ± 3% +81.1 93.19 mpstat.cpu.all.sys%
86.80 -90.5% 8.29 ± 7% vmstat.cpu.id
12.32 ± 2% +637.7% 90.90 vmstat.cpu.sy
3777392 -14.4% 3235329 vmstat.memory.cache
4.88 ± 3% +1753.7% 90.47 vmstat.procs.r
759821 ± 3% -76.8% 176114 ± 8% vmstat.system.cs
57282 +58.2% 90632 vmstat.system.in
1.00 -73.0% 0.27 ± 19% stress-ng.pipeherd.context_switches_per_bogo_op
399152 ± 2% -54.8% 180363 ± 8% stress-ng.pipeherd.context_switches_per_sec
23934072 ± 2% +70.9% 40914905 ± 10% stress-ng.pipeherd.ops
398822 ± 2% +70.9% 681769 ± 10% stress-ng.pipeherd.ops_per_sec
14105 ± 3% +15176.6% 2154810 stress-ng.time.involuntary_context_switches
448.83 ± 3% +665.7% 3436 stress-ng.time.percent_of_cpu_this_job_got
275.60 ± 3% +671.8% 2127 stress-ng.time.system_time
23935206 ± 2% -63.8% 8667217 ± 11% stress-ng.time.voluntary_context_switches
804558 ± 2% -55.3% 359243 ± 2% meminfo.Active
804410 ± 2% -55.4% 359101 ± 2% meminfo.Active(anon)
47565 ± 20% -22.3% 36948 ± 11% meminfo.AnonHugePages
3690350 -14.5% 3154124 meminfo.Cached
2366174 -22.8% 1827058 meminfo.Committed_AS
450238 -21.1% 355264 meminfo.Inactive
450058 -21.1% 355078 meminfo.Inactive(anon)
117976 -38.6% 72400 meminfo.Mapped
4473104 -12.4% 3919536 meminfo.Memused
944222 ± 2% -56.8% 407996 ± 2% meminfo.Shmem
4641483 -12.3% 4068866 meminfo.max_used_kB
616.50 +478.8% 3568 turbostat.Avg_MHz
16.27 +77.7 93.93 turbostat.Busy%
3905828 ± 7% -99.8% 9270 ± 27% turbostat.C1
8.52 ± 6% -8.5 0.02 ± 17% turbostat.C1%
22605226 -99.7% 61106 ± 4% turbostat.C1E
70.38 -70.2 0.22 ± 6% turbostat.C1E%
82.93 -93.8% 5.18 ± 3% turbostat.CPU%c1
0.11 -72.7% 0.03 turbostat.IPC
3750696 +58.4% 5939978 turbostat.IRQ
68893 ± 2% -95.1% 3352 ± 5% turbostat.POLL
0.04 -0.0 0.00 turbostat.POLL%
41.00 +5.7% 43.33 ± 2% turbostat.PkgTmp
97.64 +48.3% 144.76 turbostat.PkgWatt
10.24 -1.4% 10.10 turbostat.RAMWatt
201103 ± 2% -55.4% 89775 ± 2% proc-vmstat.nr_active_anon
77504 -1.4% 76431 proc-vmstat.nr_anon_pages
922608 -14.5% 788573 proc-vmstat.nr_file_pages
112532 -21.1% 88805 proc-vmstat.nr_inactive_anon
29507 -38.5% 18149 proc-vmstat.nr_mapped
236070 ± 2% -56.8% 102035 ± 2% proc-vmstat.nr_shmem
19797 -1.7% 19458 proc-vmstat.nr_slab_reclaimable
201103 ± 2% -55.4% 89775 ± 2% proc-vmstat.nr_zone_active_anon
112532 -21.1% 88805 proc-vmstat.nr_zone_inactive_anon
579999 -32.9% 389134 proc-vmstat.numa_hit
583256 ± 2% -33.3% 389117 proc-vmstat.numa_local
303598 ± 2% -53.8% 140238 ± 2% proc-vmstat.pgactivate
610962 -31.6% 417639 proc-vmstat.pgalloc_normal
272733 -3.4% 263482 proc-vmstat.pgfault
256096 -3.6% 246831 proc-vmstat.pgfree
8409 -4.5% 8033 proc-vmstat.pgreuse
23449 ± 4% +4249.9% 1020036 sched_debug.cfs_rq:/.avg_vruntime.avg
48864 ± 5% +2132.2% 1090754 sched_debug.cfs_rq:/.avg_vruntime.max
18796 ± 5% +5228.8% 1001644 sched_debug.cfs_rq:/.avg_vruntime.min
6530 ± 5% +187.4% 18766 ± 15% sched_debug.cfs_rq:/.avg_vruntime.stddev
0.29 ± 4% +465.6% 1.64 ± 3% sched_debug.cfs_rq:/.h_nr_running.avg
1.17 ± 20% +157.1% 3.00 ± 19% sched_debug.cfs_rq:/.h_nr_running.max
0.44 ± 3% +45.9% 0.65 ± 6% sched_debug.cfs_rq:/.h_nr_running.stddev
0.83 ± 28% +820.0% 7.67 ± 36% sched_debug.cfs_rq:/.load_avg.min
23449 ± 4% +4249.9% 1020036 sched_debug.cfs_rq:/.min_vruntime.avg
48864 ± 5% +2132.2% 1090754 sched_debug.cfs_rq:/.min_vruntime.max
18796 ± 5% +5228.8% 1001644 sched_debug.cfs_rq:/.min_vruntime.min
6530 ± 5% +187.4% 18766 ± 15% sched_debug.cfs_rq:/.min_vruntime.stddev
0.29 ± 3% +147.6% 0.71 ± 2% sched_debug.cfs_rq:/.nr_running.avg
0.44 ± 3% -31.3% 0.30 ± 14% sched_debug.cfs_rq:/.nr_running.stddev
398.86 ± 4% +326.4% 1700 ± 3% sched_debug.cfs_rq:/.runnable_avg.avg
993.17 ± 10% +186.3% 2843 ± 4% sched_debug.cfs_rq:/.runnable_avg.max
17.08 ± 32% +3470.7% 610.00 ± 39% sched_debug.cfs_rq:/.runnable_avg.min
224.04 ± 7% +131.7% 519.15 ± 10% sched_debug.cfs_rq:/.runnable_avg.stddev
396.99 ± 4% +107.1% 822.14 ± 2% sched_debug.cfs_rq:/.util_avg.avg
990.17 ± 10% +57.1% 1556 ± 9% sched_debug.cfs_rq:/.util_avg.max
17.00 ± 33% +1357.8% 247.83 ± 21% sched_debug.cfs_rq:/.util_avg.min
223.09 ± 8% +38.0% 307.92 ± 9% sched_debug.cfs_rq:/.util_avg.stddev
42.74 ± 5% +1003.6% 471.69 ± 3% sched_debug.cfs_rq:/.util_est_enqueued.avg
411.92 ± 15% +135.5% 970.00 ± 14% sched_debug.cfs_rq:/.util_est_enqueued.max
107.34 ± 8% +77.5% 190.57 ± 5% sched_debug.cfs_rq:/.util_est_enqueued.stddev
16850 ± 12% -51.9% 8099 ± 25% sched_debug.cpu.avg_idle.min
0.78 ± 7% +148.6% 1.93 ± 28% sched_debug.cpu.clock.stddev
986.99 ± 11% +145.8% 2426 sched_debug.cpu.curr->pid.avg
1503 ± 4% -42.4% 865.59 ± 13% sched_debug.cpu.curr->pid.stddev
0.31 ± 10% +436.1% 1.65 ± 4% sched_debug.cpu.nr_running.avg
1.17 ± 20% +157.1% 3.00 ± 19% sched_debug.cpu.nr_running.max
0.45 ± 4% +43.3% 0.65 ± 7% sched_debug.cpu.nr_running.stddev
662694 ± 2% -76.2% 157727 ± 7% sched_debug.cpu.nr_switches.avg
730901 ± 2% -71.1% 211353 ± 3% sched_debug.cpu.nr_switches.max
588629 ± 4% -76.6% 137816 ± 10% sched_debug.cpu.nr_switches.min
27216 ± 6% -36.3% 17323 ± 18% sched_debug.cpu.nr_switches.stddev
0.17 -52.5% 0.08 ± 4% perf-stat.i.MPKI
1.591e+09 +107.7% 3.304e+09 perf-stat.i.branch-instructions
1.98 -1.2 0.75 perf-stat.i.branch-miss-rate%
32451085 -25.8% 24094321 perf-stat.i.branch-misses
2.36 ± 5% -0.3 2.04 ± 2% perf-stat.i.cache-miss-rate%
1832514 -34.8% 1194385 ± 4% perf-stat.i.cache-misses
797265 ± 2% -76.9% 184467 ± 8% perf-stat.i.context-switches
3.22 ± 2% +188.0% 9.27 perf-stat.i.cpi
2.261e+10 +479.8% 1.311e+11 perf-stat.i.cpu-cycles
311030 ± 7% +440.7% 1681617 ± 9% perf-stat.i.cycles-between-cache-misses
0.05 ± 8% -0.0 0.01 ± 14% perf-stat.i.dTLB-load-miss-rate%
1006370 ± 10% -69.6% 305743 ± 15% perf-stat.i.dTLB-load-misses
1.964e+09 +82.1% 3.577e+09 perf-stat.i.dTLB-loads
61148 ± 14% -44.2% 34119 ± 9% perf-stat.i.dTLB-store-misses
9.704e+08 -30.7% 6.72e+08 ± 2% perf-stat.i.dTLB-stores
36.77 +39.1 75.84 ± 3% perf-stat.i.iTLB-load-miss-rate%
6585163 ± 2% -81.6% 1214954 ± 8% perf-stat.i.iTLB-loads
7.672e+09 +84.7% 1.417e+10 perf-stat.i.instructions
2339 +65.5% 3871 ± 5% perf-stat.i.instructions-per-iTLB-miss
0.36 ± 2% -58.9% 0.15 perf-stat.i.ipc
0.63 +479.8% 3.64 perf-stat.i.metric.GHz
236.49 ± 2% -72.7% 64.64 ± 5% perf-stat.i.metric.K/sec
129.12 +64.8% 212.85 perf-stat.i.metric.M/sec
2841 -4.5% 2712 perf-stat.i.minor-faults
283421 ± 2% -38.8% 173569 ± 3% perf-stat.i.node-loads
229499 -24.6% 173089 ± 3% perf-stat.i.node-stores
2841 -4.5% 2712 perf-stat.i.page-faults
0.24 -64.7% 0.08 ± 5% perf-stat.overall.MPKI
2.04 -1.3 0.73 perf-stat.overall.branch-miss-rate%
1.49 ± 9% -0.4 1.08 ± 5% perf-stat.overall.cache-miss-rate%
2.95 ± 2% +213.9% 9.25 perf-stat.overall.cpi
12338 ± 3% +792.2% 110078 ± 4% perf-stat.overall.cycles-between-cache-misses
0.05 ± 10% -0.0 0.01 ± 15% perf-stat.overall.dTLB-load-miss-rate%
36.98 +39.7 76.67 ± 3% perf-stat.overall.iTLB-load-miss-rate%
1986 +78.5% 3546 ± 5% perf-stat.overall.instructions-per-iTLB-miss
0.34 ± 2% -68.2% 0.11 perf-stat.overall.ipc
1.566e+09 +107.6% 3.252e+09 perf-stat.ps.branch-instructions
31948762 -25.8% 23700932 perf-stat.ps.branch-misses
1804771 -34.9% 1174750 ± 4% perf-stat.ps.cache-misses
784569 ± 2% -76.9% 181522 ± 8% perf-stat.ps.context-switches
2.225e+10 +479.8% 1.29e+11 perf-stat.ps.cpu-cycles
990666 ± 10% -69.6% 301054 ± 15% perf-stat.ps.dTLB-load-misses
1.933e+09 +82.1% 3.52e+09 perf-stat.ps.dTLB-loads
60212 ± 14% -44.3% 33564 ± 9% perf-stat.ps.dTLB-store-misses
9.551e+08 -30.8% 6.614e+08 ± 2% perf-stat.ps.dTLB-stores
6480331 ± 2% -81.6% 1195545 ± 8% perf-stat.ps.iTLB-loads
7.551e+09 +84.7% 1.394e+10 perf-stat.ps.instructions
2797 -4.6% 2668 perf-stat.ps.minor-faults
279235 ± 2% -38.8% 170754 ± 3% perf-stat.ps.node-loads
225941 -24.4% 170732 ± 3% perf-stat.ps.node-stores
2797 -4.6% 2669 perf-stat.ps.page-faults
4.765e+11 +84.7% 8.8e+11 perf-stat.total.instructions
0.00 ± 14% -100.0% 0.00 perf-sched.sch_delay.avg.ms.__cond_resched.__mutex_lock.constprop.0.pipe_read
0.00 +2066.7% 0.06 ± 13% perf-sched.sch_delay.avg.ms.__cond_resched.mutex_lock.pipe_read.vfs_read.ksys_read
0.00 ± 17% +26530.8% 0.58 ± 19% perf-sched.sch_delay.avg.ms.__cond_resched.stop_one_cpu.sched_exec.bprm_execve.part
0.00 -100.0% 0.00 perf-sched.sch_delay.avg.ms.__cond_resched.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode
0.00 +17154.2% 0.69 ± 44% perf-sched.sch_delay.avg.ms.__x64_sys_pause.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
0.01 ± 43% +16828.6% 0.99 ±125% perf-sched.sch_delay.avg.ms.devkmsg_read.vfs_read.ksys_read.do_syscall_64
0.00 +1283.3% 0.03 ± 5% perf-sched.sch_delay.avg.ms.do_nanosleep.hrtimer_nanosleep.common_nsleep.__x64_sys_clock_nanosleep
0.00 +18950.0% 0.38 ± 25% perf-sched.sch_delay.avg.ms.do_task_dead.do_exit.do_group_exit.__x64_sys_exit_group.do_syscall_64
0.00 +4054.2% 0.17 ± 18% perf-sched.sch_delay.avg.ms.do_wait.kernel_wait4.__do_sys_wait4.do_syscall_64
0.01 ± 38% +127.3% 0.01 ± 12% perf-sched.sch_delay.avg.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64
0.00 +5075.0% 0.10 ± 10% perf-sched.sch_delay.avg.ms.pipe_read.vfs_read.ksys_read.do_syscall_64
0.00 ± 12% +427.3% 0.02 ± 21% perf-sched.sch_delay.avg.ms.rcu_gp_kthread.kthread.ret_from_fork.ret_from_fork_asm
0.00 ± 9% +25143.5% 0.97 ± 42% perf-sched.sch_delay.avg.ms.schedule_hrtimeout_range_clock.do_select.core_sys_select.kern_select
0.00 ± 14% +196.6% 0.01 ± 24% perf-sched.sch_delay.avg.ms.schedule_hrtimeout_range_clock.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
0.00 ± 13% +9488.2% 0.27 ± 12% perf-sched.sch_delay.avg.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
0.00 +66.7% 0.01 ± 29% perf-sched.sch_delay.avg.ms.schedule_timeout.kcompactd.kthread.ret_from_fork
0.00 ± 14% +730.0% 0.03 ± 43% perf-sched.sch_delay.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
0.01 ± 29% +953.8% 0.07 ± 68% perf-sched.sch_delay.avg.ms.syslog_print.do_syslog.kmsg_read.vfs_read
0.00 ± 20% +21554.5% 0.40 ± 29% perf-sched.sch_delay.avg.ms.wait_for_partner.fifo_open.do_dentry_open.do_open
0.00 ± 7% +289.7% 0.02 ± 24% perf-sched.sch_delay.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
0.01 ± 69% +43684.2% 2.77 ± 63% perf-sched.sch_delay.max.ms.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
0.03 ± 41% +43520.5% 10.98 ± 20% perf-sched.sch_delay.max.ms.__cond_resched.mutex_lock.pipe_read.vfs_read.ksys_read
0.00 ± 70% +57461.5% 2.49 ± 20% perf-sched.sch_delay.max.ms.__cond_resched.stop_one_cpu.sched_exec.bprm_execve.part
0.00 ± 14% -100.0% 0.00 perf-sched.sch_delay.max.ms.__cond_resched.task_work_run.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode
0.01 ± 19% +34219.4% 2.06 ± 32% perf-sched.sch_delay.max.ms.__x64_sys_pause.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
0.01 ± 50% +13909.1% 1.03 ±118% perf-sched.sch_delay.max.ms.devkmsg_read.vfs_read.ksys_read.do_syscall_64
0.10 ±102% +4712.8% 5.02 ± 17% perf-sched.sch_delay.max.ms.do_nanosleep.hrtimer_nanosleep.common_nsleep.__x64_sys_clock_nanosleep
0.01 ± 54% +30528.3% 3.06 ± 31% perf-sched.sch_delay.max.ms.do_task_dead.do_exit.do_group_exit.__x64_sys_exit_group.do_syscall_64
0.01 ± 7% +19300.0% 1.55 ± 50% perf-sched.sch_delay.max.ms.do_wait.kernel_wait4.__do_sys_wait4.do_syscall_64
0.99 ± 63% +895.9% 9.90 ± 22% perf-sched.sch_delay.max.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64
2.13 ± 12% +543.6% 13.70 ± 12% perf-sched.sch_delay.max.ms.pipe_read.vfs_read.ksys_read.do_syscall_64
0.01 ± 9% +1363.9% 0.09 ± 28% perf-sched.sch_delay.max.ms.rcu_gp_kthread.kthread.ret_from_fork.ret_from_fork_asm
0.01 ± 33% +19865.2% 1.53 ± 49% perf-sched.sch_delay.max.ms.schedule_hrtimeout_range_clock.do_poll.constprop.0.do_sys_poll
0.00 ± 10% +35717.9% 1.67 ± 70% perf-sched.sch_delay.max.ms.schedule_hrtimeout_range_clock.do_select.core_sys_select.kern_select
0.01 ± 62% +423.0% 0.06 ± 34% perf-sched.sch_delay.max.ms.schedule_hrtimeout_range_clock.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
0.01 ± 65% +28549.2% 3.01 ± 47% perf-sched.sch_delay.max.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
0.01 ± 11% +10415.4% 0.68 ± 94% perf-sched.sch_delay.max.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
83.01 ±104% -98.5% 1.28 ± 52% perf-sched.sch_delay.max.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
0.01 ± 31% +1049.1% 0.11 ± 87% perf-sched.sch_delay.max.ms.syslog_print.do_syslog.kmsg_read.vfs_read
0.00 ± 20% +1.1e+05% 2.59 ± 27% perf-sched.sch_delay.max.ms.wait_for_partner.fifo_open.do_dentry_open.do_open
0.05 ± 37% +1383.7% 0.81 ± 74% perf-sched.sch_delay.max.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
0.00 ± 17% +3800.0% 0.08 ± 12% perf-sched.total_sch_delay.average.ms
0.62 ± 2% +126.8% 1.41 ± 2% perf-sched.total_wait_and_delay.average.ms
1476521 ± 2% -61.2% 573478 ± 3% perf-sched.total_wait_and_delay.count.ms
0.62 ± 2% +114.2% 1.32 ± 3% perf-sched.total_wait_time.average.ms
5.46 ± 67% +130.3% 12.58 ± 22% perf-sched.wait_and_delay.avg.ms.__cond_resched.mutex_lock.perf_poll.do_poll.constprop
0.11 ± 4% +74.0% 0.20 ± 10% perf-sched.wait_and_delay.avg.ms.do_nanosleep.hrtimer_nanosleep.common_nsleep.__x64_sys_clock_nanosleep
281.31 ± 6% -13.6% 243.07 ± 4% perf-sched.wait_and_delay.avg.ms.do_task_dead.do_exit.do_group_exit.__x64_sys_exit_group.do_syscall_64
0.38 ± 2% +138.1% 0.90 ± 3% perf-sched.wait_and_delay.avg.ms.pipe_read.vfs_read.ksys_read.do_syscall_64
4.10 ± 8% -23.7% 3.13 ± 5% perf-sched.wait_and_delay.avg.ms.rcu_gp_kthread.kthread.ret_from_fork.ret_from_fork_asm
27.03 ± 4% +150.7% 67.77 ± 2% perf-sched.wait_and_delay.avg.ms.schedule_hrtimeout_range_clock.do_poll.constprop.0.do_sys_poll
97.23 ± 6% -32.0% 66.15 ± 10% perf-sched.wait_and_delay.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
101.49 ± 6% +97.0% 199.95 ± 6% perf-sched.wait_and_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
578.13 ± 3% +16.0% 670.87 ± 2% perf-sched.wait_and_delay.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
17.83 ± 32% +562.6% 118.17 ± 43% perf-sched.wait_and_delay.count.__cond_resched.generic_perform_write.shmem_file_write_iter.vfs_write.ksys_write
3.67 ± 60% +1022.7% 41.17 ± 20% perf-sched.wait_and_delay.count.__cond_resched.mutex_lock.perf_poll.do_poll.constprop
75116 -41.9% 43673 ± 3% perf-sched.wait_and_delay.count.do_nanosleep.hrtimer_nanosleep.common_nsleep.__x64_sys_clock_nanosleep
89.33 ±223% +71413.1% 63885 ± 4% perf-sched.wait_and_delay.count.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64
1397168 ± 2% -69.8% 422092 ± 5% perf-sched.wait_and_delay.count.pipe_read.vfs_read.ksys_read.do_syscall_64
15.00 ± 3% +21.1% 18.17 ± 5% perf-sched.wait_and_delay.count.rcu_gp_kthread.kthread.ret_from_fork.ret_from_fork_asm
578.33 ± 2% -62.3% 218.17 ± 3% perf-sched.wait_and_delay.count.schedule_hrtimeout_range_clock.do_poll.constprop.0.do_sys_poll
49.17 ± 7% +53.6% 75.50 ± 11% perf-sched.wait_and_delay.count.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
1832 ± 5% -47.7% 959.00 ± 6% perf-sched.wait_and_delay.count.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
227.50 -13.8% 196.00 ± 2% perf-sched.wait_and_delay.count.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
10.50 ± 67% +1100.6% 126.04 ± 53% perf-sched.wait_and_delay.max.ms.__cond_resched.mutex_lock.perf_poll.do_poll.constprop
166.68 ±223% +501.3% 1002 perf-sched.wait_and_delay.max.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64
5.00 -15.1% 4.24 ± 7% perf-sched.wait_and_delay.max.ms.rcu_gp_kthread.kthread.ret_from_fork.ret_from_fork_asm
2503 ± 38% +41.0% 3528 ± 13% perf-sched.wait_and_delay.max.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
0.34 ± 8% -47.1% 0.18 ± 30% perf-sched.wait_time.avg.ms.__cond_resched.__mutex_lock.constprop.0.pipe_read
0.00 ±100% +2335.0% 0.08 ± 76% perf-sched.wait_time.avg.ms.__cond_resched.down_write_killable.exec_mmap.begin_new_exec.load_elf_binary
5.46 ± 67% +130.3% 12.58 ± 22% perf-sched.wait_time.avg.ms.__cond_resched.mutex_lock.perf_poll.do_poll.constprop
0.33 ± 2% +31.4% 0.44 ± 3% perf-sched.wait_time.avg.ms.__cond_resched.mutex_lock.pipe_read.vfs_read.ksys_read
0.32 ± 4% -38.2% 0.20 ± 20% perf-sched.wait_time.avg.ms.__cond_resched.pipe_read.vfs_read.ksys_read.do_syscall_64
0.11 ± 4% +51.2% 0.17 ± 12% perf-sched.wait_time.avg.ms.do_nanosleep.hrtimer_nanosleep.common_nsleep.__x64_sys_clock_nanosleep
281.30 ± 6% -13.7% 242.69 ± 4% perf-sched.wait_time.avg.ms.do_task_dead.do_exit.do_group_exit.__x64_sys_exit_group.do_syscall_64
0.24 +257.3% 0.86 ± 20% perf-sched.wait_time.avg.ms.do_wait.kernel_wait4.__do_sys_wait4.do_syscall_64
0.01 ±223% +40360.0% 4.05 ±202% perf-sched.wait_time.avg.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt
0.66 ±101% -54.8% 0.30 ± 8% perf-sched.wait_time.avg.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64
0.38 ± 2% +112.2% 0.80 ± 4% perf-sched.wait_time.avg.ms.pipe_read.vfs_read.ksys_read.do_syscall_64
4.09 ± 8% -24.1% 3.11 ± 5% perf-sched.wait_time.avg.ms.rcu_gp_kthread.kthread.ret_from_fork.ret_from_fork_asm
27.03 ± 4% +150.7% 67.76 ± 2% perf-sched.wait_time.avg.ms.schedule_hrtimeout_range_clock.do_poll.constprop.0.do_sys_poll
0.27 ± 23% -58.2% 0.11 ± 55% perf-sched.wait_time.avg.ms.schedule_preempt_disabled.__mutex_lock.constprop.0.pipe_write
0.33 ± 2% +356.1% 1.48 ± 11% perf-sched.wait_time.avg.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
97.23 ± 6% -32.0% 66.12 ± 10% perf-sched.wait_time.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
101.43 ± 6% +97.1% 199.93 ± 6% perf-sched.wait_time.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
0.00 ± 50% +10655.6% 0.16 ± 72% perf-sched.wait_time.avg.ms.wait_for_partner.fifo_open.do_dentry_open.do_open
578.12 ± 3% +16.0% 670.85 ± 2% perf-sched.wait_time.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
0.00 ±100% +13881.8% 0.51 ± 72% perf-sched.wait_time.max.ms.__cond_resched.down_write_killable.exec_mmap.begin_new_exec.load_elf_binary
10.50 ± 67% +1100.6% 126.04 ± 53% perf-sched.wait_time.max.ms.__cond_resched.mutex_lock.perf_poll.do_poll.constprop
0.42 ± 4% +2499.7% 10.86 ± 30% perf-sched.wait_time.max.ms.__cond_resched.mutex_lock.pipe_read.vfs_read.ksys_read
0.32 ± 44% +486.3% 1.88 ±110% perf-sched.wait_time.max.ms.__cond_resched.mutex_lock.pipe_write.vfs_write.ksys_write
0.71 ± 2% +416.0% 3.68 ± 20% perf-sched.wait_time.max.ms.do_wait.kernel_wait4.__do_sys_wait4.do_syscall_64
0.01 ±223% +1.8e+06% 180.76 ±203% perf-sched.wait_time.max.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt
185.31 ±196% +440.6% 1001 perf-sched.wait_time.max.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64
4.99 -16.0% 4.20 ± 8% perf-sched.wait_time.max.ms.rcu_gp_kthread.kthread.ret_from_fork.ret_from_fork_asm
0.42 ± 5% +351.1% 1.89 ± 51% perf-sched.wait_time.max.ms.schedule_preempt_disabled.__mutex_lock.constprop.0.pipe_read
0.74 ± 3% +680.0% 5.73 ± 30% perf-sched.wait_time.max.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
2503 ± 38% +41.0% 3528 ± 13% perf-sched.wait_time.max.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
0.01 ± 9% +18576.1% 1.43 ± 79% perf-sched.wait_time.max.ms.wait_for_partner.fifo_open.do_dentry_open.do_open
56.76 -56.8 0.00 perf-profile.calltrace.cycles-pp.secondary_startup_64_no_verify
54.92 -54.9 0.00 perf-profile.calltrace.cycles-pp.start_secondary.secondary_startup_64_no_verify
54.92 -54.9 0.00 perf-profile.calltrace.cycles-pp.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify
54.89 -54.9 0.00 perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify
51.90 -51.9 0.00 perf-profile.calltrace.cycles-pp.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry
51.48 -51.5 0.00 perf-profile.calltrace.cycles-pp.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify
50.29 -50.3 0.00 perf-profile.calltrace.cycles-pp.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary
45.09 -45.1 0.00 perf-profile.calltrace.cycles-pp.intel_idle.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle
10.71 -10.4 0.29 ±100% perf-profile.calltrace.cycles-pp.__mutex_lock.pipe_read.vfs_read.ksys_read.do_syscall_64
6.34 ± 2% -6.3 0.00 perf-profile.calltrace.cycles-pp.osq_lock.__mutex_lock.pipe_read.vfs_read.ksys_read
4.98 ± 7% -5.0 0.00 perf-profile.calltrace.cycles-pp.intel_idle_irq.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle
1.84 ± 4% -1.6 0.27 ±100% perf-profile.calltrace.cycles-pp.schedule.pipe_read.vfs_read.ksys_read.do_syscall_64
1.80 ± 4% -1.5 0.26 ±100% perf-profile.calltrace.cycles-pp.__schedule.schedule.pipe_read.vfs_read.ksys_read
1.49 ± 5% +0.3 1.84 perf-profile.calltrace.cycles-pp.__wake_up_common.__wake_up_common_lock.pipe_write.vfs_write.ksys_write
1.21 ± 6% +0.4 1.56 ± 2% perf-profile.calltrace.cycles-pp.autoremove_wake_function.__wake_up_common.__wake_up_common_lock.pipe_write.vfs_write
1.13 ± 6% +0.4 1.53 ± 2% perf-profile.calltrace.cycles-pp.try_to_wake_up.autoremove_wake_function.__wake_up_common.__wake_up_common_lock.pipe_write
0.00 +0.7 0.74 ± 5% perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.try_to_wake_up.autoremove_wake_function.__wake_up_common.__wake_up_common_lock
7.79 ± 4% +12.5 20.33 ± 4% perf-profile.calltrace.cycles-pp.finish_wait.pipe_read.vfs_read.ksys_read.do_syscall_64
7.29 ± 4% +13.0 20.27 ± 4% perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.finish_wait.pipe_read.vfs_read.ksys_read
6.85 ± 4% +13.4 20.20 ± 4% perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.finish_wait.pipe_read.vfs_read
36.28 +16.4 52.68 ± 6% perf-profile.calltrace.cycles-pp.read
35.78 +16.7 52.52 ± 6% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.read
35.73 +16.8 52.50 ± 6% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.read
35.50 +16.9 52.44 ± 6% perf-profile.calltrace.cycles-pp.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe.read
35.45 +17.0 52.41 ± 6% perf-profile.calltrace.cycles-pp.vfs_read.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe.read
35.27 +17.0 52.30 ± 6% perf-profile.calltrace.cycles-pp.pipe_read.vfs_read.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe
9.00 ± 3% +21.0 29.98 ± 7% perf-profile.calltrace.cycles-pp.prepare_to_wait_event.pipe_read.vfs_read.ksys_read.do_syscall_64
6.68 ± 4% +22.9 29.63 ± 7% perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.prepare_to_wait_event.pipe_read.vfs_read.ksys_read
5.74 ± 4% +23.8 29.52 ± 7% perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.prepare_to_wait_event.pipe_read.vfs_read
6.40 ± 2% +40.8 47.23 ± 7% perf-profile.calltrace.cycles-pp.write
6.13 ± 3% +40.9 46.99 ± 7% perf-profile.calltrace.cycles-pp.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
6.06 ± 3% +40.9 46.96 ± 7% perf-profile.calltrace.cycles-pp.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
6.22 ± 3% +40.9 47.13 ± 7% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.write
6.20 ± 3% +40.9 47.12 ± 7% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
5.90 ± 3% +41.0 46.88 ± 7% perf-profile.calltrace.cycles-pp.pipe_write.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe
2.02 ± 3% +42.2 44.26 ± 7% perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.__wake_up_common_lock.pipe_write.vfs_write.ksys_write
1.82 ± 4% +42.3 44.14 ± 7% perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.__wake_up_common_lock.pipe_write.vfs_write
3.54 ± 3% +42.7 46.23 ± 7% perf-profile.calltrace.cycles-pp.__wake_up_common_lock.pipe_write.vfs_write.ksys_write.do_syscall_64
56.76 -56.8 0.00 perf-profile.children.cycles-pp.secondary_startup_64_no_verify
56.76 -56.8 0.00 perf-profile.children.cycles-pp.cpu_startup_entry
56.73 -56.7 0.00 perf-profile.children.cycles-pp.do_idle
54.92 -54.9 0.00 perf-profile.children.cycles-pp.start_secondary
53.20 -53.2 0.00 perf-profile.children.cycles-pp.cpuidle_idle_call
51.96 -52.0 0.00 perf-profile.children.cycles-pp.cpuidle_enter
51.94 -51.9 0.00 perf-profile.children.cycles-pp.cpuidle_enter_state
45.09 -45.1 0.00 perf-profile.children.cycles-pp.intel_idle
12.21 -11.6 0.57 ± 23% perf-profile.children.cycles-pp.__mutex_lock
7.32 ± 2% -7.1 0.20 ± 28% perf-profile.children.cycles-pp.osq_lock
6.04 ± 6% -6.0 0.00 perf-profile.children.cycles-pp.intel_idle_irq
3.16 ± 3% -2.6 0.57 ± 9% perf-profile.children.cycles-pp.__schedule
2.52 ± 2% -2.4 0.11 ± 41% perf-profile.children.cycles-pp.mutex_spin_on_owner
1.92 ± 4% -1.4 0.54 ± 9% perf-profile.children.cycles-pp.schedule
1.63 ± 4% -1.4 0.26 ± 3% perf-profile.children.cycles-pp.asm_sysvec_apic_timer_interrupt
1.49 -1.0 0.48 ± 4% perf-profile.children.cycles-pp.mutex_lock
1.14 ± 2% -0.8 0.35 ± 3% perf-profile.children.cycles-pp.mutex_unlock
0.76 ± 6% -0.7 0.10 ± 10% perf-profile.children.cycles-pp.dequeue_entity
0.83 ± 5% -0.6 0.20 ± 10% perf-profile.children.cycles-pp.dequeue_task_fair
0.62 ± 3% -0.6 0.07 ± 11% perf-profile.children.cycles-pp._raw_spin_lock
0.84 ± 5% -0.5 0.32 ± 12% perf-profile.children.cycles-pp.ttwu_do_activate
0.66 ± 5% -0.5 0.15 ± 10% perf-profile.children.cycles-pp.enqueue_entity
0.78 ± 5% -0.5 0.28 ± 12% perf-profile.children.cycles-pp.activate_task
0.57 ± 3% -0.5 0.08 ± 9% perf-profile.children.cycles-pp.__list_del_entry_valid_or_report
0.54 ± 6% -0.5 0.05 ± 45% perf-profile.children.cycles-pp.__list_add_valid_or_report
0.70 ± 9% -0.5 0.23 ± 4% perf-profile.children.cycles-pp.sysvec_apic_timer_interrupt
0.73 ± 5% -0.5 0.27 ± 11% perf-profile.children.cycles-pp.enqueue_task_fair
0.54 ± 9% -0.3 0.22 ± 4% perf-profile.children.cycles-pp.__sysvec_apic_timer_interrupt
0.54 ± 9% -0.3 0.22 ± 3% perf-profile.children.cycles-pp.hrtimer_interrupt
0.37 ± 5% -0.3 0.07 ± 7% perf-profile.children.cycles-pp.switch_mm_irqs_off
0.42 ± 7% -0.3 0.12 ± 10% perf-profile.children.cycles-pp.pick_next_task_fair
0.49 ± 4% -0.3 0.20 ± 9% perf-profile.children.cycles-pp.update_load_avg
0.45 ± 8% -0.3 0.19 ± 4% perf-profile.children.cycles-pp.__hrtimer_run_queues
0.36 ± 4% -0.2 0.12 ± 8% perf-profile.children.cycles-pp.select_task_rq
0.34 ± 5% -0.2 0.11 ± 10% perf-profile.children.cycles-pp.select_task_rq_fair
0.32 ± 9% -0.2 0.16 ± 3% perf-profile.children.cycles-pp.tick_sched_timer
0.29 ± 7% -0.1 0.15 ± 4% perf-profile.children.cycles-pp.tick_sched_handle
0.29 ± 5% -0.1 0.15 ± 3% perf-profile.children.cycles-pp.update_process_times
0.16 ± 8% -0.1 0.03 ± 70% perf-profile.children.cycles-pp.__update_load_avg_se
0.18 ± 5% -0.1 0.06 ± 6% perf-profile.children.cycles-pp.perf_event_task_tick
0.25 ± 7% -0.1 0.13 ± 2% perf-profile.children.cycles-pp.scheduler_tick
0.18 ± 5% -0.1 0.06 ± 6% perf-profile.children.cycles-pp.perf_adjust_freq_unthr_context
0.15 ± 8% -0.1 0.07 ± 12% perf-profile.children.cycles-pp.__update_load_avg_cfs_rq
0.21 ± 6% -0.1 0.14 ± 8% perf-profile.children.cycles-pp.update_curr
0.14 ± 8% -0.1 0.06 ± 11% perf-profile.children.cycles-pp.__entry_text_start
0.22 ± 7% -0.1 0.15 ± 6% perf-profile.children.cycles-pp.syscall_exit_to_user_mode
0.18 ± 10% -0.1 0.12 ± 6% perf-profile.children.cycles-pp.exit_to_user_mode_prepare
0.14 ± 11% -0.1 0.07 ± 9% perf-profile.children.cycles-pp.security_file_permission
0.13 ± 7% -0.1 0.07 ± 14% perf-profile.children.cycles-pp.reweight_entity
0.18 ± 7% -0.1 0.13 ± 10% perf-profile.children.cycles-pp.copy_page_to_iter
0.10 ± 11% -0.1 0.04 ± 44% perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack
0.12 ± 10% -0.1 0.06 ± 11% perf-profile.children.cycles-pp.apparmor_file_permission
0.10 ± 14% -0.1 0.04 ± 45% perf-profile.children.cycles-pp.anon_pipe_buf_release
0.17 ± 9% -0.1 0.12 ± 12% perf-profile.children.cycles-pp._copy_to_iter
0.08 ± 11% -0.0 0.04 ± 71% perf-profile.children.cycles-pp.__fdget_pos
0.16 ± 7% -0.0 0.11 ± 10% perf-profile.children.cycles-pp.copyout
0.14 ± 10% -0.0 0.10 ± 9% perf-profile.children.cycles-pp.rep_movs_alternative
0.00 +0.1 0.05 ± 7% perf-profile.children.cycles-pp.exit_to_user_mode_loop
0.00 +0.1 0.05 ± 8% perf-profile.children.cycles-pp.task_tick_fair
0.00 +0.1 0.07 ± 10% perf-profile.children.cycles-pp.wake_affine
1.49 ± 5% +0.3 1.84 perf-profile.children.cycles-pp.__wake_up_common
1.19 ± 5% +0.4 1.54 perf-profile.children.cycles-pp.try_to_wake_up
1.21 ± 6% +0.4 1.57 perf-profile.children.cycles-pp.autoremove_wake_function
7.80 ± 4% +12.5 20.33 ± 4% perf-profile.children.cycles-pp.finish_wait
36.32 +16.4 52.70 ± 6% perf-profile.children.cycles-pp.read
35.50 +16.9 52.44 ± 6% perf-profile.children.cycles-pp.ksys_read
35.45 +17.0 52.41 ± 6% perf-profile.children.cycles-pp.vfs_read
35.30 +17.0 52.31 ± 6% perf-profile.children.cycles-pp.pipe_read
9.01 ± 3% +21.0 30.00 ± 7% perf-profile.children.cycles-pp.prepare_to_wait_event
6.45 ± 2% +40.8 47.26 ± 7% perf-profile.children.cycles-pp.write
6.14 ± 3% +40.9 47.00 ± 7% perf-profile.children.cycles-pp.ksys_write
6.06 ± 3% +40.9 46.97 ± 7% perf-profile.children.cycles-pp.vfs_write
5.90 ± 3% +41.0 46.88 ± 7% perf-profile.children.cycles-pp.pipe_write
3.54 ± 3% +42.7 46.25 ± 7% perf-profile.children.cycles-pp.__wake_up_common_lock
42.17 +57.5 99.70 perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
42.09 +57.6 99.67 perf-profile.children.cycles-pp.do_syscall_64
16.92 ± 3% +76.9 93.87 perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
16.29 ± 3% +78.6 94.91 perf-profile.children.cycles-pp._raw_spin_lock_irqsave
45.09 -45.1 0.00 perf-profile.self.cycles-pp.intel_idle
7.30 ± 2% -7.1 0.20 ± 28% perf-profile.self.cycles-pp.osq_lock
5.78 ± 6% -5.8 0.00 perf-profile.self.cycles-pp.intel_idle_irq
2.51 ± 2% -2.4 0.11 ± 41% perf-profile.self.cycles-pp.mutex_spin_on_owner
1.99 ± 2% -1.8 0.23 ± 12% perf-profile.self.cycles-pp.__mutex_lock
1.76 -1.5 0.24 ± 8% perf-profile.self.cycles-pp.prepare_to_wait_event
1.46 -1.0 0.44 ± 5% perf-profile.self.cycles-pp.mutex_lock
1.88 ± 2% -0.8 1.05 ± 4% perf-profile.self.cycles-pp._raw_spin_lock_irqsave
1.14 ± 2% -0.8 0.35 ± 3% perf-profile.self.cycles-pp.mutex_unlock
0.60 ± 4% -0.5 0.06 ± 11% perf-profile.self.cycles-pp._raw_spin_lock
0.54 ± 6% -0.5 0.05 ± 45% perf-profile.self.cycles-pp.__list_add_valid_or_report
0.56 ± 3% -0.5 0.08 ± 9% perf-profile.self.cycles-pp.__list_del_entry_valid_or_report
0.66 ± 10% -0.5 0.20 ± 26% perf-profile.self.cycles-pp.pipe_read
0.36 ± 6% -0.3 0.06 ± 7% perf-profile.self.cycles-pp.switch_mm_irqs_off
0.30 ± 5% -0.2 0.06 ± 8% perf-profile.self.cycles-pp.__schedule
0.15 ± 8% -0.1 0.03 ± 70% perf-profile.self.cycles-pp.__update_load_avg_se
0.17 ± 4% -0.1 0.08 ± 7% perf-profile.self.cycles-pp.update_load_avg
0.15 ± 7% -0.1 0.07 ± 14% perf-profile.self.cycles-pp.__update_load_avg_cfs_rq
0.09 ± 13% -0.1 0.03 ± 70% perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack
0.14 ± 27% -0.1 0.08 ± 39% perf-profile.self.cycles-pp.pipe_write
0.10 ± 13% -0.0 0.05 ± 8% perf-profile.self.cycles-pp.apparmor_file_permission
0.09 ± 14% -0.0 0.04 ± 45% perf-profile.self.cycles-pp.anon_pipe_buf_release
0.12 ± 7% -0.0 0.10 ± 10% perf-profile.self.cycles-pp.rep_movs_alternative
0.04 ± 71% +0.2 0.28 ± 4% perf-profile.self.cycles-pp.try_to_wake_up
16.92 ± 3% +76.9 93.87 perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
***************************************************************************************************
lkp-csl-d02: 36 threads 1 sockets Intel(R) Core(TM) i9-10980XE CPU @ 3.00GHz (Cascade Lake) with 128G memory
=========================================================================================
class/compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
memory/gcc-12/performance/x86_64-rhel-8.3/1/debian-11.1-x86_64-20220510.cgz/lkp-csl-d02/pipeherd/stress-ng/60s
commit:
cc03a5d65a ("fs/pipe: remove unnecessary spinlock from pipe_write()")
8114dc703a ("fs/pipe: use spinlock in pipe_read() only if there is a watch_queue")
cc03a5d65a4032f8 8114dc703a4833be4a98a37f5ed
---------------- ---------------------------
%stddev %change %stddev
\ | \
2709 -69.3% 830.62 uptime.idle
2e+09 -93.2% 1.357e+08 ± 2% cpuidle..time
28090525 ± 3% -99.3% 203996 ± 2% cpuidle..usage
87.74 -82.7 5.02 ± 13% mpstat.cpu.all.idle%
0.03 ± 10% -0.0 0.00 ± 37% mpstat.cpu.all.soft%
10.73 ± 8% +82.7 93.44 mpstat.cpu.all.sys%
88.01 -91.1% 7.83 ± 8% vmstat.cpu.id
11.03 ± 8% +728.1% 91.38 vmstat.cpu.sy
3828646 -15.5% 3234352 vmstat.memory.cache
4.52 ± 6% +1924.7% 91.56 vmstat.procs.r
796488 ± 3% -78.0% 174864 ± 6% vmstat.system.cs
57177 +60.1% 91552 ± 3% vmstat.system.in
1.00 -73.7% 0.26 ± 16% stress-ng.pipeherd.context_switches_per_bogo_op
421613 ± 4% -57.7% 178307 ± 7% stress-ng.pipeherd.context_switches_per_sec
25283532 ± 4% +64.1% 41497127 ± 9% stress-ng.pipeherd.ops
421306 ± 4% +64.1% 691477 ± 9% stress-ng.pipeherd.ops_per_sec
12565 ± 10% +17024.2% 2151679 stress-ng.time.involuntary_context_switches
403.83 ± 8% +751.1% 3436 stress-ng.time.percent_of_cpu_this_job_got
247.39 ± 8% +759.9% 2127 stress-ng.time.system_time
25284435 ± 4% -66.2% 8546895 ± 9% stress-ng.time.voluntary_context_switches
851649 ± 3% -58.0% 357966 ± 3% meminfo.Active
851501 ± 3% -58.0% 357818 ± 3% meminfo.Active(anon)
3742441 -15.7% 3153275 meminfo.Cached
2409010 -24.0% 1831617 meminfo.Committed_AS
451343 -21.6% 353822 meminfo.Inactive
451163 -21.6% 353642 meminfo.Inactive(anon)
118870 ± 2% -38.5% 73112 meminfo.Mapped
4514553 -13.3% 3913828 meminfo.Memused
996307 ± 3% -59.1% 407148 ± 3% meminfo.Shmem
4682131 -13.2% 4062916 meminfo.max_used_kB
212877 ± 3% -58.0% 89455 ± 3% proc-vmstat.nr_active_anon
935621 -15.7% 788336 proc-vmstat.nr_file_pages
112796 -21.6% 88425 proc-vmstat.nr_inactive_anon
29729 ± 2% -38.4% 18302 proc-vmstat.nr_mapped
249082 ± 3% -59.1% 101798 ± 3% proc-vmstat.nr_shmem
19817 -2.0% 19424 proc-vmstat.nr_slab_reclaimable
212877 ± 3% -58.0% 89455 ± 3% proc-vmstat.nr_zone_active_anon
112796 -21.6% 88425 proc-vmstat.nr_zone_inactive_anon
596114 -34.8% 388508 proc-vmstat.numa_hit
596130 -34.8% 388478 proc-vmstat.numa_local
316752 ± 2% -55.6% 140488 ± 4% proc-vmstat.pgactivate
626933 -33.3% 418232 proc-vmstat.pgalloc_normal
271840 -3.8% 261580 proc-vmstat.pgfault
8141 -2.6% 7930 proc-vmstat.pgreuse
571.50 ± 5% +526.2% 3578 turbostat.Avg_MHz
15.11 ± 5% +79.1 94.20 turbostat.Busy%
4469624 ± 9% -99.8% 9104 ± 19% turbostat.C1
9.54 ± 8% -9.5 0.02 ± 20% turbostat.C1%
23403867 ± 2% -99.7% 60893 ± 3% turbostat.C1E
70.10 -69.9 0.21 ± 5% turbostat.C1E%
150042 ± 8% -15.6% 126569 ± 3% turbostat.C6
6.59 ± 7% -1.0 5.62 ± 2% turbostat.C6%
83.84 -94.0% 5.02 turbostat.CPU%c1
0.12 ± 5% -74.6% 0.03 turbostat.IPC
3762229 +58.7% 5970483 ± 3% turbostat.IRQ
63321 ± 2% -94.2% 3649 ± 10% turbostat.POLL
0.04 ± 11% -0.0 0.00 turbostat.POLL%
41.00 +6.5% 43.67 ± 2% turbostat.PkgTmp
96.48 +50.2% 144.88 turbostat.PkgWatt
10.23 -1.5% 10.08 turbostat.RAMWatt
20253 ± 11% +4934.8% 1019726 sched_debug.cfs_rq:/.avg_vruntime.avg
42488 ± 7% +2500.3% 1104832 ± 3% sched_debug.cfs_rq:/.avg_vruntime.max
15819 ± 14% +6224.0% 1000401 sched_debug.cfs_rq:/.avg_vruntime.min
6095 ± 3% +246.2% 21099 ± 26% sched_debug.cfs_rq:/.avg_vruntime.stddev
0.29 ± 9% +475.8% 1.65 ± 3% sched_debug.cfs_rq:/.h_nr_running.avg
1.25 ± 20% +113.3% 2.67 ± 8% sched_debug.cfs_rq:/.h_nr_running.max
0.58 ± 31% +1600.0% 9.92 ± 5% sched_debug.cfs_rq:/.load_avg.min
20253 ± 11% +4934.8% 1019726 sched_debug.cfs_rq:/.min_vruntime.avg
42488 ± 7% +2500.3% 1104832 ± 3% sched_debug.cfs_rq:/.min_vruntime.max
15819 ± 14% +6224.0% 1000401 sched_debug.cfs_rq:/.min_vruntime.min
6095 ± 3% +246.2% 21099 ± 26% sched_debug.cfs_rq:/.min_vruntime.stddev
0.28 ± 11% +146.3% 0.70 ± 3% sched_debug.cfs_rq:/.nr_running.avg
0.45 ± 6% -41.7% 0.26 ± 11% sched_debug.cfs_rq:/.nr_running.stddev
371.87 ± 7% +351.1% 1677 ± 5% sched_debug.cfs_rq:/.runnable_avg.avg
1049 ± 15% +152.8% 2652 ± 8% sched_debug.cfs_rq:/.runnable_avg.max
9.33 ± 36% +8476.8% 800.50 ± 33% sched_debug.cfs_rq:/.runnable_avg.min
224.61 ± 14% +92.7% 432.74 ± 10% sched_debug.cfs_rq:/.runnable_avg.stddev
369.83 ± 7% +115.1% 795.51 ± 4% sched_debug.cfs_rq:/.util_avg.avg
1046 ± 15% +37.1% 1435 ± 7% sched_debug.cfs_rq:/.util_avg.max
8.92 ± 36% +3443.0% 315.92 ± 18% sched_debug.cfs_rq:/.util_avg.min
33.93 ± 34% +1322.2% 482.56 ± 4% sched_debug.cfs_rq:/.util_est_enqueued.avg
401.08 ± 21% +144.0% 978.75 ± 5% sched_debug.cfs_rq:/.util_est_enqueued.max
99.29 ± 25% +75.4% 174.17 ± 7% sched_debug.cfs_rq:/.util_est_enqueued.stddev
589817 ± 5% -12.9% 513465 ± 3% sched_debug.cpu.avg_idle.max
17625 ± 11% -51.8% 8492 ± 20% sched_debug.cpu.avg_idle.min
0.73 ± 4% +182.0% 2.07 ± 28% sched_debug.cpu.clock.stddev
889.48 ± 17% +175.5% 2450 sched_debug.cpu.curr->pid.avg
1441 ± 8% -41.2% 847.75 ± 9% sched_debug.cpu.curr->pid.stddev
0.00 ± 14% +98.0% 0.00 ± 20% sched_debug.cpu.next_balance.stddev
0.29 ± 17% +455.1% 1.63 ± 4% sched_debug.cpu.nr_running.avg
1.25 ± 20% +120.0% 2.75 ± 13% sched_debug.cpu.nr_running.max
697944 ± 3% -77.7% 155957 ± 6% sched_debug.cpu.nr_switches.avg
766842 ± 4% -73.1% 206139 ± 5% sched_debug.cpu.nr_switches.max
630908 ± 4% -78.9% 133104 ± 10% sched_debug.cpu.nr_switches.min
29178 ± 15% -42.1% 16903 ± 22% sched_debug.cpu.nr_switches.stddev
0.17 ± 2% -52.1% 0.08 ± 2% perf-stat.i.MPKI
1.582e+09 +109.1% 3.308e+09 perf-stat.i.branch-instructions
2.00 -1.3 0.75 perf-stat.i.branch-miss-rate%
32433729 -25.7% 24102582 perf-stat.i.branch-misses
2.46 ± 8% -0.4 2.07 ± 2% perf-stat.i.cache-miss-rate%
1826870 -34.1% 1203961 ± 2% perf-stat.i.cache-misses
838779 ± 3% -78.3% 182290 ± 7% perf-stat.i.context-switches
3.01 ± 5% +206.9% 9.25 perf-stat.i.cpi
2.105e+10 ± 5% +522.7% 1.311e+11 perf-stat.i.cpu-cycles
17229 ± 8% +16.1% 19996 ± 7% perf-stat.i.cpu-migrations
299091 ± 9% +529.6% 1883163 ± 10% perf-stat.i.cycles-between-cache-misses
0.05 ± 4% -0.0 0.01 ± 9% perf-stat.i.dTLB-load-miss-rate%
946957 ± 5% -63.0% 350764 ± 8% perf-stat.i.dTLB-load-misses
1.963e+09 +82.5% 3.584e+09 perf-stat.i.dTLB-loads
53749 ± 13% -32.4% 36351 ± 10% perf-stat.i.dTLB-store-misses
1.004e+09 ± 2% -32.7% 6.756e+08 ± 2% perf-stat.i.dTLB-stores
37.17 +39.2 76.33 ± 2% perf-stat.i.iTLB-load-miss-rate%
6918328 ± 3% -82.7% 1198409 ± 6% perf-stat.i.iTLB-loads
7.646e+09 +85.6% 1.419e+10 perf-stat.i.instructions
2207 ± 4% +73.4% 3828 ± 4% perf-stat.i.instructions-per-iTLB-miss
0.38 ± 4% -61.3% 0.15 perf-stat.i.ipc
0.58 ± 5% +522.7% 3.64 perf-stat.i.metric.GHz
247.76 ± 3% -74.1% 64.28 ± 4% perf-stat.i.metric.K/sec
129.35 +64.8% 213.17 perf-stat.i.metric.M/sec
2817 -4.5% 2690 perf-stat.i.minor-faults
287862 ± 2% -40.8% 170491 ± 2% perf-stat.i.node-loads
232290 ± 4% -26.6% 170522 ± 6% perf-stat.i.node-stores
2817 -4.5% 2690 perf-stat.i.page-faults
0.24 -64.5% 0.08 ± 2% perf-stat.overall.MPKI
2.05 -1.3 0.73 perf-stat.overall.branch-miss-rate%
1.72 ± 13% -0.6 1.11 perf-stat.overall.cache-miss-rate%
2.75 ± 5% +235.6% 9.24 perf-stat.overall.cpi
11529 ± 7% +845.4% 109003 ± 2% perf-stat.overall.cycles-between-cache-misses
0.05 ± 4% -0.0 0.01 ± 8% perf-stat.overall.dTLB-load-miss-rate%
37.38 +39.8 77.16 ± 2% perf-stat.overall.iTLB-load-miss-rate%
1855 ± 4% +88.8% 3503 ± 4% perf-stat.overall.instructions-per-iTLB-miss
0.36 ± 5% -70.3% 0.11 perf-stat.overall.ipc
1.557e+09 +109.0% 3.256e+09 perf-stat.ps.branch-instructions
31931731 -25.8% 23706440 perf-stat.ps.branch-misses
1799223 -34.2% 1184288 ± 2% perf-stat.ps.cache-misses
825430 ± 3% -78.3% 179378 ± 7% perf-stat.ps.context-switches
2.072e+10 ± 5% +522.7% 1.29e+11 perf-stat.ps.cpu-cycles
16955 ± 8% +16.0% 19676 ± 7% perf-stat.ps.cpu-migrations
932029 ± 5% -62.9% 345425 ± 8% perf-stat.ps.dTLB-load-misses
1.932e+09 +82.5% 3.527e+09 perf-stat.ps.dTLB-loads
52911 ± 13% -32.4% 35765 ± 10% perf-stat.ps.dTLB-store-misses
9.885e+08 ± 2% -32.7% 6.649e+08 ± 2% perf-stat.ps.dTLB-stores
6808264 ± 3% -82.7% 1179244 ± 6% perf-stat.ps.iTLB-loads
7.526e+09 +85.5% 1.396e+10 perf-stat.ps.instructions
2773 -4.6% 2647 perf-stat.ps.minor-faults
283598 ± 2% -40.8% 167758 ± 2% perf-stat.ps.node-loads
228693 ± 4% -26.4% 168245 ± 6% perf-stat.ps.node-stores
2773 -4.6% 2647 perf-stat.ps.page-faults
4.761e+11 +84.6% 8.788e+11 perf-stat.total.instructions
0.01 ± 75% +929.7% 0.06 ± 10% perf-sched.sch_delay.avg.ms.__cond_resched.mutex_lock.pipe_read.vfs_read.ksys_read
0.00 ± 20% +36754.5% 0.68 ± 26% perf-sched.sch_delay.avg.ms.__cond_resched.stop_one_cpu.sched_exec.bprm_execve.part
0.00 ± 14% +16358.6% 0.80 ± 23% perf-sched.sch_delay.avg.ms.__x64_sys_pause.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
0.00 +1250.0% 0.03 ± 5% perf-sched.sch_delay.avg.ms.do_nanosleep.hrtimer_nanosleep.common_nsleep.__x64_sys_clock_nanosleep
0.00 ± 20% +19114.3% 0.45 ± 16% perf-sched.sch_delay.avg.ms.do_task_dead.do_exit.do_group_exit.__x64_sys_exit_group.do_syscall_64
0.00 +3366.7% 0.14 ± 36% perf-sched.sch_delay.avg.ms.do_wait.kernel_wait4.__do_sys_wait4.do_syscall_64
0.00 ± 56% +188.5% 0.01 ± 7% perf-sched.sch_delay.avg.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64
0.00 +5083.3% 0.10 ± 9% perf-sched.sch_delay.avg.ms.pipe_read.vfs_read.ksys_read.do_syscall_64
0.00 ± 14% +980.0% 0.04 ±134% perf-sched.sch_delay.avg.ms.rcu_gp_kthread.kthread.ret_from_fork.ret_from_fork_asm
0.00 ± 14% +26266.7% 0.92 ± 45% perf-sched.sch_delay.avg.ms.schedule_hrtimeout_range_clock.do_select.core_sys_select.kern_select
0.00 ± 8% +956.0% 0.04 ± 82% perf-sched.sch_delay.avg.ms.schedule_hrtimeout_range_clock.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
0.00 ± 19% +14193.3% 0.36 ± 17% perf-sched.sch_delay.avg.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
0.00 +66.7% 0.01 ± 29% perf-sched.sch_delay.avg.ms.schedule_timeout.kcompactd.kthread.ret_from_fork
0.00 ± 9% +387.0% 0.02 ± 25% perf-sched.sch_delay.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
0.01 ± 18% +1119.0% 0.09 ±147% perf-sched.sch_delay.avg.ms.syslog_print.do_syslog.kmsg_read.vfs_read
0.00 +13975.0% 0.28 ± 41% perf-sched.sch_delay.avg.ms.wait_for_partner.fifo_open.do_dentry_open.do_open
0.00 ± 10% +239.3% 0.02 ± 46% perf-sched.sch_delay.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
0.01 ± 29% +3763.8% 0.30 ±185% perf-sched.sch_delay.max.ms.__cond_resched.__mutex_lock.constprop.0.pipe_read
0.00 ± 23% +40978.3% 1.57 ± 56% perf-sched.sch_delay.max.ms.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
0.45 ±131% +1924.0% 9.21 ± 31% perf-sched.sch_delay.max.ms.__cond_resched.mutex_lock.pipe_read.vfs_read.ksys_read
0.00 ± 17% +1.3e+05% 2.79 ± 32% perf-sched.sch_delay.max.ms.__cond_resched.stop_one_cpu.sched_exec.bprm_execve.part
0.01 ± 21% +36733.3% 2.21 ± 34% perf-sched.sch_delay.max.ms.__x64_sys_pause.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
0.01 ± 45% +2469.0% 0.30 ±134% perf-sched.sch_delay.max.ms.devkmsg_read.vfs_read.ksys_read.do_syscall_64
0.04 ± 16% +11181.6% 4.61 ± 10% perf-sched.sch_delay.max.ms.do_nanosleep.hrtimer_nanosleep.common_nsleep.__x64_sys_clock_nanosleep
0.01 ± 19% +34075.9% 3.08 ± 24% perf-sched.sch_delay.max.ms.do_task_dead.do_exit.do_group_exit.__x64_sys_exit_group.do_syscall_64
0.01 ± 30% +15051.8% 1.41 ± 63% perf-sched.sch_delay.max.ms.do_wait.kernel_wait4.__do_sys_wait4.do_syscall_64
0.43 ±137% +2356.9% 10.45 ± 33% perf-sched.sch_delay.max.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64
2.08 ± 13% +540.7% 13.32 ± 24% perf-sched.sch_delay.max.ms.pipe_read.vfs_read.ksys_read.do_syscall_64
0.01 ± 17% +7221.9% 0.39 ±190% perf-sched.sch_delay.max.ms.rcu_gp_kthread.kthread.ret_from_fork.ret_from_fork_asm
0.01 ± 18% +14615.0% 0.98 ± 45% perf-sched.sch_delay.max.ms.schedule_hrtimeout_range_clock.do_poll.constprop.0.do_sys_poll
0.00 ± 7% +31034.5% 1.50 ± 40% perf-sched.sch_delay.max.ms.schedule_hrtimeout_range_clock.do_select.core_sys_select.kern_select
0.01 ± 10% +5090.9% 0.38 ±114% perf-sched.sch_delay.max.ms.schedule_hrtimeout_range_clock.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
0.01 ± 55% +34995.7% 2.69 ± 15% perf-sched.sch_delay.max.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
0.01 ± 27% +3184.0% 0.27 ±109% perf-sched.sch_delay.max.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
49.15 ±138% -94.6% 2.64 ±174% perf-sched.sch_delay.max.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
0.01 ± 25% +1541.8% 0.15 ±169% perf-sched.sch_delay.max.ms.syslog_print.do_syslog.kmsg_read.vfs_read
0.00 ± 76% +47250.0% 1.58 ± 35% perf-sched.sch_delay.max.ms.wait_for_partner.fifo_open.do_dentry_open.do_open
0.05 ± 24% +625.5% 0.34 ±109% perf-sched.sch_delay.max.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
0.00 ± 17% +3800.0% 0.08 ± 10% perf-sched.total_sch_delay.average.ms
0.59 ± 3% +138.6% 1.40 ± 2% perf-sched.total_wait_and_delay.average.ms
1558477 ± 3% -63.4% 570364 ± 3% perf-sched.total_wait_and_delay.count.ms
0.58 ± 3% +125.1% 1.31 ± 3% perf-sched.total_wait_time.average.ms
0.12 +66.1% 0.19 ± 9% perf-sched.wait_and_delay.avg.ms.do_nanosleep.hrtimer_nanosleep.common_nsleep.__x64_sys_clock_nanosleep
0.36 ± 3% +152.2% 0.91 ± 2% perf-sched.wait_and_delay.avg.ms.pipe_read.vfs_read.ksys_read.do_syscall_64
4.02 ± 9% -21.7% 3.15 ± 5% perf-sched.wait_and_delay.avg.ms.rcu_gp_kthread.kthread.ret_from_fork.ret_from_fork_asm
25.74 ± 7% +162.9% 67.68 ± 2% perf-sched.wait_and_delay.avg.ms.schedule_hrtimeout_range_clock.do_poll.constprop.0.do_sys_poll
101.37 ± 5% -30.7% 70.21 ± 7% perf-sched.wait_and_delay.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
118.32 ± 9% +63.5% 193.49 ± 6% perf-sched.wait_and_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
569.65 ± 2% +14.6% 653.05 perf-sched.wait_and_delay.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
0.83 ±128% +960.0% 8.83 ± 61% perf-sched.wait_and_delay.count.__cond_resched.__do_fault.do_read_fault.do_fault.__handle_mm_fault
4.00 ± 32% +970.8% 42.83 ± 22% perf-sched.wait_and_delay.count.__cond_resched.mutex_lock.perf_poll.do_poll.constprop
75183 -41.6% 43888 ± 2% perf-sched.wait_and_delay.count.do_nanosleep.hrtimer_nanosleep.common_nsleep.__x64_sys_clock_nanosleep
153.50 ±142% +41336.4% 63604 ± 4% perf-sched.wait_and_delay.count.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64
1479460 ± 3% -71.6% 419944 ± 4% perf-sched.wait_and_delay.count.pipe_read.vfs_read.ksys_read.do_syscall_64
598.17 ± 2% -63.1% 220.50 ± 3% perf-sched.wait_and_delay.count.schedule_hrtimeout_range_clock.do_poll.constprop.0.do_sys_poll
47.83 ± 4% +47.7% 70.67 ± 7% perf-sched.wait_and_delay.count.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
1609 ± 9% -40.3% 960.00 ± 5% perf-sched.wait_and_delay.count.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
221.00 ± 4% -10.6% 197.50 ± 2% perf-sched.wait_and_delay.count.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
333.52 ±141% +200.3% 1001 perf-sched.wait_and_delay.max.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64
0.31 ± 4% +37.1% 0.43 ± 2% perf-sched.wait_time.avg.ms.__cond_resched.mutex_lock.pipe_read.vfs_read.ksys_read
0.11 +43.8% 0.16 ± 11% perf-sched.wait_time.avg.ms.do_nanosleep.hrtimer_nanosleep.common_nsleep.__x64_sys_clock_nanosleep
0.24 ± 2% +248.2% 0.82 ± 16% perf-sched.wait_time.avg.ms.do_wait.kernel_wait4.__do_sys_wait4.do_syscall_64
0.36 ± 3% +124.9% 0.80 ± 4% perf-sched.wait_time.avg.ms.pipe_read.vfs_read.ksys_read.do_syscall_64
4.02 ± 9% -22.6% 3.11 ± 5% perf-sched.wait_time.avg.ms.rcu_gp_kthread.kthread.ret_from_fork.ret_from_fork_asm
25.74 ± 7% +162.9% 67.67 ± 2% perf-sched.wait_time.avg.ms.schedule_hrtimeout_range_clock.do_poll.constprop.0.do_sys_poll
0.32 ± 2% +374.5% 1.53 ± 12% perf-sched.wait_time.avg.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
101.36 ± 5% -30.8% 70.19 ± 7% perf-sched.wait_time.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
118.28 ± 9% +63.6% 193.47 ± 6% perf-sched.wait_time.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
0.00 ± 99% +23733.3% 0.12 ± 54% perf-sched.wait_time.avg.ms.wait_for_partner.fifo_open.do_dentry_open.do_open
569.65 ± 2% +14.6% 653.04 perf-sched.wait_time.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
0.44 ± 22% +1755.3% 8.17 ± 7% perf-sched.wait_time.max.ms.__cond_resched.mutex_lock.pipe_read.vfs_read.ksys_read
0.39 ± 19% +370.3% 1.83 ± 44% perf-sched.wait_time.max.ms.__cond_resched.mutex_lock.pipe_write.vfs_write.ksys_write
0.39 ± 7% +407.5% 1.98 ± 59% perf-sched.wait_time.max.ms.__cond_resched.pipe_read.vfs_read.ksys_read.do_syscall_64
0.71 ± 2% +343.6% 3.16 ± 21% perf-sched.wait_time.max.ms.do_wait.kernel_wait4.__do_sys_wait4.do_syscall_64
0.19 ± 90% +14957.4% 28.81 ±136% perf-sched.wait_time.max.ms.exit_to_user_mode_loop.exit_to_user_mode_prepare.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt
0.49 ± 31% +337.6% 2.14 ± 56% perf-sched.wait_time.max.ms.schedule_preempt_disabled.__mutex_lock.constprop.0.pipe_read
0.70 ± 3% +711.4% 5.71 ± 20% perf-sched.wait_time.max.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
0.01 ± 52% +14837.8% 0.92 ± 33% perf-sched.wait_time.max.ms.wait_for_partner.fifo_open.do_dentry_open.do_open
58.50 -58.5 0.00 perf-profile.calltrace.cycles-pp.secondary_startup_64_no_verify
56.75 -56.8 0.00 perf-profile.calltrace.cycles-pp.start_secondary.secondary_startup_64_no_verify
56.74 -56.7 0.00 perf-profile.calltrace.cycles-pp.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify
56.71 -56.7 0.00 perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify
53.10 -53.1 0.00 perf-profile.calltrace.cycles-pp.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry
52.94 -52.9 0.00 perf-profile.calltrace.cycles-pp.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify
51.58 -51.6 0.00 perf-profile.calltrace.cycles-pp.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary
45.34 -45.3 0.00 perf-profile.calltrace.cycles-pp.intel_idle.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle
9.48 ± 9% -9.2 0.28 ±100% perf-profile.calltrace.cycles-pp.__mutex_lock.pipe_read.vfs_read.ksys_read.do_syscall_64
5.82 ± 10% -5.8 0.00 perf-profile.calltrace.cycles-pp.intel_idle_irq.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle
1.95 ± 5% -1.7 0.27 ±100% perf-profile.calltrace.cycles-pp.schedule.pipe_read.vfs_read.ksys_read.do_syscall_64
1.89 ± 4% -1.6 0.26 ±100% perf-profile.calltrace.cycles-pp.__schedule.schedule.pipe_read.vfs_read.ksys_read
1.67 ± 7% +0.2 1.84 perf-profile.calltrace.cycles-pp.__wake_up_common.__wake_up_common_lock.pipe_write.vfs_write.ksys_write
1.37 ± 8% +0.2 1.58 perf-profile.calltrace.cycles-pp.autoremove_wake_function.__wake_up_common.__wake_up_common_lock.pipe_write.vfs_write
1.28 ± 7% +0.3 1.54 perf-profile.calltrace.cycles-pp.try_to_wake_up.autoremove_wake_function.__wake_up_common.__wake_up_common_lock.pipe_write
0.00 +0.8 0.75 ± 6% perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.try_to_wake_up.autoremove_wake_function.__wake_up_common.__wake_up_common_lock
6.96 ± 9% +13.3 20.28 ± 4% perf-profile.calltrace.cycles-pp.finish_wait.pipe_read.vfs_read.ksys_read.do_syscall_64
6.48 ± 10% +13.7 20.22 ± 4% perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.finish_wait.pipe_read.vfs_read.ksys_read
6.04 ± 11% +14.1 20.16 ± 4% perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.finish_wait.pipe_read.vfs_read
34.07 ± 4% +18.3 52.39 ± 5% perf-profile.calltrace.cycles-pp.read
33.53 ± 4% +18.7 52.24 ± 5% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.read
33.45 ± 4% +18.8 52.22 ± 5% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.read
33.17 ± 4% +19.0 52.16 ± 5% perf-profile.calltrace.cycles-pp.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe.read
33.11 ± 4% +19.0 52.12 ± 5% perf-profile.calltrace.cycles-pp.vfs_read.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe.read
32.91 ± 4% +19.1 52.01 ± 5% perf-profile.calltrace.cycles-pp.pipe_read.vfs_read.ksys_read.do_syscall_64.entry_SYSCALL_64_after_hwframe
8.23 ± 5% +21.5 29.75 ± 6% perf-profile.calltrace.cycles-pp.prepare_to_wait_event.pipe_read.vfs_read.ksys_read.do_syscall_64
5.83 ± 9% +23.6 29.41 ± 6% perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.prepare_to_wait_event.pipe_read.vfs_read.ksys_read
4.78 ± 14% +24.5 29.30 ± 6% perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.prepare_to_wait_event.pipe_read.vfs_read
6.81 ± 5% +40.7 47.52 ± 6% perf-profile.calltrace.cycles-pp.write
6.51 ± 5% +40.8 47.28 ± 6% perf-profile.calltrace.cycles-pp.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
6.60 ± 5% +40.8 47.42 ± 6% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.write
6.58 ± 5% +40.8 47.40 ± 6% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
6.42 ± 5% +40.8 47.24 ± 6% perf-profile.calltrace.cycles-pp.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe.write
6.22 ± 4% +40.9 47.16 ± 6% perf-profile.calltrace.cycles-pp.pipe_write.vfs_write.ksys_write.do_syscall_64.entry_SYSCALL_64_after_hwframe
1.89 ± 4% +42.6 44.53 ± 6% perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.__wake_up_common_lock.pipe_write.vfs_write.ksys_write
1.67 ± 6% +42.7 44.41 ± 6% perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.__wake_up_common_lock.pipe_write.vfs_write
3.59 +42.9 46.51 ± 6% perf-profile.calltrace.cycles-pp.__wake_up_common_lock.pipe_write.vfs_write.ksys_write.do_syscall_64
58.50 -58.5 0.00 perf-profile.children.cycles-pp.secondary_startup_64_no_verify
58.50 -58.5 0.00 perf-profile.children.cycles-pp.cpu_startup_entry
58.47 -58.5 0.00 perf-profile.children.cycles-pp.do_idle
56.75 -56.8 0.00 perf-profile.children.cycles-pp.start_secondary
54.57 -54.6 0.00 perf-profile.children.cycles-pp.cpuidle_idle_call
53.16 -53.2 0.00 perf-profile.children.cycles-pp.cpuidle_enter
53.14 -53.1 0.00 perf-profile.children.cycles-pp.cpuidle_enter_state
45.34 -45.3 0.00 perf-profile.children.cycles-pp.intel_idle
11.02 ± 6% -10.5 0.55 ± 24% perf-profile.children.cycles-pp.__mutex_lock
6.92 ± 9% -6.9 0.00 perf-profile.children.cycles-pp.intel_idle_irq
6.16 ± 12% -6.0 0.19 ± 30% perf-profile.children.cycles-pp.osq_lock
3.36 ± 5% -2.8 0.57 ± 8% perf-profile.children.cycles-pp.__schedule
2.48 ± 2% -2.4 0.11 ± 40% perf-profile.children.cycles-pp.mutex_spin_on_owner
2.02 ± 5% -1.5 0.54 ± 9% perf-profile.children.cycles-pp.schedule
1.74 ± 4% -1.5 0.26 ± 2% perf-profile.children.cycles-pp.asm_sysvec_apic_timer_interrupt
1.51 -1.0 0.50 ± 5% perf-profile.children.cycles-pp.mutex_lock
1.20 ± 4% -0.8 0.36 ± 2% perf-profile.children.cycles-pp.mutex_unlock
0.82 ± 8% -0.7 0.10 ± 10% perf-profile.children.cycles-pp.dequeue_entity
0.89 ± 7% -0.7 0.20 ± 10% perf-profile.children.cycles-pp.dequeue_task_fair
0.91 ± 8% -0.6 0.32 ± 9% perf-profile.children.cycles-pp.ttwu_do_activate
0.85 ± 8% -0.6 0.28 ± 9% perf-profile.children.cycles-pp.activate_task
0.80 ± 7% -0.6 0.23 ± 2% perf-profile.children.cycles-pp.sysvec_apic_timer_interrupt
0.72 ± 9% -0.6 0.15 ± 11% perf-profile.children.cycles-pp.enqueue_entity
0.62 ± 5% -0.6 0.07 ± 9% perf-profile.children.cycles-pp._raw_spin_lock
0.79 ± 9% -0.5 0.26 ± 9% perf-profile.children.cycles-pp.enqueue_task_fair
0.57 ± 5% -0.5 0.05 ± 45% perf-profile.children.cycles-pp.__list_add_valid_or_report
0.54 ± 4% -0.5 0.08 ± 6% perf-profile.children.cycles-pp.__list_del_entry_valid_or_report
0.61 ± 7% -0.4 0.22 ± 3% perf-profile.children.cycles-pp.__sysvec_apic_timer_interrupt
0.61 ± 7% -0.4 0.22 ± 3% perf-profile.children.cycles-pp.hrtimer_interrupt
0.38 ± 6% -0.4 0.02 ± 99% perf-profile.children.cycles-pp.prepare_task_switch
0.40 ± 8% -0.3 0.06 ± 7% perf-profile.children.cycles-pp.switch_mm_irqs_off
0.52 ± 6% -0.3 0.20 ± 8% perf-profile.children.cycles-pp.update_load_avg
0.44 ± 6% -0.3 0.12 ± 7% perf-profile.children.cycles-pp.pick_next_task_fair
0.50 ± 7% -0.3 0.19 ± 2% perf-profile.children.cycles-pp.__hrtimer_run_queues
0.40 ± 6% -0.3 0.12 ± 11% perf-profile.children.cycles-pp.select_task_rq
0.38 ± 6% -0.3 0.12 ± 8% perf-profile.children.cycles-pp.select_task_rq_fair
0.36 ± 8% -0.2 0.15 ± 3% perf-profile.children.cycles-pp.tick_sched_timer
0.33 ± 8% -0.2 0.15 ± 5% perf-profile.children.cycles-pp.tick_sched_handle
0.32 ± 8% -0.2 0.14 ± 5% perf-profile.children.cycles-pp.update_process_times
0.28 ± 7% -0.1 0.13 ± 2% perf-profile.children.cycles-pp.scheduler_tick
0.20 ± 8% -0.1 0.06 ± 6% perf-profile.children.cycles-pp.perf_event_task_tick
0.20 ± 8% -0.1 0.06 ± 8% perf-profile.children.cycles-pp.perf_adjust_freq_unthr_context
0.16 ± 9% -0.1 0.04 ± 71% perf-profile.children.cycles-pp.__update_load_avg_se
0.26 ± 6% -0.1 0.15 ± 8% perf-profile.children.cycles-pp.syscall_exit_to_user_mode
0.22 ± 7% -0.1 0.12 ± 7% perf-profile.children.cycles-pp.exit_to_user_mode_prepare
0.23 ± 10% -0.1 0.14 ± 10% perf-profile.children.cycles-pp.update_curr
0.16 ± 7% -0.1 0.07 ± 9% perf-profile.children.cycles-pp.__update_load_avg_cfs_rq
0.15 ± 11% -0.1 0.07 ± 13% perf-profile.children.cycles-pp.__entry_text_start
0.16 ± 15% -0.1 0.08 ± 12% perf-profile.children.cycles-pp.security_file_permission
0.14 ± 17% -0.1 0.07 ± 11% perf-profile.children.cycles-pp.reweight_entity
0.20 ± 8% -0.1 0.13 ± 11% perf-profile.children.cycles-pp.copy_page_to_iter
0.19 ± 8% -0.1 0.12 ± 11% perf-profile.children.cycles-pp._copy_to_iter
0.10 ± 15% -0.1 0.04 ± 71% perf-profile.children.cycles-pp.anon_pipe_buf_release
0.13 ± 15% -0.1 0.07 ± 13% perf-profile.children.cycles-pp.apparmor_file_permission
0.10 ± 17% -0.1 0.04 ± 45% perf-profile.children.cycles-pp.__fdget_pos
0.17 ± 9% -0.1 0.12 ± 12% perf-profile.children.cycles-pp.copyout
0.11 ± 16% -0.1 0.05 ± 8% perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack
0.16 ± 13% -0.0 0.11 ± 13% perf-profile.children.cycles-pp.rep_movs_alternative
0.00 +0.1 0.05 ± 8% perf-profile.children.cycles-pp.task_tick_fair
0.00 +0.1 0.07 ± 11% perf-profile.children.cycles-pp.wake_affine
1.67 ± 7% +0.2 1.84 perf-profile.children.cycles-pp.__wake_up_common
1.38 ± 8% +0.2 1.58 perf-profile.children.cycles-pp.autoremove_wake_function
1.32 ± 7% +0.2 1.56 perf-profile.children.cycles-pp.try_to_wake_up
6.96 ± 9% +13.3 20.28 ± 4% perf-profile.children.cycles-pp.finish_wait
34.12 ± 4% +18.3 52.42 ± 5% perf-profile.children.cycles-pp.read
33.18 ± 4% +19.0 52.16 ± 5% perf-profile.children.cycles-pp.ksys_read
33.11 ± 4% +19.0 52.12 ± 5% perf-profile.children.cycles-pp.vfs_read
32.94 ± 4% +19.1 52.02 ± 5% perf-profile.children.cycles-pp.pipe_read
8.24 ± 5% +21.5 29.77 ± 6% perf-profile.children.cycles-pp.prepare_to_wait_event
6.86 ± 5% +40.7 47.55 ± 6% perf-profile.children.cycles-pp.write
6.52 ± 5% +40.8 47.29 ± 6% perf-profile.children.cycles-pp.ksys_write
6.43 ± 5% +40.8 47.26 ± 6% perf-profile.children.cycles-pp.vfs_write
6.23 ± 4% +40.9 47.16 ± 6% perf-profile.children.cycles-pp.pipe_write
3.59 +42.9 46.53 ± 6% perf-profile.children.cycles-pp.__wake_up_common_lock
40.31 ± 3% +59.4 99.70 perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
40.20 ± 3% +59.5 99.67 perf-profile.children.cycles-pp.do_syscall_64
14.86 ± 10% +79.0 93.86 perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
14.54 ± 8% +80.4 94.93 perf-profile.children.cycles-pp._raw_spin_lock_irqsave
45.34 -45.3 0.00 perf-profile.self.cycles-pp.intel_idle
6.59 ± 9% -6.6 0.00 perf-profile.self.cycles-pp.intel_idle_irq
6.15 ± 12% -6.0 0.19 ± 30% perf-profile.self.cycles-pp.osq_lock
2.46 -2.4 0.10 ± 41% perf-profile.self.cycles-pp.mutex_spin_on_owner
1.98 -1.8 0.22 ± 13% perf-profile.self.cycles-pp.__mutex_lock
1.80 ± 3% -1.6 0.23 ± 7% perf-profile.self.cycles-pp.prepare_to_wait_event
1.48 ± 2% -1.0 0.44 ± 4% perf-profile.self.cycles-pp.mutex_lock
2.04 ± 9% -1.0 1.07 ± 4% perf-profile.self.cycles-pp._raw_spin_lock_irqsave
1.14 ± 28% -0.9 0.20 ± 27% perf-profile.self.cycles-pp.pipe_read
1.20 ± 4% -0.8 0.36 perf-profile.self.cycles-pp.mutex_unlock
0.61 ± 5% -0.5 0.06 ± 11% perf-profile.self.cycles-pp._raw_spin_lock
0.57 ± 5% -0.5 0.05 ± 45% perf-profile.self.cycles-pp.__list_add_valid_or_report
0.54 ± 4% -0.5 0.08 ± 6% perf-profile.self.cycles-pp.__list_del_entry_valid_or_report
0.39 ± 9% -0.3 0.06 ± 7% perf-profile.self.cycles-pp.switch_mm_irqs_off
0.34 ± 6% -0.3 0.05 ± 44% perf-profile.self.cycles-pp.__schedule
0.28 ± 33% -0.2 0.08 ± 29% perf-profile.self.cycles-pp.pipe_write
0.16 ± 9% -0.1 0.03 ± 70% perf-profile.self.cycles-pp.__update_load_avg_se
0.18 ± 9% -0.1 0.08 ± 5% perf-profile.self.cycles-pp.update_load_avg
0.16 ± 9% -0.1 0.07 ± 10% perf-profile.self.cycles-pp.__update_load_avg_cfs_rq
0.10 ± 16% -0.1 0.04 ± 71% perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack
0.10 ± 18% -0.1 0.04 ± 71% perf-profile.self.cycles-pp.anon_pipe_buf_release
0.12 ± 14% -0.1 0.06 ± 15% perf-profile.self.cycles-pp.apparmor_file_permission
0.08 ± 20% -0.0 0.03 ± 70% perf-profile.self.cycles-pp.vfs_read
0.14 ± 13% -0.0 0.10 ± 11% perf-profile.self.cycles-pp.rep_movs_alternative
0.06 ± 18% +0.2 0.28 ± 3% perf-profile.self.cycles-pp.try_to_wake_up
14.85 ± 10% +79.0 93.86 perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki
^ permalink raw reply [flat|nested] only message in thread
only message in thread, other threads:[~2023-10-20 14:17 UTC | newest]
Thread overview: (only message) (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2023-10-20 14:16 [brauner-vfs:vfs.misc.backing_file] [fs/pipe] 8114dc703a: stress-ng.pipeherd.ops_per_sec 70.9% improvement kernel test robot
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).