* [peterz-queue:sched/core] [sched/deadline] 7bca60ce65: will-it-scale.per_process_ops 63.0% regression
@ 2025-06-03 13:48 kernel test robot
0 siblings, 0 replies; only message in thread
From: kernel test robot @ 2025-06-03 13:48 UTC (permalink / raw)
To: Peter Zijlstra
Cc: oe-lkp, lkp, Chris Mason, linux-kernel, aubrey.li, yu.c.chen,
oliver.sang
Hello,
kernel test robot noticed a 63.0% regression of will-it-scale.per_process_ops on:
commit: 7bca60ce656978b01caafc4e2285272ae9631953 ("sched/deadline: Less agressive dl_server handling")
https://git.kernel.org/cgit/linux/kernel/git/peterz/queue.git sched/core
testcase: will-it-scale
config: x86_64-rhel-9.4
compiler: gcc-12
test machine: 256 threads 2 sockets Intel(R) Xeon(R) 6767P CPU @ 2.4GHz (Granite Rapids) with 256G memory
parameters:
nr_task: 100%
mode: process
test: pwrite2
cpufreq_governor: performance
In addition to that, the commit also has significant impact on the following tests:
+------------------+---------------------------------------------------------------------------------------------+
| testcase: change | stress-ng: stress-ng.min-nanosleep.ops_per_sec 8.0% improvement |
| test machine | 192 threads 2 sockets Intel(R) Xeon(R) 6740E CPU @ 2.4GHz (Sierra Forest) with 256G memory |
| test parameters | cpufreq_governor=performance |
| | nr_threads=100% |
| | test=min-nanosleep |
| | testtime=60s |
+------------------+---------------------------------------------------------------------------------------------+
| testcase: change | stress-ng: stress-ng.rawudp.packets_32_bytes_received_per_sec 45.9% improvement |
| test machine | 192 threads 2 sockets Intel(R) Xeon(R) 6740E CPU @ 2.4GHz (Sierra Forest) with 256G memory |
| test parameters | cpufreq_governor=performance |
| | nr_threads=100% |
| | test=rawudp |
| | testtime=60s |
+------------------+---------------------------------------------------------------------------------------------+
| testcase: change | stress-ng: stress-ng.membarrier.ops_per_sec 18.2% improvement |
| test machine | 192 threads 2 sockets Intel(R) Xeon(R) 6740E CPU @ 2.4GHz (Sierra Forest) with 256G memory |
| test parameters | cpufreq_governor=performance |
| | nr_threads=100% |
| | test=membarrier |
| | testtime=60s |
+------------------+---------------------------------------------------------------------------------------------+
| testcase: change | stress-ng: stress-ng.timer.ops_per_sec 265.1% improvement |
| test machine | 224 threads 2 sockets Intel(R) Xeon(R) Platinum 8480CTDX (Sapphire Rapids) with 512G memory |
| test parameters | cpufreq_governor=performance |
| | nr_threads=100% |
| | test=timer |
| | testtime=60s |
+------------------+---------------------------------------------------------------------------------------------+
If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <oliver.sang@intel.com>
| Closes: https://lore.kernel.org/oe-lkp/202506032142.4f7b26c9-lkp@intel.com
Details are as below:
-------------------------------------------------------------------------------------------------->
The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20250603/202506032142.4f7b26c9-lkp@intel.com
=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase:
gcc-12/performance/x86_64-rhel-9.4/process/100%/debian-12-x86_64-20240206.cgz/lkp-gnr-2sp3/pwrite2/will-it-scale
commit:
032e1003ad ("sched: Make clangd usable")
7bca60ce65 ("sched/deadline: Less agressive dl_server handling")
032e1003ad358e4c 7bca60ce656978b01caafc4e228
---------------- ---------------------------
%stddev %change %stddev
\ | \
62171 +33.9% 83247 ± 4% uptime.idle
2.143e+09 +986.3% 2.328e+10 ± 16% cpuidle..time
5220618 ± 13% +300.1% 20887937 ± 10% cpuidle..usage
9915050 ± 10% -17.6% 8167082 ± 6% meminfo.DirectMap2M
1378641 -15.6% 1164078 ± 4% meminfo.Shmem
22480 ± 5% -17.7% 18493 ± 2% perf-c2c.DRAM.remote
14405 ± 7% -19.3% 11619 perf-c2c.HITM.remote
32559 ± 5% -14.0% 28016 ± 2% perf-c2c.HITM.total
522414 -63.0% 193394 ± 13% will-it-scale.256.processes
2.23 ± 2% +1197.8% 28.98 ± 16% will-it-scale.256.processes_idle
2040 -63.0% 754.83 ± 13% will-it-scale.per_process_ops
522414 -63.0% 193394 ± 13% will-it-scale.workload
2.73 ± 2% +26.6 29.35 ± 16% mpstat.cpu.all.idle%
0.48 +0.7 1.13 ± 6% mpstat.cpu.all.irq%
0.01 ± 5% +0.1 0.09 ± 19% mpstat.cpu.all.soft%
96.71 -27.4 69.35 ± 6% mpstat.cpu.all.sys%
99.21 -26.1% 73.30 ± 8% mpstat.max_utilization_pct
3.15 ± 4% +843.3% 29.74 ± 16% vmstat.cpu.id
96.77 -27.5% 70.18 ± 6% vmstat.cpu.sy
226.34 -20.5% 179.86 ± 6% vmstat.procs.r
44868 ± 12% +164.0% 118461 ± 14% vmstat.system.cs
338462 -9.1% 307666 ± 6% vmstat.system.in
554522 -9.6% 501067 ± 2% proc-vmstat.nr_inactive_anon
1227403 -4.4% 1173704 proc-vmstat.nr_mapped
127426 -1.3% 125816 proc-vmstat.nr_slab_reclaimable
344630 -15.6% 290933 ± 4% proc-vmstat.nr_writeback_temp
554522 -9.6% 501067 ± 2% proc-vmstat.nr_zone_active_anon
10697 ± 27% +133.3% 24951 ± 15% proc-vmstat.numa_hint_faults
1539573 -5.9% 1449401 proc-vmstat.numa_interleave
1914686 -3.5% 1847747 proc-vmstat.pgalloc_dma32
4.307e+10 -25.7% 3.2e+10 ± 5% perf-stat.i.branch-instructions
0.02 +0.0 0.03 ± 5% perf-stat.i.branch-miss-rate%
37.57 ± 2% -4.4 33.13 ± 2% perf-stat.i.cache-miss-rate%
24711200 ± 2% -24.3% 18713324 ± 8% perf-stat.i.cache-misses
69291497 ± 5% -16.8% 57684629 ± 8% perf-stat.i.cache-references
45155 ± 12% +165.4% 119829 ± 14% perf-stat.i.context-switches
4.16 -2.8% 4.04 perf-stat.i.cpi
8.973e+11 -27.7% 6.483e+11 ± 6% perf-stat.i.cpu-cycles
2.153e+11 -25.7% 1.599e+11 ± 5% perf-stat.i.instructions
0.24 +2.9% 0.25 perf-stat.i.ipc
0.01 +0.0 0.02 ± 7% perf-stat.overall.branch-miss-rate%
35.74 ± 4% -3.9 31.80 ± 3% perf-stat.overall.cache-miss-rate%
4.17 -2.7% 4.05 perf-stat.overall.cpi
0.24 +2.8% 0.25 perf-stat.overall.ipc
1.242e+08 +103.6% 2.529e+08 ± 11% perf-stat.overall.path-length
4.292e+10 -25.7% 3.191e+10 ± 5% perf-stat.ps.branch-instructions
24658370 ± 2% -24.0% 18732132 ± 8% perf-stat.ps.cache-misses
69159431 ± 5% -14.7% 59026367 ± 9% perf-stat.ps.cache-references
44956 ± 12% +164.5% 118893 ± 14% perf-stat.ps.context-switches
8.943e+11 -27.7% 6.466e+11 ± 6% perf-stat.ps.cpu-cycles
2.146e+11 -25.7% 1.594e+11 ± 5% perf-stat.ps.instructions
6.488e+13 -25.6% 4.825e+13 ± 5% perf-stat.total.instructions
36962481 -46.0% 19945596 ± 15% sched_debug.cfs_rq:/.avg_vruntime.avg
37099616 -24.3% 28080467 ± 8% sched_debug.cfs_rq:/.avg_vruntime.max
32455889 ± 4% -87.4% 4074413 ± 13% sched_debug.cfs_rq:/.avg_vruntime.min
320613 ± 30% +3264.0% 10785304 ± 5% sched_debug.cfs_rq:/.avg_vruntime.stddev
0.77 ± 2% -22.7% 0.60 ± 7% sched_debug.cfs_rq:/.h_nr_queued.avg
1.81 ± 6% -38.5% 1.11 ± 16% sched_debug.cfs_rq:/.h_nr_queued.max
0.25 ± 14% +57.4% 0.40 ± 8% sched_debug.cfs_rq:/.h_nr_queued.stddev
0.77 ± 2% -22.7% 0.60 ± 7% sched_debug.cfs_rq:/.h_nr_runnable.avg
1.81 ± 6% -43.1% 1.03 ± 6% sched_debug.cfs_rq:/.h_nr_runnable.max
0.25 ± 14% +57.6% 0.40 ± 8% sched_debug.cfs_rq:/.h_nr_runnable.stddev
1.67 ± 14% -80.0% 0.33 ± 40% sched_debug.cfs_rq:/.load_avg.min
36962481 -46.0% 19945596 ± 15% sched_debug.cfs_rq:/.min_vruntime.avg
37099616 -24.3% 28080467 ± 8% sched_debug.cfs_rq:/.min_vruntime.max
32455889 ± 4% -87.4% 4074413 ± 13% sched_debug.cfs_rq:/.min_vruntime.min
320613 ± 30% +3264.0% 10785304 ± 5% sched_debug.cfs_rq:/.min_vruntime.stddev
0.77 ± 2% -22.3% 0.60 ± 7% sched_debug.cfs_rq:/.nr_queued.avg
0.24 ± 13% +67.1% 0.40 ± 8% sched_debug.cfs_rq:/.nr_queued.stddev
781.09 -25.1% 584.71 ± 8% sched_debug.cfs_rq:/.runnable_avg.avg
1446 ± 9% -28.5% 1033 ± 6% sched_debug.cfs_rq:/.runnable_avg.max
419.36 ± 12% -83.6% 68.78 ± 39% sched_debug.cfs_rq:/.runnable_avg.min
100.55 ± 4% +238.4% 340.26 ± 7% sched_debug.cfs_rq:/.runnable_avg.stddev
779.02 -25.0% 584.31 ± 9% sched_debug.cfs_rq:/.util_avg.avg
1232 ± 7% -18.4% 1005 ± 2% sched_debug.cfs_rq:/.util_avg.max
382.89 ± 18% -83.2% 64.39 ± 40% sched_debug.cfs_rq:/.util_avg.min
94.04 ± 4% +261.5% 339.97 ± 7% sched_debug.cfs_rq:/.util_avg.stddev
736.25 ± 2% -25.7% 547.37 ± 9% sched_debug.cfs_rq:/.util_est.avg
1379 ± 13% -28.6% 984.69 ± 2% sched_debug.cfs_rq:/.util_est.max
224.52 ± 14% +73.9% 390.37 ± 7% sched_debug.cfs_rq:/.util_est.stddev
711444 ± 6% +28.6% 914604 sched_debug.cpu.avg_idle.avg
325251 ± 3% -30.9% 224817 ± 7% sched_debug.cpu.avg_idle.stddev
483.08 +228.2% 1585 ± 30% sched_debug.cpu.clock_task.stddev
4766 ± 2% -22.4% 3697 ± 7% sched_debug.cpu.curr->pid.avg
8228 ± 8% +18.1% 9717 sched_debug.cpu.curr->pid.max
1453 ± 13% +69.5% 2464 ± 8% sched_debug.cpu.curr->pid.stddev
0.00 ± 6% +17.8% 0.00 ± 9% sched_debug.cpu.next_balance.stddev
0.77 ± 2% -23.1% 0.59 ± 7% sched_debug.cpu.nr_running.avg
1.78 ± 8% -37.5% 1.11 ± 16% sched_debug.cpu.nr_running.max
0.25 ± 13% +57.6% 0.40 ± 8% sched_debug.cpu.nr_running.stddev
27430 ± 11% +154.7% 69851 ± 14% sched_debug.cpu.nr_switches.avg
12269 ± 6% -87.3% 1561 ± 10% sched_debug.cpu.nr_switches.min
17965 ± 43% +441.4% 97256 ± 24% sched_debug.cpu.nr_switches.stddev
0.07 ± 30% +248.0% 0.25 ± 19% sched_debug.cpu.nr_uninterruptible.avg
0.02 ±104% -100.0% 0.00 perf-sched.sch_delay.avg.ms.__cond_resched.__alloc_frozen_pages_noprof.alloc_pages_mpol.folio_alloc_mpol_noprof.shmem_alloc_folio
0.09 ± 8% +28.4% 0.11 ± 3% perf-sched.sch_delay.avg.ms.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
0.07 ±169% -100.0% 0.00 perf-sched.sch_delay.avg.ms.__cond_resched.down_write_killable.exec_mmap.begin_new_exec.load_elf_binary
0.01 ± 22% -100.0% 0.00 perf-sched.sch_delay.avg.ms.__cond_resched.dput.__fput.__x64_sys_close.do_syscall_64
0.00 ± 19% +288.9% 0.01 ± 27% perf-sched.sch_delay.avg.ms.__cond_resched.generic_perform_write.shmem_file_write_iter.vfs_write.__x64_sys_pwrite64
0.07 ± 51% -97.7% 0.00 ±223% perf-sched.sch_delay.avg.ms.__cond_resched.generic_perform_write.shmem_file_write_iter.vfs_write.ksys_write
0.03 ± 85% -100.0% 0.00 perf-sched.sch_delay.avg.ms.__cond_resched.shmem_inode_acct_blocks.shmem_alloc_and_add_folio.shmem_get_folio_gfp.shmem_write_begin
0.19 ± 17% -96.5% 0.01 ± 10% perf-sched.sch_delay.avg.ms.__cond_resched.stop_one_cpu.sched_exec.bprm_execve.part
0.02 ±139% -100.0% 0.00 perf-sched.sch_delay.avg.ms.__cond_resched.zap_pte_range.zap_pmd_range.isra.0
1.09 ± 33% -99.1% 0.01 ± 26% perf-sched.sch_delay.avg.ms.__x64_sys_pause.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
0.04 ± 43% -82.8% 0.01 ± 12% perf-sched.sch_delay.avg.ms.anon_pipe_read.fifo_pipe_read.vfs_read.ksys_read
0.02 ± 74% -84.6% 0.00 ± 11% perf-sched.sch_delay.avg.ms.anon_pipe_read.vfs_read.ksys_read.do_syscall_64
0.10 ± 42% -90.3% 0.01 ± 10% perf-sched.sch_delay.avg.ms.do_wait.kernel_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.02 ± 75% -100.0% 0.00 perf-sched.sch_delay.avg.ms.irqentry_exit_to_user_mode.asm_exc_page_fault.[unknown].[unknown]
0.01 ± 48% -100.0% 0.00 perf-sched.sch_delay.avg.ms.irqentry_exit_to_user_mode.asm_sysvec_call_function_single.[unknown]
0.03 ±168% -100.0% 0.00 perf-sched.sch_delay.avg.ms.irqentry_exit_to_user_mode.asm_sysvec_call_function_single.[unknown].[unknown]
0.01 ± 17% -100.0% 0.00 perf-sched.sch_delay.avg.ms.irqentry_exit_to_user_mode.asm_sysvec_reschedule_ipi.[unknown]
0.66 ± 5% -90.0% 0.07 ± 68% perf-sched.sch_delay.avg.ms.schedule_hrtimeout_range_clock.poll_schedule_timeout.constprop.0.do_poll
0.21 ± 30% -95.7% 0.01 ± 49% perf-sched.sch_delay.avg.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
0.02 ± 18% -76.4% 0.01 ± 13% perf-sched.sch_delay.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
0.07 ± 61% -90.4% 0.01 ± 15% perf-sched.sch_delay.avg.ms.wait_for_partner.fifo_open.do_dentry_open.vfs_open
0.01 ± 16% +24449.4% 3.31 ±220% perf-sched.sch_delay.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
1.01 ±139% -100.0% 0.00 perf-sched.sch_delay.max.ms.__cond_resched.__alloc_frozen_pages_noprof.alloc_pages_mpol.folio_alloc_mpol_noprof.shmem_alloc_folio
0.36 ±203% -100.0% 0.00 perf-sched.sch_delay.max.ms.__cond_resched.down_write_killable.exec_mmap.begin_new_exec.load_elf_binary
0.02 ± 26% -100.0% 0.00 perf-sched.sch_delay.max.ms.__cond_resched.dput.__fput.__x64_sys_close.do_syscall_64
0.01 ± 35% +216.3% 0.02 ± 97% perf-sched.sch_delay.max.ms.__cond_resched.generic_perform_write.shmem_file_write_iter.vfs_write.__x64_sys_pwrite64
3.59 ± 7% -99.9% 0.00 ±223% perf-sched.sch_delay.max.ms.__cond_resched.generic_perform_write.shmem_file_write_iter.vfs_write.ksys_write
1.52 ±100% -100.0% 0.00 perf-sched.sch_delay.max.ms.__cond_resched.shmem_inode_acct_blocks.shmem_alloc_and_add_folio.shmem_get_folio_gfp.shmem_write_begin
2.48 ± 19% -99.6% 0.01 ± 20% perf-sched.sch_delay.max.ms.__cond_resched.stop_one_cpu.sched_exec.bprm_execve.part
0.13 ±201% -100.0% 0.00 perf-sched.sch_delay.max.ms.__cond_resched.zap_pte_range.zap_pmd_range.isra.0
2.41 ± 44% -99.4% 0.02 ± 29% perf-sched.sch_delay.max.ms.__x64_sys_pause.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
3.35 ± 23% -99.1% 0.03 ± 31% perf-sched.sch_delay.max.ms.anon_pipe_read.fifo_pipe_read.vfs_read.ksys_read
2.82 ± 26% -58.6% 1.17 ± 39% perf-sched.sch_delay.max.ms.anon_pipe_read.vfs_read.ksys_read.do_syscall_64
2.37 ± 33% -98.3% 0.04 ± 46% perf-sched.sch_delay.max.ms.do_wait.kernel_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.05 ± 84% -100.0% 0.00 perf-sched.sch_delay.max.ms.irqentry_exit_to_user_mode.asm_exc_page_fault.[unknown].[unknown]
0.15 ± 84% -100.0% 0.00 perf-sched.sch_delay.max.ms.irqentry_exit_to_user_mode.asm_sysvec_call_function_single.[unknown]
0.44 ±194% -100.0% 0.00 perf-sched.sch_delay.max.ms.irqentry_exit_to_user_mode.asm_sysvec_call_function_single.[unknown].[unknown]
0.02 ± 42% -100.0% 0.00 perf-sched.sch_delay.max.ms.irqentry_exit_to_user_mode.asm_sysvec_reschedule_ipi.[unknown]
0.13 ± 61% -94.3% 0.01 ±135% perf-sched.sch_delay.max.ms.irqentry_exit_to_user_mode.asm_sysvec_reschedule_ipi.[unknown].[unknown]
3.64 ± 5% -76.5% 0.86 ± 43% perf-sched.sch_delay.max.ms.schedule_hrtimeout_range_clock.poll_schedule_timeout.constprop.0.do_poll
3.32 ± 18% -94.3% 0.19 ±194% perf-sched.sch_delay.max.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
2.84 ± 12% -98.5% 0.04 ± 15% perf-sched.sch_delay.max.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
2.37 ± 48% -98.9% 0.03 ± 48% perf-sched.sch_delay.max.ms.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
1.58 ± 61% -99.0% 0.02 ± 7% perf-sched.sch_delay.max.ms.wait_for_partner.fifo_open.do_dentry_open.vfs_open
14.39 ± 8% -41.5% 8.42 ± 9% perf-sched.total_wait_and_delay.average.ms
110674 ± 7% +111.8% 234429 ± 8% perf-sched.total_wait_and_delay.count.ms
4196 ± 4% -17.8% 3448 ± 12% perf-sched.total_wait_and_delay.max.ms
14.36 ± 8% -41.5% 8.40 ± 9% perf-sched.total_wait_time.average.ms
4196 ± 4% -17.8% 3448 ± 12% perf-sched.total_wait_time.max.ms
103.13 ± 33% +97.9% 204.04 ± 2% perf-sched.wait_and_delay.avg.ms.anon_pipe_read.fifo_pipe_read.vfs_read.ksys_read
40.08 ± 58% -51.0% 19.63 ± 2% perf-sched.wait_and_delay.avg.ms.anon_pipe_read.vfs_read.ksys_read.do_syscall_64
0.36 ± 15% +403.1% 1.83 ± 14% perf-sched.wait_and_delay.avg.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.shmem_file_write_iter
655.89 ± 2% -10.9% 584.12 ± 5% perf-sched.wait_and_delay.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
403.33 ± 64% -99.3% 2.83 ± 89% perf-sched.wait_and_delay.count.__cond_resched.generic_perform_write.shmem_file_write_iter.vfs_write.ksys_write
439.00 ± 38% -55.0% 197.67 ± 2% perf-sched.wait_and_delay.count.anon_pipe_read.fifo_pipe_read.vfs_read.ksys_read
871.17 ± 39% +63.0% 1419 ± 2% perf-sched.wait_and_delay.count.anon_pipe_read.vfs_read.ksys_read.do_syscall_64
102542 ± 7% +121.9% 227581 ± 9% perf-sched.wait_and_delay.count.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.shmem_file_write_iter
653.50 ± 2% -72.0% 183.17 ±141% perf-sched.wait_and_delay.count.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
1944 +13.7% 2211 ± 3% perf-sched.wait_and_delay.count.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
439.50 ± 4% -35.6% 283.00 ± 12% perf-sched.wait_and_delay.count.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
1.19 ±145% -100.0% 0.00 perf-sched.wait_time.avg.ms.__cond_resched.__alloc_frozen_pages_noprof.alloc_pages_mpol.folio_alloc_mpol_noprof.shmem_alloc_folio
0.01 ± 65% -100.0% 0.00 perf-sched.wait_time.avg.ms.__cond_resched.down_write_killable.exec_mmap.begin_new_exec.load_elf_binary
0.01 ± 22% -100.0% 0.00 perf-sched.wait_time.avg.ms.__cond_resched.dput.__fput.__x64_sys_close.do_syscall_64
2.52 ± 28% +115.7% 5.45 ± 12% perf-sched.wait_time.avg.ms.__cond_resched.generic_perform_write.shmem_file_write_iter.vfs_write.__x64_sys_pwrite64
4.99 ±132% +713.8% 40.64 ± 70% perf-sched.wait_time.avg.ms.__cond_resched.mutex_lock.perf_poll.do_poll.constprop
83.35 ±223% -100.0% 0.00 perf-sched.wait_time.avg.ms.__cond_resched.zap_pte_range.zap_pmd_range.isra.0
103.08 ± 33% +97.9% 204.03 ± 2% perf-sched.wait_time.avg.ms.anon_pipe_read.fifo_pipe_read.vfs_read.ksys_read
40.06 ± 58% -51.0% 19.63 ± 2% perf-sched.wait_time.avg.ms.anon_pipe_read.vfs_read.ksys_read.do_syscall_64
0.10 ±128% -100.0% 0.00 perf-sched.wait_time.avg.ms.irqentry_exit_to_user_mode.asm_exc_page_fault.[unknown].[unknown]
0.05 ± 85% -100.0% 0.00 perf-sched.wait_time.avg.ms.irqentry_exit_to_user_mode.asm_sysvec_call_function_single.[unknown]
0.04 ±125% -100.0% 0.00 perf-sched.wait_time.avg.ms.irqentry_exit_to_user_mode.asm_sysvec_call_function_single.[unknown].[unknown]
0.01 ± 48% -100.0% 0.00 perf-sched.wait_time.avg.ms.irqentry_exit_to_user_mode.asm_sysvec_reschedule_ipi.[unknown]
1.55 ± 39% +184.1% 4.40 ± 27% perf-sched.wait_time.avg.ms.rcu_gp_kthread.kthread.ret_from_fork.ret_from_fork_asm
0.34 ± 16% +431.0% 1.82 ± 15% perf-sched.wait_time.avg.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.shmem_file_write_iter
0.93 ± 17% -30.3% 0.65 ± 4% perf-sched.wait_time.avg.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
7.37 ± 2% +13.2% 8.34 ± 4% perf-sched.wait_time.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
655.87 ± 2% -11.4% 580.80 ± 4% perf-sched.wait_time.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
80.44 ±140% -100.0% 0.00 perf-sched.wait_time.max.ms.__cond_resched.__alloc_frozen_pages_noprof.alloc_pages_mpol.folio_alloc_mpol_noprof.shmem_alloc_folio
0.02 ± 58% -100.0% 0.00 perf-sched.wait_time.max.ms.__cond_resched.down_write_killable.exec_mmap.begin_new_exec.load_elf_binary
0.02 ± 26% -100.0% 0.00 perf-sched.wait_time.max.ms.__cond_resched.dput.__fput.__x64_sys_close.do_syscall_64
174.94 ±211% -95.6% 7.61 ± 10% perf-sched.wait_time.max.ms.__cond_resched.shmem_get_folio_gfp.shmem_write_begin.generic_perform_write.shmem_file_write_iter
166.80 ±223% -100.0% 0.00 perf-sched.wait_time.max.ms.__cond_resched.zap_pte_range.zap_pmd_range.isra.0
0.28 ±133% -100.0% 0.00 perf-sched.wait_time.max.ms.irqentry_exit_to_user_mode.asm_exc_page_fault.[unknown].[unknown]
0.82 ± 97% -100.0% 0.00 perf-sched.wait_time.max.ms.irqentry_exit_to_user_mode.asm_sysvec_call_function_single.[unknown]
0.70 ±120% -100.0% 0.00 perf-sched.wait_time.max.ms.irqentry_exit_to_user_mode.asm_sysvec_call_function_single.[unknown].[unknown]
0.05 ±110% -100.0% 0.00 perf-sched.wait_time.max.ms.irqentry_exit_to_user_mode.asm_sysvec_reschedule_ipi.[unknown]
334.66 ±141% -99.9% 0.21 ±220% perf-sched.wait_time.max.ms.irqentry_exit_to_user_mode.asm_sysvec_reschedule_ipi.[unknown].[unknown]
9.25 ±103% -71.0% 2.68 ± 9% perf-sched.wait_time.max.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
98.80 -1.8 97.02 perf-profile.calltrace.cycles-pp.osq_lock.rwsem_down_write_slowpath.down_write.shmem_file_write_iter.vfs_write
99.20 -1.0 98.24 perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.shmem_file_write_iter.vfs_write.__x64_sys_pwrite64
99.22 -0.9 98.29 perf-profile.calltrace.cycles-pp.down_write.shmem_file_write_iter.vfs_write.__x64_sys_pwrite64.do_syscall_64
99.59 -0.6 99.02 perf-profile.calltrace.cycles-pp.shmem_file_write_iter.vfs_write.__x64_sys_pwrite64.do_syscall_64.entry_SYSCALL_64_after_hwframe
99.60 -0.6 99.03 perf-profile.calltrace.cycles-pp.__x64_sys_pwrite64.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_pwrite
99.60 -0.6 99.03 perf-profile.calltrace.cycles-pp.vfs_write.__x64_sys_pwrite64.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_pwrite
99.65 -0.5 99.12 perf-profile.calltrace.cycles-pp.__libc_pwrite
99.64 -0.5 99.12 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_pwrite
99.64 -0.5 99.12 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__libc_pwrite
0.51 +0.9 1.43 ± 6% perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt.osq_lock.rwsem_down_write_slowpath.down_write.shmem_file_write_iter
0.00 +1.2 1.15 ± 6% perf-profile.calltrace.cycles-pp.update_process_times.tick_nohz_handler.__hrtimer_run_queues.hrtimer_interrupt.__sysvec_apic_timer_interrupt
0.00 +1.2 1.20 ± 6% perf-profile.calltrace.cycles-pp.tick_nohz_handler.__hrtimer_run_queues.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt
0.00 +1.2 1.21 ± 6% perf-profile.calltrace.cycles-pp.__hrtimer_run_queues.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt
0.00 +1.4 1.38 ± 6% perf-profile.calltrace.cycles-pp.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.osq_lock
0.00 +1.4 1.38 ± 6% perf-profile.calltrace.cycles-pp.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.osq_lock.rwsem_down_write_slowpath
0.00 +1.4 1.41 ± 6% perf-profile.calltrace.cycles-pp.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.osq_lock.rwsem_down_write_slowpath.down_write
98.82 -1.8 97.04 perf-profile.children.cycles-pp.osq_lock
99.20 -1.0 98.24 perf-profile.children.cycles-pp.rwsem_down_write_slowpath
99.23 -0.9 98.29 perf-profile.children.cycles-pp.down_write
99.60 -0.6 99.03 perf-profile.children.cycles-pp.__x64_sys_pwrite64
99.66 -0.5 99.12 perf-profile.children.cycles-pp.__libc_pwrite
99.70 -0.5 99.20 perf-profile.children.cycles-pp.shmem_file_write_iter
99.71 -0.5 99.22 perf-profile.children.cycles-pp.vfs_write
99.78 -0.4 99.33 perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
99.77 -0.4 99.33 perf-profile.children.cycles-pp.do_syscall_64
0.02 ±141% +0.0 0.06 ± 9% perf-profile.children.cycles-pp.shmem_get_folio_gfp
0.00 +0.1 0.06 ± 9% perf-profile.children.cycles-pp.update_rq_clock_task
0.00 +0.1 0.06 ± 8% perf-profile.children.cycles-pp.syscall_exit_to_user_mode
0.00 +0.1 0.06 ± 9% perf-profile.children.cycles-pp.wake_q_add
0.05 +0.1 0.12 ± 18% perf-profile.children.cycles-pp.shmem_write_end
0.00 +0.1 0.06 ± 17% perf-profile.children.cycles-pp.sched_ttwu_pending
0.20 ± 4% +0.1 0.27 ± 10% perf-profile.children.cycles-pp.copy_page_from_iter_atomic
0.10 ± 13% +0.1 0.17 ± 19% perf-profile.children.cycles-pp.perf_mmap__read_head
0.00 +0.1 0.07 ± 18% perf-profile.children.cycles-pp.idle_cpu
0.06 ± 6% +0.1 0.13 ± 6% perf-profile.children.cycles-pp.clockevents_program_event
0.09 ± 12% +0.1 0.17 ± 18% perf-profile.children.cycles-pp.ring_buffer_read_head
0.00 +0.1 0.08 ± 8% perf-profile.children.cycles-pp.raw_spin_rq_lock_nested
0.00 +0.1 0.08 ± 8% perf-profile.children.cycles-pp._raw_spin_lock
0.00 +0.1 0.08 ± 13% perf-profile.children.cycles-pp.__flush_smp_call_function_queue
0.00 +0.1 0.09 ± 12% perf-profile.children.cycles-pp.__sysvec_call_function_single
0.00 +0.1 0.09 ± 12% perf-profile.children.cycles-pp.kick_ilb
0.06 +0.1 0.15 ± 3% perf-profile.children.cycles-pp.get_jiffies_update
0.06 +0.1 0.15 ± 4% perf-profile.children.cycles-pp.tmigr_requires_handle_remote
0.00 +0.1 0.10 ± 19% perf-profile.children.cycles-pp.folio_unlock
0.05 +0.1 0.15 ± 5% perf-profile.children.cycles-pp.ktime_get
0.00 +0.1 0.10 ± 7% perf-profile.children.cycles-pp.update_curr
0.00 +0.1 0.11 ± 26% perf-profile.children.cycles-pp._nohz_idle_balance
0.23 ± 3% +0.1 0.36 ± 7% perf-profile.children.cycles-pp.rwsem_spin_on_owner
0.00 +0.1 0.12 ± 4% perf-profile.children.cycles-pp.update_load_avg
0.00 +0.1 0.13 ± 14% perf-profile.children.cycles-pp.try_to_wake_up
0.12 ± 6% +0.1 0.25 ± 10% perf-profile.children.cycles-pp.update_cfs_group
0.00 +0.1 0.13 ± 22% perf-profile.children.cycles-pp.handle_softirqs
0.00 +0.1 0.14 ± 22% perf-profile.children.cycles-pp.__irq_exit_rcu
0.00 +0.1 0.14 ± 14% perf-profile.children.cycles-pp.wake_up_q
0.34 ± 5% +0.2 0.49 ± 6% perf-profile.children.cycles-pp.generic_perform_write
0.03 ±100% +0.2 0.20 ± 14% perf-profile.children.cycles-pp._raw_spin_lock_irqsave
0.00 +0.2 0.21 ± 15% perf-profile.children.cycles-pp.sysvec_call_function_single
0.00 +0.2 0.21 ± 31% perf-profile.children.cycles-pp.update_sg_lb_stats
0.00 +0.2 0.22 ± 30% perf-profile.children.cycles-pp.sched_balance_find_src_group
0.00 +0.2 0.22 ± 30% perf-profile.children.cycles-pp.update_sd_lb_stats
0.00 +0.2 0.23 ± 31% perf-profile.children.cycles-pp.sched_balance_newidle
0.00 +0.2 0.24 ± 15% perf-profile.children.cycles-pp.asm_sysvec_call_function_single
0.00 +0.2 0.24 ± 32% perf-profile.children.cycles-pp.sched_balance_rq
0.00 +0.2 0.24 ± 28% perf-profile.children.cycles-pp.pick_next_task_fair
0.00 +0.3 0.26 ± 27% perf-profile.children.cycles-pp.__pick_next_task
0.08 ± 11% +0.3 0.34 ± 10% perf-profile.children.cycles-pp.rwsem_wake
0.19 ± 4% +0.3 0.45 ± 7% perf-profile.children.cycles-pp.task_tick_fair
0.11 ± 8% +0.3 0.40 ± 11% perf-profile.children.cycles-pp.up_write
0.00 +0.3 0.30 ± 20% perf-profile.children.cycles-pp.schedule_preempt_disabled
0.00 +0.3 0.30 ± 15% perf-profile.children.cycles-pp.nohz_balancer_kick
0.00 +0.3 0.31 ± 20% perf-profile.children.cycles-pp.schedule
0.00 +0.3 0.32 ± 14% perf-profile.children.cycles-pp.acpi_idle_do_entry
0.00 +0.3 0.32 ± 14% perf-profile.children.cycles-pp.acpi_idle_enter
0.00 +0.3 0.32 ± 14% perf-profile.children.cycles-pp.acpi_safe_halt
0.00 +0.3 0.32 ± 14% perf-profile.children.cycles-pp.pv_native_safe_halt
0.00 +0.3 0.32 ± 14% perf-profile.children.cycles-pp.cpuidle_enter
0.00 +0.3 0.32 ± 14% perf-profile.children.cycles-pp.cpuidle_enter_state
0.24 ± 3% +0.3 0.57 ± 7% perf-profile.children.cycles-pp.sched_tick
0.01 ±223% +0.3 0.35 ± 16% perf-profile.children.cycles-pp.__schedule
0.00 +0.4 0.35 ± 15% perf-profile.children.cycles-pp.cpuidle_idle_call
0.00 +0.4 0.36 ± 18% perf-profile.children.cycles-pp.start_secondary
0.00 +0.4 0.37 ± 24% perf-profile.children.cycles-pp._raw_spin_lock_irq
0.00 +0.4 0.41 ± 14% perf-profile.children.cycles-pp.common_startup_64
0.00 +0.4 0.41 ± 14% perf-profile.children.cycles-pp.cpu_startup_entry
0.00 +0.4 0.41 ± 14% perf-profile.children.cycles-pp.do_idle
0.00 +0.4 0.45 ± 24% perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
0.36 ± 2% +0.8 1.20 ± 6% perf-profile.children.cycles-pp.update_process_times
0.38 +0.9 1.26 ± 6% perf-profile.children.cycles-pp.tick_nohz_handler
0.38 +0.9 1.27 ± 6% perf-profile.children.cycles-pp.__hrtimer_run_queues
0.48 +1.0 1.46 ± 5% perf-profile.children.cycles-pp.__sysvec_apic_timer_interrupt
0.48 +1.0 1.46 ± 5% perf-profile.children.cycles-pp.hrtimer_interrupt
0.50 +1.0 1.51 ± 5% perf-profile.children.cycles-pp.sysvec_apic_timer_interrupt
0.52 +1.0 1.55 ± 5% perf-profile.children.cycles-pp.asm_sysvec_apic_timer_interrupt
98.32 -2.7 95.61 perf-profile.self.cycles-pp.osq_lock
0.02 ± 99% +0.0 0.07 ± 10% perf-profile.self.cycles-pp._raw_spin_lock_irqsave
0.06 ± 7% +0.1 0.12 ± 21% perf-profile.self.cycles-pp.rwsem_down_write_slowpath
0.00 +0.1 0.05 ± 8% perf-profile.self.cycles-pp.up_write
0.00 +0.1 0.06 ± 9% perf-profile.self.cycles-pp.wake_q_add
0.20 ± 4% +0.1 0.26 ± 10% perf-profile.self.cycles-pp.copy_page_from_iter_atomic
0.00 +0.1 0.06 ± 17% perf-profile.self.cycles-pp._raw_spin_lock
0.00 +0.1 0.07 ± 10% perf-profile.self.cycles-pp.update_load_avg
0.00 +0.1 0.07 ± 18% perf-profile.self.cycles-pp.idle_cpu
0.09 ± 12% +0.1 0.16 ± 18% perf-profile.self.cycles-pp.ring_buffer_read_head
0.06 +0.1 0.15 ± 3% perf-profile.self.cycles-pp.get_jiffies_update
0.05 +0.1 0.15 ± 7% perf-profile.self.cycles-pp.ktime_get
0.00 +0.1 0.10 ± 19% perf-profile.self.cycles-pp.folio_unlock
0.23 ± 3% +0.1 0.35 ± 6% perf-profile.self.cycles-pp.rwsem_spin_on_owner
0.12 ± 6% +0.1 0.25 ± 10% perf-profile.self.cycles-pp.update_cfs_group
0.00 +0.2 0.16 ± 31% perf-profile.self.cycles-pp.update_sg_lb_stats
0.00 +0.2 0.25 ± 13% perf-profile.self.cycles-pp.nohz_balancer_kick
0.00 +0.4 0.45 ± 24% perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
***************************************************************************************************
lkp-srf-2sp2: 192 threads 2 sockets Intel(R) Xeon(R) 6740E CPU @ 2.4GHz (Sierra Forest) with 256G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
gcc-12/performance/x86_64-rhel-9.4/100%/debian-12-x86_64-20240206.cgz/lkp-srf-2sp2/min-nanosleep/stress-ng/60s
commit:
032e1003ad ("sched: Make clangd usable")
7bca60ce65 ("sched/deadline: Less agressive dl_server handling")
032e1003ad358e4c 7bca60ce656978b01caafc4e228
---------------- ---------------------------
%stddev %change %stddev
\ | \
51713 ± 91% +171.0% 140151 ± 27% numa-numastat.node1.other_node
51713 ± 91% +171.0% 140151 ± 27% numa-vmstat.node1.numa_local
7.78 ± 7% -6.8 0.96 ± 2% mpstat.cpu.all.irq%
0.52 ± 4% -0.2 0.33 ± 3% mpstat.cpu.all.sys%
11.42 ± 19% -60.5% 4.51 ± 4% mpstat.max_utilization_pct
206334 +8.0% 222876 stress-ng.min-nanosleep.ops
3437 +8.0% 3713 stress-ng.min-nanosleep.ops_per_sec
72619226 +8.0% 78450215 stress-ng.time.voluntary_context_switches
91.34 +7.6% 98.24 vmstat.cpu.id
2300297 +8.0% 2485443 vmstat.system.cs
1296371 +5.1% 1362893 vmstat.system.in
1165012 +4.4% 1216559 proc-vmstat.nr_inactive_anon
1878335 +2.8% 1930467 proc-vmstat.nr_mapped
3264015 +1.6% 3315154 proc-vmstat.nr_unaccepted
990621 +5.3% 1042760 proc-vmstat.nr_writeback_temp
1165012 +4.4% 1216559 proc-vmstat.nr_zone_active_anon
1911876 +3.8% 1984414 proc-vmstat.numa_interleave
2154719 +3.6% 2232864 proc-vmstat.pgalloc_dma32
903.88 ± 11% -30.3% 629.56 ± 14% sched_debug.cfs_rq:/.avg_vruntime.min
6336 ± 29% -59.2% 2583 ± 8% sched_debug.cfs_rq:/.left_deadline.max
555.30 ± 26% -52.9% 261.73 ± 14% sched_debug.cfs_rq:/.left_deadline.stddev
6284 ± 30% -59.0% 2579 ± 8% sched_debug.cfs_rq:/.left_vruntime.max
551.08 ± 26% -52.6% 261.08 ± 14% sched_debug.cfs_rq:/.left_vruntime.stddev
187380 ± 5% +35.3% 253537 ± 5% sched_debug.cfs_rq:/.load.max
903.88 ± 11% -30.3% 629.56 ± 14% sched_debug.cfs_rq:/.min_vruntime.min
6284 ± 30% -59.0% 2579 ± 8% sched_debug.cfs_rq:/.right_vruntime.max
551.08 ± 26% -52.6% 261.08 ± 14% sched_debug.cfs_rq:/.right_vruntime.stddev
12.39 ± 4% +26.9% 15.72 ± 7% sched_debug.cpu.clock.stddev
192.90 ± 18% -29.5% 136.07 ± 8% sched_debug.cpu.curr->pid.avg
891.74 ± 8% -16.2% 746.94 ± 3% sched_debug.cpu.curr->pid.stddev
438984 +34.6% 590692 ± 2% sched_debug.cpu.nr_switches.max
180298 ± 11% -85.3% 26562 ± 99% sched_debug.cpu.nr_switches.min
34604 ± 7% +291.7% 135541 ± 12% sched_debug.cpu.nr_switches.stddev
0.01 ± 7% +30.1% 0.02 ± 13% perf-sched.sch_delay.avg.ms.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
0.01 ± 11% +43.2% 0.01 ± 12% perf-sched.sch_delay.avg.ms.__cond_resched.stop_one_cpu.sched_exec.bprm_execve.part
0.01 ± 11% +30.0% 0.01 ± 8% perf-sched.sch_delay.avg.ms.anon_pipe_read.fifo_pipe_read.vfs_read.ksys_read
0.02 ± 4% +16.5% 0.03 ± 6% perf-sched.sch_delay.avg.ms.anon_pipe_read.vfs_read.ksys_read.do_syscall_64
0.01 -70.0% 0.00 perf-sched.sch_delay.avg.ms.do_nanosleep.hrtimer_nanosleep.common_nsleep.__x64_sys_clock_nanosleep
0.01 ± 3% +23.1% 0.01 ± 3% perf-sched.sch_delay.avg.ms.do_wait.kernel_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.19 ± 50% +92.7% 0.38 ± 12% perf-sched.sch_delay.max.ms.__cond_resched.generic_perform_write.shmem_file_write_iter.vfs_write.ksys_write
3.21 ± 73% -64.9% 1.13 ± 27% perf-sched.sch_delay.max.ms.do_nanosleep.hrtimer_nanosleep.common_nsleep.__x64_sys_clock_nanosleep
0.07 ± 87% -73.1% 0.02 ± 27% perf-sched.sch_delay.max.ms.schedule_hrtimeout_range_clock.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
0.05 ± 31% -64.7% 0.02 ± 11% perf-sched.sch_delay.max.ms.schedule_timeout.kcompactd.kthread.ret_from_fork
0.09 ± 35% -76.9% 0.02 ± 11% perf-sched.sch_delay.max.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
0.01 -70.0% 0.00 perf-sched.total_sch_delay.average.ms
0.07 ±223% +563.7% 0.47 ± 7% perf-sched.wait_and_delay.avg.ms.do_wait.kernel_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.67 ± 3% +16.3% 0.78 ± 7% perf-sched.wait_and_delay.avg.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
0.28 ±223% +575.7% 1.86 ± 13% perf-sched.wait_and_delay.max.ms.do_wait.kernel_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.39 ± 3% +17.1% 0.45 ± 7% perf-sched.wait_time.avg.ms.do_wait.kernel_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.66 ± 3% +16.4% 0.77 ± 7% perf-sched.wait_time.avg.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
0.56 ± 2% -23.9% 0.43 ± 3% perf-stat.i.MPKI
5.319e+09 -26.4% 3.914e+09 perf-stat.i.branch-instructions
0.91 -0.0 0.87 perf-stat.i.branch-miss-rate%
45430893 -29.7% 31944159 perf-stat.i.branch-misses
6.57 ± 2% +13.0 19.54 ± 5% perf-stat.i.cache-miss-rate%
15264617 ± 2% -33.7% 10123277 ± 3% perf-stat.i.cache-misses
2.363e+08 ± 5% -81.5% 43729498 ± 5% perf-stat.i.cache-references
2384996 +8.0% 2575726 perf-stat.i.context-switches
3.93 ± 5% -62.2% 1.49 perf-stat.i.cpi
9.365e+10 ± 5% -70.8% 2.737e+10 perf-stat.i.cpu-cycles
8978 ± 11% +187.1% 25778 ± 15% perf-stat.i.cpu-migrations
7934 ± 9% -36.7% 5019 ± 4% perf-stat.i.cycles-between-cache-misses
2.57e+10 -23.8% 1.958e+10 perf-stat.i.instructions
0.29 ± 4% +140.6% 0.69 perf-stat.i.ipc
12.42 +8.0% 13.42 perf-stat.i.metric.K/sec
0.59 ± 2% -12.9% 0.52 ± 3% perf-stat.overall.MPKI
0.85 -0.0 0.82 perf-stat.overall.branch-miss-rate%
6.47 ± 2% +16.8 23.23 ± 6% perf-stat.overall.cache-miss-rate%
3.64 ± 5% -61.6% 1.40 perf-stat.overall.cpi
6142 ± 7% -56.0% 2703 ± 2% perf-stat.overall.cycles-between-cache-misses
0.28 ± 5% +160.0% 0.72 perf-stat.overall.ipc
5.234e+09 -26.4% 3.853e+09 perf-stat.ps.branch-instructions
44685648 -29.7% 31428779 perf-stat.ps.branch-misses
15022399 ± 2% -33.6% 9973785 ± 3% perf-stat.ps.cache-misses
2.325e+08 ± 5% -81.5% 43045648 ± 5% perf-stat.ps.cache-references
2346125 +8.0% 2533751 perf-stat.ps.context-switches
9.212e+10 ± 5% -70.8% 2.693e+10 perf-stat.ps.cpu-cycles
8832 ± 11% +187.1% 25360 ± 15% perf-stat.ps.cpu-migrations
2.529e+10 -23.8% 1.927e+10 perf-stat.ps.instructions
1.554e+12 -23.8% 1.184e+12 perf-stat.total.instructions
***************************************************************************************************
lkp-srf-2sp3: 192 threads 2 sockets Intel(R) Xeon(R) 6740E CPU @ 2.4GHz (Sierra Forest) with 256G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
gcc-12/performance/x86_64-rhel-9.4/100%/debian-12-x86_64-20240206.cgz/lkp-srf-2sp3/rawudp/stress-ng/60s
commit:
032e1003ad ("sched: Make clangd usable")
7bca60ce65 ("sched/deadline: Less agressive dl_server handling")
032e1003ad358e4c 7bca60ce656978b01caafc4e228
---------------- ---------------------------
%stddev %change %stddev
\ | \
4.053e+08 ± 8% +72.3% 6.985e+08 ± 4% cpuidle..time
252110 ± 14% +346.5% 1125793 ± 11% cpuidle..usage
2.86 ± 7% +2.1 4.94 ± 5% mpstat.cpu.all.idle%
0.42 +0.1 0.53 ± 2% mpstat.cpu.all.usr%
384.45 -12.9% 335.02 vmstat.procs.r
31579 ± 10% +161.6% 82622 ± 9% vmstat.system.cs
454840 +3.5% 470902 vmstat.system.in
8193567 ± 7% -37.3% 5137179 ± 6% numa-numastat.node0.local_node
8333499 ± 7% -37.5% 5207944 ± 6% numa-numastat.node0.numa_hit
9576611 ± 2% -48.1% 4974997 ± 6% numa-numastat.node1.local_node
9634622 ± 2% -47.1% 5100815 ± 7% numa-numastat.node1.numa_hit
1328402 +17.4% 1559668 ± 2% meminfo.Active
1328402 +17.4% 1559668 ± 2% meminfo.Active(anon)
415239 ± 3% +16.7% 484410 ± 6% meminfo.Mapped
2919547 ± 3% -23.4% 2236770 ± 3% meminfo.SUnreclaim
610696 ± 3% +38.5% 846062 ± 4% meminfo.Shmem
3132101 ± 2% -22.0% 2443796 ± 3% meminfo.Slab
2494509 +46.3% 3650414 stress-ng.rawudp.ops
41563 +46.3% 60826 stress-ng.rawudp.ops_per_sec
215.09 +45.9% 313.72 stress-ng.rawudp.packets_32_bytes_received_per_sec
871151 ± 6% +47.8% 1287573 ± 5% stress-ng.time.involuntary_context_switches
5443 ± 3% +7.4% 5848 stress-ng.time.percent_of_cpu_this_job_got
3259 ± 3% +7.2% 3493 stress-ng.time.system_time
24.68 +40.0% 34.56 stress-ng.time.user_time
751066 ± 17% +184.8% 2139221 ± 11% stress-ng.time.voluntary_context_switches
103510 ± 4% +18.6% 122788 ± 5% proc-vmstat.nr_anon_pages
332513 +17.4% 390211 ± 2% proc-vmstat.nr_inactive_anon
1040924 +5.6% 1099375 proc-vmstat.nr_mapped
721178 ± 2% -20.5% 573139 ± 3% proc-vmstat.nr_slab_reclaimable
10829041 ± 3% -33.7% 7181747 ± 2% proc-vmstat.nr_unaccepted
52896 -1.5% 52098 proc-vmstat.nr_unevictable
153210 ± 3% +38.1% 211658 ± 4% proc-vmstat.nr_writeback_temp
332513 +17.4% 390211 ± 2% proc-vmstat.nr_zone_active_anon
17772587 ± 4% -43.1% 10106636 ± 3% proc-vmstat.numa_interleave
73014310 ± 4% -44.5% 40530405 ± 3% proc-vmstat.pgalloc_dma32
72586921 ± 4% -45.4% 39615004 ± 3% proc-vmstat.pgskip_device
2270969 ± 6% -18.3% 1855414 ± 4% sched_debug.cfs_rq:/.avg_vruntime.min
162743 ± 22% +38.0% 224614 ± 7% sched_debug.cfs_rq:/.avg_vruntime.stddev
2270969 ± 6% -18.3% 1855421 ± 4% sched_debug.cfs_rq:/.min_vruntime.min
162752 ± 22% +38.0% 224615 ± 7% sched_debug.cfs_rq:/.min_vruntime.stddev
1299 ± 4% -14.8% 1107 ± 6% sched_debug.cfs_rq:/.runnable_avg.avg
1092 ± 12% -17.7% 899.04 ± 6% sched_debug.cfs_rq:/.runnable_avg.stddev
1370 ± 6% +21.9% 1671 ± 10% sched_debug.cfs_rq:/.util_avg.max
204.56 ± 8% +33.1% 272.19 ± 12% sched_debug.cfs_rq:/.util_avg.stddev
765.31 ± 3% +80.8% 1383 ± 2% sched_debug.cpu.clock_task.stddev
6433 ± 8% +126.2% 14552 ± 9% sched_debug.cpu.nr_switches.avg
4046 ± 11% +128.8% 9257 ± 8% sched_debug.cpu.nr_switches.min
0.00 ± 44% +3130.0% 0.14 ± 17% sched_debug.cpu.nr_uninterruptible.avg
11.24 ± 27% +76.6% 19.85 ± 22% sched_debug.cpu.nr_uninterruptible.stddev
227050 ± 51% +234.8% 760066 ± 40% numa-meminfo.node0.Active
227050 ± 51% +234.8% 760066 ± 40% numa-meminfo.node0.Active(anon)
1349743 ±119% +142.7% 3276163 ± 40% numa-meminfo.node0.FilePages
39511 ± 97% +664.4% 302031 ± 33% numa-meminfo.node0.Mapped
3998073 ± 42% +50.4% 6015073 ± 20% numa-meminfo.node0.MemUsed
11816 ± 14% +132.4% 27462 ± 25% numa-meminfo.node0.PageTables
1392016 ± 2% -15.0% 1182833 ± 4% numa-meminfo.node0.SUnreclaim
64403 ± 46% +497.6% 384896 ± 36% numa-meminfo.node0.Shmem
1486559 ± 4% -12.2% 1305586 ± 3% numa-meminfo.node0.Slab
20595 ± 4% -14.0% 17717 ± 8% numa-meminfo.node1.KernelStack
5849357 ± 28% -39.9% 3518264 ± 35% numa-meminfo.node1.MemUsed
31576 ± 6% -52.6% 14979 ± 47% numa-meminfo.node1.PageTables
1435745 -22.6% 1111726 ± 2% numa-meminfo.node1.SUnreclaim
1551910 ± 2% -22.8% 1197513 ± 3% numa-meminfo.node1.Slab
9918 ± 99% +656.9% 75075 ± 34% numa-vmstat.node0.nr_anon_pages
56806 ± 51% +234.2% 189858 ± 40% numa-vmstat.node0.nr_inactive_anon
2948 ± 14% +132.1% 6841 ± 25% numa-vmstat.node0.nr_kernel_stack
337527 ±119% +142.6% 818901 ± 40% numa-vmstat.node0.nr_mapped
360490 ± 4% -17.1% 298949 ± 4% numa-vmstat.node0.nr_slab_reclaimable
5016234 ± 10% -20.8% 3973477 ± 8% numa-vmstat.node0.nr_unaccepted
16192 ± 46% +493.4% 96084 ± 36% numa-vmstat.node0.nr_writeback_temp
56806 ± 51% +234.2% 189858 ± 40% numa-vmstat.node0.nr_zone_active_anon
8193068 ± 7% -37.3% 5138236 ± 6% numa-vmstat.node0.numa_interleave
20586 ± 4% -13.9% 17725 ± 8% numa-vmstat.node1.nr_foll_pin_released
7871 ± 6% -52.3% 3752 ± 48% numa-vmstat.node1.nr_kernel_stack
374773 ± 3% -25.1% 280890 ± 4% numa-vmstat.node1.nr_slab_reclaimable
5818856 ± 8% -44.5% 3228232 ± 13% numa-vmstat.node1.nr_unaccepted
9575821 ± 2% -48.1% 4973900 ± 6% numa-vmstat.node1.numa_interleave
5.63 -12.7% 4.92 perf-stat.i.MPKI
1.919e+10 -5.6% 1.812e+10 perf-stat.i.branch-instructions
0.62 -0.0 0.60 perf-stat.i.branch-miss-rate%
1.161e+08 -7.6% 1.073e+08 perf-stat.i.branch-misses
40.35 -0.6 39.71 perf-stat.i.cache-miss-rate%
5.201e+08 -18.2% 4.253e+08 perf-stat.i.cache-misses
1.286e+09 -16.8% 1.07e+09 perf-stat.i.cache-references
32886 ± 9% +162.1% 86182 ± 9% perf-stat.i.context-switches
6.63 +3.4% 6.85 perf-stat.i.cpi
6.117e+11 -3.1% 5.928e+11 perf-stat.i.cpu-cycles
6685 ± 12% +84.8% 12353 ± 9% perf-stat.i.cpu-migrations
1177 +18.4% 1394 perf-stat.i.cycles-between-cache-misses
9.179e+10 -6.1% 8.623e+10 perf-stat.i.instructions
0.15 -3.5% 0.15 perf-stat.i.ipc
5.67 -12.9% 4.94 perf-stat.overall.MPKI
0.60 -0.0 0.59 perf-stat.overall.branch-miss-rate%
40.46 -0.7 39.79 perf-stat.overall.cache-miss-rate%
6.67 +3.1% 6.88 perf-stat.overall.cpi
1176 +18.4% 1393 perf-stat.overall.cycles-between-cache-misses
0.15 -3.0% 0.15 perf-stat.overall.ipc
1.85e+10 -4.9% 1.76e+10 perf-stat.ps.branch-instructions
1.114e+08 -6.8% 1.038e+08 perf-stat.ps.branch-misses
5.02e+08 -17.6% 4.137e+08 perf-stat.ps.cache-misses
1.241e+09 -16.2% 1.039e+09 perf-stat.ps.cache-references
30883 ± 10% +167.1% 82500 ± 10% perf-stat.ps.context-switches
5.905e+11 -2.4% 5.761e+11 perf-stat.ps.cpu-cycles
6173 ± 14% +87.6% 11579 ± 10% perf-stat.ps.cpu-migrations
8.851e+10 -5.4% 8.375e+10 perf-stat.ps.instructions
5.395e+12 -5.3% 5.111e+12 perf-stat.total.instructions
5.60 ± 67% -57.9% 2.36 ± 38% perf-sched.sch_delay.avg.ms.__cond_resched.__dentry_kill.dput.__fput.__x64_sys_close
10.76 ± 40% -68.5% 3.39 ± 15% perf-sched.sch_delay.avg.ms.__cond_resched.__mutex_lock.constprop.0.ip_ra_control
8.89 ± 10% -54.3% 4.07 ± 5% perf-sched.sch_delay.avg.ms.__cond_resched.dput.__fput.__x64_sys_close.do_syscall_64
10.27 ± 7% -73.1% 2.77 ± 12% perf-sched.sch_delay.avg.ms.__cond_resched.kmem_cache_alloc_lru_noprof.__d_alloc.d_alloc_pseudo.alloc_file_pseudo
7.75 ± 34% -69.3% 2.38 ± 59% perf-sched.sch_delay.avg.ms.__cond_resched.kmem_cache_alloc_lru_noprof.sock_alloc_inode.alloc_inode.sock_alloc
7.36 ± 12% -38.8% 4.50 ± 14% perf-sched.sch_delay.avg.ms.__cond_resched.kmem_cache_alloc_node_noprof.__alloc_skb.alloc_skb_with_frags.sock_alloc_send_pskb
5.29 ± 42% -47.0% 2.80 ± 26% perf-sched.sch_delay.avg.ms.__cond_resched.kmem_cache_alloc_noprof.alloc_empty_file.alloc_file_pseudo.sock_alloc_file
6.19 ± 16% -53.2% 2.90 ± 45% perf-sched.sch_delay.avg.ms.__cond_resched.kmem_cache_alloc_noprof.security_inode_alloc.inode_init_always_gfp.alloc_inode
6.07 ± 28% -61.4% 2.34 ± 30% perf-sched.sch_delay.avg.ms.__cond_resched.kmem_cache_alloc_noprof.sk_prot_alloc.sk_alloc.inet_create
8.97 ± 14% -71.1% 2.59 ± 17% perf-sched.sch_delay.avg.ms.__cond_resched.lock_sock_nested.raw_destroy.sk_common_release.inet_release
0.86 ± 15% +25.0% 1.08 ± 13% perf-sched.sch_delay.avg.ms.__cond_resched.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
4.92 ± 32% -60.9% 1.92 ± 48% perf-sched.sch_delay.avg.ms.__cond_resched.stop_one_cpu.sched_exec.bprm_execve.part
0.10 ± 56% +1195.4% 1.28 ±112% perf-sched.sch_delay.avg.ms.anon_pipe_read.fifo_pipe_read.vfs_read.ksys_read
25.42 ± 76% -82.0% 4.57 ±142% perf-sched.sch_delay.avg.ms.devkmsg_read.vfs_read.ksys_read.do_syscall_64
0.03 ±170% +3613.4% 1.24 ±193% perf-sched.sch_delay.avg.ms.irqentry_exit_to_user_mode.asm_exc_page_fault.[unknown].[unknown]
17.92 ± 7% -68.9% 5.58 ± 27% perf-sched.sch_delay.avg.ms.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt.[unknown]
16.30 ± 41% -76.5% 3.83 ± 43% perf-sched.sch_delay.avg.ms.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt.[unknown].[unknown]
9.61 ± 16% -78.2% 2.09 ± 11% perf-sched.sch_delay.avg.ms.schedule_preempt_disabled.__mutex_lock.constprop.0.ip_ra_control
4.46 ± 13% -83.6% 0.73 ± 16% perf-sched.sch_delay.avg.ms.schedule_timeout.__skb_wait_for_more_packets.__skb_recv_datagram.skb_recv_datagram
0.67 ± 12% -56.4% 0.29 ± 46% perf-sched.sch_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
13.88 ± 7% -58.1% 5.82 ± 9% perf-sched.sch_delay.avg.ms.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
66.14 ± 27% -57.2% 28.29 ± 57% perf-sched.sch_delay.max.ms.__cond_resched.__dentry_kill.dput.__fput.__x64_sys_close
217.80 ± 20% -48.9% 111.31 ± 31% perf-sched.sch_delay.max.ms.__cond_resched.kmem_cache_alloc_lru_noprof.__d_alloc.d_alloc_pseudo.alloc_file_pseudo
54.66 ± 37% -48.5% 28.14 ± 29% perf-sched.sch_delay.max.ms.__cond_resched.kmem_cache_alloc_noprof.alloc_empty_file.alloc_file_pseudo.sock_alloc_file
23.20 ± 42% +73.8% 40.32 ± 23% perf-sched.sch_delay.max.ms.__cond_resched.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
36.10 ± 62% +1009.7% 400.61 ±115% perf-sched.sch_delay.max.ms.anon_pipe_read.fifo_pipe_read.vfs_read.ksys_read
25.42 ± 76% -81.9% 4.60 ±141% perf-sched.sch_delay.max.ms.devkmsg_read.vfs_read.ksys_read.do_syscall_64
0.06 ±191% +2498.0% 1.53 ±150% perf-sched.sch_delay.max.ms.irqentry_exit_to_user_mode.asm_exc_page_fault.[unknown].[unknown]
638.81 ± 23% -39.7% 384.97 ± 23% perf-sched.sch_delay.max.ms.schedule_timeout.__skb_wait_for_more_packets.__skb_recv_datagram.skb_recv_datagram
8.50 ± 10% -76.9% 1.96 ± 16% perf-sched.total_sch_delay.average.ms
35.01 ± 9% -71.0% 10.14 ± 11% perf-sched.total_wait_and_delay.average.ms
142342 ± 10% +220.0% 455497 ± 9% perf-sched.total_wait_and_delay.count.ms
3405 ± 14% -36.7% 2156 ± 8% perf-sched.total_wait_and_delay.max.ms
26.51 ± 9% -69.1% 8.18 ± 10% perf-sched.total_wait_time.average.ms
3405 ± 14% -41.4% 1996 ± 15% perf-sched.total_wait_time.max.ms
17.80 ± 10% -36.8% 11.24 ± 2% perf-sched.wait_and_delay.avg.ms.__cond_resched.dput.__fput.__x64_sys_close.do_syscall_64
435.74 ± 73% -93.6% 27.69 ± 28% perf-sched.wait_and_delay.avg.ms.__cond_resched.generic_perform_write.shmem_file_write_iter.vfs_write.ksys_write
20.60 ± 7% -92.9% 1.46 ±223% perf-sched.wait_and_delay.avg.ms.__cond_resched.kmem_cache_alloc_lru_noprof.__d_alloc.d_alloc_pseudo.alloc_file_pseudo
60.91 ± 8% -35.7% 39.14 ± 7% perf-sched.wait_and_delay.avg.ms.__cond_resched.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
44.07 ± 24% +92.4% 84.81 ± 41% perf-sched.wait_and_delay.avg.ms.anon_pipe_read.fifo_pipe_read.vfs_read.ksys_read
481.12 ± 7% +21.5% 584.73 ± 4% perf-sched.wait_and_delay.avg.ms.anon_pipe_read.vfs_read.ksys_read.do_syscall_64
5.54 ±223% +476.8% 31.97 ± 10% perf-sched.wait_and_delay.avg.ms.do_task_dead.do_exit.do_group_exit.__x64_sys_exit_group.x64_sys_call
249.16 ± 23% -44.8% 137.52 ± 14% perf-sched.wait_and_delay.avg.ms.schedule_hrtimeout_range_clock.poll_schedule_timeout.constprop.0.do_poll
9.50 ± 12% -82.9% 1.63 ± 15% perf-sched.wait_and_delay.avg.ms.schedule_timeout.__skb_wait_for_more_packets.__skb_recv_datagram.skb_recv_datagram
5.87 ± 10% -100.0% 0.00 perf-sched.wait_and_delay.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
157.39 ± 7% -63.8% 57.02 ± 8% perf-sched.wait_and_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
28.10 ± 7% -51.3% 13.68 ± 8% perf-sched.wait_and_delay.avg.ms.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
3937 ± 13% -39.0% 2400 ± 7% perf-sched.wait_and_delay.count.__cond_resched.dput.__fput.__x64_sys_close.do_syscall_64
11.83 ± 83% +484.5% 69.17 ± 46% perf-sched.wait_and_delay.count.__cond_resched.generic_perform_write.shmem_file_write_iter.vfs_write.ksys_write
2951 ± 11% -92.5% 222.00 ±223% perf-sched.wait_and_delay.count.__cond_resched.kmem_cache_alloc_lru_noprof.__d_alloc.d_alloc_pseudo.alloc_file_pseudo
794.33 ± 11% +144.0% 1938 ± 14% perf-sched.wait_and_delay.count.__cond_resched.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
2.50 ±223% +2246.7% 58.67 ± 12% perf-sched.wait_and_delay.count.__cond_resched.task_work_run.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
959.17 ± 18% -45.3% 524.33 ± 37% perf-sched.wait_and_delay.count.anon_pipe_read.fifo_pipe_read.vfs_read.ksys_read
58.67 ± 5% -21.0% 46.33 ± 6% perf-sched.wait_and_delay.count.anon_pipe_read.vfs_read.ksys_read.do_syscall_64
54.00 ± 24% +73.1% 93.50 ± 12% perf-sched.wait_and_delay.count.schedule_hrtimeout_range_clock.poll_schedule_timeout.constprop.0.do_poll
57079 ± 17% +442.4% 309621 ± 11% perf-sched.wait_and_delay.count.schedule_timeout.__skb_wait_for_more_packets.__skb_recv_datagram.skb_recv_datagram
1010 ± 7% -100.0% 0.00 perf-sched.wait_and_delay.count.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
9873 ± 9% +188.3% 28469 ± 7% perf-sched.wait_and_delay.count.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
61865 ± 5% +66.2% 102833 ± 6% perf-sched.wait_and_delay.count.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
435.61 ± 20% -85.0% 65.18 ±223% perf-sched.wait_and_delay.max.ms.__cond_resched.kmem_cache_alloc_lru_noprof.__d_alloc.d_alloc_pseudo.alloc_file_pseudo
34.85 ±223% +1078.1% 410.53 ± 65% perf-sched.wait_and_delay.max.ms.__cond_resched.task_work_run.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
1208 ± 17% -36.0% 774.17 ± 23% perf-sched.wait_and_delay.max.ms.schedule_timeout.__skb_wait_for_more_packets.__skb_recv_datagram.skb_recv_datagram
212.55 ±158% -100.0% 0.00 perf-sched.wait_and_delay.max.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
1841 ± 9% -16.0% 1547 ± 9% perf-sched.wait_and_delay.max.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
26.10 ± 61% -58.6% 10.80 ± 23% perf-sched.wait_time.avg.ms.__cond_resched.__wait_for_common.stop_two_cpus.migrate_swap.task_numa_migrate
418.96 ± 75% -94.6% 22.51 ± 31% perf-sched.wait_time.avg.ms.__cond_resched.generic_perform_write.shmem_file_write_iter.vfs_write.ksys_write
10.33 ± 7% -40.1% 6.18 ± 3% perf-sched.wait_time.avg.ms.__cond_resched.kmem_cache_alloc_lru_noprof.__d_alloc.d_alloc_pseudo.alloc_file_pseudo
6.19 ± 16% -43.3% 3.51 ± 26% perf-sched.wait_time.avg.ms.__cond_resched.kmem_cache_alloc_noprof.security_inode_alloc.inode_init_always_gfp.alloc_inode
10.11 ± 25% +333.1% 43.78 ± 15% perf-sched.wait_time.avg.ms.__cond_resched.lock_sock_nested.raw_destroy.sk_common_release.inet_release
60.05 ± 8% -36.6% 38.06 ± 7% perf-sched.wait_time.avg.ms.__cond_resched.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
5.18 ± 91% +184.3% 14.71 ± 42% perf-sched.wait_time.avg.ms.__cond_resched.task_work_run.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
43.97 ± 24% +90.0% 83.54 ± 41% perf-sched.wait_time.avg.ms.anon_pipe_read.fifo_pipe_read.vfs_read.ksys_read
479.10 ± 7% +20.8% 578.54 ± 4% perf-sched.wait_time.avg.ms.anon_pipe_read.vfs_read.ksys_read.do_syscall_64
4.37 ± 27% -53.2% 2.04 ± 33% perf-sched.wait_time.avg.ms.do_wait.kernel_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe
18.92 ± 6% -67.6% 6.13 ± 23% perf-sched.wait_time.avg.ms.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt.[unknown]
1.93 ± 62% +557.9% 12.73 ± 53% perf-sched.wait_time.avg.ms.irqentry_exit_to_user_mode.asm_sysvec_reschedule_ipi.[unknown].[unknown]
244.40 ± 23% -46.0% 131.93 ± 13% perf-sched.wait_time.avg.ms.schedule_hrtimeout_range_clock.poll_schedule_timeout.constprop.0.do_poll
9.76 ± 16% -47.6% 5.12 ± 7% perf-sched.wait_time.avg.ms.schedule_preempt_disabled.__mutex_lock.constprop.0.ip_ra_control
5.04 ± 12% -82.3% 0.89 ± 15% perf-sched.wait_time.avg.ms.schedule_timeout.__skb_wait_for_more_packets.__skb_recv_datagram.skb_recv_datagram
156.71 ± 7% -63.8% 56.73 ± 8% perf-sched.wait_time.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
14.22 ± 7% -44.7% 7.86 ± 7% perf-sched.wait_time.avg.ms.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
242.75 ± 20% +63.8% 397.69 ± 10% perf-sched.wait_time.max.ms.__cond_resched.kmem_cache_alloc_lru_noprof.__d_alloc.d_alloc_pseudo.alloc_file_pseudo
110.27 ± 45% +123.6% 246.56 ± 25% perf-sched.wait_time.max.ms.__cond_resched.kmem_cache_alloc_node_noprof.__alloc_skb.alloc_skb_with_frags.sock_alloc_send_pskb
66.55 ± 38% +245.7% 230.06 ± 57% perf-sched.wait_time.max.ms.__cond_resched.kmem_cache_alloc_noprof.sk_prot_alloc.sk_alloc.inet_create
15.52 ± 9% +1583.8% 261.33 ±105% perf-sched.wait_time.max.ms.__cond_resched.shmem_inode_acct_blocks.shmem_alloc_and_add_folio.shmem_get_folio_gfp.shmem_write_begin
38.52 ± 84% +855.3% 367.94 ± 80% perf-sched.wait_time.max.ms.__cond_resched.task_work_run.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
34.34 ± 82% +681.2% 268.28 ± 50% perf-sched.wait_time.max.ms.irqentry_exit_to_user_mode.asm_sysvec_reschedule_ipi.[unknown].[unknown]
131.26 ± 64% +195.2% 387.45 ± 7% perf-sched.wait_time.max.ms.schedule_preempt_disabled.__mutex_lock.constprop.0.ip_ra_control
645.30 ± 24% -39.5% 390.56 ± 23% perf-sched.wait_time.max.ms.schedule_timeout.__skb_wait_for_more_packets.__skb_recv_datagram.skb_recv_datagram
1841 ± 9% -16.0% 1547 ± 9% perf-sched.wait_time.max.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
***************************************************************************************************
lkp-srf-2sp2: 192 threads 2 sockets Intel(R) Xeon(R) 6740E CPU @ 2.4GHz (Sierra Forest) with 256G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
gcc-12/performance/x86_64-rhel-9.4/100%/debian-12-x86_64-20240206.cgz/lkp-srf-2sp2/membarrier/stress-ng/60s
commit:
032e1003ad ("sched: Make clangd usable")
7bca60ce65 ("sched/deadline: Less agressive dl_server handling")
032e1003ad358e4c 7bca60ce656978b01caafc4e228
---------------- ---------------------------
%stddev %change %stddev
\ | \
2.649e+09 +104.8% 5.425e+09 ± 6% cpuidle..time
3815319 +104.4% 7797511 ± 5% cpuidle..usage
387254 +4.6% 405223 ± 4% proc-vmstat.nr_inactive_anon
1005623 +1.8% 1023290 proc-vmstat.nr_mapped
117909 +15.0% 135583 ± 12% proc-vmstat.nr_writeback_temp
387254 +4.6% 405223 ± 4% proc-vmstat.nr_zone_active_anon
24.15 +90.6% 46.04 ± 6% vmstat.cpu.id
165.74 ± 2% -35.8% 106.42 ± 8% vmstat.procs.r
52841 +71.6% 90667 ± 3% vmstat.system.cs
3030991 -26.8% 2219763 ± 3% vmstat.system.in
22.70 +22.1 44.83 ± 6% mpstat.cpu.all.idle%
1.80 -0.4 1.38 ± 2% mpstat.cpu.all.irq%
75.22 -21.7 53.48 ± 5% mpstat.cpu.all.sys%
8.33 ± 34% +424.0% 43.67 ± 21% mpstat.max_utilization.seconds
82.47 -28.8% 58.72 ± 4% mpstat.max_utilization_pct
183.75 +18.2% 217.18 ± 3% stress-ng.membarrier.membarrier_calls_per_sec
106067 +18.1% 125306 ± 3% stress-ng.membarrier.ops
1765 +18.2% 2086 ± 3% stress-ng.membarrier.ops_per_sec
507928 -80.1% 100928 ± 31% stress-ng.time.involuntary_context_switches
14890 -30.0% 10421 ± 5% stress-ng.time.percent_of_cpu_this_job_got
8973 -30.0% 6279 ± 5% stress-ng.time.system_time
1595572 +80.9% 2886509 ± 2% stress-ng.time.voluntary_context_switches
0.16 ± 2% +21.3% 0.20 ± 3% perf-stat.i.MPKI
1.799e+10 -28.9% 1.279e+10 ± 5% perf-stat.i.branch-instructions
0.15 ± 2% +0.1 0.21 ± 5% perf-stat.i.branch-miss-rate%
15.43 -2.1 13.31 ± 3% perf-stat.i.cache-miss-rate%
11411999 -13.4% 9882107 ± 3% perf-stat.i.cache-misses
54258 +72.7% 93707 ± 3% perf-stat.i.context-switches
4.84e+11 -29.5% 3.412e+11 ± 5% perf-stat.i.cpu-cycles
17914 +8.6% 19454 perf-stat.i.cpu-migrations
45484 -17.5% 37527 ± 3% perf-stat.i.cycles-between-cache-misses
8.952e+10 -29.1% 6.343e+10 ± 5% perf-stat.i.instructions
0.19 +2.4% 0.19 perf-stat.i.ipc
0.13 +22.3% 0.16 ± 3% perf-stat.overall.MPKI
0.12 +0.1 0.18 ± 6% perf-stat.overall.branch-miss-rate%
15.19 -2.2 12.98 ± 4% perf-stat.overall.cache-miss-rate%
42490 -18.5% 34619 ± 4% perf-stat.overall.cycles-between-cache-misses
1.769e+10 -28.9% 1.258e+10 ± 5% perf-stat.ps.branch-instructions
11203175 -13.4% 9698099 ± 2% perf-stat.ps.cache-misses
53359 +72.7% 92170 ± 3% perf-stat.ps.context-switches
4.76e+11 -29.5% 3.358e+11 ± 5% perf-stat.ps.cpu-cycles
17618 +8.6% 19130 perf-stat.ps.cpu-migrations
8.804e+10 -29.1% 6.242e+10 ± 5% perf-stat.ps.instructions
5.366e+12 -28.7% 3.827e+12 ± 5% perf-stat.total.instructions
4027699 -53.5% 1873472 ± 10% sched_debug.cfs_rq:/.avg_vruntime.avg
4311516 -40.2% 2577501 ± 5% sched_debug.cfs_rq:/.avg_vruntime.max
3909898 -77.1% 894939 ± 10% sched_debug.cfs_rq:/.avg_vruntime.min
59088 ± 5% +1096.6% 707027 ± 3% sched_debug.cfs_rq:/.avg_vruntime.stddev
0.47 ± 6% -40.9% 0.28 ± 24% sched_debug.cfs_rq:/.h_nr_queued.avg
2.42 ± 14% -37.9% 1.50 ± 19% sched_debug.cfs_rq:/.h_nr_queued.max
0.44 ± 8% -15.9% 0.37 ± 6% sched_debug.cfs_rq:/.h_nr_queued.stddev
0.47 ± 6% -40.8% 0.28 ± 24% sched_debug.cfs_rq:/.h_nr_runnable.avg
2.42 ± 14% -37.9% 1.50 ± 19% sched_debug.cfs_rq:/.h_nr_runnable.max
0.44 ± 8% -15.8% 0.37 ± 6% sched_debug.cfs_rq:/.h_nr_runnable.stddev
4027699 -53.5% 1873472 ± 10% sched_debug.cfs_rq:/.min_vruntime.avg
4311516 -40.2% 2577501 ± 5% sched_debug.cfs_rq:/.min_vruntime.max
3909898 -77.1% 894939 ± 10% sched_debug.cfs_rq:/.min_vruntime.min
59088 ± 5% +1096.6% 707027 ± 3% sched_debug.cfs_rq:/.min_vruntime.stddev
0.41 ± 5% -33.0% 0.27 ± 23% sched_debug.cfs_rq:/.nr_queued.avg
574.68 ± 3% -36.2% 366.88 ± 7% sched_debug.cfs_rq:/.runnable_avg.avg
1582 ± 8% -34.4% 1037 ± 3% sched_debug.cfs_rq:/.runnable_avg.max
230.83 ± 8% -55.2% 103.42 ± 30% sched_debug.cfs_rq:/.runnable_avg.min
257.39 ± 4% -23.1% 197.85 ± 6% sched_debug.cfs_rq:/.runnable_avg.stddev
490.01 ± 2% -26.0% 362.55 ± 7% sched_debug.cfs_rq:/.util_avg.avg
1342 ± 5% -24.9% 1008 ± 3% sched_debug.cfs_rq:/.util_avg.max
180.83 ± 10% -43.5% 102.25 ± 29% sched_debug.cfs_rq:/.util_avg.min
152.13 ± 8% -62.1% 57.69 ± 29% sched_debug.cfs_rq:/.util_est.avg
954.33 ± 17% -37.1% 599.83 ± 8% sched_debug.cfs_rq:/.util_est.max
144.41 ± 11% -28.8% 102.89 ± 13% sched_debug.cfs_rq:/.util_est.stddev
2119 ± 5% -34.0% 1398 ± 25% sched_debug.cpu.curr->pid.avg
0.00 ± 5% -13.2% 0.00 ± 11% sched_debug.cpu.next_balance.stddev
0.47 ± 6% -41.1% 0.28 ± 25% sched_debug.cpu.nr_running.avg
2.42 ± 14% -37.9% 1.50 ± 19% sched_debug.cpu.nr_running.max
0.44 ± 8% -16.8% 0.37 ± 6% sched_debug.cpu.nr_running.stddev
9997 +62.1% 16208 ± 2% sched_debug.cpu.nr_switches.avg
8163 ± 2% -38.0% 5063 ± 2% sched_debug.cpu.nr_switches.min
5189 ± 4% +190.8% 15089 ± 4% sched_debug.cpu.nr_switches.stddev
238.83 ± 19% +563.5% 1584 ± 34% sched_debug.cpu.nr_uninterruptible.max
-143.67 +542.7% -923.33 sched_debug.cpu.nr_uninterruptible.min
57.69 ± 3% +1206.4% 753.61 ± 24% sched_debug.cpu.nr_uninterruptible.stddev
2.25 ± 50% -99.9% 0.00 ±223% perf-sched.sch_delay.avg.ms.__cond_resched.__kmalloc_node_noprof.alloc_cpumask_var_node.membarrier_private_expedited.__do_sys_membarrier
0.00 ±143% +7057.1% 0.17 ±197% perf-sched.sch_delay.avg.ms.__cond_resched.__mutex_lock.constprop.0.membarrier_global_expedited
0.12 ± 11% -18.8% 0.10 ± 8% perf-sched.sch_delay.avg.ms.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
0.54 ± 33% -94.6% 0.03 ± 39% perf-sched.sch_delay.avg.ms.__cond_resched.__wait_for_common.stop_two_cpus.migrate_swap.task_numa_migrate
1.90 ± 26% -99.6% 0.01 ± 50% perf-sched.sch_delay.avg.ms.__cond_resched.cpus_read_lock.membarrier_global_expedited.__do_sys_membarrier.do_syscall_64
0.01 ± 86% -100.0% 0.00 perf-sched.sch_delay.avg.ms.__cond_resched.down_write_killable.exec_mmap.begin_new_exec.load_elf_binary
0.01 ± 41% -100.0% 0.00 perf-sched.sch_delay.avg.ms.__cond_resched.dput.__fput.__x64_sys_close.do_syscall_64
0.56 ± 70% -95.9% 0.02 ± 31% perf-sched.sch_delay.avg.ms.__cond_resched.stop_one_cpu.migrate_task_to.task_numa_migrate.isra
1.49 ± 29% -99.8% 0.00 ±107% perf-sched.sch_delay.avg.ms.__cond_resched.task_work_run.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.29 ± 91% -97.6% 0.01 ± 23% perf-sched.sch_delay.avg.ms.__x64_sys_pause.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
0.03 ± 41% -75.1% 0.01 ± 38% perf-sched.sch_delay.avg.ms.anon_pipe_read.fifo_pipe_read.vfs_read.ksys_read
0.24 ± 92% -97.6% 0.01 ± 19% perf-sched.sch_delay.avg.ms.do_nanosleep.hrtimer_nanosleep.common_nsleep.__x64_sys_clock_nanosleep
0.05 ± 52% -76.7% 0.01 ± 22% perf-sched.sch_delay.avg.ms.do_wait.kernel_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.23 ±117% -95.0% 0.01 ± 26% perf-sched.sch_delay.avg.ms.io_schedule.migration_entry_wait_on_locked.__handle_mm_fault.handle_mm_fault
1.41 ± 55% -99.5% 0.01 ± 91% perf-sched.sch_delay.avg.ms.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt.[unknown]
0.05 ±114% -86.1% 0.01 ± 14% perf-sched.sch_delay.avg.ms.schedule_hrtimeout_range_clock.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
0.22 ± 60% -90.6% 0.02 ± 69% perf-sched.sch_delay.avg.ms.schedule_hrtimeout_range_clock.poll_schedule_timeout.constprop.0.do_poll
1.48 -88.4% 0.17 ± 29% perf-sched.sch_delay.avg.ms.schedule_preempt_disabled.__mutex_lock.constprop.0.membarrier_global_expedited
1.35 -98.1% 0.03 ± 16% perf-sched.sch_delay.avg.ms.schedule_preempt_disabled.__mutex_lock.constprop.0.membarrier_private_expedited
0.56 ± 2% -95.4% 0.03 ± 17% perf-sched.sch_delay.avg.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.__wait_rcu_gp
0.23 ± 21% -97.4% 0.01 ± 9% perf-sched.sch_delay.avg.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
1.19 ± 4% -97.7% 0.03 ± 23% perf-sched.sch_delay.avg.ms.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
2.51 ± 44% -99.9% 0.00 ±223% perf-sched.sch_delay.max.ms.__cond_resched.__kmalloc_node_noprof.alloc_cpumask_var_node.membarrier_private_expedited.__do_sys_membarrier
0.00 ±143% +2.1e+05% 4.82 ±219% perf-sched.sch_delay.max.ms.__cond_resched.__mutex_lock.constprop.0.membarrier_global_expedited
5.38 ± 13% -82.7% 0.93 ± 32% perf-sched.sch_delay.max.ms.__cond_resched.__wait_for_common.stop_two_cpus.migrate_swap.task_numa_migrate
4.51 ± 27% -99.8% 0.01 ± 51% perf-sched.sch_delay.max.ms.__cond_resched.cpus_read_lock.membarrier_global_expedited.__do_sys_membarrier.do_syscall_64
0.02 ±119% -100.0% 0.00 perf-sched.sch_delay.max.ms.__cond_resched.down_write_killable.exec_mmap.begin_new_exec.load_elf_binary
0.01 ± 38% -100.0% 0.00 perf-sched.sch_delay.max.ms.__cond_resched.dput.__fput.__x64_sys_close.do_syscall_64
11.24 ±118% -91.5% 0.96 ± 26% perf-sched.sch_delay.max.ms.__cond_resched.stop_one_cpu.migrate_task_to.task_numa_migrate.isra
0.58 ± 88% -98.0% 0.01 ± 22% perf-sched.sch_delay.max.ms.__cond_resched.stop_one_cpu.sched_exec.bprm_execve.part
4.25 ± 20% -99.9% 0.00 ±110% perf-sched.sch_delay.max.ms.__cond_resched.task_work_run.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
1.23 ±106% -99.1% 0.01 ± 28% perf-sched.sch_delay.max.ms.__x64_sys_pause.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
0.94 ± 81% -99.0% 0.01 ± 13% perf-sched.sch_delay.max.ms.do_nanosleep.hrtimer_nanosleep.common_nsleep.__x64_sys_clock_nanosleep
2.33 ± 50% -92.6% 0.17 ±175% perf-sched.sch_delay.max.ms.do_wait.kernel_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe
2.34 ± 52% -99.7% 0.01 ± 91% perf-sched.sch_delay.max.ms.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt.[unknown]
3.96 ± 19% -99.3% 0.03 ± 62% perf-sched.sch_delay.max.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
16.95 ± 75% -97.6% 0.41 ±104% perf-sched.sch_delay.max.ms.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
0.94 -93.3% 0.06 ± 24% perf-sched.total_sch_delay.average.ms
35.08 -24.6% 26.43 ± 2% perf-sched.total_wait_and_delay.average.ms
172758 +33.5% 230694 perf-sched.total_wait_and_delay.count.ms
4826 -11.0% 4295 ± 4% perf-sched.total_wait_and_delay.max.ms
34.14 -22.7% 26.37 ± 2% perf-sched.total_wait_time.average.ms
4826 -11.0% 4295 ± 4% perf-sched.total_wait_time.max.ms
521.72 ± 10% -66.9% 172.46 ± 10% perf-sched.wait_and_delay.avg.ms.schedule_hrtimeout_range_clock.poll_schedule_timeout.constprop.0.do_poll
17.81 +36.8% 24.35 perf-sched.wait_and_delay.avg.ms.schedule_preempt_disabled.__mutex_lock.constprop.0.membarrier_global_expedited
18.37 -16.7% 15.29 ± 7% perf-sched.wait_and_delay.avg.ms.schedule_preempt_disabled.__mutex_lock.constprop.0.membarrier_private_expedited
31.46 -40.7% 18.67 ± 2% perf-sched.wait_and_delay.avg.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.__wait_rcu_gp
528.64 ± 2% -15.5% 446.58 ± 2% perf-sched.wait_and_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
33.01 -100.0% 0.00 perf-sched.wait_and_delay.avg.ms.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
24.50 ± 3% +249.7% 85.67 ± 7% perf-sched.wait_and_delay.count.schedule_hrtimeout_range_clock.poll_schedule_timeout.constprop.0.do_poll
46991 +19.1% 55945 ± 6% perf-sched.wait_and_delay.count.schedule_preempt_disabled.__mutex_lock.constprop.0.membarrier_global_expedited
31298 +128.6% 71533 ± 3% perf-sched.wait_and_delay.count.schedule_preempt_disabled.__mutex_lock.constprop.0.membarrier_private_expedited
1858 +17.7% 2187 ± 2% perf-sched.wait_and_delay.count.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
906.33 ± 4% -100.0% 0.00 perf-sched.wait_and_delay.count.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
157.84 ±223% +519.2% 977.33 perf-sched.wait_and_delay.max.ms.do_task_dead.do_exit.do_group_exit.__x64_sys_exit_group.x64_sys_call
173.28 ± 14% -62.8% 64.47 ± 40% perf-sched.wait_and_delay.max.ms.schedule_preempt_disabled.__mutex_lock.constprop.0.membarrier_global_expedited
173.41 ± 14% -63.6% 63.19 ± 35% perf-sched.wait_and_delay.max.ms.schedule_preempt_disabled.__mutex_lock.constprop.0.membarrier_private_expedited
172.89 ± 13% -63.1% 63.75 ± 41% perf-sched.wait_and_delay.max.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.__wait_rcu_gp
15.67 ± 47% -67.8% 5.05 perf-sched.wait_and_delay.max.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
863.19 ± 35% -100.0% 0.00 perf-sched.wait_and_delay.max.ms.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
31.51 ± 6% -93.7% 1.98 ±223% perf-sched.wait_time.avg.ms.__cond_resched.__kmalloc_node_noprof.alloc_cpumask_var_node.membarrier_private_expedited.__do_sys_membarrier
7.30 ±145% +263.7% 26.54 ± 6% perf-sched.wait_time.avg.ms.__cond_resched.__mutex_lock.constprop.0.membarrier_global_expedited
31.79 ± 5% -26.7% 23.29 ± 3% perf-sched.wait_time.avg.ms.__cond_resched.__wait_for_common.stop_two_cpus.migrate_swap.task_numa_migrate
29.60 ± 2% -52.7% 13.99 ± 50% perf-sched.wait_time.avg.ms.__cond_resched.cpus_read_lock.membarrier_global_expedited.__do_sys_membarrier.do_syscall_64
0.01 ± 41% -100.0% 0.00 perf-sched.wait_time.avg.ms.__cond_resched.dput.__fput.__x64_sys_close.do_syscall_64
31.61 ± 4% -32.3% 21.42 ± 2% perf-sched.wait_time.avg.ms.__cond_resched.stop_one_cpu.migrate_task_to.task_numa_migrate.isra
28.18 ± 16% -40.3% 16.83 ± 30% perf-sched.wait_time.avg.ms.io_schedule.migration_entry_wait_on_locked.__handle_mm_fault.handle_mm_fault
521.50 ± 10% -66.9% 172.44 ± 10% perf-sched.wait_time.avg.ms.schedule_hrtimeout_range_clock.poll_schedule_timeout.constprop.0.do_poll
16.33 +48.1% 24.18 perf-sched.wait_time.avg.ms.schedule_preempt_disabled.__mutex_lock.constprop.0.membarrier_global_expedited
30.90 -39.7% 18.64 ± 2% perf-sched.wait_time.avg.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.__wait_rcu_gp
0.84 ± 4% -28.9% 0.60 ± 2% perf-sched.wait_time.avg.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
528.63 ± 2% -15.5% 446.57 ± 2% perf-sched.wait_time.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
31.82 -36.4% 20.24 ± 10% perf-sched.wait_time.avg.ms.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
32.50 ± 5% -93.9% 1.98 ±223% perf-sched.wait_time.max.ms.__cond_resched.__kmalloc_node_noprof.alloc_cpumask_var_node.membarrier_private_expedited.__do_sys_membarrier
7.30 ±145% +367.3% 34.11 ± 8% perf-sched.wait_time.max.ms.__cond_resched.__mutex_lock.constprop.0.membarrier_global_expedited
131.53 ± 30% -65.6% 45.28 ± 52% perf-sched.wait_time.max.ms.__cond_resched.__wait_for_common.stop_two_cpus.migrate_swap.task_numa_migrate
35.67 ± 3% -56.0% 15.69 ± 61% perf-sched.wait_time.max.ms.__cond_resched.cpus_read_lock.membarrier_global_expedited.__do_sys_membarrier.do_syscall_64
0.01 ± 38% -100.0% 0.00 perf-sched.wait_time.max.ms.__cond_resched.dput.__fput.__x64_sys_close.do_syscall_64
58.67 ± 80% -78.9% 12.40 ±106% perf-sched.wait_time.max.ms.__cond_resched.task_work_run.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe
337.05 ±140% -99.7% 1.17 ± 3% perf-sched.wait_time.max.ms.do_wait.kernel_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe
35.86 ± 11% -57.8% 15.12 ± 84% perf-sched.wait_time.max.ms.irqentry_exit_to_user_mode.asm_sysvec_apic_timer_interrupt.[unknown]
166.02 ± 16% -62.3% 62.52 ± 42% perf-sched.wait_time.max.ms.schedule_preempt_disabled.__mutex_lock.constprop.0.membarrier_global_expedited
168.85 ± 15% -66.0% 57.38 ± 44% perf-sched.wait_time.max.ms.schedule_preempt_disabled.__mutex_lock.constprop.0.membarrier_private_expedited
166.11 ± 15% -63.4% 60.73 ± 44% perf-sched.wait_time.max.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.__wait_rcu_gp
4.66 ± 12% -50.4% 2.31 ± 4% perf-sched.wait_time.max.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
13.65 ± 49% -63.2% 5.02 perf-sched.wait_time.max.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
861.59 ± 36% -95.7% 37.30 ± 19% perf-sched.wait_time.max.ms.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
***************************************************************************************************
lkp-spr-2sp4: 224 threads 2 sockets Intel(R) Xeon(R) Platinum 8480CTDX (Sapphire Rapids) with 512G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
gcc-12/performance/x86_64-rhel-9.4/100%/debian-12-x86_64-20240206.cgz/lkp-spr-2sp4/timer/stress-ng/60s
commit:
032e1003ad ("sched: Make clangd usable")
7bca60ce65 ("sched/deadline: Less agressive dl_server handling")
032e1003ad358e4c 7bca60ce656978b01caafc4e228
---------------- ---------------------------
%stddev %change %stddev
\ | \
9.65e+08 ± 20% -44.3% 5.377e+08 ± 6% cpuidle..time
471918 ± 13% -36.6% 299078 ± 3% cpuidle..usage
1393 ± 27% +162.5% 3658 ± 2% perf-c2c.DRAM.local
3285 ± 19% -46.2% 1768 ± 12% perf-c2c.DRAM.remote
49987 ± 7% -93.9% 3069 ± 5% perf-c2c.HITM.local
51112 ± 7% -92.2% 3983 ± 6% perf-c2c.HITM.total
11.12 ± 17% -39.8% 6.70 ± 9% vmstat.cpu.id
63.78 ± 9% +176.1% 176.06 vmstat.procs.r
14250660 ± 4% +279.8% 54130066 vmstat.system.cs
18058657 ± 6% +255.1% 64127870 vmstat.system.in
8.33 ± 23% -4.4 3.96 ± 6% mpstat.cpu.all.idle%
72.47 ± 5% -47.7 24.80 mpstat.cpu.all.irq%
0.13 ± 7% +0.9 1.02 mpstat.cpu.all.soft%
15.77 ± 9% +45.9 61.71 mpstat.cpu.all.sys%
3.30 ± 8% +5.2 8.50 mpstat.cpu.all.usr%
30202899 ± 27% -74.5% 7713523 ± 15% stress-ng.time.involuntary_context_switches
3451 ± 9% +271.5% 12821 stress-ng.time.percent_of_cpu_this_job_got
1808 ± 9% +279.4% 6860 stress-ng.time.system_time
269.18 ± 10% +218.9% 858.49 stress-ng.time.user_time
4.734e+08 ± 5% +266.2% 1.734e+09 stress-ng.time.voluntary_context_switches
5.573e+08 ± 6% +265.1% 2.035e+09 stress-ng.timer.ops
9288895 ± 6% +265.1% 33915351 stress-ng.timer.ops_per_sec
11366431 ± 2% -33.7% 7539947 ± 2% meminfo.Active
11366431 ± 2% -33.7% 7539947 ± 2% meminfo.Active(anon)
13966810 ± 2% -26.7% 10238040 ± 3% meminfo.Cached
13228581 ± 2% -28.5% 9461059 ± 3% meminfo.Committed_AS
978798 ± 3% -24.2% 742213 ± 2% meminfo.Mapped
18188884 -21.1% 14349727 meminfo.Memused
10415785 ± 3% -35.8% 6687008 ± 4% meminfo.Shmem
18314446 -21.1% 14457915 meminfo.max_used_kB
245720 ± 3% -24.3% 186050 ± 2% proc-vmstat.nr_anon_pages
2843787 ± 2% -33.9% 1880585 ± 2% proc-vmstat.nr_inactive_anon
3493822 ± 2% -26.9% 2554863 ± 3% proc-vmstat.nr_mapped
4993412 -19.1% 4037784 proc-vmstat.nr_unaccepted
44879 -4.6% 42832 proc-vmstat.nr_unevictable
2606064 ± 3% -36.0% 1667105 ± 5% proc-vmstat.nr_writeback_temp
2843787 ± 2% -33.9% 1880585 ± 2% proc-vmstat.nr_zone_active_anon
4624388 ± 4% -29.8% 3246513 ± 4% proc-vmstat.numa_interleave
4995767 ± 3% -28.6% 3567888 ± 3% proc-vmstat.pgalloc_dma32
22.69 ±100% -22.7 0.00 perf-profile.calltrace.cycles-pp.dl_server_start.enqueue_task_fair.enqueue_task.ttwu_do_activate.try_to_wake_up
22.68 ±100% -22.7 0.00 perf-profile.calltrace.cycles-pp.enqueue_dl_entity.dl_server_start.enqueue_task_fair.enqueue_task.ttwu_do_activate
22.59 ±100% -22.6 0.00 perf-profile.calltrace.cycles-pp.start_dl_timer.enqueue_dl_entity.dl_server_start.enqueue_task_fair.enqueue_task
22.54 ±100% -22.5 0.00 perf-profile.calltrace.cycles-pp.hrtimer_start_range_ns.start_dl_timer.enqueue_dl_entity.dl_server_start.enqueue_task_fair
14.58 ±100% -14.6 0.00 perf-profile.calltrace.cycles-pp.dl_server_stop.dequeue_entities.dequeue_task_fair.try_to_block_task.__schedule
14.38 ±100% -14.4 0.00 perf-profile.calltrace.cycles-pp.hrtimer_try_to_cancel.dl_server_stop.dequeue_entities.dequeue_task_fair.try_to_block_task
14.24 ±100% -14.2 0.00 perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.hrtimer_try_to_cancel.dl_server_stop.dequeue_entities.dequeue_task_fair
14.19 ±100% -14.2 0.00 perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.hrtimer_try_to_cancel.dl_server_stop.dequeue_entities
13.74 ±100% -13.7 0.00 perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.hrtimer_start_range_ns.start_dl_timer.enqueue_dl_entity.dl_server_start
13.69 ±100% -13.7 0.00 perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.hrtimer_start_range_ns.start_dl_timer.enqueue_dl_entity
8.62 ±100% -8.6 0.00 perf-profile.calltrace.cycles-pp.switch_hrtimer_base.hrtimer_start_range_ns.start_dl_timer.enqueue_dl_entity.dl_server_start
8.52 ±100% -8.5 0.00 perf-profile.calltrace.cycles-pp._raw_spin_lock.switch_hrtimer_base.hrtimer_start_range_ns.start_dl_timer.enqueue_dl_entity
8.48 ±100% -8.5 0.00 perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock.switch_hrtimer_base.hrtimer_start_range_ns.start_dl_timer
36.86 ±100% -36.9 0.00 perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
22.70 ±100% -22.7 0.00 perf-profile.children.cycles-pp.dl_server_start
22.70 ±100% -22.7 0.00 perf-profile.children.cycles-pp.enqueue_dl_entity
22.61 ±100% -22.6 0.00 perf-profile.children.cycles-pp.start_dl_timer
14.58 ±100% -14.6 0.00 perf-profile.children.cycles-pp.dl_server_stop
36.86 ±100% -36.9 0.00 perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
0.20 ± 10% -75.3% 0.05 ± 4% perf-stat.i.MPKI
3.739e+10 ± 3% +160.6% 9.743e+10 perf-stat.i.branch-instructions
0.32 ± 2% -0.1 0.17 ± 2% perf-stat.i.branch-miss-rate%
1.134e+08 ± 5% +36.0% 1.541e+08 perf-stat.i.branch-misses
13.85 ± 4% +8.4 22.21 ± 3% perf-stat.i.cache-miss-rate%
35907287 ± 12% -44.3% 20014868 ± 4% perf-stat.i.cache-misses
2.356e+08 ± 11% -66.9% 78055859 ± 5% perf-stat.i.cache-references
14772924 ± 4% +282.4% 56485490 perf-stat.i.context-switches
3.40 ± 5% -62.9% 1.26 perf-stat.i.cpi
6.126e+11 +3.6% 6.344e+11 perf-stat.i.cpu-cycles
45650 ± 30% -59.9% 18293 ± 15% perf-stat.i.cpu-migrations
25747 ± 26% +116.1% 55648 ± 6% perf-stat.i.cycles-between-cache-misses
1.844e+11 ± 3% +175.4% 5.079e+11 perf-stat.i.instructions
0.30 ± 5% +163.7% 0.80 perf-stat.i.ipc
65.89 ± 4% +282.7% 252.13 perf-stat.i.metric.K/sec
0.19 ± 9% -79.9% 0.04 ± 3% perf-stat.overall.MPKI
0.30 -0.1 0.16 perf-stat.overall.branch-miss-rate%
15.16 ± 2% +10.3 25.49 ± 3% perf-stat.overall.cache-miss-rate%
3.34 ± 5% -62.5% 1.25 perf-stat.overall.cpi
17517 ± 14% +83.8% 32206 ± 3% perf-stat.overall.cycles-between-cache-misses
0.30 ± 5% +166.0% 0.80 perf-stat.overall.ipc
3.665e+10 ± 3% +160.9% 9.561e+10 perf-stat.ps.branch-instructions
1.104e+08 ± 4% +35.6% 1.497e+08 perf-stat.ps.branch-misses
35038923 ± 12% -44.6% 19408879 ± 3% perf-stat.ps.cache-misses
2.307e+08 ± 10% -66.9% 76282209 ± 5% perf-stat.ps.cache-references
14499051 ± 4% +284.5% 55749126 perf-stat.ps.context-switches
6.024e+11 +3.7% 6.244e+11 perf-stat.ps.cpu-cycles
44625 ± 30% -59.3% 18160 ± 16% perf-stat.ps.cpu-migrations
1.808e+11 ± 3% +175.9% 4.987e+11 perf-stat.ps.instructions
1.109e+13 ± 3% +176.5% 3.065e+13 perf-stat.total.instructions
468618 ± 13% +555.4% 3071214 ± 4% sched_debug.cfs_rq:/.avg_vruntime.avg
1760225 ± 9% +135.1% 4137583 ± 4% sched_debug.cfs_rq:/.avg_vruntime.max
126460 ± 13% +2212.5% 2924391 ± 4% sched_debug.cfs_rq:/.avg_vruntime.min
165925 ± 8% -34.9% 108063 ± 7% sched_debug.cfs_rq:/.avg_vruntime.stddev
0.43 ± 5% -20.4% 0.35 ± 3% sched_debug.cfs_rq:/.h_nr_queued.avg
0.43 ± 6% -22.1% 0.33 ± 5% sched_debug.cfs_rq:/.h_nr_runnable.avg
1028326 ± 38% +209.6% 3183766 ± 4% sched_debug.cfs_rq:/.left_deadline.max
221871 ± 14% +259.8% 798312 ± 18% sched_debug.cfs_rq:/.left_deadline.stddev
1028182 ± 38% +209.6% 3183503 ± 4% sched_debug.cfs_rq:/.left_vruntime.max
221805 ± 14% +259.9% 798242 ± 18% sched_debug.cfs_rq:/.left_vruntime.stddev
6383 ± 23% -60.2% 2539 ± 5% sched_debug.cfs_rq:/.load.avg
196131 ±195% -90.4% 18756 ± 8% sched_debug.cfs_rq:/.load.max
468618 ± 13% +555.4% 3071214 ± 4% sched_debug.cfs_rq:/.min_vruntime.avg
1760225 ± 9% +135.1% 4137583 ± 4% sched_debug.cfs_rq:/.min_vruntime.max
126460 ± 13% +2212.5% 2924391 ± 4% sched_debug.cfs_rq:/.min_vruntime.min
165925 ± 8% -34.9% 108063 ± 7% sched_debug.cfs_rq:/.min_vruntime.stddev
0.43 ± 5% -28.3% 0.31 ± 6% sched_debug.cfs_rq:/.nr_queued.avg
0.28 ± 9% +27.0% 0.36 ± 3% sched_debug.cfs_rq:/.nr_queued.stddev
1028182 ± 38% +209.6% 3183503 ± 4% sched_debug.cfs_rq:/.right_vruntime.max
221805 ± 14% +259.9% 798242 ± 18% sched_debug.cfs_rq:/.right_vruntime.stddev
255.53 ± 6% +70.3% 435.29 ± 2% sched_debug.cfs_rq:/.runnable_avg.avg
196.91 ± 6% +118.7% 430.59 ± 3% sched_debug.cfs_rq:/.util_avg.avg
372.78 ± 4% -15.0% 316.74 ± 4% sched_debug.cfs_rq:/.util_est.avg
17.52 ± 4% +33.7% 23.42 ± 7% sched_debug.cpu.clock.stddev
69446 ± 5% +10.8% 76960 ± 2% sched_debug.cpu.clock_task.avg
55201 ± 5% +13.2% 62494 ± 3% sched_debug.cpu.clock_task.min
1380 ± 3% -23.0% 1063 ± 2% sched_debug.cpu.clock_task.stddev
1764 ± 11% +49.9% 2646 sched_debug.cpu.curr->pid.avg
0.00 ± 18% +171.9% 0.00 ± 90% sched_debug.cpu.next_balance.stddev
0.11 ± 22% +293.6% 0.44 sched_debug.cpu.nr_running.avg
1969374 ± 4% +284.2% 7566629 sched_debug.cpu.nr_switches.avg
3819538 ± 13% +122.0% 8477830 sched_debug.cpu.nr_switches.max
8064 ± 21% +38772.5% 3134971 ± 21% sched_debug.cpu.nr_switches.min
317241 ± 15% +68.5% 534501 ± 6% sched_debug.cpu.nr_switches.stddev
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki
^ permalink raw reply [flat|nested] only message in thread
only message in thread, other threads:[~2025-06-03 13:48 UTC | newest]
Thread overview: (only message) (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2025-06-03 13:48 [peterz-queue:sched/core] [sched/deadline] 7bca60ce65: will-it-scale.per_process_ops 63.0% regression kernel test robot
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.