From: kernel test robot <oliver.sang@intel.com>
To: Li Lingfeng <lilingfeng3@huawei.com>
Cc: <oe-lkp@lists.linux.dev>, <lkp@intel.com>,
Trond Myklebust <trond.myklebust@hammerspace.com>,
<linux-nfs@vger.kernel.org>, <oliver.sang@intel.com>
Subject: [linux-next:master] [nfs] b6dea6c7fe: fsmark.files_per_sec 17.6% regression
Date: Sat, 30 Nov 2024 18:44:15 +0800 [thread overview]
Message-ID: <202411301633.3ed8df2-lkp@intel.com> (raw)
Hello,
kernel test robot noticed a 17.6% regression of fsmark.files_per_sec on:
commit: b6dea6c7fe2d8187050f882fe6f872d30e495ffe ("nfs: pass flags to second superblock")
https://git.kernel.org/cgit/linux/kernel/git/next/linux-next.git master
[test failed on linux-next/master cfba9f07a1d6aeca38f47f1f472cfb0ba133d341]
testcase: fsmark
config: x86_64-rhel-9.4
compiler: gcc-12
test machine: 48 threads 2 sockets Intel(R) Xeon(R) CPU E5-2697 v2 @ 2.70GHz (Ivy Bridge-EP) with 64G memory
parameters:
iterations: 1x
nr_threads: 32t
disk: 1SSD
fs: xfs
fs2: nfsv4
filesize: 8K
test_size: 400M
sync_method: fsyncBeforeClose
nr_directories: 16d
nr_files_per_directory: 256fpd
cpufreq_governor: performance
In addition to that, the commit also has significant impact on the following tests:
+------------------+------------------------------------------------------------------------------------------------+
| testcase: change | fsmark: fsmark.files_per_sec 9.4% regression |
| test machine | 48 threads 2 sockets Intel(R) Xeon(R) CPU E5-2697 v2 @ 2.70GHz (Ivy Bridge-EP) with 64G memory |
| test parameters | cpufreq_governor=performance |
| | disk=1SSD |
| | filesize=9B |
| | fs2=nfsv4 |
| | fs=ext4 |
| | iterations=1x |
| | nr_directories=16d |
| | nr_files_per_directory=256fpd |
| | nr_threads=32t |
| | sync_method=fsyncBeforeClose |
| | test_size=400M |
+------------------+------------------------------------------------------------------------------------------------+
| testcase: change | fsmark: fsmark.files_per_sec 15.9% regression |
| test machine | 48 threads 2 sockets Intel(R) Xeon(R) CPU E5-2697 v2 @ 2.70GHz (Ivy Bridge-EP) with 64G memory |
| test parameters | cpufreq_governor=performance |
| | disk=1SSD |
| | filesize=9B |
| | fs2=nfsv4 |
| | fs=btrfs |
| | iterations=1x |
| | nr_directories=16d |
| | nr_files_per_directory=256fpd |
| | nr_threads=32t |
| | sync_method=fsyncBeforeClose |
| | test_size=400M |
+------------------+------------------------------------------------------------------------------------------------+
If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <oliver.sang@intel.com>
| Closes: https://lore.kernel.org/oe-lkp/202411301633.3ed8df2-lkp@intel.com
Details are as below:
-------------------------------------------------------------------------------------------------->
The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20241130/202411301633.3ed8df2-lkp@intel.com
=========================================================================================
compiler/cpufreq_governor/disk/filesize/fs2/fs/iterations/kconfig/nr_directories/nr_files_per_directory/nr_threads/rootfs/sync_method/tbox_group/test_size/testcase:
gcc-12/performance/1SSD/8K/nfsv4/xfs/1x/x86_64-rhel-9.4/16d/256fpd/32t/debian-12-x86_64-20240206.cgz/fsyncBeforeClose/lkp-ivb-2ep2/400M/fsmark
commit:
66f9dac907 ("Revert "nfs: don't reuse partially completed requests in nfs_lock_and_join_requests"")
b6dea6c7fe ("nfs: pass flags to second superblock")
66f9dac9077c9c06 b6dea6c7fe2d8187050f882fe6f
---------------- ---------------------------
%stddev %change %stddev
\ | \
5.937e+08 ± 2% +12.7% 6.69e+08 cpuidle..time
2360703 +75.8% 4150723 cpuidle..usage
381905 ± 4% +70.8% 652235 ± 2% numa-numastat.node0.local_node
410324 +65.9% 680670 ± 3% numa-numastat.node0.numa_hit
0.48 ±223% +2.8 3.27 ± 55% perf-profile.calltrace.cycles-pp.__mmput.exit_mm.do_exit.do_group_exit.__x64_sys_exit_group
0.48 ±223% +2.8 3.27 ± 55% perf-profile.calltrace.cycles-pp.exit_mm.do_exit.do_group_exit.__x64_sys_exit_group.x64_sys_call
75.28 +5.2% 79.20 iostat.cpu.idle
16.87 -36.2% 10.76 iostat.cpu.iowait
5.30 ± 2% +42.1% 7.52 ± 2% iostat.cpu.system
19.34 -7.3 12.07 mpstat.cpu.all.iowait%
0.65 ± 3% +0.3 0.94 ± 2% mpstat.cpu.all.soft%
4.88 ± 3% +2.1 7.00 ± 3% mpstat.cpu.all.sys%
11.82 ± 3% +20.4% 14.24 mpstat.max_utilization_pct
72239710 ± 2% +55.2% 1.121e+08 fsmark.app_overhead
4389 ± 2% -17.6% 3614 fsmark.files_per_sec
54.67 ± 2% +106.1% 112.67 fsmark.time.percent_of_cpu_this_job_got
5.94 +160.1% 15.45 fsmark.time.system_time
285139 +307.7% 1162591 fsmark.time.voluntary_context_switches
368790 -47.8% 192655 ± 2% meminfo.Inactive
368790 -47.8% 192655 ± 2% meminfo.Inactive(file)
173587 -23.0% 133612 meminfo.KReclaimable
173587 -23.0% 133612 meminfo.SReclaimable
317373 -13.0% 276190 meminfo.Slab
16.92 -36.4% 10.76 vmstat.cpu.wa
58264 ± 3% -11.4% 51622 ± 2% vmstat.io.bo
8.45 ± 7% -37.6% 5.27 ± 7% vmstat.procs.b
269409 ± 2% +66.6% 448791 ± 3% vmstat.system.cs
61548 +16.2% 71538 vmstat.system.in
53681 -19.8% 43073 ± 8% numa-vmstat.node0.nr_inactive_file
53681 -19.8% 43073 ± 8% numa-vmstat.node0.nr_zone_inactive_file
410388 +66.1% 681472 ± 3% numa-vmstat.node0.numa_hit
381970 ± 4% +71.0% 653038 ± 2% numa-vmstat.node0.numa_local
60942 ± 22% -48.1% 31659 ± 28% numa-vmstat.node1.nr_file_pages
38775 ± 5% -87.9% 4703 ± 62% numa-vmstat.node1.nr_inactive_file
12374 ± 5% -60.4% 4902 ± 17% numa-vmstat.node1.nr_slab_reclaimable
38775 ± 5% -87.9% 4703 ± 62% numa-vmstat.node1.nr_zone_inactive_file
998853 -4.5% 954271 proc-vmstat.nr_file_pages
92299 ± 2% -48.3% 47689 ± 2% proc-vmstat.nr_inactive_file
43433 -23.4% 33269 proc-vmstat.nr_slab_reclaimable
92299 ± 2% -48.3% 47689 ± 2% proc-vmstat.nr_zone_inactive_file
646052 +46.8% 948130 proc-vmstat.numa_hit
596308 +50.7% 898452 proc-vmstat.numa_local
1067778 +28.3% 1369682 proc-vmstat.pgalloc_normal
791047 ± 7% +39.5% 1103617 ± 2% proc-vmstat.pgfree
214306 -19.8% 171874 ± 8% numa-meminfo.node0.Inactive
214306 -19.8% 171874 ± 8% numa-meminfo.node0.Inactive(file)
243433 ± 22% -48.0% 126616 ± 28% numa-meminfo.node1.FilePages
154772 ± 5% -87.9% 18798 ± 62% numa-meminfo.node1.Inactive
154772 ± 5% -87.9% 18798 ± 62% numa-meminfo.node1.Inactive(file)
49305 ± 5% -60.2% 19601 ± 17% numa-meminfo.node1.KReclaimable
1226203 ± 10% -21.5% 962004 ± 4% numa-meminfo.node1.MemUsed
49305 ± 5% -60.2% 19601 ± 17% numa-meminfo.node1.SReclaimable
113120 ± 4% -30.5% 78616 ± 6% numa-meminfo.node1.Slab
1.09 ± 2% +17.2% 1.28 ± 2% perf-stat.i.MPKI
2.083e+09 +19.9% 2.496e+09 perf-stat.i.branch-instructions
5.09 -0.6 4.46 perf-stat.i.branch-miss-rate%
1.047e+08 +6.2% 1.112e+08 perf-stat.i.branch-misses
10791820 ± 2% +39.9% 15097313 perf-stat.i.cache-misses
2.272e+08 +34.6% 3.058e+08 perf-stat.i.cache-references
326576 ± 2% +60.3% 523588 perf-stat.i.context-switches
1.62 +8.2% 1.75 perf-stat.i.cpi
1.584e+10 +30.3% 2.063e+10 perf-stat.i.cpu-cycles
2314 ± 5% +68.5% 3899 ± 2% perf-stat.i.cpu-migrations
1484 -7.9% 1366 perf-stat.i.cycles-between-cache-misses
1.002e+10 +18.8% 1.191e+10 perf-stat.i.instructions
0.64 -7.8% 0.59 perf-stat.i.ipc
6.98 ± 2% +58.4% 11.06 perf-stat.i.metric.K/sec
9083 ± 4% -12.8% 7919 ± 5% perf-stat.i.minor-faults
9084 ± 4% -12.8% 7919 ± 5% perf-stat.i.page-faults
1.08 ± 2% +17.7% 1.27 perf-stat.overall.MPKI
5.03 -0.6 4.46 perf-stat.overall.branch-miss-rate%
4.75 +0.2 4.94 perf-stat.overall.cache-miss-rate%
1.58 +9.6% 1.73 perf-stat.overall.cpi
1468 -6.9% 1366 perf-stat.overall.cycles-between-cache-misses
0.63 -8.8% 0.58 perf-stat.overall.ipc
1.923e+09 +21.2% 2.33e+09 perf-stat.ps.branch-instructions
96657387 +7.4% 1.038e+08 perf-stat.ps.branch-misses
9961712 ± 2% +41.5% 14091659 perf-stat.ps.cache-misses
2.097e+08 +36.1% 2.854e+08 perf-stat.ps.cache-references
301501 ± 2% +62.1% 488770 perf-stat.ps.context-switches
44319 +1.1% 44810 perf-stat.ps.cpu-clock
1.462e+10 +31.7% 1.926e+10 perf-stat.ps.cpu-cycles
2137 ± 5% +70.3% 3640 ± 2% perf-stat.ps.cpu-migrations
9.25e+09 +20.2% 1.112e+10 perf-stat.ps.instructions
44319 +1.1% 44810 perf-stat.ps.task-clock
1.206e+11 +38.7% 1.673e+11 perf-stat.total.instructions
***************************************************************************************************
lkp-ivb-2ep2: 48 threads 2 sockets Intel(R) Xeon(R) CPU E5-2697 v2 @ 2.70GHz (Ivy Bridge-EP) with 64G memory
=========================================================================================
compiler/cpufreq_governor/disk/filesize/fs2/fs/iterations/kconfig/nr_directories/nr_files_per_directory/nr_threads/rootfs/sync_method/tbox_group/test_size/testcase:
gcc-12/performance/1SSD/9B/nfsv4/ext4/1x/x86_64-rhel-9.4/16d/256fpd/32t/debian-12-x86_64-20240206.cgz/fsyncBeforeClose/lkp-ivb-2ep2/400M/fsmark
commit:
66f9dac907 ("Revert "nfs: don't reuse partially completed requests in nfs_lock_and_join_requests"")
b6dea6c7fe ("nfs: pass flags to second superblock")
66f9dac9077c9c06 b6dea6c7fe2d8187050f882fe6f
---------------- ---------------------------
%stddev %change %stddev
\ | \
5106420 +67.0% 8530260 cpuidle..usage
-7.83 +34.0% -10.50 sched_debug.cpu.nr_uninterruptible.min
74.15 +4.5% 77.46 iostat.cpu.idle
18.76 -31.4% 12.87 iostat.cpu.iowait
4.63 +49.8% 6.94 iostat.cpu.system
2.46 ± 2% +10.9% 2.73 iostat.cpu.user
693560 ± 10% +42.7% 989531 ± 10% numa-numastat.node0.local_node
707512 ± 9% +42.3% 1007109 ± 9% numa-numastat.node0.numa_hit
506340 ± 12% +85.9% 941327 ± 10% numa-numastat.node1.local_node
542074 ± 11% +79.6% 973549 ± 9% numa-numastat.node1.numa_hit
2.236e+08 +34.9% 3.016e+08 fsmark.app_overhead
2969 -9.4% 2692 fsmark.files_per_sec
40.17 +138.2% 95.67 fsmark.time.percent_of_cpu_this_job_got
12.75 +176.3% 35.23 fsmark.time.system_time
568715 +308.9% 2325201 fsmark.time.voluntary_context_switches
19.76 -6.3 13.49 mpstat.cpu.all.iowait%
0.59 +0.3 0.88 mpstat.cpu.all.soft%
3.90 +2.1 5.99 mpstat.cpu.all.sys%
2.38 ± 2% +0.3 2.71 mpstat.cpu.all.usr%
2.00 +1783.3% 37.67 mpstat.max_utilization.seconds
10.03 +37.5% 13.79 mpstat.max_utilization_pct
18.74 -31.0% 12.92 vmstat.cpu.wa
107144 -5.5% 101202 vmstat.io.bo
9.71 ± 3% -35.7% 6.24 ± 10% vmstat.procs.b
4.53 ± 5% +27.6% 5.78 ± 6% vmstat.procs.r
197002 +79.6% 353838 vmstat.system.cs
58233 +13.7% 66197 vmstat.system.in
853950 +11.7% 954196 meminfo.Active
853950 +11.7% 954196 meminfo.Active(anon)
215530 +11.0% 239344 meminfo.Buffers
625583 -28.1% 449956 meminfo.Inactive
625583 -28.1% 449956 meminfo.Inactive(file)
241001 -27.6% 174436 meminfo.KReclaimable
106011 ± 8% +49.3% 158268 ± 10% meminfo.Mapped
241001 -27.6% 174436 meminfo.SReclaimable
46597 ± 21% +201.0% 140273 ± 8% meminfo.Shmem
388729 -19.3% 313805 meminfo.Slab
82817 ± 40% +45.7% 120654 ± 7% numa-vmstat.node0.nr_active_anon
76526 ± 43% +41.7% 108400 ± 2% numa-vmstat.node0.nr_anon_pages
951.19 ± 20% -22.7% 735.13 ± 7% numa-vmstat.node0.nr_dirty
80529 ± 21% -37.8% 50085 ± 10% numa-vmstat.node0.nr_inactive_file
21948 ± 8% +24.0% 27212 ± 10% numa-vmstat.node0.nr_mapped
39688 ± 5% -25.1% 29725 ± 4% numa-vmstat.node0.nr_slab_reclaimable
21176 ± 2% -8.3% 19425 ± 3% numa-vmstat.node0.nr_slab_unreclaimable
82817 ± 40% +45.7% 120653 ± 7% numa-vmstat.node0.nr_zone_active_anon
80529 ± 21% -37.8% 50085 ± 10% numa-vmstat.node0.nr_zone_inactive_file
707539 ± 9% +42.5% 1008378 ± 9% numa-vmstat.node0.numa_hit
693587 ± 10% +42.9% 990800 ± 10% numa-vmstat.node0.numa_local
5107 ± 58% +162.7% 13416 ± 30% numa-vmstat.node1.nr_mapped
5650 ± 24% +315.7% 23492 ± 41% numa-vmstat.node1.nr_shmem
20564 ± 10% -32.2% 13935 ± 9% numa-vmstat.node1.nr_slab_reclaimable
541047 ± 11% +80.2% 974981 ± 9% numa-vmstat.node1.numa_hit
505313 ± 13% +86.6% 942759 ± 10% numa-vmstat.node1.numa_local
6.38 ± 46% -3.5 2.85 ±141% perf-profile.calltrace.cycles-pp.__cmd_record
6.38 ± 46% -3.5 2.85 ±141% perf-profile.calltrace.cycles-pp.perf_session__process_events.record__finish_output.__cmd_record
6.38 ± 46% -3.5 2.85 ±141% perf-profile.calltrace.cycles-pp.reader__read_event.perf_session__process_events.record__finish_output.__cmd_record
6.38 ± 46% -3.5 2.85 ±141% perf-profile.calltrace.cycles-pp.record__finish_output.__cmd_record
5.88 ± 35% -3.4 2.47 ±142% perf-profile.calltrace.cycles-pp.ordered_events__queue.process_simple.reader__read_event.perf_session__process_events.record__finish_output
5.88 ± 35% -3.4 2.47 ±142% perf-profile.calltrace.cycles-pp.queue_event.ordered_events__queue.process_simple.reader__read_event.perf_session__process_events
4.37 ± 56% -1.5 2.85 ±141% perf-profile.calltrace.cycles-pp.process_simple.reader__read_event.perf_session__process_events.record__finish_output.__cmd_record
7.90 ± 12% -5.0 2.85 ±141% perf-profile.children.cycles-pp.perf_session__process_events
7.90 ± 12% -5.0 2.85 ±141% perf-profile.children.cycles-pp.reader__read_event
7.90 ± 12% -5.0 2.85 ±141% perf-profile.children.cycles-pp.record__finish_output
5.88 ± 35% -3.4 2.47 ±142% perf-profile.children.cycles-pp.ordered_events__queue
5.88 ± 35% -3.4 2.47 ±142% perf-profile.children.cycles-pp.queue_event
5.88 ± 35% -3.0 2.85 ±141% perf-profile.children.cycles-pp.process_simple
5.74 ± 74% +5.4 11.18 ± 11% perf-profile.children.cycles-pp.syscall_exit_to_user_mode
31.02 ± 11% +10.6 41.66 ± 12% perf-profile.children.cycles-pp.do_syscall_64
31.29 ± 11% +10.7 42.01 ± 12% perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
5.47 ± 35% -3.3 2.14 ±141% perf-profile.self.cycles-pp.queue_event
331331 ± 40% +45.4% 481839 ± 6% numa-meminfo.node0.Active
331331 ± 40% +45.4% 481839 ± 6% numa-meminfo.node0.Active(anon)
36095 ± 34% +185.6% 103075 ± 61% numa-meminfo.node0.AnonHugePages
306688 ± 43% +41.3% 433493 ± 2% numa-meminfo.node0.AnonPages
352666 ± 34% +39.9% 493368 ± 2% numa-meminfo.node0.AnonPages.max
3781 ± 20% -22.7% 2921 ± 7% numa-meminfo.node0.Dirty
321823 ± 21% -37.9% 199875 ± 10% numa-meminfo.node0.Inactive
321823 ± 21% -37.9% 199875 ± 10% numa-meminfo.node0.Inactive(file)
158690 ± 5% -25.1% 118818 ± 4% numa-meminfo.node0.KReclaimable
86534 ± 7% +23.8% 107125 ± 10% numa-meminfo.node0.Mapped
158690 ± 5% -25.1% 118818 ± 4% numa-meminfo.node0.SReclaimable
84681 ± 2% -8.2% 77738 ± 3% numa-meminfo.node0.SUnreclaim
243372 ± 3% -19.2% 196556 ± 3% numa-meminfo.node0.Slab
546078 ± 22% -22.0% 425684 ± 3% numa-meminfo.node1.AnonPages.max
82272 ± 10% -32.3% 55659 ± 9% numa-meminfo.node1.KReclaimable
19884 ± 58% +162.6% 52218 ± 29% numa-meminfo.node1.Mapped
82272 ± 10% -32.3% 55659 ± 9% numa-meminfo.node1.SReclaimable
22229 ± 25% +316.9% 92680 ± 41% numa-meminfo.node1.Shmem
145380 ± 5% -19.3% 117350 ± 5% numa-meminfo.node1.Slab
213612 +11.8% 238855 proc-vmstat.nr_active_anon
1073968 -1.9% 1053722 proc-vmstat.nr_file_pages
156460 -28.0% 112595 proc-vmstat.nr_inactive_file
26834 ± 8% +49.6% 40137 ± 10% proc-vmstat.nr_mapped
11754 ± 21% +200.7% 35348 ± 8% proc-vmstat.nr_shmem
60226 -27.6% 43625 proc-vmstat.nr_slab_reclaimable
36950 -5.6% 34868 proc-vmstat.nr_slab_unreclaimable
212974 -1.3% 210204 proc-vmstat.nr_written
213612 +11.8% 238855 proc-vmstat.nr_zone_active_anon
156460 -28.0% 112595 proc-vmstat.nr_zone_inactive_file
1933 ±131% +2667.4% 53507 ± 10% proc-vmstat.numa_hint_faults
1881 ±136% +1483.1% 29778 ± 17% proc-vmstat.numa_hint_faults_local
1251477 +58.6% 1984851 proc-vmstat.numa_hit
1201787 +60.9% 1933842 proc-vmstat.numa_local
44.50 ± 97% +17611.6% 7881 ± 45% proc-vmstat.numa_pages_migrated
104657 ± 37% +114.9% 224925 ± 28% proc-vmstat.numa_pte_updates
1702749 +43.7% 2446144 proc-vmstat.pgalloc_normal
296195 ± 2% +22.1% 361669 ± 2% proc-vmstat.pgfault
1182870 ± 5% +53.0% 1809606 proc-vmstat.pgfree
44.50 ± 97% +17611.6% 7881 ± 45% proc-vmstat.pgmigrate_success
4105642 +2.2% 4195938 proc-vmstat.pgpgout
1.28 ± 4% +11.1% 1.43 perf-stat.i.MPKI
1.739e+09 +34.1% 2.331e+09 perf-stat.i.branch-instructions
6.12 -1.1 5.00 perf-stat.i.branch-miss-rate%
1.039e+08 +9.3% 1.136e+08 perf-stat.i.branch-misses
10490708 ± 4% +46.2% 15335649 perf-stat.i.cache-misses
1.682e+08 +46.6% 2.467e+08 perf-stat.i.cache-references
212706 +77.6% 377834 perf-stat.i.context-switches
1.61 +6.3% 1.71 perf-stat.i.cpi
1.309e+10 +40.3% 1.836e+10 perf-stat.i.cpu-cycles
2446 ± 11% +59.1% 3891 ± 3% perf-stat.i.cpu-migrations
1267 ± 4% -5.2% 1201 perf-stat.i.cycles-between-cache-misses
8.326e+09 +32.8% 1.106e+10 perf-stat.i.instructions
0.63 -5.3% 0.60 perf-stat.i.ipc
4.47 +77.3% 7.93 perf-stat.i.metric.K/sec
5993 ± 3% +21.2% 7266 ± 3% perf-stat.i.minor-faults
5994 ± 3% +21.2% 7266 ± 3% perf-stat.i.page-faults
1.26 ± 4% +10.1% 1.39 perf-stat.overall.MPKI
5.98 -1.1 4.87 perf-stat.overall.branch-miss-rate%
1.57 +5.6% 1.66 perf-stat.overall.cpi
1249 ± 4% -4.2% 1197 perf-stat.overall.cycles-between-cache-misses
0.64 -5.3% 0.60 perf-stat.overall.ipc
1.69e+09 +34.4% 2.271e+09 perf-stat.ps.branch-instructions
1.01e+08 +9.5% 1.106e+08 perf-stat.ps.branch-misses
10198918 ± 4% +46.5% 14941127 perf-stat.ps.cache-misses
1.636e+08 +47.0% 2.404e+08 perf-stat.ps.cache-references
206791 +78.0% 368107 perf-stat.ps.context-switches
1.272e+10 +40.6% 1.788e+10 perf-stat.ps.cpu-cycles
2378 ± 11% +59.4% 3791 ± 3% perf-stat.ps.cpu-migrations
8.094e+09 +33.1% 1.077e+10 perf-stat.ps.instructions
5824 ± 3% +21.5% 7077 ± 3% perf-stat.ps.minor-faults
5824 ± 3% +21.5% 7077 ± 3% perf-stat.ps.page-faults
2.924e+11 ± 2% +43.5% 4.197e+11 perf-stat.total.instructions
0.00 ±223% +1020.0% 0.01 ± 59% perf-sched.sch_delay.avg.ms.__cond_resched.down_read.walk_component.link_path_walk.part
0.00 ±223% +560.0% 0.01 ± 37% perf-sched.sch_delay.avg.ms.__cond_resched.jbd2_log_wait_commit.ext4_sync_file.ext4_buffered_write_iter.do_iter_readv_writev
0.01 ± 7% -26.4% 0.01 ± 21% perf-sched.sch_delay.avg.ms.__cond_resched.stop_one_cpu.sched_exec.bprm_execve.part
0.01 -20.0% 0.00 perf-sched.sch_delay.avg.ms.__lock_sock.lock_sock_nested.tcp_recvmsg.inet6_recvmsg
0.01 -20.0% 0.00 perf-sched.sch_delay.avg.ms.__lock_sock.lock_sock_nested.tcp_sendmsg.sock_sendmsg
0.01 -30.0% 0.00 ± 14% perf-sched.sch_delay.avg.ms.__lock_sock.lock_sock_nested.tcp_sock_set_cork.xs_tcp_send_request
0.01 -11.1% 0.01 perf-sched.sch_delay.avg.ms.io_schedule.folio_wait_bit_common.folio_wait_writeback.__filemap_fdatawait_range
0.01 ± 74% +84.8% 0.01 ± 17% perf-sched.sch_delay.avg.ms.io_schedule.rq_qos_wait.wbt_wait.__rq_qos_throttle
0.02 ± 39% +78.6% 0.03 ± 34% perf-sched.sch_delay.avg.ms.irqentry_exit_to_user_mode.asm_sysvec_reschedule_ipi.[unknown].[unknown]
0.01 ± 5% -30.2% 0.01 perf-sched.sch_delay.avg.ms.jbd2_log_wait_commit.ext4_nfs_commit_metadata.nfsd_create_setattr.nfsd4_create_file
0.01 -22.2% 0.01 perf-sched.sch_delay.avg.ms.rpc_wait_bit_killable.__wait_on_bit.out_of_line_wait_on_bit.__rpc_execute
0.01 -12.5% 0.01 perf-sched.sch_delay.avg.ms.rpc_wait_bit_killable.__wait_on_bit.out_of_line_wait_on_bit.nfs4_do_close
0.01 ± 4% -20.3% 0.01 ± 5% perf-sched.sch_delay.avg.ms.rpc_wait_bit_killable.__wait_on_bit.out_of_line_wait_on_bit.nfs4_run_open_task
0.01 ± 10% -44.6% 0.01 ± 7% perf-sched.sch_delay.avg.ms.schedule_preempt_disabled.__mutex_lock.constprop.0.svc_tcp_sendto
0.01 ± 14% -22.4% 0.01 ± 12% perf-sched.sch_delay.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
0.01 ± 5% -50.0% 0.01 perf-sched.sch_delay.avg.ms.svc_recv.nfsd.kthread.ret_from_fork
0.01 ±117% +245.2% 0.02 ± 36% perf-sched.sch_delay.max.ms.__cond_resched.__release_sock.release_sock.tcp_recvmsg.inet6_recvmsg
0.00 ±223% +710.3% 0.04 ± 31% perf-sched.sch_delay.max.ms.__cond_resched.__rpc_execute.rpc_execute.rpc_run_task.nfs4_call_sync_sequence
0.00 ±223% +1520.0% 0.01 ± 75% perf-sched.sch_delay.max.ms.__cond_resched.down_read.walk_component.link_path_walk.part
0.00 ±223% +600.0% 0.01 ± 26% perf-sched.sch_delay.max.ms.__cond_resched.jbd2_log_wait_commit.ext4_sync_file.ext4_buffered_write_iter.do_iter_readv_writev
0.00 ±126% +359.1% 0.02 ± 73% perf-sched.sch_delay.max.ms.__cond_resched.lock_sock_nested.tcp_sendmsg.sock_sendmsg.svc_tcp_sendmsg
0.00 ±169% +769.2% 0.02 ± 80% perf-sched.sch_delay.max.ms.__cond_resched.mutex_lock.svc_tcp_sendto.svc_send.svc_handle_xprt
0.01 ± 85% +186.7% 0.03 ± 30% perf-sched.sch_delay.max.ms.__cond_resched.xs_stream_data_receive_workfn.process_one_work.worker_thread.kthread
0.06 ± 34% -33.1% 0.04 ± 32% perf-sched.sch_delay.max.ms.jbd2_log_wait_commit.ext4_nfs_commit_metadata.nfsd_create_setattr.nfsd_create_locked
3.59 ± 11% -52.9% 1.69 ± 75% perf-sched.sch_delay.max.ms.rpc_wait_bit_killable.__wait_on_bit.out_of_line_wait_on_bit.nfs4_do_close
0.09 ± 34% +3544.0% 3.30 ± 42% perf-sched.sch_delay.max.ms.schedule_timeout.__wait_for_common.__synchronize_srcu.part.0
0.01 ± 25% +50.0% 0.01 ± 14% perf-sched.sch_delay.max.ms.start_this_handle.jbd2_journal_start_reserved.__ext4_journal_start_reserved.ext4_convert_unwritten_io_end_vec
0.02 ± 51% +89.1% 0.03 ± 21% perf-sched.sch_delay.max.ms.wait_transaction_locked.add_transaction_credits.start_this_handle.jbd2__journal_start
0.01 ± 5% -16.3% 0.01 perf-sched.total_sch_delay.average.ms
2.35 ± 4% -40.7% 1.39 perf-sched.total_wait_and_delay.average.ms
473705 ± 6% +83.6% 869820 perf-sched.total_wait_and_delay.count.ms
2.34 ± 4% -40.8% 1.39 perf-sched.total_wait_time.average.ms
14.98 ± 6% -6.8% 13.95 perf-sched.wait_and_delay.avg.ms.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
0.14 ± 3% -48.0% 0.07 ± 2% perf-sched.wait_and_delay.avg.ms.__lock_sock.lock_sock_nested.tcp_recvmsg.inet6_recvmsg
0.11 ± 5% -53.7% 0.05 ± 3% perf-sched.wait_and_delay.avg.ms.__lock_sock.lock_sock_nested.tcp_sendmsg.sock_sendmsg
0.16 ± 4% -55.2% 0.07 ± 40% perf-sched.wait_and_delay.avg.ms.__lock_sock.lock_sock_nested.tcp_sock_set_cork.xs_tcp_send_request
21.65 ± 14% +50.9% 32.67 perf-sched.wait_and_delay.avg.ms.do_task_dead.do_exit.do_group_exit.__x64_sys_exit_group.x64_sys_call
1.74 ± 2% -36.1% 1.11 perf-sched.wait_and_delay.avg.ms.io_schedule.folio_wait_bit_common.folio_wait_writeback.__filemap_fdatawait_range
0.22 ± 3% -20.6% 0.18 ± 2% perf-sched.wait_and_delay.avg.ms.jbd2_log_wait_commit.ext4_nfs_commit_metadata.nfsd_create_setattr.nfsd4_create_file
1.00 -58.9% 0.41 perf-sched.wait_and_delay.avg.ms.rpc_wait_bit_killable.__wait_on_bit.out_of_line_wait_on_bit.__rpc_execute
3.31 ± 2% -33.1% 2.22 perf-sched.wait_and_delay.avg.ms.rpc_wait_bit_killable.__wait_on_bit.out_of_line_wait_on_bit.nfs4_do_close
1.99 ± 2% -53.0% 0.94 perf-sched.wait_and_delay.avg.ms.rpc_wait_bit_killable.__wait_on_bit.out_of_line_wait_on_bit.nfs4_run_open_task
91.01 ± 16% -50.9% 44.65 perf-sched.wait_and_delay.avg.ms.schedule_hrtimeout_range_clock.do_poll.constprop.0.do_sys_poll
246.90 ± 5% -39.0% 150.63 ± 3% perf-sched.wait_and_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
0.42 ± 3% -60.8% 0.16 ± 2% perf-sched.wait_and_delay.avg.ms.svc_recv.nfsd.kthread.ret_from_fork
0.83 ±107% +340.0% 3.67 ± 46% perf-sched.wait_and_delay.count.__cond_resched.mutex_lock.srcu_gp_end.process_srcu.process_one_work
24.83 ± 19% +59.1% 39.50 ± 16% perf-sched.wait_and_delay.count.__cond_resched.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
62182 ± 7% +156.7% 159647 perf-sched.wait_and_delay.count.__lock_sock.lock_sock_nested.tcp_recvmsg.inet6_recvmsg
3956 ± 7% +206.9% 12142 ± 2% perf-sched.wait_and_delay.count.__lock_sock.lock_sock_nested.tcp_sendmsg.sock_sendmsg
8707 ± 6% +320.1% 36582 ± 2% perf-sched.wait_and_delay.count.__lock_sock.lock_sock_nested.tcp_sock_set_cork.xs_tcp_send_request
26809 ± 6% -9.6% 24244 perf-sched.wait_and_delay.count.io_schedule.folio_wait_bit_common.folio_wait_writeback.__filemap_fdatawait_range
26615 ± 7% -12.6% 23273 perf-sched.wait_and_delay.count.jbd2_log_wait_commit.ext4_nfs_commit_metadata.nfsd_create_setattr.nfsd4_create_file
26752 ± 6% -11.5% 23677 perf-sched.wait_and_delay.count.jbd2_log_wait_commit.ext4_sync_file.ext4_buffered_write_iter.do_iter_readv_writev
30657 ± 7% +600.8% 214838 perf-sched.wait_and_delay.count.rpc_wait_bit_killable.__wait_on_bit.out_of_line_wait_on_bit.__rpc_execute
13405 ± 6% -9.6% 12116 perf-sched.wait_and_delay.count.rpc_wait_bit_killable.__wait_on_bit.out_of_line_wait_on_bit.nfs4_do_close
13399 ± 6% -9.5% 12122 perf-sched.wait_and_delay.count.rpc_wait_bit_killable.__wait_on_bit.out_of_line_wait_on_bit.nfs4_run_open_task
207.17 ± 5% +74.8% 362.17 perf-sched.wait_and_delay.count.schedule_hrtimeout_range_clock.do_poll.constprop.0.do_sys_poll
1532 ± 6% +88.8% 2892 ± 2% perf-sched.wait_and_delay.count.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
34990 ± 7% +154.4% 89028 ± 2% perf-sched.wait_and_delay.count.svc_recv.nfsd.kthread.ret_from_fork
192563 ± 6% +15.3% 222007 perf-sched.wait_and_delay.count.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
27.54 ±107% +2216.5% 638.01 ±179% perf-sched.wait_and_delay.max.ms.__lock_sock.lock_sock_nested.tcp_sock_set_cork.xs_tcp_send_request
60.18 ±105% -91.6% 5.05 perf-sched.wait_and_delay.max.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
3455 ± 22% -48.1% 1792 ± 25% perf-sched.wait_and_delay.max.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
203.43 ± 94% -95.2% 9.74 ± 7% perf-sched.wait_and_delay.max.ms.svc_recv.nfsd.kthread.ret_from_fork
14.95 ± 6% -6.8% 13.93 perf-sched.wait_time.avg.ms.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
0.00 ±223% +46408.3% 0.93 ± 21% perf-sched.wait_time.avg.ms.__cond_resched.down_read.walk_component.link_path_walk.part
0.30 ± 2% -16.0% 0.25 perf-sched.wait_time.avg.ms.__cond_resched.jbd2_journal_commit_transaction.kjournald2.kthread.ret_from_fork
0.01 ±223% +1045.5% 0.15 ± 70% perf-sched.wait_time.avg.ms.__cond_resched.jbd2_log_wait_commit.ext4_sync_file.ext4_buffered_write_iter.do_iter_readv_writev
0.01 ± 23% +52.4% 0.02 ± 19% perf-sched.wait_time.avg.ms.__cond_resched.xs_stream_data_receive_workfn.process_one_work.worker_thread.kthread
0.14 ± 3% -49.1% 0.07 ± 2% perf-sched.wait_time.avg.ms.__lock_sock.lock_sock_nested.tcp_recvmsg.inet6_recvmsg
0.11 ± 6% -55.1% 0.05 ± 4% perf-sched.wait_time.avg.ms.__lock_sock.lock_sock_nested.tcp_sendmsg.sock_sendmsg
0.15 ± 4% -56.1% 0.07 ± 43% perf-sched.wait_time.avg.ms.__lock_sock.lock_sock_nested.tcp_sock_set_cork.xs_tcp_send_request
21.65 ± 14% +50.9% 32.67 perf-sched.wait_time.avg.ms.do_task_dead.do_exit.do_group_exit.__x64_sys_exit_group.x64_sys_call
1.73 ± 2% -36.2% 1.11 perf-sched.wait_time.avg.ms.io_schedule.folio_wait_bit_common.folio_wait_writeback.__filemap_fdatawait_range
0.20 ± 3% +17.3% 0.24 ± 3% perf-sched.wait_time.avg.ms.jbd2_journal_wait_updates.jbd2_journal_commit_transaction.kjournald2.kthread
0.22 ± 3% -20.5% 0.17 ± 2% perf-sched.wait_time.avg.ms.jbd2_log_wait_commit.ext4_nfs_commit_metadata.nfsd_create_setattr.nfsd4_create_file
0.19 ± 5% -19.3% 0.16 ± 13% perf-sched.wait_time.avg.ms.jbd2_log_wait_commit.ext4_nfs_commit_metadata.nfsd_create_setattr.nfsd_create_locked
0.99 ± 2% -59.2% 0.40 perf-sched.wait_time.avg.ms.rpc_wait_bit_killable.__wait_on_bit.out_of_line_wait_on_bit.__rpc_execute
3.30 ± 2% -33.1% 2.21 perf-sched.wait_time.avg.ms.rpc_wait_bit_killable.__wait_on_bit.out_of_line_wait_on_bit.nfs4_do_close
1.98 ± 2% -53.2% 0.93 perf-sched.wait_time.avg.ms.rpc_wait_bit_killable.__wait_on_bit.out_of_line_wait_on_bit.nfs4_run_open_task
91.01 ± 16% -50.9% 44.65 perf-sched.wait_time.avg.ms.schedule_hrtimeout_range_clock.do_poll.constprop.0.do_sys_poll
246.89 ± 5% -39.0% 150.61 ± 3% perf-sched.wait_time.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
0.41 ± 3% -61.1% 0.16 ± 2% perf-sched.wait_time.avg.ms.svc_recv.nfsd.kthread.ret_from_fork
3.64 ± 10% -65.9% 1.24 ±103% perf-sched.wait_time.max.ms.__cond_resched.__kmalloc_cache_noprof.nfs_get_lock_context.nfs_page_create_from_folio.nfs_writepage_setup
0.02 ±126% +1465.3% 0.26 ± 81% perf-sched.wait_time.max.ms.__cond_resched.__release_sock.release_sock.tcp_recvmsg.inet6_recvmsg
0.30 ± 95% +190.8% 0.88 ± 12% perf-sched.wait_time.max.ms.__cond_resched.__release_sock.release_sock.tcp_sendmsg.sock_sendmsg
0.31 ±223% +557.0% 2.06 ± 19% perf-sched.wait_time.max.ms.__cond_resched.__rpc_execute.rpc_execute.rpc_run_task.nfs4_call_sync_sequence
0.00 ±223% +54158.3% 1.09 ± 31% perf-sched.wait_time.max.ms.__cond_resched.down_read.walk_component.link_path_walk.part
0.01 ±223% +1262.3% 0.17 ± 73% perf-sched.wait_time.max.ms.__cond_resched.jbd2_log_wait_commit.ext4_sync_file.ext4_buffered_write_iter.do_iter_readv_writev
0.11 ±203% +600.3% 0.78 ± 41% perf-sched.wait_time.max.ms.__cond_resched.kmem_cache_alloc_noprof.prepare_creds.nfsd_setuser.nfsd_setuser_and_check_port
0.02 ± 73% +2366.0% 0.44 ±161% perf-sched.wait_time.max.ms.__cond_resched.lock_sock_nested.tcp_recvmsg.inet6_recvmsg.sock_recvmsg
0.11 ±213% +698.2% 0.88 ± 6% perf-sched.wait_time.max.ms.__cond_resched.lock_sock_nested.tcp_sendmsg.sock_sendmsg.svc_tcp_sendmsg
0.01 ± 33% +451.2% 0.08 ± 74% perf-sched.wait_time.max.ms.__cond_resched.xs_stream_data_receive_workfn.process_one_work.worker_thread.kthread
27.53 ±107% +2217.8% 638.00 ±179% perf-sched.wait_time.max.ms.__lock_sock.lock_sock_nested.tcp_sock_set_cork.xs_tcp_send_request
0.46 ± 10% +218.9% 1.48 ± 71% perf-sched.wait_time.max.ms.jbd2_journal_wait_updates.jbd2_journal_commit_transaction.kjournald2.kthread
2.69 +11.4% 2.99 ± 6% perf-sched.wait_time.max.ms.schedule_timeout.__wait_for_common.wait_for_completion_state.kernel_clone
60.15 ±105% -91.7% 5.02 perf-sched.wait_time.max.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
3455 ± 22% -48.1% 1792 ± 25% perf-sched.wait_time.max.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
203.35 ± 94% -95.2% 9.67 ± 7% perf-sched.wait_time.max.ms.svc_recv.nfsd.kthread.ret_from_fork
***************************************************************************************************
lkp-ivb-2ep2: 48 threads 2 sockets Intel(R) Xeon(R) CPU E5-2697 v2 @ 2.70GHz (Ivy Bridge-EP) with 64G memory
=========================================================================================
compiler/cpufreq_governor/disk/filesize/fs2/fs/iterations/kconfig/nr_directories/nr_files_per_directory/nr_threads/rootfs/sync_method/tbox_group/test_size/testcase:
gcc-12/performance/1SSD/9B/nfsv4/btrfs/1x/x86_64-rhel-9.4/16d/256fpd/32t/debian-12-x86_64-20240206.cgz/fsyncBeforeClose/lkp-ivb-2ep2/400M/fsmark
commit:
66f9dac907 ("Revert "nfs: don't reuse partially completed requests in nfs_lock_and_join_requests"")
b6dea6c7fe ("nfs: pass flags to second superblock")
66f9dac9077c9c06 b6dea6c7fe2d8187050f882fe6f
---------------- ---------------------------
%stddev %change %stddev
\ | \
1.188e+09 +11.3% 1.322e+09 cpuidle..time
3929892 +91.6% 7529218 cpuidle..usage
69.88 +7.4% 75.03 iostat.cpu.idle
22.22 -34.0% 14.67 iostat.cpu.iowait
5.52 ± 3% +43.8% 7.94 iostat.cpu.system
786761 ± 9% +24.2% 977255 ± 8% numa-numastat.node0.local_node
804849 ± 9% +23.7% 995380 ± 7% numa-numastat.node0.numa_hit
916826 ± 8% +55.5% 1425647 ± 5% numa-numastat.node1.local_node
948417 ± 8% +53.7% 1457243 ± 4% numa-numastat.node1.numa_hit
23.86 -8.3 15.58 mpstat.cpu.all.iowait%
0.30 +0.0 0.33 mpstat.cpu.all.irq%
0.58 +0.3 0.93 mpstat.cpu.all.soft%
4.95 ± 3% +2.2 7.10 mpstat.cpu.all.sys%
11.40 ± 2% +23.4% 14.06 ± 2% mpstat.max_utilization_pct
1.358e+08 ± 2% +73.7% 2.359e+08 fsmark.app_overhead
4107 -15.9% 3456 fsmark.files_per_sec
6386 ± 9% -9.9% 5754 fsmark.time.maximum_resident_set_size
49.17 +128.5% 112.33 fsmark.time.percent_of_cpu_this_job_got
11.25 +185.3% 32.10 fsmark.time.system_time
575615 +300.3% 2304344 fsmark.time.voluntary_context_switches
13811 ± 2% -17.8% 11354 meminfo.Dirty
1073015 -12.6% 938340 meminfo.Inactive
1073015 -12.6% 938340 meminfo.Inactive(file)
219309 -30.5% 152451 meminfo.KReclaimable
219309 -30.5% 152451 meminfo.SReclaimable
387968 -19.8% 311283 meminfo.Slab
69.89 +7.4% 75.05 vmstat.cpu.id
22.21 -33.6% 14.75 vmstat.cpu.wa
12.73 ± 4% -39.9% 7.65 ± 6% vmstat.procs.b
4.99 ± 10% +21.9% 6.08 ± 5% vmstat.procs.r
216786 +92.3% 416945 vmstat.system.cs
53985 +16.2% 62753 vmstat.system.in
5358 ± 13% -19.4% 4316 ± 7% numa-meminfo.node0.Dirty
463057 ± 9% -25.2% 346146 ± 9% numa-meminfo.node0.Inactive
463057 ± 9% -25.2% 346146 ± 9% numa-meminfo.node0.Inactive(file)
8474 ± 8% -16.8% 7049 ± 6% numa-meminfo.node1.Dirty
90297 ± 23% -37.0% 56862 ± 7% numa-meminfo.node1.KReclaimable
90297 ± 23% -37.0% 56862 ± 7% numa-meminfo.node1.SReclaimable
169041 ± 12% -21.1% 133345 ± 2% numa-meminfo.node1.Slab
1339 ± 12% -19.4% 1079 ± 7% numa-vmstat.node0.nr_dirty
115964 ± 9% -25.6% 86318 ± 9% numa-vmstat.node0.nr_inactive_file
115964 ± 9% -25.6% 86318 ± 9% numa-vmstat.node0.nr_zone_inactive_file
1192 ± 12% -28.5% 852.16 ± 10% numa-vmstat.node0.nr_zone_write_pending
804755 ± 9% +23.5% 994145 ± 7% numa-vmstat.node0.numa_hit
786667 ± 9% +24.1% 976013 ± 8% numa-vmstat.node0.numa_local
338555 ± 7% +15.7% 391647 ± 5% numa-vmstat.node1.nr_dirtied
2120 ± 8% -16.7% 1766 ± 6% numa-vmstat.node1.nr_dirty
22647 ± 23% -37.3% 14193 ± 8% numa-vmstat.node1.nr_slab_reclaimable
338299 ± 8% +15.3% 390105 ± 5% numa-vmstat.node1.nr_written
2027 ± 10% -19.4% 1634 ± 8% numa-vmstat.node1.nr_zone_write_pending
947862 ± 8% +53.5% 1454904 ± 4% numa-vmstat.node1.numa_hit
916271 ± 8% +55.3% 1423309 ± 5% numa-vmstat.node1.numa_local
600332 +9.7% 658719 proc-vmstat.nr_dirtied
3457 -17.7% 2844 ± 2% proc-vmstat.nr_dirty
1175196 -2.8% 1141722 proc-vmstat.nr_file_pages
268553 -12.5% 234871 proc-vmstat.nr_inactive_file
54906 -30.5% 38145 proc-vmstat.nr_slab_reclaimable
42192 -5.9% 39710 proc-vmstat.nr_slab_unreclaimable
599793 +9.4% 656108 proc-vmstat.nr_written
268553 -12.5% 234871 proc-vmstat.nr_zone_inactive_file
3187 ± 3% -23.3% 2444 ± 5% proc-vmstat.nr_zone_write_pending
1754966 +39.9% 2454658 proc-vmstat.numa_hit
1709407 +41.7% 2421514 proc-vmstat.numa_local
2209705 +31.8% 2912839 proc-vmstat.pgalloc_normal
1655605 ± 3% +39.7% 2312933 proc-vmstat.pgfree
3264798 +14.5% 3739381 proc-vmstat.pgpgout
0.02 ± 97% +118.2% 0.04 ± 31% perf-sched.sch_delay.avg.ms.irqentry_exit_to_user_mode.asm_sysvec_reschedule_ipi.[unknown]
2.40 ±192% -98.4% 0.04 ±144% perf-sched.sch_delay.avg.ms.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
0.11 ± 20% -64.1% 0.04 ± 91% perf-sched.sch_delay.max.ms.devkmsg_read.vfs_read.ksys_read.do_syscall_64
0.01 ±223% +588.9% 0.05 ± 36% perf-sched.sch_delay.max.ms.do_nanosleep.hrtimer_nanosleep.common_nsleep.__x64_sys_clock_nanosleep
0.03 ± 88% +232.1% 0.10 ± 36% perf-sched.sch_delay.max.ms.irqentry_exit_to_user_mode.asm_sysvec_reschedule_ipi.[unknown]
96.58 ± 63% +711.2% 783.54 ± 44% perf-sched.total_wait_and_delay.max.ms
62.71 ± 53% +1124.4% 767.88 ± 44% perf-sched.total_wait_time.max.ms
0.95 ± 60% +370.2% 4.47 ± 35% perf-sched.wait_and_delay.avg.ms.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
6.05 ± 77% -96.3% 0.22 ±223% perf-sched.wait_and_delay.avg.ms.devkmsg_read.vfs_read.ksys_read.do_syscall_64
0.87 ± 40% -82.8% 0.15 ±141% perf-sched.wait_and_delay.avg.ms.do_wait.kernel_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe
1.67 ± 43% -90.2% 0.16 ±102% perf-sched.wait_and_delay.avg.ms.pipe_read.vfs_read.ksys_read.do_syscall_64
3.38 ± 13% +174.6% 9.29 ± 32% perf-sched.wait_and_delay.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
8.07 ±109% +1338.7% 116.15 ± 23% perf-sched.wait_and_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
4.33 ± 52% -92.3% 0.33 ±223% perf-sched.wait_and_delay.count.devkmsg_read.vfs_read.ksys_read.do_syscall_64
1.17 ±104% +957.1% 12.33 ± 57% perf-sched.wait_and_delay.count.schedule_hrtimeout_range_clock.do_poll.constprop.0.do_sys_poll
94.18 ± 68% +533.2% 596.36 ± 43% perf-sched.wait_and_delay.max.ms.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
15.07 ±103% -97.1% 0.43 ±223% perf-sched.wait_and_delay.max.ms.devkmsg_read.vfs_read.ksys_read.do_syscall_64
6.60 ± 99% -93.2% 0.45 ±141% perf-sched.wait_and_delay.max.ms.do_wait.kernel_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe
4.18 ± 8% +5513.8% 234.70 ± 49% perf-sched.wait_and_delay.max.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
61.50 ± 50% +957.2% 650.14 ± 46% perf-sched.wait_and_delay.max.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
0.48 ± 60% +777.3% 4.17 ± 35% perf-sched.wait_time.avg.ms.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
1.61 ± 44% -82.4% 0.28 ± 17% perf-sched.wait_time.avg.ms.pipe_read.vfs_read.ksys_read.do_syscall_64
3.02 ± 18% +206.6% 9.26 ± 32% perf-sched.wait_time.avg.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
7.94 ±112% +1360.8% 116.00 ± 23% perf-sched.wait_time.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
47.09 ± 68% +1166.4% 596.35 ± 43% perf-sched.wait_time.max.ms.__cond_resched.__wait_for_common.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
0.03 ±105% +3091.2% 0.90 ±126% perf-sched.wait_time.max.ms.irqentry_exit_to_user_mode.asm_sysvec_reschedule_ipi.[unknown]
4.16 ± 8% +5536.1% 234.65 ± 49% perf-sched.wait_time.max.ms.schedule_timeout.rcu_gp_fqs_loop.rcu_gp_kthread.kthread
61.02 ± 51% +937.2% 632.92 ± 45% perf-sched.wait_time.max.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
1.32 +19.1% 1.57 perf-stat.i.MPKI
1.869e+09 +22.9% 2.298e+09 perf-stat.i.branch-instructions
5.44 -0.7 4.74 perf-stat.i.branch-miss-rate%
1.006e+08 +7.9% 1.086e+08 perf-stat.i.branch-misses
11843942 ± 2% +46.2% 17314122 perf-stat.i.cache-misses
1.886e+08 +46.1% 2.755e+08 perf-stat.i.cache-references
239152 +87.2% 447643 perf-stat.i.context-switches
1.58 +10.8% 1.75 perf-stat.i.cpi
1.418e+10 +35.9% 1.927e+10 perf-stat.i.cpu-cycles
1842 ± 2% +73.5% 3195 perf-stat.i.cpu-migrations
1202 ± 2% -7.5% 1112 perf-stat.i.cycles-between-cache-misses
9.107e+09 +21.9% 1.11e+10 perf-stat.i.instructions
0.65 -9.8% 0.58 perf-stat.i.ipc
5.03 +86.7% 9.40 perf-stat.i.metric.K/sec
5735 ± 4% -10.8% 5115 ± 3% perf-stat.i.minor-faults
5735 ± 4% -10.8% 5116 ± 3% perf-stat.i.page-faults
1.30 +19.9% 1.56 perf-stat.overall.MPKI
5.38 -0.7 4.73 perf-stat.overall.branch-miss-rate%
1.56 +11.5% 1.74 perf-stat.overall.cpi
1197 -7.1% 1113 perf-stat.overall.cycles-between-cache-misses
0.64 -10.3% 0.58 perf-stat.overall.ipc
1.799e+09 +23.6% 2.223e+09 perf-stat.ps.branch-instructions
96805423 +8.5% 1.051e+08 perf-stat.ps.branch-misses
11396718 ± 2% +47.0% 16750043 perf-stat.ps.cache-misses
1.815e+08 +46.9% 2.665e+08 perf-stat.ps.cache-references
230141 +88.2% 433076 perf-stat.ps.context-switches
1.364e+10 +36.6% 1.864e+10 perf-stat.ps.cpu-cycles
1772 ± 2% +74.4% 3091 perf-stat.ps.cpu-migrations
8.763e+09 +22.6% 1.074e+10 perf-stat.ps.instructions
5514 ± 4% -10.3% 4945 ± 3% perf-stat.ps.minor-faults
5514 ± 4% -10.3% 4945 ± 3% perf-stat.ps.page-faults
2.33e+11 +41.9% 3.305e+11 perf-stat.total.instructions
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki
reply other threads:[~2024-11-30 10:44 UTC|newest]
Thread overview: [no followups] expand[flat|nested] mbox.gz Atom feed
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=202411301633.3ed8df2-lkp@intel.com \
--to=oliver.sang@intel.com \
--cc=lilingfeng3@huawei.com \
--cc=linux-nfs@vger.kernel.org \
--cc=lkp@intel.com \
--cc=oe-lkp@lists.linux.dev \
--cc=trond.myklebust@hammerspace.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.