* [linux-next:master] [mm/filemap] 0b9c0aeba9: pts.graphics-magick.Swirl.iterations_per_minute 8.6% regression
@ 2026-06-17 14:26 kernel test robot
0 siblings, 0 replies; only message in thread
From: kernel test robot @ 2026-06-17 14:26 UTC (permalink / raw)
To: fujunjie
Cc: oe-lkp, lkp, Andrew Morton, Jan Kara, Vishal Moola,
Matthew Wilcox, Roman Gushchin, linux-fsdevel, linux-mm,
oliver.sang
Hello,
kernel test robot noticed a 8.6% regression of pts.graphics-magick.Swirl.iterations_per_minute on:
commit: 0b9c0aeba938aad9964f855df00bf929b83a484d ("mm/filemap: count only the faulting address as a mmap hit")
https://git.kernel.org/cgit/linux/kernel/git/next/linux-next.git master
[still regression on linux-next/master ec039126b7fac4e3af35ebccaa7c6f9b6875ba81]
testcase: pts
config: x86_64-rhel-9.4
compiler: gcc-14
test machine: 256 threads 2 sockets Intel(R) Xeon(R) 6767P CPU @ 2.4GHz (Granite Rapids) with 256G memory
parameters:
test: graphics-magick-2.2.0
option_a: Swirl
cpufreq_governor: performance
If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <oliver.sang@intel.com>
| Closes: https://lore.kernel.org/oe-lkp/202606172137.52c5fa5e-lkp@intel.com
Details are as below:
-------------------------------------------------------------------------------------------------->
The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20260617/202606172137.52c5fa5e-lkp@intel.com
=========================================================================================
compiler/cpufreq_governor/kconfig/option_a/rootfs/tbox_group/test/testcase:
gcc-14/performance/x86_64-rhel-9.4/Swirl/debian-12-x86_64-phoronix/lkp-gnr-2sp3/graphics-magick-2.2.0/pts
commit:
95b8e43226 ("mm: use zone lock guard in __offline_isolated_pages()")
0b9c0aeba9 ("mm/filemap: count only the faulting address as a mmap hit")
95b8e432265f61bd 0b9c0aeba938aad9964f855df00
---------------- ---------------------------
%stddev %change %stddev
\ | \
814.83 -8.6% 745.00 pts.graphics-magick.Swirl.iterations_per_minute
55127 ± 5% -10.3% 49460 ± 3% pts.time.involuntary_context_switches
2153 ± 5% +5894.6% 129074 ± 6% pts.time.major_page_faults
12775652 -8.5% 11686096 pts.time.minor_page_faults
15099 -8.1% 13876 pts.time.percent_of_cpu_this_job_got
2124 -4.0% 2039 pts.time.system_time
9036 -9.2% 8207 pts.time.user_time
4500789 -4.4% 4301829 pts.time.voluntary_context_switches
12315 ± 4% +10.6% 13616 ± 4% meminfo.PageTables
1828588 ± 5% -10.9% 1629373 ± 2% numa-numastat.node1.numa_hit
1828671 ± 5% -10.9% 1629233 ± 2% numa-vmstat.node1.numa_hit
1.34 ± 3% +0.2 1.55 mpstat.cpu.all.iowait%
0.10 ± 3% +0.0 0.12 ± 3% mpstat.cpu.all.soft%
78.37 -6.7% 73.12 mpstat.max_utilization_pct
2.44 ± 66% +4.8 7.24 ± 58% perf-profile.calltrace.cycles-pp.sw_perf_event_destroy.__free_event.perf_event_release_kernel.perf_release.__fput
2.98 ± 59% +5.8 8.81 ± 33% perf-profile.calltrace.cycles-pp.__free_event.perf_event_release_kernel.perf_release.__fput.task_work_run
2.44 ± 66% +4.8 7.24 ± 49% perf-profile.children.cycles-pp.sw_perf_event_destroy
2.98 ± 59% +5.8 8.81 ± 33% perf-profile.children.cycles-pp.__free_event
3149 ± 2% +7.2% 3376 ± 2% proc-vmstat.nr_page_table_pages
443579 -1.9% 435060 proc-vmstat.nr_shmem
3584637 ± 3% -7.2% 3324877 proc-vmstat.numa_hit
3319585 ± 3% -7.8% 3060170 proc-vmstat.numa_local
3.533e+08 -13.0% 3.075e+08 proc-vmstat.pgalloc_normal
13271607 -7.4% 12294786 proc-vmstat.pgfault
3.527e+08 -13.0% 3.069e+08 proc-vmstat.pgfree
2086 ± 5% +6047.5% 128277 ± 6% proc-vmstat.pgmajfault
166303 -8.6% 152057 proc-vmstat.thp_fault_alloc
28.46 ± 2% +1.0 29.44 perf-stat.i.cache-miss-rate%
3164 ± 6% +24.3% 3933 ± 2% perf-stat.i.cpu-migrations
28.98 ± 7% +5995.2% 1766 ± 6% perf-stat.i.major-faults
0.44 +2.6% 0.46 perf-stat.overall.MPKI
0.01 +0.0 0.01 perf-stat.overall.branch-miss-rate%
1719 -1.6% 1692 perf-stat.overall.cycles-between-cache-misses
59.55 +1.9 61.43 perf-stat.overall.node-load-miss-rate%
3138 ± 6% +23.7% 3880 ± 2% perf-stat.ps.cpu-migrations
28.03 ± 7% +5980.2% 1704 ± 6% perf-stat.ps.major-faults
5.496e+13 -8.5% 5.028e+13 perf-stat.total.instructions
1.41 ± 36% +83.3% 2.58 ± 11% perf-sched.sch_delay.max.ms.perf_trace_sched_switch.do_wait.kernel_wait4.do_syscall_64.entry_SYSCALL_64_after_hwframe
1.17 ± 34% -82.8% 0.20 ±169% perf-sched.sch_delay.max.ms.perf_trace_sched_switch.preempt_schedule_thunk.affine_move_task.__set_cpus_allowed_ptr.__sched_setaffinity
7.34 +11.2% 8.15 perf-sched.total_wait_and_delay.average.ms
366839 -7.5% 339473 perf-sched.total_wait_and_delay.count.ms
7.32 +11.2% 8.14 perf-sched.total_wait_time.average.ms
3.58 +24.1% 4.45 ± 2% perf-sched.wait_and_delay.avg.ms.perf_trace_sched_switch.futex_do_wait.__futex_wait.futex_wait.do_futex
0.38 ± 2% +48.1% 0.57 ± 2% perf-sched.wait_and_delay.avg.ms.perf_trace_sched_switch.io_schedule.folio_wait_bit_common.filemap_fault.__do_fault
4.64 ±101% +192.3% 13.57 ± 23% perf-sched.wait_and_delay.avg.ms.perf_trace_sched_switch.preempt_schedule_thunk._raw_spin_unlock.__do_huge_pmd_anonymous_page.__handle_mm_fault
7.14 ± 45% +73.8% 12.41 ± 3% perf-sched.wait_and_delay.avg.ms.perf_trace_sched_switch.preempt_schedule_thunk.lock_vma_under_rcu.do_user_addr_fault.exc_page_fault
339.44 ± 3% +18.1% 400.91 ± 3% perf-sched.wait_and_delay.avg.ms.perf_trace_sched_switch.schedule_hrtimeout_range_clock.poll_schedule_timeout.constprop.0
58862 -8.3% 53992 perf-sched.wait_and_delay.count.perf_trace_sched_switch.futex_do_wait.__futex_wait.futex_wait.do_futex
294758 -7.6% 272232 perf-sched.wait_and_delay.count.perf_trace_sched_switch.io_schedule.folio_wait_bit_common.filemap_fault.__do_fault
113.17 ± 4% -12.1% 99.50 ± 2% perf-sched.wait_and_delay.count.perf_trace_sched_switch.schedule_hrtimeout_range_clock.poll_schedule_timeout.constprop.0
7.56 ±100% +191.1% 22.02 ± 3% perf-sched.wait_and_delay.max.ms.perf_trace_sched_switch.preempt_schedule_thunk._raw_spin_unlock.__do_huge_pmd_anonymous_page.__handle_mm_fault
15.87 ± 3% +36.3% 21.64 perf-sched.wait_and_delay.max.ms.perf_trace_sched_switch.preempt_schedule_thunk.cpu_stop_queue_work.stop_one_cpu.migrate_task_to
14.18 ± 45% +67.2% 23.71 ± 6% perf-sched.wait_and_delay.max.ms.perf_trace_sched_switch.preempt_schedule_thunk.lock_vma_under_rcu.do_user_addr_fault.exc_page_fault
3.55 +24.2% 4.41 ± 2% perf-sched.wait_time.avg.ms.perf_trace_sched_switch.futex_do_wait.__futex_wait.futex_wait.do_futex
0.37 ± 2% +49.8% 0.56 ± 2% perf-sched.wait_time.avg.ms.perf_trace_sched_switch.io_schedule.folio_wait_bit_common.filemap_fault.__do_fault
2.43 ± 14% +91.5% 4.66 ± 16% perf-sched.wait_time.avg.ms.perf_trace_sched_switch.irqentry_exit.asm_exc_page_fault.[unknown].[unknown]
6.68 ± 50% +103.0% 13.55 ± 23% perf-sched.wait_time.avg.ms.perf_trace_sched_switch.preempt_schedule_thunk._raw_spin_unlock.__do_huge_pmd_anonymous_page.__handle_mm_fault
8.24 ± 8% +50.0% 12.36 ± 3% perf-sched.wait_time.avg.ms.perf_trace_sched_switch.preempt_schedule_thunk.lock_vma_under_rcu.do_user_addr_fault.exc_page_fault
339.33 ± 3% +18.1% 400.81 ± 3% perf-sched.wait_time.avg.ms.perf_trace_sched_switch.schedule_hrtimeout_range_clock.poll_schedule_timeout.constprop.0
15.55 ± 3% +52.0% 23.65 ± 14% perf-sched.wait_time.max.ms.perf_trace_sched_switch.irqentry_exit.asm_exc_page_fault.[unknown].[unknown]
14.33 ± 6% +53.7% 22.03 ± 5% perf-sched.wait_time.max.ms.perf_trace_sched_switch.irqentry_exit.asm_sysvec_call_function_single.[unknown]
15.24 ± 5% +44.4% 22.01 ± 5% perf-sched.wait_time.max.ms.perf_trace_sched_switch.preempt_schedule_irq.irqentry_exit.asm_sysvec_apic_timer_interrupt.do_user_addr_fault
12.27 ± 44% +79.4% 22.01 ± 3% perf-sched.wait_time.max.ms.perf_trace_sched_switch.preempt_schedule_thunk._raw_spin_unlock.__do_huge_pmd_anonymous_page.__handle_mm_fault
15.86 ± 3% +36.3% 21.63 perf-sched.wait_time.max.ms.perf_trace_sched_switch.preempt_schedule_thunk.cpu_stop_queue_work.stop_one_cpu.migrate_task_to
14.53 ± 3% +45.3% 21.10 ± 3% perf-sched.wait_time.max.ms.perf_trace_sched_switch.preempt_schedule_thunk.do_read_fault.do_fault.__handle_mm_fault
15.90 ± 5% +41.6% 22.51 ± 2% perf-sched.wait_time.max.ms.perf_trace_sched_switch.preempt_schedule_thunk.lock_vma_under_rcu.do_user_addr_fault.exc_page_fault
10.27 ± 61% +100.7% 20.61 ± 2% perf-sched.wait_time.max.ms.perf_trace_sched_switch.preempt_schedule_thunk.stop_two_cpus.migrate_swap.task_numa_migrate
4282234 -17.3% 3541405 sched_debug.cfs_rq:/.avg_vruntime.avg
4735201 ± 2% -13.9% 4076046 ± 4% sched_debug.cfs_rq:/.avg_vruntime.max
4115256 -17.0% 3417248 ± 2% sched_debug.cfs_rq:/.avg_vruntime.min
423.15 ± 2% -9.6% 382.71 sched_debug.cfs_rq:/.runnable_avg.avg
420.53 ± 2% -9.6% 380.35 sched_debug.cfs_rq:/.util_avg.avg
1190 ± 4% -9.6% 1077 ± 3% sched_debug.cfs_rq:/.util_avg.max
286.15 ± 7% -14.4% 244.91 ± 7% sched_debug.cfs_rq:/.util_est.avg
4282232 -17.3% 3541404 sched_debug.cfs_rq:/.zero_vruntime.avg
4735201 ± 2% -13.9% 4076017 ± 4% sched_debug.cfs_rq:/.zero_vruntime.max
4115256 -17.0% 3417248 ± 2% sched_debug.cfs_rq:/.zero_vruntime.min
4263217 -16.9% 3544690 sched_debug.cfs_rq:/init.scope.se->vruntime.avg
4263686 -16.8% 3545475 sched_debug.cfs_rq:/init.scope.se->vruntime.max
4262749 -16.9% 3543905 sched_debug.cfs_rq:/init.scope.se->vruntime.min
3220012 -14.8% 2744219 sched_debug.cfs_rq:/system.slice.avg_vruntime.avg
3545518 -10.9% 3160659 ± 3% sched_debug.cfs_rq:/system.slice.avg_vruntime.max
3158026 -15.2% 2679207 sched_debug.cfs_rq:/system.slice.avg_vruntime.min
448.40 ± 2% -9.4% 406.19 sched_debug.cfs_rq:/system.slice.runnable_avg.avg
448.42 ± 2% -9.4% 406.17 sched_debug.cfs_rq:/system.slice.se->avg.runnable_avg.avg
445.86 ± 2% -9.3% 404.18 sched_debug.cfs_rq:/system.slice.se->avg.util_avg.avg
1190 ± 4% -9.6% 1077 ± 3% sched_debug.cfs_rq:/system.slice.se->avg.util_avg.max
17612 -8.0% 16195 sched_debug.cfs_rq:/system.slice.se->sum_exec_runtime.avg
17072 -8.4% 15640 sched_debug.cfs_rq:/system.slice.se->sum_exec_runtime.min
4282329 -17.3% 3541763 sched_debug.cfs_rq:/system.slice.se->vruntime.avg
4734174 ± 2% -13.9% 4074191 ± 4% sched_debug.cfs_rq:/system.slice.se->vruntime.max
4115719 -17.0% 3417711 ± 2% sched_debug.cfs_rq:/system.slice.se->vruntime.min
445.87 ± 2% -9.3% 404.21 sched_debug.cfs_rq:/system.slice.util_avg.avg
1190 ± 4% -9.6% 1077 ± 3% sched_debug.cfs_rq:/system.slice.util_avg.max
3220012 -14.8% 2744218 sched_debug.cfs_rq:/system.slice.zero_vruntime.avg
3545518 -10.9% 3160649 ± 3% sched_debug.cfs_rq:/system.slice.zero_vruntime.max
3158026 -15.2% 2679207 sched_debug.cfs_rq:/system.slice.zero_vruntime.min
3201153 -14.7% 2731311 sched_debug.cfs_rq:/system.slice/containerd.service.se->vruntime.avg
3249048 -13.6% 2808484 ± 3% sched_debug.cfs_rq:/system.slice/containerd.service.se->vruntime.max
3153595 -14.8% 2685843 sched_debug.cfs_rq:/system.slice/containerd.service.se->vruntime.min
56.08 ± 17% -29.3% 39.67 ± 9% sched_debug.cfs_rq:/system.slice/containerd.service.tg_load_avg.avg
128.83 ± 17% -25.5% 96.00 ± 24% sched_debug.cfs_rq:/system.slice/containerd.service.tg_load_avg.max
25.42 ± 31% -34.4% 16.67 ± 30% sched_debug.cfs_rq:/system.slice/containerd.service.tg_load_avg.min
27.90 ± 11% -35.1% 18.10 ± 26% sched_debug.cfs_rq:/system.slice/containerd.service.tg_load_avg.stddev
17409 -8.0% 16022 sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.avg_vruntime.avg
16929 -8.3% 15524 sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.avg_vruntime.min
157.14 ± 19% -60.4% 62.15 ± 57% sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.left_deadline.avg
17300 -23.2% 13285 ± 44% sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.left_deadline.max
1631 ± 9% -45.1% 895.75 ± 47% sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.left_deadline.stddev
157.13 ± 19% -60.5% 62.14 ± 57% sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.left_vruntime.avg
17298 -23.2% 13283 ± 44% sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.left_vruntime.max
1631 ± 9% -45.1% 895.61 ± 47% sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.left_vruntime.stddev
504.25 ± 3% -8.3% 462.57 sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.load_avg.avg
157.13 ± 19% -60.5% 62.14 ± 57% sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.right_vruntime.avg
17298 -23.2% 13283 ± 44% sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.right_vruntime.max
1631 ± 9% -45.1% 895.61 ± 47% sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.right_vruntime.stddev
449.13 ± 2% -9.2% 407.68 sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.runnable_avg.avg
448.89 ± 2% -9.3% 407.10 sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.se->avg.runnable_avg.avg
446.27 ± 2% -9.2% 405.04 sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.se->avg.util_avg.avg
1190 ± 4% -9.5% 1078 ± 3% sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.se->avg.util_avg.max
17428 -8.0% 16038 sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.se->sum_exec_runtime.avg
16954 -8.3% 15542 sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.se->sum_exec_runtime.min
3220022 -14.8% 2744222 sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.se->vruntime.avg
3545518 -10.9% 3160659 ± 3% sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.se->vruntime.max
3158026 -15.2% 2679207 sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.se->vruntime.min
117383 -8.5% 107420 sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.tg_load_avg.avg
93409 ± 4% -12.8% 81492 ± 2% sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.tg_load_avg.min
11146 ± 12% +23.7% 13782 ± 4% sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.tg_load_avg.stddev
504.19 ± 2% -7.4% 467.07 ± 2% sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.tg_load_avg_contrib.avg
446.38 ± 2% -9.2% 405.51 sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.util_avg.avg
1191 ± 4% -9.5% 1078 ± 3% sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.util_avg.max
17409 -8.0% 16021 sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.zero_vruntime.avg
16929 -8.3% 15524 sched_debug.cfs_rq:/system.slice/lkp-bootstrap.service.zero_vruntime.min
3219034 -15.3% 2727012 sched_debug.cfs_rq:/system.slice/redis-server.service.se->vruntime.avg
3233948 -15.3% 2739777 sched_debug.cfs_rq:/system.slice/redis-server.service.se->vruntime.max
3203580 -15.4% 2711785 sched_debug.cfs_rq:/system.slice/redis-server.service.se->vruntime.min
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki
^ permalink raw reply [flat|nested] only message in thread
only message in thread, other threads:[~2026-06-17 14:26 UTC | newest]
Thread overview: (only message) (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2026-06-17 14:26 [linux-next:master] [mm/filemap] 0b9c0aeba9: pts.graphics-magick.Swirl.iterations_per_minute 8.6% regression kernel test robot
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.