linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
* [linus:master] [mm]  f822a9a81a: stress-ng.bigheap.realloc_calls_per_sec 37.3% regression
@ 2025-08-07  8:17 kernel test robot
  2025-08-07  8:27 ` Lorenzo Stoakes
  0 siblings, 1 reply; 23+ messages in thread
From: kernel test robot @ 2025-08-07  8:17 UTC (permalink / raw)
  To: Dev Jain
  Cc: oe-lkp, lkp, linux-kernel, Andrew Morton, Barry Song,
	Lorenzo Stoakes, Pedro Falcato, Anshuman Khandual, Bang Li,
	Baolin Wang, bibo mao, David Hildenbrand, Hugh Dickins,
	Ingo Molnar, Jann Horn, Lance Yang, Liam Howlett, Matthew Wilcox,
	Peter Xu, Qi Zheng, Ryan Roberts, Vlastimil Babka, Yang Shi,
	Zi Yan, linux-mm, oliver.sang



Hello,

kernel test robot noticed a 37.3% regression of stress-ng.bigheap.realloc_calls_per_sec on:


commit: f822a9a81a31311d67f260aea96005540b18ab07 ("mm: optimize mremap() by PTE batching")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master

[still regression on      linus/master 186f3edfdd41f2ae87fc40a9ccba52a3bf930994]
[still regression on linux-next/master b9ddaa95fd283bce7041550ddbbe7e764c477110]

testcase: stress-ng
config: x86_64-rhel-9.4
compiler: gcc-12
test machine: 192 threads 2 sockets Intel(R) Xeon(R) Platinum 8468V  CPU @ 2.4GHz (Sapphire Rapids) with 384G memory
parameters:

	nr_threads: 100%
	testtime: 60s
	test: bigheap
	cpufreq_governor: performance




If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <oliver.sang@intel.com>
| Closes: https://lore.kernel.org/oe-lkp/202508071609.4e743d7c-lkp@intel.com


Details are as below:
-------------------------------------------------------------------------------------------------->


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20250807/202508071609.4e743d7c-lkp@intel.com

=========================================================================================
compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
  gcc-12/performance/x86_64-rhel-9.4/100%/debian-12-x86_64-20240206.cgz/igk-spr-2sp1/bigheap/stress-ng/60s

commit: 
  94dab12d86 ("mm: call pointers to ptes as ptep")
  f822a9a81a ("mm: optimize mremap() by PTE batching")

94dab12d86cf77ff f822a9a81a31311d67f260aea96 
---------------- --------------------------- 
         %stddev     %change         %stddev
             \          |                \  
     13777 ± 37%     +45.0%      19979 ± 27%  numa-vmstat.node1.nr_slab_reclaimable
    367205            +2.3%     375703        vmstat.system.in
     55106 ± 37%     +45.1%      79971 ± 27%  numa-meminfo.node1.KReclaimable
     55106 ± 37%     +45.1%      79971 ± 27%  numa-meminfo.node1.SReclaimable
    559381           -37.3%     350757        stress-ng.bigheap.realloc_calls_per_sec
     11468            +1.2%      11603        stress-ng.time.system_time
    296.25            +4.5%     309.70        stress-ng.time.user_time
      0.81 ±187%    -100.0%       0.00        perf-sched.sch_delay.avg.ms.__cond_resched.zap_pte_range.zap_pmd_range.isra.0
      9.36 ±165%    -100.0%       0.00        perf-sched.sch_delay.max.ms.__cond_resched.zap_pte_range.zap_pmd_range.isra.0
      0.81 ±187%    -100.0%       0.00        perf-sched.wait_time.avg.ms.__cond_resched.zap_pte_range.zap_pmd_range.isra.0
      9.36 ±165%    -100.0%       0.00        perf-sched.wait_time.max.ms.__cond_resched.zap_pte_range.zap_pmd_range.isra.0
      5.50 ± 17%    +390.9%      27.00 ± 56%  perf-c2c.DRAM.local
    388.50 ± 10%    +114.7%     834.17 ± 33%  perf-c2c.DRAM.remote
      1214 ± 13%    +107.3%       2517 ± 31%  perf-c2c.HITM.local
    135.00 ± 19%    +130.9%     311.67 ± 32%  perf-c2c.HITM.remote
      1349 ± 13%    +109.6%       2829 ± 31%  perf-c2c.HITM.total




Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki



^ permalink raw reply	[flat|nested] 23+ messages in thread

end of thread, other threads:[~2025-08-07 19:53 UTC | newest]

Thread overview: 23+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2025-08-07  8:17 [linus:master] [mm] f822a9a81a: stress-ng.bigheap.realloc_calls_per_sec 37.3% regression kernel test robot
2025-08-07  8:27 ` Lorenzo Stoakes
2025-08-07  8:56   ` Dev Jain
2025-08-07 10:21   ` David Hildenbrand
2025-08-07 16:06     ` Dev Jain
2025-08-07 16:10       ` Lorenzo Stoakes
2025-08-07 16:16         ` Lorenzo Stoakes
2025-08-07 17:04           ` Dev Jain
2025-08-07 17:07             ` Lorenzo Stoakes
2025-08-07 17:11               ` Dev Jain
2025-08-07 17:37   ` Jann Horn
2025-08-07 17:41     ` Lorenzo Stoakes
2025-08-07 17:46       ` Jann Horn
2025-08-07 17:50         ` Dev Jain
2025-08-07 17:53           ` Lorenzo Stoakes
2025-08-07 17:51         ` Lorenzo Stoakes
2025-08-07 18:01           ` David Hildenbrand
2025-08-07 18:04             ` Lorenzo Stoakes
2025-08-07 18:13               ` David Hildenbrand
2025-08-07 18:07             ` Jann Horn
2025-08-07 18:31               ` David Hildenbrand
2025-08-07 19:52                 ` Lorenzo Stoakes
2025-08-07 17:59       ` David Hildenbrand

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).