From: "andriy.shevchenko@linux.intel.com" <andriy.shevchenko@linux.intel.com>
To: "Stamatis, Ilias" <ilstam@amazon.co.uk>
Cc: "akpm@linux-foundation.org" <akpm@linux-foundation.org>,
"nadav.amit@gmail.com" <nadav.amit@gmail.com>,
"david@kernel.org" <david@kernel.org>,
"linux-mm@kvack.org" <linux-mm@kvack.org>,
"linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
"huang.ying.caritas@gmail.com" <huang.ying.caritas@gmail.com>,
"bhe@redhat.com" <bhe@redhat.com>,
"nh-open-source@amazon.com" <nh-open-source@amazon.com>
Subject: Re: [PATCH] Reinstate "resource: avoid unnecessary lookups in find_next_iomem_res()"
Date: Mon, 24 Nov 2025 20:55:31 +0200 [thread overview]
Message-ID: <aSSqIzJWKTe9hs2f@smile.fi.intel.com> (raw)
In-Reply-To: <c7411175b332f3befb5bebb6a75c7b91f2c1dbbc.camel@amazon.co.uk>
On Mon, Nov 24, 2025 at 06:01:35PM +0000, Stamatis, Ilias wrote:
> On Mon, 2025-11-24 at 08:58 -0800, Andrew Morton wrote:
> > On Mon, 24 Nov 2025 16:53:49 +0000 Ilias Stamatis <ilstam@amazon.com> wrote:
> >
> > > Commit 97523a4edb7b ("kernel/resource: remove first_lvl / siblings_only
> > > logic") removed an optimization introduced by commit 756398750e11
> > > ("resource: avoid unnecessary lookups in find_next_iomem_res()"). That
> > > was not called out in the message of the first commit explicitly so it's
> > > not entirely clear whether removing the optimization happened
> > > inadvertently or not.
> > >
> > > As the original commit message of the optimization explains there is no
> > > point considering the children of a subtree in find_next_iomem_res() if
> > > the top level range does not match. Reinstating the optimization results
> > > in significant performance improvements in systems with very large iomem
> > > maps when mmaping /dev/mem.
> >
> > It would be great if we could quantify "significant performance
> > improvements"?
>
> Hi Andrew and Andy,
>
> You are right to call that out and apologies for leaving it vague.
>
> I've done my testing with older kernel versions in systems where `wc -l
> /proc/iomem` can return ~5k. In that environment I see mmaping parts of
> /dev/mem taking 700-1500μs without the optimisation and 10-50μs with the
> optimisation.
>
> The real-world use case we care about is hypervisor live update where having to
> do lots of these mmaps() serially can significantly affect the guest downtime
> if the cost is 20-30x.
Thanks for providing this information.
> > It also would be good to know which exact function(s) is a bottleneck.
>
> Perf tracing shows that ~95% of CPU time is spent in find_next_iomem_res(),
Have you investigated possibility to return that check directly into
the culprit?
> the full call stack being:
>
> find_next_iomem_res+0x3b ([kernel.kallsyms])
> walk_system_ram_range+0x98 ([kernel.kallsyms])
> pat_pagerange_is_ram+0x6e ([kernel.kallsyms])
> reserve_pfn_range+0x47 ([kernel.kallsyms])
> track_pfn_remap+0xb6 ([kernel.kallsyms])
> remap_pfn_range+0x3b ([kernel.kallsyms])
> mmap_mem+0x9e ([kernel.kallsyms])
> mm_struct_mmap_region+0x1f3 ([kernel.kallsyms])
> mmap_region+0xa3 ([kernel.kallsyms])
--
With Best Regards,
Andy Shevchenko
next prev parent reply other threads:[~2025-11-24 18:55 UTC|newest]
Thread overview: 15+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-11-24 16:53 [PATCH] Reinstate "resource: avoid unnecessary lookups in find_next_iomem_res()" Ilias Stamatis
2025-11-24 16:58 ` Andrew Morton
2025-11-24 17:05 ` Andy Shevchenko
2025-11-24 18:01 ` Stamatis, Ilias
2025-11-24 18:55 ` andriy.shevchenko [this message]
2025-11-24 19:35 ` Stamatis, Ilias
2025-11-24 19:52 ` andriy.shevchenko
2025-11-24 23:30 ` Stamatis, Ilias
2025-11-25 6:50 ` andriy.shevchenko
2025-11-25 9:56 ` Stamatis, Ilias
2025-11-25 10:23 ` andriy.shevchenko
2025-11-25 14:23 ` Stamatis, Ilias
2025-11-25 18:30 ` andriy.shevchenko
2025-11-25 8:09 ` David Hildenbrand (Red Hat)
2025-11-25 8:18 ` David Hildenbrand (Red Hat)
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=aSSqIzJWKTe9hs2f@smile.fi.intel.com \
--to=andriy.shevchenko@linux.intel.com \
--cc=akpm@linux-foundation.org \
--cc=bhe@redhat.com \
--cc=david@kernel.org \
--cc=huang.ying.caritas@gmail.com \
--cc=ilstam@amazon.co.uk \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=nadav.amit@gmail.com \
--cc=nh-open-source@amazon.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.