From: Minchan Kim <minchan@kernel.org>
To: John Hubbard <jhubbard@nvidia.com>
Cc: "Paul E. McKenney" <paulmck@kernel.org>,
Andrew Morton <akpm@linux-foundation.org>,
linux-mm <linux-mm@kvack.org>,
LKML <linux-kernel@vger.kernel.org>,
John Dias <joaodias@google.com>,
David Hildenbrand <david@redhat.com>
Subject: Re: [PATCH v4] mm: fix is_pinnable_page against on cma page
Date: Wed, 11 May 2022 20:44:43 -0700 [thread overview]
Message-ID: <YnyCqwY4DxcZ/NjM@google.com> (raw)
In-Reply-To: <aafc0954-80df-dceb-03f2-2caea29cbbda@nvidia.com>
On Wed, May 11, 2022 at 07:18:56PM -0700, John Hubbard wrote:
> On 5/11/22 18:08, John Hubbard wrote:
> > On 5/11/22 18:03, Minchan Kim wrote:
> > > >
> > > > Or there might be some code path that really hates a READ_ONCE() in
> > > > that place.
> > >
> > > My worry about chaning __get_pfnblock_flags_mask is it's called
> > > multiple hot places in mm codes so I didn't want to add overhead
> > > to them.
> >
> > ...unless it really does generate the same code as is already there,
> > right? Let me check that real quick.
> >
>
> It does change the generated code slightly. I don't know if this will
> affect performance here or not. But just for completeness, here you go:
>
> free_one_page() originally has this (just showing the changed parts):
>
> mov 0x8(%rdx,%rax,8),%rbx
> and $0x3f,%ecx
> shr %cl,%rbx
> and $0x7,
>
>
> And after applying this diff:
>
> diff --git a/mm/page_alloc.c b/mm/page_alloc.c
> index 0e42038382c1..df1f8e9a294f 100644
> --- a/mm/page_alloc.c
> +++ b/mm/page_alloc.c
> @@ -482,7 +482,7 @@ unsigned long __get_pfnblock_flags_mask(const struct
> page *page,
> word_bitidx = bitidx / BITS_PER_LONG;
> bitidx &= (BITS_PER_LONG-1);
>
> - word = bitmap[word_bitidx];
> + word = READ_ONCE(bitmap[word_bitidx]);
> return (word >> bitidx) & mask;
> }
>
>
> ...it now does an extra memory dereference:
>
> lea 0x8(%rdx,%rax,8),%rax
> and $0x3f,%ecx
> mov (%rax),%rbx
> shr %cl,%rbx
> and $0x7,%ebx
>
Thanks for checking, John.
I don't want to have the READ_ONCE in __get_pfnblock_flags_mask
atm even though it's an extra memory dereference for specific
architecutre and specific compiler unless other callsites *do*
need it.
We choose the choice(i.e., having __READ_ONCE in is_pinanble_
page) for *potential* cases(e.g., aggressive LTO for future)
and if it's an extra memory dereference atm, it would be multiple
instructions *potentailly* as having more change or different
compiler and/or arches now or later. It's out of scope in
this patch.
next prev parent reply other threads:[~2022-05-12 3:44 UTC|newest]
Thread overview: 46+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-05-10 21:17 [PATCH v4] mm: fix is_pinnable_page against on cma page Minchan Kim
2022-05-10 22:56 ` John Hubbard
2022-05-10 23:31 ` Minchan Kim
2022-05-10 23:58 ` John Hubbard
2022-05-11 0:09 ` Minchan Kim
2022-05-11 4:32 ` John Hubbard
2022-05-11 21:46 ` Minchan Kim
2022-05-11 22:25 ` John Hubbard
2022-05-11 22:37 ` Minchan Kim
2022-05-11 22:49 ` John Hubbard
2022-05-11 23:08 ` Minchan Kim
2022-05-11 23:13 ` John Hubbard
2022-05-11 23:15 ` Minchan Kim
2022-05-11 23:28 ` Minchan Kim
2022-05-11 23:33 ` John Hubbard
2022-05-11 23:45 ` Paul E. McKenney
2022-05-11 23:57 ` John Hubbard
2022-05-12 0:12 ` Paul E. McKenney
2022-05-12 0:12 ` John Hubbard
2022-05-12 0:22 ` Paul E. McKenney
2022-05-12 0:26 ` Minchan Kim
2022-05-12 0:34 ` John Hubbard
2022-05-12 0:49 ` Paul E. McKenney
2022-05-12 1:02 ` John Hubbard
2022-05-12 1:03 ` Minchan Kim
2022-05-12 1:08 ` John Hubbard
2022-05-12 2:18 ` John Hubbard
2022-05-12 3:44 ` Minchan Kim [this message]
2022-05-12 4:47 ` John Hubbard
2022-05-17 14:00 ` Jason Gunthorpe
2022-05-17 18:12 ` John Hubbard
2022-05-17 19:28 ` Jason Gunthorpe
2022-05-17 20:12 ` John Hubbard
2022-05-17 20:21 ` Paul E. McKenney
2022-05-23 16:33 ` Minchan Kim
2022-05-24 2:55 ` John Hubbard
2022-05-24 5:16 ` Minchan Kim
2022-05-24 6:22 ` John Hubbard
2022-05-24 14:19 ` Jason Gunthorpe
2022-05-24 15:43 ` Minchan Kim
2022-05-24 15:48 ` Jason Gunthorpe
2022-05-24 16:37 ` Paul E. McKenney
2022-05-24 16:59 ` Minchan Kim
2022-05-12 3:57 ` Paul E. McKenney
2022-05-12 1:03 ` Minchan Kim
2022-05-12 0:35 ` Paul E. McKenney
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=YnyCqwY4DxcZ/NjM@google.com \
--to=minchan@kernel.org \
--cc=akpm@linux-foundation.org \
--cc=david@redhat.com \
--cc=jhubbard@nvidia.com \
--cc=joaodias@google.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=paulmck@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.