From: Brendan Jackman <jackmanb@google.com>
To: Borislav Petkov <bp@alien8.de>, Brendan Jackman <jackmanb@google.com>
Cc: Thomas Gleixner <tglx@linutronix.de>,
Ingo Molnar <mingo@redhat.com>,
Dave Hansen <dave.hansen@linux.intel.com>, <x86@kernel.org>,
"H. Peter Anvin" <hpa@zytor.com>,
Andy Lutomirski <luto@kernel.org>,
Peter Zijlstra <peterz@infradead.org>,
<linux-kernel@vger.kernel.org>
Subject: Re: [PATCH 2/4] x86/mm: harmonize return value of phys_pte_init()
Date: Fri, 28 Nov 2025 14:03:09 +0000 [thread overview]
Message-ID: <DEKDTYQLAD0T.3KGTCS0ZFI4DU@google.com> (raw)
In-Reply-To: <20251127143501.GAaShhlVTH5iQpCdPM@fat_crate.local>
On Thu Nov 27, 2025 at 2:35 PM UTC, Borislav Petkov wrote:
> On Fri, Oct 03, 2025 at 04:56:42PM +0000, Brendan Jackman wrote:
>> In the case that they encounter pre-existing mappings, all the other
>> phys_*_init()s include those pre-mapped PFNs in the returned value.
>> Excluding those PFNs only when they are mapped at 4K seems like an
>> error. So make it consistent.
>>
>> The other functions only include the existing mappings if the
>> page_size_mask would have allowed creating those mappings.
>> 4K pages can't be disabled by page_size_mask so that condition is not
>> needed here; paddr_last can be assigned unconditionally before checking
>> for existing mappings.
>>
>> Signed-off-by: Brendan Jackman <jackmanb@google.com>
>> ---
>> arch/x86/mm/init_64.c | 3 ++-
>> 1 file changed, 2 insertions(+), 1 deletion(-)
>>
>> diff --git a/arch/x86/mm/init_64.c b/arch/x86/mm/init_64.c
>> index 9e45b371a6234b41bd7177b81b5d432341ae7214..968a5092dbd7ee3e7007fa0c769eff7d7ecb0ba3 100644
>> --- a/arch/x86/mm/init_64.c
>> +++ b/arch/x86/mm/init_64.c
>> @@ -492,6 +492,8 @@ phys_pte_init(pte_t *pte_page, unsigned long paddr, unsigned long paddr_end,
>> continue;
>> }
>>
>> + paddr_last = paddr_next;
>> +
>> /*
>> * We will re-use the existing mapping.
>> * Xen for example has some special requirements, like mapping
>
> I don't understand: the other phys_*_init() things do:
>
> if (!XXX_none())
>
> ...
>
> paddr_last = paddr_next;
>
> while you've raised the assignment above that test.
Well they actually do this:
if (!p*_none()) {
if (!p*_leaf()) {
paddr_last = ...
continue;
}
if (page_size_mask & *) {
paddr_last = ...
continue;
}
}
if (page_size_mask & *) {
paddr_last = *
continue;
}
paddr_last = *
That is, they update paddr_last unconditionally. While before this
patch, phys_pte_init() skips the update in the !pte_non() case.
> Also "seems like an error" needs a lot more poking at because if it is an
> error, then its incarnation must be really nasty and subtle or it is not, and
> then we don't care. And it has been that way for a while now...
Before the patchset, the return value of kernel_physical_mapping_init()
means something like:
1. The last physical address that was mapped.
2. ... This includes addresses that were already mapped before the call
3. ... UNLESS that pre-existing mapping was 4K.
In patch 4/4 I'm claiming:
> The exact definition of this is pretty fiddly, but only when there is a mismatch
> between the alignment of the requested range and the page sizes allowed
> by page_size_mask, or when the range ends in a region that is not mapped
> according to e820.
Which would not be true given point 3 above. Without this
phys_pte_init() change, the return value of init_memory_mapping() is
fiddly even if you are allow arbitary page sizes and all the paddrs
you're trying to map definitely exist, because of the 4K special-case in
point 4. Instead of trying to justify why init_memory_mapping() doesn't
care even about that special-case, I just removed that special-case
because I think it was probably a bug anyway.
HOWEVER... with the wisdom of hindsight... this was a VERY obscure
and confusing way to go about writing the patchset. I apologise!
I think the right way to do this is to drop this patch (2/4) and
evaluate the remainder against the claim that init_memory_mapping()
doesn't care about the return value at all. So that would have to mean:
a. It only calls kernel_physical_mapping_init() for physical ranges that
exist.
b. It always uses a page_size_mask that matches the alignment of the
ranges it's passing.
c. It doesn't operate on ranges that already have mappings.
Am I making a bit more sense now...?
next prev parent reply other threads:[~2025-11-28 14:03 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-10-03 16:56 [PATCH 0/4] x86/mm: some cleanups for pagetable setup code Brendan Jackman
2025-10-03 16:56 ` [PATCH 1/4] x86/mm: delete disabled debug code Brendan Jackman
2025-11-27 13:39 ` [tip: x86/cleanups] x86/mm: Delete " tip-bot2 for Brendan Jackman
2025-10-03 16:56 ` [PATCH 2/4] x86/mm: harmonize return value of phys_pte_init() Brendan Jackman
2025-11-27 14:35 ` Borislav Petkov
2025-11-28 14:03 ` Brendan Jackman [this message]
2025-12-05 19:29 ` Dave Hansen
2025-12-07 2:39 ` Brendan Jackman
2025-10-03 16:56 ` [PATCH 3/4] x86/mm: drop unused return from pgtable setup functions Brendan Jackman
2025-10-03 16:56 ` [PATCH 4/4] x86/mm: simplify calculation of max_pfn_mapped Brendan Jackman
2025-10-21 13:06 ` [PATCH 0/4] x86/mm: some cleanups for pagetable setup code Brendan Jackman
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=DEKDTYQLAD0T.3KGTCS0ZFI4DU@google.com \
--to=jackmanb@google.com \
--cc=bp@alien8.de \
--cc=dave.hansen@linux.intel.com \
--cc=hpa@zytor.com \
--cc=linux-kernel@vger.kernel.org \
--cc=luto@kernel.org \
--cc=mingo@redhat.com \
--cc=peterz@infradead.org \
--cc=tglx@linutronix.de \
--cc=x86@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.