From: Pratyush Yadav <pratyush@kernel.org>
To: Rik van Riel <riel@surriel.com>
Cc: Pratyush Yadav <pratyush@kernel.org>,
Breno Leitao <leitao@debian.org>,
nao.horiguchi@gmail.com, linmiaohe@huawei.com,
david@kernel.org, lance.yang@linux.dev,
akpm@linux-foundation.org, baoquan.he@linux.dev,
rppt@kernel.org, kexec@lists.infradead.org, linux-mm@kvack.org,
rneu@meta.com, caggio@meta.com, kas@kernel.org
Subject: Re: mm/hwpoison: persist poisoned PFN list across kexec via KHO [RFC]
Date: Wed, 24 Jun 2026 17:17:02 +0200 [thread overview]
Message-ID: <2vxzjyroklxt.fsf@kernel.org> (raw)
In-Reply-To: <da460b26bef42152bf23d395f845503ad8eedc4d.camel@surriel.com> (Rik van Riel's message of "Wed, 24 Jun 2026 10:44:20 -0400")
On Wed, Jun 24 2026, Rik van Riel wrote:
> On Wed, 2026-06-24 at 15:40 +0200, Pratyush Yadav wrote:
>>
>> Also, what happens on cold reboot? If the HW does not remember bad
>> pages, won't the kernel be in the same position? How does it know the
>> bad pages on a cold boot?
>
> Some modern server hardware will simply unmap known
> bad pages from the physical page map, so they will
> not be exposed to the OS after a cold reboot.
>
> The hardware keeps a log of uncorrectable memory
> errors somewhere in memory, for example in the SEL.
>
>>
>>
>> >
>> > This PoC
>> > ========
>> >
>> > * Makes hardware-poisoned pages survive a kexec, using KHO (Kexec
>> > HandOver) to carry the poison list between kernels.
>> >
>> > * Producer: hooks num_poisoned_pages_inc()/_sub() - the single
>> > chokepoint for every poison/unpoison event - and records each
>> > poisoned PFN into a vmalloc array that KHO preserves across the
>> > kexec, described by a small versioned "hwpoison" subtree.
>>
>> More of an implementation detail, but with vmalloc array, what if you
>> have too many poisoned pages?
>> >
>
> If a very large amount of memory is broken, you
> should probably just repair the hardware.
"large" is relative. On a 2 TiB system, if you have 0.5% of pages
poisoned (I have no idea if that number is realistic), you have 10 GiB
of memory poisoned, or around 2.6 million pages. To store all their
PFNs, you need around 20 MiB of memory.
While not too large, it isn't trivial either.
I think static data structures like vmalloc are likely not the way to go
here especially when we have better things like KHO block or the KHO
radix tree.
Between those two, what is more efficient largely depends on how many
pages you'd typically see poisoned and what their locations tend to be.
That I think we can dive deeper into when we take a closer look at the
patches.
>
> Page poisoning is good for localized memory
> failures, but not for failures that extend across
> much of a memory chip.
--
Regards,
Pratyush Yadav
prev parent reply other threads:[~2026-06-24 15:17 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-06-24 10:39 mm/hwpoison: persist poisoned PFN list across kexec via KHO [RFC] Breno Leitao
2026-06-24 12:04 ` Kiryl Shutsemau
2026-06-24 13:46 ` Pratyush Yadav
2026-06-24 15:21 ` Breno Leitao
2026-06-24 15:34 ` Kiryl Shutsemau
2026-06-24 13:40 ` Pratyush Yadav
2026-06-24 14:44 ` Rik van Riel
2026-06-24 15:17 ` Pratyush Yadav [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=2vxzjyroklxt.fsf@kernel.org \
--to=pratyush@kernel.org \
--cc=akpm@linux-foundation.org \
--cc=baoquan.he@linux.dev \
--cc=caggio@meta.com \
--cc=david@kernel.org \
--cc=kas@kernel.org \
--cc=kexec@lists.infradead.org \
--cc=lance.yang@linux.dev \
--cc=leitao@debian.org \
--cc=linmiaohe@huawei.com \
--cc=linux-mm@kvack.org \
--cc=nao.horiguchi@gmail.com \
--cc=riel@surriel.com \
--cc=rneu@meta.com \
--cc=rppt@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox