From: Qu Wenruo <wqu@suse.com>
To: BP25 <bp25@posteo.net>, linux-btrfs@vger.kernel.org
Subject: Re: How to detect ram memory going bad?
Date: Thu, 20 Nov 2025 07:24:54 +1030 [thread overview]
Message-ID: <3fde2de2-d209-4028-adac-fb53e6a89dd4@suse.com> (raw)
In-Reply-To: <665d612165e1f21e681d3b1229bcd40f@posteo.net>
在 2025/11/20 01:01, BP25 写道:
> Hello I'm writing to this mailing list as suggested by the btrfs docs. I
> wanted to ask how to detect and mitigate ram memory going bad when using
> BTRFS? Because the 'Hardware Considerations' the BTRFS manual suggest in
> this scenario to run memtest; but this is probs more like right after
> installing new ram.
You should always do a stress test after hardware modification.
And it's always recommended using things like memtest86+ which is a raw
UEFI payload, with minimal memory reserved, so that almost all RAM can
be properly tested.
Memtester can be executed as a user space program but it can not test
the space reserved by the kernel.
Considering how small the space reserved by the kernel, it's still a
worthy solution, but it will still take a lot of memory (if you want to
test as many memory as possible).
For the timing:
If installing new ram, always run a memtest.
If overcloking/changing DIMM timming, always run a memtest.
Finally, backup is always recommended, no matter what.
> Is there any BTRFS tool, perhaps to run
> periodically, that can help me detect bad ram hence mitigate the
> consequences? There is a webpage called 'Will ZFS and non-ECC RAM kill
> your data?' where it's suggested that ZFS scrub effectively detects bad
> ram
I doubt, because bad ram can easily corrupt your metadata/data, and the
same bad content is written to disk.
Furthermore the checksum will still match (calculated using the bad
metadata/data), thus it's impossible to detect no matter what.
And bad RAM can happen at any random byte, if it's some core kernel
structure, you're doomed anyway.
> (when at least two copies of the same file and/or metadata are
> stored, and I wonder if there are other assumptions here...), but I'm
> new to btrfs and I wonder if the reasoning can be applied to btrfs as
> well, and how effective of a mitigation it would actually provide. OS is
> GNU (Guix) and I think can't use ECC because I suspect my X200
> motherboards wouldn't support it?
>
> Please CC: or BCC: me cause I'm not subscribed.
>
prev parent reply other threads:[~2025-11-19 20:55 UTC|newest]
Thread overview: 2+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-11-19 14:31 How to detect ram memory going bad? BP25
2025-11-19 20:54 ` Qu Wenruo [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=3fde2de2-d209-4028-adac-fb53e6a89dd4@suse.com \
--to=wqu@suse.com \
--cc=bp25@posteo.net \
--cc=linux-btrfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox