Linux Btrfs filesystem development
 help / color / mirror / Atom feed
From: Qu Wenruo <wqu@suse.com>
To: BP25 <bp25@posteo.net>, linux-btrfs@vger.kernel.org
Subject: Re: How to detect ram memory going bad?
Date: Thu, 20 Nov 2025 07:24:54 +1030	[thread overview]
Message-ID: <3fde2de2-d209-4028-adac-fb53e6a89dd4@suse.com> (raw)
In-Reply-To: <665d612165e1f21e681d3b1229bcd40f@posteo.net>



在 2025/11/20 01:01, BP25 写道:
> Hello I'm writing to this mailing list as suggested by the btrfs docs. I 
> wanted to ask how to detect and mitigate ram memory going bad when using 
> BTRFS? Because the 'Hardware Considerations' the BTRFS manual suggest in 
> this scenario to run memtest; but this is probs more like right after 
> installing new ram.

You should always do a stress test after hardware modification.
And it's always recommended using things like memtest86+ which is a raw 
UEFI payload, with minimal memory reserved, so that almost all RAM can 
be properly tested.

Memtester can be executed as a user space program but it can not test 
the space reserved by the kernel.
Considering how small the space reserved by the kernel, it's still a 
worthy solution, but it will still take a lot of memory (if you want to 
test as many memory as possible).

For the timing:

If installing new ram, always run a memtest.
If overcloking/changing DIMM timming, always run a memtest.


Finally, backup is always recommended, no matter what.


> Is there any BTRFS tool, perhaps to run 
> periodically, that can help me detect bad ram hence mitigate the 
> consequences? There is a webpage called 'Will ZFS and non-ECC RAM kill 
> your data?' where it's suggested that ZFS scrub effectively detects bad 
> ram

I doubt, because bad ram can easily corrupt your metadata/data, and the 
same bad content is written to disk.

Furthermore the checksum will still match (calculated using the bad 
metadata/data), thus it's impossible to detect no matter what.


And bad RAM can happen at any random byte, if it's some core kernel 
structure, you're doomed anyway.

> (when at least two copies of the same file and/or metadata are 
> stored, and I wonder if there are other assumptions here...), but I'm 
> new to btrfs and I wonder if the reasoning can be applied to btrfs as 
> well, and how effective of a mitigation it would actually provide. OS is 
> GNU (Guix) and I think can't use ECC because I suspect my X200 
> motherboards wouldn't support it?
> 
> Please CC: or BCC: me cause I'm not subscribed.
> 


      reply	other threads:[~2025-11-19 20:55 UTC|newest]

Thread overview: 2+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2025-11-19 14:31 How to detect ram memory going bad? BP25
2025-11-19 20:54 ` Qu Wenruo [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=3fde2de2-d209-4028-adac-fb53e6a89dd4@suse.com \
    --to=wqu@suse.com \
    --cc=bp25@posteo.net \
    --cc=linux-btrfs@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox