From: Qu Wenruo <wqu@suse.com>
To: Peter Volkov <peter.volkov@gmail.com>, dsterba@suse.cz
Cc: linux-btrfs@vger.kernel.org
Subject: Re: BTRFS critical (device dm-0): corrupt node: root=256 block=1035372494848 slot=364, bad key order, current (8796143471049 108 0) next (50450969 1 0)
Date: Wed, 2 Oct 2024 07:42:26 +0930 [thread overview]
Message-ID: <094b3ff1-05f4-4557-80db-947a8224b671@suse.com> (raw)
In-Reply-To: <CAE+k_gJETiAtToyw9LoG3QWj-5Govupt9Shp9TFuqevSbt_RbA@mail.gmail.com>
在 2024/10/2 02:40, Peter Volkov 写道:
> On Tue, Oct 1, 2024 at 3:09 PM David Sterba <dsterba@suse.cz> wrote:
>> On Tue, Oct 01, 2024 at 02:15:51PM +0000, Peter Volkov wrote:
>>> Hi! I've been using this system with this kernel (6.10.10) for a few
>>> months already and today out of nowhere btrfs broke with this error
>>> message:
>>>
>>> [53923.816740] page dumped because: eb page dump
>>> [53923.816743] BTRFS critical (device dm-0): corrupt node: root=256
>>> block=1035372494848 slot=364, bad key order, current (8796143471049
>>> 108 0) next (50450969 1 0)
>>
>> Quite obvious memory bitflip:
>>
>> 8796143471049 = 0x8000301c9c9
>> 50450969 = 0x301d219
>>
>> The first one should probably be 0x301c9c9, but it's impossible to tell
>> how many other data/metadata could have been hit by this or another
>> memory bitflip so check can detect the things but not fix.
>
> Thank you David! Is my understanding correct, that btrfs catches
> memory problems,
> so this bitflip most probably means that my drive is failing?
In this particular case, it's your hardware memory, not the drive.
The error is happening at write time, so the metadata read from disk is
fine, thus not your driver returning some weird data.
Furthermore, it's pretty hard that a simple bitflip can pass the
internal checksums of the storage device, thus it's very unlikely it's
your drive.
So, please do a full memtest of your system before doing anything else.
And considering your fsck result is already bad, it's no doubt that some
bitflip has already corrupted extent tree, and I believe the csum tree
is also corrupted.
Thanks,
Qu
>
> --
> Peter.
>
next prev parent reply other threads:[~2024-10-01 22:12 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-10-01 14:15 BTRFS critical (device dm-0): corrupt node: root=256 block=1035372494848 slot=364, bad key order, current (8796143471049 108 0) next (50450969 1 0) Peter Volkov
2024-10-01 15:09 ` David Sterba
2024-10-01 17:10 ` Peter Volkov
2024-10-01 17:55 ` Matthew Warren
2024-10-01 22:12 ` Qu Wenruo [this message]
2024-10-04 8:01 ` Peter Volkov
2024-10-04 8:28 ` Qu Wenruo
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=094b3ff1-05f4-4557-80db-947a8224b671@suse.com \
--to=wqu@suse.com \
--cc=dsterba@suse.cz \
--cc=linux-btrfs@vger.kernel.org \
--cc=peter.volkov@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).