From: "Hendrik Friedel" <hendrik@friedels.name>
To: "Zygo Blaxell" <ce3g8jdj@umail.furryterror.org>
Cc: "Btrfs BTRFS" <linux-btrfs@vger.kernel.org>
Subject: Re[2]: parent transid verify failed: Fixed but re-appearing
Date: Wed, 21 Oct 2020 21:15:06 +0000 [thread overview]
Message-ID: <em33511ef4-7da1-4e7c-8b0c-8b8d7043164c@desktop-g0r648m> (raw)
In-Reply-To: <20201021193246.GE21815@hungrycats.org>
Hello Zygo,
thanks for your reply.
>To be clear, scrub is the first thing to try, it will try to walk all
>the trees on the filesystem and read all the blocks.
>
Understood. Also thanks for explaining the other two options.
I think, it would be really advisable to add some recipies in the btrfs
wiki - especially because btrfs check --repair is often not the right
option opposed to the equivalent on other file systems.
>> > Scrub iterates over all metadata pages and should hit ptvf if it's
>> > on disk.
>>
>> But apparently it did not?!
>
>...which indicates the problem is probably in memory, not on disk.
Which is good. On the other hand, that leaves the question why I have
re-occuring failures of this kind (apparently) in memory.
What are possible reasons?
>> So it looks like it is raid1 metadata
>
>That would mean either that recovery already happened (if the corruption
>was on disk and has been removed), or the problem never reached a disk
>(if it is only in memory).
Ok.
Good news.
But it's still there:
dduper --device /dev/sda1 --dir
/srv/dev-disk-by-label-DataPool1/dduper_test/testfiles -r --dry-run
parent transid verify failed on 8333716668416 wanted 357026 found 357028
parent transid verify failed on 8333716668416 wanted 357026 found 357028
parent transid verify failed on 8333716668416 wanted 357026 found 357028
>
>> > only one of your disks is silently dropping writes. In that case
>> > btrfs will recover from ptvf by replacing the missing block from the
>> > other drive. But there are no scrub errors or kernel messages related
>> > to this, so there aren't any symptoms of that happening, so it seems
>> > these ptvf are not coming from the disk.
>> And can this be confirmed somehow? Would be good to replace that disk
>> then...
>
>These normally appear in 'btrfs dev stats', although there are various
>coverage gaps with raid5/6 and (until recently) compressed data blocks.
I do not use raid5/6 and I think I do not use compressed data.
btrfs dev stats /dev/sda1
[/dev/sda1].write_io_errs 0
[/dev/sda1].read_io_errs 0
[/dev/sda1].flush_io_errs 0
[/dev/sda1].corruption_errs 0
[/dev/sda1].generation_errs 1
So, what does the generation_errs tell us?
>'btrfs scrub status -d' will give a per-drive error breakdown.
btrfs scrub status -d /dev/sda1
scrub status for c4a6a2c9-5cf0-49b8-812a-0784953f9ba3
scrub device /dev/sda1 (id 1) history
scrub started at Mon Oct 19 21:07:13 2020 and finished after
15:11:10
total bytes scrubbed: 6.56TiB with 0 errors
btrfs scrub status -d /dev/sdj1
scrub status for c4a6a2c9-5cf0-49b8-812a-0784953f9ba3
scrub device /dev/sdj1 (id 2) history
scrub started at Mon Oct 19 21:07:13 2020 and finished after
17:06:15
total bytes scrubbed: 6.56TiB with 0 errors
>I haven't seen spurious ptvf errors on my test machines since the 5.4.30s,
>but 5.4 has a lot of fixes that between-LTS kernels like 5.6 do not
>always get.
Ok, I am compiling 5.9.1 now.
Apart from switching to the latest Kernel - what next step do you
recommend?
I tend to run a scrub again. Is it possible/useful to make it more
verbose?
What else?
Greetings,
Hendrik
next prev parent reply other threads:[~2020-10-21 21:15 UTC|newest]
Thread overview: 14+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-10-21 9:04 parent transid verify failed: Fixed but re-appearing Hendrik Friedel
2020-10-21 13:46 ` Zygo Blaxell
2020-10-21 18:19 ` Re[2]: " Hendrik Friedel
2020-10-21 19:32 ` Zygo Blaxell
2020-10-21 21:15 ` Hendrik Friedel [this message]
2020-10-21 21:22 ` Zygo Blaxell
2020-10-21 21:26 ` Re[2]: " Hendrik Friedel
2020-10-21 21:38 ` Zygo Blaxell
2020-10-22 19:30 ` Re[2]: " Hendrik Friedel
2020-11-03 18:19 ` Re[3]: " Hendrik Friedel
2020-11-03 19:40 ` Zygo Blaxell
2020-11-03 20:40 ` Re[2]: " Hendrik Friedel
2020-11-03 22:13 ` Amy Parker
2020-11-03 23:05 ` Zygo Blaxell
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=em33511ef4-7da1-4e7c-8b0c-8b8d7043164c@desktop-g0r648m \
--to=hendrik@friedels.name \
--cc=ce3g8jdj@umail.furryterror.org \
--cc=linux-btrfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).