linux-btrfs.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Zygo Blaxell <ce3g8jdj@umail.furryterror.org>
To: Hendrik Friedel <hendrik@friedels.name>
Cc: Btrfs BTRFS <linux-btrfs@vger.kernel.org>
Subject: Re: parent transid verify failed: Fixed but re-appearing
Date: Wed, 21 Oct 2020 17:22:29 -0400	[thread overview]
Message-ID: <20201021212229.GF21815@hungrycats.org> (raw)
In-Reply-To: <em33511ef4-7da1-4e7c-8b0c-8b8d7043164c@desktop-g0r648m>

On Wed, Oct 21, 2020 at 09:15:06PM +0000, Hendrik Friedel wrote:
> > That would mean either that recovery already happened (if the corruption
> > was on disk and has been removed), or the problem never reached a disk
> > (if it is only in memory).
> Ok.
> Good news.
> 
> But it's still there:
> dduper --device /dev/sda1 --dir
> /srv/dev-disk-by-label-DataPool1/dduper_test/testfiles -r --dry-run
> parent transid verify failed on 8333716668416 wanted 357026 found 357028
> parent transid verify failed on 8333716668416 wanted 357026 found 357028
> parent transid verify failed on 8333716668416 wanted 357026 found 357028

> > >  > only one of your disks is silently dropping writes.  In that case
> > >  > btrfs will recover from ptvf by replacing the missing block from the
> > >  > other drive.  But there are no scrub errors or kernel messages related
> > >  > to this, so there aren't any symptoms of that happening, so it seems
> > >  > these ptvf are not coming from the disk.
> > >  And can this be confirmed somehow? Would be good to replace that disk
> > >  then...
> > 
> > These normally appear in 'btrfs dev stats', although there are various
> > coverage gaps with raid5/6 and (until recently) compressed data blocks.
> I do not use raid5/6 and I think I do not use compressed data.
> 
> btrfs dev stats /dev/sda1
> [/dev/sda1].write_io_errs    0
> [/dev/sda1].read_io_errs     0
> [/dev/sda1].flush_io_errs    0
> [/dev/sda1].corruption_errs  0
> [/dev/sda1].generation_errs  1
> 
> So, what does the generation_errs tell us?

Try btrfs dev stats on the filesystem mount point instead of the device.
We want to see if there are generation errors on both disks or just one.

This indicates there was a parent transid verify failure observed on
that disk in the past.

If there is one on the other drive too then the filesystem may be broken,
though I'm not sure how you're getting scrub to work in that case.

It's also possible you're hitting some other kernel bug, possibly a new
one.

> > 'btrfs scrub status -d' will give a per-drive error breakdown.
> 
> btrfs scrub status -d /dev/sda1
> scrub status for c4a6a2c9-5cf0-49b8-812a-0784953f9ba3
> scrub device /dev/sda1 (id 1) history
>         scrub started at Mon Oct 19 21:07:13 2020 and finished after
> 15:11:10
>         total bytes scrubbed: 6.56TiB with 0 errors
> 
> btrfs scrub status -d /dev/sdj1
> scrub status for c4a6a2c9-5cf0-49b8-812a-0784953f9ba3
> scrub device /dev/sdj1 (id 2) history
>         scrub started at Mon Oct 19 21:07:13 2020 and finished after
> 17:06:15
>         total bytes scrubbed: 6.56TiB with 0 errors
> 
> 
> > I haven't seen spurious ptvf errors on my test machines since the 5.4.30s,
> > but 5.4 has a lot of fixes that between-LTS kernels like 5.6 do not
> > always get.
> 
> Ok, I am compiling 5.9.1 now.
> 
> Apart from switching to the latest Kernel - what next step do you recommend?
> I tend to run a scrub again. Is it possible/useful to make it more verbose?
> What else?

scrub already reports pretty much everything it finds.  'btrfs scrub
start -Bd' will present a per-disk error count at the end.

> Greetings,
> Hendrik
> 
> 

  reply	other threads:[~2020-10-21 21:22 UTC|newest]

Thread overview: 14+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2020-10-21  9:04 parent transid verify failed: Fixed but re-appearing Hendrik Friedel
2020-10-21 13:46 ` Zygo Blaxell
2020-10-21 18:19   ` Re[2]: " Hendrik Friedel
2020-10-21 19:32     ` Zygo Blaxell
2020-10-21 21:15       ` Re[2]: " Hendrik Friedel
2020-10-21 21:22         ` Zygo Blaxell [this message]
2020-10-21 21:26           ` Hendrik Friedel
2020-10-21 21:38             ` Zygo Blaxell
2020-10-22 19:30               ` Re[2]: " Hendrik Friedel
2020-11-03 18:19                 ` Re[3]: " Hendrik Friedel
2020-11-03 19:40                   ` Zygo Blaxell
2020-11-03 20:40                     ` Re[2]: " Hendrik Friedel
2020-11-03 22:13                       ` Amy Parker
2020-11-03 23:05                       ` Zygo Blaxell

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20201021212229.GF21815@hungrycats.org \
    --to=ce3g8jdj@umail.furryterror.org \
    --cc=hendrik@friedels.name \
    --cc=linux-btrfs@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).