* uncorrectable errors in Raid 10
@ 2017-11-19 19:31 Steffen Sindzinski
2017-11-20 2:03 ` Chris Murphy
0 siblings, 1 reply; 9+ messages in thread
From: Steffen Sindzinski @ 2017-11-19 19:31 UTC (permalink / raw)
To: linux-btrfs
Hello,
I have done a scrub on my Btrfs Raid 10 and have 2 uncorrectable errors.
In fact I cannot access 2 directories, even as root, permission is
denied and all directory attributes in ls -la are ????.
Before I have run this filesystem as Raid 1 with 3 disks without any
problems for more than a year. Scrubbed regularily. A month ago I added
a fouth HDD and balanced to Raid10. I am not sure if I did a scub
afterwards, but I usualy do. Now it found that errors. Smart status of
HDDs is healthy. The 2 directories were read-only for some years and not
even read in the last month.
What can I do now? Should I do a btrfs rescue? Which device, it is a
raid 10? Probably my files are OK, only the directories I cannot access.
How to recover the files?
Thanks in advance!
Steffen
Here is my data:
Linux bigbox 4.13.0-17-generic #20-Ubuntu SMP Mon Nov 6 10:04:08 UTC
2017 x86_64 x86_64 x86_64 GNU/Linux
btrfs-progs v4.12
Label: 'Videos' uuid: 4fafd0d4-7dd9-4dcc-9a33-5f1ad9555358
Total devices 4 FS bytes used 1.44TiB
devid 3 size 1.82TiB used 785.56GiB path /dev/sdc2
devid 4 size 1.82TiB used 785.56GiB path /dev/sde2
devid 5 size 1.82TiB used 785.56GiB path /dev/sdd2
devid 6 size 1.36TiB used 785.56GiB path /dev/sdf1
Data, RAID10: total=1.51TiB, used=1.43TiB
System, RAID10: total=128.00MiB, used=192.00KiB
Metadata, RAID10: total=21.00GiB, used=17.76GiB
GlobalReserve, single: total=512.00MiB, used=0.00B
% sudo btrfs scrub start -Bd /
scrub device /dev/sdc2 (id 3) canceled
scrub started at Sun Nov 19 08:43:21 2017 and was aborted after
01:58:27
total bytes scrubbed: 568.86GiB with 0 errors
scrub device /dev/sde2 (id 4) canceled
scrub started at Sun Nov 19 08:43:21 2017 and was aborted after
01:58:28
total bytes scrubbed: 702.88GiB with 1 errors
error details: verify=1
corrected errors: 0, uncorrectable errors: 1, unverified errors: 0
scrub device /dev/sdd2 (id 5) done
scrub started at Sun Nov 19 08:43:21 2017 and finished after 01:48:00
total bytes scrubbed: 737.74GiB with 1 errors
error details: verify=1
corrected errors: 0, uncorrectable errors: 1, unverified errors: 0
scrub device /dev/sdf1 (id 6) canceled
scrub started at Sun Nov 19 08:43:21 2017 and was aborted after
01:58:28
total bytes scrubbed: 506.02GiB with 0 errors
[ 4.985712] BTRFS: device label Videos devid 6 transid 772463 /dev/sdf1
[ 4.985882] BTRFS: device label Videos devid 4 transid 772463 /dev/sde2
[ 4.986541] BTRFS: device label Videos devid 5 transid 772463 /dev/sdd2
[ 4.986713] BTRFS: device label Videos devid 3 transid 772463 /dev/sdc2
[ 5.007986] BTRFS info (device sdc2): disk space caching is enabled
[ 5.007988] BTRFS info (device sdc2): has skinny extents
[ 5.149910] BTRFS info (device sdc2): bdev /dev/sdd2 errs: wr 0, rd
0, flush 0, corrupt 0, gen 3
[ 5.149916] BTRFS info (device sdc2): bdev /dev/sde2 errs: wr 0, rd
0, flush 0, corrupt 0, gen 3
[ 25.482987] BTRFS info (device sdc2): use lzo compression
[ 25.482990] BTRFS info (device sdc2): disk space caching is enabled
[57830.611730] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sdd2, sector 338986176: metadata leaf
(level 0) in tree 318
[57830.611732] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sdd2, sector 338986176: metadata leaf
(level 0) in tree 318
[57830.611734] BTRFS error (device sdc2): bdev /dev/sdd2 errs: wr 0, rd
0, flush 0, corrupt 0, gen 4
[57832.688689] BTRFS error (device sdc2): unable to fixup (regular)
error at logical 17478699876352 on dev /dev/sdd2
[57870.488081] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sde2, sector 348423360: metadata leaf
(level 0) in tree 318
[57870.488083] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sde2, sector 348423360: metadata leaf
(level 0) in tree 318
[57870.488085] BTRFS error (device sdc2): bdev /dev/sde2 errs: wr 0, rd
0, flush 0, corrupt 0, gen 4
[57870.500114] BTRFS error (device sdc2): unable to fixup (regular)
error at logical 17478699876352 on dev /dev/sde2
[88979.005712] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sdd2, sector 338986176: metadata leaf
(level 0) in tree 318
[88979.005718] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sdd2, sector 338986176: metadata leaf
(level 0) in tree 318
[88979.005720] BTRFS error (device sdc2): bdev /dev/sdd2 errs: wr 0, rd
0, flush 0, corrupt 0, gen 5
[88979.036670] BTRFS error (device sdc2): unable to fixup (regular)
error at logical 17478699876352 on dev /dev/sdd2
[89026.609561] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sde2, sector 348423360: metadata leaf
(level 0) in tree 318
[89026.609563] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sde2, sector 348423360: metadata leaf
(level 0) in tree 318
[89026.609566] BTRFS error (device sdc2): bdev /dev/sde2 errs: wr 0, rd
0, flush 0, corrupt 0, gen 5
[89026.617423] BTRFS error (device sdc2): unable to fixup (regular)
error at logical 17478699876352 on dev /dev/sde2
[97765.699866] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sdd2, sector 338986176: metadata leaf
(level 0) in tree 318
[97765.699869] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sdd2, sector 338986176: metadata leaf
(level 0) in tree 318
[97765.699871] BTRFS error (device sdc2): bdev /dev/sdd2 errs: wr 0, rd
0, flush 0, corrupt 0, gen 6
[97765.735887] BTRFS error (device sdc2): unable to fixup (regular)
error at logical 17478699876352 on dev /dev/sdd2
[97789.909036] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sde2, sector 348423360: metadata leaf
(level 0) in tree 318
[97789.909039] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sde2, sector 348423360: metadata leaf
(level 0) in tree 318
[97789.909041] BTRFS error (device sdc2): bdev /dev/sde2 errs: wr 0, rd
0, flush 0, corrupt 0, gen 6
[97789.918254] BTRFS error (device sdc2): unable to fixup (regular)
error at logical 17478699876352 on dev /dev/sde2
[104087.660640] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sdd2, sector 338986176: metadata leaf
(level 0) in tree 318
[104087.660643] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sdd2, sector 338986176: metadata leaf
(level 0) in tree 318
[104087.660645] BTRFS error (device sdc2): bdev /dev/sdd2 errs: wr 0, rd
0, flush 0, corrupt 0, gen 7
[104087.696248] BTRFS error (device sdc2): unable to fixup (regular)
error at logical 17478699876352 on dev /dev/sdd2
[104126.980483] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sde2, sector 348423360: metadata leaf
(level 0) in tree 318
[104126.980486] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sde2, sector 348423360: metadata leaf
(level 0) in tree 318
[104126.980488] BTRFS error (device sdc2): bdev /dev/sde2 errs: wr 0, rd
0, flush 0, corrupt 0, gen 7
[104126.995705] BTRFS error (device sdc2): unable to fixup (regular)
error at logical 17478699876352 on dev /dev/sde2
steffen@bigbox ~ % sudo btrfs device stats /
[/dev/sdc2].write_io_errs 0
[/dev/sdc2].read_io_errs 0
[/dev/sdc2].flush_io_errs 0
[/dev/sdc2].corruption_errs 0
[/dev/sdc2].generation_errs 0
[/dev/sde2].write_io_errs 0
[/dev/sde2].read_io_errs 0
[/dev/sde2].flush_io_errs 0
[/dev/sde2].corruption_errs 0
[/dev/sde2].generation_errs 7
[/dev/sdd2].write_io_errs 0
[/dev/sdd2].read_io_errs 0
[/dev/sdd2].flush_io_errs 0
[/dev/sdd2].corruption_errs 0
[/dev/sdd2].generation_errs 7
[/dev/sdf1].write_io_errs 0
[/dev/sdf1].read_io_errs 0
[/dev/sdf1].flush_io_errs 0
[/dev/sdf1].corruption_errs 0
[/dev/sdf1].generation_errs 0
^ permalink raw reply [flat|nested] 9+ messages in thread* Re: uncorrectable errors in Raid 10
2017-11-19 19:31 uncorrectable errors in Raid 10 Steffen Sindzinski
@ 2017-11-20 2:03 ` Chris Murphy
2017-11-20 19:28 ` Roy Sigurd Karlsbakk
[not found] ` <1c612777-77eb-1b83-101b-a9e0a53ee8be@gmail.com>
0 siblings, 2 replies; 9+ messages in thread
From: Chris Murphy @ 2017-11-20 2:03 UTC (permalink / raw)
To: Steffen Sindzinski; +Cc: Btrfs BTRFS
On Sun, Nov 19, 2017 at 12:31 PM, Steffen Sindzinski <stesind@gmail.com> wrote:
> [57830.611730] BTRFS warning (device sdc2): checksum/header error at logical
> 17478699876352 on dev /dev/sdd2, sector 338986176: metadata leaf (level 0)
> in tree 318
> [57830.611732] BTRFS warning (device sdc2): checksum/header error at logical
> 17478699876352 on dev /dev/sdd2, sector 338986176: metadata leaf (level 0)
> in tree 318
The same leaf is corrupt in the same physical sector on two devices.
I'm guessing the checksum was computed incorrectly and written twice,
affecting both copies. I doubt it's a device problem. It might be
useful to look through the archives specifically for checksum header
error, it's kinda interesting Btrfs knows the problem is specifically
there.
What do you get for:
btrfs-debut-tree -b 17478699876352 /dev/sdd2
I think the problem is isolated but you're probably best off to
freshen up backups now while you can. Yes you can use restore to get
data off the volume if you don't already have backups. And as for
btrfs check --repair, only do that once you have backups and you're
prepared to lose the file system.
--
Chris Murphy
^ permalink raw reply [flat|nested] 9+ messages in thread* Re: uncorrectable errors in Raid 10
2017-11-20 2:03 ` Chris Murphy
@ 2017-11-20 19:28 ` Roy Sigurd Karlsbakk
[not found] ` <1c612777-77eb-1b83-101b-a9e0a53ee8be@gmail.com>
1 sibling, 0 replies; 9+ messages in thread
From: Roy Sigurd Karlsbakk @ 2017-11-20 19:28 UTC (permalink / raw)
To: Chris Murphy; +Cc: Steffen Sindzinski, linux-btrfs
> On Sun, Nov 19, 2017 at 12:31 PM, Steffen Sindzinski <stesind@gmail.com> wrote:
>
>> [57830.611730] BTRFS warning (device sdc2): checksum/header error at logical
>> 17478699876352 on dev /dev/sdd2, sector 338986176: metadata leaf (level 0)
>> in tree 318
>> [57830.611732] BTRFS warning (device sdc2): checksum/header error at logical
>> 17478699876352 on dev /dev/sdd2, sector 338986176: metadata leaf (level 0)
>> in tree 318
>
> The same leaf is corrupt in the same physical sector on two devices.
> I'm guessing the checksum was computed incorrectly and written twice,
> affecting both copies. I doubt it's a device problem. It might be
> useful to look through the archives specifically for checksum header
> error, it's kinda interesting Btrfs knows the problem is specifically
> there.
Sounds reasonable. Perhaps a memory check on the machine would be a good idea_
roy
^ permalink raw reply [flat|nested] 9+ messages in thread[parent not found: <1c612777-77eb-1b83-101b-a9e0a53ee8be@gmail.com>]
* uncorrectable errors in Raid 10
@ 2017-11-19 16:21 Steffen Sindzinski
0 siblings, 0 replies; 9+ messages in thread
From: Steffen Sindzinski @ 2017-11-19 16:21 UTC (permalink / raw)
To: linux-btrfs
Hello,
I have done a scrub on my Btrfs Raid 10 and have 2 uncorrectable errors.
In fact I cannot access 2 directories, even as root, permission is
denied and all directory attributes in ls -la are ????.
Before I have run this filesystem as Raid 1 with 3 disks without any
problems for more than a year. Scrubbed regularily. A month ago I added
a fouth HDD and balanced to Raid10. I am not sure if I did a scub
afterwards, but I usualy do. Now it found that errors. Smart status of
HDDs is healthy. The 2 directories were read-only for some years and not
even read in the last month.
What can I do now? Should I do a btrfs rescue? Which device, it is a
raid 10? Probably my files are OK, only the directories I cannot access.
How to recover the files?
Thanks in advance!
Steffen
Here is my data:
Linux bigbox 4.13.0-17-generic #20-Ubuntu SMP Mon Nov 6 10:04:08 UTC
2017 x86_64 x86_64 x86_64 GNU/Linux
btrfs-progs v4.12
Label: 'Videos' uuid: 4fafd0d4-7dd9-4dcc-9a33-5f1ad9555358
Total devices 4 FS bytes used 1.44TiB
devid 3 size 1.82TiB used 785.56GiB path /dev/sdc2
devid 4 size 1.82TiB used 785.56GiB path /dev/sde2
devid 5 size 1.82TiB used 785.56GiB path /dev/sdd2
devid 6 size 1.36TiB used 785.56GiB path /dev/sdf1
Data, RAID10: total=1.51TiB, used=1.43TiB
System, RAID10: total=128.00MiB, used=192.00KiB
Metadata, RAID10: total=21.00GiB, used=17.76GiB
GlobalReserve, single: total=512.00MiB, used=0.00B
% sudo btrfs scrub start -Bd /
scrub device /dev/sdc2 (id 3) canceled
scrub started at Sun Nov 19 08:43:21 2017 and was aborted after
01:58:27
total bytes scrubbed: 568.86GiB with 0 errors
scrub device /dev/sde2 (id 4) canceled
scrub started at Sun Nov 19 08:43:21 2017 and was aborted after
01:58:28
total bytes scrubbed: 702.88GiB with 1 errors
error details: verify=1
corrected errors: 0, uncorrectable errors: 1, unverified errors: 0
scrub device /dev/sdd2 (id 5) done
scrub started at Sun Nov 19 08:43:21 2017 and finished after 01:48:00
total bytes scrubbed: 737.74GiB with 1 errors
error details: verify=1
corrected errors: 0, uncorrectable errors: 1, unverified errors: 0
scrub device /dev/sdf1 (id 6) canceled
scrub started at Sun Nov 19 08:43:21 2017 and was aborted after
01:58:28
total bytes scrubbed: 506.02GiB with 0 errors
[ 4.985712] BTRFS: device label Videos devid 6 transid 772463 /dev/sdf1
[ 4.985882] BTRFS: device label Videos devid 4 transid 772463 /dev/sde2
[ 4.986541] BTRFS: device label Videos devid 5 transid 772463 /dev/sdd2
[ 4.986713] BTRFS: device label Videos devid 3 transid 772463 /dev/sdc2
[ 5.007986] BTRFS info (device sdc2): disk space caching is enabled
[ 5.007988] BTRFS info (device sdc2): has skinny extents
[ 5.149910] BTRFS info (device sdc2): bdev /dev/sdd2 errs: wr 0, rd
0, flush 0, corrupt 0, gen 3
[ 5.149916] BTRFS info (device sdc2): bdev /dev/sde2 errs: wr 0, rd
0, flush 0, corrupt 0, gen 3
[ 25.482987] BTRFS info (device sdc2): use lzo compression
[ 25.482990] BTRFS info (device sdc2): disk space caching is enabled
[57830.611730] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sdd2, sector 338986176: metadata leaf
(level 0) in tree 318
[57830.611732] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sdd2, sector 338986176: metadata leaf
(level 0) in tree 318
[57830.611734] BTRFS error (device sdc2): bdev /dev/sdd2 errs: wr 0, rd
0, flush 0, corrupt 0, gen 4
[57832.688689] BTRFS error (device sdc2): unable to fixup (regular)
error at logical 17478699876352 on dev /dev/sdd2
[57870.488081] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sde2, sector 348423360: metadata leaf
(level 0) in tree 318
[57870.488083] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sde2, sector 348423360: metadata leaf
(level 0) in tree 318
[57870.488085] BTRFS error (device sdc2): bdev /dev/sde2 errs: wr 0, rd
0, flush 0, corrupt 0, gen 4
[57870.500114] BTRFS error (device sdc2): unable to fixup (regular)
error at logical 17478699876352 on dev /dev/sde2
[88979.005712] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sdd2, sector 338986176: metadata leaf
(level 0) in tree 318
[88979.005718] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sdd2, sector 338986176: metadata leaf
(level 0) in tree 318
[88979.005720] BTRFS error (device sdc2): bdev /dev/sdd2 errs: wr 0, rd
0, flush 0, corrupt 0, gen 5
[88979.036670] BTRFS error (device sdc2): unable to fixup (regular)
error at logical 17478699876352 on dev /dev/sdd2
[89026.609561] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sde2, sector 348423360: metadata leaf
(level 0) in tree 318
[89026.609563] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sde2, sector 348423360: metadata leaf
(level 0) in tree 318
[89026.609566] BTRFS error (device sdc2): bdev /dev/sde2 errs: wr 0, rd
0, flush 0, corrupt 0, gen 5
[89026.617423] BTRFS error (device sdc2): unable to fixup (regular)
error at logical 17478699876352 on dev /dev/sde2
[97765.699866] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sdd2, sector 338986176: metadata leaf
(level 0) in tree 318
[97765.699869] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sdd2, sector 338986176: metadata leaf
(level 0) in tree 318
[97765.699871] BTRFS error (device sdc2): bdev /dev/sdd2 errs: wr 0, rd
0, flush 0, corrupt 0, gen 6
[97765.735887] BTRFS error (device sdc2): unable to fixup (regular)
error at logical 17478699876352 on dev /dev/sdd2
[97789.909036] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sde2, sector 348423360: metadata leaf
(level 0) in tree 318
[97789.909039] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sde2, sector 348423360: metadata leaf
(level 0) in tree 318
[97789.909041] BTRFS error (device sdc2): bdev /dev/sde2 errs: wr 0, rd
0, flush 0, corrupt 0, gen 6
[97789.918254] BTRFS error (device sdc2): unable to fixup (regular)
error at logical 17478699876352 on dev /dev/sde2
[104087.660640] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sdd2, sector 338986176: metadata leaf
(level 0) in tree 318
[104087.660643] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sdd2, sector 338986176: metadata leaf
(level 0) in tree 318
[104087.660645] BTRFS error (device sdc2): bdev /dev/sdd2 errs: wr 0, rd
0, flush 0, corrupt 0, gen 7
[104087.696248] BTRFS error (device sdc2): unable to fixup (regular)
error at logical 17478699876352 on dev /dev/sdd2
[104126.980483] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sde2, sector 348423360: metadata leaf
(level 0) in tree 318
[104126.980486] BTRFS warning (device sdc2): checksum/header error at
logical 17478699876352 on dev /dev/sde2, sector 348423360: metadata leaf
(level 0) in tree 318
[104126.980488] BTRFS error (device sdc2): bdev /dev/sde2 errs: wr 0, rd
0, flush 0, corrupt 0, gen 7
[104126.995705] BTRFS error (device sdc2): unable to fixup (regular)
error at logical 17478699876352 on dev /dev/sde2
steffen@bigbox ~ % sudo btrfs device stats /
[/dev/sdc2].write_io_errs 0
[/dev/sdc2].read_io_errs 0
[/dev/sdc2].flush_io_errs 0
[/dev/sdc2].corruption_errs 0
[/dev/sdc2].generation_errs 0
[/dev/sde2].write_io_errs 0
[/dev/sde2].read_io_errs 0
[/dev/sde2].flush_io_errs 0
[/dev/sde2].corruption_errs 0
[/dev/sde2].generation_errs 7
[/dev/sdd2].write_io_errs 0
[/dev/sdd2].read_io_errs 0
[/dev/sdd2].flush_io_errs 0
[/dev/sdd2].corruption_errs 0
[/dev/sdd2].generation_errs 7
[/dev/sdf1].write_io_errs 0
[/dev/sdf1].read_io_errs 0
[/dev/sdf1].flush_io_errs 0
[/dev/sdf1].corruption_errs 0
[/dev/sdf1].generation_errs 0
^ permalink raw reply [flat|nested] 9+ messages in thread
end of thread, other threads:[~2017-11-23 14:05 UTC | newest]
Thread overview: 9+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2017-11-19 19:31 uncorrectable errors in Raid 10 Steffen Sindzinski
2017-11-20 2:03 ` Chris Murphy
2017-11-20 19:28 ` Roy Sigurd Karlsbakk
[not found] ` <1c612777-77eb-1b83-101b-a9e0a53ee8be@gmail.com>
[not found] ` <CAJCQCtRwO5E_xN-7u22uBQ57sYpUf8qQ7O0Abz2m+K8e9+DgdA@mail.gmail.com>
[not found] ` <384b4a21-e74f-c52e-c6e2-f7930f187c94@gmail.com>
2017-11-22 16:42 ` Chris Murphy
2017-11-23 12:02 ` Steffen Sindzinski
2017-11-23 12:14 ` Qu Wenruo
2017-11-23 14:05 ` Steffen Sindzinski
2017-11-23 0:41 ` Qu Wenruo
-- strict thread matches above, loose matches on Subject: below --
2017-11-19 16:21 Steffen Sindzinski
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).