From: "Austin S. Hemmelgarn" <ahferroin7@gmail.com>
To: Chris Murphy <lists@colorremedies.com>,
Nick Austin <nick@smartaustin.com>
Cc: Btrfs BTRFS <linux-btrfs@vger.kernel.org>
Subject: Re: Strange behavior when replacing device on BTRFS RAID 5 array.
Date: Mon, 27 Jun 2016 13:37:15 -0400 [thread overview]
Message-ID: <016f057b-b7a1-cccf-ca8a-cfe0e1d4341a@gmail.com> (raw)
In-Reply-To: <CAJCQCtTY-vsZi_QmQ7gSpzkQfC9oPfPU1uqvf8madwDRPsCDJw@mail.gmail.com>
On 2016-06-27 13:29, Chris Murphy wrote:
> On Sun, Jun 26, 2016 at 10:02 PM, Nick Austin <nick@smartaustin.com> wrote:
>> On Sun, Jun 26, 2016 at 8:57 PM, Nick Austin <nick@smartaustin.com> wrote:
>>> sudo btrfs fi show /mnt/newdata
>>> Label: '/var/data' uuid: e4a2eb77-956e-447a-875e-4f6595a5d3ec
>>> Total devices 4 FS bytes used 8.07TiB
>>> devid 1 size 5.46TiB used 2.70TiB path /dev/sdg
>>> devid 2 size 5.46TiB used 2.70TiB path /dev/sdl
>>> devid 3 size 5.46TiB used 2.70TiB path /dev/sdm
>>> devid 4 size 5.46TiB used 2.70TiB path /dev/sdx
>>
>> It looks like fi show has bad data:
>>
>> When I start heavy IO on the filesystem (running rsync -c to verify the data),
>> I notice zero IO on the bad drive I told btrfs to replace, and lots of IO to the
>> expected replacement.
>>
>> I guess some metadata is messed up somewhere?
>>
>> avg-cpu: %user %nice %system %iowait %steal %idle
>> 25.19 0.00 7.81 28.46 0.00 38.54
>>
>> Device: tps kB_read/s kB_wrtn/s kB_read kB_wrtn
>> sdg 437.00 75168.00 1792.00 75168 1792
>> sdl 443.00 76064.00 1792.00 76064 1792
>> sdm 438.00 75232.00 1472.00 75232 1472
>> sdw 443.00 75680.00 1856.00 75680 1856
>> sdx 0.00 0.00 0.00 0 0
>
> There's reported some bugs with 'btrfs replace' and raid56, but I
> don't know the exact nature of those bugs, when or how they manifest.
> It's recommended to fallback to use 'btrfs add' and then 'btrfs
> delete' but you have other issues going on also.
One other thing to mention, if the device is failing, _always_ add '-r'
to the replace command line. This will tell it to avoid reading from
the device being replaced (in raid1 or raid10 mode, it will pull from
the other mirror, in raid5/6 mode, it will recompute the block from
parity and compare to the stored checksums (which in turn means that
this _will_ be slower on raid5/6 than regular repalce)). Link resets
and other issues that cause devices to disappear become more common the
more damaged a disk is, so avoiding reading from it becomes more
important too, because just reading from a disk puts stress on it.
next prev parent reply other threads:[~2016-06-27 17:37 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-06-27 3:57 Strange behavior when replacing device on BTRFS RAID 5 array Nick Austin
2016-06-27 4:02 ` Nick Austin
2016-06-27 17:29 ` Chris Murphy
2016-06-27 17:37 ` Austin S. Hemmelgarn [this message]
2016-06-27 17:46 ` Chris Murphy
2016-06-27 22:29 ` Steven Haigh
2016-06-27 21:12 ` Duncan
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=016f057b-b7a1-cccf-ca8a-cfe0e1d4341a@gmail.com \
--to=ahferroin7@gmail.com \
--cc=linux-btrfs@vger.kernel.org \
--cc=lists@colorremedies.com \
--cc=nick@smartaustin.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).