From: Bogo Mipps <bogo.mipps@gmail.com>
To: Peter Grandi <pg@lxra2.for.sabi.co.UK>,
Linux Raid <linux-raid@vger.kernel.org>
Subject: Re: Advice please re failed Raid6
Date: Sun, 23 Jul 2017 12:13:47 +1200 [thread overview]
Message-ID: <8a03a92a-305e-806a-7af0-b1564eba7338@gmail.com> (raw)
In-Reply-To: <22897.52696.164078.23536@tree.ty.sabi.co.uk>
On 07/21/2017 09:48 PM, Peter Grandi wrote:
>>> Tried different order: sde, sdc, sdd and blkid worked.
>
> It is not clear what "blkid worked" means here. It should have
> reported an 'ext4' filesystem.
>
>>> Added sdb as you suggested.
>
> I actually wrote: "try a different order or 3-way subset of
> 'sd[bcde]'." Perhaps "3-way subset" was not clear. Only when the
> right subset in the right order were found adding a fourth
> member was worth it.
>
> Also it matter enormously whether "Added sdb" was done after
> recreating the set with four members with 'missing' or just 3.
> It is not clear what you have done.
>
> Also I had written: "not clear to me whether the 'mdadm' daemon
> instance triggered a 'check' or a 'repair'" and you seem to have
> not looked into that.
>
> Also I had written: "I hope that you disabled that in the
> meantime" and it is not clear whether you have done so.
>
> Also I had written: "Trigger a 'check' and see if the set is
> consistent", and I have no idea whether that happened and what
> the result was.
>
> Your actions and reports seem to be somewhat lackadaisical and
> distracted as to what is a quite subtle situation.
>
>>> Currently rebuilding.
>
> Adding back 'sdb' and rebuilding: you can leave that to the
> point where you have found the right order. Also before adding
> 'sdb' you would have used 'wipefs'/'mdadm --zero' it, I hope.
>
>> Peter, here is where I come unstuck. Where to from here?
>> Raid6 has rebuilt, apparently successfully, but I can't mount.
>
> It's difficult to say, because it is not clear what is going on,
> because if the right order of members is (sdb sde sdc sdd) the
> original output of 'mdadm --examine' is not consistent with that.
>
> The issue here continues to be what is the right order of the
> devices as members, and I am not sure that you know which
> devices are which. I don't know how accurate are your reports
> as to what happened and as to what you are doing.
>
>> [29458.547989] disk 0, o:1, dev:sde
>> [29458.547995] disk 1, o:1, dev:sdc
>> [29458.548001] disk 2, o:1, dev:sdd
>> [29458.548007] disk 3, o:1, dev:sdb
>
> To me it seems pretty unlikely that 'sdb' would be member 3, but
> again given your conflicting information as to past and current
> actions, I cannot guess what is really going on.
>
> But then your situation should be pretty easy: according to your
> reports, you have a set of 4 devices in RAID6, which means that
> any 2 devices of the 4 are sufficient to make the set work. The
> only problem is knowing in which positions.
>
> For the first stripe, the first 512KiB on each drive, the layout
> will be:
>
> member 0: the first 512KiB of the 'ext4', with the superblock.
> member 1: the second 512KiB of the 'ext4', with a distinctive layout.
> member 2: 512KiB of P (XOR parity), looking like gibberish.
> member 3: 512KiB of Q (syndrome), looking like gibberish.
>
> It might be interesting to see the output of:
>
> for D in c d e
> do
> echo
> echo "*** $D"
> blkid /dev/sd$D
> dd bs=512K count=1 if=/dev/sd$D | file -
> dd bs=512K count=1 if=/dev/sd$D | strings -a
> done
Peter, thank you for your detailed response. Much appreciated. My major
regret is not coming to this list earlier. I only discovered, far too
late, that I should have taken expert advice before I attempted any
remedial work. Too much erroneous information flying around the 'net.
I will now carefully follow your suggestions as above and report back in
a couple of days. The data on this Raid set is irreplaceable, and I want
to do everything I can to regain access.
Regards.
prev parent reply other threads:[~2017-07-23 0:13 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-07-15 23:40 Advice please re failed Raid6 Bogo Mipps
2017-07-16 0:58 ` Roman Mamedov
2017-07-17 0:19 ` Peter Grandi
2017-07-19 1:52 ` Bogo Mipps
2017-07-19 12:36 ` Peter Grandi
2017-07-20 3:59 ` Bogo Mipps
[not found] ` <cf9aac00-91b3-3cb5-bceb-df5d7113b933@gmail.com>
2017-07-21 0:44 ` Bogo Mipps
2017-07-21 9:48 ` Peter Grandi
2017-07-23 0:13 ` Bogo Mipps [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=8a03a92a-305e-806a-7af0-b1564eba7338@gmail.com \
--to=bogo.mipps@gmail.com \
--cc=linux-raid@vger.kernel.org \
--cc=pg@lxra2.for.sabi.co.UK \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox