Linux RAID subsystem development
 help / color / mirror / Atom feed
From: pg@lxraid2.for.sabi.co.UK (Peter Grandi)
To: Linux RAID <linux-raid@vger.kernel.org>
Subject: Re: Advice please re failed Raid6
Date: Mon, 17 Jul 2017 01:19:21 +0100	[thread overview]
Message-ID: <22892.649.826246.644975@tree.ty.sabi.co.uk> (raw)
In-Reply-To: <9dca5b7a-b60e-0e93-41fd-49d092d8b27b@gmail.com>

> mdadm --create --assume-clean [ ... ]

That's a very dangerous recovery method that you need to get
exactly right or it will cause trouble.

Also it should be used in very rare cases, not routinely to
recover from one missing disk.

> root@keruru:/var/log# mdadm --examine /dev/sd[bedc] >> raid.status
> root@keruru:/var/log# cat raid.status 
> /dev/sdb:
>      Array UUID : b1e6af5d:e5848ebe:63727445:2ab99719
>      Array Size : 3906765824 (3725.78 GiB 4000.53 GB)
>   Used Dev Size : 3906765824 (1862.89 GiB 2000.26 GB)
>     Data Offset : 262144 sectors
>     Device UUID : 79e4933f:dfe5923f:5ba03ae7:3efe38eb
>          Events : 119
>      Chunk Size : 512K
>    Device Role : Active device 0
>    Array State : AAAA ('A' == active, '.' == missing)
> /dev/sdc:
>      Array UUID : b1e6af5d:e5848ebe:63727445:2ab99719
>      Array Size : 3906765824 (3725.78 GiB 4000.53 GB)
>   Used Dev Size : 3906765824 (1862.89 GiB 2000.26 GB)
>     Data Offset : 262144 sectors
>     Device UUID : f1e1a946:711886a6:2604780f:8eba4a2d
>          Events : 119
>      Chunk Size : 512K
>    Device Role : Active device 1
> /dev/sdd:
>      Array UUID : b1e6af5d:e5848ebe:63727445:2ab99719
>      Array Size : 3906765824 (3725.78 GiB 4000.53 GB)
>   Used Dev Size : 3906765824 (1862.89 GiB 2000.26 GB)
>     Data Offset : 262144 sectors
>     Device UUID : cf3bc8a7:9feed87d:945d8e77:08f7f32d
>          Events : 119
>      Chunk Size : 512K
>    Device Role : Active device 2
> /dev/sde:
>      Array UUID : b1e6af5d:e5848ebe:63727445:2ab99719
>      Array Size : 3906765824 (3725.78 GiB 4000.53 GB)
>   Used Dev Size : 3906765824 (1862.89 GiB 2000.26 GB)
>     Data Offset : 262144 sectors
>          Events : 119
>      Chunk Size : 512K
>    Device Role : Active device 3

That AAAA meant that the array was fine. The important field of
'--examine' confirm that. The "dirty degraded array" meant most
likely some slight event count difference, usually one just
forces assembly in that case.

The line that worries a bit is this:

  Jul 5 21:06:18 keruru mdadm[2497]: RebuildFinished event
  detected on md  device /dev/md0, component device mismatches
  found: 1847058224 (on raid  level 6)

That seems to indicate that pretty much every block was a
mismatch. Which would have happened if you put in a blank drive
and then used '--create --assume-clean' instead of '--assemble
--force'. But '--assume-clean" explicitly skips a rebuild, and I
wonder whether you omitted that yoiu have triggered a "repair"
in 'sync_action'. Also the message is reported by 'mdadm', and
it may be that 'mdadm' was running in daemon mode and triggering
a periodic "repair". I can't remember the defaults.

HOWEVER there is a very subtle detail: the order of the devices
from '--examine' is: 0: 'sdb', 1: 'sdc', 2: 'sdd', 3: 'sde' but
you recreated the set in a different order. The order of the
devices does not matter if they have MD superblocks,.but here
you are using '--create' to make new superblocks, and they order
must match exactly the original order.

> root@keruru:/var/log# mdadm --create --assume-clean --level=6 --raid-devices=4 --size=1953382912 /dev/md0 /dev/sdb /dev/sde /dev/sdd /dev/sdc

Probably the best thing you can do is to rerun this with members
"missing /dev/sdc /dev/sdd /dev/sde". and then use 'blkid /dev/md0'
to check whether the data in it is recognized again. If so add
'/dev/sdb'. I did a quick test here of something close to that
and it worked...

  parent reply	other threads:[~2017-07-17  0:19 UTC|newest]

Thread overview: 9+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2017-07-15 23:40 Advice please re failed Raid6 Bogo Mipps
2017-07-16  0:58 ` Roman Mamedov
2017-07-17  0:19 ` Peter Grandi [this message]
2017-07-19  1:52   ` Bogo Mipps
2017-07-19 12:36     ` Peter Grandi
2017-07-20  3:59       ` Bogo Mipps
     [not found]       ` <cf9aac00-91b3-3cb5-bceb-df5d7113b933@gmail.com>
2017-07-21  0:44         ` Bogo Mipps
2017-07-21  9:48           ` Peter Grandi
2017-07-23  0:13             ` Bogo Mipps

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=22892.649.826246.644975@tree.ty.sabi.co.uk \
    --to=pg@lxraid2.for.sabi.co.uk \
    --cc=linux-raid@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox