linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Phil Turmel <philip@turmel.org>
To: Mark Kolama <indigo8472@gmail.com>, linux-raid@vger.kernel.org
Subject: Re: 4 disks outage in RAID6
Date: Mon, 22 Dec 2014 11:23:59 -0500	[thread overview]
Message-ID: <5498459F.1050600@turmel.org> (raw)
In-Reply-To: <CAMvZ8g5dv-eXrgTAr58ib06LM+NhzbY7hkmFxaH7V71iL8Eg+w@mail.gmail.com>

Good morning Mark,

On 12/20/2014 12:03 PM, Mark Kolama wrote:
> Dear List,
> 
> due to a controller failure, a raid6 with 16 drives
> lost 4 drives at once. The failure was noticed a few days later.
> 
> a examine output of all 16 drive is listed at
> http://pastebin.com/4WH9xp7K

Ok.  In the future, paste these in your email.  kernel.org has a
generous size limit and this sort of stuff should stay in the archives.
 As long as posters trim replies appropriately, it's not a problem.

> As you can see the event count differs on 4 drives with
> about 150 comparing to the other 12 drives.
> 
> I have already tried:
> mdadm --assemble --scan:
> assembled from 12 drives - not enough to start the array.
> 
> Then i tried:
> mdadm --assemble /dev/md0 /dev/sda1 /dev/sdb1 /dev/sdc1 /dev/sdd1
> /dev/sde1 /dev/sdf1 /dev/sdg1 /dev/sdh1 /dev/sdi1 /dev/sdj1 /dev/sdk1
> /dev/sdl1 /dev/sdm1 /dev/sdn1 /dev/sdo1 /dev/sdp1 --force

This should have worked, unless the array wasn't stopped first.  You
didn't show the actual response from mdadm, so we don't know.

There have also been bugs in various assembly features, so a report of
your kernel version and your mdadm version would be appropriate.

> /proc/mdstat after that:
> 
> Personalities : [raid6] [raid5] [raid4]
> md0 : inactive sda1[0](S) sdp1[15](S) sdo1[14](S) sdn1[13](S)
> sdm1[12](S) sdl1[11](S) sdk1[10](S) sdj1[9](S) sdi1[8](S) sdh1[7](S)
> sdg1[6](S) sdf1[5](S) sde1[4](S) sdd1[3](S) sdc1[2](S) sdb1[1](S)
>       62353932288 blocks super 1.2
> 
> No success either.

Try --force again, like so:

mdadm --stop /dev/md0

mdadm -Avf /dev/md0 /dev/sd[a-p]1

Show all of the output.  Also show the tail of dmesg where this
operation happens.

> So the next try would be recreating the array ?

Absolutely not.

Phil


      reply	other threads:[~2014-12-22 16:23 UTC|newest]

Thread overview: 2+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-12-20 17:03 4 disks outage in RAID6 Mark Kolama
2014-12-22 16:23 ` Phil Turmel [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=5498459F.1050600@turmel.org \
    --to=philip@turmel.org \
    --cc=indigo8472@gmail.com \
    --cc=linux-raid@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).