Lost raid 5 volume

All of lore.kernel.org
 help / color / mirror / Atom feed

* Lost raid 5 volume
@ 2014-12-13  0:48 Neil .
  2014-12-14  0:06 ` Emery Guevremont
  0 siblings, 1 reply; 5+ messages in thread
From: Neil . @ 2014-12-13  0:48 UTC (permalink / raw)
  To: linux-raid

I am looking for some help in trying to recover a raid 5 volume. Is
this the right place? What data should I provide to get the ball
rolling?

The back story is its 4 x 6tb drives in raid 5. Was created on a WD
EX4 NAS box. The NAS reported 2 simultaneous drive failures on wake up
from standby mode. On reboot both drives appear fine. No SMART errors
etc but the raid 5 volume is gone.

It's the second time the NAS box has lost data like this. Previous
time it was a mix of 2 and 3 tb drives with a 6 tb raid 5 and a 1 tb
mirror. mdadm on the nas is from 2006 (don't have the exact version to
hand).

I was thinking to put the drives into a PC, boot to a recent distro on
usb stick and see if I can recover the data there. Then setup it up
more or less permanently (and ditch the NAS). Not sure of the best way
to proceed.

I'd be really grateful for any assistance or guidance.

Cheers

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Lost raid 5 volume
  2014-12-13  0:48 Lost raid 5 volume Neil .
@ 2014-12-14  0:06 ` Emery Guevremont
  2014-12-14  4:30   ` Neil .
  0 siblings, 1 reply; 5+ messages in thread
From: Emery Guevremont @ 2014-12-14  0:06 UTC (permalink / raw)
  To: Neil .; +Cc: linux-raid

Stop your raid with mdadm -S /dev/md0 or whatever your raid device name is.

Probably the safest thing to do is clone your drives with ddrescue.
You might also want to view your S.M.A.R.T. log with smarctl -a
/dev/sda or which ever is your device name.

After, what you'd need to start doing is taking a backup of mdadm
--examine /dev/sda1 or which ever partition is your partition used for
raid. This will give us info on you md superblock. Post this info and
from there we'll be able to see how everything is setup and have a
better idea of your current situation.

On Fri, Dec 12, 2014 at 7:48 PM, Neil . <neil.perrie@gmail.com> wrote:
> I am looking for some help in trying to recover a raid 5 volume. Is
> this the right place? What data should I provide to get the ball
> rolling?
>
> The back story is its 4 x 6tb drives in raid 5. Was created on a WD
> EX4 NAS box. The NAS reported 2 simultaneous drive failures on wake up
> from standby mode. On reboot both drives appear fine. No SMART errors
> etc but the raid 5 volume is gone.
>
> It's the second time the NAS box has lost data like this. Previous
> time it was a mix of 2 and 3 tb drives with a 6 tb raid 5 and a 1 tb
> mirror. mdadm on the nas is from 2006 (don't have the exact version to
> hand).
>
> I was thinking to put the drives into a PC, boot to a recent distro on
> usb stick and see if I can recover the data there. Then setup it up
> more or less permanently (and ditch the NAS). Not sure of the best way
> to proceed.
>
> I'd be really grateful for any assistance or guidance.
>
> Cheers
> --
> To unsubscribe from this list: send the line "unsubscribe linux-raid" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Lost raid 5 volume
  2014-12-14  0:06 ` Emery Guevremont
@ 2014-12-14  4:30   ` Neil .
  2014-12-14 16:23     ` Phil Turmel
       [not found]     ` <CAB_L8sYNbdDnWCDwiyAmH+73J67o6dBy0G3QBsYTsPA-my6atg@mail.gmail.com>
  0 siblings, 2 replies; 5+ messages in thread
From: Neil . @ 2014-12-14  4:30 UTC (permalink / raw)
  To: Emery Guevremont; +Cc: linux-raid

smartctl says Overall health...: passed for all drives.

This is interesting(?)

 mdadm --examine for sda2 and b2 list:
 Array State as : AAAA

But sdc2 and d2 list it as ..AA

root@sysresccd /mnt % mdadm --examine /dev/sda2
/dev/sda2:
          Magic : a92b4efc
        Version : 1.0
    Feature Map : 0x1
     Array UUID : cf0bf1b9:f57b96be:8c749fcb:cea10311
           Name : 'localhost.localdomain':1
  Creation Time : Sat Oct 25 16:11:56 2014
     Raid Level : raid5
   Raid Devices : 4

 Avail Dev Size : 11712653168 (5585.03 GiB 5996.88 GB)
     Array Size : 17568979392 (16755.08 GiB 17990.63 GB)
  Used Dev Size : 11712652928 (5585.03 GiB 5996.88 GB)
   Super Offset : 11712653296 sectors
   Unused Space : before=0 sectors, after=368 sectors
          State : clean
    Device UUID : 0953e84c:a25760b0:28a20bab:bd1dc41b

Internal Bitmap : 2 sectors from superblock
    Update Time : Sat Dec  6 14:00:02 2014
       Checksum : dfe27505 - correct
         Events : 3

         Layout : left-symmetric
     Chunk Size : 64K

   Device Role : Active device 0
   Array State : AAAA ('A' == active, '.' == missing, 'R' == replacing)
root@sysresccd /mnt % mdadm --examine /dev/sdb2
/dev/sdb2:
          Magic : a92b4efc
        Version : 1.0
    Feature Map : 0x1
     Array UUID : cf0bf1b9:f57b96be:8c749fcb:cea10311
           Name : 'localhost.localdomain':1
  Creation Time : Sat Oct 25 16:11:56 2014
     Raid Level : raid5
   Raid Devices : 4

 Avail Dev Size : 11712653168 (5585.03 GiB 5996.88 GB)
     Array Size : 17568979392 (16755.08 GiB 17990.63 GB)
  Used Dev Size : 11712652928 (5585.03 GiB 5996.88 GB)
   Super Offset : 11712653296 sectors
   Unused Space : before=0 sectors, after=368 sectors
          State : clean
    Device UUID : d96a56d4:c5ac346a:24765692:501f6f22

Internal Bitmap : 2 sectors from superblock
    Update Time : Sat Dec  6 14:00:02 2014
       Checksum : f1ab789 - correct
         Events : 3

         Layout : left-symmetric
     Chunk Size : 64K

   Device Role : Active device 1
   Array State : AAAA ('A' == active, '.' == missing, 'R' == replacing)


root@sysresccd /mnt % mdadm --examine /dev/sdc2
/dev/sdc2:
          Magic : a92b4efc
        Version : 1.0
    Feature Map : 0x1
     Array UUID : cf0bf1b9:f57b96be:8c749fcb:cea10311
           Name : 'localhost.localdomain':1
  Creation Time : Sat Oct 25 16:11:56 2014
     Raid Level : raid5
   Raid Devices : 4

 Avail Dev Size : 11712653168 (5585.03 GiB 5996.88 GB)
     Array Size : 17568979392 (16755.08 GiB 17990.63 GB)
  Used Dev Size : 11712652928 (5585.03 GiB 5996.88 GB)
   Super Offset : 11712653296 sectors
   Unused Space : before=0 sectors, after=368 sectors
          State : clean
    Device UUID : 17d61cf2:2c0c4765:cb4c478c:4828aefc

Internal Bitmap : 2 sectors from superblock
    Update Time : Sun Dec  7 11:18:06 2014
       Checksum : fc228d5e - correct
         Events : 8

         Layout : left-symmetric
     Chunk Size : 64K

   Device Role : Active device 2
   Array State : ..AA ('A' == active, '.' == missing, 'R' == replacing)


root@sysresccd /mnt % mdadm --examine /dev/sdd2
/dev/sdd2:
          Magic : a92b4efc
        Version : 1.0
    Feature Map : 0x1
     Array UUID : cf0bf1b9:f57b96be:8c749fcb:cea10311
           Name : 'localhost.localdomain':1
  Creation Time : Sat Oct 25 16:11:56 2014
     Raid Level : raid5
   Raid Devices : 4

 Avail Dev Size : 11712653168 (5585.03 GiB 5996.88 GB)
     Array Size : 17568979392 (16755.08 GiB 17990.63 GB)
  Used Dev Size : 11712652928 (5585.03 GiB 5996.88 GB)
   Super Offset : 11712653296 sectors
   Unused Space : before=0 sectors, after=368 sectors
          State : clean
    Device UUID : 306bde8a:3beebe54:52e6acd2:f7681367

Internal Bitmap : 2 sectors from superblock
    Update Time : Sun Dec  7 11:18:06 2014
       Checksum : 3526df7d - correct
         Events : 8

         Layout : left-symmetric
     Chunk Size : 64K

   Device Role : Active device 3
   Array State : ..AA ('A' == active, '.' == missing, 'R' == replacing)






On 14 December 2014 at 00:06, Emery Guevremont
<emery.guevremont@gmail.com> wrote:
> Stop your raid with mdadm -S /dev/md0 or whatever your raid device name is.
>
> Probably the safest thing to do is clone your drives with ddrescue.
> You might also want to view your S.M.A.R.T. log with smarctl -a
> /dev/sda or which ever is your device name.
>
> After, what you'd need to start doing is taking a backup of mdadm
> --examine /dev/sda1 or which ever partition is your partition used for
> raid. This will give us info on you md superblock. Post this info and
> from there we'll be able to see how everything is setup and have a
> better idea of your current situation.
>
> On Fri, Dec 12, 2014 at 7:48 PM, Neil . <neil.perrie@gmail.com> wrote:
>> I am looking for some help in trying to recover a raid 5 volume. Is
>> this the right place? What data should I provide to get the ball
>> rolling?
>>
-snip-

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Lost raid 5 volume
  2014-12-14  4:30   ` Neil .
@ 2014-12-14 16:23     ` Phil Turmel
       [not found]     ` <CAB_L8sYNbdDnWCDwiyAmH+73J67o6dBy0G3QBsYTsPA-my6atg@mail.gmail.com>
  1 sibling, 0 replies; 5+ messages in thread
From: Phil Turmel @ 2014-12-14 16:23 UTC (permalink / raw)
  To: Neil ., Emery Guevremont; +Cc: linux-raid

Good morning Neil,

[Etiquette on kernel.org is to trim replies and either bottom-post or
interleave.]

On 12/13/2014 11:30 PM, Neil . wrote:
> smartctl says Overall health...: passed for all drives.

This is good, but not relevant.  Lots of desktop drives will report this
even though they are part of the problem.  Please show the complete
output of "smartctl -x" for the two troublesome drives.  Maybe also for
the other two.

>     Update Time : Sat Dec  6 14:00:02 2014
>          Events : 3

>     Update Time : Sat Dec  6 14:00:02 2014
>          Events : 3

>     Update Time : Sun Dec  7 11:18:06 2014
>          Events : 8

>     Update Time : Sun Dec  7 11:18:06 2014
>          Events : 8

This looks strange.  It suggests that the two drives failed well before
your reboot.  If you have dmesg from Saturday afternoon, that might be
enlightening.

However, as a raid 5, you are stuck unless you include at least one of
the two stale drives.  The correct tool for this is forced assembly:

mdadm --assemble --force --verbose /dev/mdX /dev/sd[abcd]2

If it fails, show its output.

Whether it fails or not, you need to investigate why the drives were
dropped.  Simultaneous drops suggest a hardware problem.

Phil

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Lost raid 5 volume
       [not found]             ` <CAB_L8saPqvR8HjqBFUvK+10H4Nou_yOrby5iPFkeiSiNj=RSww@mail.gmail.com>
@ 2014-12-15 13:37               ` Neil .
  0 siblings, 0 replies; 5+ messages in thread
From: Neil . @ 2014-12-15 13:37 UTC (permalink / raw)
  To: Emery Guevremont; +Cc: linux-raid

On 15 December 2014 at 11:30, Emery Guevremont
<emery.guevremont@gmail.com> wrote:
> it's a good sign.
>
> mont -o ro,noload /dev/md0 /mnt
>
> and backup.

Emery! Phil! Thank you so much!

It mounted and I can get to the data. Really cool.

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2014-12-15 13:37 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2014-12-13  0:48 Lost raid 5 volume Neil .
2014-12-14  0:06 ` Emery Guevremont
2014-12-14  4:30   ` Neil .
2014-12-14 16:23     ` Phil Turmel
     [not found]     ` <CAB_L8sYNbdDnWCDwiyAmH+73J67o6dBy0G3QBsYTsPA-my6atg@mail.gmail.com>
     [not found]       ` <CADhQzurRYoJJcM2yephWeyyEdTy8XsRROZP400i7UdYPeYXTzA@mail.gmail.com>
     [not found]         ` <CAB_L8sbo3SDUjch-kqFi3wZduQ2tcTSvs4MSabSLYodg7duahw@mail.gmail.com>
     [not found]           ` <CADhQzuos53RXQZpxepWG7v6dAc9EMseDYx4PbuXjj+4nSnCkJQ@mail.gmail.com>
     [not found]             ` <CAB_L8saPqvR8HjqBFUvK+10H4Nou_yOrby5iPFkeiSiNj=RSww@mail.gmail.com>
2014-12-15 13:37               ` Neil .

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.