* differnet UUIDs and no of spares :(
@ 2009-01-14 17:26 Eamonn Hamilton
2009-01-14 22:50 ` Bill Davidsen
0 siblings, 1 reply; 5+ messages in thread
From: Eamonn Hamilton @ 2009-01-14 17:26 UTC (permalink / raw)
To: linux-raid
Hi Guys,
I'm looking at a server with a bunch of disks that had a raid 5 with two
spares, however, one of the spares failed, the system then started
rebuilding on the other and it crashed during the rebuild.
I'm now left in the following situation :
for a in a b c d e f g h i; do mdadm --examine --scan /dev/sd${a}; done
ARRAY /dev/md0 level=raid5 num-devices=7 UUID=e1e75e8b:f9f387cd:5feed4c5:31c51eb2
ARRAY /dev/md0 level=raid5 num-devices=7 UUID=e1e75e8b:f9f387cd:5feed4c5:31c51eb2
ARRAY /dev/md0 level=raid5 num-devices=7 UUID=e1e75e8b:f9f387cd:5feed4c5:31c51eb2
ARRAY /dev/md0 level=raid5 num-devices=7 UUID=e1e75e8b:f9f387cd:5feed4c5:31c51eb2
ARRAY /dev/md0 level=raid5 num-devices=7 UUID=e1e75e8b:f9f387cd:5feed4c5:31c51eb2
ARRAY /dev/md0 level=raid5 num-devices=7 UUID=e1e75e8b:f9f387cd:8d12a2d2:3188faf0
spares=1
ARRAY /dev/md0 level=raid5 num-devices=7 UUID=e1e75e8b:f9f387cd:8d12a2d2:3188faf0
ARRAY /dev/md0 level=raid5 num-devices=7 UUID=e1e75e8b:f9f387cd:8d12a2d2:3188faf0
spares=1
ARRAY /dev/md0 level=raid5 num-devices=7 UUID=e1e75e8b:f9f387cd:8d12a2d2:3188faf0
The system complains because of the different uuids, and refuses to
recreate the array.
Is it basically stuffed, or is there something I can do to recover the
2TB filesystem that's on there ?
Cheers,
Eamonn
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: differnet UUIDs and no of spares :(
2009-01-14 17:26 differnet UUIDs and no of spares :( Eamonn Hamilton
@ 2009-01-14 22:50 ` Bill Davidsen
2009-01-14 23:15 ` Eamonn Hamilton
2009-01-21 11:06 ` eamonn
0 siblings, 2 replies; 5+ messages in thread
From: Bill Davidsen @ 2009-01-14 22:50 UTC (permalink / raw)
To: Eamonn Hamilton; +Cc: linux-raid
Eamonn Hamilton wrote:
> Hi Guys,
>
> I'm looking at a server with a bunch of disks that had a raid 5 with two
> spares, however, one of the spares failed, the system then started
> rebuilding on the other and it crashed during the rebuild.
>
> I'm now left in the following situation :
>
>
> for a in a b c d e f g h i; do mdadm --examine --scan /dev/sd${a}; done
> ARRAY /dev/md0 level=raid5 num-devices=7 UUID=e1e75e8b:f9f387cd:5feed4c5:31c51eb2
> ARRAY /dev/md0 level=raid5 num-devices=7 UUID=e1e75e8b:f9f387cd:5feed4c5:31c51eb2
> ARRAY /dev/md0 level=raid5 num-devices=7 UUID=e1e75e8b:f9f387cd:5feed4c5:31c51eb2
> ARRAY /dev/md0 level=raid5 num-devices=7 UUID=e1e75e8b:f9f387cd:5feed4c5:31c51eb2
> ARRAY /dev/md0 level=raid5 num-devices=7 UUID=e1e75e8b:f9f387cd:5feed4c5:31c51eb2
> ARRAY /dev/md0 level=raid5 num-devices=7 UUID=e1e75e8b:f9f387cd:8d12a2d2:3188faf0
> spares=1
> ARRAY /dev/md0 level=raid5 num-devices=7 UUID=e1e75e8b:f9f387cd:8d12a2d2:3188faf0
> ARRAY /dev/md0 level=raid5 num-devices=7 UUID=e1e75e8b:f9f387cd:8d12a2d2:3188faf0
> spares=1
> ARRAY /dev/md0 level=raid5 num-devices=7 UUID=e1e75e8b:f9f387cd:8d12a2d2:3188faf0
>
>
> The system complains because of the different uuids, and refuses to
> recreate the array.
>
> Is it basically stuffed, or is there something I can do to recover the
> 2TB filesystem that's on there ?
>
What can you tell us about how that happened? When (if ever) was it
running, how was it created, etc, etc.
You could probably try some things like trying to start it read-only
using --force, but don't do that yet, if you get it wrong you WILL be
likely to lose data.
--
Bill Davidsen <davidsen@tmr.com>
"Woe unto the statesman who makes war without a reason that will still
be valid when the war is over..." Otto von Bismark
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: differnet UUIDs and no of spares :(
2009-01-14 22:50 ` Bill Davidsen
@ 2009-01-14 23:15 ` Eamonn Hamilton
2009-01-21 11:06 ` eamonn
1 sibling, 0 replies; 5+ messages in thread
From: Eamonn Hamilton @ 2009-01-14 23:15 UTC (permalink / raw)
To: Bill Davidsen; +Cc: linux-raid
Hi Bill,
>
> What can you tell us about how that happened? When (if ever) was it
> running, how was it created, etc, etc.
It was running stably for a long time, with the raid5 being expanded
from 1TB to 2. It recently had two spare drives added to it, and again
was running fine, however a new kernel on the box provoked some nasty
behaviour with the sata_nv driver.
This left the raid not starting at boot-up, as devmapper had claimed one
of the spares as it's own, so I started the array without that spare,
which provoked a re-sync for reasons I'm not clear on. Having cleared
the devmapper mapping on the other spare, it was added back in as a
spare to the array. 15 minutes into the resync, however, the biox fell
over, whether as a result of the array resync or something else I dont
know, there's nothing suspicious in the logs. Which brings us to where
we are today :(
>
> You could probably try some things like trying to start it read-only
> using --force, but don't do that yet, if you get it wrong you WILL be
> likely to lose data.
>
I was tempted to change the UUID on the nonstarting members, but the
number of spares being different gave me pause.
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: differnet UUIDs and no of spares :(
2009-01-14 22:50 ` Bill Davidsen
2009-01-14 23:15 ` Eamonn Hamilton
@ 2009-01-21 11:06 ` eamonn
2009-01-21 18:04 ` Bryon Roche
1 sibling, 1 reply; 5+ messages in thread
From: eamonn @ 2009-01-21 11:06 UTC (permalink / raw)
To: Bill Davidsen; +Cc: Eamonn Hamilton, linux-raid
On Wed, January 14, 2009 10:50 pm, Bill Davidsen wrote:
> Eamonn Hamilton wrote:
>
>> Hi Guys,
>>
>>
>> I'm looking at a server with a bunch of disks that had a raid 5 with
>> two spares, however, one of the spares failed, the system then started
>> rebuilding on the other and it crashed during the rebuild.
>>
>> I'm now left in the following situation :
>>
>>
>>
>> for a in a b c d e f g h i; do mdadm --examine --scan /dev/sd${a}; done
>> ARRAY /dev/md0 level=raid5 num-devices=7
>> UUID=e1e75e8b:f9f387cd:5feed4c5:31c51eb2
>> ARRAY /dev/md0 level=raid5 num-devices=7
>> UUID=e1e75e8b:f9f387cd:5feed4c5:31c51eb2
>> ARRAY /dev/md0 level=raid5 num-devices=7
>> UUID=e1e75e8b:f9f387cd:5feed4c5:31c51eb2
>> ARRAY /dev/md0 level=raid5 num-devices=7
>> UUID=e1e75e8b:f9f387cd:5feed4c5:31c51eb2
>> ARRAY /dev/md0 level=raid5 num-devices=7
>> UUID=e1e75e8b:f9f387cd:5feed4c5:31c51eb2
>> ARRAY /dev/md0 level=raid5 num-devices=7
>> UUID=e1e75e8b:f9f387cd:8d12a2d2:3188faf0
>> spares=1 ARRAY /dev/md0 level=raid5 num-devices=7
>> UUID=e1e75e8b:f9f387cd:8d12a2d2:3188faf0
>> ARRAY /dev/md0 level=raid5 num-devices=7
>> UUID=e1e75e8b:f9f387cd:8d12a2d2:3188faf0
>> spares=1 ARRAY /dev/md0 level=raid5 num-devices=7
>> UUID=e1e75e8b:f9f387cd:8d12a2d2:3188faf0
>>
>>
>>
>> The system complains because of the different uuids, and refuses to
>> recreate the array.
>>
>> Is it basically stuffed, or is there something I can do to recover the
>> 2TB filesystem that's on there ?
>>
>>
>
> What can you tell us about how that happened? When (if ever) was it
> running, how was it created, etc, etc.
>
> You could probably try some things like trying to start it read-only
> using --force, but don't do that yet, if you get it wrong you WILL be
> likely to lose data.
>
> --
> Bill Davidsen <davidsen@tmr.com>
> "Woe unto the statesman who makes war without a reason that will still
> be valid when the war is over..." Otto von Bismark
>
>
>
I don't suppose anybody else has any ideas? I've been holding off
attempting anything in the hope of somebody handing me a silver bullet,
but failing that ... ;)
Given that I still have the logs showing the order in which the drives
were assembled in these various phases, if I force a re-assembly in a
particular order, is there anything in the meta-data which would cause the
array to auto-magically continue where it left off?
Or should I simply take the afore-mentioned silver bullet, bite it and
recreate the whole lot as a RAID6, followed by the usual mad scrabbling
though old hard drives and tapes to try and get most of the content back?
Thanks in advance for any help,
Eamonn
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: differnet UUIDs and no of spares :(
2009-01-21 11:06 ` eamonn
@ 2009-01-21 18:04 ` Bryon Roche
0 siblings, 0 replies; 5+ messages in thread
From: Bryon Roche @ 2009-01-21 18:04 UTC (permalink / raw)
To: linux-raid
On Wed, 21 Jan 2009 11:06:10 +0000, eamonn wrote:
> I don't suppose anybody else has any ideas? I've been holding off
> attempting anything in the hope of somebody handing me a silver bullet,
> but failing that ... ;)
No silver bullets. your best chance is to use the logs like you mention
to do a forced read-only assemble. For this reason, it's also usually a
good idea to save the arguments you used for mdadm --create somewhere,
especially if you made the array with a non-default layout. At least
this will let you stream the data of the array onto something else.
A forced assembly necessarily destroys the superblock and any metadata
that could be used to resume the build. Your best bet here will be to
make sure you got the device order correct, probably in accordance to the
oldest list. This assumes that you didn't start rebuilding onto any
disks in such a way as to change that order.
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2009-01-21 18:04 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2009-01-14 17:26 differnet UUIDs and no of spares :( Eamonn Hamilton
2009-01-14 22:50 ` Bill Davidsen
2009-01-14 23:15 ` Eamonn Hamilton
2009-01-21 11:06 ` eamonn
2009-01-21 18:04 ` Bryon Roche
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).