* Unable to restart reshape
@ 2011-10-30 14:57 Michael Busby
2011-10-30 15:34 ` Michael Busby
0 siblings, 1 reply; 7+ messages in thread
From: Michael Busby @ 2011-10-30 14:57 UTC (permalink / raw)
To: linux-raid
I have a system the was doing a reshape from RAID5 to 6, the system
had to be powered off this morning and moved, upon restarting the
server i issued the following command to continue the reshape
mdadm -A /dev/md0 --backup-file=/home/md.backup
i get back to following error
mdadm: Failed to restore critical section for reshape, sorry.
any idea why?
before shutting down cat /proc/mdstat showed
Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5]
[raid4] [raid10]
md0 : active raid6 sdf[0] sdb[6](S) sda[4] sdc[3] sde[2] sdd[1]
7814055936 blocks super 1.0 level 6, 512k chunk, algorithm 18
[6/5] [UUUUU_]
[==============>......] reshape = 70.8% (1384415232/1953513984)
finish=3658.6min speed=2592K/sec
but now it shows
Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5]
[raid4] [raid10]
md0 : inactive sdc[3] sdb[6](S) sde[2] sdd[1] sdf[0]
9767572240 blocks super 1.0
i am totally confused, it seems to have lost a drive from the raid,
and the number of blocks is incorrect
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Unable to restart reshape
2011-10-30 14:57 Unable to restart reshape Michael Busby
@ 2011-10-30 15:34 ` Michael Busby
2011-10-30 15:57 ` Michael Busby
0 siblings, 1 reply; 7+ messages in thread
From: Michael Busby @ 2011-10-30 15:34 UTC (permalink / raw)
To: linux-raid
On 30 October 2011 14:57, Michael Busby <michael.a.busby@gmail.com> wrote:
> I have a system the was doing a reshape from RAID5 to 6, the system
> had to be powered off this morning and moved, upon restarting the
> server i issued the following command to continue the reshape
>
> mdadm -A /dev/md0 --backup-file=/home/md.backup
>
> i get back to following error
>
> mdadm: Failed to restore critical section for reshape, sorry.
>
> any idea why?
>
> before shutting down cat /proc/mdstat showed
>
> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5]
> [raid4] [raid10]
> md0 : active raid6 sdf[0] sdb[6](S) sda[4] sdc[3] sde[2] sdd[1]
> 7814055936 blocks super 1.0 level 6, 512k chunk, algorithm 18
> [6/5] [UUUUU_]
> [==============>......] reshape = 70.8% (1384415232/1953513984)
> finish=3658.6min speed=2592K/sec
>
> but now it shows
>
> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5]
> [raid4] [raid10]
> md0 : inactive sdc[3] sdb[6](S) sde[2] sdd[1] sdf[0]
> 9767572240 blocks super 1.0
>
> i am totally confused, it seems to have lost a drive from the raid,
> and the number of blocks is incorrect
>
issuing the following
mdadm -Avv --backup-file=/home/md.backup /dev/md0
returns
mdadm: looking for devices for /dev/md0
mdadm: cannot open device /dev/sda5: Device or resource busy
mdadm: /dev/sda5 has wrong uuid.
mdadm: no RAID superblock on /dev/sda2
mdadm: /dev/sda2 has wrong uuid.
mdadm: cannot open device /dev/sda1: Device or resource busy
mdadm: /dev/sda1 has wrong uuid.
mdadm: cannot open device /dev/sda: Device or resource busy
mdadm: /dev/sda has wrong uuid.
mdadm: /dev/sdg is identified as a member of /dev/md0, slot -1.
mdadm: /dev/sdf is identified as a member of /dev/md0, slot 4.
mdadm: /dev/sdd is identified as a member of /dev/md0, slot 2.
mdadm: /dev/sde is identified as a member of /dev/md0, slot 0.
mdadm: /dev/sdc is identified as a member of /dev/md0, slot 1.
mdadm: /dev/sdb is identified as a member of /dev/md0, slot 3.
mdadm:/dev/md0 has an active reshape - checking if critical section
needs to be restored
mdadm: backup-metadata found on /home/md.backup but is not needed
mdadm: Failed to find backup of critical section
mdadm: Failed to restore critical section for reshape, sorry.
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Unable to restart reshape
2011-10-30 15:34 ` Michael Busby
@ 2011-10-30 15:57 ` Michael Busby
2011-10-30 16:04 ` Michael Busby
0 siblings, 1 reply; 7+ messages in thread
From: Michael Busby @ 2011-10-30 15:57 UTC (permalink / raw)
To: linux-raid
On 30 October 2011 15:34, Michael Busby <michael.a.busby@gmail.com> wrote:
> On 30 October 2011 14:57, Michael Busby <michael.a.busby@gmail.com> wrote:
>> I have a system the was doing a reshape from RAID5 to 6, the system
>> had to be powered off this morning and moved, upon restarting the
>> server i issued the following command to continue the reshape
>>
>> mdadm -A /dev/md0 --backup-file=/home/md.backup
>>
>> i get back to following error
>>
>> mdadm: Failed to restore critical section for reshape, sorry.
>>
>> any idea why?
>>
>> before shutting down cat /proc/mdstat showed
>>
>> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5]
>> [raid4] [raid10]
>> md0 : active raid6 sdf[0] sdb[6](S) sda[4] sdc[3] sde[2] sdd[1]
>> 7814055936 blocks super 1.0 level 6, 512k chunk, algorithm 18
>> [6/5] [UUUUU_]
>> [==============>......] reshape = 70.8% (1384415232/1953513984)
>> finish=3658.6min speed=2592K/sec
>>
>> but now it shows
>>
>> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5]
>> [raid4] [raid10]
>> md0 : inactive sdc[3] sdb[6](S) sde[2] sdd[1] sdf[0]
>> 9767572240 blocks super 1.0
>>
>> i am totally confused, it seems to have lost a drive from the raid,
>> and the number of blocks is incorrect
>>
>
> issuing the following
>
> mdadm -Avv --backup-file=/home/md.backup /dev/md0
>
> returns
>
>
> mdadm: looking for devices for /dev/md0
> mdadm: cannot open device /dev/sda5: Device or resource busy
> mdadm: /dev/sda5 has wrong uuid.
> mdadm: no RAID superblock on /dev/sda2
> mdadm: /dev/sda2 has wrong uuid.
> mdadm: cannot open device /dev/sda1: Device or resource busy
> mdadm: /dev/sda1 has wrong uuid.
> mdadm: cannot open device /dev/sda: Device or resource busy
> mdadm: /dev/sda has wrong uuid.
> mdadm: /dev/sdg is identified as a member of /dev/md0, slot -1.
> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 4.
> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 2.
> mdadm: /dev/sde is identified as a member of /dev/md0, slot 0.
> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 1.
> mdadm: /dev/sdb is identified as a member of /dev/md0, slot 3.
> mdadm:/dev/md0 has an active reshape - checking if critical section
> needs to be restored
> mdadm: backup-metadata found on /home/md.backup but is not needed
> mdadm: Failed to find backup of critical section
> mdadm: Failed to restore critical section for reshape, sorry.
>
seem the above was trying at use the wrong disks to assemble, so using
the following
mdadm -Avv /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef]
mdadm: looking for devices for /dev/md0
mdadm: /dev/sda is identified as a member of /dev/md0, slot 4.
mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1.
mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3.
mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1.
mdadm: /dev/sde is identified as a member of /dev/md0, slot 2.
mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0.
mdadm:/dev/md0 has an active reshape - checking if critical section
needs to be restored
mdadm: backup-metadata found on /home/md.backup but is not needed
mdadm: Failed to find backup of critical section
mdadm: Failed to restore critical section for reshape, sorry.
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Unable to restart reshape
2011-10-30 15:57 ` Michael Busby
@ 2011-10-30 16:04 ` Michael Busby
2011-10-30 16:22 ` Michael Busby
0 siblings, 1 reply; 7+ messages in thread
From: Michael Busby @ 2011-10-30 16:04 UTC (permalink / raw)
To: linux-raid
>>> I have a system the was doing a reshape from RAID5 to 6, the system
>>> had to be powered off this morning and moved, upon restarting the
>>> server i issued the following command to continue the reshape
>>>
>>> mdadm -A /dev/md0 --backup-file=/home/md.backup
>>>
>>> i get back to following error
>>>
>>> mdadm: Failed to restore critical section for reshape, sorry.
>>>
>>> any idea why?
>>>
>>> before shutting down cat /proc/mdstat showed
>>>
>>> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5]
>>> [raid4] [raid10]
>>> md0 : active raid6 sdf[0] sdb[6](S) sda[4] sdc[3] sde[2] sdd[1]
>>> 7814055936 blocks super 1.0 level 6, 512k chunk, algorithm 18
>>> [6/5] [UUUUU_]
>>> [==============>......] reshape = 70.8% (1384415232/1953513984)
>>> finish=3658.6min speed=2592K/sec
>>>
>>> but now it shows
>>>
>>> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5]
>>> [raid4] [raid10]
>>> md0 : inactive sdc[3] sdb[6](S) sde[2] sdd[1] sdf[0]
>>> 9767572240 blocks super 1.0
>>>
>>> i am totally confused, it seems to have lost a drive from the raid,
>>> and the number of blocks is incorrect
>>>
>>
>> issuing the following
>>
>> mdadm -Avv --backup-file=/home/md.backup /dev/md0
>>
>> returns
>>
>>
>> mdadm: looking for devices for /dev/md0
>> mdadm: cannot open device /dev/sda5: Device or resource busy
>> mdadm: /dev/sda5 has wrong uuid.
>> mdadm: no RAID superblock on /dev/sda2
>> mdadm: /dev/sda2 has wrong uuid.
>> mdadm: cannot open device /dev/sda1: Device or resource busy
>> mdadm: /dev/sda1 has wrong uuid.
>> mdadm: cannot open device /dev/sda: Device or resource busy
>> mdadm: /dev/sda has wrong uuid.
>> mdadm: /dev/sdg is identified as a member of /dev/md0, slot -1.
>> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 4.
>> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 2.
>> mdadm: /dev/sde is identified as a member of /dev/md0, slot 0.
>> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 1.
>> mdadm: /dev/sdb is identified as a member of /dev/md0, slot 3.
>> mdadm:/dev/md0 has an active reshape - checking if critical section
>> needs to be restored
>> mdadm: backup-metadata found on /home/md.backup but is not needed
>> mdadm: Failed to find backup of critical section
>> mdadm: Failed to restore critical section for reshape, sorry.
>>
>
> seem the above was trying at use the wrong disks to assemble, so using
> the following
>
> mdadm -Avv /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef]
>
> mdadm: looking for devices for /dev/md0
> mdadm: /dev/sda is identified as a member of /dev/md0, slot 4.
> mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1.
> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3.
> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1.
> mdadm: /dev/sde is identified as a member of /dev/md0, slot 2.
> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0.
> mdadm:/dev/md0 has an active reshape - checking if critical section
> needs to be restored
> mdadm: backup-metadata found on /home/md.backup but is not needed
> mdadm: Failed to find backup of critical section
> mdadm: Failed to restore critical section for reshape, sorry.
>
have now upgraded to mdadm 3.2.2
and get a little more info
mdadm -Avv /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef]
mdadm: looking for devices for /dev/md0
mdadm: /dev/sda is identified as a member of /dev/md0, slot 4.
mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1.
mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3.
mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1.
mdadm: /dev/sde is identified as a member of /dev/md0, slot 2.
mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0.
mdadm: device 6 in /dev/md0 has wrong state in superblock, but /dev/sdb seems ok
mdadm:/dev/md0 has an active reshape - checking if critical section
needs to be restored
mdadm: backup-metadata found on /home/md.backup but is not needed
mdadm: Failed to find backup of critical section
mdadm: Failed to restore critical section for reshape, sorry.
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Unable to restart reshape
2011-10-30 16:04 ` Michael Busby
@ 2011-10-30 16:22 ` Michael Busby
2011-10-30 22:02 ` Alexander Kühn
0 siblings, 1 reply; 7+ messages in thread
From: Michael Busby @ 2011-10-30 16:22 UTC (permalink / raw)
To: linux-raid
>>>> I have a system the was doing a reshape from RAID5 to 6, the system
>>>> had to be powered off this morning and moved, upon restarting the
>>>> server i issued the following command to continue the reshape
>>>>
>>>> mdadm -A /dev/md0 --backup-file=/home/md.backup
>>>>
>>>> i get back to following error
>>>>
>>>> mdadm: Failed to restore critical section for reshape, sorry.
>>>>
>>>> any idea why?
>>>>
>>>> before shutting down cat /proc/mdstat showed
>>>>
>>>> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5]
>>>> [raid4] [raid10]
>>>> md0 : active raid6 sdf[0] sdb[6](S) sda[4] sdc[3] sde[2] sdd[1]
>>>> 7814055936 blocks super 1.0 level 6, 512k chunk, algorithm 18
>>>> [6/5] [UUUUU_]
>>>> [==============>......] reshape = 70.8% (1384415232/1953513984)
>>>> finish=3658.6min speed=2592K/sec
>>>>
>>>> but now it shows
>>>>
>>>> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5]
>>>> [raid4] [raid10]
>>>> md0 : inactive sdc[3] sdb[6](S) sde[2] sdd[1] sdf[0]
>>>> 9767572240 blocks super 1.0
>>>>
>>>> i am totally confused, it seems to have lost a drive from the raid,
>>>> and the number of blocks is incorrect
>>>>
>>>
>>> issuing the following
>>>
>>> mdadm -Avv --backup-file=/home/md.backup /dev/md0
>>>
>>> returns
>>>
>>>
>>> mdadm: looking for devices for /dev/md0
>>> mdadm: cannot open device /dev/sda5: Device or resource busy
>>> mdadm: /dev/sda5 has wrong uuid.
>>> mdadm: no RAID superblock on /dev/sda2
>>> mdadm: /dev/sda2 has wrong uuid.
>>> mdadm: cannot open device /dev/sda1: Device or resource busy
>>> mdadm: /dev/sda1 has wrong uuid.
>>> mdadm: cannot open device /dev/sda: Device or resource busy
>>> mdadm: /dev/sda has wrong uuid.
>>> mdadm: /dev/sdg is identified as a member of /dev/md0, slot -1.
>>> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 4.
>>> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 2.
>>> mdadm: /dev/sde is identified as a member of /dev/md0, slot 0.
>>> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 1.
>>> mdadm: /dev/sdb is identified as a member of /dev/md0, slot 3.
>>> mdadm:/dev/md0 has an active reshape - checking if critical section
>>> needs to be restored
>>> mdadm: backup-metadata found on /home/md.backup but is not needed
>>> mdadm: Failed to find backup of critical section
>>> mdadm: Failed to restore critical section for reshape, sorry.
>>>
>>
>> seem the above was trying at use the wrong disks to assemble, so using
>> the following
>>
>> mdadm -Avv /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef]
>>
>> mdadm: looking for devices for /dev/md0
>> mdadm: /dev/sda is identified as a member of /dev/md0, slot 4.
>> mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1.
>> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3.
>> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1.
>> mdadm: /dev/sde is identified as a member of /dev/md0, slot 2.
>> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0.
>> mdadm:/dev/md0 has an active reshape - checking if critical section
>> needs to be restored
>> mdadm: backup-metadata found on /home/md.backup but is not needed
>> mdadm: Failed to find backup of critical section
>> mdadm: Failed to restore critical section for reshape, sorry.
>>
>
> have now upgraded to mdadm 3.2.2
>
> and get a little more info
>
> mdadm -Avv /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef]
>
> mdadm: looking for devices for /dev/md0
> mdadm: /dev/sda is identified as a member of /dev/md0, slot 4.
> mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1.
> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3.
> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1.
> mdadm: /dev/sde is identified as a member of /dev/md0, slot 2.
> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0.
> mdadm: device 6 in /dev/md0 has wrong state in superblock, but /dev/sdb seems ok
> mdadm:/dev/md0 has an active reshape - checking if critical section
> needs to be restored
> mdadm: backup-metadata found on /home/md.backup but is not needed
> mdadm: Failed to find backup of critical section
> mdadm: Failed to restore critical section for reshape, sorry.
>
Ok, i dont know if this is the right thing to have done
~# mdadm -Avv --force /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef]
mdadm: looking for devices for /dev/md0
mdadm: /dev/sda is identified as a member of /dev/md0, slot 4.
mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1.
mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3.
mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1.
mdadm: /dev/sde is identified as a member of /dev/md0, slot 2.
mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0.
mdadm: clearing FAULTY flag for device 1 in /dev/md0 for /dev/sdb
mdadm: Marking array /dev/md0 as 'clean'
mdadm:/dev/md0 has an active reshape - checking if critical section
needs to be restored
mdadm: backup-metadata found on /home/md.backup but is not needed
mdadm: Failed to find backup of critical section
mdadm: Failed to restore critical section for reshape, sorry.
~# mdadm -Avv /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef]
mdadm: looking for devices for /dev/md0
mdadm: /dev/sda is identified as a member of /dev/md0, slot 4.
mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1.
mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3.
mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1.
mdadm: /dev/sde is identified as a member of /dev/md0, slot 2.
mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0.
mdadm:/dev/md0 has an active reshape - checking if critical section
needs to be restored
mdadm: restoring critical section
mdadm: added /dev/sdd to /dev/md0 as 1
mdadm: added /dev/sde to /dev/md0 as 2
mdadm: added /dev/sdc to /dev/md0 as 3
mdadm: added /dev/sda to /dev/md0 as 4
mdadm: no uptodate device for slot 5 of /dev/md0
mdadm: added /dev/sdb to /dev/md0 as -1
mdadm: added /dev/sdf to /dev/md0 as 0
mdadm: /dev/md0 has been started with 4 drives (out of 6) and 1 spare.
~# cat /proc/mdstat
Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5]
[raid4] [raid10]
md0 : active raid6 sdf[0] sdb[6](S) sdc[3] sde[2] sdd[1]
7814055936 blocks super 1.0 level 6, 512k chunk, algorithm 18
[6/4] [UUUU__]
[==============>......] reshape = 74.3% (1452929024/1953513984)
finish=2545.2min speed=3276K/sec
unused devices: <none>
so looks like its carrying on now but with 4 disks and a spare, maybe
i can add the other disk once the reshape has finished
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Unable to restart reshape
2011-10-30 16:22 ` Michael Busby
@ 2011-10-30 22:02 ` Alexander Kühn
2011-10-30 22:15 ` Michael Busby
0 siblings, 1 reply; 7+ messages in thread
From: Alexander Kühn @ 2011-10-30 22:02 UTC (permalink / raw)
To: Michael Busby; +Cc: linux-raid
Zitat von Michael Busby <michael.a.busby@gmail.com>:
>>>>> I have a system the was doing a reshape from RAID5 to 6, the system
>>>>> had to be powered off this morning and moved, upon restarting the
>>>>> server i issued the following command to continue the reshape
>>>>>
>>>>> mdadm -A /dev/md0 --backup-file=/home/md.backup
>>>>>
>>>>> i get back to following error
>>>>>
>>>>> mdadm: Failed to restore critical section for reshape, sorry.
>>>>>
>>>>> any idea why?
>>>>>
>>>>> before shutting down cat /proc/mdstat showed
>>>>>
>>>>> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5]
>>>>> [raid4] [raid10]
>>>>> md0 : active raid6 sdf[0] sdb[6](S) sda[4] sdc[3] sde[2] sdd[1]
>>>>> 7814055936 blocks super 1.0 level 6, 512k chunk, algorithm 18
>>>>> [6/5] [UUUUU_]
>>>>> [==============>......] reshape = 70.8% (1384415232/1953513984)
>>>>> finish=3658.6min speed=2592K/sec
>>>>>
>>>>> but now it shows
>>>>>
>>>>> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5]
>>>>> [raid4] [raid10]
>>>>> md0 : inactive sdc[3] sdb[6](S) sde[2] sdd[1] sdf[0]
>>>>> 9767572240 blocks super 1.0
>>>>>
>>>>> i am totally confused, it seems to have lost a drive from the raid,
>>>>> and the number of blocks is incorrect
>>>>>
>>>>
>>>> issuing the following
>>>>
>>>> mdadm -Avv --backup-file=/home/md.backup /dev/md0
>>>>
>>>> returns
>>>>
>>>>
>>>> mdadm: looking for devices for /dev/md0
>>>> mdadm: cannot open device /dev/sda5: Device or resource busy
>>>> mdadm: /dev/sda5 has wrong uuid.
>>>> mdadm: no RAID superblock on /dev/sda2
>>>> mdadm: /dev/sda2 has wrong uuid.
>>>> mdadm: cannot open device /dev/sda1: Device or resource busy
>>>> mdadm: /dev/sda1 has wrong uuid.
>>>> mdadm: cannot open device /dev/sda: Device or resource busy
>>>> mdadm: /dev/sda has wrong uuid.
>>>> mdadm: /dev/sdg is identified as a member of /dev/md0, slot -1.
>>>> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 4.
>>>> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 2.
>>>> mdadm: /dev/sde is identified as a member of /dev/md0, slot 0.
>>>> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 1.
>>>> mdadm: /dev/sdb is identified as a member of /dev/md0, slot 3.
>>>> mdadm:/dev/md0 has an active reshape - checking if critical section
>>>> needs to be restored
>>>> mdadm: backup-metadata found on /home/md.backup but is not needed
>>>> mdadm: Failed to find backup of critical section
>>>> mdadm: Failed to restore critical section for reshape, sorry.
>>>>
>>>
>>> seem the above was trying at use the wrong disks to assemble, so using
>>> the following
>>>
>>> mdadm -Avv /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef]
>>>
>>> mdadm: looking for devices for /dev/md0
>>> mdadm: /dev/sda is identified as a member of /dev/md0, slot 4.
>>> mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1.
>>> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3.
>>> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1.
>>> mdadm: /dev/sde is identified as a member of /dev/md0, slot 2.
>>> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0.
>>> mdadm:/dev/md0 has an active reshape - checking if critical section
>>> needs to be restored
>>> mdadm: backup-metadata found on /home/md.backup but is not needed
>>> mdadm: Failed to find backup of critical section
>>> mdadm: Failed to restore critical section for reshape, sorry.
>>>
>>
>> have now upgraded to mdadm 3.2.2
>>
>> and get a little more info
>>
>> mdadm -Avv /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef]
>>
>> mdadm: looking for devices for /dev/md0
>> mdadm: /dev/sda is identified as a member of /dev/md0, slot 4.
>> mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1.
>> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3.
>> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1.
>> mdadm: /dev/sde is identified as a member of /dev/md0, slot 2.
>> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0.
>> mdadm: device 6 in /dev/md0 has wrong state in superblock, but
>> /dev/sdb seems ok
>> mdadm:/dev/md0 has an active reshape - checking if critical section
>> needs to be restored
>> mdadm: backup-metadata found on /home/md.backup but is not needed
>> mdadm: Failed to find backup of critical section
>> mdadm: Failed to restore critical section for reshape, sorry.
>>
>
>
> Ok, i dont know if this is the right thing to have done
>
> ~# mdadm -Avv --force /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef]
>
> mdadm: looking for devices for /dev/md0
> mdadm: /dev/sda is identified as a member of /dev/md0, slot 4.
> mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1.
> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3.
> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1.
> mdadm: /dev/sde is identified as a member of /dev/md0, slot 2.
> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0.
> mdadm: clearing FAULTY flag for device 1 in /dev/md0 for /dev/sdb
> mdadm: Marking array /dev/md0 as 'clean'
> mdadm:/dev/md0 has an active reshape - checking if critical section
> needs to be restored
> mdadm: backup-metadata found on /home/md.backup but is not needed
> mdadm: Failed to find backup of critical section
> mdadm: Failed to restore critical section for reshape, sorry.
>
>
> ~# mdadm -Avv /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef]
>
> mdadm: looking for devices for /dev/md0
> mdadm: /dev/sda is identified as a member of /dev/md0, slot 4.
> mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1.
> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3.
> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1.
> mdadm: /dev/sde is identified as a member of /dev/md0, slot 2.
> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0.
> mdadm:/dev/md0 has an active reshape - checking if critical section
> needs to be restored
> mdadm: restoring critical section
> mdadm: added /dev/sdd to /dev/md0 as 1
> mdadm: added /dev/sde to /dev/md0 as 2
> mdadm: added /dev/sdc to /dev/md0 as 3
> mdadm: added /dev/sda to /dev/md0 as 4
> mdadm: no uptodate device for slot 5 of /dev/md0
> mdadm: added /dev/sdb to /dev/md0 as -1
> mdadm: added /dev/sdf to /dev/md0 as 0
> mdadm: /dev/md0 has been started with 4 drives (out of 6) and 1 spare.
>
> ~# cat /proc/mdstat
>
> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5]
> [raid4] [raid10]
> md0 : active raid6 sdf[0] sdb[6](S) sdc[3] sde[2] sdd[1]
> 7814055936 blocks super 1.0 level 6, 512k chunk, algorithm 18
> [6/4] [UUUU__]
> [==============>......] reshape = 74.3% (1452929024/1953513984)
> finish=2545.2min speed=3276K/sec
>
> unused devices: <none>
>
> so looks like its carrying on now but with 4 disks and a spare, maybe
> i can add the other disk once the reshape has finished
It generally helps to include/examine "mdadm -E /dev/sdX" of all
devices involved in your mail(s) and also "mdadm -Q --detail /dev/md0".
After the reshape is done it will automatically rebuild using the
spare. Then you can have a close look which of your devices arent
used, clear the metadate from the device and add it as well to regain
full redundancy. You'll have plenty hours of fun watching
/proc/mdstat. ;)
Alex.
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Unable to restart reshape
2011-10-30 22:02 ` Alexander Kühn
@ 2011-10-30 22:15 ` Michael Busby
0 siblings, 0 replies; 7+ messages in thread
From: Michael Busby @ 2011-10-30 22:15 UTC (permalink / raw)
To: Alexander Kühn; +Cc: linux-raid
>>>>>> I have a system the was doing a reshape from RAID5 to 6, the system
>>>>>> had to be powered off this morning and moved, upon restarting the
>>>>>> server i issued the following command to continue the reshape
>>>>>>
>>>>>> mdadm -A /dev/md0 --backup-file=/home/md.backup
>>>>>>
>>>>>> i get back to following error
>>>>>>
>>>>>> mdadm: Failed to restore critical section for reshape, sorry.
>>>>>>
>>>>>> any idea why?
>>>>>>
>>>>>> before shutting down cat /proc/mdstat showed
>>>>>>
>>>>>> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5]
>>>>>> [raid4] [raid10]
>>>>>> md0 : active raid6 sdf[0] sdb[6](S) sda[4] sdc[3] sde[2] sdd[1]
>>>>>> 7814055936 blocks super 1.0 level 6, 512k chunk, algorithm 18
>>>>>> [6/5] [UUUUU_]
>>>>>> [==============>......] reshape = 70.8% (1384415232/1953513984)
>>>>>> finish=3658.6min speed=2592K/sec
>>>>>>
>>>>>> but now it shows
>>>>>>
>>>>>> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5]
>>>>>> [raid4] [raid10]
>>>>>> md0 : inactive sdc[3] sdb[6](S) sde[2] sdd[1] sdf[0]
>>>>>> 9767572240 blocks super 1.0
>>>>>>
>>>>>> i am totally confused, it seems to have lost a drive from the raid,
>>>>>> and the number of blocks is incorrect
>>>>>>
>>>>>
>>>>> issuing the following
>>>>>
>>>>> mdadm -Avv --backup-file=/home/md.backup /dev/md0
>>>>>
>>>>> returns
>>>>>
>>>>>
>>>>> mdadm: looking for devices for /dev/md0
>>>>> mdadm: cannot open device /dev/sda5: Device or resource busy
>>>>> mdadm: /dev/sda5 has wrong uuid.
>>>>> mdadm: no RAID superblock on /dev/sda2
>>>>> mdadm: /dev/sda2 has wrong uuid.
>>>>> mdadm: cannot open device /dev/sda1: Device or resource busy
>>>>> mdadm: /dev/sda1 has wrong uuid.
>>>>> mdadm: cannot open device /dev/sda: Device or resource busy
>>>>> mdadm: /dev/sda has wrong uuid.
>>>>> mdadm: /dev/sdg is identified as a member of /dev/md0, slot -1.
>>>>> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 4.
>>>>> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 2.
>>>>> mdadm: /dev/sde is identified as a member of /dev/md0, slot 0.
>>>>> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 1.
>>>>> mdadm: /dev/sdb is identified as a member of /dev/md0, slot 3.
>>>>> mdadm:/dev/md0 has an active reshape - checking if critical section
>>>>> needs to be restored
>>>>> mdadm: backup-metadata found on /home/md.backup but is not needed
>>>>> mdadm: Failed to find backup of critical section
>>>>> mdadm: Failed to restore critical section for reshape, sorry.
>>>>>
>>>>
>>>> seem the above was trying at use the wrong disks to assemble, so using
>>>> the following
>>>>
>>>> mdadm -Avv /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef]
>>>>
>>>> mdadm: looking for devices for /dev/md0
>>>> mdadm: /dev/sda is identified as a member of /dev/md0, slot 4.
>>>> mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1.
>>>> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3.
>>>> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1.
>>>> mdadm: /dev/sde is identified as a member of /dev/md0, slot 2.
>>>> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0.
>>>> mdadm:/dev/md0 has an active reshape - checking if critical section
>>>> needs to be restored
>>>> mdadm: backup-metadata found on /home/md.backup but is not needed
>>>> mdadm: Failed to find backup of critical section
>>>> mdadm: Failed to restore critical section for reshape, sorry.
>>>>
>>>
>>> have now upgraded to mdadm 3.2.2
>>>
>>> and get a little more info
>>>
>>> mdadm -Avv /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef]
>>>
>>> mdadm: looking for devices for /dev/md0
>>> mdadm: /dev/sda is identified as a member of /dev/md0, slot 4.
>>> mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1.
>>> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3.
>>> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1.
>>> mdadm: /dev/sde is identified as a member of /dev/md0, slot 2.
>>> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0.
>>> mdadm: device 6 in /dev/md0 has wrong state in superblock, but /dev/sdb
>>> seems ok
>>> mdadm:/dev/md0 has an active reshape - checking if critical section
>>> needs to be restored
>>> mdadm: backup-metadata found on /home/md.backup but is not needed
>>> mdadm: Failed to find backup of critical section
>>> mdadm: Failed to restore critical section for reshape, sorry.
>>>
>>
>>
>> Ok, i dont know if this is the right thing to have done
>>
>> ~# mdadm -Avv --force /dev/md0 --backup-file=/home/md.backup
>> /dev/sd[abcdef]
>>
>> mdadm: looking for devices for /dev/md0
>> mdadm: /dev/sda is identified as a member of /dev/md0, slot 4.
>> mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1.
>> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3.
>> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1.
>> mdadm: /dev/sde is identified as a member of /dev/md0, slot 2.
>> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0.
>> mdadm: clearing FAULTY flag for device 1 in /dev/md0 for /dev/sdb
>> mdadm: Marking array /dev/md0 as 'clean'
>> mdadm:/dev/md0 has an active reshape - checking if critical section
>> needs to be restored
>> mdadm: backup-metadata found on /home/md.backup but is not needed
>> mdadm: Failed to find backup of critical section
>> mdadm: Failed to restore critical section for reshape, sorry.
>>
>>
>> ~# mdadm -Avv /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef]
>>
>> mdadm: looking for devices for /dev/md0
>> mdadm: /dev/sda is identified as a member of /dev/md0, slot 4.
>> mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1.
>> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3.
>> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1.
>> mdadm: /dev/sde is identified as a member of /dev/md0, slot 2.
>> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0.
>> mdadm:/dev/md0 has an active reshape - checking if critical section
>> needs to be restored
>> mdadm: restoring critical section
>> mdadm: added /dev/sdd to /dev/md0 as 1
>> mdadm: added /dev/sde to /dev/md0 as 2
>> mdadm: added /dev/sdc to /dev/md0 as 3
>> mdadm: added /dev/sda to /dev/md0 as 4
>> mdadm: no uptodate device for slot 5 of /dev/md0
>> mdadm: added /dev/sdb to /dev/md0 as -1
>> mdadm: added /dev/sdf to /dev/md0 as 0
>> mdadm: /dev/md0 has been started with 4 drives (out of 6) and 1 spare.
>>
>> ~# cat /proc/mdstat
>>
>> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5]
>> [raid4] [raid10]
>> md0 : active raid6 sdf[0] sdb[6](S) sdc[3] sde[2] sdd[1]
>> 7814055936 blocks super 1.0 level 6, 512k chunk, algorithm 18
>> [6/4] [UUUU__]
>> [==============>......] reshape = 74.3% (1452929024/1953513984)
>> finish=2545.2min speed=3276K/sec
>>
>> unused devices: <none>
>>
>> so looks like its carrying on now but with 4 disks and a spare, maybe
>> i can add the other disk once the reshape has finished
>
> It generally helps to include/examine "mdadm -E /dev/sdX" of all devices
> involved in your mail(s) and also "mdadm -Q --detail /dev/md0".
> After the reshape is done it will automatically rebuild using the spare.
> Then you can have a close look which of your devices arent used, clear the
> metadate from the device and add it as well to regain full redundancy.
> You'll have plenty hours of fun watching /proc/mdstat. ;)
> Alex.
>
Thanks for the response Alex, the reshape has got about 2400mins left
to run and no idea how long the rebuild will take..
I will check out those commands once i am back up and running, i am
fairly new to mdadm so still finding out all the useful commands when
trouble shooting issues, thanks for pointing these out to me
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2011-10-30 22:15 UTC | newest]
Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-10-30 14:57 Unable to restart reshape Michael Busby
2011-10-30 15:34 ` Michael Busby
2011-10-30 15:57 ` Michael Busby
2011-10-30 16:04 ` Michael Busby
2011-10-30 16:22 ` Michael Busby
2011-10-30 22:02 ` Alexander Kühn
2011-10-30 22:15 ` Michael Busby
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).