* Unable to restart reshape
@ 2011-10-30 14:57 Michael Busby
2011-10-30 15:34 ` Michael Busby
0 siblings, 1 reply; 7+ messages in thread
From: Michael Busby @ 2011-10-30 14:57 UTC (permalink / raw)
To: linux-raid
I have a system the was doing a reshape from RAID5 to 6, the system
had to be powered off this morning and moved, upon restarting the
server i issued the following command to continue the reshape
mdadm -A /dev/md0 --backup-file=/home/md.backup
i get back to following error
mdadm: Failed to restore critical section for reshape, sorry.
any idea why?
before shutting down cat /proc/mdstat showed
Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5]
[raid4] [raid10]
md0 : active raid6 sdf[0] sdb[6](S) sda[4] sdc[3] sde[2] sdd[1]
7814055936 blocks super 1.0 level 6, 512k chunk, algorithm 18
[6/5] [UUUUU_]
[==============>......] reshape = 70.8% (1384415232/1953513984)
finish=3658.6min speed=2592K/sec
but now it shows
Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5]
[raid4] [raid10]
md0 : inactive sdc[3] sdb[6](S) sde[2] sdd[1] sdf[0]
9767572240 blocks super 1.0
i am totally confused, it seems to have lost a drive from the raid,
and the number of blocks is incorrect
^ permalink raw reply [flat|nested] 7+ messages in thread* Re: Unable to restart reshape 2011-10-30 14:57 Unable to restart reshape Michael Busby @ 2011-10-30 15:34 ` Michael Busby 2011-10-30 15:57 ` Michael Busby 0 siblings, 1 reply; 7+ messages in thread From: Michael Busby @ 2011-10-30 15:34 UTC (permalink / raw) To: linux-raid On 30 October 2011 14:57, Michael Busby <michael.a.busby@gmail.com> wrote: > I have a system the was doing a reshape from RAID5 to 6, the system > had to be powered off this morning and moved, upon restarting the > server i issued the following command to continue the reshape > > mdadm -A /dev/md0 --backup-file=/home/md.backup > > i get back to following error > > mdadm: Failed to restore critical section for reshape, sorry. > > any idea why? > > before shutting down cat /proc/mdstat showed > > Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5] > [raid4] [raid10] > md0 : active raid6 sdf[0] sdb[6](S) sda[4] sdc[3] sde[2] sdd[1] > 7814055936 blocks super 1.0 level 6, 512k chunk, algorithm 18 > [6/5] [UUUUU_] > [==============>......] reshape = 70.8% (1384415232/1953513984) > finish=3658.6min speed=2592K/sec > > but now it shows > > Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5] > [raid4] [raid10] > md0 : inactive sdc[3] sdb[6](S) sde[2] sdd[1] sdf[0] > 9767572240 blocks super 1.0 > > i am totally confused, it seems to have lost a drive from the raid, > and the number of blocks is incorrect > issuing the following mdadm -Avv --backup-file=/home/md.backup /dev/md0 returns mdadm: looking for devices for /dev/md0 mdadm: cannot open device /dev/sda5: Device or resource busy mdadm: /dev/sda5 has wrong uuid. mdadm: no RAID superblock on /dev/sda2 mdadm: /dev/sda2 has wrong uuid. mdadm: cannot open device /dev/sda1: Device or resource busy mdadm: /dev/sda1 has wrong uuid. mdadm: cannot open device /dev/sda: Device or resource busy mdadm: /dev/sda has wrong uuid. mdadm: /dev/sdg is identified as a member of /dev/md0, slot -1. mdadm: /dev/sdf is identified as a member of /dev/md0, slot 4. mdadm: /dev/sdd is identified as a member of /dev/md0, slot 2. mdadm: /dev/sde is identified as a member of /dev/md0, slot 0. mdadm: /dev/sdc is identified as a member of /dev/md0, slot 1. mdadm: /dev/sdb is identified as a member of /dev/md0, slot 3. mdadm:/dev/md0 has an active reshape - checking if critical section needs to be restored mdadm: backup-metadata found on /home/md.backup but is not needed mdadm: Failed to find backup of critical section mdadm: Failed to restore critical section for reshape, sorry. -- To unsubscribe from this list: send the line "unsubscribe linux-raid" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Unable to restart reshape 2011-10-30 15:34 ` Michael Busby @ 2011-10-30 15:57 ` Michael Busby 2011-10-30 16:04 ` Michael Busby 0 siblings, 1 reply; 7+ messages in thread From: Michael Busby @ 2011-10-30 15:57 UTC (permalink / raw) To: linux-raid On 30 October 2011 15:34, Michael Busby <michael.a.busby@gmail.com> wrote: > On 30 October 2011 14:57, Michael Busby <michael.a.busby@gmail.com> wrote: >> I have a system the was doing a reshape from RAID5 to 6, the system >> had to be powered off this morning and moved, upon restarting the >> server i issued the following command to continue the reshape >> >> mdadm -A /dev/md0 --backup-file=/home/md.backup >> >> i get back to following error >> >> mdadm: Failed to restore critical section for reshape, sorry. >> >> any idea why? >> >> before shutting down cat /proc/mdstat showed >> >> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5] >> [raid4] [raid10] >> md0 : active raid6 sdf[0] sdb[6](S) sda[4] sdc[3] sde[2] sdd[1] >> 7814055936 blocks super 1.0 level 6, 512k chunk, algorithm 18 >> [6/5] [UUUUU_] >> [==============>......] reshape = 70.8% (1384415232/1953513984) >> finish=3658.6min speed=2592K/sec >> >> but now it shows >> >> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5] >> [raid4] [raid10] >> md0 : inactive sdc[3] sdb[6](S) sde[2] sdd[1] sdf[0] >> 9767572240 blocks super 1.0 >> >> i am totally confused, it seems to have lost a drive from the raid, >> and the number of blocks is incorrect >> > > issuing the following > > mdadm -Avv --backup-file=/home/md.backup /dev/md0 > > returns > > > mdadm: looking for devices for /dev/md0 > mdadm: cannot open device /dev/sda5: Device or resource busy > mdadm: /dev/sda5 has wrong uuid. > mdadm: no RAID superblock on /dev/sda2 > mdadm: /dev/sda2 has wrong uuid. > mdadm: cannot open device /dev/sda1: Device or resource busy > mdadm: /dev/sda1 has wrong uuid. > mdadm: cannot open device /dev/sda: Device or resource busy > mdadm: /dev/sda has wrong uuid. > mdadm: /dev/sdg is identified as a member of /dev/md0, slot -1. > mdadm: /dev/sdf is identified as a member of /dev/md0, slot 4. > mdadm: /dev/sdd is identified as a member of /dev/md0, slot 2. > mdadm: /dev/sde is identified as a member of /dev/md0, slot 0. > mdadm: /dev/sdc is identified as a member of /dev/md0, slot 1. > mdadm: /dev/sdb is identified as a member of /dev/md0, slot 3. > mdadm:/dev/md0 has an active reshape - checking if critical section > needs to be restored > mdadm: backup-metadata found on /home/md.backup but is not needed > mdadm: Failed to find backup of critical section > mdadm: Failed to restore critical section for reshape, sorry. > seem the above was trying at use the wrong disks to assemble, so using the following mdadm -Avv /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef] mdadm: looking for devices for /dev/md0 mdadm: /dev/sda is identified as a member of /dev/md0, slot 4. mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1. mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3. mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1. mdadm: /dev/sde is identified as a member of /dev/md0, slot 2. mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0. mdadm:/dev/md0 has an active reshape - checking if critical section needs to be restored mdadm: backup-metadata found on /home/md.backup but is not needed mdadm: Failed to find backup of critical section mdadm: Failed to restore critical section for reshape, sorry. -- To unsubscribe from this list: send the line "unsubscribe linux-raid" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Unable to restart reshape 2011-10-30 15:57 ` Michael Busby @ 2011-10-30 16:04 ` Michael Busby 2011-10-30 16:22 ` Michael Busby 0 siblings, 1 reply; 7+ messages in thread From: Michael Busby @ 2011-10-30 16:04 UTC (permalink / raw) To: linux-raid >>> I have a system the was doing a reshape from RAID5 to 6, the system >>> had to be powered off this morning and moved, upon restarting the >>> server i issued the following command to continue the reshape >>> >>> mdadm -A /dev/md0 --backup-file=/home/md.backup >>> >>> i get back to following error >>> >>> mdadm: Failed to restore critical section for reshape, sorry. >>> >>> any idea why? >>> >>> before shutting down cat /proc/mdstat showed >>> >>> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5] >>> [raid4] [raid10] >>> md0 : active raid6 sdf[0] sdb[6](S) sda[4] sdc[3] sde[2] sdd[1] >>> 7814055936 blocks super 1.0 level 6, 512k chunk, algorithm 18 >>> [6/5] [UUUUU_] >>> [==============>......] reshape = 70.8% (1384415232/1953513984) >>> finish=3658.6min speed=2592K/sec >>> >>> but now it shows >>> >>> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5] >>> [raid4] [raid10] >>> md0 : inactive sdc[3] sdb[6](S) sde[2] sdd[1] sdf[0] >>> 9767572240 blocks super 1.0 >>> >>> i am totally confused, it seems to have lost a drive from the raid, >>> and the number of blocks is incorrect >>> >> >> issuing the following >> >> mdadm -Avv --backup-file=/home/md.backup /dev/md0 >> >> returns >> >> >> mdadm: looking for devices for /dev/md0 >> mdadm: cannot open device /dev/sda5: Device or resource busy >> mdadm: /dev/sda5 has wrong uuid. >> mdadm: no RAID superblock on /dev/sda2 >> mdadm: /dev/sda2 has wrong uuid. >> mdadm: cannot open device /dev/sda1: Device or resource busy >> mdadm: /dev/sda1 has wrong uuid. >> mdadm: cannot open device /dev/sda: Device or resource busy >> mdadm: /dev/sda has wrong uuid. >> mdadm: /dev/sdg is identified as a member of /dev/md0, slot -1. >> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 4. >> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 2. >> mdadm: /dev/sde is identified as a member of /dev/md0, slot 0. >> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 1. >> mdadm: /dev/sdb is identified as a member of /dev/md0, slot 3. >> mdadm:/dev/md0 has an active reshape - checking if critical section >> needs to be restored >> mdadm: backup-metadata found on /home/md.backup but is not needed >> mdadm: Failed to find backup of critical section >> mdadm: Failed to restore critical section for reshape, sorry. >> > > seem the above was trying at use the wrong disks to assemble, so using > the following > > mdadm -Avv /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef] > > mdadm: looking for devices for /dev/md0 > mdadm: /dev/sda is identified as a member of /dev/md0, slot 4. > mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1. > mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3. > mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1. > mdadm: /dev/sde is identified as a member of /dev/md0, slot 2. > mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0. > mdadm:/dev/md0 has an active reshape - checking if critical section > needs to be restored > mdadm: backup-metadata found on /home/md.backup but is not needed > mdadm: Failed to find backup of critical section > mdadm: Failed to restore critical section for reshape, sorry. > have now upgraded to mdadm 3.2.2 and get a little more info mdadm -Avv /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef] mdadm: looking for devices for /dev/md0 mdadm: /dev/sda is identified as a member of /dev/md0, slot 4. mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1. mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3. mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1. mdadm: /dev/sde is identified as a member of /dev/md0, slot 2. mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0. mdadm: device 6 in /dev/md0 has wrong state in superblock, but /dev/sdb seems ok mdadm:/dev/md0 has an active reshape - checking if critical section needs to be restored mdadm: backup-metadata found on /home/md.backup but is not needed mdadm: Failed to find backup of critical section mdadm: Failed to restore critical section for reshape, sorry. -- To unsubscribe from this list: send the line "unsubscribe linux-raid" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Unable to restart reshape 2011-10-30 16:04 ` Michael Busby @ 2011-10-30 16:22 ` Michael Busby 2011-10-30 22:02 ` Alexander Kühn 0 siblings, 1 reply; 7+ messages in thread From: Michael Busby @ 2011-10-30 16:22 UTC (permalink / raw) To: linux-raid >>>> I have a system the was doing a reshape from RAID5 to 6, the system >>>> had to be powered off this morning and moved, upon restarting the >>>> server i issued the following command to continue the reshape >>>> >>>> mdadm -A /dev/md0 --backup-file=/home/md.backup >>>> >>>> i get back to following error >>>> >>>> mdadm: Failed to restore critical section for reshape, sorry. >>>> >>>> any idea why? >>>> >>>> before shutting down cat /proc/mdstat showed >>>> >>>> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5] >>>> [raid4] [raid10] >>>> md0 : active raid6 sdf[0] sdb[6](S) sda[4] sdc[3] sde[2] sdd[1] >>>> 7814055936 blocks super 1.0 level 6, 512k chunk, algorithm 18 >>>> [6/5] [UUUUU_] >>>> [==============>......] reshape = 70.8% (1384415232/1953513984) >>>> finish=3658.6min speed=2592K/sec >>>> >>>> but now it shows >>>> >>>> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5] >>>> [raid4] [raid10] >>>> md0 : inactive sdc[3] sdb[6](S) sde[2] sdd[1] sdf[0] >>>> 9767572240 blocks super 1.0 >>>> >>>> i am totally confused, it seems to have lost a drive from the raid, >>>> and the number of blocks is incorrect >>>> >>> >>> issuing the following >>> >>> mdadm -Avv --backup-file=/home/md.backup /dev/md0 >>> >>> returns >>> >>> >>> mdadm: looking for devices for /dev/md0 >>> mdadm: cannot open device /dev/sda5: Device or resource busy >>> mdadm: /dev/sda5 has wrong uuid. >>> mdadm: no RAID superblock on /dev/sda2 >>> mdadm: /dev/sda2 has wrong uuid. >>> mdadm: cannot open device /dev/sda1: Device or resource busy >>> mdadm: /dev/sda1 has wrong uuid. >>> mdadm: cannot open device /dev/sda: Device or resource busy >>> mdadm: /dev/sda has wrong uuid. >>> mdadm: /dev/sdg is identified as a member of /dev/md0, slot -1. >>> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 4. >>> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 2. >>> mdadm: /dev/sde is identified as a member of /dev/md0, slot 0. >>> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 1. >>> mdadm: /dev/sdb is identified as a member of /dev/md0, slot 3. >>> mdadm:/dev/md0 has an active reshape - checking if critical section >>> needs to be restored >>> mdadm: backup-metadata found on /home/md.backup but is not needed >>> mdadm: Failed to find backup of critical section >>> mdadm: Failed to restore critical section for reshape, sorry. >>> >> >> seem the above was trying at use the wrong disks to assemble, so using >> the following >> >> mdadm -Avv /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef] >> >> mdadm: looking for devices for /dev/md0 >> mdadm: /dev/sda is identified as a member of /dev/md0, slot 4. >> mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1. >> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3. >> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1. >> mdadm: /dev/sde is identified as a member of /dev/md0, slot 2. >> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0. >> mdadm:/dev/md0 has an active reshape - checking if critical section >> needs to be restored >> mdadm: backup-metadata found on /home/md.backup but is not needed >> mdadm: Failed to find backup of critical section >> mdadm: Failed to restore critical section for reshape, sorry. >> > > have now upgraded to mdadm 3.2.2 > > and get a little more info > > mdadm -Avv /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef] > > mdadm: looking for devices for /dev/md0 > mdadm: /dev/sda is identified as a member of /dev/md0, slot 4. > mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1. > mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3. > mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1. > mdadm: /dev/sde is identified as a member of /dev/md0, slot 2. > mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0. > mdadm: device 6 in /dev/md0 has wrong state in superblock, but /dev/sdb seems ok > mdadm:/dev/md0 has an active reshape - checking if critical section > needs to be restored > mdadm: backup-metadata found on /home/md.backup but is not needed > mdadm: Failed to find backup of critical section > mdadm: Failed to restore critical section for reshape, sorry. > Ok, i dont know if this is the right thing to have done ~# mdadm -Avv --force /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef] mdadm: looking for devices for /dev/md0 mdadm: /dev/sda is identified as a member of /dev/md0, slot 4. mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1. mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3. mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1. mdadm: /dev/sde is identified as a member of /dev/md0, slot 2. mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0. mdadm: clearing FAULTY flag for device 1 in /dev/md0 for /dev/sdb mdadm: Marking array /dev/md0 as 'clean' mdadm:/dev/md0 has an active reshape - checking if critical section needs to be restored mdadm: backup-metadata found on /home/md.backup but is not needed mdadm: Failed to find backup of critical section mdadm: Failed to restore critical section for reshape, sorry. ~# mdadm -Avv /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef] mdadm: looking for devices for /dev/md0 mdadm: /dev/sda is identified as a member of /dev/md0, slot 4. mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1. mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3. mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1. mdadm: /dev/sde is identified as a member of /dev/md0, slot 2. mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0. mdadm:/dev/md0 has an active reshape - checking if critical section needs to be restored mdadm: restoring critical section mdadm: added /dev/sdd to /dev/md0 as 1 mdadm: added /dev/sde to /dev/md0 as 2 mdadm: added /dev/sdc to /dev/md0 as 3 mdadm: added /dev/sda to /dev/md0 as 4 mdadm: no uptodate device for slot 5 of /dev/md0 mdadm: added /dev/sdb to /dev/md0 as -1 mdadm: added /dev/sdf to /dev/md0 as 0 mdadm: /dev/md0 has been started with 4 drives (out of 6) and 1 spare. ~# cat /proc/mdstat Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5] [raid4] [raid10] md0 : active raid6 sdf[0] sdb[6](S) sdc[3] sde[2] sdd[1] 7814055936 blocks super 1.0 level 6, 512k chunk, algorithm 18 [6/4] [UUUU__] [==============>......] reshape = 74.3% (1452929024/1953513984) finish=2545.2min speed=3276K/sec unused devices: <none> so looks like its carrying on now but with 4 disks and a spare, maybe i can add the other disk once the reshape has finished -- To unsubscribe from this list: send the line "unsubscribe linux-raid" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Unable to restart reshape 2011-10-30 16:22 ` Michael Busby @ 2011-10-30 22:02 ` Alexander Kühn 2011-10-30 22:15 ` Michael Busby 0 siblings, 1 reply; 7+ messages in thread From: Alexander Kühn @ 2011-10-30 22:02 UTC (permalink / raw) To: Michael Busby; +Cc: linux-raid Zitat von Michael Busby <michael.a.busby@gmail.com>: >>>>> I have a system the was doing a reshape from RAID5 to 6, the system >>>>> had to be powered off this morning and moved, upon restarting the >>>>> server i issued the following command to continue the reshape >>>>> >>>>> mdadm -A /dev/md0 --backup-file=/home/md.backup >>>>> >>>>> i get back to following error >>>>> >>>>> mdadm: Failed to restore critical section for reshape, sorry. >>>>> >>>>> any idea why? >>>>> >>>>> before shutting down cat /proc/mdstat showed >>>>> >>>>> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5] >>>>> [raid4] [raid10] >>>>> md0 : active raid6 sdf[0] sdb[6](S) sda[4] sdc[3] sde[2] sdd[1] >>>>> 7814055936 blocks super 1.0 level 6, 512k chunk, algorithm 18 >>>>> [6/5] [UUUUU_] >>>>> [==============>......] reshape = 70.8% (1384415232/1953513984) >>>>> finish=3658.6min speed=2592K/sec >>>>> >>>>> but now it shows >>>>> >>>>> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5] >>>>> [raid4] [raid10] >>>>> md0 : inactive sdc[3] sdb[6](S) sde[2] sdd[1] sdf[0] >>>>> 9767572240 blocks super 1.0 >>>>> >>>>> i am totally confused, it seems to have lost a drive from the raid, >>>>> and the number of blocks is incorrect >>>>> >>>> >>>> issuing the following >>>> >>>> mdadm -Avv --backup-file=/home/md.backup /dev/md0 >>>> >>>> returns >>>> >>>> >>>> mdadm: looking for devices for /dev/md0 >>>> mdadm: cannot open device /dev/sda5: Device or resource busy >>>> mdadm: /dev/sda5 has wrong uuid. >>>> mdadm: no RAID superblock on /dev/sda2 >>>> mdadm: /dev/sda2 has wrong uuid. >>>> mdadm: cannot open device /dev/sda1: Device or resource busy >>>> mdadm: /dev/sda1 has wrong uuid. >>>> mdadm: cannot open device /dev/sda: Device or resource busy >>>> mdadm: /dev/sda has wrong uuid. >>>> mdadm: /dev/sdg is identified as a member of /dev/md0, slot -1. >>>> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 4. >>>> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 2. >>>> mdadm: /dev/sde is identified as a member of /dev/md0, slot 0. >>>> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 1. >>>> mdadm: /dev/sdb is identified as a member of /dev/md0, slot 3. >>>> mdadm:/dev/md0 has an active reshape - checking if critical section >>>> needs to be restored >>>> mdadm: backup-metadata found on /home/md.backup but is not needed >>>> mdadm: Failed to find backup of critical section >>>> mdadm: Failed to restore critical section for reshape, sorry. >>>> >>> >>> seem the above was trying at use the wrong disks to assemble, so using >>> the following >>> >>> mdadm -Avv /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef] >>> >>> mdadm: looking for devices for /dev/md0 >>> mdadm: /dev/sda is identified as a member of /dev/md0, slot 4. >>> mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1. >>> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3. >>> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1. >>> mdadm: /dev/sde is identified as a member of /dev/md0, slot 2. >>> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0. >>> mdadm:/dev/md0 has an active reshape - checking if critical section >>> needs to be restored >>> mdadm: backup-metadata found on /home/md.backup but is not needed >>> mdadm: Failed to find backup of critical section >>> mdadm: Failed to restore critical section for reshape, sorry. >>> >> >> have now upgraded to mdadm 3.2.2 >> >> and get a little more info >> >> mdadm -Avv /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef] >> >> mdadm: looking for devices for /dev/md0 >> mdadm: /dev/sda is identified as a member of /dev/md0, slot 4. >> mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1. >> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3. >> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1. >> mdadm: /dev/sde is identified as a member of /dev/md0, slot 2. >> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0. >> mdadm: device 6 in /dev/md0 has wrong state in superblock, but >> /dev/sdb seems ok >> mdadm:/dev/md0 has an active reshape - checking if critical section >> needs to be restored >> mdadm: backup-metadata found on /home/md.backup but is not needed >> mdadm: Failed to find backup of critical section >> mdadm: Failed to restore critical section for reshape, sorry. >> > > > Ok, i dont know if this is the right thing to have done > > ~# mdadm -Avv --force /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef] > > mdadm: looking for devices for /dev/md0 > mdadm: /dev/sda is identified as a member of /dev/md0, slot 4. > mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1. > mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3. > mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1. > mdadm: /dev/sde is identified as a member of /dev/md0, slot 2. > mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0. > mdadm: clearing FAULTY flag for device 1 in /dev/md0 for /dev/sdb > mdadm: Marking array /dev/md0 as 'clean' > mdadm:/dev/md0 has an active reshape - checking if critical section > needs to be restored > mdadm: backup-metadata found on /home/md.backup but is not needed > mdadm: Failed to find backup of critical section > mdadm: Failed to restore critical section for reshape, sorry. > > > ~# mdadm -Avv /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef] > > mdadm: looking for devices for /dev/md0 > mdadm: /dev/sda is identified as a member of /dev/md0, slot 4. > mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1. > mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3. > mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1. > mdadm: /dev/sde is identified as a member of /dev/md0, slot 2. > mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0. > mdadm:/dev/md0 has an active reshape - checking if critical section > needs to be restored > mdadm: restoring critical section > mdadm: added /dev/sdd to /dev/md0 as 1 > mdadm: added /dev/sde to /dev/md0 as 2 > mdadm: added /dev/sdc to /dev/md0 as 3 > mdadm: added /dev/sda to /dev/md0 as 4 > mdadm: no uptodate device for slot 5 of /dev/md0 > mdadm: added /dev/sdb to /dev/md0 as -1 > mdadm: added /dev/sdf to /dev/md0 as 0 > mdadm: /dev/md0 has been started with 4 drives (out of 6) and 1 spare. > > ~# cat /proc/mdstat > > Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5] > [raid4] [raid10] > md0 : active raid6 sdf[0] sdb[6](S) sdc[3] sde[2] sdd[1] > 7814055936 blocks super 1.0 level 6, 512k chunk, algorithm 18 > [6/4] [UUUU__] > [==============>......] reshape = 74.3% (1452929024/1953513984) > finish=2545.2min speed=3276K/sec > > unused devices: <none> > > so looks like its carrying on now but with 4 disks and a spare, maybe > i can add the other disk once the reshape has finished It generally helps to include/examine "mdadm -E /dev/sdX" of all devices involved in your mail(s) and also "mdadm -Q --detail /dev/md0". After the reshape is done it will automatically rebuild using the spare. Then you can have a close look which of your devices arent used, clear the metadate from the device and add it as well to regain full redundancy. You'll have plenty hours of fun watching /proc/mdstat. ;) Alex. -- To unsubscribe from this list: send the line "unsubscribe linux-raid" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Unable to restart reshape 2011-10-30 22:02 ` Alexander Kühn @ 2011-10-30 22:15 ` Michael Busby 0 siblings, 0 replies; 7+ messages in thread From: Michael Busby @ 2011-10-30 22:15 UTC (permalink / raw) To: Alexander Kühn; +Cc: linux-raid >>>>>> I have a system the was doing a reshape from RAID5 to 6, the system >>>>>> had to be powered off this morning and moved, upon restarting the >>>>>> server i issued the following command to continue the reshape >>>>>> >>>>>> mdadm -A /dev/md0 --backup-file=/home/md.backup >>>>>> >>>>>> i get back to following error >>>>>> >>>>>> mdadm: Failed to restore critical section for reshape, sorry. >>>>>> >>>>>> any idea why? >>>>>> >>>>>> before shutting down cat /proc/mdstat showed >>>>>> >>>>>> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5] >>>>>> [raid4] [raid10] >>>>>> md0 : active raid6 sdf[0] sdb[6](S) sda[4] sdc[3] sde[2] sdd[1] >>>>>> 7814055936 blocks super 1.0 level 6, 512k chunk, algorithm 18 >>>>>> [6/5] [UUUUU_] >>>>>> [==============>......] reshape = 70.8% (1384415232/1953513984) >>>>>> finish=3658.6min speed=2592K/sec >>>>>> >>>>>> but now it shows >>>>>> >>>>>> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5] >>>>>> [raid4] [raid10] >>>>>> md0 : inactive sdc[3] sdb[6](S) sde[2] sdd[1] sdf[0] >>>>>> 9767572240 blocks super 1.0 >>>>>> >>>>>> i am totally confused, it seems to have lost a drive from the raid, >>>>>> and the number of blocks is incorrect >>>>>> >>>>> >>>>> issuing the following >>>>> >>>>> mdadm -Avv --backup-file=/home/md.backup /dev/md0 >>>>> >>>>> returns >>>>> >>>>> >>>>> mdadm: looking for devices for /dev/md0 >>>>> mdadm: cannot open device /dev/sda5: Device or resource busy >>>>> mdadm: /dev/sda5 has wrong uuid. >>>>> mdadm: no RAID superblock on /dev/sda2 >>>>> mdadm: /dev/sda2 has wrong uuid. >>>>> mdadm: cannot open device /dev/sda1: Device or resource busy >>>>> mdadm: /dev/sda1 has wrong uuid. >>>>> mdadm: cannot open device /dev/sda: Device or resource busy >>>>> mdadm: /dev/sda has wrong uuid. >>>>> mdadm: /dev/sdg is identified as a member of /dev/md0, slot -1. >>>>> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 4. >>>>> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 2. >>>>> mdadm: /dev/sde is identified as a member of /dev/md0, slot 0. >>>>> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 1. >>>>> mdadm: /dev/sdb is identified as a member of /dev/md0, slot 3. >>>>> mdadm:/dev/md0 has an active reshape - checking if critical section >>>>> needs to be restored >>>>> mdadm: backup-metadata found on /home/md.backup but is not needed >>>>> mdadm: Failed to find backup of critical section >>>>> mdadm: Failed to restore critical section for reshape, sorry. >>>>> >>>> >>>> seem the above was trying at use the wrong disks to assemble, so using >>>> the following >>>> >>>> mdadm -Avv /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef] >>>> >>>> mdadm: looking for devices for /dev/md0 >>>> mdadm: /dev/sda is identified as a member of /dev/md0, slot 4. >>>> mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1. >>>> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3. >>>> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1. >>>> mdadm: /dev/sde is identified as a member of /dev/md0, slot 2. >>>> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0. >>>> mdadm:/dev/md0 has an active reshape - checking if critical section >>>> needs to be restored >>>> mdadm: backup-metadata found on /home/md.backup but is not needed >>>> mdadm: Failed to find backup of critical section >>>> mdadm: Failed to restore critical section for reshape, sorry. >>>> >>> >>> have now upgraded to mdadm 3.2.2 >>> >>> and get a little more info >>> >>> mdadm -Avv /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef] >>> >>> mdadm: looking for devices for /dev/md0 >>> mdadm: /dev/sda is identified as a member of /dev/md0, slot 4. >>> mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1. >>> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3. >>> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1. >>> mdadm: /dev/sde is identified as a member of /dev/md0, slot 2. >>> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0. >>> mdadm: device 6 in /dev/md0 has wrong state in superblock, but /dev/sdb >>> seems ok >>> mdadm:/dev/md0 has an active reshape - checking if critical section >>> needs to be restored >>> mdadm: backup-metadata found on /home/md.backup but is not needed >>> mdadm: Failed to find backup of critical section >>> mdadm: Failed to restore critical section for reshape, sorry. >>> >> >> >> Ok, i dont know if this is the right thing to have done >> >> ~# mdadm -Avv --force /dev/md0 --backup-file=/home/md.backup >> /dev/sd[abcdef] >> >> mdadm: looking for devices for /dev/md0 >> mdadm: /dev/sda is identified as a member of /dev/md0, slot 4. >> mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1. >> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3. >> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1. >> mdadm: /dev/sde is identified as a member of /dev/md0, slot 2. >> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0. >> mdadm: clearing FAULTY flag for device 1 in /dev/md0 for /dev/sdb >> mdadm: Marking array /dev/md0 as 'clean' >> mdadm:/dev/md0 has an active reshape - checking if critical section >> needs to be restored >> mdadm: backup-metadata found on /home/md.backup but is not needed >> mdadm: Failed to find backup of critical section >> mdadm: Failed to restore critical section for reshape, sorry. >> >> >> ~# mdadm -Avv /dev/md0 --backup-file=/home/md.backup /dev/sd[abcdef] >> >> mdadm: looking for devices for /dev/md0 >> mdadm: /dev/sda is identified as a member of /dev/md0, slot 4. >> mdadm: /dev/sdb is identified as a member of /dev/md0, slot -1. >> mdadm: /dev/sdc is identified as a member of /dev/md0, slot 3. >> mdadm: /dev/sdd is identified as a member of /dev/md0, slot 1. >> mdadm: /dev/sde is identified as a member of /dev/md0, slot 2. >> mdadm: /dev/sdf is identified as a member of /dev/md0, slot 0. >> mdadm:/dev/md0 has an active reshape - checking if critical section >> needs to be restored >> mdadm: restoring critical section >> mdadm: added /dev/sdd to /dev/md0 as 1 >> mdadm: added /dev/sde to /dev/md0 as 2 >> mdadm: added /dev/sdc to /dev/md0 as 3 >> mdadm: added /dev/sda to /dev/md0 as 4 >> mdadm: no uptodate device for slot 5 of /dev/md0 >> mdadm: added /dev/sdb to /dev/md0 as -1 >> mdadm: added /dev/sdf to /dev/md0 as 0 >> mdadm: /dev/md0 has been started with 4 drives (out of 6) and 1 spare. >> >> ~# cat /proc/mdstat >> >> Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5] >> [raid4] [raid10] >> md0 : active raid6 sdf[0] sdb[6](S) sdc[3] sde[2] sdd[1] >> 7814055936 blocks super 1.0 level 6, 512k chunk, algorithm 18 >> [6/4] [UUUU__] >> [==============>......] reshape = 74.3% (1452929024/1953513984) >> finish=2545.2min speed=3276K/sec >> >> unused devices: <none> >> >> so looks like its carrying on now but with 4 disks and a spare, maybe >> i can add the other disk once the reshape has finished > > It generally helps to include/examine "mdadm -E /dev/sdX" of all devices > involved in your mail(s) and also "mdadm -Q --detail /dev/md0". > After the reshape is done it will automatically rebuild using the spare. > Then you can have a close look which of your devices arent used, clear the > metadate from the device and add it as well to regain full redundancy. > You'll have plenty hours of fun watching /proc/mdstat. ;) > Alex. > Thanks for the response Alex, the reshape has got about 2400mins left to run and no idea how long the rebuild will take.. I will check out those commands once i am back up and running, i am fairly new to mdadm so still finding out all the useful commands when trouble shooting issues, thanks for pointing these out to me -- To unsubscribe from this list: send the line "unsubscribe linux-raid" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2011-10-30 22:15 UTC | newest] Thread overview: 7+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2011-10-30 14:57 Unable to restart reshape Michael Busby 2011-10-30 15:34 ` Michael Busby 2011-10-30 15:57 ` Michael Busby 2011-10-30 16:04 ` Michael Busby 2011-10-30 16:22 ` Michael Busby 2011-10-30 22:02 ` Alexander Kühn 2011-10-30 22:15 ` Michael Busby
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).