From: NeilBrown <neilb@suse.de>
To: George Spelvin <linux@horizon.com>
Cc: linux-raid@vger.kernel.org
Subject: Re: want-replacement got stuck?
Date: Thu, 22 Nov 2012 13:10:02 +1100 [thread overview]
Message-ID: <20121122131002.4944ce0d@notabene.brown> (raw)
In-Reply-To: <20121120221145.9905.qmail@science.horizon.com>
[-- Attachment #1: Type: text/plain, Size: 2621 bytes --]
On 20 Nov 2012 17:11:45 -0500 "George Spelvin" <linux@horizon.com> wrote:
> I have a RAID10 array with 4 active + 1 spare.
> Kernel is 3.6.5, x86-64 but running 32-bit unserland.
>
> After a recent failure on sdd2, the spare sdc2 was
> activated and things looked something like (manual edit,
> may not be perfectly faithful):
>
> md5 : active raid10 sdd2[4](F) sdb2[1] sde2[2] sdc2[3] sda2[0]
> 725591552 blocks 256K chunks 2 near-copies [4/4] [UUUU]
> bitmap: 50/173 pages [200KB], 2048KB chunk
>
> smartctl -A showed 1 pending sector, but badblocks didn't
> find it, so I decided to play with moving things back:
>
> # badblocks -s -v /dev/sdd2
> # mdadm /dev/md5 -r /dev/sdd2 -a /dev/sdd2
> # echo want_replacement > /sys/block/md5/md/dev-sdc2/state
>
> This ran for a while, but now it has stopped, with the following
> configuration:
>
> md5 : active raid10 sdd2[3](R) sdb2[1] sde2[2] sdc2[4](F) sda2[0]
> 725591552 blocks 256K chunks 2 near-copies [4/4] [UUU_]
> bitmap: 50/173 pages [200KB], 2048KB chunk
>
> # [530]# cat /sys/block/md5/md/dev-sd?2/state
> in_sync
> in_sync
> faulty,want_replacement
> in_sync,replacement
> in_sync
>
> I'm not quite sure how to interpret this state, and why it is showing
> "4/4" good drives but [UUU_].
"4/4" means the array is not degraded.
[UUU_] means that the drive in slot 3 is faulty.
The way this can happen without the array being degraded is that the
replacement is fully in-sync.
What has happened is the replacement finished perfectly and the want-replace
device was marked as faulty, but when md tried to remove that faulty device
it found that it was still active. Some request that has previously been
sent hadn't completed yet. So it couldn't remove it immediately.
Unfortunately it doesn't retry in any great hurry .. or possibly at all.
I'll have to look in to that and figure out the best fix.
...
> It appears to have completed:
> Nov 20 18:40:01 science kernel: md: md5: recovery done.
> Nov 20 18:40:01 science kernel: RAID10 conf printout:
> Nov 20 18:40:01 science kernel: --- wd:4 rd:4
> Nov 20 18:40:01 science kernel: disk 0, wo:0, o:1, dev:sda2
> Nov 20 18:40:01 science kernel: disk 1, wo:0, o:1, dev:sdb2
> Nov 20 18:40:01 science kernel: disk 2, wo:0, o:1, dev:sde2
> Nov 20 18:40:01 science kernel: disk 3, wo:1, o:0, dev:sdc2
>
> But as mentioned, the RAID state is a bit odd. sdc2 is still in the
> array and sdd2 is not.
Yes, it completed. The "conf printout" doesn't mention replacement devices
yet. I guess it should..
NeilBrown
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 828 bytes --]
prev parent reply other threads:[~2012-11-22 2:10 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2012-11-20 22:11 want-replacement got stuck? George Spelvin
2012-11-21 16:33 ` George Spelvin
2012-11-21 16:41 ` Roman Mamedov
2012-11-21 18:08 ` George Spelvin
2012-11-21 19:21 ` joystick
2012-11-21 21:19 ` George Spelvin
2012-11-21 22:56 ` joystick
2012-11-22 3:25 ` George Spelvin
2012-11-22 4:22 ` NeilBrown
2012-11-22 5:27 ` George Spelvin
2012-11-22 5:39 ` George Spelvin
2012-11-22 5:47 ` NeilBrown
2012-11-22 6:45 ` George Spelvin
2012-11-22 11:30 ` George Spelvin
2012-11-22 2:15 ` NeilBrown
2012-11-22 2:10 ` NeilBrown [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20121122131002.4944ce0d@notabene.brown \
--to=neilb@suse.de \
--cc=linux-raid@vger.kernel.org \
--cc=linux@horizon.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).