From: NeilBrown <neilb@suse.de>
To: Albert Pauw <albert.pauw@gmail.com>
Cc: linux-raid@vger.kernel.org
Subject: Re: More ddf container woes
Date: Tue, 15 Mar 2011 15:43:25 +1100 [thread overview]
Message-ID: <20110315154325.2e50c4ad@notabene.brown> (raw)
In-Reply-To: <4D7DD921.2060806@gmail.com>
On Mon, 14 Mar 2011 10:00:17 +0100 Albert Pauw <albert.pauw@gmail.com> wrote:
> Hi Neil,
>
> thanks, yes I noticed with the new git stuff some problems are fixed now.
>
> I noticed one more thing:
>
> When I look at the end of the output of the "mdadm -E /dev/md127" output I
> see it mentions the amount of phyiscal disks. When I fail a disk it is
> marked as
> "active/Offline, Failed" which is good. When I remove it, the amount of
> physical
> disks reported by the "mdadm -E" command stays the same, the RefNo is still
> there, the Size is still there, the Device file is removed and the state
> is still
> "active/Offline, Failed". The whole entry should be removed and the
> amount ofen
> physical disks lowered by one.
Well... maybe. Probably.
The DDF spec "requires" that there be an entry in the "physical disks"
table for every disk that is connected to the controller - whether failed
or not.
That makes some sense when you think about a hardware-RAID controller.
But how does that apply when DDF is running on a host system rather than
a RAID controller??
Maybe we should only remove them when they are physically unplugged??
There would probably be value in thinking through all of this a lot more
but for now I have arranged to remove any failed device that it not
part of an array (even a failed part).
You can find all of this in my git tree. I decided to back-port the
code from devel-3.2 which deletes devices from the DDF when you remove
them from a container - so you should find the code in the 'master'
branch works as well as that in 'devel-3.2'.
I would appreciate any more testing results that you come up with.
>
> When I re-add the failed disk (but NOT zeroed the superblock) the state
> is still
> "active/Offline, Failed" but reused for resynching a failed RAID set.
>
> Assuming that the failed state of a disk is also recorded in the
> superblock on the disk
> three different possibilities are likely when adding a disk:
>
> - A clean new disk, a new superblock is created with a new RefNo
> - A failed disk is added, use the failed state and RefNo
> - A good disk is added, possibly from a good RAID set, use this
> superblock with the
> RefNo and status. Make it possible to reassemble the RAID set when all
> the disks
> are added.
It currently seems to preserve the 'failed' state. While that may
not be ideal, it is not clearly 'wrong' and can be worked-around
by zeroing the metadata.
So I plan to leave it as it is for the moment.
I hope to put a bit of time in to sorting out some of these more subtle
issues next week - so you could well see progress in the future ...
especially if you have a brilliant idea about how it *should* work and manage
to convince me :-)
>
> Thanks for the fixes so far,
And thank you for your testing.
NeilBrown
next prev parent reply other threads:[~2011-03-15 4:43 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-02-19 11:13 mdadm ddf questions Albert Pauw
2011-02-22 7:41 ` Albert Pauw
2011-02-23 6:17 ` NeilBrown
2011-02-25 17:53 ` Albert Pauw
2011-03-02 22:31 ` NeilBrown
2011-03-10 8:34 ` More ddf container woes Albert Pauw
2011-03-11 11:50 ` Albert Pauw
2011-03-14 8:02 ` NeilBrown
2011-03-14 9:00 ` Albert Pauw
2011-03-15 4:43 ` NeilBrown [this message]
2011-03-15 19:07 ` Albert Pauw
2011-03-02 22:26 ` mdadm ddf questions NeilBrown
2011-03-02 22:11 ` NeilBrown
2011-03-04 7:52 ` Albert Pauw
-- strict thread matches above, loose matches on Subject: below --
2011-03-23 19:18 More ddf container woes Albert Pauw
2011-03-23 22:08 ` NeilBrown
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20110315154325.2e50c4ad@notabene.brown \
--to=neilb@suse.de \
--cc=albert.pauw@gmail.com \
--cc=linux-raid@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).