From: Michael Tokarev <mjt@tls.msk.ru>
To: Neil Brown <neilb@suse.de>
Cc: Alexandre Oliva <aoliva@redhat.com>,
linux-raid <linux-raid@vger.kernel.org>,
linux-kernel <linux-kernel@vger.kernel.org>
Subject: Re: modifying degraded raid 1 then re-adding other members is bad
Date: Tue, 08 Aug 2006 15:19:08 +0400 [thread overview]
Message-ID: <44D8732C.2060207@tls.msk.ru> (raw)
In-Reply-To: <17624.29070.246605.213021@cse.unsw.edu.au>
Neil Brown wrote:
> On Tuesday August 8, aoliva@redhat.com wrote:
>> Assume I have a fully-functional raid 1 between two disks, one
>> hot-pluggable and the other fixed.
>>
>> If I unplug the hot-pluggable disk and reboot, the array will come up
>> degraded, as intended.
>>
>> If I then modify a lot of the data in the raid device (say it's my
>> root fs and I'm running daily Fedora development updates :-), which
>> modifies only the fixed disk, and then plug the hot-pluggable disk in
>> and re-add its members, it appears that it comes up without resyncing
>> and, well, major filesystem corruption ensues.
>>
>> Is this a known issue, or should I try to gather more info about it?
>
> Looks a lot like
> http://bugzilla.kernel.org/show_bug.cgi?id=6965
>
> Attached are two patches. One against -mm and one against -linus.
>
> They are below.
>
> Please confirm if the appropriate one help.
>
> NeilBrown
>
> (-mm)
>
> Avoid backward event updates in md superblock when degraded.
>
> If we
> - shut down a clean array,
> - restart with one (or more) drive(s) missing
> - make some changes
> - pause, so that they array gets marked 'clean',
> the event count on the superblock of included drives
> will be the same as that of the removed drives.
> So adding the removed drive back in will cause it
> to be included with no resync.
>
> To avoid this, we only update the eventcount backwards when the array
> is not degraded. In this case there can (should) be no non-connected
> drives that we can get confused with, and this is the particular case
> where updating-backwards is valuable.
Why we're updating it BACKWARD in the first place?
Also, why, when we adding something to the array, the event counter is
checked -- should it resync regardless?
Thanks.
/mjt
next prev parent reply other threads:[~2006-08-08 11:19 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2006-08-08 9:38 modifying degraded raid 1 then re-adding other members is bad Alexandre Oliva
2006-08-08 11:12 ` Neil Brown
2006-08-08 11:19 ` Michael Tokarev [this message]
2006-08-08 19:17 ` Krzysztof Halasa
2006-08-08 22:33 ` Neil Brown
2006-08-08 22:30 ` Neil Brown
2006-08-09 6:35 ` Jan Engelhardt
2006-08-09 23:18 ` Neil Brown
2006-08-09 9:01 ` Helge Hafting
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=44D8732C.2060207@tls.msk.ru \
--to=mjt@tls.msk.ru \
--cc=aoliva@redhat.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-raid@vger.kernel.org \
--cc=neilb@suse.de \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.