From: Nigel Cunningham <nigel@tuxonice.net>
To: NeilBrown <neilb@suse.de>
Cc: linux-raid@vger.kernel.org,
TuxOnIce users' list <tuxonice-users@lists.tuxonice.net>
Subject: Re: Repeatable md OOPS on suspend, 2.6.39.4 and 3.0.3
Date: Thu, 15 Sep 2011 14:18:08 +1000 [thread overview]
Message-ID: <4E717C80.20305@tuxonice.net> (raw)
In-Reply-To: <20110915053139.09dd6ae1@notabene.brown>
Hi.
On 15/09/11 13:31, NeilBrown wrote:
> On Thu, 15 Sep 2011 09:32:10 +1000 Nigel Cunningham <nigel@tuxonice.net>
> wrote:
>
>> Hi.
>>
>> Please try/review the attached patch.
>>
>> The problem is that TuxOnIce adds a BUG_ON() to catch non-TuxOnIce I/O
>> during hibernation, as a method of seeking to stop on-disk data getting
>> corrupted by the writing of data that has potentially been overwritten
>> by the atomic copy.
>>
>> Stopping the md devices from being marked readonly is the right thing to
>> do - if we don't resume, we want recovery to be run. If we do resume,
>> they should still be in the pre-hibernate state.
>>
>> Regards,
>>
>> Nigel
>
> This doesn't feel like the right approach to me.
>
> I think the 'md' device *should* be marked 'clean' when it is clean to
> avoid unnecessary resyncs.
I must be missing something. In raid terminology, what does 'clean'
mean? Googling gives me lots of references to flyspray :) I thought it
meant the filesystems contained therein were cleanly unmounted (which it
isn't in this case). Just 'cleanly shutdown'?
> It would almost certainly make sense to have a way to tell md 'hibernate
> wrote to your device so things might have changed - you should check'.
> Then md could look at the metadata and refresh any in-memory information
> such as device failures and event counts.
> After all if a device fails while writing out the hibernation image, we want
> the hibernation to succeed (I assume) and we want md to know that the device
> is failed when it wakes back up, and currently it won't. So we really need
> that notification anyway.
Now that I understand and agree with.
Regards,
Nigel
--
Evolution (n): A hypothetical process whereby improbable
events occur with alarming frequency, order arises from chaos, and
no one is given credit.
next prev parent reply other threads:[~2011-09-15 4:18 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <CALxPEfw-XdcosHemovkxj9aOwaYxUaQ6dj=yMaKCmoy2vkkUkg@mail.gmail.com>
[not found] ` <loom.20110901T032036-733@post.gmane.org>
[not found] ` <CALxPEfwJ99bvmSXkqSKutAYbcD9OLcyMcKQQO6G+Yxi9RQ1DFA@mail.gmail.com>
[not found] ` <CALxPEfyQrwWh1AWY-fwwCeMzDEeK390Eof9PwrsN6C11Wnt9=A@mail.gmail.com>
[not found] ` <loom.20110906T201030-97@post.gmane.org>
2011-09-09 12:55 ` Repeatable md OOPS on suspend, 2.6.39.4 and 3.0.3 Nix
2011-09-14 23:32 ` Nigel Cunningham
2011-09-15 3:31 ` [TuxOnIce-users] " NeilBrown
2011-09-15 4:18 ` Nigel Cunningham [this message]
2011-09-15 5:38 ` Nix
2011-09-15 5:34 ` Nix
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4E717C80.20305@tuxonice.net \
--to=nigel@tuxonice.net \
--cc=linux-raid@vger.kernel.org \
--cc=neilb@suse.de \
--cc=tuxonice-users@lists.tuxonice.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.