From: "Rafael J. Wysocki" <rjw@sisk.pl>
To: linux-pm@lists.linux-foundation.org
Cc: linux-kernel <linux-kernel@vger.kernel.org>
Subject: Re: Massive ext4 filesystem corruption after a failed s2disk/ram cycle
Date: Tue, 6 Oct 2009 23:58:32 +0200 [thread overview]
Message-ID: <200910062358.32417.rjw@sisk.pl> (raw)
In-Reply-To: <1254863215.11577.23.camel@maxim-laptop>
On Tuesday 06 October 2009, Maxim Levitsky wrote:
> Hi,
>
> Just prior to 2.6.32 cycle I tried -next tree and noticed that after a
> failed s2ram (here it works only once, and I test once in a whileto see
> if fixed accidentally) I got a minor filesystem corruption. I am sorry I
> didn't report that back then.
>
> Now I have installed 2.6.32-rc2 (well -rc1...) and things were sort of
> ok, I have even thought that hibernation is once again stable
> (somewhere in the not that distinct past the hibernation which used to
> work, began to fail randomly on resume)
>
> Few days ago, I got a read-only filesystem again, an fsck, few more
> corrupted files..., It should have had rung the bell for me (I have
> still used hibernation, trying to understand why it fails sometimes)
>
> Yesterday, however, I have decided to fix that once and for all, and for
> that I have set up a loop + rtc wakealarm to make it cycle through
> hibernation.
>
> Needless to say I didn't run that loop more that maybe 3 cycles (and no
> failures), but noticed that rtc clock is dead on resume.
>
> I sort of fixed that (this is hpet emulation that strikes again), I will
> post when I test the fix (trivial), because when I had rebooted the
> system into the modified kernel, I got that readonly filesystem again,
> and this time the damage had spread over lots of files.
> (I have even lost most of dpkg database..., many programs,
> libraries,..., settings)
>
> Yet, thanks to Linux flexibility, after a day, and some study of
> nautilus source, I had the system recovered fully.
> (Now am doing backups.....)
>
> But I don't want that to happen again...
>
> Another clue that I have seen was that ext4 driver reported that it
> aborts journal replay.
>
> I know that for now there is not much you can do, but just to let you
> know that something is there...
>
> What is especially interesting is that there were no s2ram'disk faulure
> preceding the corruption, but my theory is that corruption wasn't
> detected for a while from last failure, probably giving such bad
> consequences.
>
> You do sync file-systems before entering the hibernation, don't you?
Yes, a sync is there, but it is not effective on some filesystems.
Thanks,
Rafael
WARNING: multiple messages have this Message-ID (diff)
From: "Rafael J. Wysocki" <rjw@sisk.pl>
To: linux-pm@lists.linux-foundation.org
Cc: Maxim Levitsky <maximlevitsky@gmail.com>,
"linux-kernel" <linux-kernel@vger.kernel.org>
Subject: Re: [linux-pm] Massive ext4 filesystem corruption after a failed s2disk/ram cycle
Date: Tue, 6 Oct 2009 23:58:32 +0200 [thread overview]
Message-ID: <200910062358.32417.rjw@sisk.pl> (raw)
In-Reply-To: <1254863215.11577.23.camel@maxim-laptop>
On Tuesday 06 October 2009, Maxim Levitsky wrote:
> Hi,
>
> Just prior to 2.6.32 cycle I tried -next tree and noticed that after a
> failed s2ram (here it works only once, and I test once in a whileto see
> if fixed accidentally) I got a minor filesystem corruption. I am sorry I
> didn't report that back then.
>
> Now I have installed 2.6.32-rc2 (well -rc1...) and things were sort of
> ok, I have even thought that hibernation is once again stable
> (somewhere in the not that distinct past the hibernation which used to
> work, began to fail randomly on resume)
>
> Few days ago, I got a read-only filesystem again, an fsck, few more
> corrupted files..., It should have had rung the bell for me (I have
> still used hibernation, trying to understand why it fails sometimes)
>
> Yesterday, however, I have decided to fix that once and for all, and for
> that I have set up a loop + rtc wakealarm to make it cycle through
> hibernation.
>
> Needless to say I didn't run that loop more that maybe 3 cycles (and no
> failures), but noticed that rtc clock is dead on resume.
>
> I sort of fixed that (this is hpet emulation that strikes again), I will
> post when I test the fix (trivial), because when I had rebooted the
> system into the modified kernel, I got that readonly filesystem again,
> and this time the damage had spread over lots of files.
> (I have even lost most of dpkg database..., many programs,
> libraries,..., settings)
>
> Yet, thanks to Linux flexibility, after a day, and some study of
> nautilus source, I had the system recovered fully.
> (Now am doing backups.....)
>
> But I don't want that to happen again...
>
> Another clue that I have seen was that ext4 driver reported that it
> aborts journal replay.
>
> I know that for now there is not much you can do, but just to let you
> know that something is there...
>
> What is especially interesting is that there were no s2ram'disk faulure
> preceding the corruption, but my theory is that corruption wasn't
> detected for a while from last failure, probably giving such bad
> consequences.
>
> You do sync file-systems before entering the hibernation, don't you?
Yes, a sync is there, but it is not effective on some filesystems.
Thanks,
Rafael
next prev parent reply other threads:[~2009-10-06 21:58 UTC|newest]
Thread overview: 30+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-10-06 21:06 Massive ext4 filesystem corruption after a failed s2disk/ram cycle Maxim Levitsky
2009-10-06 21:42 ` Theodore Tso
2009-10-06 21:42 ` Theodore Tso
2009-10-06 23:02 ` Maxim Levitsky
2009-10-06 23:02 ` Maxim Levitsky
2009-10-07 14:25 ` Jindrich Makovicka
2009-10-06 21:58 ` Rafael J. Wysocki [this message]
2009-10-06 21:58 ` [linux-pm] " Rafael J. Wysocki
2009-10-06 22:53 ` Henrique de Moraes Holschuh
2009-10-06 22:53 ` [linux-pm] " Henrique de Moraes Holschuh
2009-10-06 23:02 ` Rafael J. Wysocki
2009-10-06 23:02 ` [linux-pm] " Rafael J. Wysocki
2009-10-07 1:29 ` Henrique de Moraes Holschuh
2009-10-07 1:29 ` [linux-pm] " Henrique de Moraes Holschuh
2009-10-07 2:14 ` Daniel Pittman
2009-10-07 16:16 ` Christoph Hellwig
2009-10-07 16:16 ` [linux-pm] " Christoph Hellwig
2009-11-04 2:18 ` KOSAKI Motohiro
2009-11-04 2:18 ` [linux-pm] " KOSAKI Motohiro
2009-11-05 9:56 ` Henrique de Moraes Holschuh
2009-11-07 22:22 ` Thomas Fjellstrom
2009-11-05 9:56 ` Henrique de Moraes Holschuh
2009-11-08 8:29 ` Dave Chinner
2009-11-08 8:29 ` [linux-pm] " Dave Chinner
2009-11-08 16:49 ` Christoph Hellwig
2009-11-09 9:42 ` Henrique de Moraes Holschuh
2009-11-09 9:42 ` Henrique de Moraes Holschuh
2009-11-08 16:49 ` Christoph Hellwig
2009-10-07 2:14 ` Daniel Pittman
2009-10-10 3:26 ` ext4 filesystem corruption Maxim Levitsky
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=200910062358.32417.rjw@sisk.pl \
--to=rjw@sisk.pl \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-pm@lists.linux-foundation.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.