From: Niel Lambrechts <niel.lambrechts@gmail.com>
To: Tejun Heo <tj@kernel.org>
Cc: Alan Cox <alan@lxorguk.ukuu.org.uk>,
"linux.kernel" <linux-kernel@vger.kernel.org>,
Theodore Tso <tytso@mit.edu>
Subject: Re: 2.6.29 regression: ATA bus errors on resume
Date: Tue, 26 May 2009 07:43:47 +0200 [thread overview]
Message-ID: <4A1B8193.1010703@gmail.com> (raw)
In-Reply-To: <4A1B76EB.9040500@kernel.org>
On 05/26/2009 06:58 AM, Tejun Heo wrote:
> Hello, Niel.
>
> Niel Lambrechts wrote:
>
>> I've tested all of the kernels I have again since 2.6.29.4 also came out
>> just recently. I did a hibernate/resume for each in the console, then
>> repeated the same in X, then continued to the next kernel.
>>
>> The 2.6.29.4 log is much larger, since some other badness happened there
>> - there is a large kernel trace in there as my first X hibernation
>> attempt failed and came back to X after a few seconds. The system seemed
>> functional, it did not keep generating kernel messages - when I then
>> retried a hibernate it worked, along with the resume. Another unrelated
>> bug perhaps?
>>
>> As for "hard resetting link" messages, they seemed to always happen
>> under X the times I tried it.
>>
>> Kernel EXT4-errors? Console:ata1 reset? Console:ata2-reset? X:ata1 reset? X:ata2 reset?
>> 2.6.28.10 No no yes yes no
>> 2.6.29.4* No no no no no
>> 2.6.29.4** No - - yes no
>> 2.6.30-rc6 Yes - - yes no
>> 2.6.30-rc6 No no no yes no
>>
>> * Xorg hibernation attempt failed.
>> * Xorg Second hibernation attempt (no extra reboot)
>>
>> I also did a side by side comparison of the messages I have for
>> 2.6.30-rc6, the one with EXT4 errors I reported on yesterday, and
>> another one that worked just fine tonight. I stripped all time-stamps
>> and some pulseaudio messages from the bad one and attached them here,
>> and also saved the full messages for each kernel to
>> http://bugzilla.kernel.org/show_bug.cgi?id=13017 .
>>
>> Since analysing the code-path is still a bit beyond me, I'll leave you
>> with a little summary of the differences I notice.
>>
>> A = 2.6.30-rc6 (EXT4 clean)
>> B = 2.6.30-rc6 (EXT4 errors triggered)
>>
> Duplicate PHY events are likely to be dependent on timing and
> non-deterministic. The ext4 corrupting or not depends on whether a
> request with failfast set was in-flight at the time of the second PHY
> event, which again is dependent on timing. At any rate, this looks
> like a problem of ext4 (or something between ext4 and the driver). It
> either shouldn't issue failfast command or should take appropriate
> recovery action if it does. It would be really nice if you can give a
> shot at ext3.
Urgh. My root file-system is mounted with extents on, I would have to
re-install entirely.
I'm wondering why no one else is complaining, or whether the problem is
limited to ICH9M/M-E controllers with EXT4 or a certain type of
hard-drive. The laptop is a Lenovo W500 (fairly similar to T500), so
maybe not a lot of people with this type of controller is using EXT4 yet.
Anyhow, I think Theodore may have ruled this out as a EXT4 problem
already (I first copied him) so I'm not sure what to do now, it will
take some strong will (and even more time) for me to re-install EXT3. I
just shouldn't have to, dammit. :-p
Regards,
Niel
next prev parent reply other threads:[~2009-05-26 5:44 UTC|newest]
Thread overview: 55+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <clqON-2Xv-7@gated-at.bofh.it>
[not found] ` <clqON-2Xv-9@gated-at.bofh.it>
[not found] ` <clqON-2Xv-11@gated-at.bofh.it>
[not found] ` <clqON-2Xv-13@gated-at.bofh.it>
[not found] ` <clqON-2Xv-15@gated-at.bofh.it>
[not found] ` <clqON-2Xv-17@gated-at.bofh.it>
[not found] ` <clqON-2Xv-19@gated-at.bofh.it>
[not found] ` <clqON-2Xv-5@gated-at.bofh.it>
[not found] ` <clqYt-3bu-5@gated-at.bofh.it>
2009-03-30 18:24 ` 2.6.29 regression: ATA bus errors on resume Niel Lambrechts
2009-03-30 19:17 ` Jeff Garzik
[not found] ` <cmknZ-8lW-9@gated-at.bofh.it>
[not found] ` <cmoBl-6Ok-21@gated-at.bofh.it>
[not found] ` <cmp4n-7rb-15@gated-at.bofh.it>
[not found] ` <cmsYg-5BR-27@gated-at.bofh.it>
[not found] ` <cmvW7-1Yj-23@gated-at.bofh.it>
[not found] ` <cnheh-3vO-7@gated-at.bofh.it>
[not found] ` <cnPg1-7Q4-19@gated-at.bofh.it>
[not found] ` <cnTWo-7bV-25@gated-at.bofh.it>
[not found] ` <co1Kd-350-5@gated-at.bofh.it>
[not found] ` <co2Qf-4QQ-27@gated-at.bofh.it>
[not found] ` <co4yj-7Mc-5@gated-at.bofh.it>
[not found] ` <cp71c-4py-29@gated-at.bofh.it>
[not found] ` <cEVyE-re-1@gated-at.bofh.it>
2009-05-23 9:36 ` Niel Lambrechts
2009-05-25 1:10 ` Tejun Heo
2009-05-25 8:15 ` Alan Cox
2009-05-25 22:06 ` Niel Lambrechts
2009-05-26 4:58 ` Tejun Heo
2009-05-26 5:43 ` Niel Lambrechts [this message]
2009-05-26 5:50 ` Tejun Heo
2009-05-26 6:13 ` Niel Lambrechts
2009-05-26 13:33 ` Tejun Heo
2009-05-26 18:14 ` Niel Lambrechts
2009-05-27 0:07 ` Tejun Heo
2009-05-27 14:01 ` Niel Lambrechts
2009-06-01 18:57 ` Niel Lambrechts
2009-06-03 3:14 ` Tejun Heo
2009-06-03 4:28 ` Tejun Heo
2009-06-06 7:05 ` Niel Lambrechts
2009-06-19 15:04 ` Pavel Machek
2009-06-25 12:57 ` Tejun Heo
2009-06-25 15:25 ` Niel Lambrechts
2009-06-26 0:46 ` Tejun Heo
2009-06-26 6:24 ` Niel Lambrechts
2009-09-18 20:26 ` Berthold Gunreben
2009-09-25 4:11 ` Tejun Heo
2009-09-30 9:58 ` Berthold Gunreben
2009-09-30 10:26 ` Tejun Heo
2009-05-26 4:58 ` Tejun Heo
[not found] <ckpL0-3TE-3@gated-at.bofh.it>
[not found] ` <ckpL0-3TE-5@gated-at.bofh.it>
[not found] ` <ckpL0-3TE-7@gated-at.bofh.it>
[not found] ` <ckpL0-3TE-9@gated-at.bofh.it>
[not found] ` <ckpL0-3TE-11@gated-at.bofh.it>
[not found] ` <ckpL0-3TE-1@gated-at.bofh.it>
[not found] ` <cllvN-2Gf-1@gated-at.bofh.it>
2009-03-30 14:30 ` Niel Lambrechts
2009-03-30 14:40 ` Jeff Garzik
2009-04-01 19:48 ` Niel Lambrechts
2009-04-03 20:09 ` Jeff Garzik
2009-04-03 20:54 ` Niel Lambrechts
2009-04-02 1:50 ` Tejun Heo
2009-04-02 6:20 ` Niel Lambrechts
2009-04-02 6:52 ` Tejun Heo
2009-04-02 11:03 ` Niel Lambrechts
2009-04-02 14:15 ` Niel Lambrechts
2009-04-04 4:54 ` Tejun Heo
2009-04-06 5:01 ` Niel Lambrechts
2009-04-06 10:09 ` Tejun Heo
2009-04-06 18:23 ` Niel Lambrechts
2009-04-06 19:39 ` Tejun Heo
2009-04-06 21:26 ` Niel Lambrechts
2009-04-09 18:18 ` Tejun Heo
2009-05-23 9:17 ` Niel Lambrechts
[not found] <cjtH6-3Ll-13@gated-at.bofh.it>
[not found] ` <cjtH6-3Ll-15@gated-at.bofh.it>
[not found] ` <cjtH6-3Ll-11@gated-at.bofh.it>
[not found] ` <cjutt-577-11@gated-at.bofh.it>
[not found] ` <cjJCb-47c-23@gated-at.bofh.it>
2009-03-27 19:10 ` Niel Lambrechts
2009-03-27 22:30 ` Arjan van de Ven
2009-03-28 10:22 ` Niel Lambrechts
2009-03-28 14:06 ` Rafael J. Wysocki
2009-03-30 8:43 ` Tejun Heo
2009-03-30 8:55 ` Tejun Heo
[not found] <cjlqb-7sp-1@gated-at.bofh.it>
[not found] ` <cjq6y-6sq-11@gated-at.bofh.it>
2009-03-25 5:19 ` 2.6.29 regression: ATA bus errors on resume (was: EXT4: __ext4_get_inode_loc errors after s2disk) Niel Lambrechts
2009-03-25 6:06 ` 2.6.29 regression: ATA bus errors on resume Jeff Garzik
2009-03-25 21:40 ` Niel Lambrechts
2009-03-25 22:16 ` James Bottomley
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4A1B8193.1010703@gmail.com \
--to=niel.lambrechts@gmail.com \
--cc=alan@lxorguk.ukuu.org.uk \
--cc=linux-kernel@vger.kernel.org \
--cc=tj@kernel.org \
--cc=tytso@mit.edu \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).