From: Kasper Sandberg <lkml@metanurb.dk>
To: Gene Heskett <gene.heskett@gmail.com>
Cc: Mikael Pettersson <mikpe@it.uu.se>,
Peter Zijlstra <a.p.zijlstra@chello.nl>,
Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
Linux ide Mailing list <linux-ide@vger.kernel.org>
Subject: Re: Problem with ata layer in 2.6.24
Date: Tue, 29 Jan 2008 05:23:36 +0100 [thread overview]
Message-ID: <1201580616.12795.2.camel@localhost> (raw)
In-Reply-To: <200801281135.14555.gene.heskett@gmail.com>
On Mon, 2008-01-28 at 11:35 -0500, Gene Heskett wrote:
> On Monday 28 January 2008, Mikael Pettersson wrote:
> >Gene Heskett writes:
> > > On Monday 28 January 2008, Peter Zijlstra wrote:
> > > >On Mon, 2008-01-28 at 09:17 +0100, Mikael Pettersson wrote:
> > > >> 1. Wrong mailing list; use linux-ide (@vger) instead.
> > > >
> > > >What, and keep all us other interested people in the dark?
> > >
> > > As a test, I tried rebooting to the latest fedora kernel and found it
> > > kills X, so I'm back to the second to last fedora version ATM, and the
> > > third 'smartctl -t lng /dev/sda' in 24 hours is running now. The first
> > > two completed with no errors.
> > >
> > > I've added the linux-ide list to refresh those people of the problem,
> > > the logs are being spammed by this message stanza:
> > >
> > > Jan 28 04:46:25 coyote kernel: [26550.290016] ata1.00: exception Emask
> > > 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen Jan 28 04:46:25 coyote kernel:
> > > [26550.290028] ata1.00: cmd 35/00:58:c9:9c:0a/00:01:00:00:00/e0 tag 0 dma
> > > 176128 out Jan 28 04:46:25 coyote kernel: [26550.290029] res
> > > 40/00:01:00:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout) Jan 28 04:46:25
> > > coyote kernel: [26550.290032] ata1.00: status: { DRDY } Jan 28 04:46:25
> > > coyote kernel: [26550.290060] ata1: soft resetting link Jan 28 04:46:25
> > > coyote kernel: [26550.452301] ata1.00: configured for UDMA/100 Jan 28
> > > 04:46:25 coyote kernel: [26550.452318] ata1: EH complete
> > > Jan 28 04:46:25 coyote kernel: [26550.455898] sd 0:0:0:0: [sda] 390721968
> > > 512-byte hardware sectors (200050 MB) Jan 28 04:46:25 coyote kernel:
> > > [26550.456151] sd 0:0:0:0: [sda] Write Protect is off Jan 28 04:46:25
> > > coyote kernel: [26550.456403] sd 0:0:0:0: [sda] Write cache: enabled,
> > > read cache: enabled, doesn't support DPO or FUA
> >
> >It's not obvious from this incomplete dmesg log what HW or driver
> >is behind ata1, but if the 2.6.24-rc7 kernel matches the 2.6.24 one,
> >
> >it should be pata_amd driving a WDC disk:
> > > [ 30.702887] pata_amd 0000:00:09.0: version 0.3.10
> > > [ 30.703052] PCI: Setting latency timer of device 0000:00:09.0 to 64
> > > [ 30.703188] scsi0 : pata_amd
> > > [ 30.709313] scsi1 : pata_amd
> > > [ 30.710076] ata1: PATA max UDMA/133 cmd 0x1f0 ctl 0x3f6 bmdma 0xf000
> > > irq 14 [ 30.710079] ata2: PATA max UDMA/133 cmd 0x170 ctl 0x376 bmdma
> > > 0xf008 irq 15 [ 30.864753] ata1.00: ATA-6: WDC WD2000JB-00EVA0,
> > > 15.05R15, max UDMA/100 [ 30.864756] ata1.00: 390721968 sectors, multi
> > > 16: LBA48
> > > [ 30.871629] ata1.00: configured for UDMA/100
> >
> >Unfortunately we also see:
> > > [ 48.285456] nvidia: module license 'NVIDIA' taints kernel.
> > > [ 48.549725] ACPI: PCI Interrupt 0000:02:00.0[A] -> Link [APC4] -> GSI
> > > 19 (level, high) -> IRQ 20 [ 48.550149] NVRM: loading NVIDIA UNIX x86
> > > Kernel Module 169.07 Thu Dec 13 18:42:56 PST 2007
> >
> >We have no way of debugging that module, so please try 2.6.24 without it.
>
> Sorry, I can't do this and have a working machine. The nv driver has suffered
> bit rot or something since the FC2 days when it COULD run a 19" crt at
> 1600x1200, and will not drive this 20" wide screen lcd 1680x1050 monitor at
> more than 800x600, which is absolutely butt ugly fuzzy, looking like a jpg
> compressed to 10%. The system is not usable on a day to basis without the
> nvidia driver.
>
> Fix the nv driver so it will run this screen at its native resolution and I'll
> be glad to run it even if it won't run google earth, which I do use from time
> to time. Now, if in all the hits you can get from google on this, currently
> 14,800 just for 'exception Emask', apparently caused by a timeout, if 100% of
> the complainers are running nvidia drivers also, then I see a legit
I can invalidate this theory...
i helped a guy on irc debug this problem, and he had ati. I tried having
him stop using fglrx, and go to r300.. same problem, and same problem
even with vesa.. :)
also, i have this on my fileserver with .20, which doesent even run X,
or module support in kernel :)
> complaint. Again, fix the nv driver so it will run my screen & I'll be glad
> to switch. I can see the reason, sure, but the machine must be capable of
> doing its common day to day stuff, while using that driver, like running kde
> for kmail, and browsers that work.
>
> >If the problems persist, please try to capture a complete log from the
> >failing kernel -- the interesting bits are everything from initial boot
> >up to and including the first few errors. You may need to increase the
> >kernel's log buffer size if the log gets truncated (CONFIG_LOG_BUF_SHIFT).
>
> If by log you mean /var/log/messages, I have several megabytes of those.
> If you mean a live dmesg capture taken right now, its attached. It contains
> several of these at the bottom. I long ago made the kernel log buffer
> bigger, cuz it couldn't even show the start immediately after the boot, and
> even the dump to syslog was truncated.
>
> >There are no pata_amd changes from 2.6.24-rc7 to 2.6.24 final.
>
> That is what I was afraid of. I've done some limited grepping in that branch
> of the kernel tree, and cannot seem to locate where this EH handler is being
> invoked from.
>
> There is 2 lines of interest in the dmesg:
>
> [ 0.000000] Nvidia board detected. Ignoring ACPI timer override.
> [ 0.000000] If you got timer trouble try acpi_use_timer_override
>
> But I have NDI what it means, kernel argument/xconfig option?
>
> I've also done some googling, and it appears this problem is fairly widespread
> since the switchover to libata was encouraged. A stock fedora F8 kernel
> suffers the same freezes and eventually locks up, but does it without the
> error messages being logged, it just freezes, feeling identical to this in
> the minutes before the total freeze. I've tried 2 of those too, but the
> newest one won't even run X.
>
next prev parent reply other threads:[~2008-01-29 4:24 UTC|newest]
Thread overview: 98+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-01-28 2:22 Problem with ata layer in 2.6.24 Gene Heskett
2008-01-28 3:19 ` Kasper Sandberg
2008-01-28 8:17 ` Mikael Pettersson
2008-01-28 12:03 ` Peter Zijlstra
2008-01-28 12:26 ` Mikael Pettersson
2008-01-28 12:45 ` Ingo Molnar
2008-01-28 12:54 ` Gene Heskett
2008-01-28 13:19 ` Gene Heskett
2008-01-28 13:57 ` Mikael Pettersson
2008-01-28 16:35 ` Gene Heskett
2008-01-28 16:50 ` Calvin Walton
2008-01-28 17:20 ` Zan Lynx
2008-01-28 17:30 ` Gene Heskett
2008-01-28 17:44 ` Gene Heskett
2008-01-28 17:59 ` Daniel Barkalow
2008-01-28 18:23 ` Richard Heck
2008-01-28 18:53 ` Andrey Borzenkov
2008-01-28 19:09 ` Gene Heskett
2008-01-28 19:21 ` Andrey Borzenkov
2008-01-28 19:31 ` Gene Heskett
2008-01-28 20:00 ` Richard Heck
2008-01-28 20:01 ` Daniel Barkalow
2008-01-29 0:05 ` Gene Heskett
2008-01-29 0:34 ` Daniel Barkalow
2008-01-29 1:31 ` Gene Heskett
2008-01-29 1:51 ` Daniel Barkalow
2008-01-29 4:48 ` Michal Jaegermann
2008-01-29 12:12 ` Alan Cox
2008-01-29 14:30 ` Gene Heskett
2008-01-29 14:51 ` Gene Heskett
2008-01-29 15:47 ` Alan Cox
2008-01-29 16:32 ` Gene Heskett
2008-01-29 16:48 ` Mikael Pettersson
2008-01-29 17:04 ` Gene Heskett
2008-01-29 17:38 ` Daniel Barkalow
2008-01-29 17:44 ` Alan Cox
2008-01-29 18:12 ` Daniel Barkalow
2008-01-29 17:59 ` Gene Heskett
2008-01-29 18:54 ` Alan Cox
2008-01-29 22:41 ` Gene Heskett
2008-01-29 22:48 ` Alan Cox
2008-01-30 0:19 ` rgheck
2008-01-30 0:19 ` Mark Lord
2008-01-29 17:06 ` rgheck
2008-01-29 17:12 ` Alan Cox
2008-01-29 17:24 ` rgheck
2008-01-29 17:40 ` Alan Cox
2008-01-29 18:11 ` Mark Lord
2008-01-29 18:28 ` rgheck
2008-01-29 18:32 ` Mark Lord
2008-01-29 18:14 ` Daniel Barkalow
2008-01-29 18:46 ` Alan Cox
2008-01-29 19:14 ` Daniel Barkalow
2008-01-29 19:34 ` Alan Cox
2008-01-28 16:56 ` Gene Heskett
2008-01-28 18:20 ` Mark Lord
2008-01-28 18:59 ` Gene Heskett
2008-01-28 20:43 ` Mark Lord
2008-01-29 0:06 ` Gene Heskett
2008-01-29 3:16 ` Mark Lord
2008-01-29 4:07 ` Gene Heskett
2008-01-28 17:06 ` Dave Neuer
2008-01-29 4:23 ` Kasper Sandberg [this message]
2008-01-29 4:49 ` Gene Heskett
2008-01-29 5:01 ` Kasper Sandberg
2008-02-02 7:13 ` Tejun Heo
2008-01-28 14:44 ` Richard Heck
2008-01-28 17:01 ` Gene Heskett
2008-01-28 18:38 ` Mark Lord
2008-01-28 20:01 ` Alan Cox
2008-01-28 20:29 ` Mark Lord
2008-01-28 18:54 ` Mark Lord
2008-01-28 19:01 ` Mark Lord
2008-01-28 19:04 ` Gene Heskett
2008-01-28 20:22 ` Mark Lord
2008-01-28 20:32 ` Mark Lord
2008-01-29 0:10 ` Gene Heskett
2008-01-28 19:08 ` Jeff Garzik
2008-01-28 19:13 ` Gene Heskett
2008-01-29 6:41 ` Florian Attenberger
2008-01-29 15:04 ` Gene Heskett
2008-01-29 16:12 ` Mark Lord
2008-01-29 16:36 ` Gene Heskett
2008-01-29 18:09 ` Mark Lord
2008-01-29 16:50 ` rgheck
2008-01-29 16:58 ` Jeff Garzik
2008-01-29 17:12 ` Gene Heskett
2008-01-29 17:32 ` Jeff Garzik
2008-01-29 17:53 ` Gene Heskett
[not found] <fa.YoSRdik0niRWE1jgfb9yTQPim5A@ifi.uio.no>
[not found] ` <fa.fCSKKy5aVmh9IomaWmc+Y8P16+c@ifi.uio.no>
[not found] ` <fa.ebGd1717PgKMzHUmaQOLoLkFdsI@ifi.uio.no>
2008-01-29 0:19 ` Robert Hancock
2008-01-29 0:55 ` Gene Heskett
2008-01-29 1:31 ` Robert Hancock
2008-01-29 1:51 ` Gene Heskett
2008-01-29 2:20 ` Gene Heskett
2008-01-29 3:21 ` Mark Lord
-- strict thread matches above, loose matches on Subject: below --
2008-01-29 17:42 Adam Turk
2008-01-29 17:55 ` Gene Heskett
2008-01-29 22:57 ` Adam Turk
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1201580616.12795.2.camel@localhost \
--to=lkml@metanurb.dk \
--cc=a.p.zijlstra@chello.nl \
--cc=gene.heskett@gmail.com \
--cc=linux-ide@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=mikpe@it.uu.se \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.