From: "Simon Garner" <sgarner@expio.co.nz>
To: "Andi Kleen" <ak@colin2.muc.de>
Cc: <linux-kernel@vger.kernel.org>
Subject: Re: MSI K8D-Master - GART error 3
Date: Mon, 11 Aug 2003 10:43:57 +1200 [thread overview]
Message-ID: <003e01c35f91$08227b40$0401a8c0@SIMON> (raw)
In-Reply-To: 20030805134241.GA63394@colin2.muc.de
On Wednesday, August 06, 2003 1:42 AM [GMT+1200=NZT],
Andi Kleen <ak@colin2.muc.de> wrote:
>
> Ok that's the very old MCE code that incorrectly enabled the
> northbridge machine check. Don't use that or use mce=off. However I
> still think it's a driver bug in your case. If it was the shakey GART
> MCE itself you would get a panic because it's a unrecoverable MCE.
> More likely the driver is accessing PCI DMA mappings after they got
> unmapped, which is a serious bug, but somehow not serious enough that
> the northbridge triggers the MCE.
>
> I was confused by your statement that the SuSE 8.2 beta9 kernel
> generated that. It didn't because it doesn't contain that old code.
>
> What does a modern kernel like the SuSE one or a x86-64.org kernel
> generate exactly?
>
I have reinstalled SuSE now, and I apologise as I was only partially
correct. I do get errors, but they are slightly different from RH. They
appear to be saying the same thing, though. Every 30 seconds I get:
Aug 11 10:37:06 terra kernel: Northbridge status 9405c00000000a13
Aug 11 10:37:06 terra kernel: ECC syndrome bits b
Aug 11 10:37:06 terra kernel: extended error ecc error
Aug 11 10:37:06 terra kernel: link number 0
Aug 11 10:37:06 terra kernel: corrected ecc error
Aug 11 10:37:06 terra kernel: error address valid
Aug 11 10:37:06 terra kernel: error enable
Aug 11 10:37:06 terra kernel: previous error lost
Aug 11 10:37:06 terra kernel: error address 00000000003e4710
Aug 11 10:37:36 terra kernel: Northbridge status 9405c00000000813
Aug 11 10:37:36 terra kernel: ECC syndrome bits b
Aug 11 10:37:36 terra kernel: extended error ecc error
Aug 11 10:37:36 terra kernel: link number 0
Aug 11 10:37:36 terra kernel: corrected ecc error
Aug 11 10:37:36 terra kernel: error address valid
Aug 11 10:37:36 terra kernel: error enable
Aug 11 10:37:36 terra kernel: previous error lost
Aug 11 10:37:36 terra kernel: error address 00000000003c4220
These suggest it's just reporting ECC corrections. Why would it do this
exactly every 30 seconds? (or is that just the reporting interval?)
# uname -a
Linux terra 2.4.19-SMP #1 SMP Wed Jun 25 21:37:18 UTC 2003 x86_64
unknown unknown GNU/Linux
thanks for the help,
-Simon
next prev parent reply other threads:[~2003-08-10 22:45 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <gC1o.2gU.5@gated-at.bofh.it>
2003-08-05 0:11 ` MSI K8D-Master - GART error 3 Andi Kleen
2003-08-05 0:45 ` Simon Garner
2003-08-05 13:42 ` Andi Kleen
2003-08-10 22:43 ` Simon Garner [this message]
2003-08-10 22:56 ` Andi Kleen
2003-08-12 23:22 ` Simon Garner
2003-08-04 1:05 Simon Garner
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to='003e01c35f91$08227b40$0401a8c0@SIMON' \
--to=sgarner@expio.co.nz \
--cc=ak@colin2.muc.de \
--cc=linux-kernel@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.