From: Rogier Wolff <R.E.Wolff@BitWizard.nl>
To: Chris Rankin <rankincj@yahoo.com>
Cc: linux-kernel <linux-kernel@vger.kernel.org>
Subject: Re: APIC error on SMP machine
Date: Wed, 1 Oct 2003 09:47:52 +0200 [thread overview]
Message-ID: <20031001074752.GC30137@bitwizard.nl> (raw)
In-Reply-To: <3F79F8BB.2080905@yahoo.com>
On Tue, Sep 30, 2003 at 10:42:19PM +0100, Chris Rankin wrote:
> Linux-2.4.22-SMP, 1 GB RAM, devfs, gcc-3.2.3.
>
> Hi,
>
> Today, my dual PIII (Coppermine) refused to boot, and wrote a large number
> of these messages to the serial console instead:
>
> APIC error on CPU1: 04(04)
> APIC error on CPU1: 04(04)
> APIC error on CPU1: 04(04)
> APIC error on CPU1: 04(04)
> APIC error on CPU1: 04(04)
> APIC error on CPU1: 04(04)
> APIC error on CPU1: 04(04)
> APIC error on CPU1: 04(04)
> APIC error on CPU1: 04(04)
> APIC error on CPU1: 04(04)
> APIC error on CPU1: 04(04)
> APIC error on CPU1: 04(04)
> APIC error on CPU1: 04(04)
> APIC error on CPU1: 04(04)
> APIC error on CPU1: 04(04)
> Can anyone tell me what these might mean, please? The kernel source implies
> that it's a "Send accept error", but this doesn't help me in an "Ah, I can
> fix that!" sense.
I rewrote that code to make it spit out those messages that you
see. That however doesn't mean I know what I'm doing....
The APIC chip has a bit register that indicates errors. The kernel,
reads the register, stores it, and that should clear the error. Just to
be sure, we read it again, and store the result. Then we print the two
results.
In your case, the APIC seems to have a problem, and it doesn't go away
when we read the register, as it should.
On my "BP6" motherboard, I often see 04(08) errors: The error changes
after I read it once.
The code was printing the whole bitflag shebang before reading it again,
allowing the system to generate another error in the meanwhile, and
hanging the machine. To prevent this, I modified it to just print the
raw bits, trusting that you'd be knowledgable enough to grep through the
kernel sources to find the definitions of the bits. That proved true.
And as expected, you (just like me) don't know what to do with the
definition of that bit anyway.
On the BP6 it seems that the APIC bus is a bit noisy. So we get
transmission errors on that bus, allowing for a variety of errors on the
recieving end. In your case, the errors seem to end up happening faster
than the machine can handle :-(
Roger.
--
** R.E.Wolff@BitWizard.nl ** http://www.BitWizard.nl/ ** +31-15-2600998 **
*-- BitWizard writes Linux device drivers for any device you may have! --*
**** "Linux is like a wigwam - no windows, no gates, apache inside!" ****
next prev parent reply other threads:[~2003-10-01 7:48 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2003-09-30 21:42 APIC error on SMP machine Chris Rankin
2003-10-01 1:52 ` James Cleverdon
2003-10-01 10:14 ` Chris Rankin
2003-10-01 7:47 ` Rogier Wolff [this message]
-- strict thread matches above, loose matches on Subject: below --
2003-10-01 13:08 Matt_Domsch
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20031001074752.GC30137@bitwizard.nl \
--to=r.e.wolff@bitwizard.nl \
--cc=linux-kernel@vger.kernel.org \
--cc=rankincj@yahoo.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox