* CPU Context corruption @ 2004-09-11 11:19 Nigel Kukard 2004-09-11 12:40 ` Alan Cox 0 siblings, 1 reply; 4+ messages in thread From: Nigel Kukard @ 2004-09-11 11:19 UTC (permalink / raw) To: linux-kernel What does this error mean? CPU 0: Machine Check Exception: 0000000000000004 Bank 0: 820000001040080F I have a Matsonic 9097c motherboard, 2.4Ghz prescott celeron cpu. This error seems to be random. We have replaced the motherboard & cpu to no avail. -Nigel ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: CPU Context corruption 2004-09-11 11:19 CPU Context corruption Nigel Kukard @ 2004-09-11 12:40 ` Alan Cox 2004-09-12 6:36 ` Willy Tarreau 0 siblings, 1 reply; 4+ messages in thread From: Alan Cox @ 2004-09-11 12:40 UTC (permalink / raw) To: Nigel Kukard; +Cc: Linux Kernel Mailing List On Sad, 2004-09-11 at 12:19, Nigel Kukard wrote: > What does this error mean? > > > CPU 0: Machine Check Exception: 0000000000000004 > Bank 0: 820000001040080F > > > I have a Matsonic 9097c motherboard, 2.4Ghz prescott celeron cpu. This > error seems to be random. We have replaced the motherboard & cpu to no > avail. It normally indicates a hardware problem. The precise meaning of all the bits is in the Intel chip docs (volume 3). If you've swapped the mainboard/cpu it might just be bad RAM. ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: CPU Context corruption 2004-09-11 12:40 ` Alan Cox @ 2004-09-12 6:36 ` Willy Tarreau 2004-09-12 7:57 ` Nigel Kukard 0 siblings, 1 reply; 4+ messages in thread From: Willy Tarreau @ 2004-09-12 6:36 UTC (permalink / raw) To: Alan Cox; +Cc: Nigel Kukard, Linux Kernel Mailing List On Sat, Sep 11, 2004 at 01:40:55PM +0100, Alan Cox wrote: > On Sad, 2004-09-11 at 12:19, Nigel Kukard wrote: > > What does this error mean? > > > > > > CPU 0: Machine Check Exception: 0000000000000004 > > Bank 0: 820000001040080F > > > > > > I have a Matsonic 9097c motherboard, 2.4Ghz prescott celeron cpu. This > > error seems to be random. We have replaced the motherboard & cpu to no > > avail. > > It normally indicates a hardware problem. The precise meaning of all the > bits is in the Intel chip docs (volume 3). If you've swapped the > mainboard/cpu it might just be bad RAM. He can also get precise info with Dave Jones' parsemce tool : http://www.kernel.org/pub/linux/kernel/people/davej/tools/ It currently says : Status: (4) Machine Check in progress. Restart IP invalid. parsebank(0): 820000001040080f @ 0 External tag parity error CPU state corrupt. Restart not possible Bus and interconnect error Participation: Local processor originated request Timeout: Request did not timeout Request: Generic error Transaction type : Invalid Memory/IO : Other Since it says it's neither memory nor I/O, I think it might be related to a PCI parity error with some card, either during transfers or config access. Regards, Willy ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: CPU Context corruption 2004-09-12 6:36 ` Willy Tarreau @ 2004-09-12 7:57 ` Nigel Kukard 0 siblings, 0 replies; 4+ messages in thread From: Nigel Kukard @ 2004-09-12 7:57 UTC (permalink / raw) To: Willy Tarreau; +Cc: Alan Cox, Linux Kernel Mailing List, daniel [-- Attachment #1: Type: text/plain, Size: 2148 bytes --] > > > CPU 0: Machine Check Exception: 0000000000000004 > > > Bank 0: 820000001040080F > > > > It normally indicates a hardware problem. The precise meaning of all the > > bits is in the Intel chip docs (volume 3). If you've swapped the > > mainboard/cpu it might just be bad RAM. > > He can also get precise info with Dave Jones' parsemce tool : > > http://www.kernel.org/pub/linux/kernel/people/davej/tools/ > yep, i found this a few hours ago... and got the same result as below > It currently says : > > Status: (4) Machine Check in progress. > Restart IP invalid. > parsebank(0): 820000001040080f @ 0 > External tag parity error > CPU state corrupt. Restart not possible > Bus and interconnect error > Participation: Local processor originated request > Timeout: Request did not timeout > Request: Generic error > Transaction type : Invalid > Memory/IO : Other > > Since it says it's neither memory nor I/O, I think it might be related to > a PCI parity error with some card, either during transfers or config access. > Thats the interesting part, I have 2 cards plugged in... 1 x SIS6326 AGP graphics card, and 1 x Connexant PCI modem card. If I put the modem card into the left most pci slot, I get a bios IRQ routing error. I've replaced the AGP card with another make, same problem, I've had my supplier replace the motherboard, same problem. Replaced the CPU, same problem. I'm going to take the system back to the supplier tomorrow and ask him to sort it out.... I'm not going to tell him anything apart from "It does not work", its useless explaining to him the above if he not going to understand a word of it. Could this be an incompatibility issue between the CPU and the mobo? Regards Nigel -- Nigel Kukard, PhD CompSc (Chief Executive Officer) Linux Based Systems Design (Non-Profit) Web: www.lbsd.net Email: nkukard@lbsd.net Tel: (+27) 023 349 8000 Cell: (+27) 082 333 3723 Fax: (+27) 023 349 1395 Support: 086 747 7600 Address: LIGT House, 2 Klipdrift Rd, Rawsonville Linux Systems Design & Technology Solutions [-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --] ^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2004-09-12 7:57 UTC | newest] Thread overview: 4+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2004-09-11 11:19 CPU Context corruption Nigel Kukard 2004-09-11 12:40 ` Alan Cox 2004-09-12 6:36 ` Willy Tarreau 2004-09-12 7:57 ` Nigel Kukard
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox