* Machine Check Exception !
@ 2005-12-27 2:52 Legend W.
2005-12-27 6:46 ` Nauman Tahir
2005-12-31 1:04 ` Alan Cox
0 siblings, 2 replies; 14+ messages in thread
From: Legend W. @ 2005-12-27 2:52 UTC (permalink / raw)
To: linux-kernel
Hello,
I get the following message under 2.4.21 from RedHat:
CPU 3: Machine Check Exception: 0000000000000004
<Bank 0: b20000001040080f
and the box is dead.
When i use parsemce, it said:
Status: (4) Machine Check in progress.
Restart IP invalid.
parsebank(0): b20000001040080f @ 3
External tag parity error
CPU state corrupt. Restart not possible
Error enabled in control register
Error not corrected.
Bus and interconnect error
Participation: Local processor originated request
Timeout: Request did not timeout
Request: Generic error
Transaction type : Invalid
Memory/IO : Other
Can anybody please enlighten me what this means or what a possible
problem behind might be?
Thank you in advance
PS: my box has dual Xeon 2.8G CPU
^ permalink raw reply [flat|nested] 14+ messages in thread* Re: Machine Check Exception !
2005-12-27 2:52 Machine Check Exception ! Legend W.
@ 2005-12-27 6:46 ` Nauman Tahir
2005-12-27 17:51 ` Lee Revell
2005-12-31 1:04 ` Alan Cox
1 sibling, 1 reply; 14+ messages in thread
From: Nauman Tahir @ 2005-12-27 6:46 UTC (permalink / raw)
To: Legend W.; +Cc: linux-kernel
On 12/27/05, Legend W. <mrwangxc@gmail.com> wrote:
> Hello,
>
> I get the following message under 2.4.21 from RedHat:
>
> CPU 3: Machine Check Exception: 0000000000000004
> <Bank 0: b20000001040080f
>
> and the box is dead.
>
> When i use parsemce, it said:
> Status: (4) Machine Check in progress.
> Restart IP invalid.
> parsebank(0): b20000001040080f @ 3
> External tag parity error
> CPU state corrupt. Restart not possible
> Error enabled in control register
> Error not corrected.
> Bus and interconnect error
> Participation: Local processor originated request
> Timeout: Request did not timeout
> Request: Generic error
> Transaction type : Invalid
> Memory/IO : Other
>
> Can anybody please enlighten me what this means or what a possible
> problem behind might be?
>
> Thank you in advance
>
> PS: my box has dual Xeon 2.8G CPU
if you want to make your machine run any way use "nomce" at boot
prompt against your respective grub entry.
> -
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at http://www.tux.org/lkml/
>
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Machine Check Exception !
2005-12-27 6:46 ` Nauman Tahir
@ 2005-12-27 17:51 ` Lee Revell
0 siblings, 0 replies; 14+ messages in thread
From: Lee Revell @ 2005-12-27 17:51 UTC (permalink / raw)
To: Nauman Tahir; +Cc: Legend W., linux-kernel
On Tue, 2005-12-27 at 11:46 +0500, Nauman Tahir wrote:
> On 12/27/05, Legend W. <mrwangxc@gmail.com> wrote:
> > Hello,
> >
> > I get the following message under 2.4.21 from RedHat:
> >
> > CPU 3: Machine Check Exception: 0000000000000004
> > <Bank 0: b20000001040080f
> >
> > and the box is dead.
> >
> > When i use parsemce, it said:
> > Status: (4) Machine Check in progress.
> > Restart IP invalid.
> > parsebank(0): b20000001040080f @ 3
> > External tag parity error
> > CPU state corrupt. Restart not possible
> > Error enabled in control register
> > Error not corrected.
> > Bus and interconnect error
> > Participation: Local processor originated request
> > Timeout: Request did not timeout
> > Request: Generic error
> > Transaction type : Invalid
> > Memory/IO : Other
> >
> > Can anybody please enlighten me what this means or what a possible
> > problem behind might be?
> >
> > Thank you in advance
> >
> > PS: my box has dual Xeon 2.8G CPU
>
> if you want to make your machine run any way use "nomce" at boot
> prompt against your respective grub entry.
This is a terrible idea. MCEs indicate some kind of hardware problem,
it would be idiotic to just ignore that.
Figure out the hardware problem and fix it (bad RAM, overheating, poorly
seated card, etc).
Lee
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Machine Check Exception !
2005-12-27 2:52 Machine Check Exception ! Legend W.
2005-12-27 6:46 ` Nauman Tahir
@ 2005-12-31 1:04 ` Alan Cox
1 sibling, 0 replies; 14+ messages in thread
From: Alan Cox @ 2005-12-31 1:04 UTC (permalink / raw)
To: Legend W.; +Cc: linux-kernel
On Maw, 2005-12-27 at 10:52 +0800, Legend W. wrote:
> parsebank(0): b20000001040080f @ 3
> External tag parity error
> CPU state corrupt. Restart not possible
> Error enabled in control register
> Error not corrected.
> Bus and interconnect error
> Participation: Local processor originated request
> Timeout: Request did not timeout
> Request: Generic error
> Transaction type : Invalid
> Memory/IO : Other
>
> Can anybody please enlighten me what this means or what a possible
> problem behind might be?
Executive summary - your hardware is broken. In this case its reporting
a parity error on external tag bits - presumably cache bits. "Contact
your system vendor for advice" as they say 8)
^ permalink raw reply [flat|nested] 14+ messages in thread
* RE: Machine Check Exception ?
@ 2002-01-25 18:37 Dana Lacoste
2002-01-25 18:48 ` Dave Jones
0 siblings, 1 reply; 14+ messages in thread
From: Dana Lacoste @ 2002-01-25 18:37 UTC (permalink / raw)
To: 'Stephan von Krawczynski'; +Cc: linux-kernel
I used to get these all the time as well (with a very
similar hardware setup) and although I have never
identified exactly what was wrong (still using 2.2.x)
I don't get them any more after doing this :
1 - switched from IDE to SCSI
2 - changed RAM vendors (yes, this was unpleasant)
and, most significantly :
3 - made sure the BIOS had the correct microcode update
for the CPU. the one it had was out of date, and
changing to the latest from Intel solved a LOT of
instability issues....
> -----Original Message-----
> From: Stephan von Krawczynski [mailto:skraw@ithnet.com]
> Sent: January 25, 2002 11:48
> To: Marcel Kunath
> Cc: linux-kernel@vger.kernel.org
> Subject: Re: Machine Check Exception ?
>
>
> On Fri, 25 Jan 2002 07:37:24 -0500 (EST)
> "Marcel Kunath" <kunathma@pilot.msu.edu> wrote:
>
> > Whats the mobo?
>
> Ok,here we go:
>
> diehard:~ # lspci
> 00:00.0 Host bridge: Intel Corporation 440BX/ZX - 82443BX/ZX
> Host bridge (rev 03)
> 00:01.0 PCI bridge: Intel Corporation 440BX/ZX - 82443BX/ZX
> AGP bridge (rev 03)
> 00:04.0 ISA bridge: Intel Corporation 82371AB PIIX4 ISA (rev 02)
> 00:04.1 IDE interface: Intel Corporation 82371AB PIIX4 IDE (rev 01)
> 00:04.2 USB Controller: Intel Corporation 82371AB PIIX4 USB (rev 01)
> 00:04.3 Bridge: Intel Corporation 82371AB PIIX4 ACPI (rev 02)
> 00:09.0 SCSI storage controller: Symbios Logic Inc. (formerly
> NCR) 53c810 (rev 23)
> 00:0a.0 PCI bridge: Intel Corporation 80960RP [i960 RP
> Microprocessor/Bridge] (rev 05)
> 00:0a.1 RAID bus controller: Mylex Corporation DAC960PX (rev 05)
> 00:0b.0 Ethernet controller: 3Com Corporation 3c905C-TX [Fast
> Etherlink] (rev 74)
> 01:00.0 VGA compatible controller: S3 Inc. 86c368 [Trio
> 3D/2X] (rev 02)
>
> Regards,
> Stephan
>
>
> -
> To unsubscribe from this list: send the line "unsubscribe
> linux-kernel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at http://www.tux.org/lkml/
>
^ permalink raw reply [flat|nested] 14+ messages in thread* RE: Machine Check Exception ?
2002-01-25 18:37 Machine Check Exception ? Dana Lacoste
@ 2002-01-25 18:48 ` Dave Jones
0 siblings, 0 replies; 14+ messages in thread
From: Dave Jones @ 2002-01-25 18:48 UTC (permalink / raw)
To: Dana Lacoste; +Cc: 'Stephan von Krawczynski', linux-kernel
On Fri, 25 Jan 2002, Dana Lacoste wrote:
> I don't get them any more after doing this :
> 1 - switched from IDE to SCSI
> 2 - changed RAM vendors (yes, this was unpleasant)
> and, most significantly :
> 3 - made sure the BIOS had the correct microcode update
> for the CPU. the one it had was out of date, and
> changing to the latest from Intel solved a LOT of
> instability issues....
Flaky RAM tends to be one of the more popular triggers
of these exceptions, so (2) above was more than likely your
cause as opposed to (3). (1) seems incredibly unlikely
unless it had adverse change on power drain.
--
| Dave Jones. http://www.codemonkey.org.uk
| SuSE Labs
^ permalink raw reply [flat|nested] 14+ messages in thread
* Machine Check Exception ?
@ 2002-01-25 10:47 Stephan von Krawczynski
2002-01-25 12:37 ` Marcel Kunath
` (2 more replies)
0 siblings, 3 replies; 14+ messages in thread
From: Stephan von Krawczynski @ 2002-01-25 10:47 UTC (permalink / raw)
To: linux-kernel
Hello,
I get the following message under 2.2.19:
Message from syslogd@diehard at Thu Jan 24 14:44:49 2002 ...
diehard kernel: CPU 0: Machine Check Exception: 0000000000000004
and the box is dead.
Can anybody please enlighten me what this means or what a possible
problem behind might be?
Thank you in advance
Stephan
^ permalink raw reply [flat|nested] 14+ messages in thread* Re: Machine Check Exception ?
2002-01-25 10:47 Stephan von Krawczynski
@ 2002-01-25 12:37 ` Marcel Kunath
2002-01-25 16:36 ` Stephan von Krawczynski
2002-01-25 16:47 ` Stephan von Krawczynski
2002-01-25 12:40 ` Denis Oliver Kropp
2002-01-25 13:17 ` Dave Jones
2 siblings, 2 replies; 14+ messages in thread
From: Marcel Kunath @ 2002-01-25 12:37 UTC (permalink / raw)
To: Stephan von Krawczynski; +Cc: linux-kernel
Whats the mobo? What do you mean the box is dead? Dead or deadly stalled on
boot?
mk
> > Hello,
>
> I get the following message under 2.2.19:
>
> Message from syslogd@diehard at Thu Jan 24 14:44:49 2002 ...
> diehard kernel: CPU 0: Machine Check Exception: 0000000000000004
>
> and the box is dead.
> Can anybody please enlighten me what this means or what a possible
> problem behind might be?
>
> Thank you in advance
> Stephan
> -
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at http://www.tux.org/lkml/
>
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Machine Check Exception ?
2002-01-25 12:37 ` Marcel Kunath
@ 2002-01-25 16:36 ` Stephan von Krawczynski
2002-01-25 17:22 ` Dave Jones
2002-01-25 16:47 ` Stephan von Krawczynski
1 sibling, 1 reply; 14+ messages in thread
From: Stephan von Krawczynski @ 2002-01-25 16:36 UTC (permalink / raw)
To: Marcel Kunath; +Cc: linux-kernel
On Fri, 25 Jan 2002 07:37:24 -0500 (EST)
"Marcel Kunath" <kunathma@pilot.msu.edu> wrote:
> Whats the mobo?
I will answer later, it is remote, I have to check out. It is UP, btw.
> What do you mean the box is dead? Dead or deadly stalled on
> boot?
It boots and runs for quite a while (weeks), then suddenly freezes and
shows this message.
It does not happen often, but very rarely.
Has the number any meaning, or is it a goof?
Regards,
Stephan
> > Hello,
> >
> > I get the following message under 2.2.19:
> >
> > Message from syslogd@diehard at Thu Jan 24 14:44:49 2002 ...
> > diehard kernel: CPU 0: Machine Check Exception: 0000000000000004
> >
> > and the box is dead.
> > Can anybody please enlighten me what this means or what a possible
> > problem behind might be?
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Machine Check Exception ?
2002-01-25 12:37 ` Marcel Kunath
2002-01-25 16:36 ` Stephan von Krawczynski
@ 2002-01-25 16:47 ` Stephan von Krawczynski
1 sibling, 0 replies; 14+ messages in thread
From: Stephan von Krawczynski @ 2002-01-25 16:47 UTC (permalink / raw)
To: Marcel Kunath; +Cc: linux-kernel
On Fri, 25 Jan 2002 07:37:24 -0500 (EST)
"Marcel Kunath" <kunathma@pilot.msu.edu> wrote:
> Whats the mobo?
Ok,here we go:
diehard:~ # lspci
00:00.0 Host bridge: Intel Corporation 440BX/ZX - 82443BX/ZX Host bridge (rev 03)
00:01.0 PCI bridge: Intel Corporation 440BX/ZX - 82443BX/ZX AGP bridge (rev 03)
00:04.0 ISA bridge: Intel Corporation 82371AB PIIX4 ISA (rev 02)
00:04.1 IDE interface: Intel Corporation 82371AB PIIX4 IDE (rev 01)
00:04.2 USB Controller: Intel Corporation 82371AB PIIX4 USB (rev 01)
00:04.3 Bridge: Intel Corporation 82371AB PIIX4 ACPI (rev 02)
00:09.0 SCSI storage controller: Symbios Logic Inc. (formerly NCR) 53c810 (rev 23)
00:0a.0 PCI bridge: Intel Corporation 80960RP [i960 RP Microprocessor/Bridge] (rev 05)
00:0a.1 RAID bus controller: Mylex Corporation DAC960PX (rev 05)
00:0b.0 Ethernet controller: 3Com Corporation 3c905C-TX [Fast Etherlink] (rev 74)
01:00.0 VGA compatible controller: S3 Inc. 86c368 [Trio 3D/2X] (rev 02)
Regards,
Stephan
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Machine Check Exception ?
2002-01-25 10:47 Stephan von Krawczynski
2002-01-25 12:37 ` Marcel Kunath
@ 2002-01-25 12:40 ` Denis Oliver Kropp
2002-01-25 13:17 ` Dave Jones
2 siblings, 0 replies; 14+ messages in thread
From: Denis Oliver Kropp @ 2002-01-25 12:40 UTC (permalink / raw)
To: Stephan von Krawczynski; +Cc: linux-kernel
Quoting Stephan von Krawczynski (skraw@ithnet.com):
> Hello,
>
> I get the following message under 2.2.19:
>
> Message from syslogd@diehard at Thu Jan 24 14:44:49 2002 ...
> diehard kernel: CPU 0: Machine Check Exception: 0000000000000004
Hi,
I had the same error sometimes, during heavy load (compiling).
I replaced a memory module by another and it didn't crash anymore,
until now at least ;)
--
Best regards,
Denis Oliver Kropp
.------------------------------------------.
| DirectFB - Hardware accelerated graphics |
| http://www.directfb.org/ |
"------------------------------------------"
convergence integrated media GmbH
^ permalink raw reply [flat|nested] 14+ messages in thread* Re: Machine Check Exception ?
2002-01-25 10:47 Stephan von Krawczynski
2002-01-25 12:37 ` Marcel Kunath
2002-01-25 12:40 ` Denis Oliver Kropp
@ 2002-01-25 13:17 ` Dave Jones
2002-01-25 13:26 ` Denis Oliver Kropp
2 siblings, 1 reply; 14+ messages in thread
From: Dave Jones @ 2002-01-25 13:17 UTC (permalink / raw)
To: Stephan von Krawczynski; +Cc: linux-kernel
On Fri, Jan 25, 2002 at 11:47:18AM +0100, Stephan von Krawczynski wrote:
> Message from syslogd@diehard at Thu Jan 24 14:44:49 2002 ...
> diehard kernel: CPU 0: Machine Check Exception: 0000000000000004
>
> and the box is dead.
> Can anybody please enlighten me what this means or what a possible
> problem behind might be?
Typically a hardware problem. Some older systems generate them
spuriously though, which is why we have a "nomce" boot option.
--
| Dave Jones. http://www.codemonkey.org.uk
| SuSE Labs
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Machine Check Exception ?
2002-01-25 13:17 ` Dave Jones
@ 2002-01-25 13:26 ` Denis Oliver Kropp
0 siblings, 0 replies; 14+ messages in thread
From: Denis Oliver Kropp @ 2002-01-25 13:26 UTC (permalink / raw)
To: Dave Jones, Stephan von Krawczynski, linux-kernel
Quoting Dave Jones (davej@suse.de):
> On Fri, Jan 25, 2002 at 11:47:18AM +0100, Stephan von Krawczynski wrote:
> > Message from syslogd@diehard at Thu Jan 24 14:44:49 2002 ...
> > diehard kernel: CPU 0: Machine Check Exception: 0000000000000004
> >
> > and the box is dead.
> > Can anybody please enlighten me what this means or what a possible
> > problem behind might be?
>
> Typically a hardware problem. Some older systems generate them
> spuriously though, which is why we have a "nomce" boot option.
My system here is a P3 800 Coppermine with Infineon RAM.
After removing that module it didn't occur. Linux 2.4.17.
--
Best regards,
Denis Oliver Kropp
.------------------------------------------.
| DirectFB - Hardware accelerated graphics |
| http://www.directfb.org/ |
"------------------------------------------"
convergence integrated media GmbH
^ permalink raw reply [flat|nested] 14+ messages in thread
end of thread, other threads:[~2005-12-31 1:03 UTC | newest]
Thread overview: 14+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2005-12-27 2:52 Machine Check Exception ! Legend W.
2005-12-27 6:46 ` Nauman Tahir
2005-12-27 17:51 ` Lee Revell
2005-12-31 1:04 ` Alan Cox
-- strict thread matches above, loose matches on Subject: below --
2002-01-25 18:37 Machine Check Exception ? Dana Lacoste
2002-01-25 18:48 ` Dave Jones
2002-01-25 10:47 Stephan von Krawczynski
2002-01-25 12:37 ` Marcel Kunath
2002-01-25 16:36 ` Stephan von Krawczynski
2002-01-25 17:22 ` Dave Jones
2002-01-25 16:47 ` Stephan von Krawczynski
2002-01-25 12:40 ` Denis Oliver Kropp
2002-01-25 13:17 ` Dave Jones
2002-01-25 13:26 ` Denis Oliver Kropp
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox