All of lore.kernel.org
 help / color / mirror / Atom feed
From: Robert Hancock <hancockr@shaw.ca>
To: Handle X <xhandle@gmail.com>
Cc: Vikas Kedia <kedia.vikas@gmail.com>, linux-kernel@vger.kernel.org
Subject: Re: Can we ignore errors in mcelog if the server is running fine
Date: Fri, 28 Jul 2006 02:34:41 -0600	[thread overview]
Message-ID: <44C9CC21.9040609@shaw.ca> (raw)
In-Reply-To: <fa.9M8mPetEI5HZ8L2RMGPhKPm3gJA@ifi.uio.no>

Handle X wrote:
> On 7/27/06, Robert Hancock <hancockr@shaw.ca> wrote:
>> Vikas Kedia wrote:
>> > The server seems to be running fine. A. can I ignore the following
>> > mcelog errors ? B. If not what should i do to stop the server from
>> > reporting mcelog errors.
>>
>> Looks like data cache ECC errors, meaning the CPU 0 is faulty.
>> Eventually if it's not replaced there will likely be some uncorrectable
>> errors and the system will likely crash.
> 
> I am facing similar, but different errors.
> 
> [root@turyxsrv ~]# mcelog
> MCE 0
> HARDWARE ERROR. This is *NOT* a software problem!
> Please contact your hardware vendor
> CPU 1 4 northbridge TSC 89a560bb249
> ADDR 1dfa49690
>  Northbridge Chipkill ECC error
>  Chipkill ECC syndrome = 2021
>       bit46 = corrected ecc error
>  bus error 'local node response, request didn't time out
>      generic read mem transaction
>      memory access, level generic'
> STATUS 9410c00020080a13 MCGSTATUS 0

> Repeats whenever I do any kind of operations...
> How severe is ChipKill errors? Should I consider throwing away CPU 1
> and get another one.

That sounds to me more like some of the RAM attached to CPU1 is bad..

-- 
Robert Hancock      Saskatoon, SK, Canada
To email, remove "nospam" from hancockr@nospamshaw.ca
Home Page: http://www.roberthancock.com/


       reply	other threads:[~2006-07-28  8:34 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <fa.5uWgnVpIOBN4Pb1aWwNzF8P2OA0@ifi.uio.no>
     [not found] ` <fa.9M8mPetEI5HZ8L2RMGPhKPm3gJA@ifi.uio.no>
2006-07-28  8:34   ` Robert Hancock [this message]
2006-07-28 18:13     ` Can we ignore errors in mcelog if the server is running fine Handle X
     [not found] <fa.2RkKSvRvPsGNSGCsUHQ9gQ8qlrg@ifi.uio.no>
2006-07-27 19:34 ` Robert Hancock
2006-07-28  5:28   ` Handle X
     [not found] <6Dc4C-1tt-47@gated-at.bofh.it>
2006-07-27 12:34 ` Bodo Eggert
2006-07-27 11:11 Vikas Kedia

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=44C9CC21.9040609@shaw.ca \
    --to=hancockr@shaw.ca \
    --cc=kedia.vikas@gmail.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=xhandle@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.