public inbox for linux-ia64@vger.kernel.org
 help / color / mirror / Atom feed
From: David Mosberger <davidm@napali.hpl.hp.com>
To: Matthias Fouquet-Lapar <mfl@kernel.paris.sgi.com>
Cc: davidm@hpl.hp.com, Andi Kleen <ak@suse.de>,
	davidm@napali.hpl.hp.com, iod00d@hp.com,
	ishii.hironobu@jp.fujitsu.com, linux-kernel@vger.kernel.org,
	linux-ia64@vger.kernel.org
Subject: Re: [RFC/PATCH, 1/4] readX_check() performance evaluation
Date: Thu, 29 Jan 2004 19:28:01 +0000	[thread overview]
Message-ID: <16409.24257.589224.818006@napali.hpl.hp.com> (raw)
In-Reply-To: <200401290823.i0T8NTDi024477@mtv-vpn-hw-mfl-2.corp.sgi.com>

>>>>> On Thu, 29 Jan 2004 09:23:20 +0100 ("CET), Matthias Fouquet-Lapar <mfl@kernel.paris.sgi.com> said:

  Matthias> We have done a rather large study with DIMMs that had SBEs
  Matthias> and have found no evidence that a SBE turns into a UCE,
  Matthias> i.e. the fact that a SBE is reported, is no indication
  Matthias> that the device might fail soon.

  Matthias> As a matter of fact the soft error rates increases while
  Matthias> parts use smaller process technologies and lower supply
  Matthias> voltages. Cosmic rays are one source for soft
  Matthias> errors. Another source are alpha particles emitted by the
  Matthias> solder.

Ehh, wait a second: you're saying that your study proved that if the
device isn't failing, it isn't failing. ;-) Of course you'll get noise
and perhaps even lots of it due to cosmic rays but this doesn't say
anything about the error pattern you when a device _is_ failing (e.g.,
due to overheating, over-clocking, or wrong voltage).  Or did your
study cover the cases where a system is operated under "out-of-spec"
situation?

  Matthias> Still I think it's important to log SBEs, but you probably
  Matthias> will need a treshhold in case you hit a hard SBE. Also
  Matthias> scrubbing the memory location (and re-read the location to
  Matthias> check if the error was transient or not) might be a good
  Matthias> idea if the memory controller supports this.  If it is a
  Matthias> true, hard SBE it should be reported. It also might be a
  Matthias> good idea to mark the page, so it does not get
  Matthias> re-allocated.

Yes.  And once I finally received Andi's earlier mails (guess I have
to thank MyDoom for that... ;-( ), it was clear that nobody argued for
turning off the error reporting.  The issue was only whether or not to
log a message via printk() (which, in this case, clearly isn't a good
idea).  So I think we're all in violent agreement.

	--david

  reply	other threads:[~2004-01-29 19:28 UTC|newest]

Thread overview: 20+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2004-01-28  1:54 [RFC/PATCH, 1/4] readX_check() performance evaluation Hironobu Ishii
2004-01-28 17:20 ` Grant Grundler
2004-01-28 17:41   ` Andi Kleen
2004-01-28 18:31     ` David Mosberger
2004-01-28 18:52       ` Andi Kleen
2004-01-28 19:24         ` David Mosberger
2004-01-28 19:39           ` Andi Kleen
2004-01-28 19:48             ` David Mosberger
2004-01-28 20:01               ` Andi Kleen
2004-01-28 23:35                 ` David Mosberger
2004-02-16 10:19             ` Pavel Machek
2004-01-28 19:09     ` Grant Grundler
2004-01-28 19:17       ` Andi Kleen
2004-01-28 21:14         ` Grant Grundler
2004-01-28 21:39           ` Andi Kleen
2004-01-29  8:23 ` Matthias Fouquet-Lapar
2004-01-29 19:28   ` David Mosberger [this message]
2004-01-29 20:16 ` Matthias Fouquet-Lapar
2004-01-29 21:09   ` David Mosberger
2004-01-29 22:20 ` Matthias Fouquet-Lapar

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=16409.24257.589224.818006@napali.hpl.hp.com \
    --to=davidm@napali.hpl.hp.com \
    --cc=ak@suse.de \
    --cc=davidm@hpl.hp.com \
    --cc=iod00d@hp.com \
    --cc=ishii.hironobu@jp.fujitsu.com \
    --cc=linux-ia64@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mfl@kernel.paris.sgi.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox