public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Andi Kleen <ak@muc.de>
To: Alan Cox <alan@lxorguk.ukuu.org.uk>
Cc: Doug Thompson <norsk5@yahoo.com>,
	akpm@osdl.org, mm-commits@vger.kernel.org, norsk5@xmission.com,
	linux-kernel@vger.kernel.org
Subject: Re: + edac-new-opteron-athlon64-memory-controller-driver.patch added to -mm tree
Date: 6 Jul 2006 00:04:25 +0200
Date: Thu, 6 Jul 2006 00:04:25 +0200	[thread overview]
Message-ID: <20060705220425.GB83806@muc.de> (raw)
In-Reply-To: <1152137302.6533.28.camel@localhost.localdomain>

On Wed, Jul 05, 2006 at 11:08:21PM +0100, Alan Cox wrote:
> Ar Maw, 2006-07-04 am 13:34 +0200, ysgrifennodd Andi Kleen:
> > > > Giving a consistent sysfs interface is a bit harder, but I suppose one 
> > > > could change the code to provide pseudo banks for enable/disable too.
> > > > However that would be system specific again, so a default "all on/all off" 
> > > > policy might be quite ok.
> > > 
> > > I think we need the basic consistent sysfs case. Whether that is
> > 
> > What should i do?

s/i/it/ of course.

Basically what I asked for is what you think that sysfs interface
should do.

You want a single error / no error knob? 

The problem is that anything more detailed requires knowledge of the
specific hardware.

The single knob on standard MCE would be 

for i in /sys/devices/system/machinecheck/*/bank*
	echo 0 > $i
done

(or 0xfffffffffffffffff to turn everything on) 

What else?


What we identified as missing is a unified way for all hardware
to report how many errors and on which DIMMs. I think I can easily
add that to mcelog (it would already report it, but in a CPU
specific format) 

> 
> Well personally I would favour the MCE logging stuff staying in because
> its clearly small, compact and enough for many users, and the EDAC stuff
> hooking that feed somehow so that people who want the detail and the

As far as I can figure out there is no more detail offered by it at least
for K8.  All the information that is given by the Northbridge is in the MCE
and the rest for the DIMM topology is in SMBIOS (or could be read from user 
space if really needed) 

I went through a similar development myself BTW. When I wrote
the first Opteron machine check handler for 2.4 I also coded
access to the PCI device and read the registers there.
But later i realized that it's useless because the CPU shadows
all these registers into the regular machine check MSRs. So you
can just get it with a portable handler from there. When I redid
the handler i threw it all out.

Now you seem to want to add it in again ... 

Regarding non K8 x86-64 it would need more research, but I hop
they also dump everything into the MSRs.

> 
> As to filtering and control of the banks - that can always be done by
> filtering what is handed down from the MCE code if I understand it right
> so can be left in the EDAC side.

I think that should be done in user space.

-Andi

  reply	other threads:[~2006-07-05 22:04 UTC|newest]

Thread overview: 19+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <20060701150430.GA38488@muc.de>
     [not found] ` <20060703172633.50366.qmail@web50109.mail.yahoo.com>
     [not found]   ` <20060703184836.GA46236@muc.de>
     [not found]     ` <1151962114.16528.18.camel@localhost.localdomain>
2006-07-04  9:23       ` + edac-new-opteron-athlon64-memory-controller-driver.patch added to -mm tree Andi Kleen
2006-07-04 10:09         ` Alan Cox
2006-07-04 11:34           ` Andi Kleen
2006-07-05 22:08             ` Alan Cox
2006-07-05 22:04               ` Andi Kleen [this message]
2006-07-06  6:12                 ` Eric W. Biederman
2006-07-06 13:01                   ` Andi Kleen
2006-07-06 15:31                     ` Eric W. Biederman
2006-07-06 16:51                       ` Andi Kleen
2006-07-06 17:46                         ` Eric W. Biederman
2006-07-06 18:08                           ` Andi Kleen
2006-07-06 18:34                             ` Alan Cox
2006-07-06 18:27                               ` Andi Kleen
2006-07-06 19:09                                 ` Eric W. Biederman
2006-07-06 19:18                                   ` Andi Kleen
2006-07-06 19:43                                     ` Eric W. Biederman
2006-07-06 18:43                             ` Eric W. Biederman
2006-07-05 17:39         ` Doug Thompson
2006-07-05 19:39           ` Andi Kleen

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20060705220425.GB83806@muc.de \
    --to=ak@muc.de \
    --cc=akpm@osdl.org \
    --cc=alan@lxorguk.ukuu.org.uk \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mm-commits@vger.kernel.org \
    --cc=norsk5@xmission.com \
    --cc=norsk5@yahoo.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox