From: ebiederm@xmission.com (Eric W. Biederman)
To: Ishikawa <ishikawa@yk.rim.or.jp>
Cc: linux-kernel@vger.kernel.org
Subject: Re: Q: boot time memory region recognition and clearing.
Date: 14 Jul 2002 14:26:37 -0600 [thread overview]
Message-ID: <m1u1n2oy2q.fsf@frodo.biederman.org> (raw)
In-Reply-To: <3D319AEA.10F6611D@yk.rim.or.jp>
Ishikawa <ishikawa@yk.rim.or.jp> writes:
> "Eric W. Biederman" wrote:
>
> > Note. The hardware ECC support in memtest86 3.0
> > is limited, so I would check to make certain your chipset
> > is supported..
> >
>
> Well, I read controller.c source file, and
> found that amd751 is supported according to the info
> in AMD751 documentation.
> (This is the same documentatin I used when I hacked
> early ecc.c source file.)
>
> I ran memtest v3 on the motherboard, and
> it didn't find any errors when I ran it overnight.
I wasn't able to verify it so the AMD751 was disabled by default.
>
> > Your objective is misguided. Even with a bios that
> > is slightly buggy in initializing the ECC bits, what you want is
> > scrubbing. Then if the error disappears after 5 minutes of uptime
> > you can ignore it.
>
> I see. Then the problem would boil down whether
> AMD751 supports scrubbingat the hardware level.
> It reports that it saw correctable single bit error,
> and I take it to mean that the chip itself
> has fixed the single bit error.
> Am I too optimistic to expect this?
The single bit error has been corrected in only one direction,
in the data going to the cpu. The data remaining in memory
was not corrected.
> The problem, though, is this.
> According to the AMD751 documentation, we
> can clear the memory error info in the chip register,
> by writing to a certain location of the PCI space.
> However, when one error get reported on my motherboard,
> it would NOT go away and I think there is something amiss
> about this.
That the location is frequently read and there is no hardware
scrubbing (writing the correct value back to ram).
> I am not sure if the AMD doc is correct about this now.
> from controller.c of memtest86.
>
> /* Clear the error status */
> pci_conf_write(ctrl.bus, ctrl.dev, ctrl.fn, 0x58, 2, 0);
>
> >From locally hacked ecc.c:
> /*
> * clear error flag bits that were set by writing 0 to
> them
> * we hope the error was a fluke or something :)
> */
> /* int value = eccstat & 0xFCFF; */
> /* pci_write_config_word(bridge, 0x58, value); */
> pci_write_config_byte(bridge, 0x58, 0x0);
> pci_write_config_byte(bridge, 0x59, 0x0);
Both of which match.
> BTW, I tried both the byte and word write to 0x58, but
> it never seemd to clear the error status.
> (It is possible that the error is real, but again
> the error is reported always in the first bank even if
> I rotate the memry sticks...)
Sounds like a questionable bit of documentation.
> > And if it comes back you know you really have
> > something to worry about. At least for single bit errors this should
> > fix the whole problem with something that is useful for other
> > purposes.
> >
> > Eric
>
> Thank you for your feedback. I noticed that you
> contributed to linuxBIOS and memtest86.
> Please keep the good work going!
Hopefully we can get the problems well enough understood in the community
that we can actually get some of this code fixed up and working well.
Eric
next prev parent reply other threads:[~2002-07-14 20:35 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2002-07-13 14:19 Q: boot time memory region recognition and clearing Chiaki
2002-07-14 8:20 ` Eric W. Biederman
2002-07-14 15:38 ` Ishikawa
2002-07-14 20:26 ` Eric W. Biederman [this message]
-- strict thread matches above, loose matches on Subject: below --
2002-07-18 2:08 Chiaki
2002-07-13 23:27 Chiaki
2002-07-13 10:08 Ishikawa
2002-07-13 14:15 ` Alan Cox
2002-07-13 15:09 ` Rogier Wolff
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=m1u1n2oy2q.fsf@frodo.biederman.org \
--to=ebiederm@xmission.com \
--cc=ishikawa@yk.rim.or.jp \
--cc=linux-kernel@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox