* [PATCH] edac: i5100 ack error detection register after each read
@ 2011-12-09 16:12 Niklas Söderlund
2011-12-09 16:22 ` Borislav Petkov
0 siblings, 1 reply; 5+ messages in thread
From: Niklas Söderlund @ 2011-12-09 16:12 UTC (permalink / raw)
To: lucas.demarchi, borislav.petkov; +Cc: linux-kernel, Niklas Söderlund
If I only ack the detection register after a error have been detected
I'm unable to reliably detect errors. I have verified this behavior
using both an error injection DIMM and software to inject errors.
I can't find any documentation supporting this behavior in Intel 5100
Memory Controller Hub Chipset, see 1. So this is all based on
experimentation.
[1] Intel® 5100 Memory Controller Hub Chipset
http://www.intel.com/content/dam/doc/datasheet/5100-
memory-controller-hub-chipset-datasheet.pdf
Signed-off-by: Niklas Söderlund <niklas.soderlund@ericsson.com>
---
drivers/edac/i5100_edac.c | 11 ++++-------
1 files changed, 4 insertions(+), 7 deletions(-)
diff --git a/drivers/edac/i5100_edac.c b/drivers/edac/i5100_edac.c
index bcbdeec..ec728e9 100644
--- a/drivers/edac/i5100_edac.c
+++ b/drivers/edac/i5100_edac.c
@@ -535,23 +535,20 @@ static void i5100_read_log(struct mem_ctl_info *mci, int chan,
static void i5100_check_error(struct mem_ctl_info *mci)
{
struct i5100_priv *priv = mci->pvt_info;
- u32 dw;
-
+ u32 dw, dw2;
pci_read_config_dword(priv->mc, I5100_FERR_NF_MEM, &dw);
if (i5100_ferr_nf_mem_any(dw)) {
- u32 dw2;
pci_read_config_dword(priv->mc, I5100_NERR_NF_MEM, &dw2);
- if (dw2)
- pci_write_config_dword(priv->mc, I5100_NERR_NF_MEM,
- dw2);
- pci_write_config_dword(priv->mc, I5100_FERR_NF_MEM, dw);
i5100_read_log(mci, i5100_ferr_nf_mem_chan_indx(dw),
i5100_ferr_nf_mem_any(dw),
i5100_nerr_nf_mem_any(dw2));
+
+ pci_write_config_dword(priv->mc, I5100_NERR_NF_MEM, dw2);
}
+ pci_write_config_dword(priv->mc, I5100_FERR_NF_MEM, dw);
}
/* The i5100 chipset will scrub the entire memory once, then
--
1.7.7.3
^ permalink raw reply related [flat|nested] 5+ messages in thread
* Re: [PATCH] edac: i5100 ack error detection register after each read
2011-12-09 16:12 [PATCH] edac: i5100 ack error detection register after each read Niklas Söderlund
@ 2011-12-09 16:22 ` Borislav Petkov
2012-02-21 10:26 ` Niklas Söderlund
0 siblings, 1 reply; 5+ messages in thread
From: Borislav Petkov @ 2011-12-09 16:22 UTC (permalink / raw)
To: Niklas Söderlund; +Cc: lucas.demarchi, linux-kernel, Tony Luck
Adding Tony.
On Fri, Dec 09, 2011 at 05:12:15PM +0100, Niklas Söderlund wrote:
> If I only ack the detection register after a error have been detected
> I'm unable to reliably detect errors. I have verified this behavior
> using both an error injection DIMM and software to inject errors.
>
> I can't find any documentation supporting this behavior in Intel 5100
> Memory Controller Hub Chipset, see 1. So this is all based on
> experimentation.
>
> [1] Intel® 5100 Memory Controller Hub Chipset
> http://www.intel.com/content/dam/doc/datasheet/5100-
> memory-controller-hub-chipset-datasheet.pdf
>
> Signed-off-by: Niklas Söderlund <niklas.soderlund@ericsson.com>
> ---
> drivers/edac/i5100_edac.c | 11 ++++-------
> 1 files changed, 4 insertions(+), 7 deletions(-)
>
> diff --git a/drivers/edac/i5100_edac.c b/drivers/edac/i5100_edac.c
> index bcbdeec..ec728e9 100644
> --- a/drivers/edac/i5100_edac.c
> +++ b/drivers/edac/i5100_edac.c
> @@ -535,23 +535,20 @@ static void i5100_read_log(struct mem_ctl_info *mci, int chan,
> static void i5100_check_error(struct mem_ctl_info *mci)
> {
> struct i5100_priv *priv = mci->pvt_info;
> - u32 dw;
> -
> + u32 dw, dw2;
>
> pci_read_config_dword(priv->mc, I5100_FERR_NF_MEM, &dw);
> if (i5100_ferr_nf_mem_any(dw)) {
> - u32 dw2;
>
> pci_read_config_dword(priv->mc, I5100_NERR_NF_MEM, &dw2);
> - if (dw2)
> - pci_write_config_dword(priv->mc, I5100_NERR_NF_MEM,
> - dw2);
> - pci_write_config_dword(priv->mc, I5100_FERR_NF_MEM, dw);
>
> i5100_read_log(mci, i5100_ferr_nf_mem_chan_indx(dw),
> i5100_ferr_nf_mem_any(dw),
> i5100_nerr_nf_mem_any(dw2));
> +
> + pci_write_config_dword(priv->mc, I5100_NERR_NF_MEM, dw2);
> }
> + pci_write_config_dword(priv->mc, I5100_FERR_NF_MEM, dw);
> }
>
> /* The i5100 chipset will scrub the entire memory once, then
> --
> 1.7.7.3
>
>
--
Regards/Gruss,
Boris.
Advanced Micro Devices GmbH
Einsteinring 24, 85609 Dornach
GM: Alberto Bozzo
Reg: Dornach, Landkreis Muenchen
HRB Nr. 43632 WEEE Registernr: 129 19551
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH] edac: i5100 ack error detection register after each read
2011-12-09 16:22 ` Borislav Petkov
@ 2012-02-21 10:26 ` Niklas Söderlund
2012-02-22 0:58 ` Luck, Tony
0 siblings, 1 reply; 5+ messages in thread
From: Niklas Söderlund @ 2012-02-21 10:26 UTC (permalink / raw)
To: Tony Luck
Cc: Borislav Petkov, lucas.demarchi@profusion.mobi,
linux-kernel@vger.kernel.org
Hi Tony,
Is there any interest in this patch?
On 12/09/2011 05:22 PM, Borislav Petkov wrote:
> Adding Tony.
>
> On Fri, Dec 09, 2011 at 05:12:15PM +0100, Niklas Söderlund wrote:
>> If I only ack the detection register after a error have been detected
>> I'm unable to reliably detect errors. I have verified this behavior
>> using both an error injection DIMM and software to inject errors.
>>
>> I can't find any documentation supporting this behavior in Intel 5100
>> Memory Controller Hub Chipset, see 1. So this is all based on
>> experimentation.
>>
>> [1] Intel® 5100 Memory Controller Hub Chipset
>> http://www.intel.com/content/dam/doc/datasheet/5100-
>> memory-controller-hub-chipset-datasheet.pdf
>>
>> Signed-off-by: Niklas Söderlund<niklas.soderlund@ericsson.com>
>> ---
>> drivers/edac/i5100_edac.c | 11 ++++-------
>> 1 files changed, 4 insertions(+), 7 deletions(-)
>>
>> diff --git a/drivers/edac/i5100_edac.c b/drivers/edac/i5100_edac.c
>> index bcbdeec..ec728e9 100644
>> --- a/drivers/edac/i5100_edac.c
>> +++ b/drivers/edac/i5100_edac.c
>> @@ -535,23 +535,20 @@ static void i5100_read_log(struct mem_ctl_info *mci, int chan,
>> static void i5100_check_error(struct mem_ctl_info *mci)
>> {
>> struct i5100_priv *priv = mci->pvt_info;
>> - u32 dw;
>> -
>> + u32 dw, dw2;
>>
>> pci_read_config_dword(priv->mc, I5100_FERR_NF_MEM,&dw);
>> if (i5100_ferr_nf_mem_any(dw)) {
>> - u32 dw2;
>>
>> pci_read_config_dword(priv->mc, I5100_NERR_NF_MEM,&dw2);
>> - if (dw2)
>> - pci_write_config_dword(priv->mc, I5100_NERR_NF_MEM,
>> - dw2);
>> - pci_write_config_dword(priv->mc, I5100_FERR_NF_MEM, dw);
>>
>> i5100_read_log(mci, i5100_ferr_nf_mem_chan_indx(dw),
>> i5100_ferr_nf_mem_any(dw),
>> i5100_nerr_nf_mem_any(dw2));
>> +
>> + pci_write_config_dword(priv->mc, I5100_NERR_NF_MEM, dw2);
>> }
>> + pci_write_config_dword(priv->mc, I5100_FERR_NF_MEM, dw);
>> }
>>
>> /* The i5100 chipset will scrub the entire memory once, then
>> --
>> 1.7.7.3
>>
>>
>
^ permalink raw reply [flat|nested] 5+ messages in thread
* RE: [PATCH] edac: i5100 ack error detection register after each read
2012-02-21 10:26 ` Niklas Söderlund
@ 2012-02-22 0:58 ` Luck, Tony
2012-02-22 12:13 ` Mauro Carvalho Chehab
0 siblings, 1 reply; 5+ messages in thread
From: Luck, Tony @ 2012-02-22 0:58 UTC (permalink / raw)
To: Mauro Carvalho Chehab
Cc: Borislav Petkov, lucas.demarchi@profusion.mobi,
linux-kernel@vger.kernel.org, Niklas Söderlund
[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #1: Type: text/plain; charset="utf-8", Size: 2634 bytes --]
Mauro: Since this has been tested, do you care that the documentation
doesn't appear to describe this behavior?
-Tony
-----Original Message-----
From: Niklas Söderlund [mailto:niklas.soderlund@ericsson.com]
Sent: Tuesday, February 21, 2012 2:27 AM
To: Luck, Tony
Cc: Borislav Petkov; lucas.demarchi@profusion.mobi; linux-kernel@vger.kernel.org
Subject: Re: [PATCH] edac: i5100 ack error detection register after each read
Hi Tony,
Is there any interest in this patch?
On 12/09/2011 05:22 PM, Borislav Petkov wrote:
> Adding Tony.
>
> On Fri, Dec 09, 2011 at 05:12:15PM +0100, Niklas Söderlund wrote:
>> If I only ack the detection register after a error have been detected
>> I'm unable to reliably detect errors. I have verified this behavior
>> using both an error injection DIMM and software to inject errors.
>>
>> I can't find any documentation supporting this behavior in Intel 5100
>> Memory Controller Hub Chipset, see 1. So this is all based on
>> experimentation.
>>
>> [1] Intel® 5100 Memory Controller Hub Chipset
>> http://www.intel.com/content/dam/doc/datasheet/5100-
>> memory-controller-hub-chipset-datasheet.pdf
>>
>> Signed-off-by: Niklas Söderlund<niklas.soderlund@ericsson.com>
>> ---
>> drivers/edac/i5100_edac.c | 11 ++++-------
>> 1 files changed, 4 insertions(+), 7 deletions(-)
>>
>> diff --git a/drivers/edac/i5100_edac.c b/drivers/edac/i5100_edac.c
>> index bcbdeec..ec728e9 100644
>> --- a/drivers/edac/i5100_edac.c
>> +++ b/drivers/edac/i5100_edac.c
>> @@ -535,23 +535,20 @@ static void i5100_read_log(struct mem_ctl_info *mci, int chan,
>> static void i5100_check_error(struct mem_ctl_info *mci)
>> {
>> struct i5100_priv *priv = mci->pvt_info;
>> - u32 dw;
>> -
>> + u32 dw, dw2;
>>
>> pci_read_config_dword(priv->mc, I5100_FERR_NF_MEM,&dw);
>> if (i5100_ferr_nf_mem_any(dw)) {
>> - u32 dw2;
>>
>> pci_read_config_dword(priv->mc, I5100_NERR_NF_MEM,&dw2);
>> - if (dw2)
>> - pci_write_config_dword(priv->mc, I5100_NERR_NF_MEM,
>> - dw2);
>> - pci_write_config_dword(priv->mc, I5100_FERR_NF_MEM, dw);
>>
>> i5100_read_log(mci, i5100_ferr_nf_mem_chan_indx(dw),
>> i5100_ferr_nf_mem_any(dw),
>> i5100_nerr_nf_mem_any(dw2));
>> +
>> + pci_write_config_dword(priv->mc, I5100_NERR_NF_MEM, dw2);
>> }
>> + pci_write_config_dword(priv->mc, I5100_FERR_NF_MEM, dw);
>> }
>>
>> /* The i5100 chipset will scrub the entire memory once, then
>> --
>> 1.7.7.3
>>
>>
>
ÿôèº{.nÇ+·®+%Ëÿ±éݶ\x17¥wÿº{.nÇ+·¥{±þG«éÿ{ayº\x1dÊÚë,j\a¢f£¢·hïêÿêçz_è®\x03(éÝ¢j"ú\x1a¶^[m§ÿÿ¾\a«þG«éÿ¢¸?¨èÚ&£ø§~á¶iOæ¬z·vØ^\x14\x04\x1a¶^[m§ÿÿÃ\fÿ¶ìÿ¢¸?I¥
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH] edac: i5100 ack error detection register after each read
2012-02-22 0:58 ` Luck, Tony
@ 2012-02-22 12:13 ` Mauro Carvalho Chehab
0 siblings, 0 replies; 5+ messages in thread
From: Mauro Carvalho Chehab @ 2012-02-22 12:13 UTC (permalink / raw)
To: Luck, Tony
Cc: Borislav Petkov, lucas.demarchi@profusion.mobi,
linux-kernel@vger.kernel.org, Niklas Söderlund
Hi Tony,
Em 21-02-2012 22:58, Luck, Tony escreveu:
> Mauro: Since this has been tested, do you care that the documentation
> doesn't appear to describe this behavior?
On some chipsets, the documentation allows more than one way to read it.
Yet, the common behavior to ack for an error on those Intel 3xxxx/5xxx/7xxx
memory controllers is to do:
pci_read_config_dword(priv->mc, reg, &dw);
pci_write_config_dword(priv->mc, reg, dw);
Which is exactly what Niklas patch is doing, so it looks fine for me.
Acked-by: Mauro Carvalho Chehab <mchehab@redhat.com>
>
> -Tony
>
> -----Original Message-----
> From: Niklas Söderlund [mailto:niklas.soderlund@ericsson.com]
> Sent: Tuesday, February 21, 2012 2:27 AM
> To: Luck, Tony
> Cc: Borislav Petkov; lucas.demarchi@profusion.mobi; linux-kernel@vger.kernel.org
> Subject: Re: [PATCH] edac: i5100 ack error detection register after each read
>
> Hi Tony,
>
> Is there any interest in this patch?
>
> On 12/09/2011 05:22 PM, Borislav Petkov wrote:
>> Adding Tony.
>>
>> On Fri, Dec 09, 2011 at 05:12:15PM +0100, Niklas Söderlund wrote:
>>> If I only ack the detection register after a error have been detected
>>> I'm unable to reliably detect errors. I have verified this behavior
>>> using both an error injection DIMM and software to inject errors.
>>>
>>> I can't find any documentation supporting this behavior in Intel 5100
>>> Memory Controller Hub Chipset, see 1. So this is all based on
>>> experimentation.
>>>
>>> [1] Intel® 5100 Memory Controller Hub Chipset
>>> http://www.intel.com/content/dam/doc/datasheet/5100-
>>> memory-controller-hub-chipset-datasheet.pdf
>>>
>>> Signed-off-by: Niklas Söderlund<niklas.soderlund@ericsson.com>
>>> ---
>>> drivers/edac/i5100_edac.c | 11 ++++-------
>>> 1 files changed, 4 insertions(+), 7 deletions(-)
>>>
>>> diff --git a/drivers/edac/i5100_edac.c b/drivers/edac/i5100_edac.c
>>> index bcbdeec..ec728e9 100644
>>> --- a/drivers/edac/i5100_edac.c
>>> +++ b/drivers/edac/i5100_edac.c
>>> @@ -535,23 +535,20 @@ static void i5100_read_log(struct mem_ctl_info *mci, int chan,
>>> static void i5100_check_error(struct mem_ctl_info *mci)
>>> {
>>> struct i5100_priv *priv = mci->pvt_info;
>>> - u32 dw;
>>> -
>>> + u32 dw, dw2;
>>>
>>> pci_read_config_dword(priv->mc, I5100_FERR_NF_MEM,&dw);
>>> if (i5100_ferr_nf_mem_any(dw)) {
>>> - u32 dw2;
>>>
>>> pci_read_config_dword(priv->mc, I5100_NERR_NF_MEM,&dw2);
>>> - if (dw2)
>>> - pci_write_config_dword(priv->mc, I5100_NERR_NF_MEM,
>>> - dw2);
>>> - pci_write_config_dword(priv->mc, I5100_FERR_NF_MEM, dw);
>>>
>>> i5100_read_log(mci, i5100_ferr_nf_mem_chan_indx(dw),
>>> i5100_ferr_nf_mem_any(dw),
>>> i5100_nerr_nf_mem_any(dw2));
>>> +
>>> + pci_write_config_dword(priv->mc, I5100_NERR_NF_MEM, dw2);
>>> }
>>> + pci_write_config_dword(priv->mc, I5100_FERR_NF_MEM, dw);
>>> }
>>>
>>> /* The i5100 chipset will scrub the entire memory once, then
>>> --
>>> 1.7.7.3
>>>
>>>
>>
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2012-02-22 12:13 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-12-09 16:12 [PATCH] edac: i5100 ack error detection register after each read Niklas Söderlund
2011-12-09 16:22 ` Borislav Petkov
2012-02-21 10:26 ` Niklas Söderlund
2012-02-22 0:58 ` Luck, Tony
2012-02-22 12:13 ` Mauro Carvalho Chehab
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox