* [lm-sensors] FB-DIMM temperature
@ 2007-01-11 15:34 Rudolf Marek
2007-01-11 15:57 ` Vadim Zeitlin
` (4 more replies)
0 siblings, 5 replies; 6+ messages in thread
From: Rudolf Marek @ 2007-01-11 15:34 UTC (permalink / raw)
To: lm-sensors
Hello,
Thanks for the improvements.
>
> #include <sys/mman.h>
> #include <sys/stat.h>
> #include <fcntl.h>
> #include <stdio.h>
>
> int main(void)
> {
> int i;
> unsigned char *addr;
> int fd = open("/dev/mem", O_RDONLY);
>
> if ( fd = -1 ) {
> perror("failed to open /dev/mem");
> return 1;
> }
>
> addr = mmap(0, 128*1024*1024, PROT_READ, MAP_PRIVATE, fd, 0xFE000000);
>
> if ( addr = MAP_FAILED ) {
> perror("mmap failed");
> return 1;
> }
>
> /* we want function 3, 4 channels, 16AMB/channel */
> for ( i = 0; i < 64; i++ ) {
> int idx = i*2048;
> if ( addr[idx] != 0xff || addr[idx+1] != 0xff ) {
> printf("Intel sig at %i: %02x %02x\n",
> i, addr[idx], addr[idx+1]);
We should see the 86 80
> /* offset 85h function 3 */
> printf("Current temp is: %fC\n", addr[idx+(256*3)+0x85]/2.);
> }
> }
>
> munmap(addr, 128*1024*1024);
>
> return 0;
> }
>
> and here are the results:
>
> # ./fbdt
> Intel sig at 0: 1d 11
> Current temp is: 71.500000C
> Intel sig at 16: 1d 11
> Current temp is: 0.000000C
> Intel sig at 32: 1d 11
> Current temp is: 61.000000C
> Intel sig at 48: 1d 11
> Current temp is: 0.000000C
>
Hmm there is something wrong. Please provide output of lspci -vvv
and lspci -xxx
Do you have the windows software somewhere to download?
Rudolf
^ permalink raw reply [flat|nested] 6+ messages in thread
* [lm-sensors] FB-DIMM temperature
2007-01-11 15:34 [lm-sensors] FB-DIMM temperature Rudolf Marek
@ 2007-01-11 15:57 ` Vadim Zeitlin
2007-01-11 16:57 ` Rudolf Marek
` (3 subsequent siblings)
4 siblings, 0 replies; 6+ messages in thread
From: Vadim Zeitlin @ 2007-01-11 15:57 UTC (permalink / raw)
To: lm-sensors
On Thu, 11 Jan 2007 16:34:44 +0100 Rudolf Marek <r.marek at assembler.cz> wrote:
RM> > printf("Intel sig at %i: %02x %02x\n",
RM> > i, addr[idx], addr[idx+1]);
RM>
RM> We should see the 86 80
Ah, this is why it's called "Intel sig", I see...
RM> > and here are the results:
RM> >
RM> > # ./fbdt
RM> > Intel sig at 0: 1d 11
RM> > Current temp is: 71.500000C
RM> > Intel sig at 16: 1d 11
RM> > Current temp is: 0.000000C
RM> > Intel sig at 32: 1d 11
RM> > Current temp is: 61.000000C
RM> > Intel sig at 48: 1d 11
RM> > Current temp is: 0.000000C
RM> >
RM>
RM> Hmm there is something wrong.
FWIW, I've opened the box to try to cool the DIMMs a bit and now the
results are
# ./fbdt
Intel sig at 0: 1d 11
Current temp is: 65.000000C
Intel sig at 16: 1d 11
Current temp is: 0.000000C
Intel sig at 32: 1d 11
Current temp is: 52.000000C
Intel sig at 48: 1d 11
Current temp is: 54.000000C
so not only the values did diminish but there is also 54 instead of 0 for
the last one. So I wonder: could it be that the DIMMs really overheat and
the temp of the 2nd one is just out of the range?
RM> Please provide output of lspci -vvv and lspci -xxx
The files are relatively big so instead of posting them here I put them on
http://www.tt-solutions.com/vz/hardware/tyan2696/pci.ls
http://www.tt-solutions.com/vz/hardware/tyan2696/pci.dump
Please let me know if you need anything else. I'm afraid I don't really see
anything helpful in lspci output on my own.
RM> Do you have the windows software somewhere to download?
No, not really, I use SpeedFan (http://www.almico.com/speedfan.php) but I
don't know if it's capable of reading FB-DIMMs temperatures and, as I said,
I don't really want to install Windows on this machine just to try it out.
Thanks!
VZ
^ permalink raw reply [flat|nested] 6+ messages in thread
* [lm-sensors] FB-DIMM temperature
2007-01-11 15:34 [lm-sensors] FB-DIMM temperature Rudolf Marek
2007-01-11 15:57 ` Vadim Zeitlin
@ 2007-01-11 16:57 ` Rudolf Marek
2007-01-11 17:13 ` Vadim Zeitlin
` (2 subsequent siblings)
4 siblings, 0 replies; 6+ messages in thread
From: Rudolf Marek @ 2007-01-11 16:57 UTC (permalink / raw)
To: lm-sensors
Vadim Zeitlin wrote:
> On Thu, 11 Jan 2007 16:34:44 +0100 Rudolf Marek <r.marek at assembler.cz> wrote:
>
> RM> > printf("Intel sig at %i: %02x %02x\n",
> RM> > i, addr[idx], addr[idx+1]);
> RM>
> RM> We should see the 86 80
>
> Ah, this is why it's called "Intel sig", I see...
The 0x111d signature is company called IDT. They manufacture IDTAMB0480 which
may be used there. Question is if the register layout is same as in intel AMB
http://www.intel.com/design/chipsets/datashts/313072.htm
> RM> > and here are the results:
> RM> >
> RM> > # ./fbdt
> RM> > Intel sig at 0: 1d 11
> RM> > Current temp is: 71.500000C
> RM> > Intel sig at 16: 1d 11
> RM> > Current temp is: 0.000000C
> RM> > Intel sig at 32: 1d 11
> RM> > Current temp is: 61.000000C
> RM> > Intel sig at 48: 1d 11
> RM> > Current temp is: 0.000000C
> RM> >
> RM>
> RM> Hmm there is something wrong.
>
> FWIW, I've opened the box to try to cool the DIMMs a bit and now the
> results are
>
> # ./fbdt
> Intel sig at 0: 1d 11
> Current temp is: 65.000000C
> Intel sig at 16: 1d 11
> Current temp is: 0.000000C
> Intel sig at 32: 1d 11
> Current temp is: 52.000000C
> Intel sig at 48: 1d 11
> Current temp is: 54.000000C
>
> so not only the values did diminish but there is also 54 instead of 0 for
> the last one. So I wonder: could it be that the DIMMs really overheat and
> the temp of the 2nd one is just out of the range?
Yes it might. Please check the intel datasheet for overheat flags etc etc. It
might work for you. I asked our friend in Tyan for a help with that.
>
> RM> Please provide output of lspci -vvv and lspci -xxx
>
> The files are relatively big so instead of posting them here I put them on
>
> http://www.tt-solutions.com/vz/hardware/tyan2696/pci.ls
> http://www.tt-solutions.com/vz/hardware/tyan2696/pci.dump
>
> Please let me know if you need anything else. I'm afraid I don't really see
> anything helpful in lspci output on my own.
I was interrested in base address and it is correct.
00:10.0 Host bridge: Intel Corporation 5000 Series Chipset Error Reporting
Registers (rev 31)
00: 86 80 f0 25 00 00 00 00 31 00 00 06 00 00 80 00
10: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
20: 00 00 00 00 00 00 00 00 00 00 00 00 f1 10 96 26
30: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
40: 00 00 ff 00 00 08 00 00 00 00 00 fe 00 00 00 00
^^^^^^^^^^^^^^^
Same as we use in our prog ;)
50: 00 00 02 00 00 00 04 04 00 10 11 01 00 00 13 11
60: 00 12 09 0d 00 e0 00 00 ff ff ff ff 00 00 00 00
70: 8c c0 e2 0f 00 00 00 00 8c c0 e2 0f 00 00 00 00
80: 01 00 80 00 00 00 00 00 00 00 00 00 00 00 00 00
90: 04 01 80 00 00 00 00 00 00 00 00 00 00 00 00 00
a0: 00 00 00 00 00 00 00 00 00 00 00 00 02 06 80 00
b0: 00 00 00 00 00 00 00 00 00 00 00 00 08 07 80 00
c0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
d0: 00 00 00 00 00 00 00 00 00 00 00 00 11 01 00 00
e0: 00 00 00 00 00 00 00 80 00 00 00 00 00 00 00 00
f0: 5c 7f 48 00 06 a1 58 01 f4 08 d8 03 80 00 00 00
> RM> Do you have the windows software somewhere to download?
>
> No, not really, I use SpeedFan (http://www.almico.com/speedfan.php) but I
> don't know if it's capable of reading FB-DIMMs temperatures and, as I said,
> I don't really want to install Windows on this machine just to try it out.
I think they do not have support for this - it fairly new technology.
If you have time, maybe you may investigate the overtemperature bits. I think I
may have some time for this next week...
Rudolf
^ permalink raw reply [flat|nested] 6+ messages in thread
* [lm-sensors] FB-DIMM temperature
2007-01-11 15:34 [lm-sensors] FB-DIMM temperature Rudolf Marek
2007-01-11 15:57 ` Vadim Zeitlin
2007-01-11 16:57 ` Rudolf Marek
@ 2007-01-11 17:13 ` Vadim Zeitlin
2007-01-11 19:56 ` Vadim Zeitlin
2007-01-11 21:50 ` Rudolf Marek
4 siblings, 0 replies; 6+ messages in thread
From: Vadim Zeitlin @ 2007-01-11 17:13 UTC (permalink / raw)
To: lm-sensors
On Thu, 11 Jan 2007 17:57:38 +0100 Rudolf Marek <r.marek at assembler.cz> wrote:
RM> The 0x111d signature is company called IDT. They manufacture IDTAMB0480
RM> which may be used there.
FWIW the FB-DIMMs are 1GB PC5300 modules manufactured by Samsung but I
don't have the model number at hand (but I could look it up).
RM> Question is if the register layout is same as in intel AMB
RM>
RM> http://www.intel.com/design/chipsets/datashts/313072.htm
Thanks for the link! I didn't find anything as detailed about IDTAMB0480
though, all I could find was this:
http://www.idt.com/products/files/6186383/AMB0480_product_brief.pdf
Astonishingly, at least to me, the max temperature for these things seems
to be 124C: see page 12 of the above link or section 3.3.1 at page 25 of
http://download.intel.com/design/chipsets/designex/31306603.pdf
Do I really understand this correctly? Or should it be divided by 2?
I'll read Intel datasheets a bit more tonight, I'm sure they're ideal
bed-side reading ;-)
Regards,
VZ
^ permalink raw reply [flat|nested] 6+ messages in thread
* [lm-sensors] FB-DIMM temperature
2007-01-11 15:34 [lm-sensors] FB-DIMM temperature Rudolf Marek
` (2 preceding siblings ...)
2007-01-11 17:13 ` Vadim Zeitlin
@ 2007-01-11 19:56 ` Vadim Zeitlin
2007-01-11 21:50 ` Rudolf Marek
4 siblings, 0 replies; 6+ messages in thread
From: Vadim Zeitlin @ 2007-01-11 19:56 UTC (permalink / raw)
To: lm-sensors
On Thu, 11 Jan 2007 17:57:38 +0100 Rudolf Marek <r.marek at assembler.cz> wrote:
RM> > FWIW, I've opened the box to try to cool the DIMMs a bit and now the
RM> > results are
RM> >
RM> > # ./fbdt
RM> > Intel sig at 0: 1d 11
RM> > Current temp is: 65.000000C
RM> > Intel sig at 16: 1d 11
RM> > Current temp is: 0.000000C
RM> > Intel sig at 32: 1d 11
RM> > Current temp is: 52.000000C
RM> > Intel sig at 48: 1d 11
RM> > Current temp is: 54.000000C
RM> >
RM> > so not only the values did diminish but there is also 54 instead of 0 for
RM> > the last one.
I think I know at least why the temperature is sometimes 0: it's a timing
issue, apparently you can't read the sensors too quickly. If I insert
usleep(1000) between access to the signature bytes and the temperature value,
I can reliably read non zero values. I also looked at the values of the other
registers described in the section 14.5.3 of Intel datasheet, here is the
full output:
Sig at 0: 1d 11
Device id: 80 04
TEMPLO: 106.00C
TEMPMID: 109.00C
TEMPHI: 124.00C
TEMPSTAT: 0x1
TEMP: 68.50C
Sig at 16: 1d 11
Device id: 80 04
TEMPLO: 106.00C
TEMPMID: 109.00C
TEMPHI: 124.00C
TEMPSTAT: 0x1
TEMP: 68.00C
Sig at 32: 1d 11
Device id: 80 04
TEMPLO: 106.00C
TEMPMID: 109.00C
TEMPHI: 124.00C
TEMPSTAT: 0x1
TEMP: 59.00C
Sig at 48: 1d 11
Device id: 80 04
TEMPLO: 106.00C
TEMPMID: 109.00C
TEMPHI: 124.00C
TEMPSTAT: 0x1
TEMP: 63.50C
So, assuming I interpret TEMPLO/MID/HI correctly (I also divided their
values by 2 to get the temperature in degrees Celsius), it really does look
that these DIMMs are supposed to run quite hot.
Please let me know if I'm wrong (before I fry them :-),
VZ
^ permalink raw reply [flat|nested] 6+ messages in thread
* [lm-sensors] FB-DIMM temperature
2007-01-11 15:34 [lm-sensors] FB-DIMM temperature Rudolf Marek
` (3 preceding siblings ...)
2007-01-11 19:56 ` Vadim Zeitlin
@ 2007-01-11 21:50 ` Rudolf Marek
4 siblings, 0 replies; 6+ messages in thread
From: Rudolf Marek @ 2007-01-11 21:50 UTC (permalink / raw)
To: lm-sensors
Hello Vadim,
> I think I know at least why the temperature is sometimes 0: it's a timing
> issue, apparently you can't read the sensors too quickly. If I insert
> usleep(1000) between access to the signature bytes and the temperature value,
> I can reliably read non zero values.
Ah, that is not too convenient :/ Maybe we need to read only 4 bytes at once. I
must check the Northbridge datasheet for this. Also it might be worth to check
the 14.5.3.4 UPDATED: Update Temp Diff Bit maybe it is zero when this bit is set.
>I also looked at the values of the other
> registers described in the section 14.5.3 of Intel datasheet, here is the
> full output:
I have some news too. The registers must match by definition with the Intel
public document.
>
> Sig at 0: 1d 11
> Device id: 80 04
> TEMPLO: 106.00C
> TEMPMID: 109.00C
> TEMPHI: 124.00C
> TEMPSTAT: 0x1
> TEMP: 68.50C
> Sig at 16: 1d 11
> Device id: 80 04
> TEMPLO: 106.00C
> TEMPMID: 109.00C
> TEMPHI: 124.00C
> TEMPSTAT: 0x1
> TEMP: 68.00C
> Sig at 32: 1d 11
> Device id: 80 04
> TEMPLO: 106.00C
> TEMPMID: 109.00C
> TEMPHI: 124.00C
> TEMPSTAT: 0x1
> TEMP: 59.00C
> Sig at 48: 1d 11
> Device id: 80 04
> TEMPLO: 106.00C
> TEMPMID: 109.00C
> TEMPHI: 124.00C
> TEMPSTAT: 0x1
> TEMP: 63.50C
>
> So, assuming I interpret TEMPLO/MID/HI correctly (I also divided their
> values by 2 to get the temperature in degrees Celsius), it really does look
> that these DIMMs are supposed to run quite hot.
> Please let me know if I'm wrong (before I fry them :-),
No this are thresholds for the action, when system become too hot. Your temps
are around 70C now, so your system is safe.
No need to panic ;)
Rudolf
^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2007-01-11 21:50 UTC | newest]
Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2007-01-11 15:34 [lm-sensors] FB-DIMM temperature Rudolf Marek
2007-01-11 15:57 ` Vadim Zeitlin
2007-01-11 16:57 ` Rudolf Marek
2007-01-11 17:13 ` Vadim Zeitlin
2007-01-11 19:56 ` Vadim Zeitlin
2007-01-11 21:50 ` Rudolf Marek
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.