public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* Strange entries in /proc/acpi/thermal_zone for Thinkpad X60
@ 2006-10-12 22:19 Jeremy Fitzhardinge
  2006-10-13  4:53 ` Len Brown
  2006-10-13 14:44 ` Pavel Machek
  0 siblings, 2 replies; 6+ messages in thread
From: Jeremy Fitzhardinge @ 2006-10-12 22:19 UTC (permalink / raw)
  To: Brown, Len, Pallipadi, Venkatesh
  Cc: acpi-devel, cpufreq, Linux Kernel Mailing List

I have a Thinkpad X60 with an Intel Core Duo T2400.  In 
/proc/acpi/thermal_zone, I'm getting two subdirectories, each with their 
own set of files:

/proc/acpi/thermal_zone/THM0/cooling_mode:
<setting not supported>
cooling mode:   critical

/proc/acpi/thermal_zone/THM0/polling_frequency:
<polling disabled>

/proc/acpi/thermal_zone/THM0/state:
state:                   ok

/proc/acpi/thermal_zone/THM0/temperature:
temperature:             53 C

/proc/acpi/thermal_zone/THM0/trip_points:
critical (S5):           127 C


/proc/acpi/thermal_zone/THM1/cooling_mode:
<setting not supported>
cooling mode:   passive

/proc/acpi/thermal_zone/THM1/polling_frequency:
<polling disabled>

/proc/acpi/thermal_zone/THM1/state:
state:                   ok

/proc/acpi/thermal_zone/THM1/temperature:
temperature:             53 C

/proc/acpi/thermal_zone/THM1/trip_points:
critical (S5):           97 C
passive:                 93 C: tc1=5 tc2=4 tsp=600 devices=0xf7eaa264 0xf7eaa244 


The interesting thing is that the two sets of files are not consistent - 
sometimes they don't even show the same temperature.

The reason I'm interested in this is that I think it's behind some of my 
cpufreq problems.  Sometimes the kernel decides that I just can't raise 
the max frequency above 1GHz, because its been thermally limited (I've 
put printks in to confirm that its the ACPI thermal limit on the policy 
notifier chain which is limiting the max speed).  It seems to me that 
having a thermal zone for each core is a BIOS bug, since they're really 
the same chip, but the THM1 entries should be ignored.  I don't believe 
the CPU has ever approached either 97 C, let alone 127; while I put it 
under a fair amount of load, it is sitting on a desktop with no airflow 
obstructions, so if it really is overheating it suggests a serious 
design problem with the hardware.

But I'm just speculating; I'm not really sure what all this means.  Any 
clues?

Thanks,
    J

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: Strange entries in /proc/acpi/thermal_zone for Thinkpad X60
       [not found] <fa.P/oAhFV0AVrh8PKSKzP+xVGih2s@ifi.uio.no>
@ 2006-10-13  3:57 ` Robert Hancock
  2006-10-13  4:28   ` Jeremy Fitzhardinge
  0 siblings, 1 reply; 6+ messages in thread
From: Robert Hancock @ 2006-10-13  3:57 UTC (permalink / raw)
  To: Jeremy Fitzhardinge, linux-kernel; +Cc: Brown, Len, Pallipadi, Venkatesh

Jeremy Fitzhardinge wrote:
> I have a Thinkpad X60 with an Intel Core Duo T2400.  In 
> /proc/acpi/thermal_zone, I'm getting two subdirectories, each with their 
> own set of files:
> 

So your machine has two thermal zones..

> The interesting thing is that the two sets of files are not consistent - 
> sometimes they don't even show the same temperature.

I would expect they wouldn't, otherwise there would be no reason for the 
BIOS people to set up two thermal zones..

> 
> The reason I'm interested in this is that I think it's behind some of my 
> cpufreq problems.  Sometimes the kernel decides that I just can't raise 
> the max frequency above 1GHz, because its been thermally limited (I've 
> put printks in to confirm that its the ACPI thermal limit on the policy 
> notifier chain which is limiting the max speed).  It seems to me that 
> having a thermal zone for each core is a BIOS bug, since they're really 
> the same chip, but the THM1 entries should be ignored.  I don't believe 

How do you know they are one for each core? ACPI thermal zones can be 
anywhere in the machine that needs OS-controlled cooling. Could be the 
CPU heatsink, voltage regulator, or someplace else.

> the CPU has ever approached either 97 C, let alone 127; while I put it 
> under a fair amount of load, it is sitting on a desktop with no airflow 
> obstructions, so if it really is overheating it suggests a serious 
> design problem with the hardware.
> 
> But I'm just speculating; I'm not really sure what all this means.  Any 
> clues?

I think we need more information to decide what is going on here.. what 
temperatures are registering in the thermal zones when the CPU clock is 
being limited?

-- 
Robert Hancock      Saskatoon, SK, Canada
To email, remove "nospam" from hancockr@nospamshaw.ca
Home Page: http://www.roberthancock.com/


^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: Strange entries in /proc/acpi/thermal_zone for Thinkpad X60
  2006-10-13  3:57 ` Robert Hancock
@ 2006-10-13  4:28   ` Jeremy Fitzhardinge
  2006-10-13 18:10     ` Tomasz Torcz
  0 siblings, 1 reply; 6+ messages in thread
From: Jeremy Fitzhardinge @ 2006-10-13  4:28 UTC (permalink / raw)
  To: Robert Hancock; +Cc: linux-kernel, Brown, Len, Pallipadi, Venkatesh

Robert Hancock wrote:
> I would expect they wouldn't, otherwise there would be no reason for 
> the BIOS people to set up two thermal zones..

Ah, OK.  I misunderstood what thermal zones are.

> How do you know they are one for each core? ACPI thermal zones can be 
> anywhere in the machine that needs OS-controlled cooling. Could be the 
> CPU heatsink, voltage regulator, or someplace else.

Right, bad assumption on my part.  Is there any way to find out what 
they might correspond to?  /proc/acpi/ibm/thermal has a bunch of 
temperature-like numbers in them; I guess there should be some 
correlation between those and the thermal zones.

> I think we need more information to decide what is going on here.. 
> what temperatures are registering in the thermal zones when the CPU 
> clock is being limited?

I'll gather a bit more info.

    J

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: Strange entries in /proc/acpi/thermal_zone for Thinkpad X60
  2006-10-12 22:19 Strange entries in /proc/acpi/thermal_zone for Thinkpad X60 Jeremy Fitzhardinge
@ 2006-10-13  4:53 ` Len Brown
  2006-10-13 14:44 ` Pavel Machek
  1 sibling, 0 replies; 6+ messages in thread
From: Len Brown @ 2006-10-13  4:53 UTC (permalink / raw)
  To: cpufreq
  Cc: Jeremy Fitzhardinge, Pallipadi, Venkatesh, linux-acpi,
	Linux Kernel Mailing List

On Thursday 12 October 2006 18:19, Jeremy Fitzhardinge wrote:
> I have a Thinkpad X60 with an Intel Core Duo T2400.  In 
> /proc/acpi/thermal_zone, I'm getting two subdirectories, each with their 
> own set of files:
> 
> /proc/acpi/thermal_zone/THM0/cooling_mode:
> <setting not supported>
> cooling mode:   critical
> 
> /proc/acpi/thermal_zone/THM0/polling_frequency:
> <polling disabled>
> 
> /proc/acpi/thermal_zone/THM0/state:
> state:                   ok
> 
> /proc/acpi/thermal_zone/THM0/temperature:
> temperature:             53 C
> 
> /proc/acpi/thermal_zone/THM0/trip_points:
> critical (S5):           127 C


This means that if THM0 reaches 127, your system will shut down.
You don't have much control over this one -- but you could probably
lower the temperature  to do a critical shut-down earlier with something like this:

echo -n "126:125:124:123:122" >trip_points
note that the 1st is the critical one, and the others are hot:passive:active:active place holders.

> 
> /proc/acpi/thermal_zone/THM1/cooling_mode:
> <setting not supported>
> cooling mode:   passive
> 
> /proc/acpi/thermal_zone/THM1/polling_frequency:
> <polling disabled>
> 
> /proc/acpi/thermal_zone/THM1/state:
> state:                   ok
> 
> /proc/acpi/thermal_zone/THM1/temperature:
> temperature:             53 C
> 
> /proc/acpi/thermal_zone/THM1/trip_points:
> critical (S5):           97 C
> passive:                 93 C: tc1=5 tc2=4 tsp=600 devices=0xf7eaa264 0xf7eaa244 

You are not given the opportunity to set the active trip points here.
Looks like you have just a passive trip point at 93 and we would
expect to throttle when we go above 93.
Presumably some other method should be kicking in the fans before
this passive point is reached.

The theory is that...
If the fans kicked in earlier than you liked, you should be able to lower
the passive trip point to below that temperature to make the system
throttle before the fans kick in.

But probably the root cause of your issue is that the fans are _not_ kicking in...

for grins you can probably raise the passive point with something like this

# echo -n "98:97:96:53:45" > trip_points

but it seems that you are doing passive cooling way before  you
get anywhere near 93, so that is the mystery.

-Len

> 
> 
> The interesting thing is that the two sets of files are not consistent - 
> sometimes they don't even show the same temperature.
> 
> The reason I'm interested in this is that I think it's behind some of my 
> cpufreq problems.  Sometimes the kernel decides that I just can't raise 
> the max frequency above 1GHz, because its been thermally limited (I've 
> put printks in to confirm that its the ACPI thermal limit on the policy 
> notifier chain which is limiting the max speed).  It seems to me that 
> having a thermal zone for each core is a BIOS bug, since they're really 
> the same chip, but the THM1 entries should be ignored.  I don't believe 
> the CPU has ever approached either 97 C, let alone 127; while I put it 
> under a fair amount of load, it is sitting on a desktop with no airflow 
> obstructions, so if it really is overheating it suggests a serious 
> design problem with the hardware.
> 
> But I'm just speculating; I'm not really sure what all this means.  Any 
> clues?
> 
> Thanks,
>     J
> 
> _______________________________________________
> Cpufreq mailing list
> Cpufreq@lists.linux.org.uk
> http://lists.linux.org.uk/mailman/listinfo/cpufreq
> 

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: Strange entries in /proc/acpi/thermal_zone for Thinkpad X60
  2006-10-12 22:19 Strange entries in /proc/acpi/thermal_zone for Thinkpad X60 Jeremy Fitzhardinge
  2006-10-13  4:53 ` Len Brown
@ 2006-10-13 14:44 ` Pavel Machek
  1 sibling, 0 replies; 6+ messages in thread
From: Pavel Machek @ 2006-10-13 14:44 UTC (permalink / raw)
  To: Jeremy Fitzhardinge
  Cc: Brown, Len, Pallipadi, Venkatesh, acpi-devel, cpufreq,
	Linux Kernel Mailing List

Hi!

> I have a Thinkpad X60 with an Intel Core Duo T2400.  In 
> /proc/acpi/thermal_zone, I'm getting two subdirectories, 
> each with their own set of files:

Looks okay to me. One thermal zone is cpu temperature, and second is
temperature of something else.

> The interesting thing is that the two sets of files are 
> not consistent - sometimes they don't even show the same 
> temperature.

You have two (actually you have more, see tp_smapi) physical
thermometers.

> The reason I'm interested in this is that I think it's 
> behind some of my cpufreq problems.  Sometimes the 
> kernel decides that I just can't raise the max frequency 
> above 1GHz, because its been thermally limited (I've put 
> printks in to confirm that its the ACPI thermal limit on 
> the policy notifier chain which is limiting the max 
> speed).  It seems to me that having a thermal zone for 
> each core is a BIOS bug, since they're really the same 
> chip, but the THM1 entries should be ignored.  I don't 

THM1 does not seem to be cpu temperature.

							Pavel
-- 
Thanks for all the (sleeping) penguins.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: Strange entries in /proc/acpi/thermal_zone for Thinkpad X60
  2006-10-13  4:28   ` Jeremy Fitzhardinge
@ 2006-10-13 18:10     ` Tomasz Torcz
  0 siblings, 0 replies; 6+ messages in thread
From: Tomasz Torcz @ 2006-10-13 18:10 UTC (permalink / raw)
  To: Jeremy Fitzhardinge
  Cc: Robert Hancock, linux-kernel, Brown, Len, Pallipadi, Venkatesh

[-- Attachment #1: Type: text/plain, Size: 1017 bytes --]

On Thu, Oct 12, 2006 at 09:28:12PM -0700, Jeremy Fitzhardinge wrote:
> Robert Hancock wrote:
> >I would expect they wouldn't, otherwise there would be no reason for 
> >the BIOS people to set up two thermal zones..
> 
> Ah, OK.  I misunderstood what thermal zones are.
> 
> >How do you know they are one for each core? ACPI thermal zones can be 
> >anywhere in the machine that needs OS-controlled cooling. Could be the 
> >CPU heatsink, voltage regulator, or someplace else.
> 
> Right, bad assumption on my part.  Is there any way to find out what 
> they might correspond to?  /proc/acpi/ibm/thermal has a bunch of 
> temperature-like numbers in them; I guess there should be some 
> correlation between those and the thermal zones.

  There are many temperature sensors in Thinkpads. There's even map of
them somewhere on http://www.thinkwiki.org.

-- 
Tomasz Torcz                "Funeral in the morning, IDE hacking
zdzichu@irc.-nie.spam-.pl    in the afternoon and evening." - Alan Cox


[-- Attachment #2: Type: application/pgp-signature, Size: 229 bytes --]

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2006-10-13 18:10 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2006-10-12 22:19 Strange entries in /proc/acpi/thermal_zone for Thinkpad X60 Jeremy Fitzhardinge
2006-10-13  4:53 ` Len Brown
2006-10-13 14:44 ` Pavel Machek
     [not found] <fa.P/oAhFV0AVrh8PKSKzP+xVGih2s@ifi.uio.no>
2006-10-13  3:57 ` Robert Hancock
2006-10-13  4:28   ` Jeremy Fitzhardinge
2006-10-13 18:10     ` Tomasz Torcz

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox