sched_clock -> check_tsc_unstable -> tsc_read_c3

public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed

* sched_clock -> check_tsc_unstable -> tsc_read_c3_time ?!?
@ 2005-10-13 20:01 Lee Revell
  2005-10-13 20:11 ` john stultz
  0 siblings, 1 reply; 4+ messages in thread
From: Lee Revell @ 2005-10-13 20:01 UTC (permalink / raw)
  To: linux-kernel; +Cc: john stultz, Ingo Molnar

Looking at the latency traces it appears that sched_clock could be
optimized a bit:

evolutio-16296 0D.h4   32us : activate_task (try_to_wake_up)
evolutio-16296 0D.h4   33us : sched_clock (activate_task)
evolutio-16296 0D.h4   33us : check_tsc_unstable (sched_clock)
evolutio-16296 0D.h4   34us : tsc_read_c3_time (sched_clock)
evolutio-16296 0D.h4   35us : recalc_task_prio (activate_task)

check_tsc_unstable and tsc_read_c3_time appear to be new.  Here they
are:

     49 /* Code to mark and check if the TSC is unstable
     50  * due to cpufreq or due to unsynced TSCs
     51  */
     52 static int tsc_unstable;
     53 int check_tsc_unstable(void)
     54 {
     55         return tsc_unstable;
     56 }

     73 u64 tsc_read_c3_time(void)
     74 {
     75         return tsc_c3_offset;
     76 }

Shouldn't these be inlined or something?  I know it's only a few
microseconds, but it seems like excessive function call overhead to me.
I don't use power management and the TSC is stable on this machine.  Why
do we have to call these simple accessor functions over and over?

Lee

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: sched_clock -> check_tsc_unstable -> tsc_read_c3_time ?!?
  2005-10-13 20:01 sched_clock -> check_tsc_unstable -> tsc_read_c3_time ?!? Lee Revell
@ 2005-10-13 20:11 ` john stultz
  2005-10-13 20:13   ` Lee Revell
  0 siblings, 1 reply; 4+ messages in thread
From: john stultz @ 2005-10-13 20:11 UTC (permalink / raw)
  To: Lee Revell; +Cc: linux-kernel, Ingo Molnar

On Thu, 2005-10-13 at 16:01 -0400, Lee Revell wrote:
> Looking at the latency traces it appears that sched_clock could be
> optimized a bit:
> 
> evolutio-16296 0D.h4   32us : activate_task (try_to_wake_up)
> evolutio-16296 0D.h4   33us : sched_clock (activate_task)
> evolutio-16296 0D.h4   33us : check_tsc_unstable (sched_clock)
> evolutio-16296 0D.h4   34us : tsc_read_c3_time (sched_clock)
> evolutio-16296 0D.h4   35us : recalc_task_prio (activate_task)
> 
> check_tsc_unstable and tsc_read_c3_time appear to be new.  Here they
> are:
> 
>      49 /* Code to mark and check if the TSC is unstable
>      50  * due to cpufreq or due to unsynced TSCs
>      51  */
>      52 static int tsc_unstable;
>      53 int check_tsc_unstable(void)
>      54 {
>      55         return tsc_unstable;
>      56 }
> 
>      73 u64 tsc_read_c3_time(void)
>      74 {
>      75         return tsc_c3_offset;
>      76 }
> 
> Shouldn't these be inlined or something?  I know it's only a few
> microseconds, but it seems like excessive function call overhead to me.

Yea, you're right about the inlining. Although I'm not sure why those
functions should take microseconds to execute. That's very strange.

> I don't use power management and the TSC is stable on this machine.  Why
> do we have to call these simple accessor functions over and over?

Basically its there to detect if the cpu goes into a low power mode and
halts the TSC. 

They probably could be #defined if C3 really cannot triggered, but some
check will be necessary otherwise.

The arch specific bits in my patches have gotten less of my attention
recently, but hopefully with the arch generic code getting pretty solid
and fast I'll be back to focusing on it.

thanks
-john



^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: sched_clock -> check_tsc_unstable -> tsc_read_c3_time ?!?
  2005-10-13 20:11 ` john stultz
@ 2005-10-13 20:13   ` Lee Revell
  2005-10-14  4:35     ` Ingo Molnar
  0 siblings, 1 reply; 4+ messages in thread
From: Lee Revell @ 2005-10-13 20:13 UTC (permalink / raw)
  To: john stultz; +Cc: linux-kernel, Ingo Molnar

On Thu, 2005-10-13 at 13:11 -0700, john stultz wrote:
> Yea, you're right about the inlining. Although I'm not sure why those
> functions should take microseconds to execute. That's very strange. 

Latency tracing overhead plus a slow (600MHz) machine?

Lee


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: sched_clock -> check_tsc_unstable -> tsc_read_c3_time ?!?
  2005-10-13 20:13   ` Lee Revell
@ 2005-10-14  4:35     ` Ingo Molnar
  0 siblings, 0 replies; 4+ messages in thread
From: Ingo Molnar @ 2005-10-14  4:35 UTC (permalink / raw)
  To: Lee Revell; +Cc: john stultz, linux-kernel

* Lee Revell <rlrevell@joe-job.com> wrote:

> On Thu, 2005-10-13 at 13:11 -0700, john stultz wrote:
> > Yea, you're right about the inlining. Although I'm not sure why those
> > functions should take microseconds to execute. That's very strange. 
> 
> Latency tracing overhead plus a slow (600MHz) machine?

yeah. The micro-timings of latency tracing can be misleading. Function 
calls are very fast on most CPUs (even on a 600MHz one), but with the 
latency tracer generating one trace entry per function call, there's 
considerable added overhead.

we could in theory calibrate the tracing overhead and subtract it from 
cycle readings [i've done this in a previous mcount() based tracer 
implementation, years ago], but that would make the latency trace 
timestamps less useful as a global time reference.

	Ingo

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2005-10-14  4:35 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2005-10-13 20:01 sched_clock -> check_tsc_unstable -> tsc_read_c3_time ?!? Lee Revell
2005-10-13 20:11 ` john stultz
2005-10-13 20:13   ` Lee Revell
2005-10-14  4:35     ` Ingo Molnar

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox