All of lore.kernel.org
 help / color / mirror / Atom feed
* 2.6.19-rt14 slowdown compared to 2.6.19
@ 2006-12-22 21:39 Chen, Tim C
  2006-12-23  0:38 ` Daniel Walker
                   ` (3 more replies)
  0 siblings, 4 replies; 9+ messages in thread
From: Chen, Tim C @ 2006-12-22 21:39 UTC (permalink / raw)
  To: mingo; +Cc: linux-kernel, Siddha, Suresh B

Ingo,
 
We did some benchmarking on 2.6.19-rt14, compared it with 2.6.19 
kernel and noticed several slowdowns.  The test machine is a 2 socket
woodcrest machine with your default configuration.
 
Netperf TCP Streaming was slower by 40% ( 1 server and 1 client 
each bound to separate cpu cores on different socket, network
loopback mode was used).  

Volanomark was slower by 80% (Server and Clients communicate with 
network loopback mode. Idle time goes from 1% to 60%)

Re-Aim7 was slower by 40% (idle time goes from 0% to 20%)

Wonder if you have any suggestions on what could cause the slowdown.  
We've tried disabling CONFIG_NO_HZ and it didn't help much.

Thanks.

Tim

^ permalink raw reply	[flat|nested] 9+ messages in thread
* RE: 2.6.19-rt14 slowdown compared to 2.6.19
@ 2006-12-27  0:51 Chen, Tim C
  0 siblings, 0 replies; 9+ messages in thread
From: Chen, Tim C @ 2006-12-27  0:51 UTC (permalink / raw)
  To: Ingo Molnar; +Cc: linux-kernel, Siddha, Suresh B, Peter Zijlstra

Ingo Molnar wrote:
> 
> cool - thanks for the feedback! Running the 64-bit kernel, right?
> 

Yes, 64-bit kernel was used.

> 
> while some slowdown is to be expected, did in each case idle time
> increase significantly? 

Volanomark and Re-Aim7 ran close to 0% idle time for 2.6.19 kernel.
Idle time
increase significantly for Volanomark (to 60% idle) and Re-Aim7 (to 20%
idle) 
with the rt kernel.  For netperf, the system was 60% idle for 
both 2.6.19 and rt kernel and changes in idle time was not significant.

> If yes then this is the effect of lock
> contention. Lock contention effects are 'magnified' by PREEMPT_RT. For
> example if you run 128 threads workload that all use the same lock
> then 
> the -rt kernel can act as if it were a 128-way box (!). This way by
> running -rt you'll see scalability problems alot sooner than on real
> hardware. In other words: PREEMPT_RT in essence simulates the
> scalability behavior of up to an infinite amount of CPUs. (with the
> exception of cachemiss emulation ;) [the effect is not this precise,
> but 
> that's the rough trend]

Turning off PREEMPT_RT for 2.6.20-rc2-rt0 kernel
restored most the performance of Volanaomark
and Re-Aim7.  Idle time is close to 0%.  So the benchmarks
with large number of threads are affected more by PREEMPT_RT.

For netperf TCP streaming, the performance improved from 40% down to 20%
down from 2.6.20-rc2 kernel.  There is only a server and a client
process
for netperf.  The underlying reason for the change in performance
is probably different.

> 
> If you'd like to profile this yourself then the lowest-cost way of
> profiling lock contention on -rt is to use the yum kernel and run the
> attached trace-it-lock-prof.c code on the box while your workload is
> in 'steady state' (and is showing those extended idle times):
> 
>   ./trace-it-lock-prof > trace.txt
> 

Thanks for the pointer.  Will let you know of any relevant traces.

Thanks.

Tim

^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2007-01-02 18:07 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2006-12-22 21:39 2.6.19-rt14 slowdown compared to 2.6.19 Chen, Tim C
2006-12-23  0:38 ` Daniel Walker
2006-12-23  2:43 ` K.R. Foley
2006-12-23  3:00 ` K.R. Foley
2006-12-23 10:59 ` Ingo Molnar
2006-12-29 23:52   ` Chen, Tim C
2006-12-30 10:04     ` Ingo Molnar
2007-01-02 18:07       ` Chen, Tim C
  -- strict thread matches above, loose matches on Subject: below --
2006-12-27  0:51 Chen, Tim C

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.