public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* [PATCH 0/7] sched_domain balancing via tasklet V2
@ 2006-10-28  2:41 Christoph Lameter
  2006-10-28  2:41 ` [PATCH 1/7] Avoid taking rq lock in wake_priority_sleeper Christoph Lameter
                   ` (6 more replies)
  0 siblings, 7 replies; 10+ messages in thread
From: Christoph Lameter @ 2006-10-28  2:41 UTC (permalink / raw)
  To: akpm
  Cc: Peter Williams, linux-kernel, Nick Piggin, Christoph Lameter,
	Siddha, Suresh B, Ingo Molnar, KAMEZAWA Hiroyuki

This patchset moves potentially expensive load balancing out of the scheduler
tick (where we run with interrupts disabled) into a tasklet that is triggered
if necessary from scheduler_tick(). Load balancing will then run with interrupts
enabled. This eliminates interrupt holdoff times and avoids potential machine
livelock if f.e. load balancing is performed over a large number of processors
and many of the nodes experience heavy load which may lead to delays in
fetching cachelines. We have currently up to 1024 processors and may go up
to 4096 soon. Similar issues were seen on a Fujitsu system in the past.

However, this issue also highlights the general problem of interrupt
holdoff during scheduler load balancing.

The moving of the load balancing into a tasklet also allows some
cleanup in scheduler_tick(). It gets easier to read and the determination
of the state for load balancing can be moved out of scheduler_tick().

Further optimization of scheduler_tick() processing occurs because we
no longer check all the sched domains on each tick.
We determine the time of the next load balancing on every load balancing
and check against this single value in scheduler_tick().

Another optimization is that we perform the staggering of the individual
load balance operations not during load balancing but shift that
to the setup of the sched domains.

For the earlier discussion see:
http://marc.theaimsgroup.com/?t=116119187800002&r=1&w=2
V1: http://marc.theaimsgroup.com/?l=linux-kernel&m=116171494001548&w=2


V1-V2:
- Keep last_balance and calculate the next balancing from that start
  point.
- Move more code into time_slice calculation and rename time_slice()
  to task_running_tick().
- Separate out the wake_priority_sleeper optimization as a first patch.


^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2006-10-29  1:05 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2006-10-28  2:41 [PATCH 0/7] sched_domain balancing via tasklet V2 Christoph Lameter
2006-10-28  2:41 ` [PATCH 1/7] Avoid taking rq lock in wake_priority_sleeper Christoph Lameter
2006-10-28  2:41 ` [PATCH 2/7] Disable interrupts for locking in load_balance() Christoph Lameter
2006-10-28  2:41 ` [PATCH 3/7] Extract load calculation from rebalance_tick Christoph Lameter
2006-10-28  2:41 ` [PATCH 4/7] Stagger load balancing in build_sched_domains Christoph Lameter
2006-10-28  2:41 ` [PATCH 5/7] Move idle stat calculation into rebalance_tick() Christoph Lameter
2006-10-28 17:57   ` Siddha, Suresh B
2006-10-29  1:05     ` Christoph Lameter
2006-10-28  2:41 ` [PATCH 6/7] Use tasklet to call balancing Christoph Lameter
2006-10-28  2:41 ` [PATCH 7/7] Call tasklet less frequently Christoph Lameter

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox