From: Ingo Molnar <mingo@elte.hu>
To: Vaidyanathan Srinivasan <svaidy@linux.vnet.ibm.com>
Cc: Linux Kernel <linux-kernel@vger.kernel.org>,
Suresh B Siddha <suresh.b.siddha@intel.com>,
Venkatesh Pallipadi <venkatesh.pallipadi@intel.com>,
Peter Zijlstra <a.p.zijlstra@chello.nl>,
Arjan van de Ven <arjan@infradead.org>,
Dipankar Sarma <dipankar@in.ibm.com>,
Balbir Singh <balbir@linux.vnet.ibm.com>,
Vatsa <vatsa@linux.vnet.ibm.com>,
Gautham R Shenoy <ego@in.ibm.com>,
Andi Kleen <andi@firstfloor.org>,
Gregory Haskins <gregory.haskins@gmail.com>,
Mike Galbraith <efault@gmx.de>,
Thomas Gleixner <tglx@linutronix.de>,
Arun Bharadwaj <arun@linux.vnet.ibm.com>
Subject: Re: [RFC PATCH v1 0/3] Saving power by cpu evacuation using sched_mc=n
Date: Mon, 27 Apr 2009 07:53:47 +0200 [thread overview]
Message-ID: <20090427055347.GA20739@elte.hu> (raw)
In-Reply-To: <20090427054325.GB6440@dirshya.in.ibm.com>
* Vaidyanathan Srinivasan <svaidy@linux.vnet.ibm.com> wrote:
> > > --------------------------------------------------------
> > > sched_mc No Cores Performance AvgPower
> > > used Records/sec (Watts)
> > > --------------------------------------------------------
> > > 0 8 1.00x 1.00y
> > > 1 8 1.02x 1.01y
> > > 2 8 0.83x 1.01y
> > > 3 7 0.86x 0.97y
> > > 4 6 0.76x 0.92y
> > > 5 4 0.72x 0.82y
> > > --------------------------------------------------------
> >
> > Looks like we want the kernel default to be sched_mc=1 ?
>
> Hi Ingo,
>
> Yes, sched_mc wins for a simple cpu bound workload like this. But
> the challenge is that the best settings depends on the workload
> and the system configuration. This leads me to think that the
> default setting should be left with the distros where we can
> factor in various parameters and choose the right default from
> user space.
>
>
> > Regarding the values for 2...5 - is the AvgPower column time
> > normalized or workload normalized?
>
> The AvgPower is time normalised, just the power value divided by
> the baseline at sched_mc=0.
>
> > If it's time normalized then it appears there's no power win
> > here at all: we'd be better off by throttling the workload
> > directly (by injecting sleeps or something like that), right?
>
> Yes, there is no power win when comparing with peak benchmark
> throughput in this case. However more complex workload setup may
> not show similar characteristics because they are not dependent
> only on CPU bandwidth for their peak performance.
>
> * Reduction in cpu bandwidth may not directly translate to performance
> reduction on complex workloads
> * Even if there is degradation, the system may still meet the design
> objectives. 20-30% increase in response time over a 1 second
> nominal value may be acceptable in most cases
But ... we could probably get a _better_ (near linear) slowdown by
injecting wait cycles into the workload.
I.e. we should only touch balancing if there's a _genuine_ power
saving: i.e. less power is used for the same throughput.
The numbers in the table show a plain slowdown: doing fewer
transactions means less power used. But that is trivial to achieve
for a CPU-bound workload: throttle the workload. I.e. inject less
work, save power.
And if we want to throttle 'transparently', from the kernel, we
should do it not via an artificial open-ended scale of
sched_mc=2,3,4,5... - we should do it via a _percentage_ value.
I.e. a system setting that says "at most utilize the system 80% of
its peak capacity". That can be implemented by the kernel injecting
small delays or by intentionally not scheduling on certain CPUs (but
not delaying tasks - forcing them to other cpus in essence).
Ingo
next prev parent reply other threads:[~2009-04-27 5:54 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-04-26 20:46 [RFC PATCH v1 0/3] Saving power by cpu evacuation using sched_mc=n Vaidyanathan Srinivasan
2009-04-26 20:46 ` [RFC PATCH v1 1/3] sched: add more levels of sched_mc Vaidyanathan Srinivasan
2009-04-26 20:46 ` [RFC PATCH v1 2/3] sched: threshold helper functions Vaidyanathan Srinivasan
2009-04-26 20:47 ` [RFC PATCH v1 3/3] sched: loadbalancer hacks for forced packing of tasks Vaidyanathan Srinivasan
2009-04-27 3:52 ` [RFC PATCH v1 0/3] Saving power by cpu evacuation using sched_mc=n Ingo Molnar
2009-04-27 5:43 ` Vaidyanathan Srinivasan
2009-04-27 5:53 ` Ingo Molnar [this message]
2009-04-27 6:39 ` Vaidyanathan Srinivasan
2009-04-27 7:01 ` Balbir Singh
2009-04-27 5:54 ` Dipankar Sarma
2009-04-27 10:09 ` Peter Zijlstra
2009-04-27 14:20 ` Vaidyanathan Srinivasan
2009-04-28 8:33 ` Peter Zijlstra
2009-04-28 8:52 ` Ingo Molnar
2009-04-28 16:15 ` Vaidyanathan Srinivasan
2009-04-28 16:11 ` Vaidyanathan Srinivasan
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20090427055347.GA20739@elte.hu \
--to=mingo@elte.hu \
--cc=a.p.zijlstra@chello.nl \
--cc=andi@firstfloor.org \
--cc=arjan@infradead.org \
--cc=arun@linux.vnet.ibm.com \
--cc=balbir@linux.vnet.ibm.com \
--cc=dipankar@in.ibm.com \
--cc=efault@gmx.de \
--cc=ego@in.ibm.com \
--cc=gregory.haskins@gmail.com \
--cc=linux-kernel@vger.kernel.org \
--cc=suresh.b.siddha@intel.com \
--cc=svaidy@linux.vnet.ibm.com \
--cc=tglx@linutronix.de \
--cc=vatsa@linux.vnet.ibm.com \
--cc=venkatesh.pallipadi@intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.