From mboxrd@z Thu Jan  1 00:00:00 1970
From: David Xu <davidxu06@gmail.com>
Subject: Re: Re: performance of credit2 on hybrid workload
Date: Wed, 8 Jun 2011 17:43:18 -0400
Message-ID: <BANLkTin7AWvbtyqZKAu4_cuD_DXOVr=v6w@mail.gmail.com>
References: <BANLkTik9+a64cm6YPgnL0sTaXbEWCqYJcA@mail.gmail.com>
	<1306340309.21026.8524.camel@elijah>
	<BANLkTi=57gDitoq7-T7n9Zh0_ZrCMuxfRg@mail.gmail.com>
	<1306401493.21026.8526.camel@elijah>
	<BANLkTikU0KqN_yd1J3_HtCaAN0LrF6qBXQ@mail.gmail.com>
	<BANLkTimaUs=pnBV3sEd0c_KsNeEF4SjSDQ@mail.gmail.com>
	<BANLkTikGiK+HFAgzVF1pPObwGi55FeAW-g@mail.gmail.com>
	<BANLkTim=LPqLB=atbM+QJD-6i2LaRXj27Q@mail.gmail.com>
Mime-Version: 1.0
Content-Type: multipart/mixed; boundary="===============0568000276=="
Return-path: <xen-devel-bounces@lists.xensource.com>
In-Reply-To: <BANLkTim=LPqLB=atbM+QJD-6i2LaRXj27Q@mail.gmail.com>
List-Unsubscribe: <http://lists.xensource.com/mailman/listinfo/xen-devel>,
	<mailto:xen-devel-request@lists.xensource.com?subject=unsubscribe>
List-Post: <mailto:xen-devel@lists.xensource.com>
List-Help: <mailto:xen-devel-request@lists.xensource.com?subject=help>
List-Subscribe: <http://lists.xensource.com/mailman/listinfo/xen-devel>,
	<mailto:xen-devel-request@lists.xensource.com?subject=subscribe>
Sender: xen-devel-bounces@lists.xensource.com
Errors-To: xen-devel-bounces@lists.xensource.com
To: George Dunlap <George.Dunlap@eu.citrix.com>
Cc: xen-devel@lists.xensource.com
List-Id: xen-devel@lists.xenproject.org

--===============0568000276==
Content-Type: multipart/alternative; boundary=20cf305b0c267ec7c704a53a3924

--20cf305b0c267ec7c704a53a3924
Content-Type: text/plain; charset=ISO-8859-1

Hi George,

Thanks for your reply. I have similar ideas to you, adding another parameter
that indicates the required latency and then letting scheduler
determine latency
characteristics of a VM automatically. Firstly, adding another parameter and
let users set its value in advance sounds similar to SEDF. But sometimes the
configuration process is hard and inflexible when workloads in VM is
complex. So in my opinion, a task-aware scheduler is better. However,
manually configuration can help us to check out the effectiveness of the new
parameter. For another hand, as you described, it is also not easy and
accurate to make scheduler  determine the latency characteristics of a VM
automatically with some information we can get from hypervisor, for instance
the delay interrupt. Therefore, the key point for me is to find and
implement a scheduling helper to indicate which VM should be scheduled soon.
For example, for TCP network, we can implement a tool similar to a packet
sniffer to capture the packet and analyze its head information to infer the
type of workload. Then the analysis result can help scheduler to make a
decision. In fact, not all I/O-intensive workloads require low latency, some
of them only require high-throughput. Of course, scheduling latency impact
significantly the throughput (You handled this problem with boost mechanism
to some extension). What I want to is to only reduce the latency of a VM
which require low latency while postpone other VMs, and use other
technology such as packet offloading to compensate their lost and improve
their throughput.

This is just my course idea and there are many problems as well. I hope I
can often discuss with you and share our results. Thanks very much.

Regards,
Cong

2011/6/8 George Dunlap <George.Dunlap@eu.citrix.com>

> On Tue, Jun 7, 2011 at 8:28 PM, David Xu <davidxu06@gmail.com> wrote:
> > Hi George,
> > Could you share some ideas about how to addressed the  "mixed workload"
> > problem,  where a single VM does both
> > cpu-intensive and latency-sensitive workloads, even though you haven't
> > implemented it yet?  I am also working on it, maybe I can try some
> methods
> > and give you feedback. Thanks.
>
> Well the main thing to remember is that you can't give the VM any
> *more* time.  The amount of time it's allowed is defined by the
> scheduler parameters (and the other VMs running).  So all you can do
> is change *when* the VM gets the time.  So what you want the scheduler
> to do is give the VM shorter timeslices *so that* it can get time more
> frequently.
>
> For example, the credit1 scheduler will let a VM burn through 30ms of
> credit.  That means if its "fair share" is (say) 50%, then it has to
> wait at least 30ms before being allowed to run again in order to
> maintain fairness.  If its "fair share" is 33%, then the VM has to
> wait at least 60ms.  If the scheduler were to preempt it after 5ms,
> then the VM would only have to be delayed for 5ms or 10ms,
> respectively; and if it were preempted after 1ms, it would only have
> to be delayed 1s or 2s.
>
> So the real key to giving a VM with a mixed workload better latency
> characteristics is not to wake it up sooner, but to preempt it sooner.
>
> The problem is, of course, that preempting workloads which are *not*
> latency sensitive too soon adds scheduling overhead, and reduces cache
> effectiveness.  So the question becomes, how do I know how long to let
> a VM run for?
>
> One solution would be to introduce a scheduling parameter that will
> tell the scheduler how long to set the preemption timer for.  Then if
> an administrator knows he's running a mixed-workload VM, he can
> shorten it down; or if he knows he's running a cpu-cruncher, he can
> make it longer.  This would also be useful in verifying the logic of
> "shorter timeslices -> less latency for mixed workloads"; i.e,. we
> could vary this number and see the effects.
>
> One issue with adding this to the credit1 scheduler is that the
> credit1 scheduler is that there are only 3 priorities (BOOST, UNDER,
> and OVER), and scheduling is round-robin within each priority.  It's a
> known issue with round-robin scheduling that tasks which yield (or are
> preempted soon) are discriminated against compared to tasks which use
> up their full timeslice (or are preempted less soon).  So there
> results may not be representative.
>
> The next step would be to try to get the scheduler to determine the
> latency characteristics of a VM automatically.  The key observation
> here is that most of the time, latency-sensitive operations are
> initiated with an interrupt; or to put it the other way, a pending
> interrupt generally means that there is a latency sensitive operation
> waiting to happen.  My idea was to have the scheduler look at the
> historical rate of interrupts and determine a preemption timeslice
> based on those, such that on average, the VM's credit would be enough
> to run just when the next interrupt arrived for it to handle.
>
> It occurs to me now that after a certain point, interrupts themselves
> become inefficient and drivers sometimes go into "polling" mode, which
> would look to the scheduler the same as cpu-bound.  Hmm... bears
> thinking about. :-)
>
> Anyway, that's where I got in my thinking on this. Let me know what
> you think. :-)
>
>  -George
>

--20cf305b0c267ec7c704a53a3924
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Hi George,<div><br></div><div>Thanks for your reply. I have similar ideas t=
o you, adding another parameter that indicates the required latency and the=
n letting scheduler determine=A0<span class=3D"Apple-style-span" style=3D"b=
order-collapse: collapse; font-family: arial, sans-serif; font-size: 13px; =
">latency characteristics of a VM automatically. Firstly, adding another pa=
rameter and let users set its value in advance sounds similar to SEDF. But =
sometimes the configuration process is hard and inflexible when workloads i=
n VM is complex. So in my opinion, a task-aware scheduler is better. Howeve=
r, manually configuration can help us to check out the effectiveness of the=
 new parameter. For another hand, as you described, it is also not easy and=
 accurate to make scheduler=A0</span><span class=3D"Apple-style-span" style=
=3D"border-collapse: collapse; font-family: arial, sans-serif; font-size: 1=
3px; ">=A0determine the=A0latency characteristics of a VM automatically wit=
h some information we can get from hypervisor, for instance the delay inter=
rupt. Therefore, the key point for me is to find and implement a scheduling=
 helper to indicate which VM should be scheduled soon. For example, for TCP=
 network, we can implement a tool similar to a packet sniffer to capture th=
e packet and analyze its head information to infer the type of workload. Th=
en the analysis result can help scheduler to make a decision. In fact, not =
all I/O-intensive workloads require low latency, some of them only require =
high-throughput. Of course, scheduling latency impact significantly the thr=
oughput (You handled this problem with boost mechanism to some extension). =
What I want to is to only reduce the latency of a VM which require low late=
ncy while postpone other VMs, and use other technology=A0</span><span class=
=3D"Apple-style-span" style=3D"border-collapse: collapse; font-family: aria=
l, sans-serif; font-size: 13px; ">such as packet offloading</span><span cla=
ss=3D"Apple-style-span" style=3D"border-collapse: collapse; font-family: ar=
ial, sans-serif; font-size: 13px; ">=A0to compensate their lost and improve=
 their throughput.</span></div>
<div><span class=3D"Apple-style-span" style=3D"border-collapse: collapse; f=
ont-family: arial, sans-serif; font-size: 13px; "><br></span></div><div><sp=
an class=3D"Apple-style-span" style=3D"border-collapse: collapse; font-fami=
ly: arial, sans-serif; font-size: 13px; ">This is just my course idea and t=
here are many problems as well. I hope I can often discuss with you and sha=
re our results. Thanks very much.</span></div>
<div><span class=3D"Apple-style-span" style=3D"border-collapse: collapse; f=
ont-family: arial, sans-serif; font-size: 13px; "><br></span></div><div><sp=
an class=3D"Apple-style-span" style=3D"border-collapse: collapse; font-fami=
ly: arial, sans-serif; font-size: 13px; ">Regards,</span></div>
<div><font class=3D"Apple-style-span" face=3D"arial, sans-serif"><span clas=
s=3D"Apple-style-span" style=3D"border-collapse: collapse;">Cong</span></fo=
nt></div><div><br><div class=3D"gmail_quote">2011/6/8 George Dunlap <span d=
ir=3D"ltr">&lt;<a href=3D"mailto:George.Dunlap@eu.citrix.com">George.Dunlap=
@eu.citrix.com</a>&gt;</span><br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex;"><div class=3D"im">On Tue, Jun 7, 2011 at 8:=
28 PM, David Xu &lt;<a href=3D"mailto:davidxu06@gmail.com">davidxu06@gmail.=
com</a>&gt; wrote:<br>

&gt; Hi George,<br>
&gt; Could you share some ideas about how to addressed the=A0=A0&quot;mixed=
 workload&quot;<br>
&gt; problem,=A0=A0where a single VM does both<br>
&gt; cpu-intensive and latency-sensitive workloads, even though you haven&#=
39;t<br>
&gt; implemented it yet? =A0I am also working on it, maybe I can try some m=
ethods<br>
&gt; and give you feedback. Thanks.<br>
<br>
</div>Well the main thing to remember is that you can&#39;t give the VM any=
<br>
*more* time. =A0The amount of time it&#39;s allowed is defined by the<br>
scheduler parameters (and the other VMs running). =A0So all you can do<br>
is change *when* the VM gets the time. =A0So what you want the scheduler<br=
>
to do is give the VM shorter timeslices *so that* it can get time more<br>
frequently.<br>
<br>
For example, the credit1 scheduler will let a VM burn through 30ms of<br>
credit. =A0That means if its &quot;fair share&quot; is (say) 50%, then it h=
as to<br>
wait at least 30ms before being allowed to run again in order to<br>
maintain fairness. =A0If its &quot;fair share&quot; is 33%, then the VM has=
 to<br>
wait at least 60ms. =A0If the scheduler were to preempt it after 5ms,<br>
then the VM would only have to be delayed for 5ms or 10ms,<br>
respectively; and if it were preempted after 1ms, it would only have<br>
to be delayed 1s or 2s.<br>
<br>
So the real key to giving a VM with a mixed workload better latency<br>
characteristics is not to wake it up sooner, but to preempt it sooner.<br>
<br>
The problem is, of course, that preempting workloads which are *not*<br>
latency sensitive too soon adds scheduling overhead, and reduces cache<br>
effectiveness. =A0So the question becomes, how do I know how long to let<br=
>
a VM run for?<br>
<br>
One solution would be to introduce a scheduling parameter that will<br>
tell the scheduler how long to set the preemption timer for. =A0Then if<br>
an administrator knows he&#39;s running a mixed-workload VM, he can<br>
shorten it down; or if he knows he&#39;s running a cpu-cruncher, he can<br>
make it longer. =A0This would also be useful in verifying the logic of<br>
&quot;shorter timeslices -&gt; less latency for mixed workloads&quot;; i.e,=
. we<br>
could vary this number and see the effects.<br>
<br>
One issue with adding this to the credit1 scheduler is that the<br>
credit1 scheduler is that there are only 3 priorities (BOOST, UNDER,<br>
and OVER), and scheduling is round-robin within each priority. =A0It&#39;s =
a<br>
known issue with round-robin scheduling that tasks which yield (or are<br>
preempted soon) are discriminated against compared to tasks which use<br>
up their full timeslice (or are preempted less soon). =A0So there<br>
results may not be representative.<br>
<br>
The next step would be to try to get the scheduler to determine the<br>
latency characteristics of a VM automatically. =A0The key observation<br>
here is that most of the time, latency-sensitive operations are<br>
initiated with an interrupt; or to put it the other way, a pending<br>
interrupt generally means that there is a latency sensitive operation<br>
waiting to happen. =A0My idea was to have the scheduler look at the<br>
historical rate of interrupts and determine a preemption timeslice<br>
based on those, such that on average, the VM&#39;s credit would be enough<b=
r>
to run just when the next interrupt arrived for it to handle.<br>
<br>
It occurs to me now that after a certain point, interrupts themselves<br>
become inefficient and drivers sometimes go into &quot;polling&quot; mode, =
which<br>
would look to the scheduler the same as cpu-bound. =A0Hmm... bears<br>
thinking about. :-)<br>
<br>
Anyway, that&#39;s where I got in my thinking on this. Let me know what<br>
you think. :-)<br>
<font color=3D"#888888"><br>
=A0-George<br>
</font></blockquote></div><br></div>

--20cf305b0c267ec7c704a53a3924--


--===============0568000276==
Content-Type: text/plain; charset="us-ascii"
MIME-Version: 1.0
Content-Transfer-Encoding: 7bit
Content-Disposition: inline

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

--===============0568000276==--