From mboxrd@z Thu Jan  1 00:00:00 1970
From: George Dunlap <george.dunlap@eu.citrix.com>
Subject: Re: Xen credit scheduler question
Date: Thu, 15 Nov 2012 19:52:53 +0000
Message-ID: <50A54815.9010402@eu.citrix.com>
References: <c58a9d3a-99e4-42ac-86c9-fbec600dee14@default>
	<50A53479.5050901@eu.citrix.com>
	<27449f60-0433-4e5f-b1fb-06914b84c6f1@default>
Mime-Version: 1.0
Content-Type: multipart/mixed; boundary="===============5647842968013421696=="
Return-path: <xen-devel-bounces@lists.xen.org>
In-Reply-To: <27449f60-0433-4e5f-b1fb-06914b84c6f1@default>
List-Unsubscribe: <http://lists.xen.org/cgi-bin/mailman/options/xen-devel>,
	<mailto:xen-devel-request@lists.xen.org?subject=unsubscribe>
List-Post: <mailto:xen-devel@lists.xen.org>
List-Help: <mailto:xen-devel-request@lists.xen.org?subject=help>
List-Subscribe: <http://lists.xen.org/cgi-bin/mailman/listinfo/xen-devel>,
	<mailto:xen-devel-request@lists.xen.org?subject=subscribe>
Sender: xen-devel-bounces@lists.xen.org
Errors-To: xen-devel-bounces@lists.xen.org
To: Michael Palmeter <michael.palmeter@oracle.com>
Cc: Ashok Aletty <ashok.aletty@oracle.com>, Dario Faggioli <raistlin@linux.it>, "xen-devel@lists.xen.org" <xen-devel@lists.xen.org>
List-Id: xen-devel@lists.xenproject.org

--===============5647842968013421696==
Content-Type: multipart/alternative;
	boundary="------------090604080201040700060707"

--------------090604080201040700060707
Content-Type: text/plain; charset="windows-1252"; format=flowed
Content-Transfer-Encoding: 8bit

On 15/11/12 19:03, Michael Palmeter wrote:
>
> Thank you for your answer, George.
>
> The origin of my question is more of a business concern than a 
> technical one.  Many software products are licensed based on a cost 
> per processor core.  It is desirable to sometimes allow customers to 
> pay a fraction of software license costs in exchange for running that 
> software using only a commensurate fraction of available compute power 
> (capacity sub-licensing).  If the cap is a means of making a vCPU 
> more-or-less deterministic (in terms of its effective computational 
> capacity) then that would be useful as a programmatic means of 
> enabling capacity sub-licensing.  My example below was based on a case 
> where I have a customer that would like to use ‘cap’ to constrain 
> their single vCPU VM to only ½ of a core worth of compute capacity 
> (logically 1/32 of the compute power) in exchange for only paying 1/32 
> of the license cost for the physical server.
>

Right -- I've seen the "limit cpu power for licensing purposes" thing 
before, but I think that only went down to cores, not sub-core.

> Below you answered:
>
> “You can use ‘cap’ to make the VM in question get 50% of logical vcpu 
> time, which on an idle system will give it 0.5 of the capacity of a 
> physical core (if we don't consider Intel's Turbo Boost technology).  
> But if the system becomes busy, it will get less than 0.5 of the 
> processing capacity of a physical core.”
>
> Are you saying that cap would be able to CONSTRAIN a vCPU to an 
> effective compute capacity equal to 50% of a physical core, but it 
> does not GUARANTEE effective compute capacity equal to 50% of a 
> physical core?
>

Theoretically, a cap at 50 will give your single-vcpu VM 50% of the time 
of one hyperthread.

So if C is "typicall throughput of a single non-hyperthreaded core 
running at standard requency", and we factor out Turbo Boost, then there 
are two cases to consider:

* Other thread is idle.  In that case, the VM will get 0.5C.
* The other thread is busy.  In this case, assuming a 0.7 factor, the VM 
will get 0.5 * (0.7 * C), or about 0.35C

So the total computing power available to the VM should be <= 0.5C 
(satisfying the licensing requirements), but on a busy system it may be 
significantly less than 0.5C (perhaps not so satisfying to the owner of 
the VM).

I don't think it should be terribly difficult to put a simple "shared 
hyperthread" multiplier on the credit burned -- if someone at Oracle 
wanted to help implement this, we'd be happy to point you in the right 
direction. :-)

If you have Turbo Boost, then (as I understand it) the CPU can raise the 
clock speed of the processor when threads or cores are idle; the 
wikipedia article seems to think some processors can increase the clock 
speed up to 1.6x over the baseline frequency.  That would throw a bit of 
a wrench in the works, as you might end up with 0.5 * 1.6 * C = 0.8 C > 
0.5 C; however, looking at Intel's website, it looks like only 2- and 
4-core processors have TurboBoost, so maybe on 8-core processors we can 
punt on that thorny issue for a little while yet. :-)

> Can you offer any guidance regarding real-world scheduler overhead 
> (when cap>0 is used) and precision (how variable is actual compute 
> power for a vCPU with a cap of 100%, for example)?
>

I have not done extensive testing with the cap; I mainly know the 
mechanism by which it works.  There is no extra accounting done in the 
scheduler for having a cap: all vcpus are assigned credit every 30ms 
according to their weight and cap.  The difference is that if a 
non-capped vcpu uses up its credits, it is allowed to go negative; 
whereas a capped vcpu will be paused until it receives more credits.  So 
there should be no extra hypervisor overhead from using a cap.

The cap fundamentally works by locking out a vcpu for very small amounts 
of time within the 30ms accounting window.  But this same effect might 
happen just by having other VMs competing for the cpu; so in theory 
shouldn't be any riskier than virtualizing in the first place.

Executive summary: Factoring out Turbo Boost, "cap" should be able to 
set a sub-core upper-bound on processing power.  But on a busy system, 
it may result in the VM getting less than its upper-bound in processing 
power.

However, scheduling is a very complex and dynamic system, and like 
economics, very simple changes can have unpredictable results.  So it's 
probably a good idea to do some testing before recommending it to 
customers. :-)

BTW, are you familiar with Xen's cpupool functionality?  The guys at 
Fujitsu wrote it so that a customer could rent a fixed number of cores 
to a customer, who could then run as many VMs on those cores as they 
wanted.  I think licensing restrictions had something to do with that as 
well.  More about that here, if you're interested:
  http://blog.xen.org/index.php/2012/04/23/xen-4-2-cpupools/

  -George

--------------090604080201040700060707
Content-Type: text/html; charset="windows-1252"
Content-Transfer-Encoding: 8bit

<html>
  <head>
    <meta content="text/html; charset=windows-1252"
      http-equiv="Content-Type">
  </head>
  <body text="#000000" bgcolor="#FFFFFF">
    <div class="moz-cite-prefix">On 15/11/12 19:03, Michael Palmeter
      wrote:<br>
    </div>
    <blockquote cite="mid:27449f60-0433-4e5f-b1fb-06914b84c6f1@default"
      type="cite">
      <meta http-equiv="Content-Type" content="text/html;
        charset=windows-1252">
      <meta name="Generator" content="Microsoft Word 12 (filtered
        medium)">
      <!--[if !mso]><style>v\:* {behavior:url(#default#VML);}
o\:* {behavior:url(#default#VML);}
w\:* {behavior:url(#default#VML);}
.shape {behavior:url(#default#VML);}
</style><![endif]-->
      <style><!--
/* Font Definitions */
@font-face
	{font-family:"Cambria Math";
	panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
	{font-family:Calibri;
	panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
	{font-family:Tahoma;
	panose-1:2 11 6 4 3 5 4 4 2 4;}
@font-face
	{font-family:Verdana;
	panose-1:2 11 6 4 3 5 4 4 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
	{margin:0in;
	margin-bottom:.0001pt;
	font-size:11.0pt;
	font-family:"Calibri","sans-serif";
	color:black;}
a:link, span.MsoHyperlink
	{mso-style-priority:99;
	color:blue;
	text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
	{mso-style-priority:99;
	color:purple;
	text-decoration:underline;}
p.MsoAcetate, li.MsoAcetate, div.MsoAcetate
	{mso-style-priority:99;
	mso-style-link:"Balloon Text Char";
	margin:0in;
	margin-bottom:.0001pt;
	font-size:8.0pt;
	font-family:"Tahoma","sans-serif";
	color:black;}
span.BalloonTextChar
	{mso-style-name:"Balloon Text Char";
	mso-style-priority:99;
	mso-style-link:"Balloon Text";
	font-family:"Tahoma","sans-serif";}
span.EmailStyle19
	{mso-style-type:personal;
	font-family:"Calibri","sans-serif";
	color:windowtext;}
span.EmailStyle20
	{mso-style-type:personal-reply;
	font-family:"Calibri","sans-serif";
	color:#1F497D;}
.MsoChpDefault
	{mso-style-type:export-only;
	font-size:10.0pt;}
@page WordSection1
	{size:8.5in 11.0in;
	margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
	{page:WordSection1;}
/* List Definitions */
@list l0
	{mso-list-id:599684598;
	mso-list-type:hybrid;
	mso-list-template-ids:1243763380 67698689 67698691 67698693 67698689 67698691 67698693 67698689 67698691 67698693;}
@list l0:level1
	{mso-level-number-format:bullet;
	mso-level-text:\F0B7;
	mso-level-tab-stop:none;
	mso-level-number-position:left;
	text-indent:-.25in;
	font-family:Symbol;}
@list l0:level2
	{mso-level-tab-stop:1.0in;
	mso-level-number-position:left;
	text-indent:-.25in;}
@list l0:level3
	{mso-level-tab-stop:1.5in;
	mso-level-number-position:left;
	text-indent:-.25in;}
@list l0:level4
	{mso-level-tab-stop:2.0in;
	mso-level-number-position:left;
	text-indent:-.25in;}
@list l0:level5
	{mso-level-tab-stop:2.5in;
	mso-level-number-position:left;
	text-indent:-.25in;}
@list l0:level6
	{mso-level-tab-stop:3.0in;
	mso-level-number-position:left;
	text-indent:-.25in;}
@list l0:level7
	{mso-level-tab-stop:3.5in;
	mso-level-number-position:left;
	text-indent:-.25in;}
@list l0:level8
	{mso-level-tab-stop:4.0in;
	mso-level-number-position:left;
	text-indent:-.25in;}
@list l0:level9
	{mso-level-tab-stop:4.5in;
	mso-level-number-position:left;
	text-indent:-.25in;}
@list l1
	{mso-list-id:1244485665;
	mso-list-template-ids:2139918180;}
@list l1:level1
	{mso-level-number-format:bullet;
	mso-level-text:\F0B7;
	mso-level-tab-stop:.5in;
	mso-level-number-position:left;
	text-indent:-.25in;
	mso-ansi-font-size:10.0pt;
	font-family:Symbol;}
ol
	{margin-bottom:0in;}
ul
	{margin-bottom:0in;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="2050" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
      <div class="WordSection1">
        <p class="MsoNormal"><span style="color:#1F497D">Thank you for
            your answer, George.<o:p></o:p></span></p>
        <p class="MsoNormal"><span style="color:#1F497D"><o:p> </o:p></span></p>
        <p class="MsoNormal"><span style="color:#1F497D">The origin of
            my question is more of a business concern than a technical
            one.  Many software products are licensed based on a cost
            per processor core.  It is desirable to sometimes allow
            customers to pay a fraction of software license costs in
            exchange for running that software using only a commensurate
            fraction of available compute power (capacity
            sub-licensing).  If the cap is a means of making a vCPU
            more-or-less deterministic (in terms of its effective
            computational capacity) then that would be useful as a
            programmatic means of enabling capacity sub-licensing.  My
            example below was based on a case where I have a customer
            that would like to use ‘cap’ to constrain their single vCPU
            VM to only ½ of a core worth of compute capacity (logically
            1/32 of the compute power) in exchange for only paying 1/32
            of the license cost for the physical server.</span></p>
      </div>
    </blockquote>
    <br>
    Right -- I've seen the "limit cpu power for licensing purposes"
    thing before, but I think that only went down to cores, not
    sub-core.<br>
    <br>
    <blockquote cite="mid:27449f60-0433-4e5f-b1fb-06914b84c6f1@default"
      type="cite">
      <div class="WordSection1">
        <p class="MsoNormal"><span style="color:#1F497D"><o:p></o:p></span></p>
        <p class="MsoNormal"><span style="color:#1F497D"><o:p> </o:p></span></p>
        <p class="MsoNormal"><span style="color:#1F497D">Below you
            answered:<o:p></o:p></span></p>
        <p class="MsoNormal"><span style="color:#1F497D"><o:p> </o:p></span></p>
        <p class="MsoNormal"><span style="color:#1F497D">“You can use
            ‘cap’ to make the VM in question get 50% of logical vcpu
            time, which on an idle system will give it 0.5 of the
            capacity of a physical core (if we don't consider Intel's
            Turbo Boost technology).  But if the system becomes busy, it
            will get less than 0.5 of the processing capacity of a
            physical core.”<o:p></o:p></span></p>
        <p class="MsoNormal"><span style="color:#1F497D"><o:p> </o:p></span></p>
        <p class="MsoNormal"><span style="color:#1F497D">Are you saying
            that cap would be able to CONSTRAIN a vCPU to an effective
            compute capacity equal to 50% of a physical core, but it
            does not GUARANTEE effective compute capacity equal to 50%
            of a physical core? </span></p>
      </div>
    </blockquote>
    <br>
    Theoretically, a cap at 50 will give your single-vcpu VM 50% of the
    time of one hyperthread.<br>
    <br>
    So if C is "typicall throughput of a single non-hyperthreaded core
    running at standard requency", and we factor out Turbo Boost, then
    there are two cases to consider:<br>
    <br>
    * Other thread is idle.  In that case, the VM will get 0.5C.<br>
    * The other thread is busy.  In this case, assuming a 0.7 factor,
    the VM will get 0.5 * (0.7 * C), or about 0.35C<br>
    <br>
    So the total computing power available to the VM should be &lt;=
    0.5C (satisfying the licensing requirements), but on a busy system
    it may be significantly less than 0.5C (perhaps not so satisfying to
    the owner of the VM).<br>
    <br>
    I don't think it should be terribly difficult to put a simple
    "shared hyperthread" multiplier on the credit burned -- if someone
    at Oracle wanted to help implement this, we'd be happy to point you
    in the right direction. :-)<br>
    <br>
    If you have Turbo Boost, then (as I understand it) the CPU can raise
    the clock speed of the processor when threads or cores are idle; the
    wikipedia article seems to think some processors can increase the
    clock speed up to 1.6x over the baseline frequency.  That would
    throw a bit of a wrench in the works, as you might end up with 0.5 *
    1.6 * C = 0.8 C &gt; 0.5 C; however, looking at Intel's website, it
    looks like only 2- and 4-core processors have TurboBoost, so maybe
    on 8-core processors we can punt on that thorny issue for a little
    while yet. :-)<br>
    <br>
    <blockquote cite="mid:27449f60-0433-4e5f-b1fb-06914b84c6f1@default"
      type="cite">
      <div class="WordSection1">
        <p class="MsoNormal"><span style="color:#1F497D"><o:p></o:p></span></p>
        <p class="MsoNormal"><span style="color:#1F497D"><o:p></o:p>Can
            you offer any guidance regarding real-world scheduler
            overhead (when cap&gt;0 is used) and precision (how variable
            is actual compute power for a vCPU with a cap of 100%, for
            example)?</span></p>
      </div>
    </blockquote>
    <br>
    I have not done extensive testing with the cap; I mainly know the
    mechanism by which it works.  There is no extra accounting done in
    the scheduler for having a cap: all vcpus are assigned credit every
    30ms according to their weight and cap.  The difference is that if a
    non-capped vcpu uses up its credits, it is allowed to go negative;
    whereas a capped vcpu will be paused until it receives more
    credits.  So there should be no extra hypervisor overhead from using
    a cap.<br>
    <br>
    The cap fundamentally works by locking out a vcpu for very small
    amounts of time within the 30ms accounting window.  But this same
    effect might happen just by having other VMs competing for the cpu;
    so in theory shouldn't be any riskier than virtualizing in the first
    place.<br>
    <br>
    Executive summary: Factoring out Turbo Boost, "cap" should be able
    to set a sub-core upper-bound on processing power.  But on a busy
    system, it may result in the VM getting less than its upper-bound in
    processing power.<br>
    <br>
    However, scheduling is a very complex and dynamic system, and like
    economics, very simple changes can have unpredictable results.  So
    it's probably a good idea to do some testing before recommending it
    to customers. :-)<br>
    <br>
    BTW, are you familiar with Xen's cpupool functionality?  The guys at
    Fujitsu wrote it so that a customer could rent a fixed number of
    cores to a customer, who could then run as many VMs on those cores
    as they wanted.  I think licensing restrictions had something to do
    with that as well.  More about that here, if you're interested:<br>
     <a class="moz-txt-link-freetext" href="http://blog.xen.org/index.php/2012/04/23/xen-4-2-cpupools/">http://blog.xen.org/index.php/2012/04/23/xen-4-2-cpupools/</a><br>
    <br>
     -George<br>
  </body>
</html>

--------------090604080201040700060707--


--===============5647842968013421696==
Content-Type: text/plain; charset="us-ascii"
MIME-Version: 1.0
Content-Transfer-Encoding: 7bit
Content-Disposition: inline

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xen.org
http://lists.xen.org/xen-devel

--===============5647842968013421696==--