All of lore.kernel.org
 help / color / mirror / Atom feed
* Slow guest network I/O when CPU is pegged - Looking for acknowledgement from developers
@ 2006-04-07 18:25 Matt Ayres
  2006-04-08  7:56 ` Keir Fraser
  0 siblings, 1 reply; 9+ messages in thread
From: Matt Ayres @ 2006-04-07 18:25 UTC (permalink / raw)
  To: xen-devel@lists.xensource.com

Ok, so we all know that guest network I/O is slow when the system CPU's 
are being utilized extensively whether it be from dom0 or from other 
guests.  Lots of people have written about this and I can post concrete 
tests if required.

I'm just looking for one of the Xen developers to acknowledge that they 
have been able to replicate the problem and it is indeed being worked on 
or will be sometime in the near future.  No one has acknowledged any of 
the previous threads on either list so I want to make sure it is an 
outstanding issue that is not being overlooked.

Thank you,
Matt Ayres

(Just ask for my tests and I'll paste the output / file a bug, but I 
think it's been an established bug and didn't want to make this e-mail 
too verbose.)

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Slow guest network I/O when CPU is pegged - Looking for acknowledgement from developers
  2006-04-07 18:25 Slow guest network I/O when CPU is pegged - Looking for acknowledgement from developers Matt Ayres
@ 2006-04-08  7:56 ` Keir Fraser
  2006-04-10 10:09   ` Pasi Kärkkäinen
  2006-04-13 13:52   ` Matt Ayres
  0 siblings, 2 replies; 9+ messages in thread
From: Keir Fraser @ 2006-04-08  7:56 UTC (permalink / raw)
  To: Matt Ayres; +Cc: xen-devel@lists.xensource.com


On 7 Apr 2006, at 19:25, Matt Ayres wrote:

> Ok, so we all know that guest network I/O is slow when the system 
> CPU's are being utilized extensively whether it be from dom0 or from 
> other guests.  Lots of people have written about this and I can post 
> concrete tests if required.
>
> I'm just looking for one of the Xen developers to acknowledge that 
> they have been able to replicate the problem and it is indeed being 
> worked on or will be sometime in the near future.  No one has 
> acknowledged any of the previous threads on either list so I want to 
> make sure it is an outstanding issue that is not being overlooked.

It depends on the setup but poor scheduling is the main reason for poor 
network performance, usually. SEDF seems to have some problems with 
real-time domains (like domain0 with its default scheduling parameters) 
and gives them all the CPU they want -- this is obviously going to be 
bad if a client domain is scheduled on the same CPU. Since UDP has no 
flow control, dom0 can keep itself busy generating or forwarding UDP 
packets to the domU that get dropped continually in netback driver. 
DomU will hardly ever get scheduled. Even in the case of TCP, any drops 
will be interpreted as congestion and transmit rate will be cut.

Basically I think the SEDF scheduler needs cleaning up: probably by 
removing the mass of confusing conditionally compiled options and then 
focusing on the remaining code that is actually compiled in. Another 
option is to try specifying the BVT scheduler and see if that works 
better. Or try setting dom0 to have non-real-time guarantees. Or give 
dom0 its own hyperthread on an Intel system (strongly recommended if 
it's possible).

Apart from that, if you really are genuinely loading up CPUs with 
CPU-intensive workloads, and expecting them also to be able to process 
a significant amount of network traffic then something has to give. You 
can only run CPUs at 100%.

   -- Keir

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Slow guest network I/O when CPU is pegged - Looking for acknowledgement from developers
  2006-04-08  7:56 ` Keir Fraser
@ 2006-04-10 10:09   ` Pasi Kärkkäinen
  2006-04-13 13:52   ` Matt Ayres
  1 sibling, 0 replies; 9+ messages in thread
From: Pasi Kärkkäinen @ 2006-04-10 10:09 UTC (permalink / raw)
  To: Keir Fraser; +Cc: Matt Ayres, xen-devel@lists.xensource.com

On Sat, Apr 08, 2006 at 08:56:19AM +0100, Keir Fraser wrote:
> 
> On 7 Apr 2006, at 19:25, Matt Ayres wrote:
> 
> >Ok, so we all know that guest network I/O is slow when the system 
> >CPU's are being utilized extensively whether it be from dom0 or from 
> >other guests.  Lots of people have written about this and I can post 
> >concrete tests if required.
> >
> >I'm just looking for one of the Xen developers to acknowledge that 
> >they have been able to replicate the problem and it is indeed being 
> >worked on or will be sometime in the near future.  No one has 
> >acknowledged any of the previous threads on either list so I want to 
> >make sure it is an outstanding issue that is not being overlooked.
> 
> It depends on the setup but poor scheduling is the main reason for poor 
> network performance, usually. SEDF seems to have some problems with 
> real-time domains (like domain0 with its default scheduling parameters) 
> and gives them all the CPU they want -- this is obviously going to be 
> bad if a client domain is scheduled on the same CPU. Since UDP has no 
> flow control, dom0 can keep itself busy generating or forwarding UDP 
> packets to the domU that get dropped continually in netback driver. 
> DomU will hardly ever get scheduled. Even in the case of TCP, any drops 
> will be interpreted as congestion and transmit rate will be cut.
> 
> Basically I think the SEDF scheduler needs cleaning up: probably by 
> removing the mass of confusing conditionally compiled options and then 
> focusing on the remaining code that is actually compiled in. Another 
> option is to try specifying the BVT scheduler and see if that works 
> better. Or try setting dom0 to have non-real-time guarantees. Or give 
> dom0 its own hyperthread on an Intel system (strongly recommended if 
> it's possible).
> 

Has anyone already tried this? I'd like to know if dedicating own
hyperthread for dom0 helps to fix these network performance problems..


> Apart from that, if you really are genuinely loading up CPUs with 
> CPU-intensive workloads, and expecting them also to be able to process 
> a significant amount of network traffic then something has to give. You 
> can only run CPUs at 100%.
> 
>   -- Keir
> 
> 

-- Pasi
       
                                   ^
                                .     .
                                 Linux
                              /    -    \
                             Choice.of.the
                           .Next.Generation.

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Slow guest network I/O when CPU is pegged - Looking for acknowledgement from developers
  2006-04-08  7:56 ` Keir Fraser
  2006-04-10 10:09   ` Pasi Kärkkäinen
@ 2006-04-13 13:52   ` Matt Ayres
  2006-04-19 17:15     ` Pasi Kärkkäinen
  1 sibling, 1 reply; 9+ messages in thread
From: Matt Ayres @ 2006-04-13 13:52 UTC (permalink / raw)
  To: Keir Fraser; +Cc: xen-devel@lists.xensource.com



Keir Fraser wrote:
> 
> On 7 Apr 2006, at 19:25, Matt Ayres wrote:
> 
>> Ok, so we all know that guest network I/O is slow when the system 
>> CPU's are being utilized extensively whether it be from dom0 or from 
>> other guests.  Lots of people have written about this and I can post 
>> concrete tests if required.
>>
>> I'm just looking for one of the Xen developers to acknowledge that 
>> they have been able to replicate the problem and it is indeed being 
>> worked on or will be sometime in the near future.  No one has 
>> acknowledged any of the previous threads on either list so I want to 
>> make sure it is an outstanding issue that is not being overlooked.
> 
> It depends on the setup but poor scheduling is the main reason for poor 
> network performance, usually. SEDF seems to have some problems with 
> real-time domains (like domain0 with its default scheduling parameters) 
> and gives them all the CPU they want -- this is obviously going to be 
> bad if a client domain is scheduled on the same CPU. 

You hit it right here.  I did some thinking and informal tests and came 
to a conclusion.  The SEDF is the "new kid" on the block and it also the 
default, hence everyone is using it.  In many cases (such as mine) 
people are just using SEDF with the weight.  Also, extratime seems to be 
broken (according to Stephen in an old post) and doesn't work well with 
heavy I/O.  It especially doesn't do well when dom0 does anything else 
but provide block and network device access, even when it is tuned in 
proportion to the other VM weights.

Another argument is that the SEDF scheduler is just TOO good at what it 
does, in that case it needs some work done to be more flexible.  Users 
should consider and test both schedulers before making a decision on 
which to use, there is no clear "winner".

Why am I replying? I did my tests.  BVT is nowhere near as strict as 
SEDF in it's "while 1" tests as far as allocating CPU to domains, but it 
seems to do a good enough job of providing a proportional share based on 
weight (duh) in a real world production environment.  It also fixed my 
network throughput problem.

Thanks,
Matt

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Slow guest network I/O when CPU is pegged - Looking for acknowledgement from developers
  2006-04-13 13:52   ` Matt Ayres
@ 2006-04-19 17:15     ` Pasi Kärkkäinen
  2006-04-19 19:46       ` Anand Gupta
  2006-04-19 20:14       ` Anand Gupta
  0 siblings, 2 replies; 9+ messages in thread
From: Pasi Kärkkäinen @ 2006-04-19 17:15 UTC (permalink / raw)
  To: Matt Ayres; +Cc: xen-devel@lists.xensource.com

On Thu, Apr 13, 2006 at 09:52:19AM -0400, Matt Ayres wrote:
> 
> 
> Keir Fraser wrote:
> >
> >On 7 Apr 2006, at 19:25, Matt Ayres wrote:
> >
> >>Ok, so we all know that guest network I/O is slow when the system 
> >>CPU's are being utilized extensively whether it be from dom0 or from 
> >>other guests.  Lots of people have written about this and I can post 
> >>concrete tests if required.
> >>
> >>I'm just looking for one of the Xen developers to acknowledge that 
> >>they have been able to replicate the problem and it is indeed being 
> >>worked on or will be sometime in the near future.  No one has 
> >>acknowledged any of the previous threads on either list so I want to 
> >>make sure it is an outstanding issue that is not being overlooked.
> >
> >It depends on the setup but poor scheduling is the main reason for poor 
> >network performance, usually. SEDF seems to have some problems with 
> >real-time domains (like domain0 with its default scheduling parameters) 
> >and gives them all the CPU they want -- this is obviously going to be 
> >bad if a client domain is scheduled on the same CPU. 
> 
> You hit it right here.  I did some thinking and informal tests and came 
> to a conclusion.  The SEDF is the "new kid" on the block and it also the 
> default, hence everyone is using it.  In many cases (such as mine) 
> people are just using SEDF with the weight.  Also, extratime seems to be 
> broken (according to Stephen in an old post) and doesn't work well with 
> heavy I/O.  It especially doesn't do well when dom0 does anything else 
> but provide block and network device access, even when it is tuned in 
> proportion to the other VM weights.
> 
> Another argument is that the SEDF scheduler is just TOO good at what it 
> does, in that case it needs some work done to be more flexible.  Users 
> should consider and test both schedulers before making a decision on 
> which to use, there is no clear "winner".
> 
> Why am I replying? I did my tests.  BVT is nowhere near as strict as 
> SEDF in it's "while 1" tests as far as allocating CPU to domains, but it 
> seems to do a good enough job of providing a proportional share based on 
> weight (duh) in a real world production environment.  It also fixed my 
> network throughput problem.
> 
> Thanks,
> Matt
> 

It seems BVT was the recommended CPU scheduler in Xen 2. I think I'll have
to try it too.. I hope it will fix the network throughput problems I'm
seeing in Xen 3.

Are there any downsides in using BVT scheduler in Xen 3.0 ? Why was the
default changed from BVT -> SEDF ?

-- Pasi

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Slow guest network I/O when CPU is pegged - Looking for acknowledgement from developers
  2006-04-19 17:15     ` Pasi Kärkkäinen
@ 2006-04-19 19:46       ` Anand Gupta
  2006-04-19 19:55         ` Rob Gardner
  2006-04-19 20:14       ` Anand Gupta
  1 sibling, 1 reply; 9+ messages in thread
From: Anand Gupta @ 2006-04-19 19:46 UTC (permalink / raw)
  To: Pasi Kärkkäinen; +Cc: Matt Ayres, xen-devel@lists.xensource.com


[-- Attachment #1.1: Type: text/plain, Size: 2995 bytes --]

How does one change from SEDF -> BVT ?

On 4/19/06, Pasi Kärkkäinen <pasik@iki.fi> wrote:
>
> On Thu, Apr 13, 2006 at 09:52:19AM -0400, Matt Ayres wrote:
> >
> >
> > Keir Fraser wrote:
> > >
> > >On 7 Apr 2006, at 19:25, Matt Ayres wrote:
> > >
> > >>Ok, so we all know that guest network I/O is slow when the system
> > >>CPU's are being utilized extensively whether it be from dom0 or from
> > >>other guests.  Lots of people have written about this and I can post
> > >>concrete tests if required.
> > >>
> > >>I'm just looking for one of the Xen developers to acknowledge that
> > >>they have been able to replicate the problem and it is indeed being
> > >>worked on or will be sometime in the near future.  No one has
> > >>acknowledged any of the previous threads on either list so I want to
> > >>make sure it is an outstanding issue that is not being overlooked.
> > >
> > >It depends on the setup but poor scheduling is the main reason for poor
> > >network performance, usually. SEDF seems to have some problems with
> > >real-time domains (like domain0 with its default scheduling parameters)
> > >and gives them all the CPU they want -- this is obviously going to be
> > >bad if a client domain is scheduled on the same CPU.
> >
> > You hit it right here.  I did some thinking and informal tests and came
> > to a conclusion.  The SEDF is the "new kid" on the block and it also the
> > default, hence everyone is using it.  In many cases (such as mine)
> > people are just using SEDF with the weight.  Also, extratime seems to be
> > broken (according to Stephen in an old post) and doesn't work well with
> > heavy I/O.  It especially doesn't do well when dom0 does anything else
> > but provide block and network device access, even when it is tuned in
> > proportion to the other VM weights.
> >
> > Another argument is that the SEDF scheduler is just TOO good at what it
> > does, in that case it needs some work done to be more flexible.  Users
> > should consider and test both schedulers before making a decision on
> > which to use, there is no clear "winner".
> >
> > Why am I replying? I did my tests.  BVT is nowhere near as strict as
> > SEDF in it's "while 1" tests as far as allocating CPU to domains, but it
> > seems to do a good enough job of providing a proportional share based on
> > weight (duh) in a real world production environment.  It also fixed my
> > network throughput problem.
> >
> > Thanks,
> > Matt
> >
>
> It seems BVT was the recommended CPU scheduler in Xen 2. I think I'll have
> to try it too.. I hope it will fix the network throughput problems I'm
> seeing in Xen 3.
>
> Are there any downsides in using BVT scheduler in Xen 3.0 ? Why was the
> default changed from BVT -> SEDF ?
>
> -- Pasi
>
> _______________________________________________
> Xen-devel mailing list
> Xen-devel@lists.xensource.com
> http://lists.xensource.com/xen-devel
>



--
regards,

Anand Gupta

[-- Attachment #1.2: Type: text/html, Size: 3769 bytes --]

[-- Attachment #2: Type: text/plain, Size: 138 bytes --]

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Slow guest network I/O when CPU is pegged - Looking for acknowledgement from developers
  2006-04-19 19:46       ` Anand Gupta
@ 2006-04-19 19:55         ` Rob Gardner
  2006-04-19 20:12           ` Anand Gupta
  0 siblings, 1 reply; 9+ messages in thread
From: Rob Gardner @ 2006-04-19 19:55 UTC (permalink / raw)
  To: Anand Gupta; +Cc: Matt Ayres, xen-devel@lists.xensource.com

Anand Gupta wrote:
> How does one change from SEDF -> BVT ?
>

Put sched=bvt on the xen boot command line in grub.conf

Rob

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Slow guest network I/O when CPU is pegged - Looking for acknowledgement from developers
  2006-04-19 19:55         ` Rob Gardner
@ 2006-04-19 20:12           ` Anand Gupta
  0 siblings, 0 replies; 9+ messages in thread
From: Anand Gupta @ 2006-04-19 20:12 UTC (permalink / raw)
  To: Rob Gardner; +Cc: Matt Ayres, xen-devel@lists.xensource.com


[-- Attachment #1.1: Type: text/plain, Size: 252 bytes --]

Thanks.

On 4/20/06, Rob Gardner <rob.gardner@hp.com> wrote:
>
> Anand Gupta wrote:
> > How does one change from SEDF -> BVT ?
> >
>
> Put sched=bvt on the xen boot command line in grub.conf
>
> Rob
>
>
>


--
regards,

Anand Gupta

[-- Attachment #1.2: Type: text/html, Size: 556 bytes --]

[-- Attachment #2: Type: text/plain, Size: 138 bytes --]

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Slow guest network I/O when CPU is pegged - Looking for acknowledgement from developers
  2006-04-19 17:15     ` Pasi Kärkkäinen
  2006-04-19 19:46       ` Anand Gupta
@ 2006-04-19 20:14       ` Anand Gupta
  1 sibling, 0 replies; 9+ messages in thread
From: Anand Gupta @ 2006-04-19 20:14 UTC (permalink / raw)
  To: Pasi Kärkkäinen; +Cc: xen-devel@lists.xensource.com


[-- Attachment #1.1: Type: text/plain, Size: 334 bytes --]

On 4/19/06, Pasi Kärkkäinen <pasik@iki.fi> wrote:
>
> Are there any downsides in using BVT scheduler in Xen 3.0 ? Why was the
> default changed from BVT -> SEDF ?
>

I would like to know that as well. What i have seen from the mailing lists,
SEDF is the most talked about and recommended one...

--
regards,

Anand Gupta

[-- Attachment #1.2: Type: text/html, Size: 615 bytes --]

[-- Attachment #2: Type: text/plain, Size: 138 bytes --]

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2006-04-19 20:14 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2006-04-07 18:25 Slow guest network I/O when CPU is pegged - Looking for acknowledgement from developers Matt Ayres
2006-04-08  7:56 ` Keir Fraser
2006-04-10 10:09   ` Pasi Kärkkäinen
2006-04-13 13:52   ` Matt Ayres
2006-04-19 17:15     ` Pasi Kärkkäinen
2006-04-19 19:46       ` Anand Gupta
2006-04-19 19:55         ` Rob Gardner
2006-04-19 20:12           ` Anand Gupta
2006-04-19 20:14       ` Anand Gupta

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.