All of lore.kernel.org
 help / color / mirror / Atom feed
* help with bugs
@ 2005-08-04 15:04 Ian Pratt
  2005-08-04 19:53 ` Anthony Liguori
                   ` (3 more replies)
  0 siblings, 4 replies; 12+ messages in thread
From: Ian Pratt @ 2005-08-04 15:04 UTC (permalink / raw)
  To: xen-devel



I'd like to appeal for some help tracking down a couple of bugs that
we're struggling to reproduce:

 BUG62  eth0 -> veth0 in network script can loose network
 BUG130 time running fast bug
 BUG76  shared irq's fail under high load

These are all pretty serious and it would be good to get fixed before
3.0-testing-r1 

If you can make them exhibit frequently on your system it would be
useful to know.

Thanks,
Ian

^ permalink raw reply	[flat|nested] 12+ messages in thread
* RE: Re: help with bugs
@ 2005-08-04 22:00 Ian Pratt
  0 siblings, 0 replies; 12+ messages in thread
From: Ian Pratt @ 2005-08-04 22:00 UTC (permalink / raw)
  To: Nivedita Singhvi, Sean Dague; +Cc: xen-devel

> I didn't see your original mail, so not sure if you listed 
> any others - but was workin on 103 which seems to have gone 
> awayin current code (we're trying to narrow it to the NET 
> GRANT code (not yet confirmed).

NET_GRANT is currently disabled by default because bugs were revealed
when we tried to enable it. Steve Hand is looking into this.

Best,
Ian

^ permalink raw reply	[flat|nested] 12+ messages in thread
* RE: Re: help with bugs
@ 2005-08-04 23:48 Ian Pratt
  2005-08-05 12:23 ` Sean Dague
  0 siblings, 1 reply; 12+ messages in thread
From: Ian Pratt @ 2005-08-04 23:48 UTC (permalink / raw)
  To: Sean Dague, xen-devel

> On Thu, Aug 04, 2005 at 10:48:50PM +0100, Ian Pratt wrote:
> > > >  BUG62  eth0 -> veth0 in network script can loose network
> > > I can make this bug come and go at will based on which of the
> > > 2 network interfaces are part of the bridge.  I added that 
> > > information into the bugzilla bug, hopefully that helps.
> > 
> > Are you changing the default 'netdev' at the top of the 
> network script?
> 
> No actually, I guess I'm not even clear why veth0 exists, as 
> everything works quite nicely for me without it functioning.

If you're running services in dom0 that are used by other domains you
are liable to get head-of-line blocking or even deadlock of the domU's
networking unless you use veth0:  all of the domU's skb's could end up
getting queued in dom0 socket buffers.

veth0 avoids this by copying packets destined for dom0 and giving the
buffer back to the domU. 

Ian

^ permalink raw reply	[flat|nested] 12+ messages in thread
* RE: Re: help with bugs
@ 2005-08-05 14:09 Ian Pratt
  0 siblings, 0 replies; 12+ messages in thread
From: Ian Pratt @ 2005-08-05 14:09 UTC (permalink / raw)
  To: Sean Dague; +Cc: xen-devel

 
> > If you're running services in dom0 that are used by other 
> domains you 
> > are liable to get head-of-line blocking or even deadlock of 
> the domU's 
> > networking unless you use veth0:  all of the domU's skb's 
> could end up 
> > getting queued in dom0 socket buffers.
> > 
> > veth0 avoids this by copying packets destined for dom0 and 
> giving the 
> > buffer back to the domU.
> 
> Is there a test case for this?  I've been running some 
> services in dom0 and apparently running without veth0 for 
> quite some time.  It would be good to have a test that shows 
> this problem.

I haven't tried this, but I suspect it would work:

 * run a ttcp receiver in dom0 with a very large socket buffer size
 * connect to it with a ttcp transmitter in domU (again, large sock
buffer)
 * with data in flight, ^Z the dom0 receiver

I'd expect to see the ntworking of the domU (and potentially other
domU's) start to run very slowly or even grind to a halt. You may need
multiple parallel tcp connections to trigger this. Using UDP makes it
happen much more easily.

If you're using veth0 you shouldn't have the problem.

[There are plans for making the backend buffer management more dynamic
that would mitigate the effect on other domU's, but this wouldn't
completely obviate the need for veth0 as a single domU could still end
up with all of its buffers being held by dom0.
There's a partial fix for TCP (not UDP) whereby we have dom0 release
it's mapping of the domU buffer as soon as its sent the TCP ACK, rather
than when it finally frees the skb when the client reads it. 

Possibly the cleanest option would be to add a hook in the local receive
path that would enable us to copy&unmap any packets destined for local
delivery. Anyhow, not for 3.0.0 ...]

Ian

^ permalink raw reply	[flat|nested] 12+ messages in thread

end of thread, other threads:[~2005-08-05 14:09 UTC | newest]

Thread overview: 12+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2005-08-04 15:04 help with bugs Ian Pratt
2005-08-04 19:53 ` Anthony Liguori
2005-08-04 20:18 ` Sean Dague
2005-08-04 20:49   ` Nivedita Singhvi
2005-08-04 20:54     ` Jerone Young
2005-08-04 20:59       ` Nivedita Singhvi
2005-08-04 20:49 ` David F Barrera
2005-08-05  8:29 ` Gerd Knorr
  -- strict thread matches above, loose matches on Subject: below --
2005-08-04 22:00 Ian Pratt
2005-08-04 23:48 Ian Pratt
2005-08-05 12:23 ` Sean Dague
2005-08-05 14:09 Ian Pratt

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.