* ARP cache problems / slow connect times in routed mode - Bug #596 opened
@ 2006-03-31 20:49 Matt Ayres
2006-04-01 16:05 ` Matt Ayres
0 siblings, 1 reply; 6+ messages in thread
From: Matt Ayres @ 2006-03-31 20:49 UTC (permalink / raw)
To: xen-devel
Synopsis:
A user of mine has debugged this issue for me. It seems a Xen guest in
routed mode wants to arp cache any host it connects to with the MAC
address FE:FF:FF:FF:FF:FF. The user also identified long connection
times due to this. While a remote host is in the arp cache connection
times are fast (30ms or so), when it is not it can be well over 1000ms.
They have provided me the tcpdump output that proves this. They also
proved it is due to the ARP cache by statically adding a remote host to
the ARP cache and noting that connection times are very low.
Full debugging information is attached to the bug.
Bug URL: http://bugzilla.xensource.com/bugzilla/show_bug.cgi?id=596
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: ARP cache problems / slow connect times in routed mode - Bug #596 opened
2006-03-31 20:49 ARP cache problems / slow connect times in routed mode - Bug #596 opened Matt Ayres
@ 2006-04-01 16:05 ` Matt Ayres
2006-04-01 16:56 ` Keir Fraser
0 siblings, 1 reply; 6+ messages in thread
From: Matt Ayres @ 2006-04-01 16:05 UTC (permalink / raw)
To: xen-devel
Matt Ayres wrote:
> Synopsis:
>
> A user of mine has debugged this issue for me. It seems a Xen guest in
> routed mode wants to arp cache any host it connects to with the MAC
> address FE:FF:FF:FF:FF:FF. The user also identified long connection
> times due to this. While a remote host is in the arp cache connection
> times are fast (30ms or so), when it is not it can be well over 1000ms.
> They have provided me the tcpdump output that proves this. They also
> proved it is due to the ARP cache by statically adding a remote host to
> the ARP cache and noting that connection times are very low.
>
> Full debugging information is attached to the bug.
>
> Bug URL: http://bugzilla.xensource.com/bugzilla/show_bug.cgi?id=596
>
I have assigned this by to myself and marked it as INVALID. It appears
to be specific to CentOS / Fedora and my specific setup.
Thank you,
Matt Ayres
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: ARP cache problems / slow connect times in routed mode - Bug #596 opened
2006-04-01 16:05 ` Matt Ayres
@ 2006-04-01 16:56 ` Keir Fraser
2006-04-01 17:12 ` Matt Ayres
0 siblings, 1 reply; 6+ messages in thread
From: Keir Fraser @ 2006-04-01 16:56 UTC (permalink / raw)
To: Matt Ayres; +Cc: xen-devel
On 1 Apr 2006, at 17:05, Matt Ayres wrote:
>> A user of mine has debugged this issue for me. It seems a Xen guest
>> in routed mode wants to arp cache any host it connects to with the
>> MAC address FE:FF:FF:FF:FF:FF. The user also identified long
>> connection times due to this. While a remote host is in the arp cache
>> connection times are fast (30ms or so), when it is not it can be well
>> over 1000ms. They have provided me the tcpdump output that proves
>> this. They also proved it is due to the ARP cache by statically
>> adding a remote host to the ARP cache and noting that connection
>> times are very low.
>> Full debugging information is attached to the bug.
>> Bug URL: http://bugzilla.xensource.com/bugzilla/show_bug.cgi?id=596
>
> I have assigned this by to myself and marked it as INVALID. It
> appears to be specific to CentOS / Fedora and my specific setup.
We'll be interested to learn the full details if you manage to work out
what's going on. :-)
-- Keir
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: ARP cache problems / slow connect times in routed mode - Bug #596 opened
2006-04-01 16:56 ` Keir Fraser
@ 2006-04-01 17:12 ` Matt Ayres
2006-04-01 17:32 ` Keir Fraser
0 siblings, 1 reply; 6+ messages in thread
From: Matt Ayres @ 2006-04-01 17:12 UTC (permalink / raw)
To: Keir Fraser; +Cc: xen-devel
Keir Fraser wrote:
>
> On 1 Apr 2006, at 17:05, Matt Ayres wrote:
>
>>> A user of mine has debugged this issue for me. It seems a Xen guest
>>> in routed mode wants to arp cache any host it connects to with the
>>> MAC address FE:FF:FF:FF:FF:FF. The user also identified long
>>> connection times due to this. While a remote host is in the arp cache
>>> connection times are fast (30ms or so), when it is not it can be well
>>> over 1000ms. They have provided me the tcpdump output that proves
>>> this. They also proved it is due to the ARP cache by statically
>>> adding a remote host to the ARP cache and noting that connection
>>> times are very low.
>>> Full debugging information is attached to the bug.
>>> Bug URL: http://bugzilla.xensource.com/bugzilla/show_bug.cgi?id=596
>>
>> I have assigned this by to myself and marked it as INVALID. It
>> appears to be specific to CentOS / Fedora and my specific setup.
>
> We'll be interested to learn the full details if you manage to work out
> what's going on. :-)
>
I know exactly what went wrong. I chose to use 169.254.1.1 as the IP to
assign to my vif interfaces. Inside the guest a static route is added
for 169.254.1.1/24 via eth0 and then a default gateway to 169.254.1.1.
I chose this as various proxy ARP howto's use it and it is reserved
"link local" space, which made sense.
CentOS (RHEL) / Fedora add a static route for 169.254.0.0/18 for DHCP
purposes. I see no reason why, it's not required by any other
distribution and removing it doesn't make DHCP not work. Anyhow, it
appears having the finer-grained /24 route was causing all remote IP's
to be cached in the ARP table as local. Removing my /24 static route
fixes everything and causes only 169.254.1.1 to be in the ARP cache.
Perhaps the community can enlighten me, who is in the wrong here, RedHat
or I? We support many other distributions (Gentoo, Debian, Ubuntu,
Mandriva/Mandrake, Slackware) and no others want to add the link local
network as a static route.
The other oddity is why does having the /24 statically routed along with
the /18 cause any IP on the internet to be added to the ARP cache? That
part right there is what is most confusing to myself.
I fixed it, but I'm far from completely understanding it.
Thank you,
Matt Ayres
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: ARP cache problems / slow connect times in routed mode - Bug #596 opened
2006-04-01 17:12 ` Matt Ayres
@ 2006-04-01 17:32 ` Keir Fraser
2006-04-01 19:43 ` Matt Ayres
0 siblings, 1 reply; 6+ messages in thread
From: Keir Fraser @ 2006-04-01 17:32 UTC (permalink / raw)
To: Matt Ayres; +Cc: xen-devel
On 1 Apr 2006, at 18:12, Matt Ayres wrote:
> Perhaps the community can enlighten me, who is in the wrong here,
> RedHat or I? We support many other distributions (Gentoo, Debian,
> Ubuntu, Mandriva/Mandrake, Slackware) and no others want to add the
> link local network as a static route.
RedHat-based distros are configured with 'zeroconf' support by default,
which is what adds the 169.254 route iirc. If you don't care for that
(I can't imagine many people do) then it's a good idea to add
'NOZEROCONF=yes' to /etc/sysconfig/network.
> The other oddity is why does having the /24 statically routed along
> with the /18 cause any IP on the internet to be added to the ARP
> cache? That part right there is what is most confusing to myself.
Agreed, that doesn't seem to make any sense!
-- Keir
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: ARP cache problems / slow connect times in routed mode - Bug #596 opened
2006-04-01 17:32 ` Keir Fraser
@ 2006-04-01 19:43 ` Matt Ayres
0 siblings, 0 replies; 6+ messages in thread
From: Matt Ayres @ 2006-04-01 19:43 UTC (permalink / raw)
To: Keir Fraser; +Cc: xen-devel
Keir Fraser wrote:
>
>> The other oddity is why does having the /24 statically routed along
>> with the /18 cause any IP on the internet to be added to the ARP
>> cache? That part right there is what is most confusing to myself.
>
> Agreed, that doesn't seem to make any sense!
>
> -- Keir
I figured it out. Proxy ARP is _very_ touchy when it comes to subnets.
Since the netmask is a /32 subnet on the host vif the route works if
added as 169.254.1.1/32 on the guest (versus a /24). This allows RedHat
to do what they want and I need not worry if a user turns of zeroconf
support on their own now.
Thanks,
Matt
^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2006-04-01 19:43 UTC | newest]
Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2006-03-31 20:49 ARP cache problems / slow connect times in routed mode - Bug #596 opened Matt Ayres
2006-04-01 16:05 ` Matt Ayres
2006-04-01 16:56 ` Keir Fraser
2006-04-01 17:12 ` Matt Ayres
2006-04-01 17:32 ` Keir Fraser
2006-04-01 19:43 ` Matt Ayres
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.