From: Jacek Konieczny <jajcus@jajcus.net>
To: Ian Campbell <Ian.Campbell@citrix.com>
Cc: Mariusz Mazur <mmazur@axeos.com>, xen-devel@lists.xen.org
Subject: Re: [BUG] VIF rate limiting locks up network in the whole system
Date: Fri, 09 May 2014 13:44:04 +0200 [thread overview]
Message-ID: <536CBF84.9070400@jajcus.net> (raw)
In-Reply-To: <1399631552.9513.152.camel@kazak.uk.xensource.com>
On 05/09/14 12:32, Ian Campbell wrote:
> On Fri, 2014-05-09 at 12:25 +0200, Jacek Konieczny wrote:
>
>>> Do they perhaps differ between the working and non-working case
>>> (despite the input configuration being the same)?
>>
>> I will check that. I think this can be safely done even on a production
>> server, still running the old Xen and kernel.
>
> Just to be clear I meant working with rate= (on the old setup) and not
> working without rate= (on the new setup). Working without rate= won't
> tell us much, since those keys simply won't be present..
Yes, I understood that.
I used the same xl configuration file on two hosts.
Xen 4.4.0, Linux 3.13.6 (not-working setup):
/local/domain/0/backend/vif/24/0/frontend =
"/local/domain/24/device/vif/0" (n0,r24)
/local/domain/0/backend/vif/24/0/frontend-id = "24" (n0,r24)
/local/domain/0/backend/vif/24/0/online = "1" (n0,r24)
/local/domain/0/backend/vif/24/0/state = "4" (n0,r24)
/local/domain/0/backend/vif/24/0/script = "/etc/xen/scripts/vif-bridge"
(n0,r24)
/local/domain/0/backend/vif/24/0/mac = "02:00:0f:ff:00:1f" (n0,r24)
/local/domain/0/backend/vif/24/0/rate = "800,50000" (n0,r24)
/local/domain/0/backend/vif/24/0/bridge = "xenbr0" (n0,r24)
/local/domain/0/backend/vif/24/0/handle = "0" (n0,r24)
/local/domain/0/backend/vif/24/0/type = "vif" (n0,r24)
/local/domain/0/backend/vif/24/0/feature-sg = "1" (n0,r24)
/local/domain/0/backend/vif/24/0/feature-gso-tcpv4 = "1" (n0,r24)
/local/domain/0/backend/vif/24/0/feature-gso-tcpv6 = "1" (n0,r24)
/local/domain/0/backend/vif/24/0/feature-ipv6-csum-offload = "1" (n0,r24)
/local/domain/0/backend/vif/24/0/feature-rx-copy = "1" (n0,r24)
/local/domain/0/backend/vif/24/0/feature-rx-flip = "0" (n0,r24)
/local/domain/0/backend/vif/24/0/feature-split-event-channels = "1"
(n0,r24)
/local/domain/0/backend/vif/24/0/hotplug-status = "connected" (n0,r24)
Xen 4.2.1, kernel 3.7.1 (old working setup, I don't have 4.3 and newer
kernel handy):
/local/domain/0/backend/vif/20/0/frontend =
"/local/domain/20/device/vif/0" (n0,r20)
/local/domain/0/backend/vif/20/0/frontend-id = "20" (n0,r20)
/local/domain/0/backend/vif/20/0/online = "1" (n0,r20)
/local/domain/0/backend/vif/20/0/state = "4" (n0,r20)
/local/domain/0/backend/vif/20/0/script = "/etc/xen/scripts/vif-bridge"
(n0,r20)
/local/domain/0/backend/vif/20/0/mac = "02:00:0d:ff:00:1f" (n0,r20)
/local/domain/0/backend/vif/20/0/rate = "800,50000" (n0,r20)
/local/domain/0/backend/vif/20/0/bridge = "br1" (n0,r20)
/local/domain/0/backend/vif/20/0/handle = "0" (n0,r20)
/local/domain/0/backend/vif/20/0/type = "vif" (n0,r20)
/local/domain/0/backend/vif/20/0/feature-sg = "1" (n0,r20)
/local/domain/0/backend/vif/20/0/feature-gso-tcpv4 = "1" (n0,r20)
/local/domain/0/backend/vif/20/0/feature-rx-copy = "1" (n0,r20)
/local/domain/0/backend/vif/20/0/feature-rx-flip = "0" (n0,r20)
/local/domain/0/backend/vif/20/0/hotplug-status = "connected" (n0,r20)
No change in the 'rate' value here, but the 'features' are different.
>>> Those keys then affect netback's behaviour which is why I am interested
>>> in whether the kernel version has changed.
>>
> I think having confirmed that the xenstore keys are unchanged then the
> kernel side should be the focus.
I have booted the Xen 4.4.0 host with an older kernel: 3.7.10
The system does not lock up any more.
Xenstore variables for the backend:
/local/domain/1/device/vif/0/backend = "/local/domain/0/backend/vif/1/0"
(n1,r0)
/local/domain/1/device/vif/0/backend-id = "0" (n1,r0)
/local/domain/1/device/vif/0/state = "4" (n1,r0)
/local/domain/1/device/vif/0/handle = "0" (n1,r0)
/local/domain/1/device/vif/0/mac = "02:00:0f:ff:00:1f" (n1,r0)
/local/domain/1/device/vif/0/tx-ring-ref = "9" (n1,r0)
/local/domain/1/device/vif/0/rx-ring-ref = "768" (n1,r0)
/local/domain/1/device/vif/0/event-channel = "11" (n1,r0)
/local/domain/1/device/vif/0/request-rx-copy = "1" (n1,r0)
/local/domain/1/device/vif/0/feature-rx-notify = "1" (n1,r0)
/local/domain/1/device/vif/0/feature-sg = "1" (n1,r0)
/local/domain/1/device/vif/0/feature-gso-tcpv4 = "1" (n1,r0)
/local/domain/1/device/vif/0/feature-gso-tcpv6 = "1" (n1,r0)
/local/domain/1/device/vif/0/feature-ipv6-csum-offload = "1" (n1,r0)
Is it possible, that one of the features introduced by the 3.13 kernel is
faulty (e.g. the 'feature-split-event-channels')?
Is there a way to selectively enable/disable those features without changing
the kernel?
I will also try the 3.14.3 kernel, but I need to prepare it first.
Greets,
Jacek
next prev parent reply other threads:[~2014-05-09 11:44 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-05-09 7:03 [BUG] VIF rate limiting locks up network in the whole system Jacek Konieczny
2014-05-09 9:18 ` Ian Campbell
2014-05-09 10:25 ` Jacek Konieczny
2014-05-09 10:32 ` Ian Campbell
2014-05-09 11:44 ` Jacek Konieczny [this message]
2014-05-09 11:55 ` Ian Campbell
2014-05-09 12:44 ` Jacek Konieczny
2014-05-09 13:01 ` Ian Campbell
2014-05-09 13:43 ` Jacek Konieczny
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=536CBF84.9070400@jajcus.net \
--to=jajcus@jajcus.net \
--cc=Ian.Campbell@citrix.com \
--cc=mmazur@axeos.com \
--cc=xen-devel@lists.xen.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).