From: Jason Wang <jasowang@redhat.com>
To: "Michael S. Tsirkin" <mst@redhat.com>
Cc: netdev@vger.kernel.org, linux-kernel@vger.kernel.org,
kvm@vger.kernel.org, virtualization@lists.linux-foundation.org
Subject: Re: [PATCH V2 1/2] vhost_net: stop polling socket during rx processing
Date: Tue, 31 May 2016 11:14:10 +0800 [thread overview]
Message-ID: <574D0182.9020701@redhat.com> (raw)
In-Reply-To: <20160530184211-mutt-send-email-mst@redhat.com>
On 2016年05月30日 23:47, Michael S. Tsirkin wrote:
> On Mon, May 30, 2016 at 02:47:53AM -0400, Jason Wang wrote:
>> We don't stop rx polling socket during rx processing, this will lead
>> unnecessary wakeups from under layer net devices (E.g
>> sock_def_readable() form tun). Rx will be slowed down in this
>> way. This patch avoids this by stop polling socket during rx
>> processing. A small drawback is that this introduces some overheads in
>> light load case because of the extra start/stop polling, but single
>> netperf TCP_RR does not notice any change. In a super heavy load case,
>> e.g using pktgen to inject packet to guest, we get about ~8.8%
>> improvement on pps:
>>
>> before: ~1240000 pkt/s
>> after: ~1350000 pkt/s
>>
>> Signed-off-by: Jason Wang <jasowang@redhat.com>
>> ---
>> drivers/vhost/net.c | 56 +++++++++++++++++++++++++++--------------------------
>> 1 file changed, 29 insertions(+), 27 deletions(-)
>>
>> diff --git a/drivers/vhost/net.c b/drivers/vhost/net.c
>> index 10ff494..e91603b 100644
>> --- a/drivers/vhost/net.c
>> +++ b/drivers/vhost/net.c
>> @@ -301,6 +301,32 @@ static bool vhost_can_busy_poll(struct vhost_dev *dev,
>> !vhost_has_work(dev);
>> }
>>
>> +static void vhost_net_disable_vq(struct vhost_net *n,
>> + struct vhost_virtqueue *vq)
>> +{
>> + struct vhost_net_virtqueue *nvq =
>> + container_of(vq, struct vhost_net_virtqueue, vq);
>> + struct vhost_poll *poll = n->poll + (nvq - n->vqs);
>> + if (!vq->private_data)
>> + return;
>> + vhost_poll_stop(poll);
>> +}
>> +
>> +static int vhost_net_enable_vq(struct vhost_net *n,
>> + struct vhost_virtqueue *vq)
>> +{
>> + struct vhost_net_virtqueue *nvq =
>> + container_of(vq, struct vhost_net_virtqueue, vq);
>> + struct vhost_poll *poll = n->poll + (nvq - n->vqs);
>> + struct socket *sock;
>> +
>> + sock = vq->private_data;
>> + if (!sock)
>> + return 0;
>> +
>> + return vhost_poll_start(poll, sock->file);
>> +}
>> +
>> static int vhost_net_tx_get_vq_desc(struct vhost_net *net,
>> struct vhost_virtqueue *vq,
>> struct iovec iov[], unsigned int iov_size,
> BTW we might want to rename these functions, name no longer
> reflects function ...
Do you mean adding something reflect busy polling in the name? Then the
name may be too long or have suggestion on the name?
>
>
>> @@ -627,6 +653,7 @@ static void handle_rx(struct vhost_net *net)
>> if (!sock)
>> goto out;
>> vhost_disable_notify(&net->dev, vq);
>> + vhost_net_disable_vq(net, vq);
>>
>> vhost_hlen = nvq->vhost_hlen;
>> sock_hlen = nvq->sock_hlen;
>> @@ -715,9 +742,10 @@ static void handle_rx(struct vhost_net *net)
>> total_len += vhost_len;
>> if (unlikely(total_len >= VHOST_NET_WEIGHT)) {
>> vhost_poll_queue(&vq->poll);
>> - break;
>> + goto out;
>> }
>> }
>> + vhost_net_enable_vq(net, vq);
> OK so if sock is readable but RX VQ is empty, this will
> immediately schedule another round of handle_rx and so ad
> infinitum,
>
> Looks like a bug.
Yes it is, will change the above headcount check to:
/* OK, now we need to know about added descriptors. */
if (!headcount) {
if (unlikely(vhost_enable_notify(&net->dev, vq))) {
/* They have slipped one in as we were
* doing that: check again. */
vhost_disable_notify(&net->dev, vq);
continue;
}
/* Nothing new? Wait for eventfd to tell us
* they refilled. */
goto out;
}
>
>
>> out:
>> mutex_unlock(&vq->mutex);
>> }
>> @@ -796,32 +824,6 @@ static int vhost_net_open(struct inode *inode, struct file *f)
>> return 0;
>> }
>>
>> -static void vhost_net_disable_vq(struct vhost_net *n,
>> - struct vhost_virtqueue *vq)
>> -{
>> - struct vhost_net_virtqueue *nvq =
>> - container_of(vq, struct vhost_net_virtqueue, vq);
>> - struct vhost_poll *poll = n->poll + (nvq - n->vqs);
>> - if (!vq->private_data)
>> - return;
>> - vhost_poll_stop(poll);
>> -}
>> -
>> -static int vhost_net_enable_vq(struct vhost_net *n,
>> - struct vhost_virtqueue *vq)
>> -{
>> - struct vhost_net_virtqueue *nvq =
>> - container_of(vq, struct vhost_net_virtqueue, vq);
>> - struct vhost_poll *poll = n->poll + (nvq - n->vqs);
>> - struct socket *sock;
>> -
>> - sock = vq->private_data;
>> - if (!sock)
>> - return 0;
>> -
>> - return vhost_poll_start(poll, sock->file);
>> -}
>> -
>> static struct socket *vhost_net_stop_vq(struct vhost_net *n,
>> struct vhost_virtqueue *vq)
>> {
>> --
>> 1.8.3.1
> --
> To unsubscribe from this list: send the line "unsubscribe kvm" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
_______________________________________________
Virtualization mailing list
Virtualization@lists.linux-foundation.org
https://lists.linuxfoundation.org/mailman/listinfo/virtualization
next prev parent reply other threads:[~2016-05-31 3:14 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <1464590874-39539-1-git-send-email-jasowang@redhat.com>
2016-05-30 6:47 ` [PATCH V2 1/2] vhost_net: stop polling socket during rx processing Jason Wang
2016-05-30 15:47 ` Michael S. Tsirkin
2016-05-31 3:14 ` Jason Wang [this message]
2016-05-30 6:47 ` [PATCH V2 2/2] vhost_net: conditionally enable tx polling Jason Wang
2016-05-30 15:55 ` Michael S. Tsirkin
[not found] ` <20160530155521.GA5427@redhat.com>
2016-05-31 3:23 ` Jason Wang
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=574D0182.9020701@redhat.com \
--to=jasowang@redhat.com \
--cc=kvm@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=mst@redhat.com \
--cc=netdev@vger.kernel.org \
--cc=virtualization@lists.linux-foundation.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).