All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Michael S. Tsirkin" <mst@redhat.com>
To: Sasha Levin <levinsasha928@gmail.com>
Cc: markmc@redhat.com, kvm@vger.kernel.org,
	linux-kernel@vger.kernel.org,
	virtualization@lists.linux-foundation.org,
	Avi Kivity <avi@redhat.com>
Subject: Re: [PATCH] virtio-ring: Use threshold for switching to indirect descriptors
Date: Thu, 1 Dec 2011 12:26:42 +0200	[thread overview]
Message-ID: <20111201102640.GB8822@redhat.com> (raw)
In-Reply-To: <1322726977.3259.3.camel@lappy>

On Thu, Dec 01, 2011 at 10:09:37AM +0200, Sasha Levin wrote:
> On Thu, 2011-12-01 at 09:58 +0200, Michael S. Tsirkin wrote:
> > On Thu, Dec 01, 2011 at 01:12:25PM +1030, Rusty Russell wrote:
> > > On Wed, 30 Nov 2011 18:11:51 +0200, Sasha Levin <levinsasha928@gmail.com> wrote:
> > > > On Tue, 2011-11-29 at 16:58 +0200, Avi Kivity wrote:
> > > > > On 11/29/2011 04:54 PM, Michael S. Tsirkin wrote:
> > > > > > > 
> > > > > > > Which is actually strange, weren't indirect buffers introduced to make
> > > > > > > the performance *better*? From what I see it's pretty much the
> > > > > > > same/worse for virtio-blk.
> > > > > >
> > > > > > I know they were introduced to allow adding very large bufs.
> > > > > > See 9fa29b9df32ba4db055f3977933cd0c1b8fe67cd
> > > > > > Mark, you wrote the patch, could you tell us which workloads
> > > > > > benefit the most from indirect bufs?
> > > > > >
> > > > > 
> > > > > Indirects are really for block devices with many spindles, since there
> > > > > the limiting factor is the number of requests in flight.  Network
> > > > > interfaces are limited by bandwidth, it's better to increase the ring
> > > > > size and use direct buffers there (so the ring size more or less
> > > > > corresponds to the buffer size).
> > > > > 
> > > > 
> > > > I did some testing of indirect descriptors under different workloads.
> > > 
> > > MST and I discussed getting clever with dynamic limits ages ago, but it
> > > was down low on the TODO list.  Thanks for diving into this...
> > > 
> > > AFAICT, if the ring never fills, direct is optimal.  When the ring
> > > fills, indirect is optimal (we're better to queue now than later).
> > > 
> > > Why not something simple, like a threshold which drops every time we
> > > fill the ring?
> > > 
> > > struct vring_virtqueue
> > > {
> > > ...
> > >         int indirect_thresh;
> > > ...
> > > }
> > > 
> > > virtqueue_add_buf_gfp()
> > > {
> > > ...
> > > 
> > >         if (vq->indirect &&
> > >             (vq->vring.num - vq->num_free) + out + in > vq->indirect_thresh)
> > >                 return indirect()
> > > ...
> > > 
> > > 	if (vq->num_free < out + in) {
> > >                 if (vq->indirect && vq->indirect_thresh > 0)
> > >                         vq->indirect_thresh--;
> > >         
> > > ...
> > > }
> > > 
> > > Too dumb?
> > > 
> > > Cheers,
> > > Rusty.
> > 
> > We'll presumably need some logic to increment is back,
> > to account for random workload changes.
> > Something like slow start?
> 
> We can increment it each time the queue was less than 10% full, it
> should act like slow start, no?

No, we really shouldn't get an empty ring as long as things behave
well. What I meant is something like:

#define VIRTIO_DECREMENT 2
#define VIRTIO_INCREMENT 1
                if (vq->num_free < out + in) {
		     if (vq->indirect && vq->indirect_thresh > VIRTIO_DECREMENT)
                         vq->indirect_thresh /= VIRTIO_DECREMENT;
		} else {
			if (vq->indirect_thresh < vq->num)
				vq->indirect_thresh += VIRTIO_INCREMENT;
		}

So we try to avoid indirect but the moment there's no space, we decrease
the threshold drastically.  If you make the increment/decrement module
parameters it's easy to try different values.


> -- 
> 
> Sasha.

WARNING: multiple messages have this Message-ID (diff)
From: "Michael S. Tsirkin" <mst@redhat.com>
To: Sasha Levin <levinsasha928@gmail.com>
Cc: Rusty Russell <rusty@rustcorp.com.au>,
	Avi Kivity <avi@redhat.com>,
	linux-kernel@vger.kernel.org,
	virtualization@lists.linux-foundation.org, kvm@vger.kernel.org,
	markmc@redhat.com
Subject: Re: [PATCH] virtio-ring: Use threshold for switching to indirect descriptors
Date: Thu, 1 Dec 2011 12:26:42 +0200	[thread overview]
Message-ID: <20111201102640.GB8822@redhat.com> (raw)
In-Reply-To: <1322726977.3259.3.camel@lappy>

On Thu, Dec 01, 2011 at 10:09:37AM +0200, Sasha Levin wrote:
> On Thu, 2011-12-01 at 09:58 +0200, Michael S. Tsirkin wrote:
> > On Thu, Dec 01, 2011 at 01:12:25PM +1030, Rusty Russell wrote:
> > > On Wed, 30 Nov 2011 18:11:51 +0200, Sasha Levin <levinsasha928@gmail.com> wrote:
> > > > On Tue, 2011-11-29 at 16:58 +0200, Avi Kivity wrote:
> > > > > On 11/29/2011 04:54 PM, Michael S. Tsirkin wrote:
> > > > > > > 
> > > > > > > Which is actually strange, weren't indirect buffers introduced to make
> > > > > > > the performance *better*? From what I see it's pretty much the
> > > > > > > same/worse for virtio-blk.
> > > > > >
> > > > > > I know they were introduced to allow adding very large bufs.
> > > > > > See 9fa29b9df32ba4db055f3977933cd0c1b8fe67cd
> > > > > > Mark, you wrote the patch, could you tell us which workloads
> > > > > > benefit the most from indirect bufs?
> > > > > >
> > > > > 
> > > > > Indirects are really for block devices with many spindles, since there
> > > > > the limiting factor is the number of requests in flight.  Network
> > > > > interfaces are limited by bandwidth, it's better to increase the ring
> > > > > size and use direct buffers there (so the ring size more or less
> > > > > corresponds to the buffer size).
> > > > > 
> > > > 
> > > > I did some testing of indirect descriptors under different workloads.
> > > 
> > > MST and I discussed getting clever with dynamic limits ages ago, but it
> > > was down low on the TODO list.  Thanks for diving into this...
> > > 
> > > AFAICT, if the ring never fills, direct is optimal.  When the ring
> > > fills, indirect is optimal (we're better to queue now than later).
> > > 
> > > Why not something simple, like a threshold which drops every time we
> > > fill the ring?
> > > 
> > > struct vring_virtqueue
> > > {
> > > ...
> > >         int indirect_thresh;
> > > ...
> > > }
> > > 
> > > virtqueue_add_buf_gfp()
> > > {
> > > ...
> > > 
> > >         if (vq->indirect &&
> > >             (vq->vring.num - vq->num_free) + out + in > vq->indirect_thresh)
> > >                 return indirect()
> > > ...
> > > 
> > > 	if (vq->num_free < out + in) {
> > >                 if (vq->indirect && vq->indirect_thresh > 0)
> > >                         vq->indirect_thresh--;
> > >         
> > > ...
> > > }
> > > 
> > > Too dumb?
> > > 
> > > Cheers,
> > > Rusty.
> > 
> > We'll presumably need some logic to increment is back,
> > to account for random workload changes.
> > Something like slow start?
> 
> We can increment it each time the queue was less than 10% full, it
> should act like slow start, no?

No, we really shouldn't get an empty ring as long as things behave
well. What I meant is something like:

#define VIRTIO_DECREMENT 2
#define VIRTIO_INCREMENT 1
                if (vq->num_free < out + in) {
		     if (vq->indirect && vq->indirect_thresh > VIRTIO_DECREMENT)
                         vq->indirect_thresh /= VIRTIO_DECREMENT;
		} else {
			if (vq->indirect_thresh < vq->num)
				vq->indirect_thresh += VIRTIO_INCREMENT;
		}

So we try to avoid indirect but the moment there's no space, we decrease
the threshold drastically.  If you make the increment/decrement module
parameters it's easy to try different values.


> -- 
> 
> Sasha.

  reply	other threads:[~2011-12-01 10:26 UTC|newest]

Thread overview: 82+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2011-11-29  9:33 [PATCH] virtio-ring: Use threshold for switching to indirect descriptors Sasha Levin
2011-11-29  9:33 ` Sasha Levin
2011-11-29 12:56 ` Michael S. Tsirkin
2011-11-29 12:56   ` Michael S. Tsirkin
2011-11-29 13:34   ` Sasha Levin
2011-11-29 13:34     ` Sasha Levin
2011-11-29 13:54     ` Michael S. Tsirkin
2011-11-29 14:21       ` Sasha Levin
2011-11-29 14:21         ` Sasha Levin
2011-11-29 14:54         ` Michael S. Tsirkin
2011-11-29 14:54           ` Michael S. Tsirkin
2011-11-29 14:58           ` Avi Kivity
2011-11-29 14:58             ` Avi Kivity
2011-11-30 16:11             ` Sasha Levin
2011-11-30 16:11               ` Sasha Levin
2011-11-30 16:17               ` Sasha Levin
2011-11-30 16:17                 ` Sasha Levin
2011-12-01  2:42               ` Rusty Russell
2011-12-01  2:42                 ` Rusty Russell
2011-12-01  7:58                 ` Michael S. Tsirkin
2011-12-01  7:58                   ` Michael S. Tsirkin
2011-12-01  8:09                   ` Sasha Levin
2011-12-01  8:09                     ` Sasha Levin
2011-12-01 10:26                     ` Michael S. Tsirkin [this message]
2011-12-01 10:26                       ` Michael S. Tsirkin
2011-12-02  0:46                       ` Rusty Russell
2011-12-02  0:46                         ` Rusty Russell
2011-12-03 11:50                         ` Sasha Levin
2011-12-03 11:50                           ` Sasha Levin
2011-12-04 11:06                           ` Michael S. Tsirkin
2011-12-04 11:06                             ` Michael S. Tsirkin
2011-12-04 15:15                             ` Michael S. Tsirkin
2011-12-04 15:15                               ` Michael S. Tsirkin
2011-12-04 11:52                           ` Avi Kivity
2011-12-04 11:52                             ` Avi Kivity
2011-12-04 12:01                             ` Michael S. Tsirkin
2011-12-04 12:01                               ` Michael S. Tsirkin
2011-12-04 12:06                               ` Avi Kivity
2011-12-04 12:06                                 ` Avi Kivity
2011-12-04 15:11                                 ` Michael S. Tsirkin
2011-12-04 15:11                                   ` Michael S. Tsirkin
2011-12-04 15:16                                   ` Avi Kivity
2011-12-04 15:16                                     ` Avi Kivity
2011-12-04 16:00                                     ` Michael S. Tsirkin
2011-12-04 16:00                                       ` Michael S. Tsirkin
2011-12-04 16:33                                       ` Avi Kivity
2011-12-04 16:33                                         ` Avi Kivity
2011-12-05  0:10                                     ` Rusty Russell
2011-12-05  0:10                                       ` Rusty Russell
2011-12-05  9:52                                       ` Avi Kivity
2011-12-05  9:52                                         ` Avi Kivity
2011-12-06  5:07                                         ` Rusty Russell
2011-12-06  5:07                                           ` Rusty Russell
2011-12-06  9:58                                           ` Avi Kivity
2011-12-06  9:58                                             ` Avi Kivity
2011-12-06 12:03                                             ` Rusty Russell
2011-12-06 12:03                                             ` Rusty Russell
2011-12-07 13:37                                               ` Avi Kivity
2011-12-07 13:37                                                 ` Avi Kivity
2011-12-04 12:13                             ` Sasha Levin
2011-12-04 12:13                               ` Sasha Levin
2011-12-04 16:22                               ` Michael S. Tsirkin
2011-12-04 16:22                                 ` Michael S. Tsirkin
2011-12-04 17:34                                 ` Sasha Levin
2011-12-04 17:34                                   ` Sasha Levin
2011-12-04 17:37                                   ` Avi Kivity
2011-12-04 17:37                                     ` Avi Kivity
2011-12-04 17:39                                     ` Sasha Levin
2011-12-04 17:39                                       ` Sasha Levin
2011-12-04 18:23                                       ` Sasha Levin
2011-12-04 18:23                                         ` Sasha Levin
2011-12-07 14:02                                         ` Sasha Levin
2011-12-07 14:02                                           ` Sasha Levin
2011-12-07 15:48                                           ` Michael S. Tsirkin
2011-12-07 15:48                                             ` Michael S. Tsirkin
2011-12-08  9:44                                             ` Rusty Russell
2011-12-08  9:44                                             ` Rusty Russell
2011-12-08 10:37                                               ` Sasha Levin
2011-12-08 10:37                                                 ` Sasha Levin
2011-12-09  5:33                                                 ` Rusty Russell
2011-12-09  5:33                                                 ` Rusty Russell
2011-11-29 13:54     ` Michael S. Tsirkin

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20111201102640.GB8822@redhat.com \
    --to=mst@redhat.com \
    --cc=avi@redhat.com \
    --cc=kvm@vger.kernel.org \
    --cc=levinsasha928@gmail.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=markmc@redhat.com \
    --cc=virtualization@lists.linux-foundation.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.