Re: [Qemu-devel] [PATCH v2] virtio-blk: Fix double completion for werror=stop

qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed

From: Fam Zheng <famz@redhat.com>
To: Stefan Hajnoczi <stefanha@redhat.com>
Cc: Kevin Wolf <kwolf@redhat.com>,
	lvivier@redhat.com, qemu-block@nongnu.org, pl@kamp.de,
	qemu-devel@nongnu.org, qemu-stable@nongnu.org,
	pbonzini@redhat.com, dgibson@redhat.com
Subject: Re: [Qemu-devel] [PATCH v2] virtio-blk: Fix double completion for werror=stop
Date: Tue, 17 Nov 2015 15:28:29 +0800	[thread overview]
Message-ID: <20151117072829.GF28076@ad.usersys.redhat.com> (raw)
In-Reply-To: <20151117065858.GH16268@stefanha-x1.localdomain>

On Tue, 11/17 14:58, Stefan Hajnoczi wrote:
> On Mon, Nov 16, 2015 at 02:10:36PM +0800, Fam Zheng wrote:
> > When a request R is absorbed by request M, it is appended to the
> > "mr_next" queue led by M, and is completed together with the completion
> > of M, in virtio_blk_rw_complete.
> > 
> > With error policy equals stop, if M has an I/O error, now R also gets
> > prepended to the per device DMA restart queue, which will be retried
> > when VM resumes.  It leads to a double completion (in symptoms of memory
> > corruption or use after free).
> > 
> > Adding R to the queue is superfluous, only M needs to be in the queue.
> > 
> > Fix this by marking request R as "merged" and skipping it in
> > virtio_blk_handle_rw_error.

The commit message is outdated. "merged" requests are actually skipped in
virtio_blk_handle_request.

> > 
> > Cc: qemu-stable@nongnu.org
> > Signed-off-by: Fam Zheng <famz@redhat.com>
> > 
> > ---
> > 
> > v2: Don't lose the request in migration. [Paolo]
> > ---
> >  hw/block/virtio-blk.c          | 7 +++++++
> >  include/hw/virtio/virtio-blk.h | 1 +
> >  2 files changed, 8 insertions(+)
> > 
> > diff --git a/hw/block/virtio-blk.c b/hw/block/virtio-blk.c
> > index e70fccf..5cdb06f 100644
> > --- a/hw/block/virtio-blk.c
> > +++ b/hw/block/virtio-blk.c
> > @@ -36,6 +36,7 @@ VirtIOBlockReq *virtio_blk_alloc_request(VirtIOBlock *s)
> >      req->in_len = 0;
> >      req->next = NULL;
> >      req->mr_next = NULL;
> > +    req->merged = false;
> >      return req;
> >  }
> >  
> > @@ -344,6 +345,7 @@ static inline void submit_requests(BlockBackend *blk, MultiReqBuffer *mrb,
> >          for (i = start + 1; i < start + num_reqs; i++) {
> >              qemu_iovec_concat(qiov, &mrb->reqs[i]->qiov, 0,
> >                                mrb->reqs[i]->qiov.size);
> > +            mrb->reqs[i]->merged = true;
> >              mrb->reqs[i - 1]->mr_next = mrb->reqs[i];
> >              nb_sectors += mrb->reqs[i]->qiov.size / BDRV_SECTOR_SIZE;
> >          }
> > @@ -511,6 +513,11 @@ void virtio_blk_handle_request(VirtIOBlockReq *req, MultiReqBuffer *mrb)
> >                - sizeof(struct virtio_blk_inhdr);
> >      iov_discard_back(in_iov, &in_num, sizeof(struct virtio_blk_inhdr));
> >  
> > +    if (req->merged) {
> > +        /* Enough for restarting a (migrated) merged request, no need to
> > +         * actually submit I/O. */
> > +        return;
> > +    }

This is not enough.

There is a risk that the coalesced requests being restarted here are also
merged, which will leak the requests originally merged into it.

> >      type = virtio_ldl_p(VIRTIO_DEVICE(req->dev), &req->out.type);
> >  
> >      /* VIRTIO_BLK_T_OUT defines the command direction. VIRTIO_BLK_T_BARRIER
> > diff --git a/include/hw/virtio/virtio-blk.h b/include/hw/virtio/virtio-blk.h
> > index 6bf5905..db4adf4 100644
> > --- a/include/hw/virtio/virtio-blk.h
> > +++ b/include/hw/virtio/virtio-blk.h
> > @@ -70,6 +70,7 @@ typedef struct VirtIOBlockReq {
> >      size_t in_len;
> >      struct VirtIOBlockReq *next;
> >      struct VirtIOBlockReq *mr_next;
> > +    bool merged;
> >      BlockAcctCookie acct;
> >  } VirtIOBlockReq;
> 
> I'm not sure if this patch truly fixes the bug:
> 
> virtio_blk_rw_complete() doesn't do req->mr_next = NULL.  There is a
> potential double-free if resubmitting failed requests doesn't overwrite
> themr_next field.
> 
> This can be fixed by adding req->mr_next = NULL to the loop in
> virtio_blk_rw_complete().
> 
> Is that enough to solve the bug?  I don't think adding a new field is
> necessary.
> 
> If not, please explain the double-free.

The first free is the expected one when the coalesced request is completed.
The second free is because virtio_blk_rw_complete was also called on requests
who have "merged == true", which is a mistake in virtio_blk_dma_restart_bh.

I don't think adding req->mr_next in virtio_blk_rw_complete in the I/O error
path is right or helpful.

Fam

next prev parent reply	other threads:[~2015-11-17  7:28 UTC|newest]

Thread overview: 5+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2015-11-16  6:10 [Qemu-devel] [PATCH v2] virtio-blk: Fix double completion for werror=stop Fam Zheng
2015-11-16 12:21 ` Laurent Vivier
2015-11-17  6:58 ` Stefan Hajnoczi
2015-11-17  7:28   ` Fam Zheng [this message]
2015-11-17  9:30     ` Stefan Hajnoczi

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20151117072829.GF28076@ad.usersys.redhat.com \
    --to=famz@redhat.com \
    --cc=dgibson@redhat.com \
    --cc=kwolf@redhat.com \
    --cc=lvivier@redhat.com \
    --cc=pbonzini@redhat.com \
    --cc=pl@kamp.de \
    --cc=qemu-block@nongnu.org \
    --cc=qemu-devel@nongnu.org \
    --cc=qemu-stable@nongnu.org \
    --cc=stefanha@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).