From: Kevin Wolf <kwolf@redhat.com>
To: Vladimir Sementsov-Ogievskiy <vsementsov@virtuozzo.com>
Cc: qemu-devel@nongnu.org, qemu-block@nongnu.org, famz@redhat.com,
stefanha@redhat.com, mreitz@redhat.com, jcody@redhat.com,
eblake@redhat.com, jsnow@redhat.com, den@openvz.org
Subject: Re: [Qemu-devel] [PATCH 1/2] block: add BDRV_REQ_SERIALISING flag
Date: Wed, 4 Jul 2018 17:08:33 +0200 [thread overview]
Message-ID: <20180704150833.GG4334@localhost.localdomain> (raw)
In-Reply-To: <0e6e9f14-ad45-51c5-0d14-3e2e3dcaf5cb@virtuozzo.com>
Am 04.07.2018 um 16:44 hat Vladimir Sementsov-Ogievskiy geschrieben:
> 03.07.2018 21:07, Vladimir Sementsov-Ogievskiy wrote:
> > Serialized writes should be used in copy-on-write of backup(sync=none)
> > for image fleecing scheme.
> >
> > Signed-off-by: Vladimir Sementsov-Ogievskiy <vsementsov@virtuozzo.com>
> > ---
> > include/block/block.h | 5 ++++-
> > block/io.c | 4 ++++
> > 2 files changed, 8 insertions(+), 1 deletion(-)
> >
> > diff --git a/include/block/block.h b/include/block/block.h
> > index e5c7759a0c..107113aad5 100644
> > --- a/include/block/block.h
> > +++ b/include/block/block.h
> > @@ -58,8 +58,11 @@ typedef enum {
> > * content. */
> > BDRV_REQ_WRITE_UNCHANGED = 0x40,
> > + /* Force request serializing. Only for writes. */
> > + BDRV_REQ_SERIALISING = 0x80,
> > +
> > /* Mask of valid flags */
> > - BDRV_REQ_MASK = 0x7f,
> > + BDRV_REQ_MASK = 0xff,
> > } BdrvRequestFlags;
> > typedef struct BlockSizes {
> > diff --git a/block/io.c b/block/io.c
> > index 1a2272fad3..d5ba078514 100644
> > --- a/block/io.c
> > +++ b/block/io.c
> > @@ -1572,6 +1572,10 @@ static int coroutine_fn bdrv_aligned_pwritev(BdrvChild *child,
> > max_transfer = QEMU_ALIGN_DOWN(MIN_NON_ZERO(bs->bl.max_transfer, INT_MAX),
> > align);
> > + if (flags & BDRV_REQ_SERIALISING) {
> > + mark_request_serialising(req, bdrv_get_cluster_size(bs));
> > + }
> > +
> > waited = wait_serialising_requests(req);
> > assert(!waited || !req->serialising);
>
> Kevin, about this assertion, introduced in 28de2dcd88de "block: Assert
> serialisation assumptions in pwritev"? Will not it fail with fleecing
> scheme? I'm afraid it will, when we will wait for client read with our
> request, marked serializing a moment ago...
Hm, looks like it yes.
> Can we just switch it to assert(!waited || !req->partial);, setting
> req->partial in bdrv_co_pwritev for parts of unaligned requests? And allow
> new flag only for aligned requests?
>
> Other ideas?
The commit message of 28de2dcd88de tells you what we need to do (and
that just changing the assertion is wrong):
If a request calls wait_serialising_requests() and actually has to wait
in this function (i.e. a coroutine yield), other requests can run and
previously read data (like the head or tail buffer) could become
outdated. In this case, we would have to restart from the beginning to
read in the updated data.
However, we're lucky and don't actually need to do that: A request can
only wait in the first call of wait_serialising_requests() because we
mark it as serialising before that call, so any later requests would
wait. So as we don't wait in practice, we don't have to reload the data.
This is an important assumption that may not be broken or data
corruption will happen. Document it with some assertions.
So we may need to return -EAGAIN here, check that in the caller and
repeat the write request from the very start.
Kevin
next prev parent reply other threads:[~2018-07-04 15:08 UTC|newest]
Thread overview: 13+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-07-03 18:07 [Qemu-devel] [PATCH 0/2] fix image fleecing Vladimir Sementsov-Ogievskiy
2018-07-03 18:07 ` [Qemu-devel] [PATCH 1/2] block: add BDRV_REQ_SERIALISING flag Vladimir Sementsov-Ogievskiy
2018-07-04 14:44 ` Vladimir Sementsov-Ogievskiy
2018-07-04 15:08 ` Kevin Wolf [this message]
2018-07-04 16:11 ` Vladimir Sementsov-Ogievskiy
2018-07-04 16:20 ` Kevin Wolf
2018-07-04 16:36 ` Vladimir Sementsov-Ogievskiy
2018-07-04 17:06 ` Vladimir Sementsov-Ogievskiy
2018-07-05 8:34 ` Kevin Wolf
2018-07-03 18:07 ` [Qemu-devel] [PATCH 2/2] block/backup: fix fleecing scheme: use serialized writes Vladimir Sementsov-Ogievskiy
2018-07-04 10:39 ` Kevin Wolf
2018-07-04 12:31 ` Vladimir Sementsov-Ogievskiy
2018-07-04 13:20 ` Vladimir Sementsov-Ogievskiy
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20180704150833.GG4334@localhost.localdomain \
--to=kwolf@redhat.com \
--cc=den@openvz.org \
--cc=eblake@redhat.com \
--cc=famz@redhat.com \
--cc=jcody@redhat.com \
--cc=jsnow@redhat.com \
--cc=mreitz@redhat.com \
--cc=qemu-block@nongnu.org \
--cc=qemu-devel@nongnu.org \
--cc=stefanha@redhat.com \
--cc=vsementsov@virtuozzo.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).