From: Kevin Wolf <kwolf@redhat.com>
To: Max Reitz <mreitz@redhat.com>
Cc: fam@euphon.net,
Vladimir Sementsov-Ogievskiy <vsementsov@virtuozzo.com>,
qemu-block@nongnu.org, qemu-devel@nongnu.org, den@openvz.org,
jsnow@redhat.com
Subject: Re: [Qemu-devel] [PATCH v3 1/9] block: add .bdrv_need_rw_file_child_during_reopen_rw handler
Date: Fri, 2 Aug 2019 17:42:56 +0200 [thread overview]
Message-ID: <20190802154256.GE6379@localhost.localdomain> (raw)
In-Reply-To: <2a105159-ab90-8f7c-bba9-4cec27e6144c@redhat.com>
[-- Attachment #1: Type: text/plain, Size: 6483 bytes --]
Am 31.07.2019 um 14:09 hat Max Reitz geschrieben:
> On 25.07.19 11:18, Vladimir Sementsov-Ogievskiy wrote:
> > On reopen to rw parent may need rw access to child in .prepare, for
> > example qcow2 needs to write IN_USE flags into stored bitmaps
> > (currently it is done in a hacky way after commit and don't work).
> > So, let's introduce such logic.
> >
> > The drawback is that in worst case bdrv_reopen_set_read_only may finish
> > with error and in some intermediate state: some nodes reopened RW and
> > some are not. But this is a way to fix bug around reopening qcow2
> > bitmaps in the following commits.
>
> This commit message doesn’t really explain what this patch does.
>
> > Signed-off-by: Vladimir Sementsov-Ogievskiy <vsementsov@virtuozzo.com>
> > ---
> > include/block/block_int.h | 2 +
> > block.c | 144 ++++++++++++++++++++++++++++++++++----
> > 2 files changed, 133 insertions(+), 13 deletions(-)
> >
> > diff --git a/include/block/block_int.h b/include/block/block_int.h
> > index 3aa1e832a8..7bd6fd68dd 100644
> > --- a/include/block/block_int.h
> > +++ b/include/block/block_int.h
> > @@ -531,6 +531,8 @@ struct BlockDriver {
> > uint64_t parent_perm, uint64_t parent_shared,
> > uint64_t *nperm, uint64_t *nshared);
> >
> > + bool (*bdrv_need_rw_file_child_during_reopen_rw)(BlockDriverState *bs);
> > +
> > /**
> > * Bitmaps should be marked as 'IN_USE' in the image on reopening image
> > * as rw. This handler should realize it. It also should unset readonly
> > diff --git a/block.c b/block.c
> > index cbd8da5f3b..3c8e1c59b4 100644
> > --- a/block.c
> > +++ b/block.c
> > @@ -1715,10 +1715,12 @@ static void bdrv_get_cumulative_perm(BlockDriverState *bs, uint64_t *perm,
> > uint64_t *shared_perm);
> >
> > typedef struct BlockReopenQueueEntry {
> > - bool prepared;
> > - bool perms_checked;
> > - BDRVReopenState state;
> > - QSIMPLEQ_ENTRY(BlockReopenQueueEntry) entry;
> > + bool reopened_file_child_rw;
> > + bool changed_file_child_perm_rw;
> > + bool prepared;
> > + bool perms_checked;
> > + BDRVReopenState state;
> > + QSIMPLEQ_ENTRY(BlockReopenQueueEntry) entry;
> > } BlockReopenQueueEntry;
> >
> > /*
> > @@ -3421,6 +3423,105 @@ BlockReopenQueue *bdrv_reopen_queue(BlockReopenQueue *bs_queue,
> > keep_old_opts);
> > }
> >
> > +static int bdrv_reopen_set_read_only_drained(BlockDriverState *bs,
> > + bool read_only,
> > + Error **errp)
> > +{
> > + BlockReopenQueue *queue;
> > + QDict *opts = qdict_new();
> > +
> > + qdict_put_bool(opts, BDRV_OPT_READ_ONLY, read_only);
> > +
> > + queue = bdrv_reopen_queue(NULL, bs, opts, true);
> > +
> > + return bdrv_reopen_multiple(queue, errp);
> > +}
> > +
> > +/*
> > + * handle_recursive_reopens
> > + *
> > + * On fail it needs rollback_recursive_reopens to be called.
>
> It would be nice if this description actually said anything about what
> the function is supposed to do.
>
> > + */
> > +static int handle_recursive_reopens(BlockReopenQueueEntry *current,
> > + Error **errp)
> > +{
> > + int ret;
> > + BlockDriverState *bs = current->state.bs;
> > +
> > + /*
> > + * We use the fact that in reopen-queue children are always following
> > + * parents.
> > + * TODO: Switch BlockReopenQueue to be QTAILQ and use
> > + * QTAILQ_FOREACH_REVERSE.
>
> Why don’t you do that first? It would make the code more obvious at
> least to me.
>
> > + */
> > + if (QSIMPLEQ_NEXT(current, entry)) {
> > + ret = handle_recursive_reopens(QSIMPLEQ_NEXT(current, entry), errp);
> > + if (ret < 0) {
> > + return ret;
> > + }
> > + }
> > +
> > + if ((current->state.flags & BDRV_O_RDWR) && bs->file && bs->drv &&
> > + bs->drv->bdrv_need_rw_file_child_during_reopen_rw &&
> > + bs->drv->bdrv_need_rw_file_child_during_reopen_rw(bs))
> > + {
> > + if (!bdrv_is_writable(bs->file->bs)) {
> > + ret = bdrv_reopen_set_read_only_drained(bs->file->bs, false, errp);
>
> Hm. Sorry, I find this all a bit hard to understand. (No comments and
> all.)
>
> I understand that this is for an RO -> RW transition? Everything is
> still RO, but the parent will need an RW child before it transitions to
> RW itself.
>
>
> I’m going to be honest up front, I don’t like this very much. But I
> think it may be a reasonable solution for now.
>
> As I remember, the problem was that when reopening a qcow2 node from RO
> to RW, we need to write something in .prepare() (because it can fail),
> but naturally no .prepare() is called after any .commit(), so no matter
> the order of nodes in the ReopenQueue, the child node will never be RW
> by this point.
>
> Hm. To me that mostly means that making the whole reopen process a
> transaction was just a dream that turns out not to work.
This patch already looks somewhat complicated to me, and what you're
proposing below sounds another order of magnitude more complex.
But I think the important point is your last sentence above. Once we
acknowledge that we can't possibly maintain full transaction semantics,
I'll ask this naive question: What prevents us from just keeping the
additional write in .commit?
It is expected to work in the common case, so we're only talking about
the behaviour in error cases anyway. If something fails and we can't do
things in a transactionable way, we need to decide what the surprise
result will look like, because we can neither guarantee a rollback nor
successful completion.
If the write fails unexpectedly, and we end up with a qcow2 image that
is opened r/w, but has read-only bitmaps - wouldn't that be a reasonable
result? It seems much easier to explain than some dependency subchain
already being committed and the rest rolled back.
So, I guess my question is, what is the specifc scenario you're trying
to fix with this series (why isn't the final patch a test case that
would answer this question?), and are we sure that the cure isn't worse
than the disease?
Kevin
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 801 bytes --]
next prev parent reply other threads:[~2019-08-02 15:43 UTC|newest]
Thread overview: 17+ messages / expand[flat|nested] mbox.gz Atom feed top
2019-07-25 9:18 [Qemu-devel] [PATCH v3 for-4.1? 0/9] qcow2-bitmaps: rewrite reopening logic Vladimir Sementsov-Ogievskiy
2019-07-25 9:18 ` [Qemu-devel] [PATCH v3 1/9] block: add .bdrv_need_rw_file_child_during_reopen_rw handler Vladimir Sementsov-Ogievskiy
2019-07-31 12:09 ` Max Reitz
2019-08-01 14:02 ` Vladimir Sementsov-Ogievskiy
2019-08-01 19:06 ` Max Reitz
2019-08-02 8:49 ` Vladimir Sementsov-Ogievskiy
2019-08-02 15:42 ` Kevin Wolf [this message]
2019-08-02 16:23 ` Vladimir Sementsov-Ogievskiy
2019-08-02 16:41 ` Vladimir Sementsov-Ogievskiy
2019-07-25 9:18 ` [Qemu-devel] [PATCH v3 2/9] iotests.py: add event_wait_log and events_wait_log helpers Vladimir Sementsov-Ogievskiy
2019-07-25 9:18 ` [Qemu-devel] [PATCH v3 3/9] iotests: add test 260 to check bitmap life after snapshot + commit Vladimir Sementsov-Ogievskiy
2019-07-25 9:18 ` [Qemu-devel] [PATCH v3 4/9] block/qcow2-bitmap: get rid of bdrv_has_changed_persistent_bitmaps Vladimir Sementsov-Ogievskiy
2019-07-25 9:18 ` [Qemu-devel] [PATCH v3 5/9] block/qcow2-bitmap: drop qcow2_reopen_bitmaps_rw_hint() Vladimir Sementsov-Ogievskiy
2019-07-25 9:18 ` [Qemu-devel] [PATCH v3 6/9] block/qcow2-bitmap: do not remove bitmaps on reopen-ro Vladimir Sementsov-Ogievskiy
2019-07-25 9:18 ` [Qemu-devel] [PATCH v3 7/9] block/qcow2-bitmap: fix and improve qcow2_reopen_bitmaps_rw Vladimir Sementsov-Ogievskiy
2019-07-25 9:18 ` [Qemu-devel] [PATCH v3 8/9] block/qcow2-bitmap: fix reopening bitmaps to RW Vladimir Sementsov-Ogievskiy
2019-07-25 9:19 ` [Qemu-devel] [PATCH v3 9/9] qcow2-bitmap: move bitmap reopen-rw code to qcow2_reopen_prepare Vladimir Sementsov-Ogievskiy
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20190802154256.GE6379@localhost.localdomain \
--to=kwolf@redhat.com \
--cc=den@openvz.org \
--cc=fam@euphon.net \
--cc=jsnow@redhat.com \
--cc=mreitz@redhat.com \
--cc=qemu-block@nongnu.org \
--cc=qemu-devel@nongnu.org \
--cc=vsementsov@virtuozzo.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).