From: Paolo Bonzini <pbonzini@redhat.com>
To: Alexey Kardashevskiy <aik@ozlabs.ru>, qemu-devel@nongnu.org
Cc: Kevin Wolf <kwolf@redhat.com>, Stefan Hajnoczi <stefanha@redhat.com>
Subject: Re: [Qemu-devel] [RFC PATCH] block/migration: Disable cache invalidate for incoming migration
Date: Thu, 02 Oct 2014 11:45:06 +0200 [thread overview]
Message-ID: <542D1EA2.6060607@redhat.com> (raw)
In-Reply-To: <1412239972-23493-1-git-send-email-aik@ozlabs.ru>
Il 02/10/2014 10:52, Alexey Kardashevskiy ha scritto:
> When migrated using libvirt with "--copy-storage-all", at the end of
> migration there is race between NBD mirroring task trying to do flush
> and migration completion, both end up invalidating cache. Since qcow2
> driver does not handle this situation very well, random crashes happen.
>
> This disables the BDRV_O_INCOMING flag for the block device being migrated
> and restores it when NBD task is done.
>
> Signed-off-by: Alexey Kardashevskiy <aik@ozlabs.ru>
> ---
>
>
> The commit log is not full and most likely incorrect as well
> as the patch :) Please, help. Thanks!
>
> The patch seems to fix the initial problem though.
>
>
> btw is there any easy way to migrate one QEMU to another
> using NBD (i.e. not using "migrate -b") and not using libvirt?
> What would the command line be? Debugging with libvirt is real
> pain :(
>
>
> ---
> block.c | 17 ++++-------------
> migration.c | 1 -
> nbd.c | 11 +++++++++++
> 3 files changed, 15 insertions(+), 14 deletions(-)
>
> diff --git a/block.c b/block.c
> index c5a251c..ed72e0a 100644
> --- a/block.c
> +++ b/block.c
> @@ -5073,6 +5073,10 @@ void bdrv_invalidate_cache_all(Error **errp)
> QTAILQ_FOREACH(bs, &bdrv_states, device_list) {
> AioContext *aio_context = bdrv_get_aio_context(bs);
>
> + if (!(bs->open_flags & BDRV_O_INCOMING)) {
> + continue;
> + }
> +
> aio_context_acquire(aio_context);
> bdrv_invalidate_cache(bs, &local_err);
> aio_context_release(aio_context);
This part is okay, though perhaps we should add it to
bdrv_invalidate_cache instead?
> @@ -5083,19 +5087,6 @@ void bdrv_invalidate_cache_all(Error **errp)
> }
> }
>
> -void bdrv_clear_incoming_migration_all(void)
> -{
> - BlockDriverState *bs;
> -
> - QTAILQ_FOREACH(bs, &bdrv_states, device_list) {
> - AioContext *aio_context = bdrv_get_aio_context(bs);
> -
> - aio_context_acquire(aio_context);
> - bs->open_flags = bs->open_flags & ~(BDRV_O_INCOMING);
> - aio_context_release(aio_context);
> - }
> -}
> -
> int bdrv_flush(BlockDriverState *bs)
> {
> Coroutine *co;
> diff --git a/migration.c b/migration.c
> index 8d675b3..c49a05a 100644
> --- a/migration.c
> +++ b/migration.c
> @@ -103,7 +103,6 @@ static void process_incoming_migration_co(void *opaque)
> }
> qemu_announce_self();
>
> - bdrv_clear_incoming_migration_all();
> /* Make sure all file formats flush their mutable metadata */
> bdrv_invalidate_cache_all(&local_err);
> if (local_err) {
This part I don't understand.
Shouldn't you at least be adding
bs->open_flags = bs->open_flags & ~(BDRV_O_INCOMING);
to bdrv_invalidate_cache?
> diff --git a/nbd.c b/nbd.c
> index e9b539b..7b479c0 100644
> --- a/nbd.c
> +++ b/nbd.c
> @@ -106,6 +106,7 @@ struct NBDExport {
> off_t dev_offset;
> off_t size;
> uint32_t nbdflags;
> + bool restore_incoming;
> QTAILQ_HEAD(, NBDClient) clients;
> QTAILQ_ENTRY(NBDExport) next;
>
> @@ -972,6 +973,13 @@ NBDExport *nbd_export_new(BlockDriverState *bs, off_t dev_offset,
> exp->ctx = bdrv_get_aio_context(bs);
> bdrv_ref(bs);
> bdrv_add_aio_context_notifier(bs, bs_aio_attached, bs_aio_detach, exp);
> +
> + if (bs->open_flags & BDRV_O_INCOMING) {
> + bdrv_invalidate_cache(bs, NULL);
> + exp->restore_incoming = !!(bs->open_flags & BDRV_O_INCOMING);
> + bs->open_flags &= ~(BDRV_O_INCOMING);
> + }
> +
> return exp;
> }
>
> @@ -1021,6 +1029,9 @@ void nbd_export_close(NBDExport *exp)
> if (exp->bs) {
> bdrv_remove_aio_context_notifier(exp->bs, bs_aio_attached,
> bs_aio_detach, exp);
> + if (exp->restore_incoming) {
> + exp->bs->open_flags |= BDRV_O_INCOMING;
> + }
> bdrv_unref(exp->bs);
> exp->bs = NULL;
> }
>
For this, I don't think you even need exp->restore_incoming, and then it
can simply be a one-liner
+ bdrv_invalidate_cache(bs, NULL);
if you modify bdrv_invalidate_cache instead of bdrv_invalidate_cache_all.
Paolo
next prev parent reply other threads:[~2014-10-02 9:45 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-10-02 8:52 [Qemu-devel] [RFC PATCH] block/migration: Disable cache invalidate for incoming migration Alexey Kardashevskiy
2014-10-02 9:45 ` Paolo Bonzini [this message]
2014-10-02 10:19 ` Alexey Kardashevskiy
2014-10-02 14:52 ` Stefan Hajnoczi
2014-10-03 4:12 ` Alexey Kardashevskiy
2014-10-06 10:03 ` Stefan Hajnoczi
2014-10-06 22:47 ` Alexey Kardashevskiy
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=542D1EA2.6060607@redhat.com \
--to=pbonzini@redhat.com \
--cc=aik@ozlabs.ru \
--cc=kwolf@redhat.com \
--cc=qemu-devel@nongnu.org \
--cc=stefanha@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.