From: Lukas Straub <lukasstraub2@web.de>
To: "Zhang, Chen" <chen.zhang@intel.com>
Cc: "Marc-André Lureau" <marcandre.lureau@redhat.com>,
"Jason Wang" <jasowang@redhat.com>,
qemu-devel <qemu-devel@nongnu.org>,
"Li Zhijian" <lizhijian@cn.fujitsu.com>,
"Paolo Bonzini" <pbonzini@redhat.com>
Subject: Re: [PATCH v4 5/6] net/colo-compare.c, softmmu/vl.c: Check that colo-compare is active
Date: Thu, 7 May 2020 17:54:21 +0200 [thread overview]
Message-ID: <20200507175421.185b48f8@luklap> (raw)
In-Reply-To: <0380e994e53947cd961cc363d46561d3@intel.com>
[-- Attachment #1: Type: text/plain, Size: 6995 bytes --]
On Thu, 7 May 2020 11:38:04 +0000
"Zhang, Chen" <chen.zhang@intel.com> wrote:
> > -----Original Message-----
> > From: Lukas Straub <lukasstraub2@web.de>
> > Sent: Monday, May 4, 2020 6:28 PM
> > To: qemu-devel <qemu-devel@nongnu.org>
> > Cc: Zhang, Chen <chen.zhang@intel.com>; Li Zhijian
> > <lizhijian@cn.fujitsu.com>; Jason Wang <jasowang@redhat.com>; Marc-
> > André Lureau <marcandre.lureau@redhat.com>; Paolo Bonzini
> > <pbonzini@redhat.com>
> > Subject: [PATCH v4 5/6] net/colo-compare.c, softmmu/vl.c: Check that colo-
> > compare is active
> >
> > If the colo-compare object is removed before failover and a checkpoint
> > happens, qemu crashes because it tries to lock the destroyed event_mtx in
> > colo_notify_compares_event.
> >
> > Fix this by checking if everything is initialized by introducing a new variable
> > colo_compare_active which is protected by a new mutex
> > colo_compare_mutex. The new mutex also protects against concurrent
> > access of the net_compares list and makes sure that
> > colo_notify_compares_event isn't active while we destroy event_mtx and
> > event_complete_cond.
> >
> > With this it also is again possible to use colo without colo-compare (periodic
> > mode) and to use multiple colo-compare for multiple network interfaces.
> >
>
> Hi Lukas,
>
> For this case I think we don't need to touch vl.c code, we can solve this issue from another perspective:
> How to remove colo-compare?
> User will use qemu-monitor or QMP command to disable an object, so we just need return operation failed
> When user try to remove colo-compare object while COLO is running.
Yeah, but that still leaves the other problem that colo can't be used without colo-compare (qemu crashes then).
Regards,
Lukas Straub
> Thanks
> Zhang Chen
>
> > Signed-off-by: Lukas Straub <lukasstraub2@web.de>
> > ---
> > net/colo-compare.c | 35 +++++++++++++++++++++++++++++------
> > net/colo-compare.h | 1 +
> > softmmu/vl.c | 2 ++
> > 3 files changed, 32 insertions(+), 6 deletions(-)
> >
> > diff --git a/net/colo-compare.c b/net/colo-compare.c index
> > 56db3d3bfc..c7572d75e9 100644
> > --- a/net/colo-compare.c
> > +++ b/net/colo-compare.c
> > @@ -54,6 +54,8 @@ static NotifierList colo_compare_notifiers = #define
> > REGULAR_PACKET_CHECK_MS 3000 #define DEFAULT_TIME_OUT_MS 3000
> >
> > +static QemuMutex colo_compare_mutex;
> > +static bool colo_compare_active;
> > static QemuMutex event_mtx;
> > static QemuCond event_complete_cond;
> > static int event_unhandled_count;
> > @@ -906,6 +908,12 @@ static void check_old_packet_regular(void *opaque)
> > void colo_notify_compares_event(void *opaque, int event, Error **errp) {
> > CompareState *s;
> > + qemu_mutex_lock(&colo_compare_mutex);
> > +
> > + if (!colo_compare_active) {
> > + qemu_mutex_unlock(&colo_compare_mutex);
> > + return;
> > + }
> >
> > qemu_mutex_lock(&event_mtx);
> > QTAILQ_FOREACH(s, &net_compares, next) { @@ -919,6 +927,7 @@ void
> > colo_notify_compares_event(void *opaque, int event, Error **errp)
> > }
> >
> > qemu_mutex_unlock(&event_mtx);
> > + qemu_mutex_unlock(&colo_compare_mutex);
> > }
> >
> > static void colo_compare_timer_init(CompareState *s) @@ -1274,7 +1283,14
> > @@ static void colo_compare_complete(UserCreatable *uc, Error **errp)
> > s->vnet_hdr);
> > }
> >
> > + qemu_mutex_lock(&colo_compare_mutex);
> > + if (!colo_compare_active) {
> > + qemu_mutex_init(&event_mtx);
> > + qemu_cond_init(&event_complete_cond);
> > + colo_compare_active = true;
> > + }
> > QTAILQ_INSERT_TAIL(&net_compares, s, next);
> > + qemu_mutex_unlock(&colo_compare_mutex);
> >
> > s->out_sendco.s = s;
> > s->out_sendco.chr = &s->chr_out;
> > @@ -1290,9 +1306,6 @@ static void colo_compare_complete(UserCreatable
> > *uc, Error **errp)
> >
> > g_queue_init(&s->conn_list);
> >
> > - qemu_mutex_init(&event_mtx);
> > - qemu_cond_init(&event_complete_cond);
> > -
> > s->connection_track_table =
> > g_hash_table_new_full(connection_key_hash,
> > connection_key_equal,
> > g_free, @@ -1384,12 +1397,19 @@ static void
> > colo_compare_finalize(Object *obj)
> >
> > qemu_bh_delete(s->event_bh);
> >
> > + qemu_mutex_lock(&colo_compare_mutex);
> > QTAILQ_FOREACH(tmp, &net_compares, next) {
> > if (tmp == s) {
> > QTAILQ_REMOVE(&net_compares, s, next);
> > break;
> > }
> > }
> > + if (QTAILQ_EMPTY(&net_compares)) {
> > + colo_compare_active = false;
> > + qemu_mutex_destroy(&event_mtx);
> > + qemu_cond_destroy(&event_complete_cond);
> > + }
> > + qemu_mutex_unlock(&colo_compare_mutex);
> >
> > AioContext *ctx = iothread_get_aio_context(s->iothread);
> > aio_context_acquire(ctx);
> > @@ -1413,15 +1433,18 @@ static void colo_compare_finalize(Object *obj)
> > object_unref(OBJECT(s->iothread));
> > }
> >
> > - qemu_mutex_destroy(&event_mtx);
> > - qemu_cond_destroy(&event_complete_cond);
> > -
> > g_free(s->pri_indev);
> > g_free(s->sec_indev);
> > g_free(s->outdev);
> > g_free(s->notify_dev);
> > }
> >
> > +void colo_compare_init_globals(void)
> > +{
> > + colo_compare_active = false;
> > + qemu_mutex_init(&colo_compare_mutex);
> > +}
> > +
> > static const TypeInfo colo_compare_info = {
> > .name = TYPE_COLO_COMPARE,
> > .parent = TYPE_OBJECT,
> > diff --git a/net/colo-compare.h b/net/colo-compare.h index
> > 22ddd512e2..eb483ac586 100644
> > --- a/net/colo-compare.h
> > +++ b/net/colo-compare.h
> > @@ -17,6 +17,7 @@
> > #ifndef QEMU_COLO_COMPARE_H
> > #define QEMU_COLO_COMPARE_H
> >
> > +void colo_compare_init_globals(void);
> > void colo_notify_compares_event(void *opaque, int event, Error **errp);
> > void colo_compare_register_notifier(Notifier *notify); void
> > colo_compare_unregister_notifier(Notifier *notify); diff --git a/softmmu/vl.c
> > b/softmmu/vl.c index 32c0047889..a913ed5469 100644
> > --- a/softmmu/vl.c
> > +++ b/softmmu/vl.c
> > @@ -112,6 +112,7 @@
> > #include "qapi/qmp/qerror.h"
> > #include "sysemu/iothread.h"
> > #include "qemu/guest-random.h"
> > +#include "net/colo-compare.h"
> >
> > #define MAX_VIRTIO_CONSOLES 1
> >
> > @@ -2906,6 +2907,7 @@ void qemu_init(int argc, char **argv, char **envp)
> > precopy_infrastructure_init();
> > postcopy_infrastructure_init();
> > monitor_init_globals();
> > + colo_compare_init_globals();
> >
> > if (qcrypto_init(&err) < 0) {
> > error_reportf_err(err, "cannot initialize crypto: ");
> > --
> > 2.20.1
>
[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 833 bytes --]
next prev parent reply other threads:[~2020-05-07 16:04 UTC|newest]
Thread overview: 30+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-05-04 10:27 [PATCH v4 0/6] colo-compare bugfixes Lukas Straub
2020-05-04 10:28 ` [PATCH v4 1/6] net/colo-compare.c: Create event_bh with the right AioContext Lukas Straub
2020-05-04 10:28 ` [PATCH v4 2/6] chardev/char.c: Use qemu_co_sleep_ns if in coroutine Lukas Straub
2020-05-04 10:28 ` [PATCH v4 3/6] net/colo-compare.c: Fix deadlock in compare_chr_send Lukas Straub
2020-05-07 11:00 ` Zhang, Chen
2020-05-07 15:51 ` Lukas Straub
2020-05-08 2:19 ` Zhang, Chen
2020-05-08 6:08 ` Lukas Straub
2020-05-08 6:28 ` Zhang, Chen
2020-05-08 7:56 ` Lukas Straub
2020-05-11 8:30 ` Zhang, Chen
2020-05-04 10:28 ` [PATCH v4 4/6] net/colo-compare.c: Only hexdump packets if tracing is enabled Lukas Straub
2020-05-04 11:27 ` Philippe Mathieu-Daudé
2020-05-04 11:58 ` Philippe Mathieu-Daudé
2020-05-04 10:28 ` [PATCH v4 5/6] net/colo-compare.c, softmmu/vl.c: Check that colo-compare is active Lukas Straub
2020-05-07 11:38 ` Zhang, Chen
2020-05-07 15:54 ` Lukas Straub [this message]
2020-05-08 2:26 ` Zhang, Chen
2020-05-08 3:55 ` Derek Su
2020-05-08 6:10 ` Lukas Straub
2020-05-08 6:50 ` Zhang, Chen
2020-05-09 12:21 ` Lukas Straub
2020-05-11 8:49 ` Zhang, Chen
2020-05-12 17:28 ` Dr. David Alan Gilbert
2020-05-04 10:28 ` [PATCH v4 6/6] net/colo-compare.c: Correct ordering in complete and finalize Lukas Straub
2020-05-07 13:26 ` Zhang, Chen
2020-05-07 16:09 ` Lukas Straub
2020-05-08 3:01 ` Zhang, Chen
2020-05-11 8:33 ` Zhang, Chen
2020-05-15 9:15 ` [PATCH v4 0/6] colo-compare bugfixes Zhang, Chen
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20200507175421.185b48f8@luklap \
--to=lukasstraub2@web.de \
--cc=chen.zhang@intel.com \
--cc=jasowang@redhat.com \
--cc=lizhijian@cn.fujitsu.com \
--cc=marcandre.lureau@redhat.com \
--cc=pbonzini@redhat.com \
--cc=qemu-devel@nongnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).