* [PATCH 0/2] Multifd fixes
@ 2024-08-01 17:40 Fabiano Rosas
2024-08-01 17:41 ` [PATCH 1/2] migration: Fix cleanup of iochannel in file migration Fabiano Rosas
` (2 more replies)
0 siblings, 3 replies; 7+ messages in thread
From: Fabiano Rosas @ 2024-08-01 17:40 UTC (permalink / raw)
To: qemu-devel; +Cc: Peter Xu
Hi, a couple of multifd fixes to issues that Jim spotted while working
on mapped-ram for libvirt.
CI run: https://gitlab.com/farosas/qemu/-/pipelines/1397467740
Fabiano Rosas (2):
migration: Fix cleanup of iochannel in file migration
migration/multifd: Fix multifd_send_setup cleanup when channel
creation fails
migration/file.c | 2 --
migration/multifd.c | 26 +++++++++++++++-----------
2 files changed, 15 insertions(+), 13 deletions(-)
--
2.35.3
^ permalink raw reply [flat|nested] 7+ messages in thread
* [PATCH 1/2] migration: Fix cleanup of iochannel in file migration
2024-08-01 17:40 [PATCH 0/2] Multifd fixes Fabiano Rosas
@ 2024-08-01 17:41 ` Fabiano Rosas
2024-08-01 18:39 ` Peter Xu
2024-08-01 17:41 ` [PATCH 2/2] migration/multifd: Fix multifd_send_setup cleanup when channel creation fails Fabiano Rosas
2024-08-02 12:47 ` [PATCH 0/2] Multifd fixes Fabiano Rosas
2 siblings, 1 reply; 7+ messages in thread
From: Fabiano Rosas @ 2024-08-01 17:41 UTC (permalink / raw)
To: qemu-devel; +Cc: Peter Xu, Jim Fehlig
The QIOChannelFile object already has its reference decremented by
g_autoptr. Trying to unref an extra time causes:
ERROR:../qom/object.c:1241:object_unref: assertion failed: (obj->ref > 0)
Fixes: a701c03dec ("migration: Drop reference to QIOChannel if file seeking fails")
Fixes: 6d3279655a ("migration: Fix file migration with fdset")
Reported-by: Jim Fehlig <jfehlig@suse.com>
Signed-off-by: Fabiano Rosas <farosas@suse.de>
---
migration/file.c | 2 --
1 file changed, 2 deletions(-)
diff --git a/migration/file.c b/migration/file.c
index db870f2cf0..6451a21c86 100644
--- a/migration/file.c
+++ b/migration/file.c
@@ -112,7 +112,6 @@ void file_start_outgoing_migration(MigrationState *s,
error_setg_errno(errp, errno,
"failed to truncate migration file to offset %" PRIx64,
offset);
- object_unref(OBJECT(fioc));
return;
}
@@ -120,7 +119,6 @@ void file_start_outgoing_migration(MigrationState *s,
ioc = QIO_CHANNEL(fioc);
if (offset && qio_channel_io_seek(ioc, offset, SEEK_SET, errp) < 0) {
- object_unref(OBJECT(fioc));
return;
}
qio_channel_set_name(ioc, "migration-file-outgoing");
--
2.35.3
^ permalink raw reply related [flat|nested] 7+ messages in thread
* [PATCH 2/2] migration/multifd: Fix multifd_send_setup cleanup when channel creation fails
2024-08-01 17:40 [PATCH 0/2] Multifd fixes Fabiano Rosas
2024-08-01 17:41 ` [PATCH 1/2] migration: Fix cleanup of iochannel in file migration Fabiano Rosas
@ 2024-08-01 17:41 ` Fabiano Rosas
2024-08-01 18:38 ` Peter Xu
2024-08-02 12:47 ` [PATCH 0/2] Multifd fixes Fabiano Rosas
2 siblings, 1 reply; 7+ messages in thread
From: Fabiano Rosas @ 2024-08-01 17:41 UTC (permalink / raw)
To: qemu-devel; +Cc: Peter Xu, qemu-stable, Jim Fehlig
When a channel fails to create, the code currently just returns. This
is wrong for two reasons:
1) Channel n+1 will not get to initialize it's semaphores, leading to
an assert when terminate_threads tries to post to it:
qemu-system-x86_64: ../util/qemu-thread-posix.c:92:
qemu_mutex_lock_impl: Assertion `mutex->initialized' failed.
2) (theoretical) If channel n-1 already started creation it will
defeat the purpose of the channels_created logic which is in place
to avoid migrate_fd_cleanup() to run while channels are still being
created.
This cannot really happen today because the current failure cases
for multifd_new_send_channel_create() are all synchronous,
resulting from qio_channel_file_new_path() getting a bad
filename. This would hit all channels equally.
But I don't want to set a trap for future people, so have all
channels try to create (even if failing), and only fail after the
channels_created semaphore has been posted.
While here, remove the error_report_err call. There's one already at
migrate_fd_cleanup later on.
Cc: qemu-stable@nongnu.org
Reported-by: Jim Fehlig <jfehlig@suse.com>
Fixes: bd8b0a8f82 ("migration/multifd: Move multifd_send_setup error handling in to the function")
Signed-off-by: Fabiano Rosas <farosas@suse.de>
---
migration/multifd.c | 26 +++++++++++++++-----------
1 file changed, 15 insertions(+), 11 deletions(-)
diff --git a/migration/multifd.c b/migration/multifd.c
index 0b4cbaddfe..552f9723c8 100644
--- a/migration/multifd.c
+++ b/migration/multifd.c
@@ -1156,7 +1156,6 @@ static bool multifd_new_send_channel_create(gpointer opaque, Error **errp)
bool multifd_send_setup(void)
{
MigrationState *s = migrate_get_current();
- Error *local_err = NULL;
int thread_count, ret = 0;
uint32_t page_count = MULTIFD_PACKET_SIZE / qemu_target_page_size();
bool use_packets = multifd_use_packets();
@@ -1177,6 +1176,7 @@ bool multifd_send_setup(void)
for (i = 0; i < thread_count; i++) {
MultiFDSendParams *p = &multifd_send_state->params[i];
+ Error *local_err = NULL;
qemu_sem_init(&p->sem, 0);
qemu_sem_init(&p->sem_sync, 0);
@@ -1196,7 +1196,8 @@ bool multifd_send_setup(void)
p->write_flags = 0;
if (!multifd_new_send_channel_create(p, &local_err)) {
- return false;
+ migrate_set_error(s, local_err);
+ ret = -1;
}
}
@@ -1209,24 +1210,27 @@ bool multifd_send_setup(void)
qemu_sem_wait(&multifd_send_state->channels_created);
}
+ if (ret) {
+ goto err;
+ }
+
for (i = 0; i < thread_count; i++) {
MultiFDSendParams *p = &multifd_send_state->params[i];
+ Error *local_err = NULL;
ret = multifd_send_state->ops->send_setup(p, &local_err);
if (ret) {
- break;
+ migrate_set_error(s, local_err);
+ goto err;
}
}
- if (ret) {
- migrate_set_error(s, local_err);
- error_report_err(local_err);
- migrate_set_state(&s->state, MIGRATION_STATUS_SETUP,
- MIGRATION_STATUS_FAILED);
- return false;
- }
-
return true;
+
+err:
+ migrate_set_state(&s->state, MIGRATION_STATUS_SETUP,
+ MIGRATION_STATUS_FAILED);
+ return false;
}
bool multifd_recv(void)
--
2.35.3
^ permalink raw reply related [flat|nested] 7+ messages in thread
* Re: [PATCH 2/2] migration/multifd: Fix multifd_send_setup cleanup when channel creation fails
2024-08-01 17:41 ` [PATCH 2/2] migration/multifd: Fix multifd_send_setup cleanup when channel creation fails Fabiano Rosas
@ 2024-08-01 18:38 ` Peter Xu
2024-08-01 19:14 ` Fabiano Rosas
0 siblings, 1 reply; 7+ messages in thread
From: Peter Xu @ 2024-08-01 18:38 UTC (permalink / raw)
To: Fabiano Rosas; +Cc: qemu-devel, qemu-stable, Jim Fehlig
On Thu, Aug 01, 2024 at 02:41:01PM -0300, Fabiano Rosas wrote:
> When a channel fails to create, the code currently just returns. This
> is wrong for two reasons:
>
> 1) Channel n+1 will not get to initialize it's semaphores, leading to
> an assert when terminate_threads tries to post to it:
>
> qemu-system-x86_64: ../util/qemu-thread-posix.c:92:
> qemu_mutex_lock_impl: Assertion `mutex->initialized' failed.
>
> 2) (theoretical) If channel n-1 already started creation it will
> defeat the purpose of the channels_created logic which is in place
> to avoid migrate_fd_cleanup() to run while channels are still being
> created.
>
> This cannot really happen today because the current failure cases
> for multifd_new_send_channel_create() are all synchronous,
> resulting from qio_channel_file_new_path() getting a bad
> filename. This would hit all channels equally.
>
> But I don't want to set a trap for future people, so have all
> channels try to create (even if failing), and only fail after the
> channels_created semaphore has been posted.
>
> While here, remove the error_report_err call. There's one already at
> migrate_fd_cleanup later on.
>
> Cc: qemu-stable@nongnu.org
> Reported-by: Jim Fehlig <jfehlig@suse.com>
> Fixes: bd8b0a8f82 ("migration/multifd: Move multifd_send_setup error handling in to the function")
Should it be this one instead?
b7b03eb614 ("migration/multifd: Add outgoing QIOChannelFile support")
> Signed-off-by: Fabiano Rosas <farosas@suse.de>
Reviewed-by: Peter Xu <peterx@redhat.com>
PS: what's your plan on your other multifd SendData series? I got a bit
overloaded on downstream stuff and I still have plenty review debts
recently (CPR one of them.. needs follow ups), so just to say I may delay a
bit on reading that one. I assume it's next-release stuff anyway, but let
me know otherwise.
Thanks,
--
Peter Xu
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH 1/2] migration: Fix cleanup of iochannel in file migration
2024-08-01 17:41 ` [PATCH 1/2] migration: Fix cleanup of iochannel in file migration Fabiano Rosas
@ 2024-08-01 18:39 ` Peter Xu
0 siblings, 0 replies; 7+ messages in thread
From: Peter Xu @ 2024-08-01 18:39 UTC (permalink / raw)
To: Fabiano Rosas; +Cc: qemu-devel, Jim Fehlig
On Thu, Aug 01, 2024 at 02:41:00PM -0300, Fabiano Rosas wrote:
> The QIOChannelFile object already has its reference decremented by
> g_autoptr. Trying to unref an extra time causes:
>
> ERROR:../qom/object.c:1241:object_unref: assertion failed: (obj->ref > 0)
>
> Fixes: a701c03dec ("migration: Drop reference to QIOChannel if file seeking fails")
> Fixes: 6d3279655a ("migration: Fix file migration with fdset")
> Reported-by: Jim Fehlig <jfehlig@suse.com>
> Signed-off-by: Fabiano Rosas <farosas@suse.de>
Ouch..
Reviewed-by: Peter Xu <peterx@redhat.com>
--
Peter Xu
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH 2/2] migration/multifd: Fix multifd_send_setup cleanup when channel creation fails
2024-08-01 18:38 ` Peter Xu
@ 2024-08-01 19:14 ` Fabiano Rosas
0 siblings, 0 replies; 7+ messages in thread
From: Fabiano Rosas @ 2024-08-01 19:14 UTC (permalink / raw)
To: Peter Xu; +Cc: qemu-devel, qemu-stable, Jim Fehlig
Peter Xu <peterx@redhat.com> writes:
> On Thu, Aug 01, 2024 at 02:41:01PM -0300, Fabiano Rosas wrote:
>> When a channel fails to create, the code currently just returns. This
>> is wrong for two reasons:
>>
>> 1) Channel n+1 will not get to initialize it's semaphores, leading to
>> an assert when terminate_threads tries to post to it:
>>
>> qemu-system-x86_64: ../util/qemu-thread-posix.c:92:
>> qemu_mutex_lock_impl: Assertion `mutex->initialized' failed.
>>
>> 2) (theoretical) If channel n-1 already started creation it will
>> defeat the purpose of the channels_created logic which is in place
>> to avoid migrate_fd_cleanup() to run while channels are still being
>> created.
>>
>> This cannot really happen today because the current failure cases
>> for multifd_new_send_channel_create() are all synchronous,
>> resulting from qio_channel_file_new_path() getting a bad
>> filename. This would hit all channels equally.
>>
>> But I don't want to set a trap for future people, so have all
>> channels try to create (even if failing), and only fail after the
>> channels_created semaphore has been posted.
>>
>> While here, remove the error_report_err call. There's one already at
>> migrate_fd_cleanup later on.
>>
>> Cc: qemu-stable@nongnu.org
>> Reported-by: Jim Fehlig <jfehlig@suse.com>
>> Fixes: bd8b0a8f82 ("migration/multifd: Move multifd_send_setup error handling in to the function")
>
> Should it be this one instead?
>
> b7b03eb614 ("migration/multifd: Add outgoing QIOChannelFile support")
Yep, thanks. I'll fix it up.
>
>> Signed-off-by: Fabiano Rosas <farosas@suse.de>
>
> Reviewed-by: Peter Xu <peterx@redhat.com>
>
> PS: what's your plan on your other multifd SendData series? I got a bit
> overloaded on downstream stuff and I still have plenty review debts
> recently (CPR one of them.. needs follow ups), so just to say I may delay a
> bit on reading that one. I assume it's next-release stuff anyway, but let
> me know otherwise.
That one is pretty ready. From my side I don't intend to change anything
else, save for review comments. And it's definitely 9.2 material.
I think CPR is more important at this point because it's been lagging
behind for a while.
I have a PR to send with these fixes and catch up on that virtio-net
discussion. After that I should be able to get some reviews done.
>
> Thanks,
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH 0/2] Multifd fixes
2024-08-01 17:40 [PATCH 0/2] Multifd fixes Fabiano Rosas
2024-08-01 17:41 ` [PATCH 1/2] migration: Fix cleanup of iochannel in file migration Fabiano Rosas
2024-08-01 17:41 ` [PATCH 2/2] migration/multifd: Fix multifd_send_setup cleanup when channel creation fails Fabiano Rosas
@ 2024-08-02 12:47 ` Fabiano Rosas
2 siblings, 0 replies; 7+ messages in thread
From: Fabiano Rosas @ 2024-08-02 12:47 UTC (permalink / raw)
To: qemu-devel, Fabiano Rosas; +Cc: Peter Xu
On Thu, 01 Aug 2024 14:40:59 -0300, Fabiano Rosas wrote:
> on mapped-ram for libvirt.
>
> CI run: https://gitlab.com/farosas/qemu/-/pipelines/1397467740
>
> Fabiano Rosas (2):
> migration: Fix cleanup of iochannel in file migration
> migration/multifd: Fix multifd_send_setup cleanup when channel
> creation fails
>
> [...]
Queued, thanks!
^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2024-08-02 12:48 UTC | newest]
Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-08-01 17:40 [PATCH 0/2] Multifd fixes Fabiano Rosas
2024-08-01 17:41 ` [PATCH 1/2] migration: Fix cleanup of iochannel in file migration Fabiano Rosas
2024-08-01 18:39 ` Peter Xu
2024-08-01 17:41 ` [PATCH 2/2] migration/multifd: Fix multifd_send_setup cleanup when channel creation fails Fabiano Rosas
2024-08-01 18:38 ` Peter Xu
2024-08-01 19:14 ` Fabiano Rosas
2024-08-02 12:47 ` [PATCH 0/2] Multifd fixes Fabiano Rosas
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).