qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
* [PATCH] Improve error propagation via return path
@ 2025-10-21  7:52 Dhruv Choudhary
  2025-10-21 14:34 ` Peter Xu
  0 siblings, 1 reply; 7+ messages in thread
From: Dhruv Choudhary @ 2025-10-21  7:52 UTC (permalink / raw)
  To: Peter Xu, Fabiano Rosas, qemu-devel; +Cc: Dhruv Choudhary

Use the return-path thread to send error details from the
destination to the source on a migration failure. Management
applications can then query the source QEMU for errors, as
the single source of truth, making failures easy to trace.

Signed-off-by: Dhruv Choudhary <dhruv.choudhary@nutanix.com>
---
 migration/migration.c | 25 +++++++++++++++++++++++--
 1 file changed, 23 insertions(+), 2 deletions(-)

diff --git a/migration/migration.c b/migration/migration.c
index a63b46bbef..123cffb286 100644
--- a/migration/migration.c
+++ b/migration/migration.c
@@ -87,6 +87,7 @@ enum mig_rp_message_type {
     MIG_RP_MSG_RECV_BITMAP,  /* send recved_bitmap back to source */
     MIG_RP_MSG_RESUME_ACK,   /* tell source that we are ready to resume */
     MIG_RP_MSG_SWITCHOVER_ACK, /* Tell source it's OK to do switchover */
+    MIG_RP_MSG_ERROR,        /* propogate error to source */
 
     MIG_RP_MSG_MAX
 };
@@ -608,6 +609,17 @@ int migrate_send_rp_req_pages(MigrationIncomingState *mis,
     return migrate_send_rp_message_req_pages(mis, rb, start);
 }
 
+static void migrate_send_rp_error(MigrationIncomingState *mis, Error *errp)
+{
+    const char *rpmsg = error_get_pretty(errp);
+    if (!mis->to_src_file) {
+        mis->to_src_file = qemu_file_get_return_path(mis->from_src_file);
+    }
+    migrate_send_rp_message(mis, MIG_RP_MSG_ERROR,
+                            (uint16_t)(strlen(rpmsg) + 1),
+                            (char *)rpmsg);
+}
+
 static bool migration_colo_enabled;
 bool migration_incoming_colo_enabled(void)
 {
@@ -905,8 +917,12 @@ process_incoming_migration_co(void *opaque)
     }
 
     if (ret < 0) {
-        error_prepend(&local_err, "load of migration failed: %s: ",
-                      strerror(-ret));
+        error_prepend(&local_err, "destination error : load of migration failed:
+                       %s: ", strerror(-ret));
+        /* Check if return path is enabled and then send error to source */
+        if (migrate_postcopy_ram() || migrate_return_path()) {
+            migrate_send_rp_error(mis, local_err);
+        }
         goto fail;
     }
 
@@ -2437,6 +2453,7 @@ static struct rp_cmd_args {
     [MIG_RP_MSG_RECV_BITMAP]    = { .len = -1, .name = "RECV_BITMAP" },
     [MIG_RP_MSG_RESUME_ACK]     = { .len =  4, .name = "RESUME_ACK" },
     [MIG_RP_MSG_SWITCHOVER_ACK] = { .len =  0, .name = "SWITCHOVER_ACK" },
+    [MIG_RP_MSG_ERROR]          = { .len = -1, .name = "ERROR"},
     [MIG_RP_MSG_MAX]            = { .len = -1, .name = "MAX" },
 };
 
@@ -2667,6 +2684,10 @@ static void *source_return_path_thread(void *opaque)
             trace_source_return_path_thread_switchover_acked();
             break;
 
+        case MIG_RP_MSG_ERROR:
+            error_setg(&err, "%s", (char *)buf);
+            goto out;
+
         default:
             break;
         }
-- 
2.39.3



^ permalink raw reply related	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2025-10-22  8:33 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2025-10-21  7:52 [PATCH] Improve error propagation via return path Dhruv Choudhary
2025-10-21 14:34 ` Peter Xu
2025-10-21 14:54   ` Vladimir Sementsov-Ogievskiy
2025-10-21 15:24     ` Peter Xu
2025-10-21 20:31       ` Fabiano Rosas
2025-10-21 21:18         ` Peter Xu
2025-10-22  8:32         ` Daniel P. Berrangé

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).