From: Lukas Straub <lukasstraub2@web.de>
To: Zhang Chen <zhangckid@gmail.com>
Cc: "Zhijian Li (Fujitsu)" <lizhijian@fujitsu.com>,
Peter Xu <peterx@redhat.com>,
"zhanghailiang@xfusion.com" <zhanghailiang@xfusion.com>,
"farosas@suse.de" <farosas@suse.de>,
"qemu-devel@nongnu.org" <qemu-devel@nongnu.org>
Subject: Re: [PATCH] migration: Fix transition to COLO state from precopy
Date: Mon, 24 Nov 2025 09:38:57 +0100 [thread overview]
Message-ID: <20251124093857.1fa22ded@penguin> (raw)
In-Reply-To: <CAK3tnvK2h4gDYZn_P-mQPhM5qvSkPy2FJ_EzKqyMU9ZcyT8TTA@mail.gmail.com>
[-- Attachment #1: Type: text/plain, Size: 2911 bytes --]
On Thu, 6 Nov 2025 11:21:56 +0800
Zhang Chen <zhangckid@gmail.com> wrote:
> On Thu, Nov 6, 2025 at 9:10 AM Zhijian Li (Fujitsu)
> <lizhijian@fujitsu.com> wrote:
> >
> >
> >
> > On 06/11/2025 04:58, Peter Xu wrote:
> > > On Tue, Nov 04, 2025 at 09:36:06AM +0800, Li Zhijian wrote:
> > >> Commit 4881411136 ("migration: Always set DEVICE state") set a new DEVICE
> > >> state before completed during migration, which broke the original transition
> > >> to COLO. The migration flow for precopy has changed to:
> > >> active -> pre-switchover -> device -> completed.
> > >>
> > >> This patch updates the transition state to ensure that the Pre-COLO
> > >> state corresponds to DEVICE state correctly.
> > >>
> > >> Fixes: 4881411136 ("migration: Always set DEVICE state")
> > >> Signed-off-by: Li Zhijian <lizhijian@fujitsu.com>
> > >> ---
> > >> [...]
> > >
> > > Thanks a lot for fixing it, Zhijian. It means I broke COLO already for
> > > 10.0/10.1..
> > >
> > > Hailiang/Chen, do you still know anyone who is using COLO, especially in
> > > enterprise? I don't expect any individual using it.. It definitely
> > > complicates migration logics all over the places. Fabiano and I discussed
> > > a few times on removing legacy code and COLO was always in the list.
> > >
> > > We used to discuss RDMA obsoletion too, that's when Huawei developers at
> > > least tried to re-implement the whole RDMA using rsocket, that didn't land
> > > only because of a perf regression. Meanwhile, Zhijian also provided an
> > > unit test, which we rely on recently to not break RDMA at the minimum.
> > >
> > > If we do not have known users, I sincerely want to discuss with you on
> > > obsoletion and removal of COLO from qemu codebase. Do you see feasible?
> > >
> > > Zhijian, do you have any input here?
> >
> >
> > If we don't have any known users, I personally have no objection to removing COLO.
> >
> > From my previous understanding, its use cases are rather limited, and the checkpointing overhead is significant.
> > Moreover, with the continuous development of Cloud Native over the past decade, service-based
> > FT/HA solutions have become very mature, which shrinks the use cases for VM-based FT solutions even further.
> >
> > I think it's worth keeping if we have:
> >
> > - Active users who depend on it.
> > - A unit test for the COLO framework.
> >
> > Thanks
> > Zhijian
> >
> >
>
> Add CC Lukas.
>
> [...]
Hello Everyone,
Thanks for bringing this to my attention.
I will write a migration unit-test and take maintainership of the colo
components.
Peter, what is your plan with refactoring the migration code and where
is the colo code blocking you?
I have quite a few cleanup patches lying around. Are you open to take
these in advance before the next merge window opens?
Best regards,
Lukas Straub
[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 833 bytes --]
next prev parent reply other threads:[~2025-11-24 8:39 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-11-04 1:36 [PATCH] migration: Fix transition to COLO state from precopy Li Zhijian via
2025-11-04 1:49 ` Zhijian Li (Fujitsu)
2025-11-04 2:40 ` Zhang Chen
2025-11-05 20:58 ` Peter Xu
2025-11-06 1:09 ` Zhijian Li (Fujitsu)
2025-11-06 3:21 ` Zhang Chen
2025-11-06 3:24 ` Zhang Chen
2025-11-06 22:07 ` Peter Xu
2025-11-24 0:35 ` Jason Wang
2025-11-24 1:56 ` Zhang Chen
2025-11-24 8:38 ` Lukas Straub [this message]
2025-11-24 19:28 ` Peter Xu
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20251124093857.1fa22ded@penguin \
--to=lukasstraub2@web.de \
--cc=farosas@suse.de \
--cc=lizhijian@fujitsu.com \
--cc=peterx@redhat.com \
--cc=qemu-devel@nongnu.org \
--cc=zhangckid@gmail.com \
--cc=zhanghailiang@xfusion.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.