From: Nick Thomas <nick@bytemark.co.uk>
To: Alexander Graf <agraf@suse.de>, Marcelo Tosatti <mtosatti@redhat.com>
Cc: qemu-devel@nongnu.org, kvm@vger.kernel.org
Subject: Re: [PATCH] kvmclock: Ensure time in migration never goes backward
Date: Wed, 07 May 2014 11:04:55 +0100 [thread overview]
Message-ID: <536A0547.1080706@bytemark.co.uk> (raw)
In-Reply-To: <53688C56.6020109@suse.de>
Hi all,
On 06/05/14 08:16, Alexander Graf wrote:
>
> On 06.05.14 01:23, Marcelo Tosatti wrote:
>
>> 1) By what algorithm you retrieve
>> and compare time in kvmclock guest structure and KVM_GET_CLOCK.
>> What are the results of the comparison.
>> And whether and backwards time was visible in the guest.
>
> I've managed to get my hands on a broken migration stream from Nick.
> There I looked at the curr_clocksource structure and saw that the last
> seen time on the kvmclock clock source was greater than the value that
> the kvmclock device migrated.
We've been seeing live migration failures where the guest sees time go
backwards (= massive forward leap to the kernel, apparently) for a
while now, affecting perhaps 5-10% of migrations we'd do (usually a
large proportion of the migrations on a few hosts, rather than an even
spread); initially in December, when we tried an upgrade to QEMU 1.7.1
and a 3.mumble (3.10?) kernel, from 1.5.0 and Debian's 3.2.
My testing at the time seemed to indicate that either upgrade - qemu or
kernel - caused the problems to show up. Guest symptoms are that the
kernel enters a tight loop in __run_timers and stays there. In the end,
I gave up and downgraded us again without any clear idea of what was
happening, or why.
In April, we finally got together a fairly reliable test case. This
patch resolves the guest hangs in that test, and I've also been able to
conduct > 1000 migrations of production guests without seeing the issue
recur. So,
Tested-by: Nick Thomas <nick@bytemark.co.uk>
/Nick
WARNING: multiple messages have this Message-ID (diff)
From: Nick Thomas <nick@bytemark.co.uk>
To: Alexander Graf <agraf@suse.de>, Marcelo Tosatti <mtosatti@redhat.com>
Cc: qemu-devel@nongnu.org, kvm@vger.kernel.org
Subject: Re: [Qemu-devel] [PATCH] kvmclock: Ensure time in migration never goes backward
Date: Wed, 07 May 2014 11:04:55 +0100 [thread overview]
Message-ID: <536A0547.1080706@bytemark.co.uk> (raw)
In-Reply-To: <53688C56.6020109@suse.de>
Hi all,
On 06/05/14 08:16, Alexander Graf wrote:
>
> On 06.05.14 01:23, Marcelo Tosatti wrote:
>
>> 1) By what algorithm you retrieve
>> and compare time in kvmclock guest structure and KVM_GET_CLOCK.
>> What are the results of the comparison.
>> And whether and backwards time was visible in the guest.
>
> I've managed to get my hands on a broken migration stream from Nick.
> There I looked at the curr_clocksource structure and saw that the last
> seen time on the kvmclock clock source was greater than the value that
> the kvmclock device migrated.
We've been seeing live migration failures where the guest sees time go
backwards (= massive forward leap to the kernel, apparently) for a
while now, affecting perhaps 5-10% of migrations we'd do (usually a
large proportion of the migrations on a few hosts, rather than an even
spread); initially in December, when we tried an upgrade to QEMU 1.7.1
and a 3.mumble (3.10?) kernel, from 1.5.0 and Debian's 3.2.
My testing at the time seemed to indicate that either upgrade - qemu or
kernel - caused the problems to show up. Guest symptoms are that the
kernel enters a tight loop in __run_timers and stays there. In the end,
I gave up and downgraded us again without any clear idea of what was
happening, or why.
In April, we finally got together a fairly reliable test case. This
patch resolves the guest hangs in that test, and I've also been able to
conduct > 1000 migrations of production guests without seeing the issue
recur. So,
Tested-by: Nick Thomas <nick@bytemark.co.uk>
/Nick
next prev parent reply other threads:[~2014-05-07 10:04 UTC|newest]
Thread overview: 44+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-05-05 13:51 [PATCH] kvmclock: Ensure time in migration never goes backward Alexander Graf
2014-05-05 13:51 ` [Qemu-devel] " Alexander Graf
2014-05-05 17:46 ` Marcin Gibuła
2014-05-05 18:05 ` Alexander Graf
2014-05-05 18:26 ` Marcin Gibuła
2014-05-05 23:27 ` Marcelo Tosatti
2014-05-05 23:27 ` Marcelo Tosatti
2014-05-05 23:31 ` Marcelo Tosatti
2014-05-05 23:31 ` Marcelo Tosatti
2014-05-06 8:07 ` Marcin Gibuła
2014-05-06 8:07 ` [Qemu-devel] " Marcin Gibuła
2014-05-06 7:11 ` Alexander Graf
2014-05-06 7:37 ` Marcin Gibuła
2014-05-05 23:23 ` Marcelo Tosatti
2014-05-05 23:23 ` [Qemu-devel] " Marcelo Tosatti
2014-05-05 23:31 ` Marcelo Tosatti
2014-05-05 23:31 ` [Qemu-devel] " Marcelo Tosatti
2014-05-06 7:18 ` Alexander Graf
2014-05-06 7:18 ` [Qemu-devel] " Alexander Graf
2014-05-06 19:54 ` Marcin Gibuła
2014-05-07 23:23 ` Marcelo Tosatti
2014-05-07 23:23 ` Marcelo Tosatti
2014-05-07 23:21 ` Marcelo Tosatti
2014-05-07 23:21 ` [Qemu-devel] " Marcelo Tosatti
2014-05-07 23:29 ` Alexander Graf
2014-05-07 23:29 ` [Qemu-devel] " Alexander Graf
2014-05-06 7:16 ` Alexander Graf
2014-05-06 7:16 ` [Qemu-devel] " Alexander Graf
2014-05-07 10:04 ` Nick Thomas [this message]
2014-05-07 10:04 ` Nick Thomas
2014-05-08 1:33 ` Marcelo Tosatti
2014-05-08 1:33 ` [Qemu-devel] " Marcelo Tosatti
2014-05-08 7:21 ` Alexander Graf
2014-05-08 7:21 ` [Qemu-devel] " Alexander Graf
2014-05-09 2:28 ` Marcelo Tosatti
2014-05-09 2:28 ` [Qemu-devel] " Marcelo Tosatti
2014-05-09 11:53 ` Paolo Bonzini
2014-05-09 11:53 ` [Qemu-devel] " Paolo Bonzini
2014-05-12 20:26 ` Alexander Graf
2014-05-12 20:26 ` [Qemu-devel] " Alexander Graf
2014-05-14 7:26 ` Marcelo Tosatti
2014-05-14 7:26 ` [Qemu-devel] " Marcelo Tosatti
2014-05-14 6:47 ` Marcelo Tosatti
2014-05-14 6:47 ` [Qemu-devel] " Marcelo Tosatti
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=536A0547.1080706@bytemark.co.uk \
--to=nick@bytemark.co.uk \
--cc=agraf@suse.de \
--cc=kvm@vger.kernel.org \
--cc=mtosatti@redhat.com \
--cc=qemu-devel@nongnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.