From mboxrd@z Thu Jan 1 00:00:00 1970 From: Kouya Shimura Subject: Re: [PATCH 2/2] x86/hvm: fix corrupt ACPI PM-Timer during live migration Date: Thu, 16 May 2013 14:29:00 +0900 Message-ID: <51946E9C.5070204@jp.fujitsu.com> References: <5110A2DD.1080603@jp.fujitsu.com> <511A42E902000078000BDB5E@nat28.tlf.novell.com> <511C7FB7.90907@jp.fujitsu.com> <511E742A02000078000BEDAE@nat28.tlf.novell.com> <51247E56.4060904@jp.fujitsu.com> <5138C72802000078000C3FB0@nat28.tlf.novell.com> <514A9BC4.40508@jp.fujitsu.com> <514AB78A.3010109@jp.fujitsu.com> <514AE87902000078000C76E3@nat28.tlf.novell.com> <514BB003.1070903@jp.fujitsu.com> <514C1E2F02000078000C7A93@nat28.tlf.novell.com> <5193B85702000078000D684D@nat28.tlf.novell.com> <68122F7B6C6F3D1C589CAB91@nimrod.local> <5193BDAB02000078000D688A@nat28.tlf.novell.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii"; Format="flowed" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <5193BDAB02000078000D688A@nat28.tlf.novell.com> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Alex Bligh Cc: Tim Deegan , Diana Crisan , Keir Fraser , Jan Beulich , xen-devel@lists.xen.org List-Id: xen-devel@lists.xenproject.org On 05/15/2013 11:54 PM, Jan Beulich wrote: >>>> On 15.05.13 at 16:49, Alex Bligh wrote: >> --On 15 May 2013 15:31:19 +0100 Jan Beulich wrote: >> >>>> Did this ever get committed? I can't immediately find a commit. >>> >>> No, it didn't - no-one knowing that code well enough ever acked >>> patch 1, and without that we can't apply the patch here. >> >> I certainly am not someone who knows that code well, so can't help >> with that. But I (or more accurately Diana) can reliably replicate >> live migrate on HVM and qemu-upstream DM causing (a) ACPI entries >> to disappear from xenstore and (b) walltime to fail to advance in >> the migrated domain until the walltime is manually set (stuck >> clock). >> >> Is this likely to be related? > > I'd like to defer to Kouya to tell whether that matches the > symptoms he saw. I don't think my fix is related to Alex's problem. I observed that gettimeofday() sometimes goes backward *a few seconds* on migration. In linux OSs, ACPI Timer value is masked by 24bit. The valid range: (1 / 3.579545MHz) * 0xffffff = 4.7sec So, the effect of the corrupt ACPI timer is at most 4.7 sec. The clock shouldn't be frozen for a long time. Apparently (a) sounds like a problem of xl toolstack. -- Kouya