Live Migration Across AMD and Intel

All of lore.kernel.org
 help / color / mirror / Atom feed

* Live Migration Across AMD and Intel
@ 2008-12-02  9:05 Igor Chubin
  2008-12-02  9:23 ` Keir Fraser
  0 siblings, 1 reply; 7+ messages in thread
From: Igor Chubin @ 2008-12-02  9:05 UTC (permalink / raw)
  To: xen-devel

Hello, All.

Do I understand correctly:

for the present it is possible to make domain live migration
across two hosts, one of which is Intel-CPU-based and
another is AMD-CPU-based?

Does CPUID feature of Xen 3.3 permit live migration only 
between hosts with CPUs with different flags but 
produce by the same vendor?

What is the difference (from domU perspective)
between such CPUs, that prohibits live migration?

Thank you.

-- 
WBR, i.m.chubin

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Live Migration Across AMD and Intel
  2008-12-02  9:05 Live Migration Across AMD and Intel Igor Chubin
@ 2008-12-02  9:23 ` Keir Fraser
  2008-12-02  9:44   ` James Harper
  0 siblings, 1 reply; 7+ messages in thread
From: Keir Fraser @ 2008-12-02  9:23 UTC (permalink / raw)
  To: Igor Chubin, xen-devel

On 2/12/08 09:05, "Igor Chubin" <igor@chub.in> wrote:

> Hello, All.
> 
> Do I understand correctly:
> 
> for the present it is possible to make domain live migration
> across two hosts, one of which is Intel-CPU-based and
> another is AMD-CPU-based?
> 
> Does CPUID feature of Xen 3.3 permit live migration only
> between hosts with CPUs with different flags but
> produce by the same vendor?
> 
> What is the difference (from domU perspective)
> between such CPUs, that prohibits live migration?

Supported instructions differ (even at the basic level of SYSENTER versus
SYSCALL in some situations). Supported architectural MSRs differ too.

Actually it could be forced to work sometimes, but it's not particularly
something we want to be claiming to support.

 -- Keir

^ permalink raw reply	[flat|nested] 7+ messages in thread

* RE: Live Migration Across AMD and Intel
  2008-12-02  9:23 ` Keir Fraser
@ 2008-12-02  9:44   ` James Harper
  2008-12-02 10:07     ` Igor Chubin
  2008-12-02 10:14     ` Christoph Egger
  0 siblings, 2 replies; 7+ messages in thread
From: James Harper @ 2008-12-02  9:44 UTC (permalink / raw)
  To: Keir Fraser, Igor Chubin, xen-devel

> On 2/12/08 09:05, "Igor Chubin" <igor@chub.in> wrote:
> 
> > Hello, All.
> >
> > Do I understand correctly:
> >
> > for the present it is possible to make domain live migration
> > across two hosts, one of which is Intel-CPU-based and
> > another is AMD-CPU-based?
> >
> > Does CPUID feature of Xen 3.3 permit live migration only
> > between hosts with CPUs with different flags but
> > produce by the same vendor?
> >
> > What is the difference (from domU perspective)
> > between such CPUs, that prohibits live migration?
> 
> Supported instructions differ (even at the basic level of SYSENTER
versus
> SYSCALL in some situations). Supported architectural MSRs differ too.
> 
> Actually it could be forced to work sometimes, but it's not
particularly
> something we want to be claiming to support.
> 

This appeared on slashdot a while ago...
http://linux.slashdot.org/linux/08/11/07/1535235.shtml. I'm not sure
that their source is that reputable though. If you were emulating a 486
it might work :)

James 

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Live Migration Across AMD and Intel
  2008-12-02  9:44   ` James Harper
@ 2008-12-02 10:07     ` Igor Chubin
  2008-12-02 10:32       ` Keir Fraser
  2008-12-02 10:53       ` Thorolf Godawa
  2008-12-02 10:14     ` Christoph Egger
  1 sibling, 2 replies; 7+ messages in thread
From: Igor Chubin @ 2008-12-02 10:07 UTC (permalink / raw)
  To: James Harper; +Cc: xen-devel, Keir Fraser

On Di, Dez 02, 2008 at 08:44:53 +1100, James Harper wrote:
> > On 2/12/08 09:05, "Igor Chubin" <igor@chub.in> wrote:
> > 
> > > Hello, All.
> > >
> > > Do I understand correctly:
> > >
> > > for the present it is possible to make domain live migration
> > > across two hosts, one of which is Intel-CPU-based and
> > > another is AMD-CPU-based?
> > >
> > > Does CPUID feature of Xen 3.3 permit live migration only
> > > between hosts with CPUs with different flags but
> > > produce by the same vendor?
> > >
> > > What is the difference (from domU perspective)
> > > between such CPUs, that prohibits live migration?
> > 
> > Supported instructions differ (even at the basic level of SYSENTER
> versus
> > SYSCALL in some situations). Supported architectural MSRs differ too.
> > 
> > Actually it could be forced to work sometimes, but it's not
> particularly
> > something we want to be claiming to support.
> > 
> 
> This appeared on slashdot a while ago...
> http://linux.slashdot.org/linux/08/11/07/1535235.shtml. I'm not sure
> that their source is that reputable though. If you were emulating a 486
> it might work :)
> 


Thank you Keir and James, 

I know about this posting on slashdot.
Also here is related video [1] with a small presentation.

They claim there that live migration across AMD and Intel 
works on KVM. As far as I understand now it works in
some particular (and rare) situations but it is not true in general.

Thank you.

[1] http://www.youtube.com/watch?v=EuhU6jJjpAQ

> James 

-- 
WBR, i.m.chubin

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Live Migration Across AMD and Intel
  2008-12-02  9:44   ` James Harper
  2008-12-02 10:07     ` Igor Chubin
@ 2008-12-02 10:14     ` Christoph Egger
  1 sibling, 0 replies; 7+ messages in thread
From: Christoph Egger @ 2008-12-02 10:14 UTC (permalink / raw)
  To: xen-devel; +Cc: James Harper, Igor Chubin, Keir Fraser

[-- Attachment #1: Type: text/plain, Size: 2323 bytes --]

On Tuesday 02 December 2008 10:44:53 James Harper wrote:
> > On 2/12/08 09:05, "Igor Chubin" <igor@chub.in> wrote:
> > > Hello, All.
> > >
> > > Do I understand correctly:
> > >
> > > for the present it is possible to make domain live migration
> > > across two hosts, one of which is Intel-CPU-based and
> > > another is AMD-CPU-based?
> > >
> > > Does CPUID feature of Xen 3.3 permit live migration only
> > > between hosts with CPUs with different flags but
> > > produce by the same vendor?
> > >
> > > What is the difference (from domU perspective)
> > > between such CPUs, that prohibits live migration?

You need to emulate SYSENTER for the 64bit case
completely (including the MSRs  MSR_IA32_SYSENTER_CS,
MSR_IA32_SYSENTER_EIP and MSR_IA32_SYSENTER_ESP)
for the 32bit compat and 64bit modes.

The three MSRs also exist on AMD, but unlike on Intel
they are always 32bit. So you have to emulate them
with a 64bit storage. The SYSENTER fields in the vmcb
are 64bit wide for alignment purpose, but only the
lower 32bits are used (meaning the guest still only see's
a 32bit wide MSRs, upper 4-bytes are zero-filled by hw).
So you can't use them as 64bit storage to emulate them.


Another issue are the x87 FPU instructions. Both AMD and Intel
CPUs compute with more than 64bits internally and the results
are different on the additional bits. If the FPU is heavily used,
(e.g. in a loop), differences appear and comparisons of results
will fail after migration which are intended to succeed. Attached
is a fpu testprogram which shows the FPU differences.


A third case is the idle-loop in the guests. Intel prefers monitor/mwait,
AMD prefers hlt. Moreover, monitor/mwait has been introduced in AMD Family10h,
so if you're migrating to an AMD K8, Xen has to emulate the monitor/mwait
instructions.


Hope, that clearifies things up.

Christoph


-- 
AMD Saxony, Dresden, Germany
Operating System Research Center

Legal Information:
AMD Saxony Limited Liability Company & Co. KG
Sitz (Geschäftsanschrift):
   Wilschdorfer Landstr. 101, 01109 Dresden, Deutschland
Registergericht Dresden: HRA 4896
vertretungsberechtigter Komplementär:
   AMD Saxony LLC (Sitz Wilmington, Delaware, USA)
Geschäftsführer der AMD Saxony LLC:
   Dr. Hans-R. Deppe, Thomas McCoy

[-- Attachment #2: fpu.c --]
[-- Type: text/x-csrc, Size: 8187 bytes --]


#include <stdio.h>
#include <stdint.h>
#include <stdarg.h>
#include <stdbool.h>
#include <stdlib.h>
#include <limits.h>
#include <math.h>
#include <assert.h>
#include <errno.h>

/*
 * inline asm constraint reference
 * http://blogs.sun.com/x86be/entry/gcc_style_asm_inlining_support
 */

#define MAXLOOP		10000000ULL

#ifndef M_PIl		/* GNU extension */
#define M_PIl		3.1415926535897932384626433832795029L
#endif

#define FPREGS_NUM	8
#define FPREG_SIZE	10
#define FPSTACK_SIZE	(FPREGS_NUM * FPREG_SIZE)

/*
 * http://lxr.linux.no/linux+v2.6.27.5/include/asm-x86/processor.h#L283
 */
struct i387_fsave {
	uint32_t fs_cwd; /* FPU Control Word             */
	uint32_t fs_swd; /* FPU Status Word              */
	uint32_t fs_twd; /* FPU Tag Word                 */
	uint32_t fs_fip; /* FPU IP Offset                */
	uint32_t fs_fcs; /* FPU IP Selector              */
	uint32_t fs_foo; /* FPU Operand Pointer Offset   */
	uint32_t fs_fos; /* FPU Operand Pointer Selector */

	/* 8*10 bytes for each FP-reg = 80 bytes:                       */
	uint8_t fs_st_space[FPSTACK_SIZE];

	/* Software status information [not touched by FSAVE ]:         */
	uint32_t fs_status;
};

void snprintfhex(char *buf, size_t size, const long double val)
{
	size_t i, len;
	const uint8_t *uval;

	len = 0;
	uval = (const uint8_t *)&val;

	for (i = 0; i < size; i++)
		len += snprintf(&buf[len], size - len, "%02x", uval[size - 1 - i]);
}

void printfhex(const char *fmt, const char *str, const long double val, ...)
{
	char buf[100];
	char buf2[200];
	va_list ap;

	snprintfhex(buf, sizeof(buf), val);
	snprintf(buf2, sizeof(buf2), "%s%s%s", fmt, buf, str);

	va_start(ap, val);
	vprintf(buf2, ap);
	va_end(ap);
}

void print_fsave(struct i387_fsave f, const char *msg)
{
	int i;
	long double *val;

	printf("\tCtrl: %08x %08x %08x %08x %08x %08x\n",
		f.fs_cwd, f.fs_swd, f.fs_twd, f.fs_fip, f.fs_fcs, f.fs_foo, f.fs_fos);

	for (i = 0; i < FPREGS_NUM; i++) {
		val = (long double *)&(f.fs_st_space[i * FPREG_SIZE]);

		printfhex("\t%i %20.20Le ", (i % 2) ? "\n" : "", *val, i, *val);
	}

	printf("\n");
}

void test_add(const char *msg)
{
	float f;
	double d;
	long double ld;
	uint64_t i;

	printf("%s: %s\n", __func__, msg);

	f = 0.0;
	d = 0.0;
	ld = 0.0;

	printfhex("f: %20.20e, ", "\n", f, f);
	printfhex("d: %20.20le, ", "\n", d, d);
	printfhex("ld: %20.20Le, ", "\n", ld, ld);

	for (i = 0; i < MAXLOOP; i++) {
		f += 0.1;
		d += 0.1;
		ld += 0.1;
	}

	printfhex("f: %20.20e, ", "\n", f, f);
	printfhex("d: %20.20le, ", "\n", d, d);
	printfhex("ld: %20.20Le, ", "\n", ld, ld);
}


void test_sin(const char *msg)
{
	float f;
	double d;
	long double ld;
	uint64_t i;

	printf("%s: %s\n", __func__, msg);

#if 0
	sin(ln(2) radians) = ~ 0.63896
#endif
	f = 0.0;
	d = 0.0;
	ld = 0.0;

	asm volatile( "fldln2;" "fsin;" : "=t" (f));
	asm volatile( "fldln2;" "fsin;" : "=t" (d));
	asm volatile( "fldln2;" "fsin;" : "=t" (ld));

	printfhex("f: %20.20e, ", "\n", f, f);
	printfhex("d: %20.20le, ", "\n", d, d);
	printfhex("ld: %20.20Le, ", "\n", ld, ld);
}

void test_manysin(const char *msg)
{
	long double ld = 0.0, ldout = 0.0;
	int i;

	printf("%s: %s\n", __func__, msg);

	/* Whole range */
	for (ld = 2.0e-63, i = -63; i < 63; ld *= -10.0, i++) {
		asm volatile( "fsin"
			: "=t" (ldout)
			: "0" (ld)
			);

		printfhex("sin( %20.20Le, ", "", ld, ld);
		printfhex(") = %20.20Le, ", "\n", ldout, ldout);
	}

	/* -2pi to 2pi */
	for (ld = -2.0*M_PIl; ld < 2.0*M_PIl; ld += 0.03)
	{
		asm volatile( "fsin"
			: "=t" (ldout)
			: "0" (ld)
			);

		printfhex("sin( %20.20Le, ", "", ld, ld);
		printfhex(") = %20.20Le, ", "\n", ldout, ldout);
	}



	ld = 0.7853975; /* pi/4: sin(ld) = 1/sqrt2 = 0.707 */
	asm volatile( "fsin"
		: "=t" (ldout)
		: "0" (ld)
		);

	printfhex("sin( %20.20Le, ", "", ld, ld);
	printfhex(") = %20.20Le, ", "\n", ldout, ldout);
}


/* sse */
void test_recip(const char *msg)
{
	float f, fout;
	int i;

	printf("%s: %s\n", __func__, msg);

	for (f = 1.0e-44, i = -44; i < 44; f *= -10.0, i++) {
		asm volatile( "rcpss %1, %0"
			: "=x" (fout)
			: "x" (f)
			);

		printfhex("rcpss( %20.20e, ", "", f, f);
		printfhex(" ) = %20.20e, ", "\n", fout, fout);

		asm volatile( "rsqrtss %1, %0"
			: "=x" (fout)
			: "x" (f)
			);

		printfhex("rsqrtss( %20.20e, ", "", f, f);
		printfhex(" ) = %20.20e, ", "", fout, fout);
	}

	printf("\n");
}

/* sse */
void test_addsubmuldiv(const char *msg)
{
	float f1 = 1.0e-44;
	float f2 = -3.124e-44;
	float fout;
	double d1 = 1.0e-300;
	double d2 = -3.124e-300;
	double dout;

	while (d1 < 1.0e300) {
		/* Add */
		asm volatile("movd %1, %0; addss %2, %0"
			: "=x" (fout) /*0*/
			: "m" (f1) /*1*/, "m" (f2) /*2*/
			);
		asm volatile("movsd %1, %0; addsd %2, %0"
			: "=x" (dout) /*0*/
			: "m" (d1) /*1*/, "m" (d2) /*2*/
			);

		printfhex("addss( %20.20e, ", "", f1, f1);
		printfhex(" %20.20e", "", f2, f2);
		printfhex(" ) = %20.20e, ", "\n", fout, fout);

		printfhex("addsd( %20.20le, ", "", d1, d1);
		printfhex(" %20.20le", "", d2, d2);
		printfhex(" ) = %20.20le, ", "\n", dout, dout); 

		printfhex("d1 + d2 = %20.20le, ", "\n", d1 + d2, d1 + d2);

		/* Sub */
		asm volatile("movd %1, %0; subss %2, %0"
			: "=x" (fout) /*0*/
			: "m" (f1) /*1*/, "m" (f2) /*2*/
			);
		asm volatile("movsd %1, %0; subsd %2, %0"
			: "=x" (dout) /*0*/
			: "m" (d1) /*1*/, "m" (d2) /*2*/
			);

		printfhex("subss( %20.20e, ", "", f1, f1);
		printfhex(" %20.20e", "", f2, f2);
		printfhex(" ) = %20.20e, ", "\n", fout, fout);

		printfhex("subsd( %20.20le, ", "", d1, d1);
		printfhex(" %20.20le", "", d2, d2);
		printfhex(" ) = %20.20le, ", "\n", dout, dout); 

		printfhex("d1 - d2 = %20.20le, ", "\n", d1 - d2, d1 - d2);

		/* Mul */
		asm volatile( "movd %1, %0; mulss %2, %0"
			: "=x" (fout) /*0*/
			: "m" (f1) /*1*/, "m" (f2) /*2*/
			);
		asm volatile( "movsd %1, %0; mulsd %2, %0"
			: "=x" (dout) /*0*/
			: "m" (d1) /*1*/, "m" (d2) /*2*/
			);

		printfhex("mulss( %20.20e, ", "", f1, f1);
		printfhex(" %20.20e", "", f2, f2);
		printfhex(" ) = %20.20e, ", "\n", fout, fout);

		printfhex("mulsd( %20.20le, ", "", d1, d1);
		printfhex(" %20.20le", "", d2, d2);
		printfhex(" ) = %20.20le, ", "\n", dout, dout); 

		printfhex("d1 * d2 = %20.20le, ", "\n", d1 * d2, d1 * d2);

		/* Div */
		asm volatile("movd %1, %0; divss %2, %0"
			: "=x" (fout) /*0*/
			: "m" (f1) /*1*/, "m" (f2) /*2*/
			);
		asm volatile("movsd %1, %0; divsd %2, %0"
			: "=x" (dout) /*0*/
			: "m" (d1) /*1*/, "m" (d2) /*2*/
			);

		printfhex("divss( %20.20e, ", "", f1, f1);
		printfhex(" %20.20e", "", f2, f2);
		printfhex(" ) = %20.20e, ", "\n", fout, fout);

		printfhex("divsd( %20.20le, ", "", d1, d1);
		printfhex(" %20.20le", "", d2, d2);
		printfhex(" ) = %20.20le, ", "\n", dout, dout); 

		printfhex("d1 / d2 = %20.20le, ", "\n", d1 / d2, d1 / d2);

		f1 *= -4.0 * 3.14159;
		d1 *= -4.0 * 3.14159;
		f2 *= -5.0 * 2.78182;
		d2 *= -5.0 * 2.78182;
	}

	printf("\n");
}

void test_fsave(const char *msg)
{
	printf("%s: %s\n", __func__, msg);
	struct i387_fsave f;

	asm("finit; fld1; fldl2e; fldl2t; fldlg2; fldln2; fldz; fldpi; fldpi; fcos; fsave %0"
		:
		: "m"(f)
		);
	print_fsave(f, "filled stack");


/*
 * Stack:
 * 0: cos(pi) = -1.0
 * 1: pi = 3.14...
 * 2: 0.0
 * 3: ln(2) = 0.6931...
 * 4: log10(2) = 0.3010...
 * 5: log2(10) = 3.3219...
 * 6: log2(e) = 1.4426...
 * 7: 1.0
 */

}

#define ALLTESTS	6

int main(int argc, const char * const argv[])
{
	int i, runtests;
	int testnr;


	printf("sizeof(float): %u\n", sizeof(float));
	printf("sizeof(double): %u\n", sizeof(double));
	printf("sizeof(long double): %u\n", sizeof(long double));

	runtests = argc;
	if (runtests == 1)
		runtests = ALLTESTS;

	for (i = 1; i <= runtests; i++) {
		if (argc == 1) {
			testnr = i;
		} else {
			testnr = (int)strtol(argv[i-1], NULL, 0);
			if (errno == ERANGE)
				continue;
		}

		switch (testnr) {
		case 1:
			test_add("floating add");
			break;
		case 2:
			test_sin("floating sine");
			break;
		case 3:
			test_fsave("fsave");
			break;
		case 4:
			test_manysin("sin of +- 2e-63 .. 2e63");
			break;
		case 5:
			test_recip("rcpss & rsqrtss");
			break;
		case 6:
			test_addsubmuldiv("sse add, sub, mul, div");
			break;
		}
	}

	return 0;
}

[-- Attachment #3: Type: text/plain, Size: 138 bytes --]

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Live Migration Across AMD and Intel
  2008-12-02 10:07     ` Igor Chubin
@ 2008-12-02 10:32       ` Keir Fraser
  2008-12-02 10:53       ` Thorolf Godawa
  1 sibling, 0 replies; 7+ messages in thread
From: Keir Fraser @ 2008-12-02 10:32 UTC (permalink / raw)
  To: Igor Chubin, James Harper; +Cc: xen-devel

On 2/12/08 10:07, "Igor Chubin" <igor@chub.in> wrote:

>> This appeared on slashdot a while ago...
>> http://linux.slashdot.org/linux/08/11/07/1535235.shtml. I'm not sure
>> that their source is that reputable though. If you were emulating a 486
>> it might work :)
> 
> Thank you Keir and James,
> 
> I know about this posting on slashdot.
> Also here is related video [1] with a small presentation.
> 
> They claim there that live migration across AMD and Intel
> works on KVM. As far as I understand now it works in
> some particular (and rare) situations but it is not true in general.

Yeah, it can definitely work in a range of cases. It's definitely demo-able.
But claiming it works in general in production environments is something
else!

 -- Keir

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Live Migration Across AMD and Intel
  2008-12-02 10:07     ` Igor Chubin
  2008-12-02 10:32       ` Keir Fraser
@ 2008-12-02 10:53       ` Thorolf Godawa
  1 sibling, 0 replies; 7+ messages in thread
From: Thorolf Godawa @ 2008-12-02 10:53 UTC (permalink / raw)
  To: xen-devel

Hi,

> As far as I understand now it works in some particular (and rare)
> situations but it is not true in general.
mostly if you have one old and one modern CPU it works on one direction.

During some prototyping a while ago I was always able to migrate from an
old Intel P3-system to an "modern" AMD Athlon-system, while the way back
most times result in a crash of the VM.

But I had better success with Intel P4 and Intel Xeon CPUs (both with
first generation of HVM-support), where I could do migration on both
directions without any errors!
-- 

Chau y hasta luego,

Thorolf

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2008-12-02 10:53 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-12-02  9:05 Live Migration Across AMD and Intel Igor Chubin
2008-12-02  9:23 ` Keir Fraser
2008-12-02  9:44   ` James Harper
2008-12-02 10:07     ` Igor Chubin
2008-12-02 10:32       ` Keir Fraser
2008-12-02 10:53       ` Thorolf Godawa
2008-12-02 10:14     ` Christoph Egger

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.