All of lore.kernel.org
 help / color / mirror / Atom feed
From: Keir Fraser <keir@xensource.com>
To: "Graham, Simon" <Simon.Graham@stratus.com>,
	Keir Fraser <keir@xensource.com>,
	xen-devel@lists.xensource.com
Subject: Re: DomU crash during migration when suspendingsource domain
Date: Wed, 14 Feb 2007 14:56:41 +0000	[thread overview]
Message-ID: <C1F8D3A9.9664%keir@xensource.com> (raw)
In-Reply-To: <342BAC0A5467384983B586A6B0B3767104A6A851@EXNA.corp.stratus.com>

On 14/2/07 14:43, "Graham, Simon" <Simon.Graham@stratus.com> wrote:

> Do you plan to do this for PV domains as well as HVM?

Yes, we have a special paravirtualised CPUID interface which Linux uses. So
this can be done.
 
> I guess I'm not quite sure I fully understand -- since we hot remove all
> the processors (but one - I guess that is an issue) and then hot add
> them again after migration, you would think it would be OK to hot add a
> completely different processor -- of course there will be issues with
> the Linux code given that you cant actually test this on a
> non-virtualized system.

You might indeed think that. Unfortunately code can depend on the fact that
all x86 systems (at least so far) have symmetric cache hierarchies. In the
case of this particular code, num_cache_leaves is latched during boot based
on CPU0's CPUID result. This value is then considered safe to use for all
CPUs forever more, which is not a good assumption in your case.

In this particular case it is quite arguable that
cache_remove_shared_cpu_map() should check cpuid4_info[i]!=NULL, just as
done in cache_shared_cpu_map_setup(). I can make this fix in our tree but
something similar ought to be submitted upstream too. I'm pretty certain
that this will fix your crash.

> You misunderstand my point -- in an FT environment, you MUST be able to
> upgrade and repair hardware without taking the domain down -- clearly
> this would normally be to an equivalent or higher functionality system
> but we cant guarantee that there wont be a new spiffy processor that
> causes this same issue to arise or that we wont run into some similar
> issue when replacing faulty hardware (the original system might no
> longer be available for example).

Upgrading upwards actually tends to be okay. I can't think of any practical
examples of how that might fail. After all, worst case we can hide the extra
features from the guest since we have some control over CPUID. *Downgrading*
is the problem!

 -- Keir

  reply	other threads:[~2007-02-14 14:56 UTC|newest]

Thread overview: 7+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2007-02-14 14:43 DomU crash during migration when suspendingsource domain Graham, Simon
2007-02-14 14:56 ` Keir Fraser [this message]
2007-02-14 15:15 ` Petersson, Mats
  -- strict thread matches above, loose matches on Subject: below --
2007-02-14 15:08 Graham, Simon
2007-02-14 15:43 ` Keir Fraser
2007-02-14 13:57 Graham, Simon
2007-02-14 14:35 ` Keir Fraser

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=C1F8D3A9.9664%keir@xensource.com \
    --to=keir@xensource.com \
    --cc=Simon.Graham@stratus.com \
    --cc=xen-devel@lists.xensource.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.