From: Chegu Vinod <chegu_vinod@hp.com>
To: qemu-devel <qemu-devel@nongnu.org>
Subject: [Qemu-devel] Live Migration of a large guest : guest frozen on the destination host
Date: Mon, 11 Jun 2012 07:02:06 -0700 [thread overview]
Message-ID: <4FD5FA5E.1020305@hp.com> (raw)
[-- Attachment #1: Type: text/plain, Size: 2012 bytes --]
Hello,
'am having some issues trying to live migrate a large guest and would
like to get some pointers
on how to go about about debugging this. Here is some info. on the
configuration
_Hardware :_
Two DL980's each with 80 Westmere cores + 1 TB of RAM. Using a 10G NIC
private link
(back to back) between two DL980's
_
Host software used:_
Host 3.4.1 kernel
Qemu versions used :
Case 1: upstream qemu (1.1.50) - from qemu.git
Case 2 : 1.0.92 + Juan Quintela's huge_memory changes
_
Guest :
_40VCPUs + 512GB
_Guest software used:_
RHEL6.3 RC1 (had some basic boot issues with 3.4.1 kernel and udevd..)
Guest is booted off an FC LUN (visible to both the hosts).
[Note: 'am not using virsh/virt-manager etc. but just the qemu to start
the guest and also interact with
the qemu monitor for live migration etc. Have set the migration speed to
10G but haven't changed the
downtime (default : 30ms) ]
Tried to live migrate this large guest..using either of the qemu's (i.e.
Case 1 or Case2) and observed
the following :
When this guest was Idling 'was able to live migrate and have the guest
come up fine on the
other host. Was able to interact with the guest on the destination host.
With workloads (e.g. AIM7-compute or SpecJBB or Google Stress App Test
(SAT)) running in the
guest if we tried to do live migration.. we observe that [after a while]
the source host claims that the
live migration is complete...but the guest on the destination host is
often in a "frozen/hung" state..
can't really interact with it or ping it. Still trying to capture more
information...but was also hoping to
get some clues/tips from the experts on these mailing lists...
[ BTW, is there a way to get a snap shot of the image of the guest on
the source host just before
the "downtime" (i.e. start of stage 3) on the source host and compare
that with the image of the guest on
the destination host just before its about to resume ? Is such a
debugging feature already available ? ]
Thanks
Vinod
[-- Attachment #2: Type: text/html, Size: 2703 bytes --]
reply other threads:[~2012-06-11 14:03 UTC|newest]
Thread overview: [no followups] expand[flat|nested] mbox.gz Atom feed
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4FD5FA5E.1020305@hp.com \
--to=chegu_vinod@hp.com \
--cc=qemu-devel@nongnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).