From: Andres Lagar Cavilla <andreslc@cs.toronto.edu>
To: xen-devel@lists.xensource.com
Subject: Live Migration Error
Date: Fri, 13 May 2005 12:07:33 -0500 [thread overview]
Message-ID: <4284DED5.4050706@cs.toronto.edu> (raw)
Hi,
I've been scanning the list and seen reports on problems with live
migration. Thought I might add a bit more entropy.
I try to do a live migration in the same physical host, i.e. xm migrate
--live 'whatever' localhost
It fails with 'Error: errors: suspend, failed, Callbak timed out'.
It seems like transfer of memory pages works until the point when the
domain needs to be suspended to do the final transfer. Funny thing is it
used to work before, gloriously, and I haven't made any
software/hardware changes. At some point a xm save command failed with
timeout, and from there on live migration fails with this message.
Non-live migration works perfectly, also between different physical
hosts. save/restore also works flawlessly.
For the record, I use nfsroot.
I attached xfrd.log. I can post some other stuff, just ask
Thanks a lot
Andres
xfrd.log:
(xfr.migrate 6 "(domain (id 6) (name AndresNfsDomain) (memory 511)
(maxmem 524288) (state -b---) (cpu 1) (cpu_time 0.10838393) (up_time
27.1105668545) (start_time 1115999325.85) (console (status listening)
(id 12) (domain 6) (local_port 12) (remote_port 1) (console_port 9606))
(devices (vif (idx 0) (vif 0) (mac 00:80:84:00:00:11) (vifname vif6.0)
(evtchn 13 3) (index 0))) (config (vm (name AndresNfsDomain) (memory 512)
(image (linux (kernel /boot/vmlinuz-2.6.11-xenU) (ip
192.168.70.45:192.168.70.106:192.168.70.254:255.255.255.0:virtuality:eth0:off)
(root /dev/nfs)
(args 'nfsroot=192.168.70.106:/mnt/nfs2,rsize=32768,wsize=32768 4')))
(device (vif (mac 00:80:84:00:00:11))))))" localhost 8002 1 0)[DEBUG]
Conn_sxpr< err=0
[DEBUG] Conn_connect> addr=127.0.0.1:8002
[DEBUG] Conn_init> flags=1
[DEBUG] Conn_init> write stream...
[DEBUG] stream_init>mode=w flags=1 compress=0
[DEBUG] stream_init> unbuffer...
[DEBUG] stream_init< err=0
[DEBUG] Conn_init> read stream...
[DEBUG] stream_init>mode=r flags=1 compress=0
[DEBUG] stream_init> unbuffer...
[DEBUG] stream_init< err=0
[DEBUG] Conn_sxpr>
(xfr.err 0)[DEBUG] Conn_sxpr< err=0
[1115999352.965314] xc_linux_save start 6
xc_linux_save start 6
[1115999352.966265] Saving memory pages: iter 1 0%
Saving memory pages: iter 1 0%4344 [INF] XFRD> Xfr service for
127.0.0.1:54931
[DEBUG] Conn_init> flags=1
[DEBUG] Conn_init> write stream...
[DEBUG] stream_init>mode=w flags=1 compress=0
[DEBUG] stream_init> unbuffer...
[DEBUG] stream_init< err=0
[DEBUG] Conn_init> read stream...
[DEBUG] stream_init>mode=r flags=1 compress=0
[DEBUG] stream_init> unbuffer...
[DEBUG] stream_init< err=0
[DEBUG] Conn_sxpr>
(xfr.hello 1 0)[DEBUG] Conn_sxpr< err=0
[DEBUG] Conn_sxpr>
(xfr.xfr 6)[DEBUG] Conn_sxpr< err=0
[1115999352.971066] xc_linux_restore start
xc_linux_restore start
[1115999352.991648] Created domain 7
Created domain 7
[1115999353.003196] Reloading memory pages: 0%
Reloading memory pages: 5%
5%
10%
10%
10%FNI 765 : [1000007e,1020] pte=00bec063, mfn=00000bec, pfn=ffffffff
[mfn]=deadbeef
15%
15%
20%
20%
25%
25%
30%
30%
35%
35%
40%
40%
45%
45%
50%
50%
55%
55%
60%
60%
65%
65%
70%
70%
75%
75%
80%
80%
85%
85%
90%
90%
95%
95%
1: sent 130824, skipped 243,
1: sent 130824, skipped 243, delta 2629ms, dom0 100%, target 71%, sent
1630Mb/s, dirtied 4Mb/s 321 pages
[1115999355.596112] Saving memory pages: iter 2 0%
2: sent 320, skipped 0, 2 0%
2: sent 320, skipped 0, delta 11ms, dom0 0%, target 100%, sent 953Mb/s,
dirtied 35Mb/s 12 pages
[1115999355.607606] Saving memory pages: iter 3 0%
3: sent 12, skipped 0, r 3 0%
3: sent 12, skipped100%
100%[DEBUG] Conn_sxpr>
(xfr.err 22)[DEBUG] Conn_sxpr< err=0
Retry suspend domain (120)
#... This repeats 198 times in total ...#
Retry suspend domain (120)
Unable to suspend domain. (120)
Unable to suspend domain. (120)
Domain appears not to have suspended: 120
Domain appears not to have suspended: 120
4343 [WRN] XFRD> Transfer errors:
4343 [WRN] XFRD> state=XFR_STATE err=1
4343 [INF] XFRD> Xfr service err=1
Error when reading from state file
Error when reading from state file
4344 [INF] XFRD> Xfr service err=1
next reply other threads:[~2005-05-13 17:07 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2005-05-13 17:07 Andres Lagar Cavilla [this message]
2005-05-13 18:45 ` Live Migration Error Teemu Koponen
2005-05-13 20:50 ` Andrés Lagar Cavilla
2005-05-13 21:12 ` Andrés Lagar Cavilla
-- strict thread matches above, loose matches on Subject: below --
2005-05-16 20:14 Ian Pratt
2005-05-13 21:14 Ian Pratt
[not found] ` <A95E2296287EAD4EB592B5DEEFCE0E9D1E3FFF@liverpoolst.ad.cl.c am.ac.uk>
2005-05-13 21:22 ` Andrés Lagar Cavilla
2005-05-16 18:55 ` Andres Lagar Cavilla
2005-05-16 20:09 ` Jim Henderson
2005-05-13 16:36 Ian Pratt
[not found] ` <A95E2296287EAD4EB592B5DEEFCE0E9D1E3FE9@liverpoolst.ad.cl.c am.ac.uk>
2005-05-13 20:47 ` Andrés Lagar Cavilla
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4284DED5.4050706@cs.toronto.edu \
--to=andreslc@cs.toronto.edu \
--cc=xen-devel@lists.xensource.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.