xen-devel.lists.xenproject.org archive mirror
 help / color / mirror / Atom feed
* Internal error during live migration saving
@ 2011-09-13 21:01 Nathan March
  2011-09-14 17:53 ` Shriram Rajagopalan
  0 siblings, 1 reply; 4+ messages in thread
From: Nathan March @ 2011-09-13 21:01 UTC (permalink / raw)
  To: xen-devel

Just wondering if this is a known bug?

Trying to migrate the VM off to a diff dom0 results in the below error. 
Other VMs migrated off fine (started at around the same time as this vm) 
and I've tried a few different target servers, all resulting in the same 
thing.

[2011-09-13 13:48:24 3996] DEBUG (XendCheckpoint:124) [xc_save]: 
/usr/lib/xen/bin/xc_save 29 77 0 0 1
[2011-09-13 13:48:24 3996] INFO (XendCheckpoint:423) xc_save: failed to 
get the suspend evtchn port
[2011-09-13 13:48:24 3996] INFO (XendCheckpoint:423)
[2011-09-13 13:49:03 3996] DEBUG (XendCheckpoint:394) suspend
[2011-09-13 13:49:03 3996] DEBUG (XendCheckpoint:127) In 
saveInputHandler suspend
[2011-09-13 13:49:03 3996] DEBUG (XendCheckpoint:129) Suspending 77 ...
[2011-09-13 13:49:03 3996] DEBUG (XendDomainInfo:524) 
XendDomainInfo.shutdown(suspend)
[2011-09-13 13:49:03 3996] DEBUG (XendDomainInfo:1881) 
XendDomainInfo.handleShutdownWatch
[2011-09-13 13:50:06 3996] DEBUG (XendDomainInfo:1881) 
XendDomainInfo.handleShutdownWatch
[2011-09-13 13:50:06 3996] INFO (XendCheckpoint:423) xc: error: Suspend 
request failed: Internal error
[2011-09-13 13:50:06 3996] INFO (XendCheckpoint:423) xc: error: Domain 
appears not to have suspended: Internal error
[2011-09-13 13:50:06 3996] ERROR (XendCheckpoint:185) Save failed on 
domain globish (77) - resuming.
Traceback (most recent call last):
   File "/usr/lib64/python2.6/site-packages/xen/xend/XendCheckpoint.py", 
line 146, in save
     forkHelper(cmd, fd, saveInputHandler, False)
   File "/usr/lib64/python2.6/site-packages/xen/xend/XendCheckpoint.py", 
line 395, in forkHelper
     inputHandler(line, child.tochild)
   File "/usr/lib64/python2.6/site-packages/xen/xend/XendCheckpoint.py", 
line 131, in saveInputHandler
     dominfo.waitForSuspend()
   File "/usr/lib64/python2.6/site-packages/xen/xend/XendDomainInfo.py", 
line 2998, in waitForSuspend
     raise XendError(msg)
XendError: Timeout waiting for domain 77 to suspend
[2011-09-13 13:50:06 3996] DEBUG (XendDomainInfo:3135) 
XendDomainInfo.resumeDomain(77)

xend-debug.log and the target dom0 logs don't show anything of value.

This is xen 4.1.1 on linux 3.0.3

- Nathan

-- 
Nathan March<nathan@gt.net>
Gossamer Threads Inc. http://www.gossamer-threads.com/
Tel: (604) 687-5804 Fax: (604) 687-5806

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Internal error during live migration saving
  2011-09-13 21:01 Internal error during live migration saving Nathan March
@ 2011-09-14 17:53 ` Shriram Rajagopalan
  2011-09-14 17:58   ` Nathan March
  0 siblings, 1 reply; 4+ messages in thread
From: Shriram Rajagopalan @ 2011-09-14 17:53 UTC (permalink / raw)
  To: Nathan March; +Cc: xen-devel

On Tue, Sep 13, 2011 at 2:01 PM, Nathan March <nathan@gt.net> wrote:
> Just wondering if this is a known bug?
>
> Trying to migrate the VM off to a diff dom0 results in the below error.
> Other VMs migrated off fine (started at around the same time as this vm) and
> I've tried a few different target servers, all resulting in the same thing.
>

Were other domains linux 3.0.3 as well ?

> [2011-09-13 13:48:24 3996] DEBUG (XendCheckpoint:124) [xc_save]:
> /usr/lib/xen/bin/xc_save 29 77 0 0 1
> [2011-09-13 13:48:24 3996] INFO (XendCheckpoint:423) xc_save: failed to get
> the suspend evtchn port
> [2011-09-13 13:48:24 3996] INFO (XendCheckpoint:423)
> [2011-09-13 13:49:03 3996] DEBUG (XendCheckpoint:394) suspend
> [2011-09-13 13:49:03 3996] DEBUG (XendCheckpoint:127) In saveInputHandler
> suspend
> [2011-09-13 13:49:03 3996] DEBUG (XendCheckpoint:129) Suspending 77 ...
> [2011-09-13 13:49:03 3996] DEBUG (XendDomainInfo:524)
> XendDomainInfo.shutdown(suspend)
> [2011-09-13 13:49:03 3996] DEBUG (XendDomainInfo:1881)
> XendDomainInfo.handleShutdownWatch
> [2011-09-13 13:50:06 3996] DEBUG (XendDomainInfo:1881)
> XendDomainInfo.handleShutdownWatch
> [2011-09-13 13:50:06 3996] INFO (XendCheckpoint:423) xc: error: Suspend
> request failed: Internal error
> [2011-09-13 13:50:06 3996] INFO (XendCheckpoint:423) xc: error: Domain
> appears not to have suspended: Internal error
> [2011-09-13 13:50:06 3996] ERROR (XendCheckpoint:185) Save failed on domain
> globish (77) - resuming.
> Traceback (most recent call last):
>  File "/usr/lib64/python2.6/site-packages/xen/xend/XendCheckpoint.py", line
> 146, in save
>    forkHelper(cmd, fd, saveInputHandler, False)
>  File "/usr/lib64/python2.6/site-packages/xen/xend/XendCheckpoint.py", line
> 395, in forkHelper
>    inputHandler(line, child.tochild)
>  File "/usr/lib64/python2.6/site-packages/xen/xend/XendCheckpoint.py", line
> 131, in saveInputHandler
>    dominfo.waitForSuspend()
>  File "/usr/lib64/python2.6/site-packages/xen/xend/XendDomainInfo.py", line
> 2998, in waitForSuspend
>    raise XendError(msg)
> XendError: Timeout waiting for domain 77 to suspend
> [2011-09-13 13:50:06 3996] DEBUG (XendDomainInfo:3135)
> XendDomainInfo.resumeDomain(77)
>
> xend-debug.log and the target dom0 logs don't show anything of value.
>
> This is xen 4.1.1 on linux 3.0.3
>

Did you try xm save -c (or the xl equivalent) ? This should be
activating the same
code path where this error seems to appear.

Also, make sure you have CONFIG_XEN_SAVE_RESTORE enabled.
> - Nathan
>
> --
> Nathan March<nathan@gt.net>
> Gossamer Threads Inc. http://www.gossamer-threads.com/
> Tel: (604) 687-5804 Fax: (604) 687-5806
>
>
> _______________________________________________
> Xen-devel mailing list
> Xen-devel@lists.xensource.com
> http://lists.xensource.com/xen-devel
>

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Internal error during live migration saving
  2011-09-14 17:53 ` Shriram Rajagopalan
@ 2011-09-14 17:58   ` Nathan March
  2011-09-14 18:09     ` Shriram Rajagopalan
  0 siblings, 1 reply; 4+ messages in thread
From: Nathan March @ 2011-09-14 17:58 UTC (permalink / raw)
  To: rshriram; +Cc: xen-devel


On 9/14/2011 10:53 AM, Shriram Rajagopalan wrote:
> On Tue, Sep 13, 2011 at 2:01 PM, Nathan March<nathan@gt.net>  wrote:
>> Just wondering if this is a known bug?
>>
>> Trying to migrate the VM off to a diff dom0 results in the below error.
>> Other VMs migrated off fine (started at around the same time as this vm) and
>> I've tried a few different target servers, all resulting in the same thing.
>>
> Were other domains linux 3.0.3 as well ?
All the dom0's are 3.0.3 and all the domU's are 2.6.32.27 (w/ grsec).

I did a cold reboot of the VM and now it migrates properly.

>> [2011-09-13 13:48:24 3996] DEBUG (XendCheckpoint:124) [xc_save]:
>> /usr/lib/xen/bin/xc_save 29 77 0 0 1
>> [2011-09-13 13:48:24 3996] INFO (XendCheckpoint:423) xc_save: failed to get
>> the suspend evtchn port
>> [2011-09-13 13:48:24 3996] INFO (XendCheckpoint:423)
>> [2011-09-13 13:49:03 3996] DEBUG (XendCheckpoint:394) suspend
>> [2011-09-13 13:49:03 3996] DEBUG (XendCheckpoint:127) In saveInputHandler
>> suspend
>> [2011-09-13 13:49:03 3996] DEBUG (XendCheckpoint:129) Suspending 77 ...
>> [2011-09-13 13:49:03 3996] DEBUG (XendDomainInfo:524)
>> XendDomainInfo.shutdown(suspend)
>> [2011-09-13 13:49:03 3996] DEBUG (XendDomainInfo:1881)
>> XendDomainInfo.handleShutdownWatch
>> [2011-09-13 13:50:06 3996] DEBUG (XendDomainInfo:1881)
>> XendDomainInfo.handleShutdownWatch
>> [2011-09-13 13:50:06 3996] INFO (XendCheckpoint:423) xc: error: Suspend
>> request failed: Internal error
>> [2011-09-13 13:50:06 3996] INFO (XendCheckpoint:423) xc: error: Domain
>> appears not to have suspended: Internal error
>> [2011-09-13 13:50:06 3996] ERROR (XendCheckpoint:185) Save failed on domain
>> globish (77) - resuming.
>> Traceback (most recent call last):
>>   File "/usr/lib64/python2.6/site-packages/xen/xend/XendCheckpoint.py", line
>> 146, in save
>>     forkHelper(cmd, fd, saveInputHandler, False)
>>   File "/usr/lib64/python2.6/site-packages/xen/xend/XendCheckpoint.py", line
>> 395, in forkHelper
>>     inputHandler(line, child.tochild)
>>   File "/usr/lib64/python2.6/site-packages/xen/xend/XendCheckpoint.py", line
>> 131, in saveInputHandler
>>     dominfo.waitForSuspend()
>>   File "/usr/lib64/python2.6/site-packages/xen/xend/XendDomainInfo.py", line
>> 2998, in waitForSuspend
>>     raise XendError(msg)
>> XendError: Timeout waiting for domain 77 to suspend
>> [2011-09-13 13:50:06 3996] DEBUG (XendDomainInfo:3135)
>> XendDomainInfo.resumeDomain(77)
>>
>> xend-debug.log and the target dom0 logs don't show anything of value.
>>
>> This is xen 4.1.1 on linux 3.0.3
>>
> Did you try xm save -c (or the xl equivalent) ? This should be
> activating the same
> code path where this error seems to appear.
>
> Also, make sure you have CONFIG_XEN_SAVE_RESTORE enabled.
Unfortunately I didn't think to try it. I do have that set on both dom0 
and domu.

- Nathan

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Internal error during live migration saving
  2011-09-14 17:58   ` Nathan March
@ 2011-09-14 18:09     ` Shriram Rajagopalan
  0 siblings, 0 replies; 4+ messages in thread
From: Shriram Rajagopalan @ 2011-09-14 18:09 UTC (permalink / raw)
  To: Nathan March; +Cc: xen-devel

On Wed, Sep 14, 2011 at 10:58 AM, Nathan March <nathan@gt.net> wrote:
>
> On 9/14/2011 10:53 AM, Shriram Rajagopalan wrote:
>>
>> On Tue, Sep 13, 2011 at 2:01 PM, Nathan March<nathan@gt.net>  wrote:
>>>
>>> Just wondering if this is a known bug?
>>>
>>> Trying to migrate the VM off to a diff dom0 results in the below error.
>>> Other VMs migrated off fine (started at around the same time as this vm)
>>> and
>>> I've tried a few different target servers, all resulting in the same
>>> thing.
>>>
>> Were other domains linux 3.0.3 as well ?
>
> All the dom0's are 3.0.3 and all the domU's are 2.6.32.27 (w/ grsec).
>
> I did a cold reboot of the VM and now it migrates properly.
>
>>> [2011-09-13 13:48:24 3996] DEBUG (XendCheckpoint:124) [xc_save]:
>>> /usr/lib/xen/bin/xc_save 29 77 0 0 1
>>> [2011-09-13 13:48:24 3996] INFO (XendCheckpoint:423) xc_save: failed to
>>> get
>>> the suspend evtchn port
>>> [2011-09-13 13:48:24 3996] INFO (XendCheckpoint:423)
>>> [2011-09-13 13:49:03 3996] DEBUG (XendCheckpoint:394) suspend
>>> [2011-09-13 13:49:03 3996] DEBUG (XendCheckpoint:127) In saveInputHandler
>>> suspend
>>> [2011-09-13 13:49:03 3996] DEBUG (XendCheckpoint:129) Suspending 77 ...
>>> [2011-09-13 13:49:03 3996] DEBUG (XendDomainInfo:524)
>>> XendDomainInfo.shutdown(suspend)
>>> [2011-09-13 13:49:03 3996] DEBUG (XendDomainInfo:1881)
>>> XendDomainInfo.handleShutdownWatch
>>> [2011-09-13 13:50:06 3996] DEBUG (XendDomainInfo:1881)
>>> XendDomainInfo.handleShutdownWatch
>>> [2011-09-13 13:50:06 3996] INFO (XendCheckpoint:423) xc: error: Suspend
>>> request failed: Internal error
>>> [2011-09-13 13:50:06 3996] INFO (XendCheckpoint:423) xc: error: Domain
>>> appears not to have suspended: Internal error
>>> [2011-09-13 13:50:06 3996] ERROR (XendCheckpoint:185) Save failed on
>>> domain
>>> globish (77) - resuming.
>>> Traceback (most recent call last):
>>>  File "/usr/lib64/python2.6/site-packages/xen/xend/XendCheckpoint.py",
>>> line
>>> 146, in save
>>>    forkHelper(cmd, fd, saveInputHandler, False)
>>>  File "/usr/lib64/python2.6/site-packages/xen/xend/XendCheckpoint.py",
>>> line
>>> 395, in forkHelper
>>>    inputHandler(line, child.tochild)
>>>  File "/usr/lib64/python2.6/site-packages/xen/xend/XendCheckpoint.py",
>>> line
>>> 131, in saveInputHandler
>>>    dominfo.waitForSuspend()
>>>  File "/usr/lib64/python2.6/site-packages/xen/xend/XendDomainInfo.py",
>>> line
>>> 2998, in waitForSuspend
>>>    raise XendError(msg)
>>> XendError: Timeout waiting for domain 77 to suspend
>>> [2011-09-13 13:50:06 3996] DEBUG (XendDomainInfo:3135)
>>> XendDomainInfo.resumeDomain(77)
>>>
>>> xend-debug.log and the target dom0 logs don't show anything of value.
>>>
>>> This is xen 4.1.1 on linux 3.0.3
>>>
>> Did you try xm save -c (or the xl equivalent) ? This should be
>> activating the same
>> code path where this error seems to appear.
>>
>> Also, make sure you have CONFIG_XEN_SAVE_RESTORE enabled.
>
> Unfortunately I didn't think to try it. I do have that set on both dom0 and
> domu.
>
> - Nathan
>
>

Oh, I assumed that the domU's were linux 3.0.3. That config has no meaning
for dom0s.

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2011-09-14 18:09 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-09-13 21:01 Internal error during live migration saving Nathan March
2011-09-14 17:53 ` Shriram Rajagopalan
2011-09-14 17:58   ` Nathan March
2011-09-14 18:09     ` Shriram Rajagopalan

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).