From mboxrd@z Thu Jan  1 00:00:00 1970
From: Shriram Rajagopalan <rshriram@cs.ubc.ca>
Subject: Re: [PATCH v5 00/21] libxl: domain save/restore: run in
	a separate process
Date: Wed, 27 Jun 2012 12:06:15 -0400
Message-ID: <CAP8mzPOaQKvt0cuHU5N3Y5nQ1Ced-9g9H7=CJrKkv0=FrNQbBQ@mail.gmail.com>
References: <1340733318-21099-1-git-send-email-ian.jackson@eu.citrix.com>
	<20457.63648.864838.199205@mariner.uk.xensource.com>
	<CAP8mzPNS34tci-dTM0ucsWQyR9exxjD8WqksYvUEO7H7HMgQmg@mail.gmail.com>
	<CAP8mzPO2GWc26Sd72zjrRUYQ5kgod=DbOeTRLSH3DtEbtc-taQ@mail.gmail.com>
	<20459.3767.440030.931642@mariner.uk.xensource.com>
Reply-To: rshriram@cs.ubc.ca
Mime-Version: 1.0
Content-Type: multipart/mixed; boundary="===============1979954470247439263=="
Return-path: <xen-devel-bounces@lists.xen.org>
In-Reply-To: <20459.3767.440030.931642@mariner.uk.xensource.com>
List-Unsubscribe: <http://lists.xen.org/cgi-bin/mailman/options/xen-devel>,
	<mailto:xen-devel-request@lists.xen.org?subject=unsubscribe>
List-Post: <mailto:xen-devel@lists.xen.org>
List-Help: <mailto:xen-devel-request@lists.xen.org?subject=help>
List-Subscribe: <http://lists.xen.org/cgi-bin/mailman/listinfo/xen-devel>,
	<mailto:xen-devel-request@lists.xen.org?subject=subscribe>
Sender: xen-devel-bounces@lists.xen.org
Errors-To: xen-devel-bounces@lists.xen.org
To: Ian Jackson <Ian.Jackson@eu.citrix.com>
Cc: "xen-devel@lists.xen.org" <xen-devel@lists.xen.org>
List-Id: xen-devel@lists.xenproject.org

--===============1979954470247439263==
Content-Type: multipart/alternative; boundary=000e0ce041667780fa04c37667d4

--000e0ce041667780fa04c37667d4
Content-Type: text/plain; charset=ISO-8859-1

On Wed, Jun 27, 2012 at 9:46 AM, Ian Jackson <Ian.Jackson@eu.citrix.com>wrote:

> Shriram Rajagopalan writes ("Re: [PATCH v5 00/21] libxl: domain
> save/restore: run in a separate process"):
> > Ian,
> >  The code segfaults. Here are the system details and error traces from
> gdb.
>
> Thanks.
>
> > My setup:
> >
> > dom0 : ubuntu 64bit, 2.6.32-39 (pvops kernel),
> >            running latest xen-4.2-unstable (built from your repo)
> >            tools stack also built from your repo (which I hope has all
> the latest patches).
> >
> > domU: ubuntu 32bit PV, xenolinux kernel (2.6.32.2 - novel suse version)
> >            with suspend event channel support
> >
> > As a sanity check, I tested xl remus with latest tip from xen-unstable
> > mercurial repo, c/s: 25496:e08cf97e76f0
> >
> > Blackhole replication (to /dev/null) and localhost replication worked as
> expected
> > and the guest recovered properly without any issues.
>
> Thanks for the test runes.  That didn't work entirely properly for
> me, even with the xen-unstable baseline.
>
> I did this
>   xl -vvvv remus -b -i 100 debian.guest.osstest dummy >remus.log 2>&1 &
> The result was that the guest's networking broke.  The guest shows up
> in xl list as
>   debian.guest.osstest                      7   512     1     ---ss-
> 5.2
> and is still responsive on its pv console.


This is normal. You are suspending every 100ms. So, when you see ---ss-,
you just ended up doing "xl list" right when the guest was suspended. :)

do a xl top and you would see the guest's state oscillate from --b-- to
--s--
depending on the checkpoint interval. Or do xl list multiple times.


> After I killed the remus
> process, the guest's networking was still broken.
>
>
That is strange..  xl remus has literally no networking support on the remus
front.  So, it shouldnt affect anything in the guest. In fact I repeated
your test
on my box , where the guest was continuously pinging a host . Pings
continued
to work. so did ssh.


> At the start, the guest prints this on its console:
>  [   36.017241] WARNING: g.e. still in use!
>  [   36.021056] WARNING: g.e. still in use!
>  [   36.024740] WARNING: g.e. still in use!
>  [   36.024763] WARNING: g.e. still in use!
>
> If I try the rune with "localhost" I would have expected, surely, to
> see a domain with the incoming migration ?  But I don't.  I tried
> killing the `xl remus' process and the guest became wedged.
>
>
With "-b" option the second argument (localhost|dummy) is ignored. Did you
try the command without the -b option, i.e.
xl remus -vvv -e domU localhost

But I was partially able to reproduce some of your test results without your
patches (i.e. on xen-unstable baseline). See end of mail for more details.


> However, when I apply my series, I can indeed produce an assertion
> failure:
>
>  xc: detail: All memory is saved
>  xc: error: Could not get domain info (3 = No such process): Internal error
>  libxl: error: libxl.c:388:libxl_domain_resume: xc_domain_resume failed
> for domain 3077579968: No such process
>  xl: libxl_event.c:1426: libxl__ao_inprogress_gc: Assertion `ao->magic ==
> 0xA0FACE00ul' failed.
>
> So I have indeed made matters worse.
>
>
> > Blackhole replication:
> > ================
> > xl error:
> > ----------
> > xc: error: Could not get domain info (3 = No such process): Internal
> error
> > libxl: error: libxl.c:388:libxl_domain_resume: xc_domain_resume failed
> for domain 4154075147<tel:4154075147>: No such process
> > libxl: error: libxl_dom.c:1184:libxl__domain_save_device_model: unable
> to open qemu save file ?8b: No such file or directory
>
> I don't see that at all.
>
> NB that PV guests may have a qemu for certain disk backends, or
> consoles, depending on the configuration.  Can you show me your domain
> config ?  Mine is below.
>
>
Ah that explains the qemu related calls.

My Guest config: (from tests on 32bit PV domU w/ suspend event channel
support)

kernel = "/home/kernels/vmlinuz-2.6.32.2-xenu"
memory = 1024
name = "xltest2"
vcpus = 2
vif = [ 'mac=00:16:3e:00:00:01,bridge=eth0' ]
disk = [ 'phy:/dev/drbd1,xvda1,w']
hostname= "rshriram-vm3"
root = "/dev/xvda1 ro"
extra = "console=xvc0 3"
on_poweroff = 'destroy'
on_reboot   = 'destroy'
on_crash    = 'coredump-destroy'

NB: This guest kernel has suspend-event-channel support
which is available in all suse-kernels I suppose. If you would
just like to use mine, the source tarball (2.6.32.2 version + kernel config)
is at http://aramis.nss.cs.ubc.ca/xenolinux-2.6.32.2.tar.gz


> I also ran xl in GDB to get a stack trace and hopefully some useful debug
> info.
> > gdb traces: http://pastebin.com/7zFwFjW4
>
> I get a different crash - see above.
>
> > Localhost replication: Partial success, but xl still segfaults
> >  dmesg shows
> >  [ 1399.254849] xl[4716]: segfault at 0 ip 00007f979483a417 sp
> 00007fffe06043e0 error 6 in libxenlight.so.2.0.0[7f9794807000+4d000]
>
> I see exactly the same thing with `localhost' instead of `dummy'.  And
> I see no incoming domain.
>
> I will investigate the crash I see.  In the meantime can you try to
> help me see why it doesn't work me even with the baseline ?
>
>

I also tested with 64-bit 3.3.0 PV kernel (w/o suspend-event channel
support)

guest config:
kernel = "/home/kernels/vmlinuz-3.3.0-rc1-xenu"
memory = 1024
name = "xl-ubuntu-pv64"
vcpus = 2
vif = [ 'mac=00:16:3e:00:00:03, bridge=eth0' ]
disk = [ 'phy:/dev/vgdrbd/ubuntu-pv64,xvda1,w' ]
hostname= "rshriram-vm1"
root = "/dev/xvda1 ro"
extra = "console=hvc0 3"

With xen-unstable baseline,
Test 1. Blackhole replication
 command: nohup xl remus -vvv -e -b -i 100 xl-ubuntu-pv64 dummy
>blackhole.log 2>&1 &
 result: works (networking included)
debug output:
libxl: debug: libxl_dom.c:687:libxl__domain_suspend_common_callback:
issuing PV suspend request via XenBus control node
libxl: debug: libxl_dom.c:691:libxl__domain_suspend_common_callback: wait
for the guest to acknowledge suspend request
libxl: debug: libxl_dom.c:738:libxl__domain_suspend_common_callback: guest
acknowledged suspend request
libxl: debug: libxl_dom.c:742:libxl__domain_suspend_common_callback: wait
for the guest to suspend
libxl: debug: libxl_dom.c:754:libxl__domain_suspend_common_callback: guest
has suspended

 caveat: killing remus doesnt do a proper cleanup i.e if you killed it
while the domain was
             suspended, it leaves it in the suspended state (where libxl
waits for guest to suspend)
              Its a pain. In xend/python version, I added a handler
(SIGUSR1) , so that one could do
             pkill -USR1 -f remus and gracefully exit remus, without
wedging the domU.

             * I do not know if adding signal handlers is frowned upon in
the xl land :)
               If there is some protocol in place to handle such things, I
would be happy to send
               a patch that ensures that the guest is "resumed" while doing
blackhole replication

Test 2. Localhost replication w/ failover by destroying primary VM
 command: nohup xl remus -vvv -b -i 100 xl-ubuntu-pv64 localhost
>blackhole.log 2>&1 &
 result: works (networking included)

--000e0ce041667780fa04c37667d4
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

On Wed, Jun 27, 2012 at 9:46 AM, Ian Jackson <span dir=3D"ltr">&lt;<a href=
=3D"mailto:Ian.Jackson@eu.citrix.com" target=3D"_blank">Ian.Jackson@eu.citr=
ix.com</a>&gt;</span> wrote:<br><div class=3D"gmail_quote"><blockquote clas=
s=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;pad=
ding-left:1ex">

Shriram Rajagopalan writes (&quot;Re: [PATCH v5 00/21] libxl: domain save/r=
estore: run in a separate process&quot;):<br>
<div class=3D"im">&gt; Ian,<br>
&gt; =A0The code segfaults. Here are the system details and error traces fr=
om gdb.<br>
<br>
</div>Thanks.<br>
<div class=3D"im"><br>
&gt; My setup:<br>
&gt;<br>
&gt; dom0 : ubuntu 64bit, 2.6.32-39 (pvops kernel),<br>
&gt; =A0 =A0 =A0 =A0 =A0 =A0running latest xen-4.2-unstable (built from you=
r repo)<br>
&gt; =A0 =A0 =A0 =A0 =A0 =A0tools stack also built from your repo (which I =
hope has all the latest patches).<br>
&gt;<br>
&gt; domU: ubuntu 32bit PV, xenolinux kernel (2.6.32.2 - novel suse version=
)<br>
&gt; =A0 =A0 =A0 =A0 =A0 =A0with suspend event channel support<br>
&gt;<br>
&gt; As a sanity check, I tested xl remus with latest tip from xen-unstable=
<br>
&gt; mercurial repo, c/s: 25496:e08cf97e76f0<br>
&gt;<br>
&gt; Blackhole replication (to /dev/null) and localhost replication worked =
as expected<br>
&gt; and the guest recovered properly without any issues.<br>
<br>
</div>Thanks for the test runes. =A0That didn&#39;t work entirely properly =
for<br>
me, even with the xen-unstable baseline.<br>
<br>
I did this<br>
 =A0 xl -vvvv remus -b -i 100 debian.guest.osstest dummy &gt;remus.log 2&gt=
;&amp;1 &amp;<br>
The result was that the guest&#39;s networking broke. =A0The guest shows up=
<br>
in xl list as<br>
 =A0 debian.guest.osstest =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A07 =A0 =
512 =A0 =A0 1 =A0 =A0 ---ss- =A0 =A0 =A0 5.2<br>
and is still responsive on its pv console. =A0</blockquote><div><br></div><=
div>This is normal. You are suspending every 100ms. So, when you see ---ss-=
,</div><div>you just ended up doing &quot;xl list&quot; right when the gues=
t was suspended. :)</div>

<div><br></div><div>do a xl top and you would see the guest&#39;s state osc=
illate from --b-- to --s--</div><div>depending on the checkpoint interval. =
Or do xl list multiple times.</div><div><br></div><div>=A0</div><blockquote=
 class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc soli=
d;padding-left:1ex">

After I killed the remus<br>
process, the guest&#39;s networking was still broken.<br>
<br></blockquote><div>=A0</div><div><div>That is strange.. =A0xl remus has =
literally no networking support on the remus</div><div>front. =A0So, it sho=
uldnt affect anything in the guest. In fact I repeated your test</div></div=
>

<div>on my box , where the guest was continuously pinging a host . Pings co=
ntinued</div><div>to work. so did ssh.</div><div><br></div><div>=A0</div><b=
lockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px =
#ccc solid;padding-left:1ex">


At the start, the guest prints this on its console:<br>
 =A0[ =A0 36.017241] WARNING: g.e. still in use!<br>
 =A0[ =A0 36.021056] WARNING: g.e. still in use!<br>
 =A0[ =A0 36.024740] WARNING: g.e. still in use!<br>
 =A0[ =A0 36.024763] WARNING: g.e. still in use!<br>
<br>
If I try the rune with &quot;localhost&quot; I would have expected, surely,=
 to<br>
see a domain with the incoming migration ? =A0But I don&#39;t. =A0I tried<b=
r>
killing the `xl remus&#39; process and the guest became wedged.<br>
<br></blockquote><div><br></div><div>With &quot;-b&quot; option the second =
argument (localhost|dummy) is ignored. Did you</div><div>try the command wi=
thout the -b option, i.e.</div><div>xl remus -vvv -e domU localhost=A0</div=
>

<div><br></div><div>But I was partially able to reproduce some of your test=
 results without your</div><div>patches (i.e. on xen-unstable baseline). Se=
e end of mail for more details.</div><div><br></div><blockquote class=3D"gm=
ail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-le=
ft:1ex">


<br>
However, when I apply my series, I can indeed produce an assertion<br>
failure:<br>
<br>
=A0xc: detail: All memory is saved<br>
<div class=3D"im">=A0xc: error: Could not get domain info (3 =3D No such pr=
ocess): Internal error<br>
</div>=A0libxl: error: libxl.c:388:libxl_domain_resume: xc_domain_resume fa=
iled for domain <a href=3D"tel:3077579968" value=3D"+13077579968">307757996=
8</a>: No such process<br>
=A0xl: libxl_event.c:1426: libxl__ao_inprogress_gc: Assertion `ao-&gt;magic=
 =3D=3D 0xA0FACE00ul&#39; failed.<br>
<br>
So I have indeed made matters worse.<br>
<div class=3D"im"><br>
<br>
&gt; Blackhole replication:<br>
&gt; =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D<br>
&gt; xl error:<br>
&gt; ----------<br>
&gt; xc: error: Could not get domain info (3 =3D No such process): Internal=
 error<br>
</div>&gt; libxl: error: libxl.c:388:libxl_domain_resume: xc_domain_resume =
failed for domain <a href=3D"tel:4154075147" value=3D"+14154075147">4154075=
147</a>&lt;tel:<a href=3D"tel:4154075147" value=3D"+14154075147">4154075147=
</a>&gt;: No such process<br>


<div class=3D"im">&gt; libxl: error: libxl_dom.c:1184:libxl__domain_save_de=
vice_model: unable to open qemu save file ?8b: No such file or directory<br=
>
<br>
</div>I don&#39;t see that at all.<br>
<br>
NB that PV guests may have a qemu for certain disk backends, or<br>
consoles, depending on the configuration. =A0Can you show me your domain<br=
>
config ? =A0Mine is below.<br>
<div class=3D"im"><br></div></blockquote><div><br></div><div>Ah that explai=
ns the qemu related calls.=A0</div><div><br></div><div>My Guest config: (fr=
om tests on 32bit PV domU w/ suspend event channel support)</div><div><br>

</div><div><div>kernel =3D &quot;/home/kernels/vmlinuz-2.6.32.2-xenu&quot;<=
/div><div>memory =3D 1024</div><div>name =3D &quot;xltest2&quot;</div><div>=
vcpus =3D 2</div><div>vif =3D [ &#39;mac=3D00:16:3e:00:00:01,bridge=3Deth0&=
#39; ]</div>

<div>disk =3D [ &#39;phy:/dev/drbd1,xvda1,w&#39;]</div><div>hostname=3D &qu=
ot;rshriram-vm3&quot;</div><div>root =3D &quot;/dev/xvda1 ro&quot;</div><di=
v>extra =3D &quot;console=3Dxvc0 3&quot;</div><div>on_poweroff =3D &#39;des=
troy&#39;</div>

<div>on_reboot =A0 =3D &#39;destroy&#39;</div><div>on_crash =A0 =A0=3D &#39=
;coredump-destroy&#39;</div></div><div><br></div><div>NB: This guest kernel=
 has suspend-event-channel support</div><div>which is available in all suse=
-kernels I suppose. If you would</div>

<div>just like to use mine,=A0the source tarball (2.6.32.2 version + kernel=
 config)</div><div>is at=A0<a href=3D"http://aramis.nss.cs.ubc.ca/xenolinux=
-2.6.32.2.tar.gz">http://aramis.nss.cs.ubc.ca/xenolinux-2.6.32.2.tar.gz</a>=
</div>

<div><br></div><div><br></div><blockquote class=3D"gmail_quote" style=3D"ma=
rgin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div class=3D"=
im">
&gt; I also ran xl in GDB to get a stack trace and hopefully some useful de=
bug info.<br>
&gt; gdb traces: <a href=3D"http://pastebin.com/7zFwFjW4" target=3D"_blank"=
>http://pastebin.com/7zFwFjW4</a><br>
<br>
</div>I get a different crash - see above.<br>
<div class=3D"im"><br>
&gt; Localhost replication: Partial success, but xl still segfaults<br>
&gt; =A0dmesg shows<br>
&gt; =A0[ 1399.254849] xl[4716]: segfault at 0 ip 00007f979483a417 sp 00007=
fffe06043e0 error 6 in libxenlight.so.2.0.0[7f9794807000+4d000]<br>
<br>
</div>I see exactly the same thing with `localhost&#39; instead of `dummy&#=
39;. =A0And<br>
I see no incoming domain.<br>
<br>
I will investigate the crash I see. =A0In the meantime can you try to<br>
help me see why it doesn&#39;t work me even with the baseline ?<br>
<br></blockquote><div><br></div><div><br></div><div>I also tested with 64-b=
it 3.3.0 PV kernel (w/o suspend-event channel support)</div><div><br></div>=
<div>guest config:</div><div><div>kernel =3D &quot;/home/kernels/vmlinuz-3.=
3.0-rc1-xenu&quot;</div>

<div>memory =3D 1024</div><div>name =3D &quot;xl-ubuntu-pv64&quot;</div><di=
v>vcpus =3D 2</div><div>vif =3D [ &#39;mac=3D00:16:3e:00:00:03, bridge=3Det=
h0&#39; ]</div><div>disk =3D [ &#39;phy:/dev/vgdrbd/ubuntu-pv64,xvda1,w&#39=
; ]</div>
<div>
hostname=3D &quot;rshriram-vm1&quot;</div><div>root =3D &quot;/dev/xvda1 ro=
&quot;</div><div>extra =3D &quot;console=3Dhvc0 3&quot;</div></div><div><br=
></div><div>With xen-unstable baseline,</div><div>Test 1. Blackhole replica=
tion</div>

<div>=A0command: nohup xl remus -vvv -e -b -i 100 xl-ubuntu-pv64 dummy &gt;=
blackhole.log 2&gt;&amp;1 &amp;</div><div>=A0result: works (networking incl=
uded)</div><div>debug output:</div><div><div>libxl: debug: libxl_dom.c:687:=
libxl__domain_suspend_common_callback: issuing PV suspend request via XenBu=
s control node</div>

<div>libxl: debug: libxl_dom.c:691:libxl__domain_suspend_common_callback: w=
ait for the guest to acknowledge suspend request</div><div>libxl: debug: li=
bxl_dom.c:738:libxl__domain_suspend_common_callback: guest acknowledged sus=
pend request</div>

<div>libxl: debug: libxl_dom.c:742:libxl__domain_suspend_common_callback: w=
ait for the guest to suspend</div><div>libxl: debug: libxl_dom.c:754:libxl_=
_domain_suspend_common_callback: guest has suspended</div><div><br></div>

</div><div>=A0caveat: killing remus doesnt do a proper cleanup i.e if you k=
illed it while the domain was</div><div>=A0 =A0 =A0 =A0 =A0 =A0 =A0suspende=
d, it leaves it in the suspended state (where libxl waits for guest to susp=
end)</div><div>

=A0 =A0 =A0 =A0 =A0 =A0 =A0 Its a pain. In xend/python version, I=A0added a=
 handler (SIGUSR1) , so that one could do</div><div>=A0 =A0 =A0 =A0 =A0 =A0=
 =A0pkill -USR1 -f remus and gracefully exit remus, without wedging the dom=
U.</div><div><br></div><div>

=A0 =A0 =A0 =A0 =A0 =A0 =A0* I do not know if adding signal handlers is fro=
wned upon in the xl land :)</div><div>=A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0If the=
re is some protocol in place to handle such things, I would be happy to sen=
d</div><div>=A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0a patch that ensures that the gu=
est is &quot;resumed&quot; while doing blackhole replication</div>

<div><br></div><div>Test 2. Localhost replication w/ failover by destroying=
 primary VM</div><div><div>=A0command: nohup xl remus -vvv -b -i 100 xl-ubu=
ntu-pv64 localhost &gt;blackhole.log 2&gt;&amp;1 &amp;</div><div>=A0result:=
 works (networking included)</div>

<div><br></div></div></div>

--000e0ce041667780fa04c37667d4--


--===============1979954470247439263==
Content-Type: text/plain; charset="us-ascii"
MIME-Version: 1.0
Content-Transfer-Encoding: 7bit
Content-Disposition: inline

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xen.org
http://lists.xen.org/xen-devel

--===============1979954470247439263==--