From mboxrd@z Thu Jan 1 00:00:00 1970 From: Ian Campbell Subject: Re: stubdom migration failure on merlot* XSM related (Was: [adhoc test] 65682: tolerable FAIL]) Date: Fri, 11 Dec 2015 14:05:33 +0000 Message-ID: <1449842733.30975.21.camel@citrix.com> References: <1449835954.30975.0.camel@citrix.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <1449835954.30975.0.camel@citrix.com> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Ian Jackson , Wei Liu , Daniel De Graaf Cc: xen-devel List-Id: xen-devel@lists.xenproject.org On Fri, 2015-12-11 at 12:12 +0000, Ian Campbell wrote: > I think 10 repetitions indicates it probably wasn't a fluke, however I've > kicked off a repeat (flight 65738) of the test to increase confidence. > > In the test-amd64-amd64-xl-qemut-stubdom-debianhvm-amd64-xsm case I've set > debianhvm_general_timeoutfactor to 2, which should allow the ssh to > complete inside the now 20s timeout, which will then run the 10x case so we > can see how repeated migrations occur. This is not a fix, just a > workaround/diagnostic aid. > > I've left test-amd64-amd64-xl-qemut-stubdom-debianhvm-amd64 alone in the > repeat since I want to check the pass wasn't a fluke. That repeat has completed, results are at http://logs.test-lab.xenproject.org/osstest/logs/65738/ (build was reused from the previous adhoc flight 65682) Both test cases passed, remember that the -xsm one had a doubled time out. http://logs.test-lab.xenproject.org/osstest/logs/65738/test-amd64-amd64-xl-qemut-stubdom-debianhvm-amd64-xsm/13.ts-guest-localmigrate.log has (ssh debug* elided): 015-12-11 10:58:49 Z executing ssh ... -v root@172.16.145.159 date OpenSSH_6.0p1 Debian-4+deb7u2, OpenSSL 1.0.1e 11 Feb 2013 [...] Authenticated to 172.16.145.159 ([172.16.145.159]:22). [...] Fri Dec 11 10:59:04 UTC 2015 [...] Transferred: sent 3064, received 2032 bytes, in 0.0 seconds Bytes per second: sent 301038.8, received 199644.5 [...] 2015-12-11 10:59:05 Z execution took 15 seconds [**>20/2**]: timeout 50 ssh -[...] root@172.16.145.159 date In the repeated case http://logs.test-lab.xenproject.org/osstest/logs/65738/test-amd64-amd64-xl-qemut-stubdom-debianhvm-amd64-xsm/15.ts-guest-localmigrate.log it looks like 4 out of the 10 reps took 15-16s, the rest were below the theshold for logging. The non-XSM case passes completely again. Ian.