From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from eggs.gnu.org ([2001:4830:134:3::10]:46485) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1fLNHU-0005xD-QY for qemu-devel@nongnu.org; Wed, 23 May 2018 02:29:53 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1fLNHQ-0008Q9-Rr for qemu-devel@nongnu.org; Wed, 23 May 2018 02:29:52 -0400 Received: from mx3-rdu2.redhat.com ([66.187.233.73]:38844 helo=mx1.redhat.com) by eggs.gnu.org with esmtps (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from ) id 1fLNHQ-0008PO-Md for qemu-devel@nongnu.org; Wed, 23 May 2018 02:29:48 -0400 Received: from smtp.corp.redhat.com (int-mx04.intmail.prod.int.rdu2.redhat.com [10.11.54.4]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id BF7025BCB6 for ; Wed, 23 May 2018 06:29:45 +0000 (UTC) From: Juan Quintela In-Reply-To: <20180522162037.GG6233@localhost.localdomain> (Kevin Wolf's message of "Tue, 22 May 2018 18:20:37 +0200") References: <20180515234017.2277-1-quintela@redhat.com> <20180515234017.2277-11-quintela@redhat.com> <20180518085902.GB4833@localhost.localdomain> <20180518103441.GD2478@work-vm> <20180518121430.GC4833@localhost.localdomain> <20180522162037.GG6233@localhost.localdomain> Reply-To: quintela@redhat.com Date: Wed, 23 May 2018 08:29:32 +0200 Message-ID: <87muwq6g1f.fsf@secure.mitica> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Subject: Re: [Qemu-devel] [PULL 10/40] migration: Delay start of migration main routines List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , To: Kevin Wolf Cc: "Dr. David Alan Gilbert" , lvivier@redhat.com, qemu-devel@nongnu.org, peterx@redhat.com Kevin Wolf wrote: > Am 18.05.2018 um 14:14 hat Kevin Wolf geschrieben: >> Am 18.05.2018 um 12:34 hat Dr. David Alan Gilbert geschrieben: >> > * Kevin Wolf (kwolf@redhat.com) wrote: >> > > Am 16.05.2018 um 01:39 hat Juan Quintela geschrieben: >> > > > We need to make sure that we have started all the multifd threads. >> > > >=20 >> > > > Signed-off-by: Juan Quintela >> > > > Reviewed-by: Daniel P. Berrang=C3=A9 >> > >=20 >> > > This commit makes qemu-iotests 091 hang for me. Either it breaks >> > > backward compatibility intentionally and we need to update the test >> > > case, or there is a bug somewhere. >> >=20 >> > It's not an intentional break. >> > And the avocado tcp and exec migrations pass OK, so hmm. >>=20 >> In case it helps, 169 fails as well and I got a core dump of an aborting >> QEMU process: >>=20 >> (gdb) bt >> #0 0x00007ff079f779fb in raise () at /lib64/libc.so.6 >> #1 0x00007ff079f79800 in abort () at /lib64/libc.so.6 >> #2 0x00007ff079f700da in __assert_fail_base () at /lib64/libc.so.6 >> #3 0x00007ff079f70152 in () at /lib64/libc.so.6 >> #4 0x000055c2126f067b in bdrv_close_all () at block.c:3375 >> #5 0x000055c2123c54a6 in main (argc=3D, argv=3D> out>, envp=3D) at vl.c:4682 >>=20 >> If I understand correctly, that assertion failure means that someone is >> still holding a reference to a block device after all user-owned >> references have been closed. I suppose this was the source qemu and >> the migration hasn't been completed properly, though I haven't looked at >> the code yet and this idea might be completely wrong. >>=20 >> Anyway, 091 is certainly the simpler test case to play with, but maybe >> this gives you another hint. > > Any news on this? This is starting to become really annoying as a > hanging test suite impacts my ability to properly test block layer > patches. > > If there is no hope of quickly getting a proper fix for this, we may > have to revert something for now to fight the symptoms at least. Hi I am looking into this. I have been on vacation the end of last week, and to make things nicer, body decided that taking a stomach flu before returning was a great idea. Coping with email/tests. Later, Juan.