From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from eggs.gnu.org ([2001:4830:134:3::10]:54739) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1cRHHj-0000Rs-I3 for qemu-devel@nongnu.org; Wed, 11 Jan 2017 06:41:45 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1cRHHg-00060k-Ei for qemu-devel@nongnu.org; Wed, 11 Jan 2017 06:41:43 -0500 Received: from mail-wm0-x22e.google.com ([2a00:1450:400c:c09::22e]:38581) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1cRHHg-00060G-8Y for qemu-devel@nongnu.org; Wed, 11 Jan 2017 06:41:40 -0500 Received: by mail-wm0-x22e.google.com with SMTP id r144so25696306wme.1 for ; Wed, 11 Jan 2017 03:41:40 -0800 (PST) References: <000301d259dc$f9d097c0$ed71c740$@ru> <000601d25a95$12b1b9f0$38152dd0$@ru> <20161220102126.GE5602@stefanha-x1.localdomain> <002501d25ab1$af024b00$0d06e100$@ru> <000301d25b4f$20018440$60048cc0$@ru> <000801d26bd9$dca56db0$95f04910$@ru> From: Alex =?utf-8?Q?Benn=C3=A9e?= In-reply-to: <000801d26bd9$dca56db0$95f04910$@ru> Date: Wed, 11 Jan 2017 11:41:37 +0000 Message-ID: <87o9zd3jta.fsf@linaro.org> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Subject: Re: [Qemu-devel] qemu-2.8-rc4 is broken List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , To: Pavel Dovgalyuk Cc: 'Stefan Hajnoczi' , 'qemu-devel' , 'Paolo Bonzini' , 'Pavel Dovgalyuk' , 'Peter Maydell' Pavel Dovgalyuk writes: >> From: Stefan Hajnoczi [mailto:stefanha@gmail.com] >> On Wed, Dec 21, 2016 at 5:57 AM, Pavel Dovgalyuk wrote: >> >> -----Original Message----- >> >> From: Stefan Hajnoczi [mailto:stefanha@gmail.com] >> >> On Tue, Dec 20, 2016 at 11:10 AM, Pavel Dovgalyuk wrote: >> >> >> From: Stefan Hajnoczi [mailto:stefanha@gmail.com] >> >> >> On Tue, Dec 20, 2016 at 10:45:44AM +0300, Pavel Dovgalyuk wrote: >> >> >> > It also fails much earlier when I enable logs with "-d int -D log". >> >> >> > >> >> >> Looks like a heap corruption bug since free() is failing. >> >> > >> >> > Seems to be a race condition. >> >> > When I add logs into invalidate_page_bitmap, the bug disappears. >> >> > It seems that someone tries to free the same page bitmap twice and simultaneously. >> >> >> >> Does the following workaround prevent the crashes? >> >> >> >> -global apic-common.vapic=off >> > >> > Yes, this option helps. >> > Thank you. >> >> Good news. This can be fixed in 2.8.1 once someone finds a solution. > > It seems that something still goes wrong. > I'm using this workaround, but there is a kind of deadlock in translation. > call_rcu_thread hangs at some moment in qemu_event_wait. > > As far as I understand, it is used by QHT in translate-all.c. > I can't get more information yet, because logging makes everything too slow. There are a number of users of RCU bit for QHT I think it only gets activated when it needs to re-size its hash table on insertion of new TranslationBlocks. Can you get a backtrace of all threads when it deadlocks? -- Alex Bennée