From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from eggs.gnu.org ([208.118.235.92]:48345) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1UK23e-00086e-Hy for qemu-devel@nongnu.org; Mon, 25 Mar 2013 03:43:07 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1UK23c-0004P5-Vd for qemu-devel@nongnu.org; Mon, 25 Mar 2013 03:43:06 -0400 Received: from mx1.redhat.com ([209.132.183.28]:33064) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1UK23c-0004Oq-NY for qemu-devel@nongnu.org; Mon, 25 Mar 2013 03:43:04 -0400 Message-ID: <51500001.1060109@redhat.com> Date: Mon, 25 Mar 2013 08:42:57 +0100 From: Gerd Hoffmann MIME-Version: 1.0 References: <514C21C6.3070800@greensocs.com> <20130322165039.32aae1fb@doriath> <20130322173904.66d2f5ce@doriath> In-Reply-To: <20130322173904.66d2f5ce@doriath> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Subject: Re: [Qemu-devel] Abort in monitor_puts. List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , To: Luiz Capitulino Cc: Anthony Liguori , qemu-devel , =?UTF-8?B?S09OUkFEIEZyw6lkw6lyaWM=?= On 03/22/13 22:39, Luiz Capitulino wrote: > On Fri, 22 Mar 2013 16:50:39 -0400 > Luiz Capitulino wrote: >=20 >> On Fri, 22 Mar 2013 10:17:58 +0100 >> KONRAD Fr=C3=A9d=C3=A9ric wrote: >> >>> Hi, >>> >>> Seems there is an issue with the current git (found by toddf on IRC). >>> >>> To reproduce: >>> >>> ./qemu-system-x86_64 --monitor stdio --nographic >>> >>> and put "?" it should abort. >>> >>> Here is the backtrace: >>> >>> #0 0x00007f77cd347935 in raise () from /lib64/libc.so.6 >>> #1 0x00007f77cd3490e8 in abort () from /lib64/libc.so.6 >>> #2 0x00007f77cd3406a2 in __assert_fail_base () from /lib64/libc.so.6 >>> #3 0x00007f77cd340752 in __assert_fail () from /lib64/libc.so.6 >>> #4 0x00007f77d1c1f226 in monitor_puts (mon=3D, >>> str=3D) at=20 >> >> Yes, it's easy to reproduce. Bisect says: >> >> f628926bb423fa8a7e0b114511400ea9df38b76a is the first bad commit >> commit f628926bb423fa8a7e0b114511400ea9df38b76a >> Author: Gerd Hoffmann >> Date: Tue Mar 19 10:57:56 2013 +0100 >> >> fix monitor >> =20 >> chardev flow control broke monitor, fix it by adding watch support. >> =20 >> Signed-off-by: Anthony Liguori >> >> My impression is that monitor_puts() in being called in parallel. >=20 > Not all. >=20 > What's happening is that qemu_chr_fe_write() is returning < 0, > mon->outbuf_index is not reset and is full, this causes the assert in > monitor_puts() to trig. >=20 > The previous version of monitor_flush() ignores errors, and everything > works, so doing the same thing here fixes the problem :) No, ignoring errors breaks qmp because the output isn't valid json any more when you cut off something ... > For some reason I'm unable to see what the error code is. Gerd, do you = think > the patch below is reasonable? If it's not, how should we handle errors= here? No, it's not. Ignoring the error for errno =3D EAGAIN breaks flow control. Ignoring the error for errno !=3D EAGAIN (and maybe logging a debug message) would be ok, but I suspect it's actually EAGAIN you get here. Just go for a larger buffer? cheers, Gerd