From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from eggs.gnu.org ([208.118.235.92]:56438) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Uf4dt-0001lw-DU for qemu-devel@nongnu.org; Wed, 22 May 2013 04:43:38 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Uf4dl-0003MO-IR for qemu-devel@nongnu.org; Wed, 22 May 2013 04:43:29 -0400 Received: from mx1.redhat.com ([209.132.183.28]:54898) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Uf4dl-0003M8-3N for qemu-devel@nongnu.org; Wed, 22 May 2013 04:43:21 -0400 Message-ID: <519C84AB.6050203@redhat.com> Date: Wed, 22 May 2013 10:41:15 +0200 From: Paolo Bonzini MIME-Version: 1.0 References: <518C8FD7.9080201@profihost.ag> <20130510074217.GB1500@stefanha-thinkpad.redhat.com> <518CB8E4.5090305@profihost.ag> <51924A40.6090209@profihost.ag> <5192543C.4010305@profihost.ag> <6BCF67A6-A1D5-4B4D-B83D-DC32F7621C43@profihost.ag> In-Reply-To: <6BCF67A6-A1D5-4B4D-B83D-DC32F7621C43@profihost.ag> Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Subject: Re: [Qemu-devel] segfault in aio_bh_poll async.c:80 WAS: Re: kvm process disappears List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , To: Stefan Priebe - Profihost AG Cc: Josh Durgin , Stefan Hajnoczi , qemu-devel , "pve-devel@pve.proxmox.com" , Michael Roth Il 22/05/2013 08:26, Stefan Priebe - Profihost AG ha scritto: >> Hi, >> >> as i can't reproduce no ;-( i just saw the kernel segfault message and >> used addr2line and a qemu dbg package to get the code line. > > I've now seen this again for two or three times. It always happens > when we do an fstrim inside the guest. > And I've seen this first since josh async rbd patch. This one? commit dc7588c1eb3008bda53dde1d6b890cd299758155 Author: Josh Durgin Date: Fri Mar 29 13:03:23 2013 -0700 rbd: add an asynchronous flush The existing bdrv_co_flush_to_disk implementation uses rbd_flush(), which is sychronous and causes the main qemu thread to block until it is complete. This results in unresponsiveness and extra latency for the guest. Fix this by using an asynchronous version of flush. This was added to librbd with a special #define to indicate its presence, since it will be backported to stable versions. Thus, there is no need to check the version of librbd. Implement this as bdrv_aio_flush, since it matches other aio functions in the rbd block driver, and leave out bdrv_co_flush_to_disk when the asynchronous version is available. Reported-by: Oliver Francke Signed-off-by: Josh Durgin Signed-off-by: Stefan Hajnoczi Do you see it even with "-drive discard=off"? Paolo