From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from eggs.gnu.org ([208.118.235.92]:48978) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1TNfPa-0006XQ-Pm for qemu-devel@nongnu.org; Mon, 15 Oct 2012 03:48:35 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1TNfPU-0006sp-Ur for qemu-devel@nongnu.org; Mon, 15 Oct 2012 03:48:30 -0400 Received: from mail-bk0-f45.google.com ([209.85.214.45]:55273) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1TNfPU-0006sc-Nr for qemu-devel@nongnu.org; Mon, 15 Oct 2012 03:48:24 -0400 Received: by mail-bk0-f45.google.com with SMTP id jf3so1916661bkc.4 for ; Mon, 15 Oct 2012 00:48:23 -0700 (PDT) Date: Mon, 15 Oct 2012 09:48:19 +0200 From: Stefan Hajnoczi Message-ID: <20121015074819.GB24883@stefanha-thinkpad.redhat.com> References: <1349962403.4696.51.camel@storm> <20121012083315.GB14822@stefanha-thinkpad.redhat.com> <1350032009.4696.84.camel@storm> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <1350032009.4696.84.camel@storm> Subject: Re: [Qemu-devel] Silent filesystem/qcow2 corruptions with qemu-kvm-1.0 and 1.1.1 List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , To: Tiziano =?iso-8859-1?Q?M=FCller?= Cc: qemu-devel On Fri, Oct 12, 2012 at 10:53:29AM +0200, Tiziano Müller wrote: > Am Freitag, den 12.10.2012, 10:33 +0200 schrieb Stefan Hajnoczi: > > On Thu, Oct 11, 2012 at 03:33:23PM +0200, Tiziano Müller wrote: > > > Checking the image using `qemu-img check` then gives something like > > > this: > > > > > > ERROR OFLAG_COPIED: offset=3bc30000 refcount=1 > > > ERROR offset=c7e331: Cluster is not properly aligned; L2 entry > > > corrupted. > > > > Is any other program accessing the qcow2 image on the host while the VM > > is running? > > > For example, are you running qemu-img on the image while the VM is > > running? > > On some VMs we tried to extract filesystem snapshots at runtime: > > qemu-img convert -s snapshot-id original.qcow2 snapshot.qcow2 > > (yes, that's not consistent, we're switching to external snapshots). > But that should open the image read-only, right? > > Other operations where the qemu-monitor-commands "savevm" and "delvm". > > Although: we created a new qcow2 and even in that the filesystem got > corrupted without any of the above actions. So we're pretty confident > that those operations are not the sole cause. Okay, that's consistent with the other symptoms you've reported. It's not clear whether the corruption arises inside qcow2 or if something else is causing corruption and qcow2/xfs get upset. That is the next step to debugging this - hopefully it will become possible to reproduce it reliably, at which point it is much easier to debug :). Stefan