public inbox for kvm@vger.kernel.org
 help / color / mirror / Atom feed
From: Aurelien Jarno <aurelien-rXXEIb44qovR7s880joybQ@public.gmane.org>
To: Avi Kivity <avi-atKUWr5tajBWk0Htik3J/w@public.gmane.org>
Cc: kvm-devel-5NWGOfrQmneRv+LV9MX5uipxlwaOVQ5f@public.gmane.org
Subject: Re: Data corruption in guest using KVM
Date: Sun, 22 Jul 2007 15:38:18 +0200	[thread overview]
Message-ID: <20070722133818.GG16993@hall.aurel32.net> (raw)
In-Reply-To: <46A30CA3.3090100-atKUWr5tajBWk0Htik3J/w@public.gmane.org>

[-- Attachment #1: Type: text/plain, Size: 2936 bytes --]

On Sun, Jul 22, 2007 at 10:52:03AM +0300, Avi Kivity wrote:
> Aurelien Jarno wrote:
> > Hi all,
> >
> > For a long time I am seeing data corruption in guests when using KVM,
> > but I am convinced only since today that the problem comes from KVM.
> >
> > The symptoms are a few bytes that are mangled to 0x00 in a file that has
> > been written. For now I have only seen 2 or 4 consecutive bytes mangled,
> > but that may due to statistics given the limited samples.
> >
> > The problem appears very rarely. I am only seeing it when doing huge 
> > compilations (for example gcc or glibc), and not for every build. Note
> > that I am only detecting build failures, so I can miss some corruptions.
> >
> > Note that I have observed the problem on GNU/Linux, GNU/kFreeBSD and
> > plain FreeBSD, for both 32 and 64-bit guests. I always used 64-bit 
> > hosts, and I have seen the problem on both Core 2 and Athlon 64 CPU
> > (always multi-core).
> >
> > I have never seen such corruptions using QEMU, so I would say the
> > problem does not comes from the disk emulation, though it may be due to
> > statistics. Note that I have made a lot of compilation in a MIPS QEMU
> > guest (a few hundred of hours), without any problem. This platform uses
> > the same IDE controller as the one in KVM.
> >
> > Does anybody have seen the same kind of problem? Without a way to 
> > reproduce the corruption, I think it will be very difficult to debug 
> > the problem.
> 
> Did you observe anything about the corruption?  For example, are the 
> offsets at page boundary?  Can you provide a corrupted file and the 
> same, non-corrupted file as a reference?

For now I am still trying to find an easy way to reproduce it. You will
find below a sample of a bad and a good file. I have gzipped them to
make sure they will not be mangled once more by a MUA or a MTA.

What is strange with this sample is that the size of the file is not the
same. I will try to get more corrupted file.

I have been able to reproduce the bug with one or multiple guests
running, so it is not dependent on the number of guests running.


> For the 32-bit case, were the guests pae, nonpae, or both?

I am using nonpae guests (I only give 1GB of memory to the guests).


> How would I go about reproducing this?   Is a single ./configure; make 
> clean; make in a loop compiling gcc sufficient?

Yes basically that's what I am doing but on the glibc sources as I get 
more "success" to reproduce the bug. Note that you should run the 
configure in a different directory from the sources.

I generally observed the bug every 10 to 15 builds. One build takes
about 45 minutes here.

-- 
  .''`.  Aurelien Jarno	            | GPG: 1024D/F1BCDB73
 : :' :  Debian developer           | Electrical Engineer
 `. `'   aurel32-8fiUuRrzOP0dnm+yROfE0A@public.gmane.org         | aurelien-rXXEIb44qovR7s880joybQ@public.gmane.org
   `-    people.debian.org/~aurel32 | www.aurel32.net

[-- Attachment #2: sem_close.o.d.bad.gz --]
[-- Type: application/octet-stream, Size: 1494 bytes --]

[-- Attachment #3: sem_close.o.d.good.gz --]
[-- Type: application/octet-stream, Size: 1471 bytes --]

[-- Attachment #4: Type: text/plain, Size: 315 bytes --]

-------------------------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >>  http://get.splunk.com/

[-- Attachment #5: Type: text/plain, Size: 186 bytes --]

_______________________________________________
kvm-devel mailing list
kvm-devel-5NWGOfrQmneRv+LV9MX5uipxlwaOVQ5f@public.gmane.org
https://lists.sourceforge.net/lists/listinfo/kvm-devel

  parent reply	other threads:[~2007-07-22 13:38 UTC|newest]

Thread overview: 19+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2007-07-21 17:22 Data corruption in guest using KVM Aurelien Jarno
     [not found] ` <20070721172248.GA1555-OqXK5JiLQY5aJl8KAwiEcA@public.gmane.org>
2007-07-21 17:46   ` Anthony Liguori
     [not found]     ` <46A24675.1010506-rdkfGonbjUSkNkDKm+mE6A@public.gmane.org>
2007-07-21 17:54       ` Aurelien Jarno
     [not found]         ` <20070721175404.GA3665-OqXK5JiLQY5aJl8KAwiEcA@public.gmane.org>
2007-07-21 18:03           ` Anthony Liguori
     [not found]             ` <46A24A7E.6040104-rdkfGonbjUSkNkDKm+mE6A@public.gmane.org>
2007-07-21 18:39               ` Aurelien Jarno
     [not found]                 ` <20070721183924.GA5108-OqXK5JiLQY5aJl8KAwiEcA@public.gmane.org>
2007-07-21 21:00                   ` Missing my posts to this lists Simon Gao
     [not found]                     ` <46A273F4.2040001-g4dUTk+gKbW4mfPA/iJWtA@public.gmane.org>
2007-07-22  7:53                       ` Avi Kivity
2007-07-22  7:52   ` Data corruption in guest using KVM Avi Kivity
     [not found]     ` <46A30CA3.3090100-atKUWr5tajBWk0Htik3J/w@public.gmane.org>
2007-07-22 13:38       ` Aurelien Jarno [this message]
     [not found]         ` <20070722133818.GG16993-OqXK5JiLQY5aJl8KAwiEcA@public.gmane.org>
2007-07-22 13:46           ` Avi Kivity
     [not found]             ` <46A35FAB.701-atKUWr5tajBWk0Htik3J/w@public.gmane.org>
2007-07-22 16:44               ` Aurelien Jarno
     [not found]                 ` <20070722164454.GA26166-OqXK5JiLQY5aJl8KAwiEcA@public.gmane.org>
2007-07-22 17:34                   ` Avi Kivity
     [not found]                     ` <46A3952D.2020009-atKUWr5tajBWk0Htik3J/w@public.gmane.org>
2007-07-22 18:14                       ` Aurelien Jarno
2007-07-22 23:34                       ` Aurelien Jarno
     [not found]                         ` <20070722233429.GA10146-OqXK5JiLQY5aJl8KAwiEcA@public.gmane.org>
2007-07-23  6:08                           ` Aurelien Jarno
2007-07-23  8:04                           ` Avi Kivity
2007-07-22 15:14           ` Avi Kivity
     [not found]             ` <46A37448.1010008-atKUWr5tajBWk0Htik3J/w@public.gmane.org>
2007-07-22 15:19               ` Aurelien Jarno
     [not found]                 ` <20070722151913.GA22621-OqXK5JiLQY5aJl8KAwiEcA@public.gmane.org>
2007-07-22 15:24                   ` Avi Kivity

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20070722133818.GG16993@hall.aurel32.net \
    --to=aurelien-rxxeib44qovr7s880joybq@public.gmane.org \
    --cc=avi-atKUWr5tajBWk0Htik3J/w@public.gmane.org \
    --cc=kvm-devel-5NWGOfrQmneRv+LV9MX5uipxlwaOVQ5f@public.gmane.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox