xen-devel.lists.xenproject.org archive mirror
 help / color / mirror / Atom feed
From: Joanna Rutkowska <joanna@invisiblethingslab.com>
To: xen-devel@lists.xensource.com
Subject: Re: Xen 4.0.0x allows for data corruption in Dom0
Date: Tue, 09 Mar 2010 00:48:20 +0100	[thread overview]
Message-ID: <4B958CC4.1040105@invisiblethingslab.com> (raw)
In-Reply-To: <4B958B14.5030805@goop.org>


[-- Attachment #1.1: Type: text/plain, Size: 2115 bytes --]

On 03/09/2010 12:41 AM, Jeremy Fitzhardinge wrote:
> On 03/08/2010 03:23 PM, Joanna Rutkowska wrote:
>> But the corruptions always happen in 32-bytes chunks, which might
>> suggest it's not a page-related problem (e.g. wrongly re-used page), as
>> in that case we would be observing (at least sometimes) much bigger
>> chunks of corrupted data, I think.
>>    
> 
> Given that the domU doesn't have any devices or much going on, it could
> easily be corrupting memory in only small amounts.
> 
But see, before I tried this with such a small dummy do-nothing DomU
(which I did for the purpose of reporting to xen-devel), I experienced
very similar corruption when running regular VMs, i.e. with normal linux
and all the usual apps inside them. Same pattern of corruption.

>> The reason why I still believe it's a hypervisor related thing, it that
>> I'm currently using the very *same* Dom0 kernel (very recent
>> xen/stable-2.6.31) with Xen 3.4.2 and the system is damn stable. And I
>> really mean extensive use with 5-7 VMs running all the time doing
>> various things from Web browsing to kernel building.
>>    
> 
> OK, it's always good to get some positive feedback.
> 

At least one full-time user of the pvops kernel ;)

>> If I was to make an educated guess I would say it's something related to
>> some interrupt handling, i.e. Xen mishandling it, e.g. the handler is
>> writing out-of-buffer somewhere and it just happens to land in the Dom0
>> fs buffer used by e.g. dd operation.
>>    
> 
> 
> It would be interesting to see what happens if you write the file with
> the test domain paused (xm pause ...).  If the corruption continues,
> then it is almost certainly Xen.

Right.

> If it stops, then it either means the
> corruption was caused by pages inappropriately shared between dom0 and
> domU, or something like vcpu context switch is corrupting memory (which
> would be very sad).
> 

Unfortunately, I cannot do any more tests. We have downgraded all our
test machines to Xen 3.4.2 and are using them for other things now. Sorry.

joanna.


[-- Attachment #1.2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 226 bytes --]

[-- Attachment #2: Type: text/plain, Size: 138 bytes --]

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

  reply	other threads:[~2010-03-08 23:48 UTC|newest]

Thread overview: 33+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <4B922A89.2060105@invisiblethingslab.com>
2010-03-08 22:24 ` Xen 4.0.0x allows for data corruption in Dom0 Jeremy Fitzhardinge
2010-03-08 22:34   ` Joanna Rutkowska
2010-03-08 23:12     ` Jeremy Fitzhardinge
2010-03-08 23:23       ` Joanna Rutkowska
2010-03-08 23:41         ` Jeremy Fitzhardinge
2010-03-08 23:48           ` Joanna Rutkowska [this message]
2010-03-09  0:18         ` James Harper
2010-03-09  0:20           ` Joanna Rutkowska
2010-03-08 23:32 ` Daniel Stodden
     [not found]   ` <4B958A42.4000407@invisiblethingslab.com>
2010-03-08 23:46     ` Daniel Stodden
     [not found] <C7B80F95.C5F3%keir.fraser@eu.citrix.com>
2010-03-06 13:37 ` Joanna Rutkowska
2010-03-06 17:18   ` Keir Fraser
     [not found] <C7B7F4C4.C5D8%keir.fraser@eu.citrix.com>
2010-03-06 13:36 ` Keir Fraser
2010-03-07 14:36   ` Pasi Kärkkäinen
2010-03-07 14:39     ` Keir Fraser
2010-03-07 16:12       ` Pasi Kärkkäinen
2010-03-08 23:22         ` Daniel Stodden
2010-03-08 23:30           ` Joanna Rutkowska
2010-03-08 23:52             ` Daniel Stodden
2010-03-08 23:56               ` Joanna Rutkowska
2010-03-09  0:33                 ` Daniel Stodden
2010-03-09  8:25           ` Pasi Kärkkäinen
2010-03-09  9:37             ` Jan Beulich
2010-03-09 10:15               ` Jan Beulich
2010-03-09 10:17                 ` Keir Fraser
2010-03-09 10:15               ` Keir Fraser
2010-03-09 10:25                 ` Pasi Kärkkäinen
2010-03-09 10:43                   ` Keir Fraser
2010-03-09 12:03                     ` Pasi Kärkkäinen
2010-03-09 10:42                 ` Jan Beulich
2010-03-09 23:28                 ` Jeremy Fitzhardinge
2010-03-10  1:33                   ` Dan Magenheimer
2010-03-10 18:02                     ` Jeremy Fitzhardinge

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4B958CC4.1040105@invisiblethingslab.com \
    --to=joanna@invisiblethingslab.com \
    --cc=xen-devel@lists.xensource.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).