qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
From: Anthony Liguori <anthony@codemonkey.ws>
To: Avi Kivity <avi@qumranet.com>
Cc: Blue Swirl <blauwirbel@gmail.com>,
	Laurent Vivier <Laurent.Vivier@bull.net>,
	qemu-devel@nongnu.org, Paul Brook <paul@codesourcery.com>
Subject: Re: [Qemu-devel] Re: [PATCH][v2] Align file accesses with cache=off (O_DIRECT)
Date: Wed, 21 May 2008 09:26:14 -0500	[thread overview]
Message-ID: <48343106.4070801@codemonkey.ws> (raw)
In-Reply-To: <48342F05.2090603@qumranet.com>

Avi Kivity wrote:
> Anthony Liguori wrote:
>>
>> "cached" is not a terribly accurate term.  O_DIRECT avoids the host 
>> page cache but it doesn't guarantee that the disk is using 
>> write-through.  For that, you need to use hdparm.
>>
>> O_SYNC basically turns the host page cache into a write-through 
>> cache.  In terms of data integrity, the only question that matters is 
>> whether you're misleading the guest into thinking data is on the disk 
>> when it isn't.  Both O_DIRECT and O_SYNC accomplish this.
>>
>> If you just are concerned with data integrity, O_SYNC is probably 
>> better because you get the benefits of host caching.  O_DIRECT is 
>> really for circumstances where you know that using the host page 
>> cache is going to reduce performance.
>
> In one specific circumstance O_SYNC has data integrity problems: 
> shared disks with guests running on different hosts (or even a guest 
> on one host, sharing a disk with another host).  In these cases, two 
> reads can return different values without an intervening write.

Are you assuming the underlying disk sharing protocol does not keep the 
page cache coherent?

> In the general case, O_DIRECT gives better performance.  It avoids 
> copying from the host pagecache to guest memory, and if you have spare 
> memory to benefit from caching, give it to the guest; the nearer to 
> the data consumer the cache is, the faster it performs.

This assumes the only thing you're running on the machine is VMs.  If 
you're just running one VM on your desktop, it is probably preferable to 
go through the host page cache.  In particular, the host page cache can 
be automatically aged and adjusted whereas, in general, you cannot 
reduce the guests page cache size.

> In one specific case O_SYNC (or regular read/write cached operation) 
> is better, rebooting the same guest over and over, as the guest cache 
> is flushed on reboot.  Not a very interesting case, but unfortunately 
> one that is very visible.
>
> (if you have a backing file for a COW disk, then opening the backing 
> file without O_DIRECT may be a good idea too, as the file can be 
> shared among many guests).

FWIW, we really only need to use O_SYNC when the guest has disabled 
write-back.  I think we should do that unconditionally too as it's an 
issue of correctness.

Regards,

Anthony Liguori

  reply	other threads:[~2008-05-21 14:26 UTC|newest]

Thread overview: 48+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2008-05-20 11:32 [Qemu-devel] [PATCH][v2] Align file accesses with cache=off (O_DIRECT) Laurent Vivier
2008-05-20 19:47 ` [Qemu-devel] " Anthony Liguori
2008-05-20 22:36   ` Jamie Lokier
2008-05-20 22:52     ` Paul Brook
2008-05-20 22:59       ` Laurent Vivier
2008-05-21  0:54         ` Paul Brook
2008-05-21  7:59           ` Laurent Vivier
2008-05-21  0:58       ` Anthony Liguori
2008-05-21  1:04         ` Jamie Lokier
2008-05-21  1:05         ` Anthony Liguori
2008-05-21  8:06           ` Kevin Wolf
2008-05-21  1:05         ` Paul Brook
2008-05-21  1:14           ` Anthony Liguori
2008-05-21  8:24             ` Kevin Wolf
2008-05-21 12:26               ` Jamie Lokier
2008-05-21 12:37                 ` Avi Kivity
2008-05-21 13:41                   ` Jamie Lokier
2008-05-21 13:55                     ` Anthony Liguori
2008-05-21 14:17                       ` Avi Kivity
2008-05-21 14:26                         ` Anthony Liguori [this message]
2008-05-21 14:57                           ` Avi Kivity
2008-05-21 15:34                             ` Jamie Lokier
2008-05-21 16:02                               ` Anthony Liguori
2008-05-21 16:24                                 ` Jamie Lokier
2008-05-21 16:48                                   ` Avi Kivity
2008-05-21 17:01                                     ` Andrea Arcangeli
2008-05-21 17:18                                       ` Avi Kivity
2008-05-21 17:47                                         ` Andrea Arcangeli
2008-05-21 17:53                                           ` Anthony Liguori
2008-05-21 18:08                                             ` Andrea Arcangeli
2008-05-21 18:25                                               ` Anthony Liguori
2008-05-21 20:13                                                 ` Andrea Arcangeli
2008-05-21 20:35                                                   ` Anthony Liguori
2008-05-21 20:42                                                     ` Andrea Arcangeli
2008-05-21 18:29                                           ` Avi Kivity
2008-05-21 16:45                                 ` Avi Kivity
2008-05-21 16:44                               ` Avi Kivity
2008-05-20 23:04     ` Laurent Vivier
2008-05-20 23:13       ` Jamie Lokier
2008-05-21  1:00     ` Anthony Liguori
2008-05-21  1:19       ` Jamie Lokier
2008-05-21  2:12         ` Anthony Liguori
2008-05-21  8:27           ` Andreas Färber
2008-05-21 14:06             ` Anthony Liguori
2008-05-21 15:31               ` Jamie Lokier
2008-05-21 11:43           ` Jamie Lokier
2008-05-23  9:12   ` Laurent Vivier
2008-05-28  7:01     ` Kevin Wolf

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=48343106.4070801@codemonkey.ws \
    --to=anthony@codemonkey.ws \
    --cc=Laurent.Vivier@bull.net \
    --cc=avi@qumranet.com \
    --cc=blauwirbel@gmail.com \
    --cc=paul@codesourcery.com \
    --cc=qemu-devel@nongnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).