From: Avi Kivity <avi@qumranet.com>
To: Anthony Liguori <anthony@codemonkey.ws>
Cc: Blue Swirl <blauwirbel@gmail.com>,
Laurent Vivier <Laurent.Vivier@bull.net>,
qemu-devel@nongnu.org, Paul Brook <paul@codesourcery.com>
Subject: Re: [Qemu-devel] Re: [PATCH][v2] Align file accesses with cache=off (O_DIRECT)
Date: Wed, 21 May 2008 17:57:08 +0300 [thread overview]
Message-ID: <48343844.1050107@qumranet.com> (raw)
In-Reply-To: <48343106.4070801@codemonkey.ws>
Anthony Liguori wrote:
> Avi Kivity wrote:
>> Anthony Liguori wrote:
>>>
>>> "cached" is not a terribly accurate term. O_DIRECT avoids the host
>>> page cache but it doesn't guarantee that the disk is using
>>> write-through. For that, you need to use hdparm.
>>>
>>> O_SYNC basically turns the host page cache into a write-through
>>> cache. In terms of data integrity, the only question that matters
>>> is whether you're misleading the guest into thinking data is on the
>>> disk when it isn't. Both O_DIRECT and O_SYNC accomplish this.
>>>
>>> If you just are concerned with data integrity, O_SYNC is probably
>>> better because you get the benefits of host caching. O_DIRECT is
>>> really for circumstances where you know that using the host page
>>> cache is going to reduce performance.
>>
>> In one specific circumstance O_SYNC has data integrity problems:
>> shared disks with guests running on different hosts (or even a guest
>> on one host, sharing a disk with another host). In these cases, two
>> reads can return different values without an intervening write.
>
> Are you assuming the underlying disk sharing protocol does not keep
> the page cache coherent?
>
I was assuming access to a raw partition. But yes, with a cluster file
system this objection goes away.
At a significant cost, though. You're now running two cache coherency
protocols on top of each other.
>> In the general case, O_DIRECT gives better performance. It avoids
>> copying from the host pagecache to guest memory, and if you have
>> spare memory to benefit from caching, give it to the guest; the
>> nearer to the data consumer the cache is, the faster it performs.
>
> This assumes the only thing you're running on the machine is VMs. If
> you're just running one VM on your desktop, it is probably preferable
> to go through the host page cache. In particular, the host page cache
> can be automatically aged and adjusted whereas, in general, you cannot
> reduce the guests page cache size.
Agreed. For casual uses, O_DIRECT is overkill. It does get rid of the
data copies, though.
>
>> In one specific case O_SYNC (or regular read/write cached operation)
>> is better, rebooting the same guest over and over, as the guest cache
>> is flushed on reboot. Not a very interesting case, but unfortunately
>> one that is very visible.
>>
>> (if you have a backing file for a COW disk, then opening the backing
>> file without O_DIRECT may be a good idea too, as the file can be
>> shared among many guests).
>
> FWIW, we really only need to use O_SYNC when the guest has disabled
> write-back. I think we should do that unconditionally too as it's an
> issue of correctness.
If the guest is used for non critical applications (like testing distro
installers), then it's just a slowdown.
Even if the guest did not disable disk writeback, pagecache and disk
write caches have vastly different characteristics, so I think we should
set O_SYNC there as well.
Here's a summary of the use cases I saw so far:
- casual use, no critical data: write back cache
- backing file shared among many guests: read-only, cached
- desktop system, but don't lose my data: O_SYNC
(significant resources on the host)
- dedicated virtualization engine: O_DIRECT
(most host resources assigned to guests)
--
error compiling committee.c: too many arguments to function
next prev parent reply other threads:[~2008-05-21 14:57 UTC|newest]
Thread overview: 48+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-05-20 11:32 [Qemu-devel] [PATCH][v2] Align file accesses with cache=off (O_DIRECT) Laurent Vivier
2008-05-20 19:47 ` [Qemu-devel] " Anthony Liguori
2008-05-20 22:36 ` Jamie Lokier
2008-05-20 22:52 ` Paul Brook
2008-05-20 22:59 ` Laurent Vivier
2008-05-21 0:54 ` Paul Brook
2008-05-21 7:59 ` Laurent Vivier
2008-05-21 0:58 ` Anthony Liguori
2008-05-21 1:04 ` Jamie Lokier
2008-05-21 1:05 ` Anthony Liguori
2008-05-21 8:06 ` Kevin Wolf
2008-05-21 1:05 ` Paul Brook
2008-05-21 1:14 ` Anthony Liguori
2008-05-21 8:24 ` Kevin Wolf
2008-05-21 12:26 ` Jamie Lokier
2008-05-21 12:37 ` Avi Kivity
2008-05-21 13:41 ` Jamie Lokier
2008-05-21 13:55 ` Anthony Liguori
2008-05-21 14:17 ` Avi Kivity
2008-05-21 14:26 ` Anthony Liguori
2008-05-21 14:57 ` Avi Kivity [this message]
2008-05-21 15:34 ` Jamie Lokier
2008-05-21 16:02 ` Anthony Liguori
2008-05-21 16:24 ` Jamie Lokier
2008-05-21 16:48 ` Avi Kivity
2008-05-21 17:01 ` Andrea Arcangeli
2008-05-21 17:18 ` Avi Kivity
2008-05-21 17:47 ` Andrea Arcangeli
2008-05-21 17:53 ` Anthony Liguori
2008-05-21 18:08 ` Andrea Arcangeli
2008-05-21 18:25 ` Anthony Liguori
2008-05-21 20:13 ` Andrea Arcangeli
2008-05-21 20:35 ` Anthony Liguori
2008-05-21 20:42 ` Andrea Arcangeli
2008-05-21 18:29 ` Avi Kivity
2008-05-21 16:45 ` Avi Kivity
2008-05-21 16:44 ` Avi Kivity
2008-05-20 23:04 ` Laurent Vivier
2008-05-20 23:13 ` Jamie Lokier
2008-05-21 1:00 ` Anthony Liguori
2008-05-21 1:19 ` Jamie Lokier
2008-05-21 2:12 ` Anthony Liguori
2008-05-21 8:27 ` Andreas Färber
2008-05-21 14:06 ` Anthony Liguori
2008-05-21 15:31 ` Jamie Lokier
2008-05-21 11:43 ` Jamie Lokier
2008-05-23 9:12 ` Laurent Vivier
2008-05-28 7:01 ` Kevin Wolf
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=48343844.1050107@qumranet.com \
--to=avi@qumranet.com \
--cc=Laurent.Vivier@bull.net \
--cc=anthony@codemonkey.ws \
--cc=blauwirbel@gmail.com \
--cc=paul@codesourcery.com \
--cc=qemu-devel@nongnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.