All of lore.kernel.org
 help / color / mirror / Atom feed
* Shouldn't backend devices for VMX domain disks be opened with O_DIRECT?
@ 2006-02-02 21:34 Steve Dobbelstein
  2006-02-02 21:46 ` Anthony Liguori
  0 siblings, 1 reply; 10+ messages in thread
From: Steve Dobbelstein @ 2006-02-02 21:34 UTC (permalink / raw)
  To: xen-devel

While running some disk performance tests for VMX domains we noticed that
writes to the backend device for a VMX domain's disk go through the buffer
cache, that is, they are not written immediately to disk.  Shouldn't the
I/Os go straight to the backend device, i.e., the device should be opened
with O_DIRECT or some such?  From the domain's perspective it expects the
data to be physically on the device, but in reality it is not.   There are
things, such a writes to a file system journal, that the OS in the domain
will expect to be on disk.  If the whole system crashes before the buffer
cache in dom0 is written to disk, those writes may not be on the disk.
When the domain is started again it may find the file system in an
inconsistent state, due to writes to the journal that didn't make it to
disk, and may not be able to recover.

It seems to me that if a domain expects things to be physically on its
frontend device that they should be physically on the backend device as
well.  Or am I missing something from the bigger picture?

Steve D.

^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2006-02-03 15:42 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2006-02-02 21:34 Shouldn't backend devices for VMX domain disks be opened with O_DIRECT? Steve Dobbelstein
2006-02-02 21:46 ` Anthony Liguori
2006-02-02 22:28   ` Steve Dobbelstein
2006-02-02 22:41     ` Philip R. Auld
2006-02-03  0:09       ` Anthony Liguori
2006-02-03  0:31         ` Luciano Miguel Ferreira Rocha
2006-02-03  2:40         ` Rik van Riel
2006-02-03  2:42         ` Stephen Tweedie
2006-02-03  2:50           ` Anthony Liguori
2006-02-03 15:42             ` Stephen C. Tweedie

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.