From: Anthony Liguori <aliguori@us.ibm.com>
To: Stephen Tweedie <sct@redhat.com>
Cc: Steve Dobbelstein <steved@us.ibm.com>,
"Philip R. Auld" <pauld@egenera.com>,
"xen-devel@lists.xensource.com" <xen-devel@lists.xensource.com>
Subject: Re: Shouldn't backend devices for VMX domain disks be opened with O_DIRECT?
Date: Thu, 02 Feb 2006 20:50:28 -0600 [thread overview]
Message-ID: <43E2C4F4.1060509@us.ibm.com> (raw)
In-Reply-To: <1138934528.4374.13.camel@localhost.localdomain>
Stephen Tweedie wrote:
>Hi,
>
>On Thu, 2006-02-02 at 18:09 -0600, Anthony Liguori wrote:
>
>
>
>>Referring to the original question, which has been quoted away,
>>journaling doesn't require that data be written to disk per-say but that
>>writes occur in a particular order. A journal is always recoverable
>>given that writes occur in the expected order.
>>
>>
>
>Sure... it's *internally* consistent, maybe. But you need more than
>that. You need guarantees that things are on disk, else external
>consistency guarantees will be broken.
>
>
Ok, this is certainly correct (but not the original point).
>Consider things like sendmail fsync()ing a spool file before telling the
>sender that the email has been accepted. After that acknowledgement,
>the sender can delete the mail from its queues knowing that the
>recipient MTA definitely has the data, and even if it crashes, the mail
>won't be lost. Databases frequently have similar consistency
>requirements. If a power failure loses writes that you have told the
>domU have completed --- even if you maintain write ordering --- then you
>*are* putting application correctness at risk, there's no doubt about
>it.
>
>
Ok, this is a good argument for using O_SYNC.
>Fortunately, that's just what blkback is doing --- it's using submit_bio
>to submit the write IOs without waiting for completion, and is using the
>bio's bi_end_io callback to process the IO completion once it is hard on
>disk.
>
>
Yup, the question here is with the device model which doesn't use the
block frontend/backend. Would O_DIRECT be helpful over O_SYNC?
Regards,
Anthony Liguori
>--Stephen
>
>
>
>
>
next prev parent reply other threads:[~2006-02-03 2:50 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2006-02-02 21:34 Shouldn't backend devices for VMX domain disks be opened with O_DIRECT? Steve Dobbelstein
2006-02-02 21:46 ` Anthony Liguori
2006-02-02 22:28 ` Steve Dobbelstein
2006-02-02 22:41 ` Philip R. Auld
2006-02-03 0:09 ` Anthony Liguori
2006-02-03 0:31 ` Luciano Miguel Ferreira Rocha
2006-02-03 2:40 ` Rik van Riel
2006-02-03 2:42 ` Stephen Tweedie
2006-02-03 2:50 ` Anthony Liguori [this message]
2006-02-03 15:42 ` Stephen C. Tweedie
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=43E2C4F4.1060509@us.ibm.com \
--to=aliguori@us.ibm.com \
--cc=pauld@egenera.com \
--cc=sct@redhat.com \
--cc=steved@us.ibm.com \
--cc=xen-devel@lists.xensource.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.