From: Jeff Cody <jcody@redhat.com>
To: Kevin Wolf <kwolf@redhat.com>
Cc: supriyak@linux.vnet.ibm.com, pbonzini@redhat.com,
eblake@redhat.com, qemu-devel@nongnu.org, stefanha@gmail.com
Subject: Re: [Qemu-devel] [PATCH 3/7] block: raw-posix image file reopen
Date: Wed, 05 Sep 2012 12:43:16 -0400 [thread overview]
Message-ID: <50478124.8030801@redhat.com> (raw)
In-Reply-To: <50477020.7050903@redhat.com>
On 09/05/2012 11:30 AM, Kevin Wolf wrote:
> Am 30.08.2012 20:47, schrieb Jeff Cody:
>> This is derived from the Supriya Kannery's reopen patches.
>>
>> This contains the raw-posix driver changes for the bdrv_reopen_*
>> functions. All changes are staged into a temporary scratch buffer
>> during the prepare() stage, and copied over to the live structure
>> during commit(). Upon abort(), all changes are abandoned, and the
>> live structures are unmodified.
>>
>> The _prepare() will create an extra fd - either by means of a dup,
>> if possible, or opening a new fd if not (for instance, access
>> control changes). Upon _commit(), the original fd is closed and
>> the new fd is used. Upon _abort(), the duplicate/new fd is closed.
>>
>> Signed-off-by: Jeff Cody <jcody@redhat.com>
>> ---
>> block/raw-posix.c | 153 +++++++++++++++++++++++++++++++++++++++++++++++++-----
>> 1 file changed, 139 insertions(+), 14 deletions(-)
>>
>> diff --git a/block/raw-posix.c b/block/raw-posix.c
>> index 6be20b1..48086d7 100644
>> --- a/block/raw-posix.c
>> +++ b/block/raw-posix.c
>> @@ -140,6 +140,15 @@ typedef struct BDRVRawState {
>> #endif
>> } BDRVRawState;
>>
>> +typedef struct BDRVRawReopenState {
>> + BDRVReopenState reopen_state;
>> + int fd;
>> + int open_flags;
>> + uint8_t *aligned_buf;
>> + unsigned aligned_buf_size;
>> + BDRVRawState *stash_s;
>> +} BDRVRawReopenState;
>> +
>> static int fd_open(BlockDriverState *bs);
>> static int64_t raw_getlength(BlockDriverState *bs);
>>
>> @@ -185,6 +194,28 @@ static int raw_normalize_devicepath(const char **filename)
>> }
>> #endif
>>
>> +static void raw_parse_flags(int bdrv_flags, int *open_flags)
>> +{
>> + assert(open_flags != NULL);
>> +
>> + *open_flags |= O_BINARY;
>> + *open_flags &= ~O_ACCMODE;
>> + if (bdrv_flags & BDRV_O_RDWR) {
>> + *open_flags |= O_RDWR;
>> + } else {
>> + *open_flags |= O_RDONLY;
>> + }
>> +
>> + /* Use O_DSYNC for write-through caching, no flags for write-back caching,
>> + * and O_DIRECT for no caching. */
>> + if ((bdrv_flags & BDRV_O_NOCACHE)) {
>> + *open_flags |= O_DIRECT;
>> + }
>> + if (!(bdrv_flags & BDRV_O_CACHE_WB)) {
>> + *open_flags |= O_DSYNC;
>> + }
>> +}
>
> The code motion would ideally be a separate patch.
>
OK
>> +
>> static int raw_open_common(BlockDriverState *bs, const char *filename,
>> int bdrv_flags, int open_flags)
>> {
>> @@ -196,20 +227,8 @@ static int raw_open_common(BlockDriverState *bs, const char *filename,
>> return ret;
>> }
>>
>> - s->open_flags = open_flags | O_BINARY;
>> - s->open_flags &= ~O_ACCMODE;
>> - if (bdrv_flags & BDRV_O_RDWR) {
>> - s->open_flags |= O_RDWR;
>> - } else {
>> - s->open_flags |= O_RDONLY;
>> - }
>> -
>> - /* Use O_DSYNC for write-through caching, no flags for write-back caching,
>> - * and O_DIRECT for no caching. */
>> - if ((bdrv_flags & BDRV_O_NOCACHE))
>> - s->open_flags |= O_DIRECT;
>> - if (!(bdrv_flags & BDRV_O_CACHE_WB))
>> - s->open_flags |= O_DSYNC;
>> + s->open_flags = open_flags;
>> + raw_parse_flags(bdrv_flags, &s->open_flags);
>>
>> s->fd = -1;
>> fd = qemu_open(filename, s->open_flags, 0644);
>> @@ -283,6 +302,109 @@ static int raw_open(BlockDriverState *bs, const char *filename, int flags)
>> return raw_open_common(bs, filename, flags, 0);
>> }
>>
>> +static int raw_reopen_prepare(BDRVReopenState *state, Error **errp)
>> +{
>> + BDRVRawState *s;
>> + BDRVRawReopenState *raw_s;
>> + int ret = 0;
>> +
>> + assert(state != NULL);
>> + assert(state->bs != NULL);
>> +
>> + s = state->bs->opaque;
>> +
>> + state->opaque = g_malloc0(sizeof(BDRVRawReopenState));
>> + raw_s = state->opaque;
>> +
>> + raw_parse_flags(state->flags, &raw_s->open_flags);
>> +
>> + /*
>> + * If we didn't have BDRV_O_NOCACHE set before, we may not have allocated
>> + * aligned_buf
>> + */
>> + if ((state->flags & BDRV_O_NOCACHE)) {
>> + /*
>> + * Allocate a buffer for read/modify/write cycles. Choose the size
>> + * pessimistically as we don't know the block size yet.
>> + */
>> + raw_s->aligned_buf_size = 32 * MAX_BLOCKSIZE;
>> + raw_s->aligned_buf = qemu_memalign(MAX_BLOCKSIZE,
>> + raw_s->aligned_buf_size);
>> +
>> + if (raw_s->aligned_buf == NULL) {
>> + ret = -1;
>> + goto error;
>> + }
>
> Even though it's pretty small, I think I would factor this out into a
> small static helper to make sure it's kept in sync with raw_open_common().
>
OK, and like your suggestion above, I'll put that in a separate code
motion patch.
>> + }
>> +
>> + int fcntl_flags = O_APPEND | O_ASYNC | O_NONBLOCK;
>> +#ifdef O_NOATIME
>> + fcntl_flags |= O_NOATIME;
>> +#endif
>> + if ((raw_s->open_flags & ~fcntl_flags) == (s->open_flags & ~fcntl_flags)) {
>> + /* dup the original fd */
>> + /* TODO: use qemu fcntl wrapper */
>> + raw_s->fd = fcntl(s->fd, F_DUPFD_CLOEXEC, 0);
>> + if (raw_s->fd == -1) {
>> + ret = -1;
>> + goto error;
>> + }
>> + ret = fcntl_setfl(raw_s->fd, raw_s->open_flags);
>> + } else {
>> + raw_s->fd = qemu_open(state->bs->filename, raw_s->open_flags, 0644);
>> + if (raw_s->fd == -1) {
>> + ret = -1;
>> + }
>
> Ignoring this part for now, with qemu_dup_flags() it's going to look a
> bit different. In particular, I'm hoping that we don't get a second
> fcntl_flags enumeration here, but can just fall back to qemu_open()
> whenever qemu_dup_flags() fails.
That will require modification to qemu_dup_flags()... I believe
qemu_dup_flags() silently filters out fcntl incompatible flags.
Maybe it would be best to create a small helper function in osdep.c, that
fetches the fcntl_flags. Then qemu_dup_flags() and this function would
use the same helper to fetch fcntl_flags. The results of that would
determine if we call qemu_dup_flags() or qemu_open().
Although, I do think it makes sense to always try qemu_open() if
qemu_dup_flags() fails for some reason.
>
> If we do need to keep fcntl_flags here, we'll probably want to add
> O_DIRECT to it.
>
>> + }
>> +error:
>> + return ret;
>> +}
>> +
>> +
>> +static void raw_reopen_commit(BDRVReopenState *state)
>> +{
>> + BDRVRawReopenState *raw_s = state->opaque;
>> + BDRVRawState *s = state->bs->opaque;
>> +
>> + if (raw_s->aligned_buf != NULL) {
>> + if (s->aligned_buf) {
>> + qemu_vfree(s->aligned_buf);
>> + }
>> + s->aligned_buf = raw_s->aligned_buf;
>> + s->aligned_buf_size = raw_s->aligned_buf_size;
>> + }
>> +
>> + s->open_flags = raw_s->open_flags;
>> +
>> + close(s->fd);
>> + s->fd = raw_s->fd;
>> +
>> + g_free(state->opaque);
>> + state->opaque = NULL;
>> +}
>
> I think s->use_aio must be changed as well, it depends on
> BDRV_O_NOCACHE. Maybe we even need to do it in prepare, laio_init() may
> need to be called.
Yes, good catch, thanks.
>
> Kevin
>
next prev parent reply other threads:[~2012-09-05 16:43 UTC|newest]
Thread overview: 27+ messages / expand[flat|nested] mbox.gz Atom feed top
2012-08-30 18:47 [Qemu-devel] [PATCH 0/7] block: bdrv_reopen() patches Jeff Cody
2012-08-30 18:47 ` [Qemu-devel] [PATCH 1/7] block: correctly set the keep_read_only flag Jeff Cody
2012-09-05 12:47 ` Kevin Wolf
2012-09-05 13:08 ` Jeff Cody
2012-09-05 13:12 ` Kevin Wolf
2012-08-30 18:47 ` [Qemu-devel] [PATCH 2/7] block: Framework for reopening files safely Jeff Cody
2012-09-05 15:09 ` Kevin Wolf
2012-09-05 16:38 ` Jeff Cody
2012-09-11 14:57 ` Jeff Cody
2012-09-11 15:14 ` Kevin Wolf
2012-09-11 15:36 ` Jeff Cody
2012-09-11 15:41 ` Kevin Wolf
2012-08-30 18:47 ` [Qemu-devel] [PATCH 3/7] block: raw-posix image file reopen Jeff Cody
2012-08-30 22:15 ` Eric Blake
2012-08-31 14:42 ` Jeff Cody
2012-08-31 14:49 ` Kevin Wolf
2012-08-31 15:10 ` Jeff Cody
2012-09-05 15:30 ` Kevin Wolf
2012-09-05 16:43 ` Jeff Cody [this message]
2012-09-06 9:23 ` Kevin Wolf
2012-09-06 15:34 ` Corey Bryant
2012-09-07 10:40 ` Kevin Wolf
2012-09-07 14:29 ` Corey Bryant
2012-08-30 18:47 ` [Qemu-devel] [PATCH 4/7] block: raw " Jeff Cody
2012-08-30 18:47 ` [Qemu-devel] [PATCH 5/7] block: qed " Jeff Cody
2012-08-30 18:47 ` [Qemu-devel] [PATCH 6/7] block: qcow2 " Jeff Cody
2012-08-30 18:47 ` [Qemu-devel] [PATCH 7/7] block: qcow " Jeff Cody
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=50478124.8030801@redhat.com \
--to=jcody@redhat.com \
--cc=eblake@redhat.com \
--cc=kwolf@redhat.com \
--cc=pbonzini@redhat.com \
--cc=qemu-devel@nongnu.org \
--cc=stefanha@gmail.com \
--cc=supriyak@linux.vnet.ibm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.