From: Max Reitz <mreitz@redhat.com>
To: "Denis V. Lunev" <den@openvz.org>
Cc: Kevin Wolf <kwolf@redhat.com>, Peter Lieven <pl@kamp.de>,
Fam Zheng <famz@redhat.com>,
qemu-devel@nongnu.org, Stefan Hajnoczi <stefanha@redhat.com>
Subject: Re: [Qemu-devel] [PATCH 4/7] block: use fallocate(FALLOC_FL_ZERO_RANGE) in handle_aiocb_write_zeroes
Date: Tue, 27 Jan 2015 12:30:01 -0500 [thread overview]
Message-ID: <54C7CB19.4010904@redhat.com> (raw)
In-Reply-To: <1422366699-17473-5-git-send-email-den@openvz.org>
On 2015-01-27 at 08:51, Denis V. Lunev wrote:
> This efficiently writes zeroes on Linux if the kernel is capable enough.
> FALLOC_FL_ZERO_RANGE correctly handles all cases, including and not
> including file expansion.
>
> Signed-off-by: Denis V. Lunev <den@openvz.org>
> CC: Kevin Wolf <kwolf@redhat.com>
> CC: Stefan Hajnoczi <stefanha@redhat.com>
> CC: Peter Lieven <pl@kamp.de>
> CC: Fam Zheng <famz@redhat.com>
> ---
> block/raw-posix.c | 16 ++++++++++++++--
> configure | 19 +++++++++++++++++++
> 2 files changed, 33 insertions(+), 2 deletions(-)
Okay, now the "ret" in handle_aiocb_write_zeroes() is necessary, so
please disregard my statement about removing it in patch 3.
> diff --git a/block/raw-posix.c b/block/raw-posix.c
> index 24e1fab..3c35b2f 100644
> --- a/block/raw-posix.c
> +++ b/block/raw-posix.c
> @@ -60,7 +60,7 @@
> #define FS_NOCOW_FL 0x00800000 /* Do not cow file */
> #endif
> #endif
> -#ifdef CONFIG_FALLOCATE_PUNCH_HOLE
> +#if defined(CONFIG_FALLOCATE_PUNCH_HOLE) || defined(CONFIG_FALLOCATE_ZERO_RANGE)
> #include <linux/falloc.h>
> #endif
> #if defined (__FreeBSD__) || defined(__FreeBSD_kernel__)
> @@ -902,7 +902,7 @@ static int translate_err(int err)
> return err;
> }
>
> -#if defined(CONFIG_FALLOCATE_PUNCH_HOLE)
> +#if defined(CONFIG_FALLOCATE_PUNCH_HOLE) || defined(CONFIG_FALLOCATE_ZERO_RANGE)
> static int do_fallocate(int fd, int mode, off_t offset, off_t len)
> {
> do {
> @@ -955,6 +955,18 @@ static ssize_t handle_aiocb_write_zeroes(RawPosixAIOData *aiocb)
> }
> #endif
>
> +#ifdef CONFIG_FALLOCATE_ZERO_RANGE
> + if (s->has_write_zeroes) {
> + ret = do_fallocate(s->fd, FALLOC_FL_ZERO_RANGE,
> + aiocb->aio_offset, aiocb->aio_nbytes);
> + if (ret == 0 || ret != -ENOTSUP) {
> + return ret;
> + }
> + s->has_write_zeroes = false;
> + return ret;
> + }
First, you probably want to simply fall through here; right now, you are
immediately failing with -ENOTSUP on the first call, but falling through
on the second call. After this patch, it doesn't make a difference, but
after the next one, it might.
Second, while using s->has_write_zeroes here seems correct to me, I
personally don't like sharing it with handle_aiocb_write_zeroes_block();
and if you do introduce a new flag like "has_zero_range", please don't
make it a bit field (I will give you an R-b regardless of whether you
make it a bit field or not, I just won't like it).
Feel free to keep has_write_zeroes, though, while it doesn't look good
to me it certainly is correct from a technical perspective.
Max
next prev parent reply other threads:[~2015-01-27 17:30 UTC|newest]
Thread overview: 23+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-01-27 13:51 [Qemu-devel] [PATCH v4 0/7] eliminate data write in bdrv_write_zeroes on Linux in raw-posix.c Denis V. Lunev
2015-01-27 13:51 ` [Qemu-devel] [PATCH 1/7] block/raw-posix: create translate_err helper to merge errno values Denis V. Lunev
2015-01-27 16:50 ` Max Reitz
2015-01-27 13:51 ` [Qemu-devel] [PATCH 2/7] block/raw-posix: create do_fallocate helper Denis V. Lunev
2015-01-27 16:57 ` Max Reitz
2015-01-27 13:51 ` [Qemu-devel] [PATCH 3/7] block/raw-posix: refactor handle_aiocb_write_zeroes a bit Denis V. Lunev
2015-01-27 17:13 ` Max Reitz
2015-01-27 13:51 ` [Qemu-devel] [PATCH 4/7] block: use fallocate(FALLOC_FL_ZERO_RANGE) in handle_aiocb_write_zeroes Denis V. Lunev
2015-01-27 17:30 ` Max Reitz [this message]
2015-01-27 13:51 ` [Qemu-devel] [PATCH 5/7] block: use fallocate(FALLOC_FL_PUNCH_HOLE) & fallocate(0) to write zeroes Denis V. Lunev
2015-01-27 17:48 ` Max Reitz
2015-01-27 13:51 ` [Qemu-devel] [PATCH 6/7] block/raw-posix: call plain fallocate in handle_aiocb_write_zeroes Denis V. Lunev
2015-01-27 17:57 ` Max Reitz
2015-01-27 18:19 ` Denis V. Lunev
2015-01-27 18:24 ` Max Reitz
2015-01-27 18:33 ` Denis V. Lunev
2015-01-27 13:51 ` [Qemu-devel] [PATCH 7/7] block/raw-posix: set max_write_zeroes to INT_MAX for regular files Denis V. Lunev
2015-01-27 18:05 ` Max Reitz
2015-01-27 18:11 ` Denis V. Lunev
2015-01-28 6:39 ` Denis V. Lunev
-- strict thread matches above, loose matches on Subject: below --
2015-01-28 18:38 [Qemu-devel] [PATCH v5 0/7] eliminate data write in bdrv_write_zeroes on Linux in raw-posix.c Denis V. Lunev
2015-01-28 18:38 ` [Qemu-devel] [PATCH 4/7] block: use fallocate(FALLOC_FL_ZERO_RANGE) in handle_aiocb_write_zeroes Denis V. Lunev
2015-01-29 22:40 ` Max Reitz
2015-01-30 8:42 [Qemu-devel] [PATCH v6 0/7] eliminate data write in bdrv_write_zeroes on Linux in raw-posix.c Denis V. Lunev
2015-01-30 8:42 ` [Qemu-devel] [PATCH 4/7] block: use fallocate(FALLOC_FL_ZERO_RANGE) in handle_aiocb_write_zeroes Denis V. Lunev
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=54C7CB19.4010904@redhat.com \
--to=mreitz@redhat.com \
--cc=den@openvz.org \
--cc=famz@redhat.com \
--cc=kwolf@redhat.com \
--cc=pl@kamp.de \
--cc=qemu-devel@nongnu.org \
--cc=stefanha@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).