From: "Denis V. Lunev" <den@openvz.org>
To: Max Reitz <mreitz@redhat.com>
Cc: Kevin Wolf <kwolf@redhat.com>, Fam Zheng <famz@redhat.com>,
Peter Lieven <pl@kamp.de>,
qemu-devel@nongnu.org, Stefan Hajnoczi <stefanha@redhat.com>
Subject: Re: [Qemu-devel] [PATCH 6/7] block/raw-posix: call plain fallocate in handle_aiocb_write_zeroes
Date: Fri, 30 Jan 2015 08:38:56 +0300 [thread overview]
Message-ID: <54CB18F0.3020700@openvz.org> (raw)
In-Reply-To: <54CAB927.4010006@redhat.com>
On 30/01/15 01:50, Max Reitz wrote:
> On 2015-01-28 at 13:38, Denis V. Lunev wrote:
>> There is a possibility that we are extending our image and thus writing
>> zeroes beyond the end of the file. In this case we do not need to care
>> about the hole to make sure that there is no data in the file under
>> this offset (pre-condition to fallocate(0) to work). We could simply
>> call
>> fallocate(0).
>>
>> This improves the performance of writing zeroes even on really old
>> platforms which do not have even FALLOC_FL_PUNCH_HOLE.
>>
>> Before the patch do_fallocate was used when either
>> CONFIG_FALLOCATE_PUNCH_HOLE or CONFIG_FALLOCATE_ZERO_RANGE are defined.
>> Now the story is different. CONFIG_FALLOCATE is defined when Linux
>> fallocate is defined, posix_fallocate is completely different story
>> (CONFIG_POSIX_FALLOCATE). CONFIG_FALLOCATE is mandatory prerequite
>> for both CONFIG_FALLOCATE_PUNCH_HOLE and CONFIG_FALLOCATE_ZERO_RANGE
>> thus we are on the safe side.
>>
>> Signed-off-by: Denis V. Lunev <den@openvz.org>
>> CC: Max Reitz <mreitz@redhat.com>
>> CC: Kevin Wolf <kwolf@redhat.com>
>> CC: Stefan Hajnoczi <stefanha@redhat.com>
>> CC: Peter Lieven <pl@kamp.de>
>> CC: Fam Zheng <famz@redhat.com>
>> ---
>> block/raw-posix.c | 14 +++++++++++++-
>> 1 file changed, 13 insertions(+), 1 deletion(-)
>>
>> diff --git a/block/raw-posix.c b/block/raw-posix.c
>> index 2e24829..3db911a 100644
>> --- a/block/raw-posix.c
>> +++ b/block/raw-posix.c
>> @@ -147,6 +147,7 @@ typedef struct BDRVRawState {
>> bool has_discard:1;
>> bool has_write_zeroes:1;
>> bool discard_zeroes:1;
>> + bool has_fallocate;
>> bool needs_alignment;
>> } BDRVRawState;
>> @@ -452,6 +453,7 @@ static int raw_open_common(BlockDriverState
>> *bs, QDict *options,
>> }
>> if (S_ISREG(st.st_mode)) {
>> s->discard_zeroes = true;
>> + s->has_fallocate = true;
>> }
>> if (S_ISBLK(st.st_mode)) {
>> #ifdef BLKDISCARDZEROES
>> @@ -902,7 +904,7 @@ static int translate_err(int err)
>> return err;
>> }
>> -#if defined(CONFIG_FALLOCATE_PUNCH_HOLE) ||
>> defined(CONFIG_FALLOCATE_ZERO_RANGE)
>> +#ifdef CONFIG_FALLOCATE
>> static int do_fallocate(int fd, int mode, off_t offset, off_t len)
>> {
>> do {
>> @@ -980,6 +982,16 @@ static ssize_t
>> handle_aiocb_write_zeroes(RawPosixAIOData *aiocb)
>> }
>> #endif
>> +#ifdef CONFIG_FALLOCATE
>> + if (s->has_fallocate && aiocb->aio_offset >=
>> bdrv_getlength(aiocb->bs)) {
>> + int ret = do_fallocate(s->fd, 0, aiocb->aio_offset,
>> aiocb->aio_nbytes);
>> + if (ret == 0 || ret != -ENOTSUP) {
>> + return ret;
>> + }
>> + s->has_fallocate = false;
>> + }
>> +#endif
>> +
>> return -ENOTSUP;
>> }
>
> Now that you do have has_fallocate, I think you should be using it in
> patch 5 as well. So I think you should either you make this patch add
> it in the area touched by patch 5, or you introduce has_fallocate in
> patch 5 already and use it there.
>
> Max
OK. No problem. I do not think that it is ever possible, but
why not?
I will reorder these patches in the patchset to minimize changes.
next prev parent reply other threads:[~2015-01-30 5:40 UTC|newest]
Thread overview: 18+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-01-28 18:38 [Qemu-devel] [PATCH v5 0/7] eliminate data write in bdrv_write_zeroes on Linux in raw-posix.c Denis V. Lunev
2015-01-28 18:38 ` [Qemu-devel] [PATCH 1/7] block/raw-posix: create translate_err helper to merge errno values Denis V. Lunev
2015-01-28 18:38 ` [Qemu-devel] [PATCH 2/7] block/raw-posix: create do_fallocate helper Denis V. Lunev
2015-01-28 18:38 ` [Qemu-devel] [PATCH 3/7] block/raw-posix: refactor handle_aiocb_write_zeroes a bit Denis V. Lunev
2015-01-28 18:38 ` [Qemu-devel] [PATCH 4/7] block: use fallocate(FALLOC_FL_ZERO_RANGE) in handle_aiocb_write_zeroes Denis V. Lunev
2015-01-29 22:40 ` Max Reitz
2015-01-28 18:38 ` [Qemu-devel] [PATCH 5/7] block: use fallocate(FALLOC_FL_PUNCH_HOLE) & fallocate(0) to write zeroes Denis V. Lunev
2015-01-29 22:40 ` Max Reitz
2015-01-28 18:38 ` [Qemu-devel] [PATCH 6/7] block/raw-posix: call plain fallocate in handle_aiocb_write_zeroes Denis V. Lunev
2015-01-29 22:50 ` Max Reitz
2015-01-30 5:38 ` Denis V. Lunev [this message]
2015-01-28 18:38 ` [Qemu-devel] [PATCH 7/7] block/raw-posix: set max_write_zeroes to INT_MAX for regular files Denis V. Lunev
2015-01-29 22:51 ` Max Reitz
-- strict thread matches above, loose matches on Subject: below --
2015-01-27 13:51 [Qemu-devel] [PATCH v4 0/7] eliminate data write in bdrv_write_zeroes on Linux in raw-posix.c Denis V. Lunev
2015-01-27 13:51 ` [Qemu-devel] [PATCH 6/7] block/raw-posix: call plain fallocate in handle_aiocb_write_zeroes Denis V. Lunev
2015-01-27 17:57 ` Max Reitz
2015-01-27 18:19 ` Denis V. Lunev
2015-01-27 18:24 ` Max Reitz
2015-01-27 18:33 ` Denis V. Lunev
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=54CB18F0.3020700@openvz.org \
--to=den@openvz.org \
--cc=famz@redhat.com \
--cc=kwolf@redhat.com \
--cc=mreitz@redhat.com \
--cc=pl@kamp.de \
--cc=qemu-devel@nongnu.org \
--cc=stefanha@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).