From: Anton Nefedov <anton.nefedov@virtuozzo.com>
To: Alberto Garcia <berto@igalia.com>, qemu-devel@nongnu.org
Cc: qemu-block@nongnu.org, kwolf@redhat.com, mreitz@redhat.com,
eblake@redhat.com, den@virtuozzo.com
Subject: Re: [Qemu-devel] [PATCH v6 8/9] qcow2: skip writing zero buffers to empty COW areas
Date: Wed, 17 Jan 2018 17:12:44 +0300 [thread overview]
Message-ID: <11f70cde-3f73-ad03-3d67-1e2eee2a04e4@virtuozzo.com> (raw)
In-Reply-To: <w51h8rlwzp8.fsf@maestria.local.igalia.com>
On 16/1/2018 7:11 PM, Alberto Garcia wrote:
> On Tue 16 Jan 2018 02:04:29 PM CET, Anton Nefedov wrote:
>
>> iotest 060:
>> write to the discarded cluster does not trigger COW anymore.
>> so, break on write_aio event instead, will work for the test
>> (but write won't fail anymore, so update reference output)
>
> I'm wondering about this. The reason why the write doesn't fail anymore
> is because after this patch we're breaking in write_aio as you say:
>
> BLKDBG_EVENT(bs->file, BLKDBG_WRITE_AIO);
> trace_qcow2_writev_data(qemu_coroutine_self(),
> cluster_offset + offset_in_cluster);
> ret = bdrv_co_pwritev(bs->file,
> cluster_offset + offset_in_cluster,
> cur_bytes, &hd_qiov, 0);
>
> When the image is marked as corrupted then bs->drv is set to NULL, but
> bs->file->drv is still valid. So QEMU goes forward and writes into the
> image.
>
> Should we check bs->drv after BLKDBG_EVENT() or perhaps set
> bs->file->bs->drv = NULL when an image is corrupted?
>
I don't know. On one hand we'll catch and cancel some of in-flight
requests which is rather good.
It feels though like the drv check that the test uses to get error on is
mostly because the driver function is used directly.
>> +static bool is_zero_cow(BlockDriverState *bs, QCowL2Meta *m)
>> +{
>> + if (bs->encrypted) {
>> + return false;
>> + }
>
> I found this a bit confusing because is_zero_cow() can be interpreted as
> "the region we're going to copy only contains zeroes" or "we're only
> going to write zeroes".
>
> In the first case the bs->encrypted test does not belong there, because
> that region may perfectly well contain only zeroes and bs->encrypted
> tells us nothing about it.
>
> In the second case the test is fine because bs->encrypted means that
> we're definitely going to write something other than zeroes.
>
> I think it's worth adding a comment clarifying this in order to avoid
> confusion, or perhaps renaming the function to make it more explicit
> (cow_writes_as_zeroes() or something like that).
>
Agree. I'd rather take bs->encrypted check out.
>> +static void handle_alloc_space(BlockDriverState *bs, QCowL2Meta *l2meta)
>> +{
>> + BDRVQcow2State *s = bs->opaque;
>> + QCowL2Meta *m;
>> +
>> + for (m = l2meta; m != NULL; m = m->next) {
>> + int ret;
>> +
>> + if (!m->cow_start.nb_bytes && !m->cow_end.nb_bytes) {
>> + continue;
>> + }
>> +
>> + if (!is_zero_cow(bs, m)) {
>> + continue;
>> + }
>> +
>> + /* instead of writing zero COW buffers,
>> + efficiently zero out the whole clusters */
>> + ret = bdrv_co_pwrite_zeroes(bs->file, m->alloc_offset,
>> + m->nb_clusters * s->cluster_size,
>> + BDRV_REQ_ALLOCATE);
>> + if (ret < 0) {
>> + continue;
>> + }
>
> Is it always fine to simply ignore the error and go on?
>
Good point, probably error codes other than ENOTSUP and EAGAIN should be
propagated.
>> --- a/tests/qemu-iotests/060
>> +++ b/tests/qemu-iotests/060
>> @@ -160,7 +160,7 @@ poke_file "$TEST_IMG" '131084' "\x00\x00" # 0x2000c
>> # any unallocated cluster, leading to an attempt to overwrite the second L2
>> # table. Finally, resume the COW write and see it fail (but not crash).
>> echo "open -o file.driver=blkdebug $TEST_IMG
>> -break cow_read 0
>> +break write_aio 0
>> aio_write 0k 1k
>> wait_break 0
>> write 64k 64k
>
> Apart from what I wrote in the beginning of the e-mail, if you're
> changing the semantics of this test you should also update the
> comment. With your patch the COW no longer stops before doing the read,
> and after being resumed it no longer crashes.
>
In fact, the change makes the test quite useless.
I will fix COW instead (i.e. use a real backing file).
Also I think I missed to create a new blkdbg event, it looks those are
generally put before bdrv_x(bs->file) calls.
next prev parent reply other threads:[~2018-01-17 14:13 UTC|newest]
Thread overview: 14+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-01-16 13:04 [Qemu-devel] [PATCH v6 0/9] qcow2: cluster space preallocation Anton Nefedov
2018-01-16 13:04 ` [Qemu-devel] [PATCH v6 1/9] mirror: inherit supported write/zero flags Anton Nefedov
2018-01-16 13:04 ` [Qemu-devel] [PATCH v6 2/9] blkverify: set " Anton Nefedov
2018-01-16 13:04 ` [Qemu-devel] [PATCH v6 3/9] block: introduce BDRV_REQ_ALLOCATE flag Anton Nefedov
2018-01-16 13:04 ` [Qemu-devel] [PATCH v6 4/9] block: treat BDRV_REQ_ALLOCATE as serialising Anton Nefedov
2018-01-16 20:43 ` Eric Blake
2018-01-16 13:04 ` [Qemu-devel] [PATCH v6 5/9] file-posix: support BDRV_REQ_ALLOCATE Anton Nefedov
2018-01-16 13:04 ` [Qemu-devel] [PATCH v6 6/9] block: support BDRV_REQ_ALLOCATE in passthrough drivers Anton Nefedov
2018-01-16 13:04 ` [Qemu-devel] [PATCH v6 7/9] qcow2: move is_zero() up Anton Nefedov
2018-01-16 13:04 ` [Qemu-devel] [PATCH v6 8/9] qcow2: skip writing zero buffers to empty COW areas Anton Nefedov
2018-01-16 16:11 ` Alberto Garcia
2018-01-17 14:12 ` Anton Nefedov [this message]
2018-01-16 13:04 ` [Qemu-devel] [PATCH v6 9/9] iotest 134: test cluster-misaligned encrypted write Anton Nefedov
2018-01-16 20:45 ` Eric Blake
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=11f70cde-3f73-ad03-3d67-1e2eee2a04e4@virtuozzo.com \
--to=anton.nefedov@virtuozzo.com \
--cc=berto@igalia.com \
--cc=den@virtuozzo.com \
--cc=eblake@redhat.com \
--cc=kwolf@redhat.com \
--cc=mreitz@redhat.com \
--cc=qemu-block@nongnu.org \
--cc=qemu-devel@nongnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).