From: Jean-Louis Dupond <jean-louis@dupond.be>
To: qemu-devel@nongnu.org, kwolf@redhat.com, hreitz@redhat.com,
andrey.drobyshev@virtuozzo.com
Subject: Re: [PATCH 1/2] qcow2: handle discard-no-unref in measure
Date: Wed, 5 Jun 2024 12:59:19 +0200 [thread overview]
Message-ID: <5b49cbb9-9b28-4a0d-b897-77492392333d@dupond.be> (raw)
In-Reply-To: <20240605090639.3402698-2-jean-louis@dupond.be>
On 5/06/2024 11:06, Jean-Louis Dupond wrote:
> When doing a measure on an image with a backing file and
> discard-no-unref is enabled, the code should take this into account.
>
> If for example you have a snapshot image with a base, and you do a
> discard within the snapshot, it will be ZERO and ALLOCATED, but without
> host offset.
> Now if we commit this snapshot, and the clusters in the base image were
> allocated, the clusters will only be set to ZERO, but the host offset
> will not be cleared.
> Therefor ZERO & ALLOCATED clusters in the top image need to check the
> base to see if space will be freed or not, to have a correct measure
> output.
>
> Bug-Url: https://gitlab.com/qemu-project/qemu/-/issues/2369
> Signed-off-by: Jean-Louis Dupond <jean-louis@dupond.be>
> ---
> block/qcow2.c | 36 +++++++++++++++++++++++++++++++++---
> 1 file changed, 33 insertions(+), 3 deletions(-)
>
> diff --git a/block/qcow2.c b/block/qcow2.c
> index 956128b409..1ce7ebbab4 100644
> --- a/block/qcow2.c
> +++ b/block/qcow2.c
> @@ -5163,9 +5163,16 @@ static BlockMeasureInfo *qcow2_measure(QemuOpts *opts, BlockDriverState *in_bs,
> } else {
> int64_t offset;
> int64_t pnum = 0;
> + BlockDriverState *parent = bdrv_filter_or_cow_bs(in_bs);
> + BDRVQcow2State *s = NULL;
> +
> + if (parent) {
> + s = parent->opaque;
> + }
>
> for (offset = 0; offset < ssize; offset += pnum) {
> int ret;
> + int retp = 0;
>
> ret = bdrv_block_status_above(in_bs, NULL, offset,
> ssize - offset, &pnum, NULL,
> @@ -5176,10 +5183,33 @@ static BlockMeasureInfo *qcow2_measure(QemuOpts *opts, BlockDriverState *in_bs,
> goto err;
> }
>
> - if (ret & BDRV_BLOCK_ZERO) {
> + /* If we have a parent in the chain and the current block is zero but allocated,
> + * then we want to check the allocation state of the parent block.
> + * If it was allocated and now zero, we want
> + * to include it into the calculation, cause it will not free space when
> + * committing the top into base with discard-no-unref enabled.
> + */
> + if (parent &&
> + ((ret & (BDRV_BLOCK_ZERO | BDRV_BLOCK_ALLOCATED)) ==
> + (BDRV_BLOCK_ZERO | BDRV_BLOCK_ALLOCATED)) &&
> + s->discard_no_unref) {
> + int64_t pnum_parent = 0;
> + retp = bdrv_block_status_above(parent, NULL, offset,
> + ssize - offset, &pnum_parent, NULL,
> + NULL);
> + // Check if parent block has an offset
> + if (retp & BDRV_BLOCK_OFFSET_VALID) {
> + pnum = retp;
This should be `pnum = pnum_parent` of course :)
> + }
> + }
> + if (ret & BDRV_BLOCK_ZERO && !retp) {
> /* Skip zero regions (safe with no backing file) */
> - } else if ((ret & (BDRV_BLOCK_DATA | BDRV_BLOCK_ALLOCATED)) ==
> - (BDRV_BLOCK_DATA | BDRV_BLOCK_ALLOCATED)) {
> + } else if (((ret & (BDRV_BLOCK_DATA | BDRV_BLOCK_ALLOCATED)) ==
> + (BDRV_BLOCK_DATA | BDRV_BLOCK_ALLOCATED)) ||
> + (((ret & (BDRV_BLOCK_ZERO | BDRV_BLOCK_ALLOCATED)) ==
> + (BDRV_BLOCK_ZERO | BDRV_BLOCK_ALLOCATED)) &&
> + s && s->discard_no_unref &&
> + retp & BDRV_BLOCK_OFFSET_VALID)) {
> /* Extend pnum to end of cluster for next iteration */
> pnum = ROUND_UP(offset + pnum, cluster_size) - offset;
>
This seems to work fine in my tests with the following commands:
./build/qemu-img create -f qcow2 /tmp/test.qcow2 128M
./build/qemu-io -c 'open /tmp/test.qcow2' -c 'write 0 8M' -c 'write
56M 20M' -c 'write 10M 8M' -c 'write 24M 32M'
./build/qemu-img create -f qcow2 -b /tmp/test.qcow2 -F qcow2
/tmp/test_snap.qcow2
./build/qemu-io -c 'open -o discard=unmap,discard-no-unref=on
/tmp/test_snap.qcow2' -c 'write 16M 8M' -c 'discard 60M 20M' -c 'write
84M 10M'
./build/qemu-img measure --output json -O qcow2 'json:{"file":
{"driver": "file", "filename": "/tmp/test_snap.qcow2"}, "driver":
"qcow2", "backing": {"driver": "qcow2", "file": {"driver": "file",
"filename": "/tmp/test.qcow2"}, "backing": null}}'
./build/qemu-img measure --output json -O qcow2 'json:{"file":
{"driver": "file", "filename": "/tmp/test_snap.qcow2"}, "driver":
"qcow2", "discard":"unmap", "discard-no-unref":true, "backing":
{"driver": "qcow2", "discard-no-unref":true, "file": {"driver": "file",
"filename": "/tmp/test.qcow2"}, "backing": null}}'
But it does not seem to work when the base image has ZERO ALLOCATED
clusters that overlap with the ZERO ALLOCATED clusters in the snapshot.
As its then seen as a single zero cluster by the bdrv_block_status_above
function.
This happens for example when the base vm was initially running without
discard-no-unref and enabled it only later.
Any idea's on how to handle that ?
Thanks
Jean-Louis
prev parent reply other threads:[~2024-06-05 11:00 UTC|newest]
Thread overview: 3+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-06-05 9:06 [PATCH 1/2] qcow2: handle discard-no-unref in measure Jean-Louis Dupond
2024-06-05 9:06 ` [PATCH 2/2] qcow2: don't allow discard-no-unref when discard is not enabled Jean-Louis Dupond
2024-06-05 10:59 ` Jean-Louis Dupond [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=5b49cbb9-9b28-4a0d-b897-77492392333d@dupond.be \
--to=jean-louis@dupond.be \
--cc=andrey.drobyshev@virtuozzo.com \
--cc=hreitz@redhat.com \
--cc=kwolf@redhat.com \
--cc=qemu-devel@nongnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).