From: Peter Lieven <pl@kamp.de>
To: Kevin Wolf <kwolf@redhat.com>
Cc: Paolo Bonzini <pbonzini@redhat.com>,
"qemu-devel@nongnu.org" <qemu-devel@nongnu.org>,
"stefanha@redhat.com >> Stefan Hajnoczi" <stefanha@redhat.com>,
mreitz@redhat.com
Subject: Re: [Qemu-devel] [Fwd: [PATCH v2] vpc: Implement bdrv_co_get_block_status()]
Date: Fri, 20 Feb 2015 14:42:15 +0100 [thread overview]
Message-ID: <54E739B7.1010601@kamp.de> (raw)
In-Reply-To: <20150219120734.GB3893@noname.redhat.com>
Am 19.02.2015 um 13:07 schrieb Kevin Wolf:
> Am 18.02.2015 um 22:03 hat Peter Lieven geschrieben:
>> Am 18.02.2015 um 21:57 schrieb Peter Lieven:
>>> This implements bdrv_co_get_block_status() for VHD images. This can
>>> significantly speed up qemu-img convert operation because only with this
>>> function implemented sparseness can be considered. (Before, converting a
>>> 1 TB empty image took several minutes for me, now it's instantaneous.)
>>>
>>> Signed-off-by: Kevin Wolf <kwolf@redhat.com>
>>> ---
>>> block/vpc.c | 50 ++++++++++++++++++++++++++++++++++++++++++++++++--
>>> 1 file changed, 48 insertions(+), 2 deletions(-)
>>>
>>> diff --git a/block/vpc.c b/block/vpc.c
>>> index 7fddbf0..1533b6a 100644
>>> --- a/block/vpc.c
>>> +++ b/block/vpc.c
>>> @@ -597,6 +597,51 @@ static coroutine_fn int vpc_co_write(BlockDriverState
>>> *bs, int64_t sector_num,
>>> return ret;
>>> }
>>>
>>> +static int64_t coroutine_fn vpc_co_get_block_status(BlockDriverState *bs,
>>> + int64_t sector_num, int nb_sectors, int *pnum)
>>> +{
>>> + BDRVVPCState *s = bs->opaque;
>>> + VHDFooter *footer = (VHDFooter*) s->footer_buf;
>>> + int64_t start, offset, next;
>>> + bool allocated;
>>> + int n;
>>> +
>>> + if (be32_to_cpu(footer->type) == VHD_FIXED) {
>>> + *pnum = nb_sectors;
>>> + return BDRV_BLOCK_RAW | BDRV_BLOCK_OFFSET_VALID | BDRV_BLOCK_DATA |
>>> + (sector_num << BDRV_SECTOR_BITS);
>>> + }
>>> +
>>> + offset = get_sector_offset(bs, sector_num, 0);
>>> + start = offset;
>>> + allocated = (offset != -1);
>>> + *pnum = 0;
>>> +
>>> + do {
>>> + /* All sectors in a block are contiguous (without using the
>>> bitmap) */
>>> + n = ROUND_UP(sector_num + 1, s->block_size / BDRV_SECTOR_SIZE)
>>> + - sector_num;
>>> + n = MIN(n, nb_sectors);
>>> +
>>> + *pnum += n;
>>> + sector_num += n;
>>> + nb_sectors -= n;
>>> + next = start + (*pnum * BDRV_SECTOR_SIZE);
>>> +
>>> + if (nb_sectors == 0) {
>>> + break;
>>> + }
>>> +
>>> + offset = get_sector_offset(bs, sector_num, 0);
>>> + } while ((allocated && offset == next) || (!allocated && offset == -1));
>>> +
>>> + if (allocated) {
>>> + return BDRV_BLOCK_DATA | BDRV_BLOCK_OFFSET_VALID | start;
>>> + } else {
>>> + return 0;
>> Shouldn't this be
>>
>> return BDRV_BLOCK_ZERO;
>>
>> ?
>>
>> vpc_read memsets all blocks with offset == -1 to 0x00.
> Yes, but the blocks are still unallocated, as opposed to allocated as
> zero clusters, and this is indicated by 0.
Okay, than I somehow have to fix that up in the iscsi driver. There
I tread unallocated and anchored identically. Has that changed somewhen
after the initial introduction of bdrv_get_block_status?
>
> vpc_get_info() sets bdi->unallocated_blocks_are_zero = true, so we end
> up with bdrv_co_get_block_status() returning BDRV_BLOCK_ZERO, but not
> BDRV_BLOCK_ALLOCATED (which would be set if we had BDRV_BLOCK_ZERO
> here).
>
> I'm not sure if a wrong allocated flag would cause problem currently,
> but it's definitely necessary to get right once we add support for
> differencing images (patches are on the list, pending review).
>
>> Not for this patch, but couldn't we use your new function to signifincantly speed up
>> reading of continous allocated areas in vpc_read?
> There aren't really contiguous blocks in VHD, you always have a bitmap
> in between. In some cases it might be better to read the bitmap as well
> as the two adjacent blocks and throw that buffer away in order to save
> one read request, but with relatively large block sizes of VHD it's
> probably not going to help that much.
If there is always a bitmap between 2 clusters I do not understand the loop
in your bdrv_co_get_block_status implementation for VPC?
If I understand correctly you skip over the bitmaps in the loop and report
continous sectors as allocated from start.
If there are bitmaps in between at least qemu-img map would produce wrong
output.
Peter
prev parent reply other threads:[~2015-02-20 13:42 UTC|newest]
Thread overview: 3+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <7d15cff8d75566a93a52801605d8761c.squirrel@ssl.dlhnet.de>
2015-02-18 21:03 ` [Qemu-devel] [Fwd: [PATCH v2] vpc: Implement bdrv_co_get_block_status()] Peter Lieven
2015-02-19 12:07 ` Kevin Wolf
2015-02-20 13:42 ` Peter Lieven [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=54E739B7.1010601@kamp.de \
--to=pl@kamp.de \
--cc=kwolf@redhat.com \
--cc=mreitz@redhat.com \
--cc=pbonzini@redhat.com \
--cc=qemu-devel@nongnu.org \
--cc=stefanha@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).