qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
From: Max Reitz <mreitz@redhat.com>
To: "Benoît Canet" <benoit.canet@irqsave.net>
Cc: Kevin Wolf <kwolf@redhat.com>,
	qemu-devel@nongnu.org, Stefan Hajnoczi <stefanha@redhat.com>
Subject: Re: [Qemu-devel] [PATCH alt 6/7] block/qcow2: Simplify shared L2 handling in amend
Date: Fri, 01 Aug 2014 22:51:47 +0200	[thread overview]
Message-ID: <53DBFDE3.7010402@redhat.com> (raw)
In-Reply-To: <20140731082420.GK707@irqsave.net>

On 31.07.2014 10:24, Benoît Canet wrote:
> The Saturday 26 Jul 2014 à 21:22:10 (+0200), Max Reitz wrote :
>
>> Currently, we have a bitmap for keeping track of which clusters have
>> been created during the zero cluster expansion process. This was
>> necessary because we need to properly increase the refcount for shared
>> L2 tables.
>>
>> However, now we can simply take the L2 refcount and use it for the
>> cluster allocated for expansion. This will be the correct refcount and
>> therefore we don't have to remember that cluster having been allocated
>> any more.
>>
>> Signed-off-by: Max Reitz <mreitz@redhat.com>
>> ---
>>   block/qcow2-cluster.c | 90 ++++++++++++++++-----------------------------------
>>   1 file changed, 28 insertions(+), 62 deletions(-)
>>
>> diff --git a/block/qcow2-cluster.c b/block/qcow2-cluster.c
>> index f8bec6f..e6bff40 100644
>> --- a/block/qcow2-cluster.c
>> +++ b/block/qcow2-cluster.c
>> @@ -1543,20 +1543,12 @@ fail:
>>    * Expands all zero clusters in a specific L1 table (or deallocates them, for
>>    * non-backed non-pre-allocated zero clusters).
>>    *
>> - * expanded_clusters is a bitmap where every bit corresponds to one cluster in
>> - * the image file; a bit gets set if the corresponding cluster has been used for
>> - * zero expansion (i.e., has been filled with zeroes and is referenced from an
>> - * L2 table). nb_clusters contains the total cluster count of the image file,
>> - * i.e., the number of bits in expanded_clusters.
>> - *
>>    * l1_entries and *visited_l1_entries are ued to keep track of progress for
>>    * status_cb(). l1_entries contains the total number of L1 entries and
>>    * *visited_l1_entries counts all visited L1 entries.
>>    */
>>   static int expand_zero_clusters_in_l1(BlockDriverState *bs, uint64_t *l1_table,
>> -                                      int l1_size, uint8_t **expanded_clusters,
>> -                                      uint64_t *nb_clusters,
>> -                                      int64_t *visited_l1_entries,
>> +                                      int l1_size, int64_t *visited_l1_entries,
>>                                         int64_t l1_entries,
>>                                         BlockDriverAmendStatusCB *status_cb)
>>   {
>> @@ -1575,6 +1567,7 @@ static int expand_zero_clusters_in_l1(BlockDriverState *bs, uint64_t *l1_table,
>>       for (i = 0; i < l1_size; i++) {
>>           uint64_t l2_offset = l1_table[i] & L1E_OFFSET_MASK;
>>           bool l2_dirty = false;
>> +        int l2_refcount;
>>   
>>           if (!l2_offset) {
>>               /* unallocated */
>> @@ -1595,33 +1588,19 @@ static int expand_zero_clusters_in_l1(BlockDriverState *bs, uint64_t *l1_table,
>>               goto fail;
>>           }
>>   
>> +        l2_refcount = qcow2_get_refcount(bs, l2_offset >> s->cluster_bits);
>> +        if (l2_refcount < 0) {
>> +            ret = l2_refcount;
>> +            goto fail;
>> +        }
>> +
>>           for (j = 0; j < s->l2_size; j++) {
>>               uint64_t l2_entry = be64_to_cpu(l2_table[j]);
>> -            int64_t offset = l2_entry & L2E_OFFSET_MASK, cluster_index;
>> +            int64_t offset = l2_entry & L2E_OFFSET_MASK;
>>               int cluster_type = qcow2_get_cluster_type(l2_entry);
>>               bool preallocated = offset != 0;
>>   
>> -            if (cluster_type == QCOW2_CLUSTER_NORMAL) {
>> -                cluster_index = offset >> s->cluster_bits;
>> -                assert((cluster_index >= 0) && (cluster_index < *nb_clusters));
>> -                if ((*expanded_clusters)[cluster_index / 8] &
>> -                    (1 << (cluster_index % 8))) {
>> -                    /* Probably a shared L2 table; this cluster was a zero
>> -                     * cluster which has been expanded, its refcount
>> -                     * therefore most likely requires an update. */
>> -                    ret = qcow2_update_cluster_refcount(bs, cluster_index, 1,
>> -                                                        QCOW2_DISCARD_NEVER);
>> -                    if (ret < 0) {
>> -                        goto fail;
>> -                    }
>> -                    /* Since we just increased the refcount, the COPIED flag may
>> -                     * no longer be set. */
>> -                    l2_table[j] = cpu_to_be64(l2_entry & ~QCOW_OFLAG_COPIED);
>> -                    l2_dirty = true;
>> -                }
>> -                continue;
>> -            }
>> -            else if (qcow2_get_cluster_type(l2_entry) != QCOW2_CLUSTER_ZERO) {
>> +            if (cluster_type != QCOW2_CLUSTER_ZERO) {
>>                   continue;
>>               }
>>   
>> @@ -1639,6 +1618,19 @@ static int expand_zero_clusters_in_l1(BlockDriverState *bs, uint64_t *l1_table,
>>                       ret = offset;
>>                       goto fail;
>>                   }
>> +
>> +                if (l2_refcount > 1) {
>> +                    /* For shared L2 tables, set the refcount accordingly (it is
>> +                     * already 1 and needs to be l2_refcount) */
>> +                    ret = qcow2_update_cluster_refcount(bs,
>> +                            offset >> s->cluster_bits, l2_refcount - 1,
>> +                            QCOW2_DISCARD_OTHER);
> This look like a wrong usage of qcow2_update_cluster_refcount:
>
> /*
>   * Increases or decreases the refcount of a given cluster by one.
>   * addend must be 1 or -1.
> Here ^

As far as I can see, that comment no longer applies (anything in 
update_refcount() very well allows arbitrary values). I'll remove it in v2.

>   *
>   * If the return value is non-negative, it is the new refcount of the cluster.
>   * If it is negative, it is -errno and indicates an error.
>   */
> int qcow2_update_cluster_refcount(BlockDriverState *bs,
>                                    int64_t cluster_index,
>                                    int addend,
>                                    enum qcow2_discard_type type)
>
> Also this call is in a loop it would do l2_refcount - 1 * n increments on the refcount.

Hm? The cluster at "offset" is allocated directly before the call to 
qcow2_update_cluster_refcount(). There is no loop which the latter call 
is in, but the allocation is not.

Max

>> +                    if (ret < 0) {
>> +                        qcow2_free_clusters(bs, offset, s->cluster_size,
>> +                                            QCOW2_DISCARD_OTHER);
>> +                        goto fail;
>> +                    }
>> +                }
>
>
>
>>               }
>>   
>>               ret = qcow2_pre_write_overlap_check(bs, 0, offset, s->cluster_size);
>> @@ -1660,29 +1652,12 @@ static int expand_zero_clusters_in_l1(BlockDriverState *bs, uint64_t *l1_table,
>>                   goto fail;
>>               }
>>   
>> -            l2_table[j] = cpu_to_be64(offset | QCOW_OFLAG_COPIED);
>> -            l2_dirty = true;
>> -
>> -            cluster_index = offset >> s->cluster_bits;
>> -
>> -            if (cluster_index >= *nb_clusters) {
>> -                uint64_t old_bitmap_size = (*nb_clusters + 7) / 8;
>> -                uint64_t new_bitmap_size;
>> -                /* The offset may lie beyond the old end of the underlying image
>> -                 * file for growable files only */
>> -                assert(bs->file->growable);
>> -                *nb_clusters = size_to_clusters(s, bs->file->total_sectors *
>> -                                                BDRV_SECTOR_SIZE);
>> -                new_bitmap_size = (*nb_clusters + 7) / 8;
>> -                *expanded_clusters = g_realloc(*expanded_clusters,
>> -                                               new_bitmap_size);
>> -                /* clear the newly allocated space */
>> -                memset(&(*expanded_clusters)[old_bitmap_size], 0,
>> -                       new_bitmap_size - old_bitmap_size);
>> +            if (l2_refcount == 1) {
>> +                l2_table[j] = cpu_to_be64(offset | QCOW_OFLAG_COPIED);
>> +            } else {
>> +                l2_table[j] = cpu_to_be64(offset);
>>               }
>> -
>> -            assert((cluster_index >= 0) && (cluster_index < *nb_clusters));
>> -            (*expanded_clusters)[cluster_index / 8] |= 1 << (cluster_index % 8);
>> +            l2_dirty = true;
>>           }
>>   
>>           if (is_active_l1) {
>> @@ -1749,9 +1724,7 @@ int qcow2_expand_zero_clusters(BlockDriverState *bs,
>>   {
>>       BDRVQcowState *s = bs->opaque;
>>       uint64_t *l1_table = NULL;
>> -    uint64_t nb_clusters;
>>       int64_t l1_entries = 0, visited_l1_entries = 0;
>> -    uint8_t *expanded_clusters;
>>       int ret;
>>       int i, j;
>>   
>> @@ -1762,12 +1735,7 @@ int qcow2_expand_zero_clusters(BlockDriverState *bs,
>>           }
>>       }
>>   
>> -    nb_clusters = size_to_clusters(s, bs->file->total_sectors *
>> -                                   BDRV_SECTOR_SIZE);
>> -    expanded_clusters = g_malloc0((nb_clusters + 7) / 8);
>> -
>>       ret = expand_zero_clusters_in_l1(bs, s->l1_table, s->l1_size,
>> -                                     &expanded_clusters, &nb_clusters,
>>                                        &visited_l1_entries, l1_entries,
>>                                        status_cb);
>>       if (ret < 0) {
>> @@ -1803,7 +1771,6 @@ int qcow2_expand_zero_clusters(BlockDriverState *bs,
>>           }
>>   
>>           ret = expand_zero_clusters_in_l1(bs, l1_table, s->snapshots[i].l1_size,
>> -                                         &expanded_clusters, &nb_clusters,
>>                                            &visited_l1_entries, l1_entries,
>>                                            status_cb);
>>           if (ret < 0) {
>> @@ -1814,7 +1781,6 @@ int qcow2_expand_zero_clusters(BlockDriverState *bs,
>>       ret = 0;
>>   
>>   fail:
>> -    g_free(expanded_clusters);
>>       g_free(l1_table);
>>       return ret;
>>   }
>> -- 
>> 2.0.3
>>
>>

  reply	other threads:[~2014-08-01 20:52 UTC|newest]

Thread overview: 26+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-07-26 19:22 [Qemu-devel] [PATCH alt 0/7] block/qcow2: Improve zero cluster expansion Max Reitz
2014-07-26 19:22 ` [Qemu-devel] [PATCH alt 1/7] block: Add status callback to bdrv_amend_options() Max Reitz
2014-07-31  7:51   ` Benoît Canet
2014-07-31  8:07     ` Benoît Canet
2014-08-01 20:06     ` Max Reitz
2014-07-26 19:22 ` [Qemu-devel] [PATCH alt 2/7] qemu-img: Add progress output for amend Max Reitz
2014-07-31  7:56   ` Benoît Canet
2014-08-01 20:09     ` Max Reitz
2014-07-26 19:22 ` [Qemu-devel] [PATCH alt 3/7] qemu-img: Fix insignifcant memleak Max Reitz
2014-07-30 14:58   ` Eric Blake
2014-07-30 20:08     ` Max Reitz
2014-07-31  7:59   ` Benoît Canet
2014-07-26 19:22 ` [Qemu-devel] [PATCH alt 4/7] block/qcow2: Implement status CB for amend Max Reitz
2014-07-30 16:28   ` Eric Blake
2014-07-31  8:06   ` Benoît Canet
2014-08-01 20:18     ` Max Reitz
2014-08-01 20:38       ` Eric Blake
2014-08-01 20:48         ` Max Reitz
2014-07-26 19:22 ` [Qemu-devel] [PATCH alt 5/7] block/qcow2: Make get_refcount() global Max Reitz
2014-07-31  8:09   ` Benoît Canet
2014-07-26 19:22 ` [Qemu-devel] [PATCH alt 6/7] block/qcow2: Simplify shared L2 handling in amend Max Reitz
2014-07-31  8:24   ` Benoît Canet
2014-08-01 20:51     ` Max Reitz [this message]
2014-07-26 19:22 ` [Qemu-devel] [PATCH alt 7/7] iotests: Expand test 061 Max Reitz
2014-07-31  8:30   ` Benoît Canet
2014-08-01 20:34     ` Max Reitz

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=53DBFDE3.7010402@redhat.com \
    --to=mreitz@redhat.com \
    --cc=benoit.canet@irqsave.net \
    --cc=kwolf@redhat.com \
    --cc=qemu-devel@nongnu.org \
    --cc=stefanha@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).