All of lore.kernel.org
 help / color / mirror / Atom feed
From: Wei Wang <wei.w.wang@intel.com>
To: "Dr. David Alan Gilbert" <dgilbert@redhat.com>,
	"Michael S. Tsirkin" <mst@redhat.com>
Cc: qemu-devel@nongnu.org, virtio-dev@lists.oasis-open.org,
	quintela@redhat.com, pbonzini@redhat.com,
	liliang.opensource@gmail.com, yang.zhang.wz@gmail.com,
	quan.xu0@gmail.com, nilal@redhat.com, riel@redhat.com
Subject: [virtio-dev] Re: [PATCH v4 2/4] migration: API to clear bits of guest free pages from the dirty bitmap
Date: Thu, 15 Mar 2018 19:10:30 +0800	[thread overview]
Message-ID: <5AAA54A6.9020900@intel.com> (raw)
In-Reply-To: <20180314194258.GI3006@work-vm>

On 03/15/2018 03:42 AM, Dr. David Alan Gilbert wrote:
> * Michael S. Tsirkin (mst@redhat.com) wrote:
>> On Wed, Mar 14, 2018 at 06:11:37PM +0000, Dr. David Alan Gilbert wrote:
>>>> +            used_len = block->used_length - offset;
>>>> +            addr += used_len;
>>>> +        }
>>>> +
>>>> +        start = offset >> TARGET_PAGE_BITS;
>>>> +        npages = used_len >> TARGET_PAGE_BITS;
>>>> +        ram_state->migration_dirty_pages -=
>>>> +                      bitmap_count_one_with_offset(block->bmap, start, npages);
>>>> +        bitmap_clear(block->bmap, start, npages);
>>> If this is happening while the migration is running, this isn't safe -
>>> the migration code could clear a bit at about the same point this
>>> happens, so that the count returned by bitmap_count_one_with_offset
>>> wouldn't match the word that was cleared by bitmap_clear.
>>>
>>> The only way I can see to fix it is to run over the range using
>>> bitmap_test_and_clear_atomic, using the return value to decrement
>>> the number of dirty pages.
>>> But you also need to be careful with the update of the
>>> migration_dirty_pages value itself, because that's also being read
>>> by the migration thread.
>>>
>>> Dave
>> I see that there's migration_bitmap_sync but it does not seem to be
> Do you mean bitmap_mutex?
>
>> taken on all paths. E.g. migration_bitmap_clear_dirty and
>> migration_bitmap_find_dirty are called without that lock sometimes.
>> Thoughts?

Right. The bitmap claims to protect modification of the bitmap, but 
migration_bitmap_clear_dirty doesn't strictly follow the rule.

> Hmm, that doesn't seem to protect much at all!  It looks like it was
> originally added to handle hotplug causing the bitmaps to be resized;
> that extension code was removed in 66103a5 so that lock can probably go.
>
> I don't see how the lock would help us though; the migration thread is
> scanning it most of the time so would have to have the lock held
> most of the time.
>



How about adding the lock to migration_bitmap_clear_dirty, and we will 
have something like this:

migration_bitmap_clear_dirty()
{
     qemu_mutex_lock(&rs->bitmap_mutex);
     ret = test_and_clear_bit(page, rb->bmap);
      if (ret) {
         rs->migration_dirty_pages--;
     }
     ...
     qemu_mutex_unlock(&rs->bitmap_mutex);
}


qemu_guest_free_page_hint()
{
     qemu_mutex_lock(&rs->bitmap_mutex);
     ...
     ram_state->migration_dirty_pages -=
                       bitmap_count_one_with_offset(block->bmap, start, 
npages);
     bitmap_clear(block->bmap, start, npages);
     qemu_mutex_unlock(&rs->bitmap_mutex);
}


The migration thread will hold the lock only when it clears a bit from 
the bitmap. Or would you consider to change it to qemu_spin_lock?

Best,
Wei


---------------------------------------------------------------------
To unsubscribe, e-mail: virtio-dev-unsubscribe@lists.oasis-open.org
For additional commands, e-mail: virtio-dev-help@lists.oasis-open.org


WARNING: multiple messages have this Message-ID (diff)
From: Wei Wang <wei.w.wang@intel.com>
To: "Dr. David Alan Gilbert" <dgilbert@redhat.com>,
	"Michael S. Tsirkin" <mst@redhat.com>
Cc: qemu-devel@nongnu.org, virtio-dev@lists.oasis-open.org,
	quintela@redhat.com, pbonzini@redhat.com,
	liliang.opensource@gmail.com, yang.zhang.wz@gmail.com,
	quan.xu0@gmail.com, nilal@redhat.com, riel@redhat.com
Subject: Re: [Qemu-devel] [PATCH v4 2/4] migration: API to clear bits of guest free pages from the dirty bitmap
Date: Thu, 15 Mar 2018 19:10:30 +0800	[thread overview]
Message-ID: <5AAA54A6.9020900@intel.com> (raw)
In-Reply-To: <20180314194258.GI3006@work-vm>

On 03/15/2018 03:42 AM, Dr. David Alan Gilbert wrote:
> * Michael S. Tsirkin (mst@redhat.com) wrote:
>> On Wed, Mar 14, 2018 at 06:11:37PM +0000, Dr. David Alan Gilbert wrote:
>>>> +            used_len = block->used_length - offset;
>>>> +            addr += used_len;
>>>> +        }
>>>> +
>>>> +        start = offset >> TARGET_PAGE_BITS;
>>>> +        npages = used_len >> TARGET_PAGE_BITS;
>>>> +        ram_state->migration_dirty_pages -=
>>>> +                      bitmap_count_one_with_offset(block->bmap, start, npages);
>>>> +        bitmap_clear(block->bmap, start, npages);
>>> If this is happening while the migration is running, this isn't safe -
>>> the migration code could clear a bit at about the same point this
>>> happens, so that the count returned by bitmap_count_one_with_offset
>>> wouldn't match the word that was cleared by bitmap_clear.
>>>
>>> The only way I can see to fix it is to run over the range using
>>> bitmap_test_and_clear_atomic, using the return value to decrement
>>> the number of dirty pages.
>>> But you also need to be careful with the update of the
>>> migration_dirty_pages value itself, because that's also being read
>>> by the migration thread.
>>>
>>> Dave
>> I see that there's migration_bitmap_sync but it does not seem to be
> Do you mean bitmap_mutex?
>
>> taken on all paths. E.g. migration_bitmap_clear_dirty and
>> migration_bitmap_find_dirty are called without that lock sometimes.
>> Thoughts?

Right. The bitmap claims to protect modification of the bitmap, but 
migration_bitmap_clear_dirty doesn't strictly follow the rule.

> Hmm, that doesn't seem to protect much at all!  It looks like it was
> originally added to handle hotplug causing the bitmaps to be resized;
> that extension code was removed in 66103a5 so that lock can probably go.
>
> I don't see how the lock would help us though; the migration thread is
> scanning it most of the time so would have to have the lock held
> most of the time.
>



How about adding the lock to migration_bitmap_clear_dirty, and we will 
have something like this:

migration_bitmap_clear_dirty()
{
     qemu_mutex_lock(&rs->bitmap_mutex);
     ret = test_and_clear_bit(page, rb->bmap);
      if (ret) {
         rs->migration_dirty_pages--;
     }
     ...
     qemu_mutex_unlock(&rs->bitmap_mutex);
}


qemu_guest_free_page_hint()
{
     qemu_mutex_lock(&rs->bitmap_mutex);
     ...
     ram_state->migration_dirty_pages -=
                       bitmap_count_one_with_offset(block->bmap, start, 
npages);
     bitmap_clear(block->bmap, start, npages);
     qemu_mutex_unlock(&rs->bitmap_mutex);
}


The migration thread will hold the lock only when it clears a bit from 
the bitmap. Or would you consider to change it to qemu_spin_lock?

Best,
Wei

  parent reply	other threads:[~2018-03-15 11:07 UTC|newest]

Thread overview: 56+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2018-03-07 12:34 [virtio-dev] [PATCH v4 0/4] virtio-balloon: free page hint reporting support Wei Wang
2018-03-07 12:34 ` [Qemu-devel] " Wei Wang
2018-03-07 12:34 ` [virtio-dev] [PATCH v4 1/4] bitmap: bitmap_count_one_with_offset Wei Wang
2018-03-07 12:34   ` [Qemu-devel] " Wei Wang
2018-03-14 16:20   ` Dr. David Alan Gilbert
2018-03-07 12:34 ` [virtio-dev] [PATCH v4 2/4] migration: API to clear bits of guest free pages from the dirty bitmap Wei Wang
2018-03-07 12:34   ` [Qemu-devel] " Wei Wang
2018-03-14 18:11   ` Dr. David Alan Gilbert
2018-03-14 19:16     ` [virtio-dev] " Michael S. Tsirkin
2018-03-14 19:16       ` [Qemu-devel] " Michael S. Tsirkin
2018-03-14 19:42       ` Dr. David Alan Gilbert
2018-03-14 20:38         ` [virtio-dev] " Michael S. Tsirkin
2018-03-14 20:38           ` [Qemu-devel] " Michael S. Tsirkin
2018-03-15 11:10         ` Wei Wang [this message]
2018-03-15 11:10           ` Wei Wang
2018-03-15 10:52     ` [virtio-dev] " Wei Wang
2018-03-15 10:52       ` [Qemu-devel] " Wei Wang
2018-03-15 13:50       ` [virtio-dev] " Michael S. Tsirkin
2018-03-15 13:50         ` [Qemu-devel] " Michael S. Tsirkin
2018-03-16 11:24         ` [virtio-dev] " Wei Wang
2018-03-16 11:24           ` [Qemu-devel] " Wei Wang
2018-03-07 12:34 ` [virtio-dev] [PATCH v4 3/4] virtio-balloon: VIRTIO_BALLOON_F_FREE_PAGE_HINT Wei Wang
2018-03-07 12:34   ` [Qemu-devel] " Wei Wang
2018-03-13 16:49   ` [virtio-dev] " Michael S. Tsirkin
2018-03-13 16:49     ` [Qemu-devel] " Michael S. Tsirkin
2018-03-14  2:43     ` [virtio-dev] " Wei Wang
2018-03-14  2:43       ` [Qemu-devel] " Wei Wang
2018-03-14  2:53       ` Michael S. Tsirkin
2018-03-14  2:53         ` [Qemu-devel] " Michael S. Tsirkin
2018-03-14  6:03         ` Wei Wang
2018-03-14  6:03           ` [Qemu-devel] " Wei Wang
2018-03-14 14:12           ` Michael S. Tsirkin
2018-03-14 14:12             ` [Qemu-devel] " Michael S. Tsirkin
2018-03-15  1:15             ` Wei Wang
2018-03-15  1:15               ` [Qemu-devel] " Wei Wang
2018-03-15  2:47               ` Michael S. Tsirkin
2018-03-15  2:47                 ` [Qemu-devel] " Michael S. Tsirkin
2018-03-15 10:24                 ` Wei Wang
2018-03-15 10:24                   ` [Qemu-devel] " Wei Wang
2018-03-15 13:53                   ` Michael S. Tsirkin
2018-03-15 13:53                     ` [Qemu-devel] " Michael S. Tsirkin
2018-03-14 18:44   ` [Qemu-devel] " Dr. David Alan Gilbert
2018-03-07 12:34 ` [virtio-dev] [PATCH v4 4/4] migration: use the free page hint feature from balloon Wei Wang
2018-03-07 12:34   ` [Qemu-devel] " Wei Wang
2018-03-13 16:35   ` [virtio-dev] " Michael S. Tsirkin
2018-03-13 16:35     ` [Qemu-devel] " Michael S. Tsirkin
2018-03-14  2:41     ` [virtio-dev] " Wei Wang
2018-03-14  2:41       ` [Qemu-devel] " Wei Wang
2018-03-14  2:51       ` [virtio-dev] " Michael S. Tsirkin
2018-03-14  2:51         ` [Qemu-devel] " Michael S. Tsirkin
2018-03-14  6:50         ` [virtio-dev] " Wei Wang
2018-03-14  6:50           ` [Qemu-devel] " Wei Wang
2018-03-14 14:45           ` [virtio-dev] " Michael S. Tsirkin
2018-03-14 14:45             ` [Qemu-devel] " Michael S. Tsirkin
2018-03-14 19:49   ` Dr. David Alan Gilbert
2018-03-16 11:20     ` Wei Wang

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=5AAA54A6.9020900@intel.com \
    --to=wei.w.wang@intel.com \
    --cc=dgilbert@redhat.com \
    --cc=liliang.opensource@gmail.com \
    --cc=mst@redhat.com \
    --cc=nilal@redhat.com \
    --cc=pbonzini@redhat.com \
    --cc=qemu-devel@nongnu.org \
    --cc=quan.xu0@gmail.com \
    --cc=quintela@redhat.com \
    --cc=riel@redhat.com \
    --cc=virtio-dev@lists.oasis-open.org \
    --cc=yang.zhang.wz@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.