qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
* [Qemu-devel] [PATCH 5/7] migration: Fix the migrate auto converge process
@ 2014-02-28  4:11 Gonglei (Arei)
  2014-02-28 10:16 ` Dr. David Alan Gilbert
  0 siblings, 1 reply; 3+ messages in thread
From: Gonglei (Arei) @ 2014-02-28  4:11 UTC (permalink / raw)
  To: qemu-devel@nongnu.org
  Cc: Peter Maydell, Juan Quintela, pl@kamp.de, owasserm@redhat.com,
	aliguori@amazon.com, chenliang (T), pbonzini@redhat.com

It is inaccuracy and complex that using the transfer speed of
migration thread to determine whether the convergence migration.
The dirty page may be compressed by XBZRLE or ZERO_PAGE.The counter
of updating dirty bitmap will be increasing continuously if the
migration can't convergence.

Signed-off-by: ChenLiang <chenliang88@huawei.com>
Signed-off-by: Gonglei <arei.gonglei@huawei.com>
---
 arch_init.c | 26 +++-----------------------
 1 file changed, 3 insertions(+), 23 deletions(-)

diff --git a/arch_init.c b/arch_init.c
index fc71331..2211e0b 100644
--- a/arch_init.c
+++ b/arch_init.c
@@ -107,7 +107,6 @@ int graphic_depth = 32;
 
 const uint32_t arch_type = QEMU_ARCH;
 static bool mig_throttle_on;
-static int dirty_rate_high_cnt;
 static void check_guest_throttling(void);
 
 static uint64_t bitmap_sync_cnt;
@@ -464,17 +463,11 @@ static void migration_bitmap_sync(void)
     uint64_t num_dirty_pages_init = migration_dirty_pages;
     MigrationState *s = migrate_get_current();
     static int64_t start_time;
-    static int64_t bytes_xfer_prev;
     static int64_t num_dirty_pages_period;
     int64_t end_time;
-    int64_t bytes_xfer_now;
 
     increase_bitmap_sync_cnt();
 
-    if (!bytes_xfer_prev) {
-        bytes_xfer_prev = ram_bytes_transferred();
-    }
-
     if (!start_time) {
         start_time = qemu_clock_get_ms(QEMU_CLOCK_REALTIME);
     }
@@ -493,21 +486,9 @@ static void migration_bitmap_sync(void)
     /* more than 1 second = 1000 millisecons */
     if (end_time > start_time + 1000) {
         if (migrate_auto_converge()) {
-            /* The following detection logic can be refined later. For now:
-               Check to see if the dirtied bytes is 50% more than the approx.
-               amount of bytes that just got transferred since the last time we
-               were in this routine. If that happens >N times (for now N==4)
-               we turn on the throttle down logic */
-            bytes_xfer_now = ram_bytes_transferred();
-            if (s->dirty_pages_rate &&
-               (num_dirty_pages_period * TARGET_PAGE_SIZE >
-                   (bytes_xfer_now - bytes_xfer_prev)/2) &&
-               (dirty_rate_high_cnt++ > 4)) {
-                    trace_migration_throttle();
-                    mig_throttle_on = true;
-                    dirty_rate_high_cnt = 0;
-             }
-             bytes_xfer_prev = bytes_xfer_now;
+            if (get_bitmap_sync_cnt() > 15) {
+                mig_throttle_on = true;
+            }
         } else {
              mig_throttle_on = false;
         }
@@ -712,7 +693,6 @@ static int ram_save_setup(QEMUFile *f, void *opaque)
     bitmap_set(migration_bitmap, 0, ram_pages);
     migration_dirty_pages = ram_pages;
     mig_throttle_on = false;
-    dirty_rate_high_cnt = 0;
     reset_bitmap_sync_cnt();
 
     if (migrate_use_xbzrle()) {
-- 
1.7.12.4

Best regards,
-Gonglei

^ permalink raw reply related	[flat|nested] 3+ messages in thread

* Re: [Qemu-devel] [PATCH 5/7] migration: Fix the migrate auto converge process
  2014-02-28  4:11 [Qemu-devel] [PATCH 5/7] migration: Fix the migrate auto converge process Gonglei (Arei)
@ 2014-02-28 10:16 ` Dr. David Alan Gilbert
  2014-02-28 11:03   ` Gonglei
  0 siblings, 1 reply; 3+ messages in thread
From: Dr. David Alan Gilbert @ 2014-02-28 10:16 UTC (permalink / raw)
  To: Gonglei (Arei)
  Cc: chenliang (T), Peter Maydell, Juan Quintela, pl@kamp.de,
	qemu-devel@nongnu.org, aliguori@amazon.com, pbonzini@redhat.com

* Gonglei (Arei) (arei.gonglei@huawei.com) wrote:
> It is inaccuracy and complex that using the transfer speed of
> migration thread to determine whether the convergence migration.
> The dirty page may be compressed by XBZRLE or ZERO_PAGE.The counter
> of updating dirty bitmap will be increasing continuously if the
> migration can't convergence.
> 
> Signed-off-by: ChenLiang <chenliang88@huawei.com>
> Signed-off-by: Gonglei <arei.gonglei@huawei.com>
> ---
>  arch_init.c | 26 +++-----------------------
>  1 file changed, 3 insertions(+), 23 deletions(-)
> 
> diff --git a/arch_init.c b/arch_init.c
> index fc71331..2211e0b 100644
> --- a/arch_init.c
> +++ b/arch_init.c
> @@ -107,7 +107,6 @@ int graphic_depth = 32;
>  
>  const uint32_t arch_type = QEMU_ARCH;
>  static bool mig_throttle_on;
> -static int dirty_rate_high_cnt;
>  static void check_guest_throttling(void);
>  
>  static uint64_t bitmap_sync_cnt;
> @@ -464,17 +463,11 @@ static void migration_bitmap_sync(void)
>      uint64_t num_dirty_pages_init = migration_dirty_pages;
>      MigrationState *s = migrate_get_current();
>      static int64_t start_time;
> -    static int64_t bytes_xfer_prev;
>      static int64_t num_dirty_pages_period;
>      int64_t end_time;
> -    int64_t bytes_xfer_now;
>  
>      increase_bitmap_sync_cnt();
>  
> -    if (!bytes_xfer_prev) {
> -        bytes_xfer_prev = ram_bytes_transferred();
> -    }
> -
>      if (!start_time) {
>          start_time = qemu_clock_get_ms(QEMU_CLOCK_REALTIME);
>      }
> @@ -493,21 +486,9 @@ static void migration_bitmap_sync(void)
>      /* more than 1 second = 1000 millisecons */
>      if (end_time > start_time + 1000) {
>          if (migrate_auto_converge()) {
> -            /* The following detection logic can be refined later. For now:
> -               Check to see if the dirtied bytes is 50% more than the approx.
> -               amount of bytes that just got transferred since the last time we
> -               were in this routine. If that happens >N times (for now N==4)
> -               we turn on the throttle down logic */
> -            bytes_xfer_now = ram_bytes_transferred();
> -            if (s->dirty_pages_rate &&
> -               (num_dirty_pages_period * TARGET_PAGE_SIZE >
> -                   (bytes_xfer_now - bytes_xfer_prev)/2) &&
> -               (dirty_rate_high_cnt++ > 4)) {
> -                    trace_migration_throttle();
> -                    mig_throttle_on = true;
> -                    dirty_rate_high_cnt = 0;
> -             }
> -             bytes_xfer_prev = bytes_xfer_now;
> +            if (get_bitmap_sync_cnt() > 15) {
> +                mig_throttle_on = true;
> +            }

That is a lot simpler, and I suspect as good - again I'd
move that magic '15' to a constant somewhere.

What have you tested this on - have you tested with really big RAM VMs?
What's it's behaviour like with rate-limiting?

Dave
> 
--
Dr. David Alan Gilbert / dgilbert@redhat.com / Manchester, UK

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [Qemu-devel] [PATCH 5/7] migration: Fix the migrate auto converge process
  2014-02-28 10:16 ` Dr. David Alan Gilbert
@ 2014-02-28 11:03   ` Gonglei
  0 siblings, 0 replies; 3+ messages in thread
From: Gonglei @ 2014-02-28 11:03 UTC (permalink / raw)
  To: Dr. David Alan Gilbert
  Cc: chenliang (T), Peter Maydell, Juan Quintela, pl@kamp.de,
	qemu-devel@nongnu.org, aliguori@amazon.com, pbonzini@redhat.com

On 2014/2/28 18:16, Dr. David Alan Gilbert wrote:

> * Gonglei (Arei) (arei.gonglei@huawei.com) wrote:
>> It is inaccuracy and complex that using the transfer speed of
>> migration thread to determine whether the convergence migration.
>> The dirty page may be compressed by XBZRLE or ZERO_PAGE.The counter
>> of updating dirty bitmap will be increasing continuously if the
>> migration can't convergence.
>>
>> Signed-off-by: ChenLiang <chenliang88@huawei.com>
>> Signed-off-by: Gonglei <arei.gonglei@huawei.com>
>> ---
>>  arch_init.c | 26 +++-----------------------
>>  1 file changed, 3 insertions(+), 23 deletions(-)
>>
>> diff --git a/arch_init.c b/arch_init.c
>> index fc71331..2211e0b 100644
>> --- a/arch_init.c
>> +++ b/arch_init.c
>> @@ -107,7 +107,6 @@ int graphic_depth = 32;
>>  
>>  const uint32_t arch_type = QEMU_ARCH;
>>  static bool mig_throttle_on;
>> -static int dirty_rate_high_cnt;
>>  static void check_guest_throttling(void);
>>  
>>  static uint64_t bitmap_sync_cnt;
>> @@ -464,17 +463,11 @@ static void migration_bitmap_sync(void)
>>      uint64_t num_dirty_pages_init = migration_dirty_pages;
>>      MigrationState *s = migrate_get_current();
>>      static int64_t start_time;
>> -    static int64_t bytes_xfer_prev;
>>      static int64_t num_dirty_pages_period;
>>      int64_t end_time;
>> -    int64_t bytes_xfer_now;
>>  
>>      increase_bitmap_sync_cnt();
>>  
>> -    if (!bytes_xfer_prev) {
>> -        bytes_xfer_prev = ram_bytes_transferred();
>> -    }
>> -
>>      if (!start_time) {
>>          start_time = qemu_clock_get_ms(QEMU_CLOCK_REALTIME);
>>      }
>> @@ -493,21 +486,9 @@ static void migration_bitmap_sync(void)
>>      /* more than 1 second = 1000 millisecons */
>>      if (end_time > start_time + 1000) {
>>          if (migrate_auto_converge()) {
>> -            /* The following detection logic can be refined later. For now:
>> -               Check to see if the dirtied bytes is 50% more than the approx.
>> -               amount of bytes that just got transferred since the last time we
>> -               were in this routine. If that happens >N times (for now N==4)
>> -               we turn on the throttle down logic */
>> -            bytes_xfer_now = ram_bytes_transferred();
>> -            if (s->dirty_pages_rate &&
>> -               (num_dirty_pages_period * TARGET_PAGE_SIZE >
>> -                   (bytes_xfer_now - bytes_xfer_prev)/2) &&
>> -               (dirty_rate_high_cnt++ > 4)) {
>> -                    trace_migration_throttle();
>> -                    mig_throttle_on = true;
>> -                    dirty_rate_high_cnt = 0;
>> -             }
>> -             bytes_xfer_prev = bytes_xfer_now;
>> +            if (get_bitmap_sync_cnt() > 15) {
>> +                mig_throttle_on = true;
>> +            }
> 
> That is a lot simpler, and I suspect as good - again I'd
> move that magic '15' to a constant somewhere.
> 

Thanks, Check it.

> What have you tested this on - have you tested with really big RAM VMs?
> What's it's behaviour like with rate-limiting?
> 
> Dave

Yeah, We have tested it using 25G memory VM as the first mail said.

>>
> --
> Dr. David Alan Gilbert / dgilbert@redhat.com / Manchester, UK


Best regards,
-Gonglei

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2014-02-28 11:04 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2014-02-28  4:11 [Qemu-devel] [PATCH 5/7] migration: Fix the migrate auto converge process Gonglei (Arei)
2014-02-28 10:16 ` Dr. David Alan Gilbert
2014-02-28 11:03   ` Gonglei

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).