qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
From: Peter Lieven <pl@kamp.de>
To: Alexey Kardashevskiy <aik@ozlabs.ru>
Cc: Paolo Bonzini <pbonzini@redhat.com>,
	"qemu-ppc@nongnu.org" <qemu-ppc@nongnu.org>,
	"qemu-devel@nongnu.org" <qemu-devel@nongnu.org>,
	David Gibson <david@gibson.dropbear.id.au>
Subject: Re: [Qemu-devel] broken incoming migration
Date: Tue, 04 Jun 2013 12:56:19 +0200	[thread overview]
Message-ID: <51ADC7D3.8010008@kamp.de> (raw)
In-Reply-To: <51AC6A26.7060309@ozlabs.ru>

On 03.06.2013 12:04, Alexey Kardashevskiy wrote:
> On 05/31/2013 12:38 AM, Peter Lieven wrote:
>>
>>
>> Am 30.05.2013 um 15:41 schrieb "Paolo Bonzini" <pbonzini@redhat.com>:
>>
>>> Il 30/05/2013 11:08, Peter Lieven ha scritto:
>>>> Am 30.05.2013 10:18, schrieb Alexey Kardashevskiy:
>>>>> On 05/30/2013 05:49 PM, Paolo Bonzini wrote:
>>>>>> Il 30/05/2013 09:44, Alexey Kardashevskiy ha scritto:
>>>>>>> Hi!
>>>>>>>
>>>>>>> I found the migration broken on pseries platform, specifically, this patch
>>>>>>> broke it:
>>>>>>>
>>>>>>> f1c72795af573b24a7da5eb52375c9aba8a37972
>>>>>>> migration: do not sent zero pages in bulk stage
>>>>>>>
>>>>>>> The idea is not to send zero pages to the destination guest which is
>>>>>>> expected to have 100% empty RAM.
>>>>>>>
>>>>>>> However on pseries plaftorm the guest always has some stuff in the RAM as a
>>>>>>> part of initialization (device tree, system firmware and rtas (?)) so it is
>>>>>>> not completely empty. As the source guest cannot detect this, it skips some
>>>>>>> pages during migration and we get a broken destination guest. Bug.
>>>>>>>
>>>>>>> While the idea is ok in general, I do not see any easy way to fix it as
>>>>>>> neither QEMUMachine::init nor QEMUMachine::reset callbacks has information
>>>>>>> about whether we are about to receive a migration or not (-incoming
>>>>>>> parameter) and we cannot move device-tree and system firmware
>>>>>>> initialization anywhere else.
>>>>>>>
>>>>>>> ram_bulk_stage is static and cannot be disabled from the platform
>>>>>>> initialization code.
>>>>>>>
>>>>>>> So what would the community suggest?
>>>>>> Revert the patch. :)
>>>>> I'll wait for 24 hours (forgot to cc: the author) and then post a revert
>>>>> patch :)
>>>> does this problem only occur on pseries emulation?
>>> Probably not.  On a PC, it would occur if you had 4K of zeros in the
>>> source BIOS but not in the destination BIOS.  When you reboot, the BIOS
>>> image is wrong.
>>>
>>>> not sending zero pages is not only a performance benefit it also makes
>>>> overcomitted memory usable. the madv_dontneed seems to kick in asynchronously
>>>> and memory is not available immediately.
>>> You could also scan the page for nonzero values before writing it.
>> i had this in mind, but then choosed the other approach.... turned out to be a bad idea.
>>
>> alexey: i will prepare a patch later today, could you then please verify it fixes your problem.
>
> Yes I can, where is the patch? :)

its on my todo for today. sorry, have been a bit busy lately.

Peter

  reply	other threads:[~2013-06-04 10:56 UTC|newest]

Thread overview: 49+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-05-30  7:44 [Qemu-devel] broken incoming migration Alexey Kardashevskiy
2013-05-30  7:49 ` Alexey Kardashevskiy
2013-05-30  7:49 ` Paolo Bonzini
2013-05-30  8:18   ` Alexey Kardashevskiy
2013-05-30  9:08     ` Peter Lieven
2013-05-30  9:31       ` Alexey Kardashevskiy
2013-05-30 13:00       ` Paolo Bonzini
2013-05-30 13:38         ` Alexey Kardashevskiy
2013-05-30 14:08           ` Paolo Bonzini
2013-05-30 14:38         ` Peter Lieven
2013-05-30 14:41           ` Paolo Bonzini
2013-06-04 13:52             ` Peter Lieven
2013-06-04 14:14               ` Paolo Bonzini
2013-06-04 14:38                 ` Peter Lieven
2013-06-04 14:40                   ` Paolo Bonzini
2013-06-04 14:48                     ` Peter Lieven
2013-06-04 15:17                       ` Paolo Bonzini
2013-06-04 19:15                         ` Peter Lieven
2013-06-05  3:37                           ` Alexey Kardashevskiy
2013-06-05  6:09                             ` Peter Lieven
2013-06-09  4:12                               ` liu ping fan
2013-06-09  7:22                                 ` Peter Lieven
2013-06-04 15:10                     ` Peter Lieven
2013-06-08  8:27                       ` Wenchao Xia
2013-06-08  8:30                         ` Alexey Kardashevskiy
2013-06-09  2:16                           ` Wenchao Xia
2013-06-09  2:34                             ` Alexey Kardashevskiy
2013-06-09  2:52                               ` [Qemu-devel] [Qemu-ppc] " Benjamin Herrenschmidt
2013-06-09  3:01                                 ` Alexey Kardashevskiy
2013-06-09  3:01                               ` [Qemu-devel] " Wenchao Xia
2013-06-09  3:09                                 ` Alexey Kardashevskiy
2013-06-09  3:31                                   ` Wenchao Xia
2013-06-09  7:27                                   ` Peter Lieven
2013-06-10  6:39                                     ` Alexey Kardashevskiy
2013-06-10  6:50                                       ` Peter Lieven
2013-06-10  6:55                                         ` Alexey Kardashevskiy
2013-06-10  8:44                                           ` Peter Lieven
2013-06-10  9:10                                             ` Alexey Kardashevskiy
2013-06-10  9:33                                               ` [Qemu-devel] [Qemu-ppc] " Benjamin Herrenschmidt
2013-06-10  9:42                                                 ` Peter Lieven
2013-06-09  2:53                             ` Benjamin Herrenschmidt
2013-06-12 14:00                               ` Paolo Bonzini
2013-06-12 14:11                                 ` Benjamin Herrenschmidt
2013-06-12 20:10                                   ` Paolo Bonzini
2013-06-13  2:41                                     ` Wenchao Xia
2013-06-03 10:04           ` [Qemu-devel] " Alexey Kardashevskiy
2013-06-04 10:56             ` Peter Lieven [this message]
2013-06-08  8:24         ` Wenchao Xia
2013-05-30 10:18 ` Peter Maydell

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=51ADC7D3.8010008@kamp.de \
    --to=pl@kamp.de \
    --cc=aik@ozlabs.ru \
    --cc=david@gibson.dropbear.id.au \
    --cc=pbonzini@redhat.com \
    --cc=qemu-devel@nongnu.org \
    --cc=qemu-ppc@nongnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).