From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from eggs.gnu.org ([2001:4830:134:3::10]:54034) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1c7gus-0007Sm-Kt for qemu-devel@nongnu.org; Fri, 18 Nov 2016 06:01:11 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1c7gur-0006tP-Hn for qemu-devel@nongnu.org; Fri, 18 Nov 2016 06:01:10 -0500 Date: Fri, 18 Nov 2016 16:45:05 +1100 From: David Gibson Message-ID: <20161118054505.GD31640@umbus.fritz.box> References: <1479433227-29238-1-git-send-email-mdroth@linux.vnet.ibm.com> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha256; protocol="application/pgp-signature"; boundary="rqzD5py0kzyFAOWN" Content-Disposition: inline In-Reply-To: <1479433227-29238-1-git-send-email-mdroth@linux.vnet.ibm.com> Subject: Re: [Qemu-devel] [PATCH for-2.8 0/3] spapr: fix breakage of memory unplug after migration List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , To: Michael Roth Cc: qemu-devel@nongnu.org, qemu-ppc@nongnu.org, duanj@linux.vnet.ibm.com, bharata@linux.vnet.ibm.com, dgilbert@redhat.com, quintela@redhat.com, amit.shah@redhat.com --rqzD5py0kzyFAOWN Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Thu, Nov 17, 2016 at 07:40:24PM -0600, Michael Roth wrote: > These patches are based on David's ppc-for-2.8 tree, and are also > available from: >=20 > https://github.com/mdroth/qemu/commits/spapr-cas-migration >=20 > Currently, memory hotplugged to a pseries guest cannot be removed after > the guest has been migrated. This is due to 2 issues: >=20 > 1) The coldplugged state of memory on the target side is one where the > corresponding DRC's allocation state is: >=20 > allocation_state =3D=3D unallocated, > awaiting_allocation =3D=3D true, >=20 > When the guest attempts to unplug memory on the target side, it first > checks that allocation_state =3D=3D allocated. If we fix this, the gue= st > can successfully notify QEMU of completion on it's end, but then the > DRC code sees that awaiting_allocation =3D=3D true, so it defers the > finalizing of the LMB and corresponding DIMM since it assumes that > the DIMM must have been previously allocated before it can be removed. >=20 > To address this, we pull in patches 1-2 from Jian Jun's DRC migration > series: >=20 > https://lists.gnu.org/archive/html/qemu-ppc/2016-10/msg00048.html >=20 > with some minor changes relating to prior review comments, and > the addition of migrating the DRC's awaiting_allocation value, which > wasn't part of the original patch. This doesn't address the full scope > of the issues Jian Jun was looking at (involving synchronizing state > when migration occurs during fairly small race windows), just this > particular case, which is more user visible since the time window is > indefinite. >=20 > 2) The ability to unplug memory is gated on the QEMU side by a check as > to whether or not support for newer-style hotplug events was negotiated > via CAS during boot. The check is performed by checking the correspond= ing > entry in the sPAPROptionVector structure. However, since this value is= n't > migrated currently, we are unable to unplug until after the guest rebo= ots. >=20 > We address that here by adding migration support for sPAPROptionVector= s, > and including the CAS-negotiated vector as part of the migration stream > for any cases where we advertise newer-style hotplug event support to > the guest. >=20 > David, >=20 > These fixes ended up going out much later than planned. I'm not sure > if you're planning another pull for 2.8 or not, and realize there are > some patches here not specifically pseries-related so it's > understandable if we opt to pursue these for 2.9/2.8.1 instead. But if > possible I'm hoping to get these in so that the memory unplug > support is fully functional for 2.8. Yeah, I'm still expecting to push a few bugfixes in before 2.8. So, I've merged these patches into ppc-for-2.8 (fixing a couple of trivial style nits along the way). I have a couple of comments that I'll make on the patches, but they're not important enough to stop these going in ASAP. Unfortunately, of course, this is not the only migration breakage we have at the moment. I'm presently wrestling with both breakage due to changes in the insns_flags masks, and due to the reworking of the mmio windows for the PHB. >=20 >=20 > hw/core/qdev.c | 6 +++++- > hw/ppc/spapr.c | 68 +++++++++++++++++++++++++++++++++++++++= +++++++++++++++++++++++++++++ > hw/ppc/spapr_drc.c | 70 +++++++++++++++++++++++++++++++++++++++= +++++++++++++++++++++++++++++++ > hw/ppc/spapr_ovec.c | 12 ++++++++++++ > hw/ppc/spapr_pci.c | 22 ++++++++++++++++++++++ > include/hw/ppc/spapr_drc.h | 9 +++++++++ > include/hw/ppc/spapr_ovec.h | 4 ++++ > include/hw/qdev-core.h | 9 +++++++++ > migration/savevm.c | 4 ++-- > 9 files changed, 201 insertions(+), 3 deletions(-) >=20 --=20 David Gibson | I'll have my music baroque, and my code david AT gibson.dropbear.id.au | minimalist, thank you. NOT _the_ _other_ | _way_ _around_! http://www.ozlabs.org/~dgibson --rqzD5py0kzyFAOWN Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- Version: GnuPG v2 iQIcBAEBCAAGBQJYLpVfAAoJEGw4ysog2bOSnrQQAIJQctvhJGPsVNdE7JznJHIm XA+v/J/6l9c7Morl6PQPJ41gjdD8LuTOtKrVJXmBLqLbNdOn99xpHsh9NvIuxPJw DG2llFPxEPtyBxZlTGpvyr/IUPDBkV7cRTszZN3lAqIo0aRsUrQ875/ENcZBosQm scwFk/sss9g7YYGVva2UA2agOlBPtjgdVIfdJU/vV72DI7nz93woXkJ+Zpo9R49G EY5fkGAyq8B+7IiQDByY4JiP/9IrkHOjuIhyvrrTOVtWCtB+vWxRcPdAsbmRPELr b3+gfGdqO/7tZ1MT1j7jDSzS9T67md4RI9U4UYyvIhDR+ecbJCD+Kn7w0EiybeNU CazvS9tYXWIQgWrrNsEeJuu8Pqieq533JDicHf2MnPtQUysOckjJulIxFl6Ikpnp EyWA8tkvX0MFknjEBrbZT2TYmzDzwvHIET+Mkhiu0pafyt4l6cM6LxEzyMM7l9nX lth/85af2IOdEAQbkzksz/qhuRxR8xENTfIStna6cnBNOTptvO7D2XTFLLVlnHXt NyE71ajdOMyc6EAwOdIprzC2Pf5G0+/j15khE8kbqjd+Qt9fdMJw5i67mCIoku8Y mAKua8QR36NLTTlnZ25jWDVBAhWlpOI1/vK9g+P333X+g3lJ0oUgUF1nbH1H+k9y Nk5I0c0Ltrz672GIpMdM =IQF6 -----END PGP SIGNATURE----- --rqzD5py0kzyFAOWN--