From: David Gibson <david@gibson.dropbear.id.au>
To: "Cédric Le Goater" <clg@kaod.org>
Cc: groug@kaod.org, abologna@redhat.com, qemu-ppc@nongnu.org,
qemu-devel@nongnu.org, aik@ozlabs.ru
Subject: Re: [Qemu-devel] [PATCH 8/9] spapr: Limit available pagesizes to provide a consistent guest environment
Date: Thu, 21 Jun 2018 23:58:10 +1000 [thread overview]
Message-ID: <20180621135810.GC2009@umbus.fritz.box> (raw)
In-Reply-To: <471b50ee-8f18-9fcc-10c7-8f5bca8c8cd3@kaod.org>
[-- Attachment #1: Type: text/plain, Size: 4614 bytes --]
On Thu, Jun 21, 2018 at 02:50:32PM +0200, Cédric Le Goater wrote:
> On 06/21/2018 01:52 PM, David Gibson wrote:
> > On Thu, Jun 21, 2018 at 09:01:27AM +0200, Cédric Le Goater wrote:
> >> On 06/18/2018 08:36 AM, David Gibson wrote:
> >>> KVM HV has some limitations (deriving from the hardware) that mean not all
> >>> host-cpu supported pagesizes may be usable in the guest. At present this
> >>> means that KVM guests and TCG guests may see different available page sizes
> >>> even if they notionally have the same vcpu model. This is confusing and
> >>> also prevents migration between TCG and KVM.
> >>>
> >>> This patch makes the environment consistent by always allowing the same set
> >>> of pagesizes. Since we can't remove the KVM limitations, we do this by
> >>> always applying the same limitations it has, even to TCG guests.
> >>>
> >>> Signed-off-by: David Gibson <david@gibson.dropbear.id.au>
> >>>
> >>> ---
> >>> hw/ppc/spapr_caps.c | 33 +++++++++++++++++++++++++++++++++
> >>> 1 file changed, 33 insertions(+)
> >>>
> >>> diff --git a/hw/ppc/spapr_caps.c b/hw/ppc/spapr_caps.c
> >>> index 9fc739b3f5..0584c7c6ab 100644
> >>> --- a/hw/ppc/spapr_caps.c
> >>> +++ b/hw/ppc/spapr_caps.c
> >>> @@ -334,6 +334,38 @@ static void cap_hpt_maxpagesize_apply(sPAPRMachineState *spapr,
> >>> spapr_check_pagesize(spapr, qemu_getrampagesize(), errp);
> >>> }
> >>>
> >>> +static bool spapr_pagesize_cb(void *opaque, uint32_t seg_pshift, uint32_t pshift)
> >>> +{
> >>> + unsigned maxshift = *((unsigned *)opaque);
> >>> +
> >>> + assert(pshift >= seg_pshift);
> >>
> >> you could check that elsewhere.
> >
> > Um.. I'm not sure what you're getting at.
>
> you could put the assert in ppc_hash64_filter_pagesizes(), that is where
> the parameters are coming from.
Yes.. but it's here that we're relying on that fact. That's kind of
the point with assert()s.
>
> >>> + /* Don't allow the guest to use pages bigger than the configured
> >>> + * maximum size */
> >>> + if (pshift > maxshift) {
> >>> + return false;
> >>> + }
> >>> +
> >>> + /* For whatever reason, KVM doesn't allow multiple pagesizes
> >>> + * within a segment, *except* for the case of 16M pages in a 4k or
> >>> + * 64k segment. Always exclude other cases, so that TCG and KVM
> >>> + * guests see a consistent environment */
> >>> + if ((pshift != seg_pshift) && (pshift != 24)) {
> >>> + return false;
> >>> + }
> >
> > Note the stanza above, I'll refer to it below.
>
> ok.
>
> >
> >>> +
> >>> + return true;
> >>> +}
> >>
> >> So, do we really need ppc_hash64_filter_pagesizes() to have a callback ?
> >
> > I agree that it seems overly involved, but it was the best way I could
> > see to logically separate the TCG / softmmu specific logic from the
> > spapr specific logic.
>
> ok. I agree then.
>
> Reviewed-by: Cédric Le Goater <clg@kaod.org>
>
> Thanks,
>
> C.
>
> >> It seems that we only use the routine once in the patchset and that the
> >> only thing we need to check is 'maxshift'.
> >
> > Not quite. An earlier draft had this routine just take a max page
> > size and clamp accordingly. But that failed when I wrote the code to
> > check against the KVM capabilities, because KVM also excludes some
> > other pagesize combinations. That's what the stanza I point out above
> > is about
> >
> >> Do you envision other usage of the routine ?
> >
> > Not really, no.
> >
> >>
> >> Thanks,
> >>
> >> C.
> >>
> >>> +static void cap_hpt_maxpagesize_cpu_apply(sPAPRMachineState *spapr,
> >>> + PowerPCCPU *cpu,
> >>> + uint8_t val, Error **errp)
> >>> +{
> >>> + unsigned maxshift = val;
> >>> +
> >>> + ppc_hash64_filter_pagesizes(cpu, spapr_pagesize_cb, &maxshift);
> >>> +}
> >>> +
> >>> sPAPRCapabilityInfo capability_table[SPAPR_CAP_NUM] = {
> >>> [SPAPR_CAP_HTM] = {
> >>> .name = "htm",
> >>> @@ -401,6 +433,7 @@ sPAPRCapabilityInfo capability_table[SPAPR_CAP_NUM] = {
> >>> .set = spapr_cap_set_pagesize,
> >>> .type = "int",
> >>> .apply = cap_hpt_maxpagesize_apply,
> >>> + .cpu_apply = cap_hpt_maxpagesize_cpu_apply,
> >>> },
> >>> };
> >>>
> >>>
> >>
> >
>
--
David Gibson | I'll have my music baroque, and my code
david AT gibson.dropbear.id.au | minimalist, thank you. NOT _the_ _other_
| _way_ _around_!
http://www.ozlabs.org/~dgibson
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 833 bytes --]
next prev parent reply other threads:[~2018-06-21 13:58 UTC|newest]
Thread overview: 43+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-06-18 6:35 [Qemu-devel] [PATCH 0/9] spapr: Clean up pagesize handling David Gibson
2018-06-18 6:35 ` [Qemu-devel] [PATCH 1/9] target/ppc: Allow cpu compatiblity checks based on type, not instance David Gibson
2018-06-18 13:22 ` Greg Kurz
2018-06-21 5:20 ` Cédric Le Goater
2018-06-18 6:35 ` [Qemu-devel] [PATCH 2/9] spapr: Compute effective capability values earlier David Gibson
2018-06-18 13:37 ` Greg Kurz
2018-06-21 5:32 ` Cédric Le Goater
2018-06-18 6:36 ` [Qemu-devel] [PATCH 3/9] spapr: Add cpu_apply hook to capabilities David Gibson
2018-06-18 15:28 ` Greg Kurz
2018-06-21 5:34 ` Cédric Le Goater
2018-06-18 6:36 ` [Qemu-devel] [PATCH 4/9] target/ppc: Add kvmppc_hpt_needs_host_contiguous_pages() helper David Gibson
2018-06-18 15:32 ` Greg Kurz
2018-06-21 5:56 ` Cédric Le Goater
2018-06-21 6:34 ` David Gibson
2018-06-18 6:36 ` [Qemu-devel] [PATCH 5/9] spapr: Maximum (HPT) pagesize property David Gibson
2018-06-19 9:23 ` Cédric Le Goater
2018-06-19 11:22 ` David Gibson
2018-06-21 6:22 ` Cédric Le Goater
2018-06-21 11:00 ` David Gibson
2018-06-21 9:19 ` Greg Kurz
2018-06-21 11:01 ` David Gibson
2018-06-18 6:36 ` [Qemu-devel] [PATCH 6/9] spapr: Use maximum page size capability to simplify memory backend checking David Gibson
2018-06-21 6:29 ` Cédric Le Goater
2018-06-21 11:06 ` David Gibson
2018-06-21 10:29 ` Greg Kurz
2018-06-21 11:11 ` David Gibson
2018-06-18 6:36 ` [Qemu-devel] [PATCH 7/9] target/ppc: Add ppc_hash64_filter_pagesizes() David Gibson
2018-06-21 6:38 ` Cédric Le Goater
2018-06-21 11:48 ` Greg Kurz
2018-06-18 6:36 ` [Qemu-devel] [PATCH 8/9] spapr: Limit available pagesizes to provide a consistent guest environment David Gibson
2018-06-21 7:01 ` Cédric Le Goater
2018-06-21 11:52 ` David Gibson
2018-06-21 12:50 ` Cédric Le Goater
2018-06-21 13:58 ` David Gibson [this message]
2018-06-21 12:24 ` Greg Kurz
2018-06-21 14:01 ` David Gibson
2018-06-21 14:18 ` Greg Kurz
2018-06-18 6:36 ` [Qemu-devel] [PATCH 9/9] spapr: Don't rewrite mmu capabilities in KVM mode David Gibson
2018-06-21 7:53 ` Cédric Le Goater
2018-06-21 12:01 ` David Gibson
2018-06-21 12:51 ` Cédric Le Goater
2018-06-21 1:08 ` [Qemu-devel] [PATCH 0/9] spapr: Clean up pagesize handling David Gibson
2018-06-21 6:52 ` no-reply
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20180621135810.GC2009@umbus.fritz.box \
--to=david@gibson.dropbear.id.au \
--cc=abologna@redhat.com \
--cc=aik@ozlabs.ru \
--cc=clg@kaod.org \
--cc=groug@kaod.org \
--cc=qemu-devel@nongnu.org \
--cc=qemu-ppc@nongnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).