All of lore.kernel.org
 help / color / mirror / Atom feed
From: Greg Kurz <groug@kaod.org>
To: Thomas Huth <thuth@redhat.com>
Cc: David Gibson <david@gibson.dropbear.id.au>,
	qemu-ppc@nongnu.org, qemu-devel@nongnu.org
Subject: Re: [Qemu-devel] [PATCH] ppc: Yet another fix for the huge page support detection mechanism
Date: Fri, 15 Jul 2016 17:18:29 +0200	[thread overview]
Message-ID: <20160715171829.0a9dfd16@bahia.lan> (raw)
In-Reply-To: <5a9731af-6636-7031-4d50-20815cdfb5e0@redhat.com>

[-- Attachment #1: Type: text/plain, Size: 3895 bytes --]

On Fri, 15 Jul 2016 14:28:44 +0200
Thomas Huth <thuth@redhat.com> wrote:

> On 15.07.2016 10:35, David Gibson wrote:
> > On Fri, Jul 15, 2016 at 10:10:25AM +0200, Thomas Huth wrote:  
> >> Commit 86b50f2e1bef ("Disable huge page support if it is not available
> >> for main RAM") already made sure that huge page support is not announced
> >> to the guest if the normal RAM of non-NUMA configurations is not backed
> >> by a huge page filesystem. However, there is one more case that can go
> >> wrong: NUMA is enabled, but the RAM of the NUMA nodes are not configured
> >> with huge page support (and only the memory of a DIMM is configured with
> >> it). When QEMU is started with the following command line for example,
> >> the Linux guest currently crashes because it is trying to use huge pages
> >> on a memory region that does not support huge pages:
> >>
> >>  qemu-system-ppc64 -enable-kvm ... -m 1G,slots=4,maxmem=32G -object \
> >>    memory-backend-file,policy=default,mem-path=/hugepages,size=1G,id=mem-mem1 \
> >>    -device pc-dimm,id=dimm-mem1,memdev=mem-mem1 -smp 2 \
> >>    -numa node,nodeid=0 -numa node,nodeid=1
> >>
> >> To fix this issue, we've got to make sure to disable huge page support,
> >> too, when there is a NUMA node that is not using a memory backend with
> >> huge page support.
> >>
> >> Fixes: 86b50f2e1befc33407bdfeb6f45f7b0d2439a740
> >> Signed-off-by: Thomas Huth <thuth@redhat.com>
> >> ---
> >>  target-ppc/kvm.c | 10 +++++++---
> >>  1 file changed, 7 insertions(+), 3 deletions(-)
> >>
> >> diff --git a/target-ppc/kvm.c b/target-ppc/kvm.c
> >> index 884d564..7a8f555 100644
> >> --- a/target-ppc/kvm.c
> >> +++ b/target-ppc/kvm.c
> >> @@ -389,12 +389,16 @@ static long getrampagesize(void)
> >>  
> >>      object_child_foreach(memdev_root, find_max_supported_pagesize, &hpsize);
> >>  
> >> -    if (hpsize == LONG_MAX) {
> >> +    if (hpsize == LONG_MAX || hpsize == getpagesize()) {
> >>          return getpagesize();
> >>      }
> >>  
> >> -    if (nb_numa_nodes == 0 && hpsize > getpagesize()) {
> >> -        /* No NUMA nodes and normal RAM without -mem-path ==> no huge pages! */
> >> +    /* If NUMA is disabled or the NUMA nodes are not backed with a
> >> +     * memory-backend, then there is at least one node using "normal"
> >> +     * RAM. And since normal RAM has not been configured with "-mem-path"
> >> +     * (what we've checked earlier here already), we can not use huge pages!
> >> +     */
> >> +    if (nb_numa_nodes == 0 || numa_info[0].node_memdev == NULL) {  
> > 
> > Is that second clause sufficient, or do you need to loop through and
> > check the memdev of every node?  
> 
> Checking the first entry should be sufficient. QEMU forces you to
> specify either a memory backend for all NUMA nodes (which we should have
> looked at during the object_child_foreach() some lines earlier), or you
> must not specify a memory backend for any NUMA node at all. You can not
> mix the settings, so checking numa_info[0] is enough.
> 
>  Thomas
> 
> 

And what happens if we specify a hugepage memdev backend to one of the
nodes and a regular RAM memdev backend to the other ?

I actually wanted to try that but I hit an assertion, which isn't
related to this patch I think:

qemu-system-ppc64: memory.c:1934: memory_region_add_subregion_common: 
   Assertion `!subregion->container' failed.

So I tried to trick the logic you are trying to fix the other way
round:

-mem-path /dev/hugepages \
-m 1G,slots=4,maxmem=32G \
-object memory-backend-ram,policy=default,size=1G,id=mem-mem1 \
-device pc-dimm,id=dimm-mem1,memdev=mem-mem1 \
-smp 2 \
-numa node,nodeid=0 -numa node,nodeid=1

The guest fails the same way as before your patch: the hugepage size is
advertised to the guest, but the numa node is associated to regular ram.

--
Greg

[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 181 bytes --]

  reply	other threads:[~2016-07-15 15:18 UTC|newest]

Thread overview: 19+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2016-07-15  8:10 [Qemu-devel] [PATCH] ppc: Yet another fix for the huge page support detection mechanism Thomas Huth
2016-07-15  8:35 ` David Gibson
2016-07-15 12:28   ` Thomas Huth
2016-07-15 15:18     ` Greg Kurz [this message]
2016-07-15 15:54       ` Thomas Huth
2016-07-15 16:31         ` Greg Kurz
2016-07-15  9:28 ` Greg Kurz
2016-07-18  9:21   ` Thomas Huth
2016-07-18  9:36     ` Greg Kurz
2016-07-18  0:52 ` David Gibson
2016-07-18  8:59   ` [Qemu-devel] [Qemu-ppc] " Greg Kurz
2016-07-18  9:04     ` Thomas Huth
2016-07-18  9:26       ` Greg Kurz
2016-07-18  9:33         ` Thomas Huth
2016-07-18 10:44           ` Greg Kurz
2016-07-18 13:16             ` [Qemu-devel] assert in memory.c line 1934 (was: Yet another fix for the huge page support detection mechanism) Thomas Huth
2016-07-18 13:23             ` [Qemu-devel] [Qemu-ppc] [PATCH] ppc: Yet another fix for the huge page support detection mechanism Greg Kurz
2016-07-18  9:21     ` David Gibson
2016-07-18 10:01       ` Greg Kurz

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20160715171829.0a9dfd16@bahia.lan \
    --to=groug@kaod.org \
    --cc=david@gibson.dropbear.id.au \
    --cc=qemu-devel@nongnu.org \
    --cc=qemu-ppc@nongnu.org \
    --cc=thuth@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.