qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
From: David Gibson <david@gibson.dropbear.id.au>
To: Laurent Vivier <lvivier@redhat.com>
Cc: agraf@suse.de, mst@redhat.com, abologna@redhat.com,
	aik@ozlabs.ru, qemu-devel@nongnu.org, qemu-ppc@nongnu.org,
	mdroth@linux.vnet.ibm.com, benh@kernel.crashing.org
Subject: Re: [Qemu-devel] [PATCHv2 5/7] spapr: Adjust placement of PCI host bridge to allow > 1TiB RAM
Date: Thu, 13 Oct 2016 10:48:41 +1100	[thread overview]
Message-ID: <20161012234841.GC18039@umbus.fritz.box> (raw)
In-Reply-To: <b14796de-2042-366f-1e65-eb16c80a1601@redhat.com>

[-- Attachment #1: Type: text/plain, Size: 3961 bytes --]

On Wed, Oct 12, 2016 at 02:06:10PM +0200, Laurent Vivier wrote:
> 
> 
> On 12/10/2016 12:55, David Gibson wrote:
> > On Wed, Oct 12, 2016 at 12:07:50PM +0200, Laurent Vivier wrote:
> >>
> >>
> >> On 12/10/2016 06:44, David Gibson wrote:
> >>> Currently the default PCI host bridge for the 'pseries' machine type is
> >>> constructed with its IO windows in the 1TiB..(1TiB + 64GiB) range in
> >>> guest memory space.  This means that if > 1TiB of guest RAM is specified,
> >>> the RAM will collide with the PCI IO windows, causing serious problems.
> >>>
> >>> Problems won't be obvious until guest RAM goes a bit beyond 1TiB, because
> >>> there's a little unused space at the bottom of the area reserved for PCI,
> >>> but essentially this means that > 1TiB of RAM has never worked with the
> >>> pseries machine type.
> >>>
> >>> This patch fixes this by altering the placement of PHBs on large-RAM VMs.
> >>> Instead of always placing the first PHB at 1TiB, it is placed at the next
> >>> 1 TiB boundary after the maximum RAM address.
> >>>
> >>> Technically, this changes behaviour in a migration-breaking way for
> >>> existing machines with > 1TiB maximum memory, but since having > 1 TiB
> >>> memory was broken anyway, this seems like a reasonable trade-off.
> >>>
> >>> Signed-off-by: David Gibson <david@gibson.dropbear.id.au>
> >>> ---
> >>>  hw/ppc/spapr.c | 12 ++++++++++--
> >>>  1 file changed, 10 insertions(+), 2 deletions(-)
> >>>
> >>> diff --git a/hw/ppc/spapr.c b/hw/ppc/spapr.c
> >>> index f6e9c2a..7cb167c 100644
> >>> --- a/hw/ppc/spapr.c
> >>> +++ b/hw/ppc/spapr.c
> >>> @@ -2376,15 +2376,23 @@ static void spapr_phb_placement(sPAPRMachineState *spapr, uint32_t index,
> >>>                                  unsigned n_dma, uint32_t *liobns, Error **errp)
> >>>  {
> >>>      const uint64_t base_buid = 0x800000020000000ULL;
> >>> -    const hwaddr phb0_base = 0x10000000000ULL; /* 1 TiB */
> >>>      const hwaddr phb_spacing = 0x1000000000ULL; /* 64 GiB */
> >>>      const hwaddr mmio_offset = 0xa0000000; /* 2 GiB + 512 MiB */
> >>>      const hwaddr pio_offset = 0x80000000; /* 2 GiB */
> >>>      const uint32_t max_index = 255;
> >>> +    const hwaddr phb0_alignment = 0x10000000000ULL; /* 1 TiB */
> >>>  
> >>> -    hwaddr phb_base;
> >>> +    uint64_t ram_top = MACHINE(spapr)->ram_size;
> >>> +    hwaddr phb0_base, phb_base;
> >>>      int i;
> >>>  
> >>> +    if (MACHINE(spapr)->maxram_size > ram_top) {
> >>> +        ram_top = spapr->hotplug_memory.base +
> >>> +            memory_region_size(&spapr->hotplug_memory.mr);
> >>> +    }
> >>
> >> Why don't you set directly ram_top to maxram_size?
> > 
> > Because there may be an alignment gap between ram_size and the bottom
> > of the hotplug region.
> 
> Perhaps you could add a comment why we have this check:
> 
> when machine->ram_size == machine->maxram_size, there is no hotpluggable
> memory.

Good idea, I've added something.

> BTW, something strange in hotpluggable memory in hw/ppc/spapr.c:
> 
>      /* initialize hotplug memory address space */
>      if (machine->ram_size < machine->maxram_size) {
>          ram_addr_t hotplug_mem_size = machine->maxram_size -
> machine->ram_size;
> ...
>          spapr->hotplug_memory.base = ROUND_UP(machine->ram_size,
>                                                SPAPR_HOTPLUG_MEM_ALIGN);
>          memory_region_init(&spapr->hotplug_memory.mr, OBJECT(spapr),
>                             "hotplug-memory", hotplug_mem_size);
> 
> So the end of hotpluggable memory can be beyond maxram_size, is that normal?

Yes.  maxram_size is the total possible amount of RAM not counting
gaps, rather than the maximum possible RAM address.

-- 
David Gibson			| I'll have my music baroque, and my code
david AT gibson.dropbear.id.au	| minimalist, thank you.  NOT _the_ _other_
				| _way_ _around_!
http://www.ozlabs.org/~dgibson

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 819 bytes --]

  reply	other threads:[~2016-10-12 23:55 UTC|newest]

Thread overview: 19+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2016-10-12  4:44 [Qemu-devel] [PATCHv2 0/7] Improve PCI IO window orgnaization for pseries David Gibson
2016-10-12  4:44 ` [Qemu-devel] [PATCHv2 1/7] libqos: Isolate knowledge of spapr memory map to qpci_init_spapr() David Gibson
2016-10-12  8:00   ` Laurent Vivier
2016-10-12  4:44 ` [Qemu-devel] [PATCHv2 2/7] libqos: Correct error in PCI hole sizing for spapr David Gibson
2016-10-12  8:07   ` Laurent Vivier
2016-10-12  4:44 ` [Qemu-devel] [PATCHv2 3/7] libqos: Limit spapr-pci to 32-bit MMIO for now David Gibson
2016-10-12  8:20   ` Laurent Vivier
2016-10-12  4:44 ` [Qemu-devel] [PATCHv2 4/7] spapr_pci: Delegate placement of PCI host bridges to machine type David Gibson
2016-10-12  9:26   ` Laurent Vivier
2016-10-12 23:25     ` David Gibson
2016-10-12  4:44 ` [Qemu-devel] [PATCHv2 5/7] spapr: Adjust placement of PCI host bridge to allow > 1TiB RAM David Gibson
2016-10-12 10:07   ` Laurent Vivier
2016-10-12 10:55     ` David Gibson
2016-10-12 12:06       ` Laurent Vivier
2016-10-12 23:48         ` David Gibson [this message]
2016-10-12  4:44 ` [Qemu-devel] [PATCHv2 6/7] spapr_pci: Add a 64-bit MMIO window David Gibson
2016-10-12 13:39   ` Laurent Vivier
2016-10-12  4:44 ` [Qemu-devel] [PATCHv2 7/7] spapr: Improved placement of PCI host bridges in guest memory map David Gibson
2016-10-12  8:26 ` [Qemu-devel] [PATCHv2 0/7] Improve PCI IO window orgnaization for pseries no-reply

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20161012234841.GC18039@umbus.fritz.box \
    --to=david@gibson.dropbear.id.au \
    --cc=abologna@redhat.com \
    --cc=agraf@suse.de \
    --cc=aik@ozlabs.ru \
    --cc=benh@kernel.crashing.org \
    --cc=lvivier@redhat.com \
    --cc=mdroth@linux.vnet.ibm.com \
    --cc=mst@redhat.com \
    --cc=qemu-devel@nongnu.org \
    --cc=qemu-ppc@nongnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).