* Xenheap placement in a NUMA system
@ 2015-09-28 12:28 Mario Lodde
2015-09-28 13:29 ` Andrew Cooper
0 siblings, 1 reply; 3+ messages in thread
From: Mario Lodde @ 2015-09-28 12:28 UTC (permalink / raw)
To: xen-devel
Hi everyone,
I’m new to Xen and I’m trying to understand how physical memory is managed. If I’m understanding correctly the source code, the hypervisor allocates the memory it needs only from the xenheap, while the rest of the memory is used by dom0 and the domUs.
If we consider a NUMA system, is the entire xenheap concentrated in the boot node or is it spread across the nodes?
In case it’s all stored in the boot node, and dom0’s memory is also contained entirely in the same node, is it correct to say that a failure in another node would just require to shut down the VMs that are running in that node but the rest of the system is still able to run?
Thanks in advance for your help.
Best regards,
Mario
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: Xenheap placement in a NUMA system
2015-09-28 12:28 Xenheap placement in a NUMA system Mario Lodde
@ 2015-09-28 13:29 ` Andrew Cooper
2015-09-29 16:12 ` Mario Lodde
0 siblings, 1 reply; 3+ messages in thread
From: Andrew Cooper @ 2015-09-28 13:29 UTC (permalink / raw)
To: Mario Lodde, xen-devel
On 28/09/15 13:28, Mario Lodde wrote:
> Hi everyone,
>
> I’m new to Xen and I’m trying to understand how physical memory is managed. If I’m understanding correctly the source code, the hypervisor allocates the memory it needs only from the xenheap, while the rest of the memory is used by dom0 and the domUs.
To a first approximation, yes.
The difference between the xenheap and domheap is that the xenheap is
always mapped into virtual address space, while the domheap might not be.
It is therefore safe to create a pointer to an item in xenheap memory,
but domheap pages may only be accessed using {,un}map_domheap_page()
(and friends).
> If we consider a NUMA system, is the entire xenheap concentrated in the boot node or is it spread across the nodes?
This is architecture dependent. On x86, the xenheap limit is at the
directmap end, which puts it at the 5TiB physical boundary.
On a small machine (< 5TB of RAM), xenheap pages will be available
across all NUMA nodes. On larger machines, this will not be the case.
> In case it’s all stored in the boot node, and dom0’s memory is also contained entirely in the same node, is it correct to say that a failure in another node would just require to shut down the VMs that are running in that node but the rest of the system is still able to run?
What do you mean by "failure" here? A full DIMM failure would almost
certainly be completely fatal to Xen, while an individual cell failure
does have logic for being propagated to the owning domain, in the hope
that it can take some corrective action.
~Andrew
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: Xenheap placement in a NUMA system
2015-09-28 13:29 ` Andrew Cooper
@ 2015-09-29 16:12 ` Mario Lodde
0 siblings, 0 replies; 3+ messages in thread
From: Mario Lodde @ 2015-09-29 16:12 UTC (permalink / raw)
To: Andrew Cooper; +Cc: xen-devel
On 28 Sep 2015, at 15:29, Andrew Cooper <andrew.cooper3@citrix.com> wrote:
>> In case it’s all stored in the boot node, and dom0’s memory is also contained entirely in the same node, is it correct to say that a failure in another node would just require to shut down the VMs that are running in that node but the rest of the system is still able to run?
>
> What do you mean by "failure" here? A full DIMM failure would almost
> certainly be completely fatal to Xen, while an individual cell failure
> does have logic for being propagated to the owning domain, in the hope
> that it can take some corrective action.
>
> ~Andrew
Yes, I mean losing a whole DIMM on a node. I was wondering if it’s possible, assuming the xenheap and dom0 are confined to the boot node, to implement some sort of live migration within nodes of a NUMA host.
At a first glance I thought it was enough to periodically checkpoint all VMs and possibly do some cleanup after a node is detected dead to avoid using it again but I’m surely oversimplifying the problem.
Best regards,
Mario
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2015-09-29 16:12 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2015-09-28 12:28 Xenheap placement in a NUMA system Mario Lodde
2015-09-28 13:29 ` Andrew Cooper
2015-09-29 16:12 ` Mario Lodde
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.