From: Igor Mammedov <imammedo@redhat.com>
To: Hu Tao <hutao@cn.fujitsu.com>
Cc: pbonzini@redhat.com, lersek@redhat.com, qemu-devel@nongnu.org,
Wanlong Gao <gaowanlong@cn.fujitsu.com>
Subject: Re: [Qemu-devel] [PATCH v18 10/14] numa: add -numa node, memdev= option
Date: Wed, 19 Feb 2014 10:50:24 +0100 [thread overview]
Message-ID: <20140219105024.06c378d7@nial.usersys.redhat.com> (raw)
In-Reply-To: <b87b2fb5255071e1f47c02f25949f2c73c856652.1392794450.git.hutao@cn.fujitsu.com>
On Wed, 19 Feb 2014 15:54:01 +0800
Hu Tao <hutao@cn.fujitsu.com> wrote:
> From: Paolo Bonzini <pbonzini@redhat.com>
>
> This option provides the infrastructure for binding guest NUMA nodes
> to host NUMA nodes. For example:
>
> -object memory-ram,size=1024M,policy=membind,host-nodes=0,id=ram-node0 \
> -numa node,nodeid=0,cpus=0,memdev=ram-node0 \
> -object memory-ram,size=1024M,policy=interleave,host-nodes=1-3,id=ram-node1 \
> -numa node,nodeid=1,cpus=1,memdev=ram-node1
>
> The option replaces "-numa mem".
>
> Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>
>
> Conflicts:
> include/sysemu/sysemu.h
> numa.c
>
> Signed-off-by: Hu Tao <hutao@cn.fujitsu.com>
> ---
> include/sysemu/sysemu.h | 2 ++
> numa.c | 64 +++++++++++++++++++++++++++++++++++++++++++++++--
> qapi-schema.json | 6 ++++-
> 3 files changed, 69 insertions(+), 3 deletions(-)
>
> diff --git a/include/sysemu/sysemu.h b/include/sysemu/sysemu.h
> index e9da760..acfc0c7 100644
> --- a/include/sysemu/sysemu.h
> +++ b/include/sysemu/sysemu.h
> @@ -12,6 +12,7 @@
> #include "qemu/bitmap.h"
> #include "qom/object.h"
> #include "hw/boards.h"
> +#include "sysemu/hostmem.h"
>
> /* vl.c */
>
> @@ -140,6 +141,7 @@ extern int nb_numa_nodes;
> typedef struct node_info {
> uint64_t node_mem;
> DECLARE_BITMAP(node_cpu, MAX_CPUMASK_BITS);
> + HostMemoryBackend *node_memdev;
> } NodeInfo;
> extern NodeInfo numa_info[MAX_NODES];
> void set_numa_nodes(void);
> diff --git a/numa.c b/numa.c
> index 403b08b..ca55ad7 100644
> --- a/numa.c
> +++ b/numa.c
> @@ -27,6 +27,8 @@
> #include "qapi-visit.h"
> #include "qapi/opts-visitor.h"
> #include "qapi/dealloc-visitor.h"
> +#include "qapi/qmp/qerror.h"
> +
> QemuOptsList qemu_numa_opts = {
> .name = "numa",
> .implied_opt_name = "type",
> @@ -34,10 +36,13 @@ QemuOptsList qemu_numa_opts = {
> .desc = { { 0 } } /* validated with OptsVisitor */
> };
>
> +static int have_memdevs = -1;
> +
> static int numa_node_parse(NumaNodeOptions *opts)
> {
> uint16_t nodenr;
> uint16List *cpus = NULL;
> + Error *local_err = NULL;
>
> if (opts->has_nodeid) {
> nodenr = opts->nodeid;
> @@ -60,6 +65,19 @@ static int numa_node_parse(NumaNodeOptions *opts)
> bitmap_set(numa_info[nodenr].node_cpu, cpus->value, 1);
> }
>
> + if (opts->has_mem && opts->has_memdev) {
> + fprintf(stderr, "qemu: cannot specify both mem= and memdev=\n");
> + return -1;
> + }
> +
> + if (have_memdevs == -1) {
> + have_memdevs = opts->has_memdev;
> + }
> + if (opts->has_memdev != have_memdevs) {
> + fprintf(stderr, "qemu: memdev option must be specified for either "
> + "all or no nodes\n");
> + }
> +
> if (opts->has_mem) {
> int64_t mem_size;
> char *endptr;
> @@ -70,7 +88,19 @@ static int numa_node_parse(NumaNodeOptions *opts)
> }
> numa_info[nodenr].node_mem = mem_size;
> }
> + if (opts->has_memdev) {
> + Object *o;
> + o = object_resolve_path_type(opts->memdev, TYPE_MEMORY_BACKEND, NULL);
> + if (!o) {
> + error_setg(&local_err, "memdev=%s is ambiguous", opts->memdev);
> + qerror_report_err(local_err);
> + return -1;
> + }
>
> + object_ref(o);
> + numa_info[nodenr].node_mem = object_property_get_int(o, "size", NULL);
> + numa_info[nodenr].node_memdev = MEMORY_BACKEND(o);
if you make numa_info QOM object node_memdev link<> property,
then above hunk could be replaced with just setting link.
And node_mem could be replaced with readonly property that reads size
directly from memdev avoiding data duplication.
As side-effect it numa_info will also become accessible for introspection
using QOM interface. Something like:
qom-list /machine/memory-node[X]
qom-get /machine/memory-node[X]/memory_size
> + }
> return 0;
> }
>
> @@ -189,12 +219,42 @@ void set_numa_modes(void)
> }
> }
>
> +static void allocate_system_memory_nonnuma(MemoryRegion *mr, Object *owner,
> + const char *name,
> + QEMUMachineInitArgs *args)
> +{
> + uint64_t ram_size = args->ram_size;
> +
> + memory_region_init_ram(mr, owner, name, ram_size);
> + vmstate_register_ram_global(mr);
> +}
> +
> void memory_region_allocate_system_memory(MemoryRegion *mr, Object *owner,
> const char *name,
> QEMUMachineInitArgs *args)
> {
> uint64_t ram_size = args->ram_size;
> + uint64_t addr = 0;
> + int i;
>
> - memory_region_init_ram(mr, owner, name, ram_size);
> - vmstate_register_ram_global(mr);
> + if (nb_numa_nodes == 0 || !have_memdevs) {
> + allocate_system_memory_nonnuma(mr, owner, name, args);
> + return;
> + }
> +
> + memory_region_init(mr, owner, name, ram_size);
> + for (i = 0; i < nb_numa_nodes; i++) {
> + Error *local_err = NULL;
> + uint64_t size = numa_info[i].node_mem;
> + HostMemoryBackend *backend = numa_info[i].node_memdev;
> + MemoryRegion *seg = host_memory_backend_get_memory(backend, &local_err);
> + if (local_err) {
> + qerror_report_err(local_err);
> + exit(1);
> + }
> +
> + memory_region_add_subregion(mr, addr, seg);
> + vmstate_register_ram_global(seg);
> + addr += size;
> + }
> }
> diff --git a/qapi-schema.json b/qapi-schema.json
> index a2839b8..498ea9b 100644
> --- a/qapi-schema.json
> +++ b/qapi-schema.json
> @@ -4441,7 +4441,10 @@
> #
> # @cpus: #optional VCPUs belong to this node
> #
> -# @mem: #optional memory size of this node
> +# @memdev: #optional memory backend object. If specified for one node,
> +# it must be specified for all nodes.
> +#
> +# @mem: #optional memory size of this node; mutually exclusive with @memdev.
> #
> # Since: 2.0
> ##
> @@ -4449,4 +4452,5 @@
> 'data': {
> '*nodeid': 'uint16',
> '*cpus': ['uint16'],
> + '*memdev': 'str',
> '*mem': 'str' }}
next prev parent reply other threads:[~2014-02-19 10:46 UTC|newest]
Thread overview: 53+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-02-19 7:53 [Qemu-devel] [PATCH v18 00/14] Add support for binding guest numa nodes to host numa nodes Hu Tao
2014-02-19 7:53 ` [Qemu-devel] [PATCH v18 01/14] NUMA: move numa related code to new file numa.c Hu Tao
2014-02-19 7:53 ` [Qemu-devel] [PATCH v18 02/14] NUMA: check if the total numa memory size is equal to ram_size Hu Tao
2014-02-25 13:38 ` Eric Blake
2014-02-19 7:53 ` [Qemu-devel] [PATCH v18 03/14] NUMA: Add numa_info structure to contain numa nodes info Hu Tao
2014-02-19 9:26 ` Igor Mammedov
2014-02-21 2:54 ` hu tao
2014-02-19 7:53 ` [Qemu-devel] [PATCH v18 04/14] NUMA: convert -numa option to use OptsVisitor Hu Tao
2014-02-19 7:53 ` [Qemu-devel] [PATCH v18 05/14] NUMA: expand MAX_NODES from 64 to 128 Hu Tao
2014-02-19 7:53 ` [Qemu-devel] [PATCH v18 06/14] qapi: add SIZE type parser to string_input_visitor Hu Tao
2014-02-19 9:54 ` Igor Mammedov
2014-02-19 7:53 ` [Qemu-devel] [PATCH v18 07/14] add memdev backend infrastructure Hu Tao
2014-02-19 9:15 ` Igor Mammedov
2014-02-19 7:53 ` [Qemu-devel] [PATCH v18 08/14] pc: pass QEMUMachineInitArgs to pc_memory_init Hu Tao
2014-02-19 7:54 ` [Qemu-devel] [PATCH v18 09/14] numa: introduce memory_region_allocate_system_memory Hu Tao
2014-02-19 7:54 ` [Qemu-devel] [PATCH v18 10/14] numa: add -numa node, memdev= option Hu Tao
2014-02-19 9:50 ` Igor Mammedov [this message]
2014-02-19 11:53 ` Paolo Bonzini
2014-03-04 0:10 ` Eric Blake
2014-03-04 2:20 ` Hu Tao
2014-02-19 7:54 ` [Qemu-devel] [PATCH v18 11/14] qapi: make string input visitor parse int list Hu Tao
2014-02-19 8:17 ` Hu Tao
2014-02-19 8:42 ` Paolo Bonzini
2014-02-19 7:54 ` [Qemu-devel] [PATCH v18 12/14] qapi: add HostMemPolicy enum type Hu Tao
2014-02-19 9:08 ` Paolo Bonzini
2014-02-19 11:23 ` Igor Mammedov
2014-02-19 7:54 ` [Qemu-devel] [PATCH v18 13/14] memory backend: fill memory backend ram fields Hu Tao
2014-02-19 9:03 ` Paolo Bonzini
2014-02-19 9:36 ` Igor Mammedov
2014-02-25 10:20 ` Hu Tao
2014-02-25 14:15 ` Paolo Bonzini
2014-02-26 5:00 ` Hu Tao
2014-02-26 8:47 ` Igor Mammedov
2014-02-26 8:59 ` Hu Tao
2014-02-26 12:19 ` Igor Mammedov
2014-02-26 11:22 ` Paolo Bonzini
2014-02-26 5:57 ` Hu Tao
2014-02-26 9:05 ` Paolo Bonzini
2014-02-26 9:10 ` Igor Mammedov
2014-02-26 10:33 ` Paolo Bonzini
2014-02-26 12:31 ` Igor Mammedov
2014-02-26 12:45 ` Paolo Bonzini
2014-02-26 12:58 ` Marcelo Tosatti
2014-02-26 13:14 ` Paolo Bonzini
2014-02-26 13:43 ` Igor Mammedov
2014-02-26 13:47 ` Paolo Bonzini
2014-02-26 14:25 ` Igor Mammedov
2014-02-26 14:39 ` Paolo Bonzini
2014-02-25 10:09 ` Hu Tao
2014-03-03 3:24 ` Hu Tao
2014-02-19 7:54 ` [Qemu-devel] [PATCH v18 14/14] amp: add query-memdev Hu Tao
2014-02-19 8:14 ` Hu Tao
2014-02-19 9:07 ` Paolo Bonzini
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20140219105024.06c378d7@nial.usersys.redhat.com \
--to=imammedo@redhat.com \
--cc=gaowanlong@cn.fujitsu.com \
--cc=hutao@cn.fujitsu.com \
--cc=lersek@redhat.com \
--cc=pbonzini@redhat.com \
--cc=qemu-devel@nongnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).