From: Igor Mammedov <imammedo@redhat.com>
To: Gavin Shan <gshan@redhat.com>
Cc: peter.maydell@linaro.org, drjones@redhat.com,
richard.henderson@linaro.org, qemu-devel@nongnu.org,
zhenyzha@redhat.com, wangyanan55@huawei.com, qemu-arm@nongnu.org,
shan.gavin@gmail.com
Subject: Re: [PATCH v3 1/4] hw/arm/virt: Consider SMP configuration in CPU topology
Date: Wed, 30 Mar 2022 14:50:46 +0200 [thread overview]
Message-ID: <20220330145046.175de97b@redhat.com> (raw)
In-Reply-To: <e46061b3-cebf-c4be-758e-c465cf7879c1@redhat.com>
On Sat, 26 Mar 2022 02:49:59 +0800
Gavin Shan <gshan@redhat.com> wrote:
> Hi Igor,
>
> On 3/25/22 9:19 PM, Igor Mammedov wrote:
> > On Wed, 23 Mar 2022 15:24:35 +0800
> > Gavin Shan <gshan@redhat.com> wrote:
> >> Currently, the SMP configuration isn't considered when the CPU
> >> topology is populated. In this case, it's impossible to provide
> >> the default CPU-to-NUMA mapping or association based on the socket
> >> ID of the given CPU.
> >>
> >> This takes account of SMP configuration when the CPU topology
> >> is populated. The die ID for the given CPU isn't assigned since
> >> it's not supported on arm/virt machine yet. Besides, the cluster
> >> ID for the given CPU is assigned because it has been supported
> >> on arm/virt machine.
> >>
> >> Signed-off-by: Gavin Shan <gshan@redhat.com>
> >> ---
> >> hw/arm/virt.c | 11 +++++++++++
> >> qapi/machine.json | 6 ++++--
> >> 2 files changed, 15 insertions(+), 2 deletions(-)
> >>
> >> diff --git a/hw/arm/virt.c b/hw/arm/virt.c
> >> index d2e5ecd234..064eac42f7 100644
> >> --- a/hw/arm/virt.c
> >> +++ b/hw/arm/virt.c
> >> @@ -2505,6 +2505,7 @@ static const CPUArchIdList *virt_possible_cpu_arch_ids(MachineState *ms)
> >> int n;
> >> unsigned int max_cpus = ms->smp.max_cpus;
> >> VirtMachineState *vms = VIRT_MACHINE(ms);
> >> + MachineClass *mc = MACHINE_GET_CLASS(vms);
> >>
> >> if (ms->possible_cpus) {
> >> assert(ms->possible_cpus->len == max_cpus);
> >> @@ -2518,6 +2519,16 @@ static const CPUArchIdList *virt_possible_cpu_arch_ids(MachineState *ms)
> >> ms->possible_cpus->cpus[n].type = ms->cpu_type;
> >> ms->possible_cpus->cpus[n].arch_id =
> >> virt_cpu_mp_affinity(vms, n);
> >> +
> >> + assert(!mc->smp_props.dies_supported);
> >> + ms->possible_cpus->cpus[n].props.has_socket_id = true;
> >> + ms->possible_cpus->cpus[n].props.socket_id =
> >> + n / (ms->smp.clusters * ms->smp.cores * ms->smp.threads);
> >> + ms->possible_cpus->cpus[n].props.has_cluster_id = true;
> >> + ms->possible_cpus->cpus[n].props.cluster_id =
> >> + n / (ms->smp.cores * ms->smp.threads);
> >
> > are there any relation cluster values here and number of clusters with
> > what virt_cpu_mp_affinity() calculates?
> >
>
> They're different clusters. The cluster returned by virt_cpu_mp_affinity()
> is reflected to MPIDR_EL1 system register, which is mainly used by VGIC2/3
> interrupt controller to send send group interrupts to the CPU cluster. It's
> notable that the value returned from virt_cpu_mp_affinity() is always
> overrided by KVM. It means this value is only used by TCG for the emulated
> GIC2/GIC3.
>
> The cluster in 'ms->possible_cpus' is passed to ACPI PPTT table to populate
> the CPU topology.
>
>
> >> + ms->possible_cpus->cpus[n].props.has_core_id = true;
> >> + ms->possible_cpus->cpus[n].props.core_id = n / ms->smp.threads;
> >
> >> ms->possible_cpus->cpus[n].props.has_thread_id = true;
> >> ms->possible_cpus->cpus[n].props.thread_id = n;
> > of cause target has the right to decide how to allocate IDs, and mgmt
> > is supposed to query these IDs before using them.
> > But:
> > * IDs within 'props' are supposed to be arch defined.
> > (on x86 IDs in range [0-smp.foo_id), on ppc it something different)
> > Question is what real hardware does here in ARM case (i.e.
> > how .../cores/threads are described on bare-metal)?
> >
>
> On ARM64 bare-metal machine, the core/cluster ID assignment is pretty arbitrary.
> I checked the CPU topology on my bare-metal machine, which has following SMP
> configurations.
>
> # lscpu
> :
> Thread(s) per core: 4
> Core(s) per socket: 28
> Socket(s): 2
>
> smp.sockets = 2
> smp.clusters = 1
> smp.cores = 56 (28 per socket)
> smp.threads = 4
>
> // CPU0-111 belongs to socket0 or package0
> // CPU112-223 belongs to socket1 or package1
> # cat /sys/devices/system/cpu/cpu0/topology/package_cpus
> 00000000,00000000,00000000,0000ffff,ffffffff,ffffffff,ffffffff
> # cat /sys/devices/system/cpu/cpu111/topology/package_cpus
> 00000000,00000000,00000000,0000ffff,ffffffff,ffffffff,ffffffff
> # cat /sys/devices/system/cpu/cpu112/topology/package_cpus
> ffffffff,ffffffff,ffffffff,ffff0000,00000000,00000000,00000000
> # cat /sys/devices/system/cpu/cpu223/topology/package_cpus
> ffffffff,ffffffff,ffffffff,ffff0000,00000000,00000000,00000000
>
> // core/cluster ID spans from 0 to 27 on socket0
> # for i in `seq 0 27`; do cat /sys/devices/system/cpu/cpu$i/topology/core_id; done
> 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27
> # for i in `seq 28 55`; do cat /sys/devices/system/cpu/cpu$i/topology/core_id; done
> 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27
> # for i in `seq 0 27`; do cat /sys/devices/system/cpu/cpu$i/topology/cluster_id; done
> 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27
> # for i in `seq 28 55`; do cat /sys/devices/system/cpu/cpu$i/topology/cluster_id; done
> 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27
>
> // However, core/cluster ID starts from 256 on socket1
> # for i in `seq 112 139`; do cat /sys/devices/system/cpu/cpu$i/topology/core_id; done
> 256 257 258 259 260 261 262 263 264 265 266 267 268 269
> 270 271 272 273 274 275 276 277 278 279 280 281 282 283
> # for i in `seq 140 167`; do cat /sys/devices/system/cpu/cpu$i/topology/core_id; done
> 256 257 258 259 260 261 262 263 264 265 266 267 268 269
> 270 271 272 273 274 275 276 277 278 279 280 281 282 283
> # for i in `seq 112 139`; do cat /sys/devices/system/cpu/cpu$i/topology/cluster_id; done
> 256 257 258 259 260 261 262 263 264 265 266 267 268 269
> 270 271 272 273 274 275 276 277 278 279 280 281 282 283
> # for i in `seq 140 167`; do cat /sys/devices/system/cpu/cpu$i/topology/cluster_id; done
> 256 257 258 259 260 261 262 263 264 265 266 267 268 269
> 270 271 272 273 274 275 276 277 278 279 280 281 282 283
so it seems that IDs are repeatable within a socket.
If there no arch defined way or other objections it might be better
to stick to what x86 does for consistency reasons (i.e. socket/die/
cluster/core/thread are in range [0..x) including thread-id being
in range [0..threads) ) instead of inventing arm/virt specific scheme.
>
> > * maybe related: looks like build_pptt() and build_madt() diverge on
> > the meaning of 'ACPI Processor ID' and how it's generated.
> > My understanding of 'ACPI Processor ID' is that it should match
> > across all tables. So UIDs generated in build_pptt() look wrong to me.
> >
> > * maybe related: build_pptt() looks broken wrt core/thread where it
> > may create at the same time a leaf core with a leaf thread underneath it,
> > is such description actually valid?
> >
>
> Yes, the UIDs in MADT/PPTT should match. I'm not sure if I missed anything here.
> I don't see how the UID in MADT and PPTT table are diverged. In both functions,
> 'thread_id' is taken as UID.
>
> In build_pptt(), when the entries for the cores becomes leaf, nothing will be
> pushed into @list, @length becomes zero for the loop to create entries for
> the threads. In this case, we won't have any entries created for threads.
>
> >
> >> }
> >> diff --git a/qapi/machine.json b/qapi/machine.json
> >> index 42fc68403d..99c945f258 100644
> >> --- a/qapi/machine.json
> >> +++ b/qapi/machine.json
> >> @@ -868,10 +868,11 @@
> >> # @node-id: NUMA node ID the CPU belongs to
> >> # @socket-id: socket number within node/board the CPU belongs to
> >> # @die-id: die number within socket the CPU belongs to (since 4.1)
> >> -# @core-id: core number within die the CPU belongs to
> >> +# @cluster-id: cluster number within die the CPU belongs to
> >> +# @core-id: core number within cluster the CPU belongs to
> >
> > s:cluster:cluster/die:
> >
>
> Ok. I will amend it like below in next respin:
>
> # @core-id: core number within cluster/die the CPU belongs to
>
> I'm not sure if we need make similar changes for 'cluster_id' like below?
>
> # @cluster-id: cluster number within die/socket the CPU belongs to
> ^^^^^^^^^^
maybe postpone it till die is supported?
>
> >> # @thread-id: thread number within core the CPU belongs to
> >> #
> >> -# Note: currently there are 5 properties that could be present
> >> +# Note: currently there are 6 properties that could be present
> >> # but management should be prepared to pass through other
> >> # properties with device_add command to allow for future
> >> # interface extension. This also requires the filed names to be kept in
> >> @@ -883,6 +884,7 @@
> >> 'data': { '*node-id': 'int',
> >> '*socket-id': 'int',
> >> '*die-id': 'int',
> >> + '*cluster-id': 'int',
> >> '*core-id': 'int',
> >> '*thread-id': 'int'
> >> }
>
> Thanks,
> Gavin
>
next prev parent reply other threads:[~2022-03-30 12:52 UTC|newest]
Thread overview: 23+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-03-23 7:24 [PATCH v3 0/4] hw/arm/virt: Fix CPU's default NUMA node ID Gavin Shan
2022-03-23 7:24 ` [PATCH v3 1/4] hw/arm/virt: Consider SMP configuration in CPU topology Gavin Shan
2022-03-25 13:19 ` Igor Mammedov
2022-03-25 18:49 ` Gavin Shan
2022-03-30 12:50 ` Igor Mammedov [this message]
2022-04-02 2:27 ` wangyanan (Y) via
2022-04-03 10:46 ` Gavin Shan
2022-03-30 13:18 ` Igor Mammedov
2022-04-03 10:48 ` Gavin Shan
2022-04-02 2:17 ` wangyanan (Y) via
2022-04-03 11:55 ` Gavin Shan
2022-03-23 7:24 ` [PATCH v3 2/4] hw/arm/virt: Fix CPU's default NUMA node ID Gavin Shan
2022-03-25 13:25 ` Igor Mammedov
2022-04-02 2:02 ` wangyanan (Y) via
2022-04-03 11:57 ` Gavin Shan
2022-03-23 7:24 ` [PATCH v3 3/4] hw/acpi/aml-build: Use existing CPU topology to build PPTT table Gavin Shan
2022-03-30 14:10 ` Igor Mammedov
2022-04-03 14:40 ` Gavin Shan
2022-03-23 7:24 ` [PATCH v3 4/4] hw/arm/virt: Unify ACPI processor ID in MADT and SRAT table Gavin Shan
2022-03-25 14:00 ` Igor Mammedov
2022-03-25 19:08 ` Gavin Shan
2022-03-30 12:52 ` Igor Mammedov
2022-04-03 10:43 ` Gavin Shan
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20220330145046.175de97b@redhat.com \
--to=imammedo@redhat.com \
--cc=drjones@redhat.com \
--cc=gshan@redhat.com \
--cc=peter.maydell@linaro.org \
--cc=qemu-arm@nongnu.org \
--cc=qemu-devel@nongnu.org \
--cc=richard.henderson@linaro.org \
--cc=shan.gavin@gmail.com \
--cc=wangyanan55@huawei.com \
--cc=zhenyzha@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).