From: Felix Kuehling <felix.kuehling@amd.com>
To: Jonathan Kim <jonathan.kim@amd.com>, amd-gfx@lists.freedesktop.org
Cc: Sean.Keely@amd.com
Subject: Re: [PATCH] drm/amdkfd: map gpu hive id to xgmi connected cpu
Date: Thu, 14 Oct 2021 13:55:12 -0400 [thread overview]
Message-ID: <e7d59f5e-3db8-5abd-9947-373f868f1219@amd.com> (raw)
In-Reply-To: <20211014174454.3342996-1-jonathan.kim@amd.com>
Am 2021-10-14 um 1:44 p.m. schrieb Jonathan Kim:
> ROCr needs to be able to identify all devices that have direct access to
> fine grain memory, which should include CPUs that are connected to GPUs
> over xGMI. The GPU hive ID can be mapped onto the CPU hive ID since the
> CPU is part of the hive.
>
> Signed-off-by: Jonathan Kim <jonathan.kim@amd.com>
> ---
> drivers/gpu/drm/amd/amdkfd/kfd_topology.c | 22 +++++++++++++++++++++-
> 1 file changed, 21 insertions(+), 1 deletion(-)
>
> diff --git a/drivers/gpu/drm/amd/amdkfd/kfd_topology.c b/drivers/gpu/drm/amd/amdkfd/kfd_topology.c
> index 98cca5f2b27f..d04c48dfd72b 100644
> --- a/drivers/gpu/drm/amd/amdkfd/kfd_topology.c
> +++ b/drivers/gpu/drm/amd/amdkfd/kfd_topology.c
> @@ -1296,6 +1296,27 @@ int kfd_topology_add_device(struct kfd_dev *gpu)
>
> proximity_domain = atomic_inc_return(&topology_crat_proximity_domain);
>
> + adev = (struct amdgpu_device *)(gpu->kgd);
> +
> + /* Include the CPU in xGMI hive if xGMI connected by assigning it the hive ID. */
> + if (gpu->hive_id && adev->gmc.xgmi.connected_to_cpu) {
> + int i;
> +
> + for (i = 0; i < proximity_domain; i++) {
> + struct kfd_topology_device *to_dev =
> + kfd_topology_device_by_proximity_domain(i);
> +
> + if (!to_dev)
> + continue;
> +
> + if (to_dev->gpu)
> + break;
> +
> + to_dev->node_props.hive_id = gpu->hive_id;
> + break;
On a NUMA system there will be multiple CPU nodes (e.g. in NPS-4 mode).
The "break" statement here means, you'll only update the hive ID on the
first NUMA node.
Other than that, this change makes sense.
Regards,
Felix
> + }
> + }
> +
> /* Check to see if this gpu device exists in the topology_device_list.
> * If so, assign the gpu to that device,
> * else create a Virtual CRAT for this gpu device and then parse that
> @@ -1457,7 +1478,6 @@ int kfd_topology_add_device(struct kfd_dev *gpu)
> dev->node_props.max_waves_per_simd = 10;
> }
>
> - adev = (struct amdgpu_device *)(dev->gpu->kgd);
> /* kfd only concerns sram ecc on GFX and HBM ecc on UMC */
> dev->node_props.capability |=
> ((adev->ras_enabled & BIT(AMDGPU_RAS_BLOCK__GFX)) != 0) ?
next prev parent reply other threads:[~2021-10-14 17:55 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-10-14 17:44 [PATCH] drm/amdkfd: map gpu hive id to xgmi connected cpu Jonathan Kim
2021-10-14 17:55 ` Felix Kuehling [this message]
-- strict thread matches above, loose matches on Subject: below --
2021-10-14 18:12 Jonathan Kim
2021-10-14 18:46 ` Felix Kuehling
2021-10-15 15:11 Jonathan Kim
2021-10-15 21:52 ` Felix Kuehling
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=e7d59f5e-3db8-5abd-9947-373f868f1219@amd.com \
--to=felix.kuehling@amd.com \
--cc=Sean.Keely@amd.com \
--cc=amd-gfx@lists.freedesktop.org \
--cc=jonathan.kim@amd.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox