public inbox for linux-edac@vger.kernel.org
 help / color / mirror / Atom feed
* [PATCH v2 0/6] Address Translation support for MI200 and MI300 models
@ 2023-11-29  7:35 Muralidhara M K
  2023-11-29  7:35 ` [PATCH v2 1/6] RAS: Add Address Translation support for MI200 Muralidhara M K
                   ` (5 more replies)
  0 siblings, 6 replies; 7+ messages in thread
From: Muralidhara M K @ 2023-11-29  7:35 UTC (permalink / raw)
  To: linux-edac; +Cc: linux-kernel, bp, mchehab, Muralidhara M K

From: Muralidhara M K <muralidhara.mk@amd.com>

This patchset adds support for MI200 heterogeneous address translation support
and MI300A address translation support, Few fixups on HBM3 memory address maps to
convert on-die(MCA decoded) address to Normalized address.

The patch set depends on the Yazen's patches submitted "AMD Address Translation Library"
https://lore.kernel.org/r/20231005173526.42831-1-yazen.ghannam@amd.com

The patchset does the following

Patch 1:
MI200 heterogeneous address translation support.

Patch 2:
MI300 heterogeneous address translation support.

Patch 3:
Convert HBM3 MCA Decoded address to Normalized address.

Patch 4:
lookup table to get the correct cs instance id for HBM3.

Patch 5:
Convert physical cs id to logical cs id by static lookup
table.

Patch 6:
Identify all 8 column system physical addresses from each HBM3 row and retire all
column addresses when the error is injected to avoid future errors.

Muralidhara M K (6):
  RAS: Add Address Translation support for MI200
  RAS: Add Address Translation support for MI300
  RAS: Add MCA Error address conversion for UMC
  RAS: Add static lookup table to get CS physical ID
  RAS: Add fixed Physical to logical CS ID mapping table
  RAS: EDAC/amd64: Retire all system physical address from HBM3 row

 drivers/edac/amd64_edac.c         |   3 +
 drivers/ras/amd/atl/core.c        |   5 +-
 drivers/ras/amd/atl/dehash.c      | 149 ++++++++++++++++
 drivers/ras/amd/atl/denormalize.c | 110 +++++++++++-
 drivers/ras/amd/atl/internal.h    |  27 ++-
 drivers/ras/amd/atl/map.c         | 158 ++++++++++++++---
 drivers/ras/amd/atl/reg_fields.h  |  34 ++++
 drivers/ras/amd/atl/system.c      |   4 +
 drivers/ras/amd/atl/umc.c         | 284 +++++++++++++++++++++++++++++-
 include/linux/amd-atl.h           |   2 +
 10 files changed, 747 insertions(+), 29 deletions(-)

-- 
2.25.1


^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2023-11-29  7:35 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2023-11-29  7:35 [PATCH v2 0/6] Address Translation support for MI200 and MI300 models Muralidhara M K
2023-11-29  7:35 ` [PATCH v2 1/6] RAS: Add Address Translation support for MI200 Muralidhara M K
2023-11-29  7:35 ` [PATCH v2 2/6] RAS: Add Address Translation support for MI300 Muralidhara M K
2023-11-29  7:35 ` [PATCH v2 3/6] RAS: Add MCA Error address conversion for UMC Muralidhara M K
2023-11-29  7:35 ` [PATCH v2 4/6] RAS: Add static lookup table to get CS physical ID Muralidhara M K
2023-11-29  7:35 ` [PATCH v2 5/6] RAS: Add fixed Physical to logical CS ID mapping table Muralidhara M K
2023-11-29  7:35 ` [PATCH v2 6/6] RAS: EDAC/amd64: Retire all system physical address from HBM3 row Muralidhara M K

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox