mbox series

[v5,0/5] x86/edac/amd64: Add heterogeneous node support

Message ID 20211025145018.29985-1-nchatrad@amd.com (mailing list archive)
Headers show
Series x86/edac/amd64: Add heterogeneous node support | expand

Message

Naveen Krishna Chatradhi Oct. 25, 2021, 2:50 p.m. UTC
On newer heterogeneous systems with AMD CPUs the data fabrics of GPUs
can be connected directly via custom links.

This series of patchset does the following
1. amd_nb.c:
   a. Add support for northbridges on Aldebaran GPU nodes
   b. export AMD node map details to be used by edac and mce modules
	
2. mce_amd module:
   a. Identify the node ID where the error occurred and map the node
      id to linux enumerated node id.

3. amd64_edac module
   a. Add new family op routines
   b. Enumerate UMCs and HBMs on the GPU nodes
   c. Move fam_type structure into amd64_pvt struct

This patchset is rebased on top of
"
commit 07416cadfdfa38283b840e700427ae3782c76f6b
Author: Yazen Ghannam <yazen.ghannam@amd.com>
Date:   Tue Oct 5 15:44:19 2021 +0000

    EDAC/amd64: Handle three rank interleaving mode
"

Muralidhara M K (3):
  x86/amd_nb: Add support for northbridges on Aldebaran
  EDAC/amd64: Extend family ops functions
  EDAC/amd64: Move struct fam_type into amd64_pvt structure

Naveen Krishna Chatradhi (2):
  EDAC/mce_amd: Extract node id from MCA_IPID
  EDAC/amd64: Enumerate memory on Aldebaran GPU nodes

 arch/x86/include/asm/amd_nb.h |   9 +
 arch/x86/kernel/amd_nb.c      | 150 +++++++--
 drivers/edac/amd64_edac.c     | 592 +++++++++++++++++++++++++---------
 drivers/edac/amd64_edac.h     |  39 ++-
 drivers/edac/mce_amd.c        |  24 +-
 include/linux/pci_ids.h       |   1 +
 6 files changed, 630 insertions(+), 185 deletions(-)