mbox series

[v4,0/4] x86/edac/amd64: Add support for noncpu nodes

Message ID 20211014185058.9587-1-nchatrad@amd.com (mailing list archive)
Headers show
Series x86/edac/amd64: Add support for noncpu nodes | expand

Message

Naveen Krishna Chatradhi Oct. 14, 2021, 6:50 p.m. UTC
On newer heterogeneous systems with AMD CPUs the data fabrics of GPUs
can be connected directly via custom links.

This patchset does the following
1. amd_nb.c:
   a. Add support for northbridges on Aldebaran GPU nodes
   b. export AMD node map details to be used by edac and mce modules
	
2. mce_amd module:
   a. Identify the node ID where the error occurred and map the node id
      to linux enumerated node id.

2. Modifies the amd64_edac module
   a. Add new family op routines
   b. Enumerate UMCs and HBMs on the GPU nodes

This patchset is rebased on top of
"
commit 07416cadfdfa38283b840e700427ae3782c76f6b
Author: Yazen Ghannam <yazen.ghannam@amd.com>
Date:   Tue Oct 5 15:44:19 2021 +0000

    EDAC/amd64: Handle three rank interleaving mode
"

Muralidhara M K (2):
  x86/amd_nb: Add support for northbridges on Aldebaran
  EDAC/amd64: Extend family ops functions

Naveen Krishna Chatradhi (2):
  EDAC/mce_amd: Extract node id from MCA_IPID
  EDAC/amd64: Enumerate memory on Aldebaran GPU nodes

 arch/x86/include/asm/amd_nb.h |   9 +
 arch/x86/kernel/amd_nb.c      | 131 +++++++--
 drivers/edac/amd64_edac.c     | 517 +++++++++++++++++++++++++---------
 drivers/edac/amd64_edac.h     |  33 +++
 drivers/edac/mce_amd.c        |  24 +-
 include/linux/pci_ids.h       |   1 +
 6 files changed, 564 insertions(+), 151 deletions(-)