mbox series

[0/7] x86/edac/amd64: Add support for noncpu nodes

Message ID 20210630152828.162659-1-nchatrad@amd.com (mailing list archive)
Headers show
Series x86/edac/amd64: Add support for noncpu nodes | expand

Message

Naveen Krishna Chatradhi June 30, 2021, 3:28 p.m. UTC
On newer heterogeneous systems from AMD with GPU nodes connected via
xGMI links to the CPUs, the GPU dies are interfaced with HBM2 memory.

This patchset applies on top of the following series by Yazen Ghannam
AMD MCA Address Translation Updates
[https://patchwork.kernel.org/project/linux-edac/list/?series=505989]

This patchset does the following
1. Add support for northbridges on Aldebaran
	* x86/amd_nb: Add Aldebaran device to PCI IDs
	* x86/amd_nb: Add support for northbridges on Aldebaran
2. Add HBM memory type in EDAC
	* EDAC/mc: Add new HBM2 memory type
3. Modifies the amd64_edac module to
   a. Handle the UMCs on the noncpu nodes,
	* EDAC/mce_amd: extract node id from InstanceHi in IPID
   b. Enumerate HBM memory and add address translation
	* EDAC/amd64: Enumerate memory on noncpu nodes
   c. Address translation on Data Fabric version 3.5.
	* EDAC/amd64: Add address translation support for DF3.5
	* EDAC/amd64: Add fixed UMC to CS mapping


Aldebaran has 2 Dies (enumerated as a MCx, x= 8 ~ 15) 
  Each Die has 4 UMCs (enumerated as csrowx, x=0~3)
  Each die has 2 root ports, with 4 misc port for each root.
  Each UMC manages 8 UMC channels each connected to 2GB of HBM memory.

Muralidhara M K (3):
  x86/amd_nb: Add Aldebaran device to PCI IDs
  x86/amd_nb: Add support for northbridges on Aldebaran
  EDAC/amd64: Add address translation support for DF3.5

Naveen Krishna Chatradhi (3):
  EDAC/mc: Add new HBM2 memory type
  EDAC/mce_amd: extract node id from InstanceHi in IPID
  EDAC/amd64: Enumerate memory on noncpu nodes

Yazen Ghannam (1):
  EDAC/amd64: Add fixed UMC to CS mapping

 arch/x86/include/asm/amd_nb.h |   6 +
 arch/x86/kernel/amd_nb.c      |  62 +++-
 drivers/edac/amd64_edac.c     | 546 +++++++++++++++++++++++++++++-----
 drivers/edac/amd64_edac.h     |  27 ++
 drivers/edac/edac_mc.c        |   1 +
 drivers/edac/mce_amd.c        |  15 +-
 include/linux/edac.h          |   3 +
 include/linux/pci_ids.h       |   1 +
 8 files changed, 582 insertions(+), 79 deletions(-)