[PATCH 0/7] EDAC/i10nm: Add RRL support for Intel Granite Rapids server

From: Qiuxu Zhuo
Date: Thu Apr 17 2025 - 11:08:40 EST


The RRL (Retry Read error Log) registers provide additional error information
during DDR memory correction. This can assist memory DIMM vendors in isolating
problematic cells within the DIMM and help CSPs predict uncorrectable errors.
This patch series is to add RRL support for Intel Granite Rapids servers and
is structured as follows:

Patch 1-3: Bug fixes and cleanup.

Patch 4-6: Refactor the RRL (Retry Read error Log) code of previous generations
and prepare for Patch 7.

Patch 7 : Add RRL support for Intel Granite Rapids server.

Qiuxu Zhuo (7):
EDAC/skx_common: Fix general protection fault
EDAC/{skx_common,i10nm}: Fix the loss of saved RRL for HBM pseudo channel 0
EDAC/i10nm: Explicitly set the modes of the RRL register sets
EDAC/{skx_common,i10nm}: Structure the per-channel RRL registers
EDAC/{skx_common,i10nm}: Refactor enable_retry_rd_err_log()
EDAC/{skx_common,i10nm}: Refactor show_retry_rd_err_log()
EDAC/{skx_common,i10nm}: Add RRL support for Intel Granite Rapids server

drivers/edac/i10nm_base.c | 477 ++++++++++++++++++++++----------------
drivers/edac/skx_common.c | 1 +
drivers/edac/skx_common.h | 61 ++++-
3 files changed, 330 insertions(+), 209 deletions(-)


base-commit: 8ffd015db85fea3e15a77027fda6c02ced4d2444
--
2.43.0