Re: EDAC: Is it possible to calculate which piece of memory isbad?

From: Justin Piszcz
Date: Tue Mar 30 2010 - 04:00:56 EST




On Tue, 30 Mar 2010, Borislav Petkov wrote:

From: Justin Piszcz <jpiszcz@xxxxxxxxxxxxxxx>
Date: Mon, Mar 29, 2010 at 09:40:38AM -0400

Hello,

I see the following errors:

EDAC MC0: CE page 0x8abba, offset 0xa10, grain 8, syndrome 0x4758, row 0, channel 0, label "": k8_edac

It looks like it is the first DIMM on your mainboard, i.e., whichever
gets mapped to channel 0 of the DCT.

Sigh, someday we'll have a better mapping, hopefully, ... :|

EDAC MC0: CE - no information available: k8_edac Error Overflow set
EDAC k8 MC0: extended error code: ECC chipkill x4 error
EDAC k8 MC0: general bus error: participating processor(local node origin), time-out(no timeout) memory transaction type(generic read), mem or i/o(mem access), cache level(generic)

Is it possible to use the page or offset to calculate which DIMM is having a
problem?


--
Regards/Gruss,
Boris.

--
Advanced Micro Devices, Inc.
Operating Systems Research Center


Hi,

Thanks, how did you make that calculation?

Justin.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/