Re: Linux & ECC memory

Doug Paul (dbp@dragonsys.com)
Thu, 14 Nov 1996 22:53:02 -0500


On Thu, 14 Nov 1996, Kenneth Albanowski wrote:

> A more subtle issue is whether the ECC memory controller could report
> instances where ECC detection and successful correction took place. It
> would seem to be useful to provide a way for the OS to recognize that
> non-fatal memory errors have occured, even though they were repaired.

PC hardware can only report hard (uncorrectable) memory errors.
Linux will report these errors with the somewhat cryptic message
"Uhhuh. NMI received. Dazed and confused, but trying to continue".
(linux/arch/i386/kernel/traps.c).

Soft errors (i.e. corrected by the ECC hardware) are not reported.
(Many workstations log both soft and hard errors.)

On both Intel Venus and Supermicro P6DNF (both P6 with Natoma
chipsets), the ECC must be enabled by a bios seting.

Doug