Re: Hardware Error Kernel Mini-Summit

From: Andi Kleen
Date: Mon May 24 2010 - 14:31:14 EST


On Mon, May 24, 2010 at 10:35:21AM -0700, Tony Luck wrote:
> On Mon, May 24, 2010 at 8:55 AM, Russ Anderson <rja@xxxxxxx> wrote:
> > ia64 had the Intel defined MCA Spec which defined the interaction
> > between SAL and the kernel.  x86 does not have a similar well
> > defined way of how errors should be handled.  It would be
> > good to agree on how the errors should be handled.
>
> X86 has machine check registers defined by the SDM. It also
> has some f/w <-> OS interactions defined by the APEI sections
> in the latest ACPI spec (chapter 17 of the 4.0a spec released
> last month - see http://acpi.info). Some parts look cleaner than

I should add the Intel Software Developer's manual has quite
precise guidelines on what to do (and the Linux MCE code implements
near all that faithfully)

The ACPI spec isn't quite as precise unfortunately.

-Andi
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/