Re: Kernel panic: Machine check exception

From: Eric W. Biederman
Date: Tue Dec 06 2005 - 06:13:34 EST


Alan Cox <alan@xxxxxxxxxxxxxxxxxxx> writes:

> On Sad, 2005-11-19 at 12:54 -0800, Avuton Olrich wrote:
>> Is there a good way to narrow it down? I guess running a badmem
>> program would be good to start with, otherwise ...(?).
>
> A memory test may be worth doing but most machine checks indicate the
> fault is more serious than bad memory.

Although on the Opteron that is usually what it is (as memory
errors can be reported through the machine check interface)

In this case bank 4 is the appropriate bank. Although the
other bits don't look right for a memory error. I wonder
if it is that darn iommu fault again.

To decode an Opteron machine_check you can look in
the bios and kernel programmers guide. (Possibly the
architecture but I think that is too generic) to see
what all of the bits mean.

It is a pain but is faster than poking blindly in
the dark.

Eric
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/