Re: ECC error in 2.5.64 + some patches

From: Dominik Kubla (dominik@kubla.de)
Date: Thu Mar 27 2003 - 12:19:41 EST


Am Donnerstag, 27. März 2003 18:00 schrieb Chris Wedgwood:

> > Message from syslogd@slovax at Thu Mar 27 05:53:49 2003 ...
> > slovax kernel: Bank 1: 9000000000000151
>
> Status: (9000000000000151) Restart IP valid.
>
> *Exactly* what this means I don't know --- but I'm guessing the CPU is
> overheating. Check fans, air-flow, etc. and see if that helps. So
> far whenever I've seen the above problem it's *ALWAYS* been related to
> the CPU getting too hot.

Well the internal busses and buffers of modern CPU's and in many cases also
the on-die caches have ECC logic. And if i should hazard a guess: "Restart
IP valid" => Restarted Instruction Pre-Fetch resulted in a valid state of the
pre-fetch queue.

In Larry's case i'd remove the cpu cooler, clean everything and reassemble,
since i would assume that there is a hot-spot on the die.

Regards,
  Dominik

-- 
Be at war with your voices, at peace with your neighbors, and let every new
year find you a better man. (Benjamin Franklin, 1706-1790)

- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/



This archive was generated by hypermail 2b29 : Mon Mar 31 2003 - 22:00:28 EST