Re: FIC PA-2007 & Linux

Alan Cox (alan@lxorguk.ukuu.org.uk)
Tue, 10 Feb 1998 18:57:20 +0000 (GMT)


> kernel compiles, I also built all of XFree86, gcc 2.7.2.3, gcc 2.8.0,
> glibc, and egcs without incident. If this is a hardware problem, it sure
> doesn't manifest itself very easily.

Subtle bit errors are generally very random and very hard to catch. On
machines without parity they also normally result in wrong answers rather
than noticed crashes.

> Just for kicks, I did run some more tests on NT and it didn't fail once.
> Does it have a mechanism for catching these hardware flaws and doing a
> retry or some such?

No but it doesnt hit the machine with the same patterns as Linux. A standard
PC has no useful mechanism for recovery from errors. Parity ram will trap
some memory errors (but not cache coherency or bus errors) and give an NMI.
You pay a serious premium for PC's with proper ECC ram and the like.

> I still would like to work through some of the other things outlined in
> the Sig11 page (ie. disabling cache, etc), but as this is my main machine,
> I can't be out of service for testing for too long.

Understood.

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.rutgers.edu