APIC error on SMP machine

From: Chris Rankin
Date: Tue Sep 30 2003 - 16:44:34 EST


Linux-2.4.22-SMP, 1 GB RAM, devfs, gcc-3.2.3.

Hi,

Today, my dual PIII (Coppermine) refused to boot, and wrote a large number of these messages to the serial console instead:

APIC error on CPU1: 04(04)
APIC error on CPU1: 04(04)
APIC error on CPU1: 04(04)
APIC error on CPU1: 04(04)
APIC error on CPU1: 04(04)
APIC error on CPU1: 04(04)
APIC error on CPU1: 04(04)
APIC error on CPU1: 04(04)
APIC error on CPU1: 04(04)
APIC error on CPU1: 04(04)
APIC error on CPU1: 04(04)
APIC error on CPU1: 04(04)
APIC error on CPU1: 04(04)
APIC error on CPU1: 04(04)
APIC error on CPU1: 04(04)

Can anyone tell me what these might mean, please? The kernel source implies that it's a "Send accept error", but this doesn't help me in an "Ah, I can fix that!" sense.

Does this APIC error just mean that the CPU is unhappy in this slot, and is refusing to listen to the motherboard? Or is the motherboard refusing to listen to the CPU?

Background:
This machine has been misbehaving for a while. I thought I had worked around the problem by underclocking the FSB from 133 MHz to 100 MHz, but that now looks like it was just a "reprieve". I have tried running "nosmp", "pci=noacpi" and "noapic pci=noacpi" without success, and have resorted to yanking the CPU out of this slot entirely. (I suspect that the CPU is fine, however.) I have also restored the FSB to 133 MHz, so I am currently running the SMP kernel on a single 933 MHz PIII.

Cheers,
Chris

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/