CPU not responding on Dual Xeon SuperMicro motherboard (2.6)

From: Alberto Nava
Date: Fri Jun 04 2004 - 22:24:10 EST


Hi,

When booting 2.6 kernels on a Super Micro P4DP6 dual
Xeon motherboard sometimes the kernel fails to initialize one of the
HT CPUs.

I tried with 2.6.5, 2.6.6 and 2.6.7-rc2, and they all exhibit the
problem at least 1 every 10 reboots. The 2.4.25 kernel did not exhibit
the problem on a overnight reboot loop.

The motherboard is SuperMicro P4DP6 and processors are
Intel(R) Xeon(TM) CPU 2.40GHz stepping 07

I've tried on several similar machines with the same results.

Any idea of what's causing this?

ktwc2 syslog: syslogd startup succeeded
ktwc2 kernel: klogd 1.4.1, log source = /proc/kmsg started.
ktwc2 kernel: even in supervisor mode... Ok.
ktwc2 kernel: Calibrating delay loop... 4767.74 BogoMIPS
ktwc2 kernel: kdb version 4.3 by Keith Owens, Scott Lurndal. Copyright SGI, All Rights Reserved
ktwc2 kernel: Dentry cache hash table entries: 262144 (order: 8, 1048576 bytes)
ktwc2 kernel: Inode-cache hash table entries: 131072 (order: 7, 524288 bytes)
ktwc2 kernel: Mount-cache hash table entries: 512 (order: 0, 4096 bytes)
ktwc2 kernel: CPU: Trace cache: 12K uops, L1 D cache: 8K
ktwc2 kernel: CPU: L2 cache: 512K
ktwc2 kernel: CPU: Physical Processor ID: 0
ktwc2 syslog: klogd startup succeeded
ktwc2 kernel: Intel machine check architecture supported.
ktwc2 kernel: Intel machine check reporting enabled on CPU#0.
ktwc2 kernel: CPU#0: Intel P4/Xeon Extended MCE MSRs (12) available
ktwc2 kernel: CPU#0: Thermal monitoring enabled
ktwc2 portmap: portmap startup succeeded
ktwc2 kernel: Enabling fast FPU save and restore... done.
ktwc2 kernel: Enabling unmasked SIMD FPU exception support... done.
ktwc2 kernel: Checking 'hlt' instruction... OK.
ktwc2 kernel: POSIX conformance testing by UNIFIX
ktwc2 kernel: CPU0: Intel(R) Xeon(TM) CPU 2.40GHz stepping 07
ktwc2 kernel: per-CPU timeslice cutoff: 1462.99 usecs.
ktwc2 kernel: task migration cache decay timeout: 2 msecs.
ktwc2 kernel: Getting VERSION: 50014
ktwc2 kernel: Getting VERSION: 50014
ktwc2 kernel: Getting ID: 0
ktwc2 kernel: Getting LVT0: 700
ktwc2 kernel: Getting LVT1: 400
ktwc2 keytable: Loading keymap:
ktwc2 kernel: enabled ExtINT on CPU#0
ktwc2 keytable:
ktwc2 kernel: ESR value before enabling vector: 00000000
ktwc2 keytable: Loading system font:
ktwc2 kernel: ESR value after enabling vector: 00000000
ktwc2 keytable:
ktwc2 kernel: CPU present map: c3
ktwc2 kernel: Booting processor 1/1 eip 3000
ktwc2 kernel: Setting warm reset code and vector.
ktwc2 rc: Starting keytable: succeeded
ktwc2 kernel: 1.
ktwc2 kernel: 2.
ktwc2 kernel: 3.
ktwc2 kernel: Asserting INIT.
ktwc2 kernel: Waiting for send to finish...
ktwc2 kernel: +Deasserting INIT.
ktwc2 kernel: Waiting for send to finish...
ktwc2 kernel: +#startup loops: 2.
ktwc2 kernel: Sending STARTUP #1.
ktwc2 kernel: After apic_write.
ktwc2 kernel: Startup point 1.
ktwc2 kernel: Waiting for send to finish...
ktwc2 kernel: +Sending STARTUP #2.
ktwc2 random: Initializing random number generator: succeeded
ktwc2 kernel: After apic_write.
ktwc2 kernel: Startup point 1.
ktwc2 kernel: Waiting for send to finish...
ktwc2 kernel: +After Startup.
ktwc2 kernel: Before Callout 1.
ktwc2 kernel: After Callout 1.
ktwc2 kernel: Not responding.
.......





-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/