On Mon, 26 Dec 2016, Boris Ostrovsky wrote:
On 12/26/2016 10:45 AM, Markus Trippelsdorf wrote:
The comment in arch/x86/kernel/cpu/mcheck/mce_amd.c says:
1384 * mcheck_init_device should be inited before threshold_init_device to
1385 * initialize mce_device, otherwise a NULL ptr dereference will cause panic.
That's not changed by this commit.
My nightly test hit this as well. AMD only, Intel passed. I haven't
verified whether commit that Markus implicated is the one that caused
this but it's the same BUG signature (but possibly slightly different stack)
[ 1.554351] smpboot: CPU0: AMD Engineering Sample (family: 0x10,
model: 0x4, stepping: 0x1)
...
Is there anything interesting error message before the BUG hits? I'll try
to reproduce on a AMD box tomorrow.