Re: PROBLEM: mce: [Hardware Error] from dmesg -l emerg

From: Borislav Petkov
Date: Tue May 22 2018 - 03:41:23 EST


On Mon, May 21, 2018 at 05:06:14PM -0700, Luck, Tony wrote:
> On Tue, May 22, 2018 at 02:43:37AM +0530, Jeffrin Thalakkottoor wrote:
> > mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 5: ee0000000040110b
> > mce: [Hardware Error]: TSC 0 ADDR 160000080 MISC 5040008086
> > mce: [Hardware Error]: PROCESSOR 0:306d4 TIME 1526932210 SOCKET 0 APIC
> > 0 microcode 2a
>
> The problem is that "mcelog --ascii" is expecting the first line to
> look like:
>
> CPU 0: Machine Check Exception: 0 Bank 5: ee0000000040110b
>
> This seems to have been broken by commit:
>
> cd9c57cad3fe ("x86/MCE: Dump MCE to dmesg if no consumers")
>
> relevent part is this ... where we now conditionally include the
> word "Exception".

Well, mcelog should decode the error signature regardless whether it
says "Exception" or not, IMO.

--
Regards/Gruss,
Boris.

Good mailing practices for 400: avoid top-posting and trim the reply.