Re: [PATCH v3] x86/mce: Try printing all machine check banks known before panic

From: Borislav Petkov
Date: Sun Nov 23 2014 - 15:55:37 EST


On Fri, Nov 21, 2014 at 09:59:49PM +0000, Luck, Tony wrote:
> > Oh, cpu errata. So this would mean that we can't even rely on the
> > contents of the MCA banks, can we?
> >
> > In any case, is any of the information in the MCA banks in such cases
> > even usable then? Because if not, we're definitely barking up the wrong
> > tree...
>
> See above - I think even if there is a bug in the core that isn't setting the
> right bits in the MCi_STATUS register - we could get good data from
> devices out in the uncore.

Btw, since we're talking about errata - I guess you could use X86_BUG
and static_cpu_has_bug* to query in do_machine_check and modify logging
behavior of mce_log to bypass mce_severity and put all errors of the
last round in mce_log and then dump them out at panic time.

--
Regards/Gruss,
Boris.

Sent from a fat crate under my desk. Formatting is fine.
--
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/