Re: [PATCH 1/5] HWPOISON: define VM_FAULT_HWPOISON to 0 whenfeature is disabled

From: Ingo Molnar
Date: Fri Jun 12 2009 - 07:23:32 EST



* Wu Fengguang <fengguang.wu@xxxxxxxxx> wrote:

> So as to eliminate one #ifdef in the c source.
>
> Proposed by Nick Piggin.
>
> CC: Nick Piggin <npiggin@xxxxxxx>
> Signed-off-by: Wu Fengguang <fengguang.wu@xxxxxxxxx>
> ---
> arch/x86/mm/fault.c | 3 +--
> include/linux/mm.h | 7 ++++++-
> 2 files changed, 7 insertions(+), 3 deletions(-)
>
> --- sound-2.6.orig/arch/x86/mm/fault.c
> +++ sound-2.6/arch/x86/mm/fault.c
> @@ -819,14 +819,13 @@ do_sigbus(struct pt_regs *regs, unsigned
> tsk->thread.error_code = error_code;
> tsk->thread.trap_no = 14;
>
> -#ifdef CONFIG_MEMORY_FAILURE
> if (fault & VM_FAULT_HWPOISON) {
> printk(KERN_ERR
> "MCE: Killing %s:%d due to hardware memory corruption fault at %lx\n",
> tsk->comm, tsk->pid, address);
> code = BUS_MCEERR_AR;
> }
> -#endif

Btw., anything like this should happen in close cooperation with the
x86 tree, not as some pure MM feature. I dont see Cc:s and nothing
that indicates that realization. What's going on here?

It is not at all clear to me whether propagating hardware failures
this widely is desired from a general design POV. Most desktop
hardware wont give a damn about this (and if a hardware fault
happens you want to get as far from the crappy hardware as possible)
so i'm not sure how relevant it is and how well tested it will
become in practice.

I.e. really some wider discussion needs to happen on this.

Ingo
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/