Re: x86-64 sporadic hang in 2.6.23rc7 and 2.6.22

From: Helge Hafting
Date: Fri Oct 05 2007 - 08:12:57 EST


Andi Kleen wrote:
Helge Hafting <helge.hafting@xxxxxxxxxxxxx> writes:
shrink_dcache_memory

That usually means random memory corruption from somewhere -- dcache
tends to use a lot of memory and when it is corrupted anywhere these functions tend to crash while walking the lists.

Unfortunately memory corruption is hard to track down because
the messenger is usually not the one to blame.

Perhaps enable slab debugging and see if it turns
something up. Could be also broken hardware. Does an older kernel
run stable? If yes and if it can be reproduced bisecting would
be good.
I attempted bisecting - and failed. The problem is that
2.6.23rc7 seems very unstable, but 2.6.22rc4 is much better
but not perfect. 2.6.22rc4 only crashed once - it can compile for
hours and swap lots and keep running. But it died at least once.

I'll try running recent kernels with more debugging instead.
I think I used SLUB instead of SLAB - either way I can switch
that over to see if it changes things.

Helge Hafting

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/