Re: nfsd deadlock, 2.6.36-rc3

From: J. Bruce Fields
Date: Wed Sep 08 2010 - 13:51:32 EST


On Wed, Sep 08, 2010 at 10:52:51AM -0600, Tim Gardner wrote:
> The solution appears to be to twiddle with
> /proc/sys/vm/min_free_kbytes and /proc/sys/vm/drop_caches, though
> I'm not sure this addresses the root cause. Perhaps low memory
> really is the root cause.
>
> At any rate, their solution was to set min_free_kbytes to 4GB, and
> to 'echo 1 > /proc/sys/vm/drop_caches' whenever free memory fell
> below 8GB. Not particularly elegant, but it appears to have stopped
> their server from wedging.

That does sound like a workaround rather than a fix. Were there any
diagnostics left in the logs after the lockups? Could you get sysrq-t
dumps and figure out what was waiting on what? If the system was too
wedged for any of that to work, would any fo the watchdog deubgging
options help?

--b.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/