Re: 2.6.34 nfs fsync() deadlock

From: Trond Myklebust
Date: Mon May 31 2010 - 11:41:19 EST


On Mon, 2010-05-31 at 15:41 +0100, Nix wrote:
> I just got a deadlock while doing a postgresql 'make check' on an NFSv3
> TCP mount (the server is Linux 2.6.34 as well; the underlying filesystem
> is ext4). This testsuite is notable for doing a lot of things in
> parallel, each of which is generally doing a lot of fsync()ing.
>
> /proc/mounts for the filesystem of discourse:
>
> package.srvr.nix:/usr/src /usr/src nfs rw,relatime,vers=3,rsize=1048576,wsize=1048576,namlen=255,hard,proto=tcp,timeo=600,retrans=2,sec=sys,mountaddr=192.168.14.15,mountvers=3,mountport=50892,mountproto=udp,addr=192.168.14.15 0 0
>
> /etc/exports for it on the server:
>
> /usr/src mutilate(rw,no_root_squash,no_subtree_check,async)
>
> dmesg:

Can you see if the following 2 attached patches (already sent to
stable@xxxxxxxxxx) suffice to fix the hang?

Cheers
Trond