linux-2.2.0-pre9/final nfs repeatedly crashes after 10 minutes of operation

Leif Johansson (leifj@matematik.su.se)
Sat, 23 Jan 1999 16:22:12 +0100


Hi,

Server : 2.2.0-final, knfsd-981204 (latest), rh5.2+util-linux-2.9g
Clients: several 2.0.36, nfs-server-2.2beta37-1, rh 5.2 and one solaris 2.5.1 (mounts with vers=2)

After at most 10 minutes of nfs service the nfs server says this and the nfsd is as dead as a duck:

Jan 23 14:09:43 fs1 kernel: Unable to handle kernel NULL pointer dereference at virtual address 00000022
Jan 23 14:09:43 fs1 kernel: current->tss.cr3 = 00101000, `r3 = 00101000
Jan 23 14:09:43 fs1 kernel: *pde = 00000000
Jan 23 14:09:43 fs1 kernel: Oops: 0000
Jan 23 14:09:43 fs1 kernel: CPU: 0
Jan 23 14:09:43 fs1 kernel: EIP: 0010:[<c012d66e>]
Jan 23 14:09:43 fs1 kernel: EFLAGS: 00010292
Jan 23 14:09:43 fs1 kernel: eax: 00000000 ebx: c189c2c0 ecx: c189cc40 edx: c0ad804c
Jan 23 14:09:43 fs1 kernel: esi: c189cc40 edi: c0ad8000 ebp: c0ad8000 esp: c0e59ee4
Jan 23 14:09:43 fs1 kernel: ds: 0018 es: 0018 ss: 0018
Jan 23 14:09:43 fs1 kernel: Process nfsd (pid: 309, process nr: 19, stackpage=c0e59000)
Jan 23 14:09:43 fs1 kernel: Stack: c0ad8000 c0ad8000 0000000c c0151584 c0ad8000 c189cc40 c0ad8000 c189c2c0
Jan 23 14:09:43 fs1 kernel: c0093ba4 c0093b60 c0630000 c014df60 c189cc40 c014eae1 c0630000 c0093b60
Jan 23 14:09:43 fs1 kernel: c12428ac 0000000c c0093ba4 c12428dc 0000000c c0630000 c020902c c0eb0014
Jan 23 14:09:43 fs1 kernel: Call Trace: [<c0151584>] [<c014df60>] [<c014eae1>] [<c014e02b>] [<c018399d>] [<c018417f>] [<c018
4c67>]
Jan 23 14:09:43 fs1 kernel: [<c014df60>] [<c014deb3>] [<c014dd14>] [<c01064a3>]
Jan 23 14:09:43 fs1 kernel: Code: 66 8b 50 22 66 81 e2 00 f0 66 81 fa 00 40 0f 94 c2 81 e2 ff
Jan 23 14:10:16 fs1 kernel: fh_verify: oldin/.netscape permission failure, acc=1, error=218103808
Jan 23 14:10:16 fs1 kernel: fh_verify: oldin/.netscape permission failure, acc=4, error=218103808
Jan 23 14:10:17 fs1 kernel: fh_verify: oldin/.netscape permission failure, acc=1, error=218103808

I was unable to kill/restart nfsd and had to hit reset. The problem repeated itself again after ca 10
minutes of operation. This time I called it a day, backed down to 2.0.36 and wrote this email. With the
old kernel it is as stable as a rock so I tend to discount hw problems. The error seemed to occur when
two persons were accessing the same file (the fh_verify error turned up at about the same time). Any idea?

Please cc me any replies since I am not on the list.

Best Regards
Leif Johansson

Leif Johansson Phone: +46 8 164541
Department of Mathematics Fax : +46 8 6126717
Stockholm University email: leifj@matematik.su.se

<This space is left blank for quotational and disclamatory purposes.>

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.rutgers.edu
Please read the FAQ at http://www.tux.org/lkml/