2.2.0-pre7-ac6 oopses

Scott Henry (scotth@sgi.com)
17 Jan 1999 17:58:47 -0800


I'm running on a UMAX 520T notebook, K6-2/300MHz/64MB. Redhat 5.2 base.

The running kernel /proc/version
Linux version 2.2.0-pre7-ac6 (root@washu.corp.sgi.com) (gcc version 2.7.2.3) #2 Sun Jan 17 15:33:20 PST 1999

I was running netscape (v4.0?) while compiling pre7-ac6 with
arca-VM-23 patches (I'm not sure whether they are intended to be
together, there was only one problem, in the file
include/linux/sched.h '#define INIT_MM' section).

Netscape died (and now dies on startup), and make also dies trying
to compile the kernel. The oopses (quite repeatable) are as follows:

Jan 17 17:07:03 washu kernel: Unable to handle kernel NULL pointer dereference at virtual address 0000000f
Jan 17 17:07:03 washu kernel: current->tss.cr3 = 03ae9000, `r3 = 03ae9000
Jan 17 17:07:03 washu kernel: *pde = 00000000
Jan 17 17:07:03 washu kernel: Oops: 0000
Jan 17 17:07:03 washu kernel: CPU: 0
Jan 17 17:07:03 washu kernel: EIP: 0010:[<c011b043>]
Jan 17 17:07:03 washu kernel: EFLAGS: 00010202
Jan 17 17:07:03 washu kernel: eax: c02ae087 ebx: 00000000 ecx: 0000000c edx: 00000007
Jan 17 17:07:03 washu kernel: esi: c213dea8 edi: 0003084f ebp: c0684000 esp: c289df00
Jan 17 17:07:03 washu kernel: ds: 0018 es: 0018 ss: 0018
Jan 17 17:07:03 washu kernel: Process netscape-naviga (pid: 7911, process nr: 80, stackpage=c289d000)
Jan 17 17:07:03 washu kernel: Stack: c02a30c0 00006000 00007000 c02ae087 c011b4a6 c188d4a0 00007000 00000000
Jan 17 17:07:03 washu kernel: 00001000 402b5000 402b6000 c188d4a0 08792800 00006000 00001000 00004000
Jan 17 17:07:03 washu kernel: 00012000 00000000 00000002 00000001 00006000 c213dea8 c011b83c c188d4a0
Jan 17 17:07:03 washu kernel: Call Trace: [<c011b4a6>] [<c011b83c>] [<c011b788>] [<c016388e>] [<c0122e7e>] [<c01089f0>]
Jan 17 17:07:03 washu kernel: Code: 39 72 08 75 f4 8b 4c 24 10 39 4a 0c 75 eb 8d 42 14 ff 42 14

Jan 17 17:08:24 washu kernel: Unable to handle kernel NULL pointer dereference at virtual address 0000000f
Jan 17 17:08:24 washu kernel: current->tss.cr3 = 001c7000, `r3 = 001c7000
Jan 17 17:08:24 washu kernel: *pde = 00000000
Jan 17 17:08:24 washu kernel: Oops: 0000
Jan 17 17:08:24 washu kernel: CPU: 0
Jan 17 17:08:24 washu kernel: EIP: 0010:[<c011b30b>]
Jan 17 17:08:24 washu kernel: EFLAGS: 00010202
Jan 17 17:08:24 washu kernel: eax: 00000007 ebx: c02ae087 ecx: c02ae087 edx: 00000000
Jan 17 17:08:24 washu kernel: esi: c37a0860 edi: 00012000 ebp: 00000000 esp: c3dd7f28
Jan 17 17:08:24 washu kernel: ds: 0018 es: 0018 ss: 0018
Jan 17 17:08:24 washu kernel: Process make (pid: 8355, process nr: 87, stackpage=c3dd7000)
Jan 17 17:08:24 washu kernel: Stack: 4000e000 c1a53b60 c2a658a0 c02ae087 4000d000 00001000 c3cc8f40 c252fe60
Jan 17 17:08:24 washu kernel: c2a65ea0 00000001 00000000 c37a0860 c011b83c c1a53b60 c1a53b74 c3dd7f80
Jan 17 17:08:24 washu kernel: c011b788 c1a53b60 4000d000 0806b1f0 00001000 c010c890 00000000 00001000
Jan 17 17:08:24 washu kernel: Call Trace: [<c011b83c>] [<c011b788>] [<c010c890>] [<c0122e7e>] [<c01089f0>]
Jan 17 17:08:24 washu kernel: Code: 39 70 08 75 f0 39 50 0c 75 eb 8d 50 14 ff 40 14 ba 02 00 00

The make generates the OOPS when compiling

make[1]: Entering directory `/usr/src/linux-2.2.0-pre7-ac6/fs'
make -C lockd modules
make[1]: *** [_modsubdir_lockd] Segmentation fault
make[1]: Leaving directory `/usr/src/linux-2.2.0-pre7-ac6/fs'
make: *** [_mod_fs] Error 2

Also, my NFS (via autofs) seems to be hung, too... (a client
mounting a disk on an SGI Indy running IRIX 6.5.2). This has
happened before, and in previous times (and previous kernels) it
froze-out my PCMCIA ethernet card... The symptoms in messages are
(there's more, this is just a part of the message log):

Jan 17 17:27:54 washu automount[285]: attempting to mount entry /net/skuld.linux
Jan 17 17:27:55 washu kernel: SIG: sigpending lied
Jan 17 17:28:06 washu last message repeated 5 times
Jan 17 17:30:36 washu kernel: SIG: sigpending lied
Jan 17 17:30:36 washu kernel: SIG: sigpending lied
Jan 17 17:30:36 washu automount[8661]: expired /net/skuld.linux
Jan 17 17:30:44 washu automount[285]: attempting to mount entry /net/skuld.linux
Jan 17 17:30:44 washu kernel: SIG: sigpending lied
Jan 17 17:30:45 washu last message repeated 5 times
Jan 17 17:30:46 washu kernel: RPC: sendmsg returned error 14
Jan 17 17:30:51 washu last message repeated 7 times
Jan 17 17:30:57 washu kernel: nfs: server skuld not responding, still trying
Jan 17 17:30:57 washu kernel: RPC: sendmsg returned error 14
Jan 17 17:30:57 washu kernel: nfs: server skuld not responding, still trying
Jan 17 17:30:57 washu kernel: RPC: sendmsg returned error 14
Jan 17 17:30:58 washu last message repeated 2 times
Jan 17 17:30:58 washu kernel: nfs: task 30 can't get a request slot
Jan 17 17:32:08 washu kernel: RPC: sendmsg returned error 14
Jan 17 17:32:39 washu last message repeated 12 times
Jan 17 17:32:45 washu last message repeated 3 times
Jan 17 17:32:51 washu kernel: nfs: task 31 can't get a request slot
Jan 17 17:32:51 washu kernel: RPC: sendmsg returned error 14
Jan 17 17:33:21 washu last message repeated 11 times
Jan 17 17:33:51 washu last message repeated 10 times

Plus, a HUGE amount of the physical memory seems to be locked in
cache, and it only seems to increase. from /proc/meminfo:

total: used: free: shared: buffers: cached:
Mem: 64962560 52199424 12763136 17354752 3727360 31481856
Swap: 78409728 6463488 71946240
MemTotal: 63440 kB
MemFree: 12464 kB
MemShared: 16948 kB
Buffers: 3640 kB
Cached: 30744 kB
SwapTotal: 76572 kB
SwapFree: 70260 kB

Most of my shells are hung on something. Only my remote stuff (and
local emacs) is currently working. I could open more shells... New
processes are able to start.

I'm going to reboot to this kernel again, to see how reproducible it is...

(I'm reading linux-kernel via a mail-to-news gateway)

-- 
               More Important Drivel from:
 Scott Henry <scotth@sgi.com>   \o^o/   http://reality.sgi.com/scotth/

- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.rutgers.edu Please read the FAQ at http://www.tux.org/lkml/