2.4.8 oops in ksoftirqd_CPU0

From: Manuel McLure (manuel@mclure.org)
Date: Sat Aug 11 2001 - 22:39:27 EST


Hardware:

MSI K7T Turbo R
Athlon Thunderbird 900MHz (not overclocked)
256MB PC133 SDRAM
Soundblaster Live!

Kernel is 2.4.8 with two patches - Rui Sosa's emu10k1 compile patch and the
linux-2.4.4-openssl-0.9.6a-mppe.patch from
ftp://mirror.binarix.com/ppp-mppe (for encrypted PPTP connections) built
with Athlon optimizations (.config file is attached).

History - kernel 2.4.6 (with CVS emu10k1 driver and the MPPE patch) runs
rock-solid on this machine. Since I upgraded to 2.4.7 (again with CVS
emu10k1 and the MPPE patch) I started seeing hangs. At that point I could
not get oops information, unfortunately. Today I installed 2.4.8
(configured as above) and started getting hangs again. I then set up serial
console and managed to get an Oops. While I was running that Oops through
ksymoops I got another Oops! Following are both Oops:

First Oops:
------------------------------------------------------
ksymoops 2.4.0 on i686 2.4.8. Options used
     -V (default)
     -k /proc/ksyms (default)
     -l /proc/modules (default)
     -o /lib/modules/2.4.8/ (default)
     -m /boot/System.map-2.4.8 (default)

Warning: You did not tell me where to find symbol information. I will
assume that the log matches the kernel and modules that are running
right now and I'll use the default options above for symbol resolution.
If the current kernel and/or modules do not match the log, you can get
more accurate output by telling me the kernel version and where to find
map, modules, ksyms etc. ksymoops -h explains the options.

Warning (compare_maps): mismatch on symbol tulip_max_interrupt_work ,
tulip says d190236c, /lib/modules/2.4.8/kernel/drivers/net/tulip/tulip.o
says d190192c. Ignoring /lib/modules/2.4.8/kernel/drivers/net/tulip/tulip.o
entry
Warning (compare_maps): mismatch on symbol tulip_rx_copybreak , tulip says
d1902370, /lib/modules/2.4.8/kernel/drivers/net/tulip/tulip.o says
d1901930. Ignoring /lib/modules/2.4.8/kernel/drivers/net/tulip/tulip.o
entry
Warning (compare_maps): mismatch on symbol usb_devfs_handle , usbcore says
d18d12e0, /lib/modules/2.4.8/kernel/drivers/usb/usbcore.o says d18d0e00.
Ignoring /lib/modules/2.4.8/kernel/drivers/usb/usbcore.o entry
Unable to handle kernel NULL pointer dereference at virtual address
00000000
d796940f
*pde = 00000000
Oops: 0000
CPU: 0
EIP: 0010:[<d796940f>]
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010082
eax: cfbf1d80 ebx: 00000000 ecx: 00000001 edx: 00000000
esi: 00000282 edi: 00000000 ebp: c02d0240 esp: c15e7fa8
ds: 0018 es: 0018 ss: 0018
Process ksoftirqd_CPU0 (pid: 3, stackpage=c15e7000)
Stack: c15e6000 c627b2c4 00000000 c0116a34 cfbf1d80 00000001 c02d0280
fffffffe
       00000246 c01167fb c02d0280 c15e6000 c15e6000 c0105000 0008e000
c0116ca5
       00010f00 c15f7fb0 c0105706 c15f7fc8 c0116c10 c15f7fc8
Call Trace: [<c0116a34>] [<c01167fb>] [<c0105000>] [<c0116ca5>]
[<c0105706>] [<c0116c10>]
Code: f6 03 02 75 04 56 9d eb 54 53 e8 e2 23 00 00 8d 44 24 04 50

>>EIP; d796940f <[emu10k1]emu10k1_waveout_bh+f/70> <=====
Trace; c0116a34 <tasklet_hi_action+64/90>
Trace; c01167fb <do_softirq+4b/90>
Trace; c0105000 <_stext+0/0>
Trace; c0116ca5 <ksoftirqd+95/c0>
Trace; c0105706 <kernel_thread+26/30>
Trace; c0116c10 <ksoftirqd+0/c0>
Code; d796940f <[emu10k1]emu10k1_waveout_bh+f/70>
00000000 <_EIP>:
Code; d796940f <[emu10k1]emu10k1_waveout_bh+f/70> <=====
   0: f6 03 02 testb $0x2,(%ebx) <=====
Code; d7969412 <[emu10k1]emu10k1_waveout_bh+12/70>
   3: 75 04 jne 9 <_EIP+0x9> d7969418
<[emu10k1]emu10k1_waveout_bh+18/70>
Code; d7969414 <[emu10k1]emu10k1_waveout_bh+14/70>
   5: 56 push %esi
Code; d7969415 <[emu10k1]emu10k1_waveout_bh+15/70>
   6: 9d popf
Code; d7969416 <[emu10k1]emu10k1_waveout_bh+16/70>
   7: eb 54 jmp 5d <_EIP+0x5d> d796946c
<[emu10k1]emu10k1_waveout_bh+6c/70>
Code; d7969418 <[emu10k1]emu10k1_waveout_bh+18/70>
   9: 53 push %ebx
Code; d7969419 <[emu10k1]emu10k1_waveout_bh+19/70>
   a: e8 e2 23 00 00 call 23f1 <_EIP+0x23f1> d796b800
<[emu10k1]emu10k1_waveout_update+0/a0>
Code; d796941e <[emu10k1]emu10k1_waveout_bh+1e/70>
   f: 8d 44 24 04 lea 0x4(%esp,1),%eax
Code; d7969422 <[emu10k1]emu10k1_waveout_bh+22/70>
  13: 50 push %eax

Kernel panic: Aiee, killing interrupt handler!

4 warnings issued. Results may not be reliable.
------------------------------------------------------

Second Oops (happened while trying to ksymoops the first oops):
------------------------------------------------------
ksymoops 2.4.0 on i686 2.4.8. Options used
     -V (default)
     -k /proc/ksyms (default)
     -l /proc/modules (default)
     -o /lib/modules/2.4.8/ (default)
     -m /boot/System.map-2.4.8 (default)

Warning: You did not tell me where to find symbol information. I will
assume that the log matches the kernel and modules that are running
right now and I'll use the default options above for symbol resolution.
If the current kernel and/or modules do not match the log, you can get
more accurate output by telling me the kernel version and where to find
map, modules, ksyms etc. ksymoops -h explains the options.

Warning (compare_maps): mismatch on symbol tulip_max_interrupt_work ,
tulip says d190236c, /lib/modules/2.4.8/kernel/drivers/net/tulip/tulip.o
says d190192c. Ignoring /lib/modules/2.4.8/kernel/drivers/net/tulip/tulip.o
entry
Warning (compare_maps): mismatch on symbol tulip_rx_copybreak , tulip says
d1902370, /lib/modules/2.4.8/kernel/drivers/net/tulip/tulip.o says
d1901930. Ignoring /lib/modules/2.4.8/kernel/drivers/net/tulip/tulip.o
entry
Warning (compare_maps): mismatch on symbol usb_devfs_handle , usbcore says
d18d12e0, /lib/modules/2.4.8/kernel/drivers/usb/usbcore.o says d18d0e00.
Ignoring /lib/modules/2.4.8/kernel/drivers/usb/usbcore.o entry
Unable to handle kernel NULL pointer dereference at virtual address
00000000
d796940f
*pde = 00000000
Oops: 0000
CPU: 0
EIP: 0010:[<d796940f>]
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010082
eax: c8df78c0 ebx: 00000000 ecx: 00000001 edx: 00000000
esi: 00000282 edi: 00000000 ebp: c02d0240 esp: c15e7fa8
ds: 0018 es: 0018 ss: 0018
Process ksoftirqd_CPU0 (pid: 3, stackpage=c15e7000)
Stack: c15e6000 cf16e2c4 00000000 c0116a34 c8df78c0 00000001 c02d0280
fffffffe
       00000246 c01167fb c02d0280 c15e6000 c15e6000 c0105000 0008e000
c0116ca5
       00010f00 c15f7fb0 c0105706 c15f7fc8 c0116c10 c15f7fc8
Call Trace: [<c0116a34>] [<c01167fb>] [<c0105000>] [<c0116ca5>]
[<c0105706>] [<c0116c10>]
Code: f6 03 02 75 04 56 9d eb 54 53 e8 e2 23 00 00 8d 44 24 04 50

>>EIP; d796940f <[emu10k1]emu10k1_waveout_bh+f/70> <=====
Trace; c0116a34 <tasklet_hi_action+64/90>
Trace; c01167fb <do_softirq+4b/90>
Trace; c0105000 <_stext+0/0>
Trace; c0116ca5 <ksoftirqd+95/c0>
Trace; c0105706 <kernel_thread+26/30>
Trace; c0116c10 <ksoftirqd+0/c0>
Code; d796940f <[emu10k1]emu10k1_waveout_bh+f/70>
00000000 <_EIP>:
Code; d796940f <[emu10k1]emu10k1_waveout_bh+f/70> <=====
   0: f6 03 02 testb $0x2,(%ebx) <=====
Code; d7969412 <[emu10k1]emu10k1_waveout_bh+12/70>
   3: 75 04 jne 9 <_EIP+0x9> d7969418
<[emu10k1]emu10k1_waveout_bh+18/70>
Code; d7969414 <[emu10k1]emu10k1_waveout_bh+14/70>
   5: 56 push %esi
Code; d7969415 <[emu10k1]emu10k1_waveout_bh+15/70>
   6: 9d popf
Code; d7969416 <[emu10k1]emu10k1_waveout_bh+16/70>
   7: eb 54 jmp 5d <_EIP+0x5d> d796946c
<[emu10k1]emu10k1_waveout_bh+6c/70>
Code; d7969418 <[emu10k1]emu10k1_waveout_bh+18/70>
   9: 53 push %ebx
Code; d7969419 <[emu10k1]emu10k1_waveout_bh+19/70>
   a: e8 e2 23 00 00 call 23f1 <_EIP+0x23f1> d796b800
<[emu10k1]emu10k1_waveout_update+0/a0>
Code; d796941e <[emu10k1]emu10k1_waveout_bh+1e/70>
   f: 8d 44 24 04 lea 0x4(%esp,1),%eax
Code; d7969422 <[emu10k1]emu10k1_waveout_bh+22/70>
  13: 50 push %eax

Kernel panic: Aiee, killing interrupt handler!

4 warnings issued. Results may not be reliable.
------------------------------------------------------

If there is any more needed information, please ask. I read linux-kernel
from work, not home, so please Cc: me at this address if you want any
replies during the weekend.

Thanks!

-- 
Manuel A. McLure KE6TAW | ...for in Ulthar, according to an ancient
<manuel@mclure.org>     | and significant law, no man may kill a cat.
<http://www.mclure.org> |             -- H.P. Lovecraft


- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/



This archive was generated by hypermail 2b29 : Wed Aug 15 2001 - 21:00:40 EST