PROBLEM: crashes of 2.4.31hf2.6 kernel (oops included)

From: morrowc+kernel
Date: Mon Jul 03 2006 - 22:20:42 EST



I applied the HF2.6 patch to a current 2.4.31 kernel source tree, re-built/re-installed and recieve some 'new' errors... Below is the standard (from: http://kernel.org/pub/linux/docs/lkml/reporting-bugs.html) form + oops output. Hopefully this isn't a 'new' problem, though i'd rather like to not upgrade this box to 2.6 if possible (yet).

To: linux-kernel@xxxxxxxxxxxxxxx
From: morrowc+kernel@xxxxxxxxxxxxxx
Subject: PROBLEM: crashes of 2.4.31 kernel (oops included)


From URL - http://kernel.org/pub/linux/docs/lkml/reporting-bugs.html


[1.] One line summary of the problem: kernel panic/oops system hangs
[2.] Full description of the problem/report:
After some random period of time (from 1 hour to several days) kernel receives an 'oops' and dies. Requires power cycle to recover. I've a few of the 'oops' output saved and converted with ksymoops if required (included one below) Error looks like it might be the ethernet driver
or perhaps the tcp connection maintenance...

[3.] Keywords (i.e., modules, networking, kernel):
networking kernel oops
[4.] Kernel version (from /proc/version):
Linux version 2.4.31 (root@xxxxxxxxxxxxxxxxxxxxx) (gcc version 3.3.5 (Debian 1:3.3.5-13)) #2 SMP Tue Jun 27 00:59:16 GMT 2006

(actually 2.4.21HF2.6 patch applied)

[5.] Output of Oops.. message (if applicable) with symbolic information
resolved (see Documentation/oops-tracing.txt)

ksymoops 2.4.9 on i686 2.4.31. Options used
-v /usr/src/linux-2.4.31/vmlinux (specified)
-k /proc/ksyms (default)
-l /proc/modules (default)
-o /lib/modules/2.4.31/ (default)
-m /usr/src/linux-2.4.31/System.map (specified)

Unable to handle kernel paging request at virtual address 08040124
c024b03f
*pde = 20528067
Oops: 0000
CPU: 1
EIP: 0010:[<c024b03f>] Not tainted
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010246
eax: 08040120 ebx: e9850b00 ecx: 00000000 edx: ec606c00
esi: ee3c2500 edi: ee3c2000 ebp: ec606d3c esp: dcfbb95c
ds: 0018 es: 0018 ss: 0018
Process cleanup (pid: 10270, stackpage=dcfbb000)
Stack: ec606c00 e9850b00 00000000 00000011 0000013b 01e761f0 00000005 00000005
ec606c00 c024b200 ec606d3c 00000001 c024b4b6 ec606c00 c32de800 dcfbb9a4
ec606c00 c024b200 00000020 c012310e ec606c00 00000000 ec606bc0 d37fdf04 Call Trace: [<c024b200>] [<c024b4b6>] [<c024b200>] [<c012310e>] [<c011f205>]
[<c011f0b3>] [<c011ee66>] [<c010af26>] [<c010d5c8>] [<c0180309>] [<c016a422>]
[<c016d800>] [<c0116ac1>] [<c0116ac1>] [<c0177702>] [<c0176d3e>] [<c0178f1d>]
[<c01790dd>] [<c016dbf8>] [<c0179453>] [<c0184e0f>] [<c018541e>] [<c0173fc1>]
[<c018ba95>] [<c0175482>] [<c018bd5d>] [<c018c5ee>] [<c0140f0e>] [<c013fe12>]
[<c013e6bf>] [<c013e766>] [<c0108efb>]
Code: ff 50 04 85 c0 0f 85 8b 00 00 00 0f b6 43 10 fe c0 88 43 10


EIP; c024b03f <tcp_synack_timer+ff/1f0> <=====

ebx; e9850b00 <_end+29508560/307f8ac0>
edx; ec606c00 <_end+2c2be660/307f8ac0>
esi; ee3c2500 <_end+2e079f60/307f8ac0>
edi; ee3c2000 <_end+2e079a60/307f8ac0>
ebp; ec606d3c <_end+2c2be79c/307f8ac0>
esp; dcfbb95c <_end+1cc733bc/307f8ac0>

Trace; c024b200 <tcp_keepalive_timer+0/2ce>
Trace; c024b4b6 <tcp_keepalive_timer+2b6/2ce>
Trace; c024b200 <tcp_keepalive_timer+0/2ce>
Trace; c012310e <timer_bh+1be/3f0>
Trace; c011f205 <bh_action+45/70>
Trace; c011f0b3 <tasklet_hi_action+63/b0>
Trace; c011ee66 <do_softirq+d6/e0>
Trace; c010af26 <do_IRQ+e6/f0>
Trace; c010d5c8 <call_do_IRQ+5/d>
Trace; c0180309 <leaf_delete_items_entirely+d9/210>
Trace; c016a422 <balance_leaf_when_delete+82/470>
Trace; c016d800 <balance_leaf+2ff0/3010>
Trace; c0116ac1 <schedule+2c1/550>
Trace; c0116ac1 <schedule+2c1/550>
Trace; c0177702 <get_parents+d2/190>
Trace; c0176d3e <is_leaf_removable+fe/140>
Trace; c0178f1d <clear_all_dirty_bits+1d/30>
Trace; c01790dd <wait_tb_buffers_until_unlocked+1ad/350>
Trace; c016dbf8 <do_balance+98/110>
Trace; c0179453 <fix_nodes+1d3/3f0>
Trace; c0184e0f <reiserfs_cut_from_item+19f/4a0>
Trace; c018541e <reiserfs_do_truncate+29e/570>
Trace; c0173fc1 <reiserfs_truncate_file+e1/290>
Trace; c018ba95 <journal_end+25/30>
Trace; c0175482 <reiserfs_file_release+272/490>
Trace; c018bd5d <journal_end_sync+3d/90>
Trace; c018c5ee <__commit_trans_index+8e/90>
Trace; c0140f0e <fsync_buffers_list+ce/1b0>
Trace; c013fe12 <fput+112/140>
Trace; c013e6bf <filp_close+8f/d0>
Trace; c013e766 <sys_close+66/80>
Trace; c0108efb <system_call+33/38>

Code; c024b03f <tcp_synack_timer+ff/1f0>
00000000 <_EIP>:
Code; c024b03f <tcp_synack_timer+ff/1f0> <=====
0: ff 50 04 call *0x4(%eax) <=====
Code; c024b042 <tcp_synack_timer+102/1f0>
3: 85 c0 test %eax,%eax
Code; c024b044 <tcp_synack_timer+104/1f0>
5: 0f 85 8b 00 00 00 jne 96 <_EIP+0x96>
Code; c024b04a <tcp_synack_timer+10a/1f0>
b: 0f b6 43 10 movzbl 0x10(%ebx),%eax
Code; c024b04e <tcp_synack_timer+10e/1f0>
f: fe c0 inc %al
Code; c024b050 <tcp_synack_timer+110/1f0>
11: 88 43 10 mov %al,0x10(%ebx)

<0>Kernel panic: Aiee, killing interrupt handler!


[6.] A small shell script or example program which triggers the
problem (if possible)

N/A

[7.] Environment
[7.1.] Software (add the output of the ver_linux script here)

# sh ./scripts/ver_linux If some fields are empty or look unusual you may have an old version.
Compare to the current minimal requirements in Documentation/Changes.

Linux neo-u2.ops-netman.net 2.4.31 #2 SMP Tue Jun 27 00:59:16 GMT 2006 i686 GNU/Linux

Gnu C 3.3.5
Gnu make 3.80
binutils 2.15
util-linux 2.12p
mount 2.12p
modutils 2.4.26
e2fsprogs 1.37
PPP 2.4.3
Linux C Library 2.3.2
Dynamic linker (ldd) 2.3.2
Procps 3.2.1
Net-tools 1.60
Console-tools 0.2.3
Sh-utils 5.2.1
Modules Loaded sd_mod scsi_mod ipt_LOG ipt_REJECT ipt_state ip_conntrack iptable_filter ip_tables ip6t_LOG ip6table_filter ip6_tables


[7.2.] Processor information (from /proc/cpuinfo):

# cat /proc/cpuinfo processor : 0
vendor_id : GenuineIntel
cpu family : 6
model : 8
model name : Pentium III (Coppermine)
stepping : 3
cpu MHz : 596.007
cache size : 256 KB
fdiv_bug : no
hlt_bug : no
f00f_bug : no
coma_bug : no
fpu : yes
fpu_exception : yes
cpuid level : 2
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 mmx fxsr sse
bogomips : 1189.47

processor : 1
vendor_id : GenuineIntel
cpu family : 6
model : 8
model name : Pentium III (Coppermine)
stepping : 3
cpu MHz : 596.007
cache size : 256 KB
fdiv_bug : no
hlt_bug : no
f00f_bug : no
coma_bug : no
fpu : yes
fpu_exception : yes
cpuid level : 2
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 mmx fxsr sse
bogomips : 1189.47



[7.3.] Module information (from /proc/modules):
# cat /proc/modules
sd_mod 10892 0 (autoclean) (unused)
scsi_mod 91636 1 (autoclean) [sd_mod]
ipt_LOG 3544 1 (autoclean)
ipt_REJECT 3384 1 (autoclean)
ipt_state 536 2 (autoclean)
ip_conntrack 22432 0 (autoclean) [ipt_state]
iptable_filter 1740 1 (autoclean)
ip_tables 12960 4 [ipt_LOG ipt_REJECT ipt_state iptable_filter]
ip6t_LOG 4312 1 (autoclean)
ip6table_filter 1836 1 (autoclean)
ip6_tables 13600 2 [ip6t_LOG ip6table_filter]


[7.4.] Loaded driver and hardware information (/proc/ioports, /proc/iomem)

# cat /proc/ioports 0000-001f : dma1
0020-003f : pic1
0040-0043 : timer0
0050-0053 : timer1
0060-006f : keyboard
0080-008f : dma page reg
00a0-00bf : pic2
00c0-00df : dma2
00f0-00ff : fpu
0170-0177 : ide1
01f0-01f7 : ide0
02f8-02ff : serial(set)
0376-0376 : ide1
03c0-03df : vga+
03f6-03f6 : ide0
03f8-03ff : serial(set)
0400-043f : Intel Corp. 82371AB/EB/MB PIIX4 ACPI
0440-045f : Intel Corp. 82371AB/EB/MB PIIX4 ACPI
0cf8-0cff : PCI conf1
c000-cfff : PCI Bus #01
d000-dfff : PCI Bus #02
df00-df3f : Intel Corp. 82557/8/9 [Ethernet Pro 100] (#3)
df00-df3f : eepro100
ee80-eebf : Intel Corp. 82557/8/9 [Ethernet Pro 100] (#2)
ee80-eebf : eepro100
ef00-ef3f : Intel Corp. 82557/8/9 [Ethernet Pro 100]
ef00-ef3f : eepro100
ef80-ef9f : Intel Corp. 82371AB/EB/MB PIIX4 USB
ffa0-ffaf : Intel Corp. 82371AB/EB/MB PIIX4 IDE
ffa0-ffa7 : ide0
ffa8-ffaf : ide1

# cat /proc/iomem 00000000-0009fbff : System RAM
0009fc00-0009ffff : reserved
000a0000-000bffff : Video RAM area
000c0000-000c7fff : Video ROM
000c9800-000ca7ff : Extension ROM
000f0000-000fffff : System ROM
00100000-2fffffff : System RAM
00100000-0026e798 : Kernel code
0026e799-002bf19f : Kernel data
fbd00000-fddfffff : PCI Bus #01
fc000000-fcffffff : Chips and Technologies F69000 HiQVideo
fde00000-fe2fffff : PCI Bus #02
fe100000-fe1fffff : Intel Corp. 82557/8/9 [Ethernet Pro 100] (#3)
fe2ff000-fe2fffff : Intel Corp. 82557/8/9 [Ethernet Pro 100] (#3)
fe2ff000-fe2fffff : eepro100
fe800000-fe8fffff : Intel Corp. 82557/8/9 [Ethernet Pro 100] (#2)
fea00000-feafffff : Intel Corp. 82557/8/9 [Ethernet Pro 100]
febfe000-febfefff : Intel Corp. 82557/8/9 [Ethernet Pro 100] (#2)
febfe000-febfefff : eepro100
febff000-febfffff : Intel Corp. 82557/8/9 [Ethernet Pro 100]
febff000-febfffff : eepro100
fec00000-fec00fff : reserved
fee00000-fee00fff : reserved
ff000000-ff0fffff : PCI Bus #01
ff100000-ff1fffff : PCI Bus #02
ff400000-ff7fffff : Intel Corp. 440GX - 82443GX Host bridge
fffc0000-ffffffff : reserved



[7.5.] PCI information ('lspci -vvv' as root)
# lspci -vvv 0000:00:00.0 Host bridge: Intel Corp. 440GX - 82443GX Host bridge
Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr+ Stepping- SERR+ FastB2B-
Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort+ >SERR- <PERR-
Latency: 64
Region 0: Memory at ff400000 (32-bit, prefetchable) [size=4M]
Capabilities: [a0] AGP version 1.0
Status: RQ=32 Iso- ArqSz=0 Cal=0 SBA+ ITACoh- GART64- HTrans- 64bit- FW- AGP3- Rate=x1,x2
Command: RQ=1 ArqSz=0 Cal=0 SBA- AGP- GART64- 64bit- FW- Rate=<none>

0000:00:01.0 PCI bridge: Intel Corp. 440GX - 82443GX AGP bridge (prog-if 00 [Normal decode])
Control: I/O+ Mem+ BusMaster+ SpecCycle+ MemWINV+ VGASnoop- ParErr- Stepping- SERR- FastB2B-
Status: Cap- 66MHz+ UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Latency: 64
Bus: primary=00, secondary=01, subordinate=01, sec-latency=64
I/O behind bridge: 0000c000-0000cfff
Memory behind bridge: fbd00000-fddfffff
Prefetchable memory behind bridge: ff000000-ff0fffff
BridgeCtl: Parity- SERR- NoISA- VGA+ MAbort- >Reset- FastB2B+

0000:00:07.0 ISA bridge: Intel Corp. 82371AB/EB/MB PIIX4 ISA (rev 02)
Control: I/O+ Mem+ BusMaster+ SpecCycle+ MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B-
Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Latency: 0

0000:00:07.1 IDE interface: Intel Corp. 82371AB/EB/MB PIIX4 IDE (rev 01) (prog-if 80 [Master])
Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B-
Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Latency: 64
Region 4: I/O ports at ffa0 [size=16]

0000:00:07.2 USB Controller: Intel Corp. 82371AB/EB/MB PIIX4 USB (rev 01) (prog-if 00 [UHCI])
Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B-
Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Latency: 64
Interrupt: pin D routed to IRQ 0
Region 4: I/O ports at ef80 [size=32]

0000:00:07.3 Bridge: Intel Corp. 82371AB/EB/MB PIIX4 ACPI (rev 02)
Control: I/O+ Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B-
Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Interrupt: pin ? routed to IRQ 9

0000:00:11.0 Ethernet controller: Intel Corp. 82557/8/9 [Ethernet Pro 100] (rev 08)
Subsystem: Intel Corp. 82559 Fast Ethernet LAN on Motherboard
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr- Stepping- SERR+ FastB2B-
Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Latency: 64 (2000ns min, 14000ns max), Cache Line Size: 0x08 (32 bytes)
Interrupt: pin A routed to IRQ 11
Region 0: Memory at febff000 (32-bit, non-prefetchable) [size=4K]
Region 1: I/O ports at ef00 [size=64]
Region 2: Memory at fea00000 (32-bit, non-prefetchable) [size=1M]
Expansion ROM at fe900000 [disabled] [size=1M]
Capabilities: [dc] Power Management version 2
Flags: PMEClk- DSI+ D1+ D2+ AuxCurrent=0mA PME(D0+,D1+,D2+,D3hot+,D3cold-)
Status: D0 PME-Enable- DSel=0 DScale=2 PME-

0000:00:12.0 Ethernet controller: Intel Corp. 82557/8/9 [Ethernet Pro 100] (rev 08)
Subsystem: Intel Corp. 82559 Fast Ethernet LAN on Motherboard
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr- Stepping- SERR+ FastB2B-
Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Latency: 64 (2000ns min, 14000ns max), Cache Line Size: 0x08 (32 bytes)
Interrupt: pin A routed to IRQ 10
Region 0: Memory at febfe000 (32-bit, non-prefetchable) [size=4K]
Region 1: I/O ports at ee80 [size=64]
Region 2: Memory at fe800000 (32-bit, non-prefetchable) [size=1M]
Expansion ROM at fe700000 [disabled] [size=1M]
Capabilities: [dc] Power Management version 2
Flags: PMEClk- DSI+ D1+ D2+ AuxCurrent=0mA PME(D0+,D1+,D2+,D3hot+,D3cold-)
Status: D0 PME-Enable- DSel=0 DScale=2 PME-

0000:00:14.0 PCI bridge: Digital Equipment Corporation DECchip 21152 (rev 03) (prog-if 00 [Normal decode])
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B-
Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Latency: 64, Cache Line Size: 0x08 (32 bytes)
Bus: primary=00, secondary=02, subordinate=02, sec-latency=64
I/O behind bridge: 0000d000-0000dfff
Memory behind bridge: fde00000-fe2fffff
Prefetchable memory behind bridge: 00000000ff100000-00000000ff100000
BridgeCtl: Parity+ SERR+ NoISA- VGA- MAbort- >Reset- FastB2B-
Capabilities: [dc] Power Management version 1
Flags: PMEClk- DSI- D1- D2- AuxCurrent=220mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
Status: D0 PME-Enable- DSel=0 DScale=0 PME-
Bridge: PM- B3+

0000:01:00.0 VGA compatible controller: Chips and Technologies F69000 HiQVideo (rev 64) (prog-if 00 [VGA])
Subsystem: Chips and Technologies F69000 HiQVideo
Control: I/O+ Mem+ BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping+ SERR+ FastB2B-
Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Interrupt: pin A routed to IRQ 0
Region 0: Memory at fc000000 (32-bit, non-prefetchable) [size=16M]
Expansion ROM at fddc0000 [disabled] [size=256K]

0000:02:0e.0 Ethernet controller: Intel Corp. 82557/8/9 [Ethernet Pro 100] (rev 08)
Subsystem: Intel Corp. EtherExpress PRO/100+ Management Adapter
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr- Stepping- SERR+ FastB2B-
Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Latency: 64 (2000ns min, 14000ns max), Cache Line Size: 0x08 (32 bytes)
Interrupt: pin A routed to IRQ 10
Region 0: Memory at fe2ff000 (32-bit, non-prefetchable) [size=4K]
Region 1: I/O ports at df00 [size=64]
Region 2: Memory at fe100000 (32-bit, non-prefetchable) [size=1M]
Expansion ROM at fe000000 [disabled] [size=1M]
Capabilities: [dc] Power Management version 2
Flags: PMEClk- DSI+ D1+ D2+ AuxCurrent=0mA PME(D0+,D1+,D2+,D3hot+,D3cold-)
Status: D0 PME-Enable- DSel=0 DScale=2 PME-



[7.6.] SCSI information (from /proc/scsi/scsi)

# cat /proc/scsi/scsi
Attached devices: none


[7.7.] Other information that might be relevant to the problem
(please look in /proc and include all information that you
think to be relevant):
[X.] Other notes, patches, fixes, workarounds:


If more info is required I believe I can provide it if requested. Thanks!
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/