Lots of oopses on IBM Netfinity 5500

=?ISO-8859-1?Q?B=E5rd_Dahlmo?= (baard@funn.no)
Wed, 30 Jun 1999 16:53:30 +0200 (CEST)


[1.] One line summary of the problem:
Lots of oopses on IBM Netfinity 5500

[2.] Full description of the problem/report:

Running on a IBM Netfinity 5500 with a ServeRAID kontroller, I get
lots of oopses/hangs I believe are related to the filesystem.

[3.] Keywords (i.e., modules, networking, kernel):
ServeRAID module (ips.o)

[4.] Kernel version (from /proc/version):
Linux version 2.2.5-funn (root@balder.narviknett.no) (gcc version
egcs-2.91.66 19990314/Linux (egcs-1.1.2 release)) #3 fre jun 25 17:44:02
CEST 1999

[5.] Output of Oops.. message (if applicable) with symbolic information
resolved (see Documentation/oops-tracing.txt)

Here's one Oops - the others look similar.

Jun 30 13:10:38 balder kernel: Unable to handle kernel paging request at
virtual address 00010000
Jun 30 13:10:38 balder kernel: current->tss.cr3 = 06757000, %cr3 =
06757000
Jun 30 13:10:38 balder kernel: *pde = 00000000
Jun 30 13:10:38 balder kernel: Oops: 0000
Jun 30 13:10:38 balder kernel: CPU: 0
Jun 30 13:10:38 balder kernel: EIP: 0010:[find_buffer+36/76]
Jun 30 13:10:38 balder kernel: EFLAGS: 00010206
Jun 30 13:10:38 balder kernel: eax: 00010000 ebx: 00000902 ecx:
00000809 edx: 00010000
Jun 30 13:10:38 balder kernel: esi: 00000400 edi: c5cb2550 ebp:
00000809 esp: c66efdf8
Jun 30 13:10:38 balder kernel: ds: 0018 es: 0018 ss: 0018
Jun 30 13:10:38 balder kernel: Process procmail (pid: 671, process nr: 62,
stackpage=c66ef000)
Jun 30 13:10:38 balder kernel: Stack: c01267df 00000809 00000902 00000400
c0126b5f 00000809 00000902 00000400
Jun 30 13:10:38 balder kernel: 00000902 00000902 c5cb2550 c5cb2550
00000000 c013cf77 00000809 00000902
Jun 30 13:10:38 balder kernel: 00000400 00000000 00000902 000003b8
c66eff08 c013d5a5 c5cb2550 00000902
Jun 30 13:10:38 balder kernel: Call Trace: [get_hash_table+19/32]
[getblk+27/548] [ext2_alloc_block+103/328] [block_getblk+329/644]
[ext2_getblk+365/512] [ext2_file_write+1013/1288] [fcntl_setlk+332/356]
Jun 30 13:10:38 balder kernel: Code: 8b 00 39 5a 04 75 15 39 72 08 75 10
66 39 4a 0c 75 0a 89 d0

Code: 00000000 Before first symbol 00000000 <_IP>: <===
Code: 00000000 Before first symbol 0: 8b 00
movl (%eax),%eax <===
Code: 00000002 Before first symbol 2: 39 5a 04
cmpl %ebx,0x4(%edx)
Code: 00000005 Before first symbol 5: 75 15
jne 0000001c Before first symbol
Code: 00000007 Before first symbol 7: 39 72 08
cmpl %esi,0x8(%edx)
Code: 0000000a Before first symbol a: 75 10
jne 0000001c Before first symbol
Code: 0000000c Before first symbol c: 66 39 4a 0c
cmpw %cx,0xc(%edx)
Code: 00000010 Before first symbol 10: 75 0a
jne 0000001c Before first symbol
Code: 00000012 Before first symbol 12: 89 d0
movl %edx,%eax

[6.] A small shell script or example program which triggers the
problem (if possible)

n/a

[7.] Environment

[7.1.] Software (add the output of the ver_linux script here)
Linux balder.narviknett.no 2.2.5-funn #3 fre jun 25 17:44:02 CEST 1999 i686 unknown
Kernel modules 2.1.121
Gnu C egcs-2.91.66
Binutils 2.9.1.0.23
Linux C Library 2.1.1
Dynamic linker ldd (GNU libc) 2.1.1
Procps 2.0.2
Mount 2.9o
Net-tools (1999-04-20)
Kbd [option...]
Sh-utils 1.16
Modules Loaded nfsd lockd sunrpc pcnet32 ips

[7.2.] Processor information (from /proc/cpuinfo):
processor : 0
vendor_id : GenuineIntel
cpu family : 6
model : 5
model name : Pentium II (Deschutes)
stepping : 2
cpu MHz : 348.450914
cache size : 512 KB
fdiv_bug : no
hlt_bug : no
sep_bug : no
f00f_bug : no
fpu : yes
fpu_exception : yes
cpuid level : 2
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca
cmov pat pse36 mmx osfxsr
bogomips : 347.34

[7.3.] Module information (from /proc/modules):

nfsd 151576 8 (autoclean)
lockd 31208 1 (autoclean) [nfsd]
sunrpc 52420 1 (autoclean) [nfsd lockd]
pcnet32 7592 1 (autoclean)
ips 16776 10

[7.4.] SCSI information (from /proc/scsi/scsi)

Attached devices:
Host: scsi0 Channel: 00 Id: 00 Lun: 00
Vendor: IBM Model: SERVERAID Rev: 1.0
Type: Direct-Access ANSI SCSI revision: 01
Host: scsi0 Channel: 00 Id: 01 Lun: 00
Vendor: IBM Model: SERVERAID Rev: 1.0
Type: Direct-Access ANSI SCSI revision: 01

[7.5.] Other information that might be relevant to the problem
(please look in /proc and include all information that you
think to be relevant):

Kernel is stock RH 6.0 with ipv4-DoS-patch and IBM's ServeRAID driver.

/proc/scsi/ips/0:

IBM ServeRAID General Information:

Controller Type : ServeRAID on motherboard
IO port address : 0x2000
IRQ number : 11
BIOS Version : 3.10.05
Firmware Version : 2.86.03
Boot Block Version : 97139
Driver Version : 0.99.01c
Logical Drives/Offline/Critical : 2/0/0
Initiator IDs (Channel/ID) : 1/7 2/7
Rebuild Rate (Low/Mid/High) : High
Stripe Size : 8K
Max Physical Devices : 30
Defunct Drive Count : 0
Config Update Count : 18
Flash Reprogram Count : 57

output from df:

Filesystem 1k-blocks Used Available Use% Mounted on
/dev/sda5 402759 90633 291325 24% /
/dev/sda1 23208 7597 14413 35% /boot
/dev/sda10 991000 3480 936316 0% /tmp
/dev/sda6 991000 36986 902810 4% /var
/dev/sda7 991000 7378 932418 1% /var/tmp
/dev/sda11 1981000 537138 1341451 29% /usr
/dev/sda8 991000 399603 540193 43% /usr/src
/dev/sda9 991000 4334 935462 0% /var/spool/mail
/dev/sdb1 17060598 686887 16373711 4% /home

sda are 2 x 9 GB drives with raid0 (total size 18 GB)
sdb are 3 x 9 GB drives with raid5 /total size 18 GB)

[X.] Other notes, patches, fixes, workarounds:

Other info available on request. Test account could be arranged if that
helps.

-- 
Bård Dahlmo    90 10 76 13

- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.rutgers.edu Please read the FAQ at http://www.tux.org/lkml/