Re: ext2 corruption in 2.0.33

Joseph H. Buehler (jhpb@sarto.gaithersburg.md.us)
24 Jan 1998 19:25:11 -0500


Andries.Brouwer@cwi.nl writes:

> Below a report on ext2 corruption I got a moment ago.
> If noone else reports such things[*] then maybe my hardware
> is not 100%, but this is a 1-month-old machine that has never
> given any cause for suspicion. For the time being I suspect
> the kernel, a vanilla 2.0.33.

Someone on the linux SMP list was just complaining about filesystem
errors under 2.0.32 on an SMP machine.

I have a strange problem at the moment on an SMP machine running
Redhat 5.0. Compiling egcs on one of my 2 SCSI disks always causes
various compile/link aborts. Compiling it on the root disk seems to
work OK (only did it once and it worked).

There was nothing in the system error log and forcing e2fsck to check
the filesystem did not discover any problems. Running a simple
program I wrote to read/write blocks of pseudo-random data to a big
file does not detect any corruption. (Anyone know of a disk
diagnostic program I can use, I didn't find anything on the net.)

Upgrading to 2.0.33 did not change anything. My dmesg file is
attached, if this helps.

I notice the host adaptors report "Parity Checking". Is there
something that I may need to do to the drives to use parity?

Joe Buehler

BIOS revision 2.10 entry at 0xf04d0
Probing PCI hardware.
Calibrating delay loop.. ok - 199.07 BogoMIPS
Memory: 63116k/65536k available (676k kernel code, 384k reserved, 1172k data)
Swansea University Computer Society NET3.035 for Linux 2.0
NET3: Unix domain sockets 0.13 for Linux NET3.035.
Swansea University Computer Society TCP/IP for NET3.034
IP Protocols: ICMP, UDP, TCP
VFS: Diskquotas version dquot_5.6.0 initialized
Checking 386/387 coupling... Ok, fpu using exception 16 error reporting.
Checking 'hlt' instruction... Ok.
Linux version 2.0.33 (root@altera) (gcc version 2.7.2.3) #2 Thu Jan 22 23:57:49 EST 1998
Booting processor 0 stack 00002000: Calibrating delay loop.. ok - 199.07 BogoMIPS
Total of 2 processors activated (398.13 BogoMIPS).
Starting kswapd v 1.4.2.2
Serial driver version 4.13 with no serial options enabled
tty00 at 0x03f8 (irq = 4) is a 16550A
tty01 at 0x02f8 (irq = 3) is a 16550A
tty02 at 0x03e8 (irq = 4) is a 16550A
tty03 at 0x02e8 (irq = 3) is a 16550A
Ramdisk driver initialized : 16 ramdisks of 4096K size
ide: i82371 PIIX (Triton) on PCI bus 0 function 9
ide: ports are not enabled (BIOS)
ide2: ports already in use, skipping probe
Floppy drive(s): fd0 is 1.44M
FDC 0 is a post-1991 82077
scsi : 0 hosts.
scsi : detected total.
tulip.c:v0.79 9/3/97 becker@cesdis.gsfc.nasa.gov
eth0: DEC DS21041 Tulip at 0xe000, 21041 mode, 00 40 33 91 16 4f, IRQ 15.
The following verbose information is emitted for
bug reports on media selection.
eth0:21041 Media information at 30, default media 0800 (Autosense).
eth0: 21041 media 00 (10baseT), csr13 0401 csr14 0000 csr15 0000.
eth0: 21041 media 00 (10baseT), csr13 0000 csr14 0000 csr15 0000.
eth0: 21041 media 00 (10baseT), csr13 0000 csr14 0000 csr15 0000.
RAMDISK: Compressed image found at block 0
VFS: Mounted root (ext2 filesystem).
ncr53c8xx: at PCI bus 0, device 11, function 0
ncr53c8xx: 53c810a detected
ncr53c8xx: at PCI bus 0, device 12, function 0
ncr53c8xx: 53c810a detected
ncr53c810a-0: rev=0x11, base=0xe3000000, io_port=0xd800, irq=11
ncr53c810a-0: ID 7, Fast-10, Parity Checking
ncr53c810a-0: restart (scsi reset).
ncr53c810a-1: rev=0x11, base=0xe2800000, io_port=0xd400, irq=9
ncr53c810a-1: ID 7, Fast-10, Parity Checking
ncr53c810a-1: restart (scsi reset).
scsi0 : ncr53c8xx - revision 2.4a
scsi1 : ncr53c8xx - revision 2.4a
scsi : 2 hosts.
ncr53c810a-0-<0,0>: using tagged command queueing, up to 4 cmds/lun
Vendor: IBM Model: DORS-32160 Rev: S82C
Type: Direct-Access ANSI SCSI revision: 02
Detected scsi disk sda at scsi0, channel 0, id 0, lun 0
ncr53c810a-0-<1,0>: using tagged command queueing, up to 4 cmds/lun
Vendor: MICROP Model: 2217-15MZ1001905 Rev: HQ30
Type: Direct-Access ANSI SCSI revision: 02
Detected scsi disk sdb at scsi0, channel 0, id 1, lun 0
Vendor: TEAC Model: CD-ROM CD-56S Rev: 1.0D
Type: CD-ROM ANSI SCSI revision: 02
Detected scsi CD-ROM sr0 at scsi1, channel 0, id 1, lun 0
Vendor: WangDAT Model: Model 2600 Rev: 01.4
Type: Sequential-Access ANSI SCSI revision: 02
Vendor: HP Model: C1750A Rev: 3125
Type: Processor ANSI SCSI revision: 01
ncr53c810a-0-<0,0>: FAST-10 SCSI 10.0 MB/s (100 ns, offset 8)
SCSI device sda: hdwr sector= 512 bytes. Sectors= 4226725 [2063 MB] [2.1 GB]
Partition check:
sda: sda1 sda2 sda3
ncr53c810a-0-<1,0>: FAST-10 SCSI 10.0 MB/s (100 ns, offset 8)
SCSI device sdb: hdwr sector= 512 bytes. Sectors= 3450902 [1685 MB] [1.7 GB]
sdb: sdb3 sdb4
VFS: Mounted root (ext2 filesystem) readonly.
Adding Swap: 114228k swap-space (priority -1)
Soundblaster audio driver Copyright (C) by Hannu Savolainen 1993-1996
SB 4.13 detected OK (220)
Installed 0
Detected scsi tape st0 at scsi1, channel 0, id 3, lun 0