Re: SCSI disk error in 2.0.30, data loss

Bob McElrath (mcelrath@draal.physics.wisc.edu)
Tue, 2 Mar 1999 14:22:24 -0600 (CST)


Looks like SCSI controller configuration problems. Disable
Disconnect/Reconnect, and Tagged Queuing. I saw errors like this with my
zip drive on the BT-958 controller, when it was trying to address it in ways
it didn't like (turning Tagged Queuing off for it helped). At any rate, it
can probably be fixed by futzing with the controller paramters (ctrl-A at
bootup), or kernel parameters (you can tell the BT-958 driver to disable
tagged queuing (etc.) for INDIVIDUAL targets).

On Tue, 2 Mar 1999, Michael Stutz wrote:

> I've been having trouble recently with a SCSI drive -- when writing to the
> disk I sometimes get the following errors (with data loss):
>
> ------
> Mar 1 21:06:48 erie kernel: scsi0 channel 0 : resetting for second half of
> retries.
> Mar 1 21:06:48 erie kernel: SCSI bus is being reset for host 0 channel 0.
> Mar 1 21:06:48 erie kernel: scsi0: Sending Bus Device Reset CCB #2666 to
> Target 0
> Mar 1 21:06:50 erie kernel: scsi0: Bus Device Reset CCB #2666 to Target 0
> Completed
> Mar 1 21:07:06 erie kernel: scsi : aborting command due to timeout : pid
> 2649, scsi0, channel 0, id 0, lun 0 Write (6) 18 33 11 24 00
> Mar 1 21:07:06 erie kernel: scsi0: Aborting CCB #2669 to Target 0
> Mar 1 21:07:06 erie kernel: SCSI host 0 channel 0 reset (pid 2644) timed
> out - trying harder
> Mar 1 21:07:06 erie kernel: SCSI bus is being reset for host 0 channel 0.
> Mar 1 21:07:06 erie kernel: scsi0: CCB #2669 to Target 0 Aborted
> Mar 1 21:07:06 erie kernel: scsi0: Resetting BusLogic BT-958 due to Target 0
> Mar 1 21:07:06 erie kernel: scsi0: *** BusLogic BT-958 Initialized
> Successfully ***
> Mar 1 21:08:25 erie kernel: scsi0: Tagged Queuing now active for Target 0
> Mar 1 21:16:01 erie kernel: scsi0 channel 0 : resetting for second half of
> retries.
> Mar 1 21:16:01 erie kernel: SCSI bus is being reset for host 0 channel 0.
> Mar 1 21:16:01 erie kernel: scsi0: Tagged Queuing now disabled for Target 0
> Mar 1 21:16:01 erie kernel: scsi0: Sending Bus Device Reset CCB #3094 to
> Target 0
> Mar 1 21:16:05 erie kernel: sidisk I/O error: dev 08:01, sector 1590410
> Mar 1 21:16:05 erie kernel: SCSI disk error : host 0 channel 0 id 0 lun 0
> return code = 28000002
> Mar 1 21:16:05 erie kernel: extra data not valid Current error sd08:01:
> sense key Not Ready
> Mar 1 21:16:05 erie kernel: Additional sense indicates Logical unit is in
> process of becoming ready
> Mar 1 21:16:05 erie kernel: scsidisk I/O error: dev 08:01, sector 1590412
> Mar 1 21:16:05 erie kernel: SCSI disk error : host 0 channel 0 id 0 lun 0
> return code = 28000002
> Mar 1 21:16:05 erie kernel: extra data not valid Current error sd08:01:
> sense key Not Ready
> Mar 1 21:16:05 erie kernel: Additional sense indicates Logical unit is in
> process of becoming ready
>
> [last three lines repeated many times]
>
> Mar 1 21:16:10 erie kernel: scsidisk I/O error: dev 08:01, sector 32
> Mar 1 21:16:10 erie kernel: Kernel panic: EXT2-fs panic (device 08:01):
> ext2_write_inode: unable to read i-node block - inode=33, block=16
> Mar 1 21:16:10 erie kernel:
> Mar 1 21:16:10 erie kernel: SCSI disk error : host 0 channel 0 id 0 lun 0
> return code = 28000002
> Mar 1 21:16:10 erie kernel: extra data not valid Current error sd08:01:
> sense key Not Ready
> Mar 1 21:16:10 erie kernel: Additional sense indicates Logical unit is in
> process of becoming ready
> Mar 1 21:16:10 erie kernel: scsidisk I/O error: dev 08:01, sector 2
>
> ------
>
> At first I thought it was a bad disk, so I replaced it with a similar one (a
> Seagate ST32155W), did a "mke2fs -cv /dev/sda1", but I'm getting the same
> errors, and I don't know what to do. Any help is greatly appreciated.
>
> System info:
>
> Linux erie 2.0.30 #1 Tue Jul 29 10:01:26 EDT 1997 i586 unknown
> Kernel modules 2.1.85
> Gnu C 2.7.2.3
> Binutils 2.9.1
> Linux C Library 0.7.so
> Dynamic Linker (ld.so) 1.9.9
> Linux C++ Library 2.7.2
> Procps 1.2.7
> Mount 2.7l
> Net-tools (1998-03-02)
> Kbd 0.94
> Sh-utils 1.16
>
> Feb 15 17:36:32 erie kernel: scsi0 : BusLogic BT-958
> Feb 15 17:36:32 erie kernel: scsi : 1 host.
> Feb 15 17:36:32 erie kernel: Vendor: SEAGATE Model: ST32155W
> Rev: 0596
> Feb 15 17:36:32 erie kernel: Type: Direct-Access
> ANSI SCSI revision: 02
> Feb 15 17:36:32 erie kernel: Detected scsi disk sda at scsi0, channel 0, id
> 0, lun 0
> Feb 15 17:36:32 erie kernel: Vendor: HP Model: CD-Writer 6020
> Rev: 1.07
> Feb 15 17:36:32 erie kernel: Type: CD-ROM
> ANSI SCSI revision: 02
> Feb 15 17:36:32 erie kernel: Detected scsi CD-ROM sr0 at scsi0, channel 0,
> id 4, lun 0
> Feb 15 17:36:32 erie kernel: Vendor: UMAX Model: Astra 1220S
> Rev: V1.1
> Feb 15 17:36:32 erie kernel: Type: Scanner
> ANSI SCSI revision: 02
> Feb 15 17:36:32 erie kernel: Detected scsi generic sgc at scsi0, channel 0,
> id 5, lun 0
> Feb 15 17:36:32 erie kernel: scsi : detected 1 SCSI cdrom 1 SCSI disk total.
> Feb 15 17:36:32 erie kernel: SCSI device sda: hdwr sector= 512 bytes.
> Sectors= 4197405 [2049 MB] [2.0 GB]
>
> Output of /proc/scsi/BugLogic/0:
>
> -----
> ***** BusLogic SCSI Driver Version 2.0.9 of 29 March 1997 *****
> Copyright 1995 by Leonard N. Zubkoff <lnz@dandelion.com>
> Configuring BusLogic Model BT-958 PCI Wide Ultra SCSI Host Adapter
> Firmware Version: 5.05R, I/O Address: 0xFFF4, IRQ Channel: 10/Level
> PCI Bus: 0, Device: 9, Address: 0xFFBEF000, Host Adapter SCSI ID: 7
> Parity Checking: Enabled, Extended Translation: Enabled
> Synchronous Negotiation: Disabled, Wide Negotiation: Disabled
> Disconnect/Reconnect: Enabled, Tagged Queuing: Enabled
> Scatter/Gather Limit: 128 of 8192 segments, Mailboxes: 255
> Driver Queue Depth: 255, Host Adapter Queue Depth: 192
> Tagged Queue Depth: Automatic, Untagged Queue Depth: 3
> Error Recovery Strategy: Default, SCSI Bus Reset: Enabled
> SCSI Bus Termination: High Enabled, SCAM: Disabled
> *** BusLogic BT-958 Initialized Successfully ***
> Target 0: Asynchronous
> Target 4: Asynchronous
> Target 5: Asynchronous
>
> Current Driver Queue Depth: 255
> Currently Allocated CCBs: 64
>
>
> DATA TRANSFER STATISTICS
>
> Target Tagged Queuing Queue Depth Commands Attempted Commands Completed
> ====== ============== =========== ================== ==================
> 0 Active 28 49828 49825
> 4 Not Supported 3 2 2
> 5 Not Supported 3 2 2
>
> Target Read Commands Write Commands Total Bytes Read Total Bytes Written
> ====== ============= ============== =================== ===================
> 0 48797 1027 2242658304 5245952
> 4 0 0 0 0
> 5 0 0 0 0
>
> Target Command 0-1KB 1-2KB 2-4KB 4-8KB 8-16KB
> ====== ======= ========= ========= ========= ========= =========
> 0 Read 0 3311 3 2 257
> 0 Write 0 515 0 0 512
> 4 Read 0 0 0 0 0
> 4 Write 0 0 0 0 0
> 5 Read 0 0 0 0 0
> 5 Write 0 0 0 0 0
>
> Target Command 16-32KB 32-64KB 64-128KB 128-256KB 256KB+
> ====== ======= ========= ========= ========= ========= =========
> 0 Read 10 45214 0 0 0
> 0 Write 0 0 0 0 0
> 4 Read 0 0 0 0 0
> 4 Write 0 0 0 0 0
> 5 Read 0 0 0 0 0
> 5 Write 0 0 0 0 0
>
>
> ERROR RECOVERY STATISTICS
>
> Command Aborts Bus Device Resets Host Adapter Resets
> Target Requested Completed Requested Completed Requested Completed
> ID \\\\ Attempted //// \\\\ Attempted //// \\\\ Attempted ////
> ====== ===== ===== ===== ===== ===== ===== ===== ===== =====
> 0 0 0 0 1 1 1 0 0 0
> 4 0 0 0 0 0 0 0 0 0
> 5 0 0 0 0 0 0 0 0 0
>
> External Host Adapter Resets: 0
>
> -----
>
> Partition table:
>
> -----
> Disk /dev/sda: 255 heads, 63 sectors, 261 cylinders
> Units = cylinders of 16065 * 512 bytes
>
> Device Boot Begin Start End Blocks Id System
> /dev/sda1 1 1 261 2096451 83 Linux native
>
> -----
>
>
>
>
> -
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to majordomo@vger.rutgers.edu
> Please read the FAQ at http://www.tux.org/lkml/
>

-- Bob

Bob McElrath (rsmcelrath@students.wisc.edu) Univ. of Wisconsin at Madison

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.rutgers.edu
Please read the FAQ at http://www.tux.org/lkml/