AIC7XXX + NFS Problem?

Shane Shrybman (shane@zeke.yi.org)
Wed, 1 Sep 1999 21:05:30 -0400 (EDT)


Hi,

I have a 2940UW adapter with one wide seagate drive on it. It has been
working very well for me for a while now. I have not had any scsi errors
for many moons. But, when moving 300-400 MB of files over nfs today I got
these...

Sep 1 20:26:38 mars kernel: scsi0 channel 0 : resetting for second half
of retries.
Sep 1 20:26:38 mars kernel: SCSI bus is being reset for host 0 channel 0.
Sep 1 20:26:41 mars kernel: (scsi0:0:2:0) Using asynchronous transfers.
Sep 1 20:26:41 mars kernel: (scsi0:0:2:0) CRC error during Data-In phase.
Sep 1 20:26:41 mars last message repeated 2 times
Sep 1 20:26:41 mars kernel: SCSI disk error : host 0 channel 0 id 2 lun 0
return code = 18000002
Sep 1 20:26:41 mars kernel: [valid=0] Info fld=0x0, Current sd08:01:
sense key Aborted Command
Sep 1 20:26:41 mars kernel: Additional sense indicates Initiator detected
error message received
Sep 1 20:26:41 mars kernel: scsidisk I/O error: dev 08:01, sector 6634170
Sep 1 20:26:41 mars kernel: (scsi0:0:2:0) CRC error during Data-In phase.
Sep 1 20:26:41 mars kernel: (scsi0:0:2:0) CRC error during Data-In phase.
Sep 1 20:26:42 mars kernel: scsi0 channel 0 : resetting for second half
of retries.
Sep 1 20:26:42 mars kernel: SCSI bus is being reset for host 0 channel 0.
Sep 1 20:26:45 mars kernel: (scsi0:0:2:0) Using asynchronous transfers.
Sep 1 20:26:45 mars kernel: (scsi0:0:2:0) CRC error during Data-In phase.
Sep 1 20:26:45 mars kernel: (scsi0:0:2:0) CRC error during Data-In phase.
Sep 1 20:26:45 mars kernel: SCSI disk error : host 0 channel 0 id 2 lun 0
return code = 18000002
Sep 1 20:26:45 mars kernel: [valid=0] Info fld=0x0, Current sd08:01:
sense key Aborted Command
Sep 1 20:26:45 mars kernel: Additional sense indicates Initiator detected
error message received
Sep 1 20:26:45 mars kernel: scsidisk I/O error: dev 08:01, sector 6634178

I have run many benchmarks on the drive with bonnie, lmbench and hdparm
and never had any errors. The errors only seem to appear when doing
lengthy operations over nfs. Smaller nfs operations seem to be fine, also
copying files from box B to box A(see below) is also fine(box A to box B
is the problem). Could someone tell me what these error codes mean? (or
how I can decipher them)

The two machines are both running linux:

Box A mixed scsi and ide
kernel 2.2.11-ac1 + tcp patches
nfs client
RedHat 6.0 i386
TCQ is on by default and commands/device is set to 32
The seagate drive is SCA with a converter to HD68 pin
(I would not be at all surprised if this converter is to blame, I
just can't bring myself to spend any more money on scsi cableage.)

Box B ide only
kernel 2.2.12 vanilla
nfs server
RedHat 5.1 + many updates i386
no errors in logs here

Shane

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.rutgers.edu
Please read the FAQ at http://www.tux.org/lkml/