SCSI disk I/O error

Sean Farley (sean@farley.org)
Sat, 18 Apr 1998 11:36:54 -0500 (EST)


This message is in MIME format. The first part should be readable text,
while the remaining parts are likely unreadable without MIME-aware tools.
Send mail to mime@docserver.cac.washington.edu for more info.

---1463811583-1051598523-892916390=:419
Content-Type: TEXT/PLAIN; CHARSET=US-ASCII
Content-ID: <Pine.LNX.3.96.980418112120.700B@seen.farley.org>

At the moment I cannot think of a great subject that will catch everyone's
attention. :)

This morning when I sat down to my computer to see if some work I left for
it to think (compile) on had gone smoothly, I noticed that although
applications, under X, were still active, I was unable to start any new
applications.

I was able to flip over to a console (Ctrl-Alt-F1). Ctrl-ScrlLock gave me
pages of processes running. Except for a few processes that I knew were
running, most of the processes were xinted. I assume that the computer
had used up all of the process slots which is why I was unable to logon at
the console or start-up another xterm.

Come to think about it, before I went to bed, I was unable to start qps
(Qt process utility) to view the running process. I thought about looking
at it in the morning when I woke up. I don't know if the SCSI errors
caused the problem or something else did. The first SCSI error was well
after I had gone to bed and had had trouble with qps.

After rebooting (hard reboot), fsck proceeded to inform me that one of my
partitions needed serious checking.

Here are the assortment of errors syslog saved for me from the night
before:
...
Apr 18 02:48:48 seen kernel: scsi : aborting command due to timeout : pid
16668, scsi0, channel 0, id 1, lun 0 Write (6) 10 30 0a 02 00
Apr 18 02:48:48 seen kernel: scsi : aborting command due to timeout : pid
16669, scsi0, channel 0, id 1, lun 0 Write (6) 10 64 cc 02 00
Apr 18 02:48:48 seen kernel: scsi : aborting command due to timeout : pid
16670, scsi0, channel 0, id 1, lun 0 Write (6) 10 65 fa ec 00
Apr 18 02:48:49 seen kernel: SCSI host 0 abort (pid 16669) timed out -
resetting
Apr 18 02:48:49 seen kernel: SCSI bus is being reset for host 0 channel 0.
Apr 18 02:48:49 seen kernel: SCSI host 0 abort (pid 16670) timed out -
resetting
Apr 18 02:48:49 seen kernel: SCSI bus is being reset for host 0 channel 0.
Apr 18 02:49:05 seen kernel: SCSI disk error : host 0 channel 0 id 1 lun 0
return code = 26030000
Apr 18 02:49:05 seen kernel: scsidisk I/O error: dev 08:12, sector 357882,
absolute sector 1074682
Apr 18 02:49:05 seen kernel: SCSI disk error : host 0 channel 0 id 1 lun 0
return code = 26030000
Apr 18 02:49:05 seen kernel: scsidisk I/O error: dev 08:12, sector 344074,
absolute sector 1060874
...

After about 12 minutes of this, the file system decided to join in feeling
that it had been left out:
...
Apr 18 03:00:57 seen kernel: EXT2-fs error (device 08:12):
ext2_write_inode: unable to read inode block - inode=2059, block=8200
...

The odd thing is that last week I had just scanned the drive for bad
blocks and repartitioned it.

System info: Linux v2.0.33 (w/ memory detection patch)
Abit IT5H v2 (month old)
Pentium 233MHz (month old)
e2fsck 1.10, 24-Apr-97 for EXT2 FS 0.5b, 95/08/09
Using EXT2FS Library version 1.10, 24-Apr-97
THE drive -> Conner 1060W (patched firmware)
Seagate ST15230W

[ /home/sean 1 ] cat /proc/scsi/scsi
Attached devices:
Host: scsi0 Channel: 00 Id: 00 Lun: 00
Vendor: SEAGATE Model: ST15230W SUN4.2G Rev: 0738
Type: Direct-Access ANSI SCSI revision: 02
Host: scsi0 Channel: 00 Id: 01 Lun: 00
Vendor: CONNER Model: CFP1060W 1.05GB Rev: 2035
Type: Direct-Access ANSI SCSI revision: 02
Host: scsi0 Channel: 00 Id: 03 Lun: 00
Vendor: PLEXTOR Model: CD-ROM PX-12TS Rev: 1.02
Type: CD-ROM ANSI SCSI revision: 02

[ /home/sean 2 ] cat /proc/scsi/aic7xxx/0
Adaptec AIC7xxx driver version: 4.1.1/3.2.1
Compile Options:
AIC7XXX_RESET_DELAY : 5
AIC7XXX_TAGGED_QUEUEING: Enabled
AIC7XXX_PAGE_ENABLE : Enabled
AIC7XXX_PROC_STATS : Disabled

Adapter Configuration:
SCSI Adapter: Adaptec AHA-294X SCSI host adapter
(AIC-787x chipset)
Host Bus: Wide
Base IO: 0x6000
Base IO Memory: 0xe4000000
IRQ: 9
SCBs: Used 9, HW 16, Page 255
Interrupts: 39393
Serial EEPROM: True
Extended Translation: Enabled
SCSI Bus Reset: Enabled
Ultra SCSI: Disabled
Disconnect Enable Flags: 0xffff

I hope I gave enough to go on; I am also including a full section of my
log file (gzipped). I doubt any hardware problems, but I can't guarantee
that my hardware is fully functional. Almost everything in this computer
has been replaced due to hardware problems. At this point I would rather
see a kernel bug. :(

If there are any questions, just ask.

Thanks,
Sean

P.S. Looking for pity. :) Here is my system and what I had to replace
since I first purchased it:

SuperMicro P55CMS w/P100 and 16Meg --> All DOA!
Conner 1060W --> Firmware patch failed
Iiyama 8617 --> Blurring
#9 771 --> Memory error
Plextor 12Plex --> Power failed
Plextor 12Plex --> Awful noise
Logitech Mouseman --> Button 1 died

Fortunately, they were all within the warranty period from the
manufacturers. It is still a pain. If any piece fails outside of
warranty, hand me a shotgun and stand back!
---------------
sean@farley.org

---1463811583-1051598523-892916390=:419
Content-Type: APPLICATION/OCTET-STREAM; NAME="scsi-errors.gz"
Content-Transfer-Encoding: BASE64
Content-ID: <Pine.LNX.3.96.980418111950.419D@seen.farley.org>
Content-Description: Syslog of SCSI errors

H4sICCPMODUCA3Njc2ktZXJyb3JzAO2dT28cuRHFc95PwaMN2Eo3WcU/A/hg
BDnsKUA2QHJbjKTWWvBYY8yMkuy3T8/I60jVLLinhm92gljwYa21+Fjkqx8p
TrH7/eeN67Pr/ILy+Mdth+HBfRw2D8Nq4bY323u3cMvr9WZ3//CLu1l/+rR8
uHW3j4Pbrd3u/tOwftyN/+Lz/a3rY4z5zeFnujfu5sPyYWzDjf+5/39v3Orx
wXXu75v73eBexdeu71zoXLccpV3XOffD+6Y9KUf1JJK7ucH0JHXH9YTd3dIN
N7WelJc9+elPP/3oPqy3u7GNQ3fcq6/Rvz505Nbtu/LWbYbtsDv0dkaL149b
d79118P+3x9+0t2tN78JfY3hyty71J27d2XRcaWt2/vtRzdsNuPPLyYtHKbo
ywxtht3j5mGc6tvBvXM+dqEbv74hsZ/0g8KPf/zLk8rC3Q7/dF1e9H70xHCz
G4UDp5zHvy6vt+vV4+iDL9/vu0Qx+/+ZMIjGDtfCiF1OJDUiPoxonA3OXX02
Qp5oJHwYyWyqTjOV1Oh7eBhS4ogwSAtDmqqnqcbzPi+egWaPjLGx9cjtD8vV
nVvf7ePY3A/bq2+0egqEqm0ZBzrXRpnMoxy1UY5Cw+NT10dzGFkLIwuNgPd8
MHu+aKlbJhr42Qjm2SjaslbkshYKInVDw/1Dta2GqVsbgZmjrAGySEAy3vNs
97xGoCIJxHjPs93zGoHKCwJxtxhFoWFUJGaHUTqFQGmq0X7VrbRqTl2trVap
q4zAzFH22ih7odFHuFn6aA6DtDBIaHi8573d81ELI0oNxofB5jCyFoYkkC+I
1PWlXer6gk1db111S68Bspca1MPNQr05DI1AvSQQ4T1PZs/3GoF6SSDGE4jN
BOo1AvWSQBwQqStbPSV1q201TN3aCMwcZQ2Q/UtA9vDTtorE/DC8RiAvNfoO
HkZv9rzXCOS91MDPRm+fDY1AnqRGbp+601btqau01Sx1x/bNq67XAOmj0Ah4
zwe75zUCeUmggPd8MHs+aAQKE42CD8NsqqARKEgCUUCkLoV2qUsBm7pkXnWD
BsggAcl4z7Pd8xqBgiQQ4z3Pds9rBAovCeSbnp5Uw/AnHKCQRiCaaABW3Wmr
9tRV2mqWuvURmDnKGiDJCw34+Y4/4XyHNAIRSQ28573d8xqBKEqNgg/DTCDS
CESSQOMyDEhd2eopqVttq2Hq1kZg3iizBkiWGoT3PJk9zxqBWBKI8J4ns+dZ
IxBLAjF+Ntg+GxqBWBKIEyJ1ObVLXU7Y1OVkHmUNkPwSkGHRoT0/lZgfRtQI
FKVGT/AwzAUyJWoEil5qZHwY5q1c1AgUSWh4wKo7bdWeukpbzVK3PgIzR1kD
ZIxCI+A9H+ye1wgUJYECnkD2852kEShJjZanJ0oY9gOUpBEoSQJRQqQupXap
SwmbumRedZMGyCQByXjP2893kkag9JJAtOjQnp9KHBGGRqCUpUbGh2FedbVS
7JSlRq1O+tTUpXqBtyl1qWmxeK6MsrlSvGi3KNLLWxSjhsd73ps9rxW8p5cF
73sNvOe93fMagbIkUMDPRrDPhkagLAlUq5M+PXWrBd7G1G1ZLF5LXXOleNEK
3lORGoT3PJk9rxW8pyIJxHjP2wtktIryVCSBGD8bbJ8NjUAvK8qZFx1g1Z22
ak9dpa1mqVsfgZmjrAFSFLzzokd7fioxN4z9kNTDyFONjA8jm8PwWhheaLQ8
PVHCsB6gjGGQFgZJjYhIXR/bpa6P2NS13pYb24raKEtABrzng93zWQtDEojw
niez57WK8txPNBI+jGQOQyNQLwlUq5M+PXWrBd7G1G1ZLF5LXWul+DjKGiBF
wXtcdGjPTyWOCEMjkCh4j/Cr9hWJI8LQCCQqyuP4g/AwevNsaBXl2U80uH3q
xnqBtyl1Y9Ni8VwbZTaPsgZIUfAeFx7veW/2vFbwnr0kUMB7Ptg9rxHISwIF
/GwE+2xoBPKSQNQhUpe6dqlLHTZ1ybzqagXvOUgNxnuezZ7XCt5zkARivOfZ
7HmtojyLivLU9PSkGkYyH6CMYWgEEhXlqXpX7NTUTfVLbqbUTU0vzOXaKJtX
Xa3gPYuC97ToE9wsvdnzWsF7Jqnh8Z73Zs9rFeVZVJQn+OOGKhJHhKERiCSB
EFftU8Or9gl81T6ZK8XHUdYASRKQhPc82T2vEYgkgQjveTJ7Xqsozyw1GD8b
bJ4NraI8syQQEyJ1mdqlLhM2da2V4uMoa4AUBe8Z/kTIisQRYWgEEgXvGf4o
xYrEEWFoBBIV5Rn/uKFsftxQ7rSK8hwnGoBVd9qqPXWVtpqlbjZftR9HWQOk
KHjP8EcpViSOCEMjUJQECnjPB7vnNQJFSSDq4GHYD1C0ivIcJYGIEKlL1C51
ibCpS+ZVVyt4z0lqMN7zbPa8VvCeRcF7wT9KsZgfpZg7raJcPBWb8Q/35nLC
AYpWUZ5FRXmBXLUvDa/aF/BV+2KuFB9HWQOkKHgv+Kv2xXzVPndawXvOEw28
573Z81pFec6SQAE/G8E+GxqBsiRQCIjUDaFd6oaATd1griDUCt5zloAkvOfJ
7nmNQFkSiPGetxfIaBXluUw08LPB5tnQKspzkQRiyKrLDVddBq+6bF51tYL3
/LzgPYx7QPD5TlXiiDA0Aj0veH/SYHwYfIbZKPgwzAcoWmF8fl4Yf9BoftW+
2qoxdfW22qSuOgKtzRLwnrc+SjH32vWDMtEgPIHsn3HNn42LDaOPVJuMFIKf
KhA+CDJbymuW8lIjIvhDsR1/KGL5Y/84dL7jsbVUVYn2jpcKf/7H3/zbu+2X
EF6Njd/fDE/tv1644d87//O/9m83/Pn+Yez5wj0+LK9Xh/ckboblrTt8112v
1jcf3dunv73zHZc3T997l2UPenAtV1WivRv66jMv579VMvlw3Js2Y3S3o/Tw
8q2S1Z68OjS7GLc+i+61e394c+PNcrUabg+pulztZ+7XfQc/r4bd+N0vXb2S
7Sb8RKUzTFTGh5GxaTtVOH/aNn+wRbVV+xrTIx9soY5Aa7N6gpvV9uDSyl4k
9TGLrUgPfqRFVQIyD5caxjHQ8L87NALezoHwbrjYMOZmZUgIeIfUDt4hYeEd
zrDToBO3hIm/tSX86z4P/7sj3Gevr+wICbQjxH5gUJVAzNPFhnEM3KXC+eGO
rT2pSkDccKlhzIU7FwTcubSDOxcs3LmgbeKrxcfHwH3/QWADuE870gbuHnyz
vSqBmKeLDWM+3KcK54a7Bz8kqSoBccOlhjEP7r56Re9UuPv63UIT3H3Te4q5
Mr6+x9uk9rv4EXAPXWkEd9mRVnCH/wbrz/CL+AWHcQzcpcL54U4dfBipw7vh
YsOYC/fmdySqrZ4Cd+AdCXUEWtuE8XsAzv/HYRzDPv6dj6QD+HWIVYn2brjg
MOaxL1Q/+j2VfaH+mbWJfaHp59+5Nr4Jb5P+tCPpsH90Q4ON7bQjbTa2AXyL
piqBmKeLDWM+3KcK54e7j/hhjGdww6WGMRfuwSPgHnw7uAePhXvw6GyT79M6
f7YRfg9C0D0IE8KmTO1syoS1KQPHl6pnKaeOL9UPgUzjS00PlKbjS2c4FKNq
mfwRezxqtMebdqTNHo/Aj0uvSkDm6VLDmL/qTBXOveoQ+MmsVQmEGzgi2Mix
HRs5YtmIL+5gfA07G3+Nn59zU4Vz5xz6zT9VCYQb4Ldm2Xhrdt6GivH3ZfmU
+7Kz5wFxX5brL4U0sY+bvmAyV0b5a/t/+P71/ev7V/3rt5TZP9Xl60dH21+3
q/UvY7JdhbfhQIrdcrO7+uE/jAvx5wirAAA=
---1463811583-1051598523-892916390=:419--

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.rutgers.edu