Re: dell poweredge 2400 harddisks going into offline mode when heavyI/O occurs

From: Bill Davidsen
Date: Thu Oct 05 2006 - 23:46:33 EST


Matt Domsch wrote:
On Thu, Sep 28, 2006 at 04:19:23PM +0200, Folkert van Heusden wrote:
Hi,

I have a Dell Poweredge 2400 with 6 scsi harddisks in (hw-) raid 5.
512MB ram, 2x P3.
When heavy disk i/o occurs, the system puts the harddisks into offline
mode causing the filesystems to be put in readonly. The current kernel
is 2.6.8, with 2.4.27 this did not occure. Googling did not help. The
disks all have green lights (there's a special led for each to indicate
errors - that one is off).

[snip]
Sep 28 16:05:12 kasparov kernel: aacraid: Host adapter reset request. SCSI hang ?
Sep 28 16:06:12 kasparov kernel: aacraid: SCSI bus appears hung
Sep 28 16:06:12 kasparov kernel: scsi: Device offlined - not ready after error recovery: host 1 channel 0 id 0 lun 0

Yes, this is familiar. See:

http://lists.us.dell.com/pipermail/linux-poweredge/2004-May/014694.html

In addition, please consider mounting your file systems with
'noatime', as this reduces the number of small writes being sent to
the disks.

2.6.x kernels have the ability to swamp the RAID controller firmware
with requests where 2.4.x kernels couldn't so easily.

Can you configure the controller as JBOD and use software raid. Would the controller keep up with that?

Thanks,
Matt



-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/