Re: Fw: [Bugme-new] [Bug 3993] New: sata_sx4 causes file corruptionduring simultaneous writes

From: David R
Date: Wed Jan 05 2005 - 17:27:35 EST


Andrew Morton wrote:

controller, using the libata sata_sx4 driver. Individual writes to the drives
are fine. When the drives are written to simultaneously, either by multiple cp
threads or assembling them in a raid 5, corruption occurs as evidenced by fsck
errors and inconsistent md5 sums.



FWIW at <$dayjob> we have had exactly the same issues using Win2k (ugh) and Promise's own drivers on a Dual Opteron system (Rioworks HDAMA) with an integrated Fastrak S150TX4 controller. Relatively stable using a single drive as a separate volume (our application prefers a RAID 0 stripe), but random subtle corruptions when using an array (striped or mirrored). This is both using the controller's embedded RAID and W2K's software RAID (with the Promise configured to present separate disks). Firmware upgrades/downgrades were tried with no luck. We have two identically configured machines that both exhibit the same problem.

Interestingly, the errors were always single flipped bit(s) at random offset(s) within the file. Different on each run. Sounds like a RAM issue but both machines memtest fine and run without issues when using a single drive.

We never found a solution (we simply use single (large) SATA drives instead) :-(

David




-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/