Re: raid is dangerous but that's secret (was Re: [patch] ext2/3:document conditions when reliable operation is possible)

From: Ric Wheeler
Date: Wed Sep 02 2009 - 22:01:09 EST


On 08/31/2009 09:21 AM, Christoph Hellwig wrote:
On Mon, Aug 31, 2009 at 09:19:58AM -0400, Mark Lord wrote:
In my opinion even that is too weak. We know how to control the cache
settings on all common disks (that is scsi and ata), so we should always
disable the write cache unless we know that the whole stack (filesystem,
raid, volume managers) supports barriers. And even then we should make
sure the filesystems does actually use barriers everywhere that's needed
which failed at for years.
..

That stack does not know that my MD device has full battery backup,
so it bloody well better NOT prevent me from enabling the write caches.

No one is going to prevent you from doing it. That question is one of
sane defaults. And always safe, but slower if you have advanced
equipment is a much better default than usafe by default on most of
the install base.


Just to add some support to this, all of the external RAID arrays that I know of normally run with write cache disabled on the component drives. In addition, many of them will disable their internal write cache if/when they detect that they have lost their UPS.

I think that if we had done this kind of sane default earlier for MD levels that do not handle barriers, we would not have left some people worried about our software RAID.

To be clear, if a sophisticated user wants to override this default, that should be supported. It is not (in my opinion) a safe default behaviour.

Ric

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/