Re: O_DIRECT to md raid 6 is slow

From: Stan Hoeppner
Date: Sun Aug 19 2012 - 19:34:27 EST


On 8/19/2012 9:01 AM, David Brown wrote:
> I'm sort of jumping in to this thread, so my apologies if I repeat
> things other people have said already.

I'm glad you jumped in David. You made a critical statement of fact
below which clears some things up. If you had stated it early on,
before Miquel stole the thread and moved it to LKML proper, it would
have short circuited a lot of this discussion. Which is:

> AFAIK, there is scope for a few performance optimisations in raid6. One
> is that for small writes which only need to change one block, raid5 uses
> a "short-cut" RMW cycle (read the old data block, read the old parity
> block, calculate the new parity block, write the new data and parity
> blocks). A similar short-cut could be implemented in raid6, though it
> is not clear how much a difference it would really make.

Thus my original statement was correct, or at least half correct[1], as
it pertained to md/RAID6. Then Miquel switched the discussion to
md/RAID5 and stated I was all wet. I wasn't, and neither was Dave
Chinner. I was simply unaware of this md/RAID5 single block write RMW
shortcut. I'm copying lkml proper on this simply to set the record
straight. Not that anyone was paying attention, but it needs to be in
the same thread in the archives. The takeaway:

md/RAID6 must read all devices in a RMW cycle.

md/RAID5 takes a shortcut for single block writes, and must only read
one drive for the RMW cycle.

[1}The only thing that's not clear at this point is if md/RAID6 also
always writes back all chunks during RMW, or only the chunk that has
changed.

--
Stan

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/