Re: document ext3 requirements

From: david
Date: Sun Jan 04 2009 - 22:00:28 EST


On Sun, 4 Jan 2009, Rob Landley wrote:

On Sunday 04 January 2009 17:13:08 Sitsofe Wheeler wrote:
Pavel Machek wrote:
Is there linux filesystem that can handle that? I know jffs2, but
that's unsuitable for stuff like USB thumb drives, right?

This raises the question that if nothing can handle it which FS is the
least bad? The last I heard people were saying that with cheap SSDs the
recommendation was FAT [1] but in the future btrfs, nilfs and logfs
would be better.

[1] http://lkml.org/lkml/2008/10/14/129

I wonder if the flash filesystems could be told via mount options that they're
to use a normal block device as if it was a flash with granularity X?

They can't explicitly control erase, but writing to any block in a block group
will erase and rewrite the whole group so they can just do large write
transactions close to each other and the device should aggregate enough for an
erase block. (Plus don't touch anything _outside_ where you guess an erase
block to be until you've finished writing the whole block, which they
presumably already do.)

this capability would help for raid arrays as well.

if you have a raid5/6 array writing one sector to a stripe results in you reading the pairity block for that stripe, reading the rest of the sectors for the block on that disk, recalculating the pairity information and writing the changed sectors out to both disks.

if you are writing the entire stripe, you could calculate the pairity and just write everything out (no reads nessasary).

this would make sequential writes to raid5/6 arrays almost as fast as if they were raid0 stripes.

if you could define 'erase block size' to be the raid stripe size the same approach would work for both systems.

when I asked about the on the md list a couple of years ago, the response that I got was that it was a good idea, but there was no way to get the information about the low-level topology to the higher levels that would need to act on the information. now that there is a second case where this is needed, any mechanism that gets created should be made so that it's useable for both.

David Lang
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/