Re: [Ext2-devel] ext2/ialloc.c cleanup

From: Andreas Dilger (adilger@turbolabs.com)
Date: Fri Nov 09 2001 - 01:56:32 EST


On Nov 08, 2001 22:15 -0800, Andrew Morton wrote:
> max_dirs = ndirs / ngroups + inodes_per_group / 16;
> min_inodes = avefreei - inodes_per_group / 4;
> min_blocks = avefreeb - EXT2_BLOCKS_PER_GROUP(sb) / 4;
>
> things start to get a bit confusing. A couple of 1-2 line comments
> which explain what the variables actually represent would help to
> clarify things. Also, an explanation of `debt' is needed.

Yes, my eyes glazed over at this point as well.

> Tree Stock Stock/ideal Patched Patched/stock Orlov Orlov/ideal
> (secs) (secs) (secs)

Andrew, could you stick with a single metric (either /ideal or /stock)?

> 12 54 4.15 101 187.04% 76 5.85
> 13 82 6.31 104 126.83% 78 6
> 14 89 6.85 103 115.73% 77 5.92
> 15 88 6.77 95 107.95% 77 5.92
> 16 106 8.15 99 93.40% 77 5.92

What else would be useful here is percent full for the filesystem.
Since stock ext2 preallocates up to 7 blocks for each file, this is good
for reducing fragmentation, but as the filesystem gets full it makes
fragmentation worse in the end.

> So I just don't know at this stage. Even after a single pass of the Smith
> workload, we're running at 3x to 5x worse than ideal. If that's simply
> the best we can do, then we need to compare stock 2.4.14 with Orlov
> partway through the progress of the Smith workload to evaluate how much
> more serious the fragmentation is. That's easy enough - I'll do it.
>
> The next task is to work out why a single pass of the Smith workload
> fragments the filesystem so much, and whether this can be improved.

After reading the paper, one possible cause of skewed results may be
a result of their "reverse engineering" of the filesystem layout from
the inode numbers. They state that they were not able to capture the
actual pathnames of the files in the workload, so they invented pathnames
that would put the files in the same cylinder groups based on the FFS
allocation algorithms then in use, assuming the test filesystem had an
identical layout.

It may be possible to hack the test data into ext2 by creating a filesystem
with the same number of block groups as the test FFS filesystem with the
Smith workload. It may also not be valid for our needs, as we are playing
with the actual group selection algorithm, so real pathnames may give us
a different layout.

Have you ever looked at the resulting data from a Smith test? Does it do
more than simply create 27 directories (the number of cylinder groups) and
dump files directly therein, as opposed to creating more subdirectories?
Remember that they were strictly concerned with block allocation for files,
and not cylinder group selection for directories.

Cheers, Andreas

--
Andreas Dilger
http://sourceforge.net/projects/ext2resize/
http://www-mddsp.enel.ucalgary.ca/People/adilger/

- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/



This archive was generated by hypermail 2b29 : Thu Nov 15 2001 - 21:00:21 EST