Re: [RFC PATCH 0/2] Union Mount: Directory listing in glibc

From: Erez Zadok
Date: Wed Apr 30 2008 - 12:39:27 EST


In message <c810d5090804292307y2884b169t5c27b50fb7c94c30@xxxxxxxxxxxxxx>, "NAGABHUSHAN BS" writes:
> On Tue, Apr 29, 2008 at 9:19 PM, Erez Zadok <ezk@xxxxxxxxxxxxx> wrote:
> > In message <20080429133201.GA9938@xxxxxxxxxxxxxxxxxxxxx>, bsn.0007@xxxxxxxxx writes:
[...]

> > You might consider using a hash table instead of a list; it'll be
> > faster in case where there are a lot of whiteouts/duplicates to
> > process.
> >
>
> Yes, hash table was one of the things i had considered to use as a
> cache. But i am not completely sure how this would support seekdir.
> Any suggestions are greatly welcome.
>
> Regards Nagabhushan

Here's one idea:

1. Your main data structure is a linear array of struct dirents. You
perform seekdir/readdir on this linear structure, which should be
trivial. Each time you find a dirent which definitely needs to go into
this structure, you append to it. If you're about to run out of space in
it, you can realloc it 2x its previous size (a common technique). This
structure remains cached in memory for as long as the directory is open.

Optimization 1: you can keep it cached past close(fd), b/c recursive
programs like /bin/find may close and reopen a directory several times.
But you'll need to determine if the directory's mtime had changed and if
so, purge the cached content and reconstruct it.

Optimization 2: if you can do a stat(2) on each directory in the union,
and sum their total size, that'll provide you with an upper bound on the
size of your dirent cache. In that case, you can forego the realloc I
mentioned, and just malloc one large array at once. This could be more
efficient than realloc'ing for two reasons: (a) realloc may have to
memcpy data, which you can avoid; and (b) the rest of the malloc'ed
array, near its end, may be unused, but as long as you don't touch it,
it'll be one large-ish extent of virtual memory for which physical page
frames won't be actually needed. Even better, you can realloc() that
large malloc'ed area to cut back on its size to exactly what you need.

2. During the construction of the dirent array above, you keep two hash
tables:

2a. HT1 is used for duplicate elimination. It contains POINTERS ONLY (or
offsets) into the dirent cache buffer. You add entries into it each
time you append a name to the dirent cache array. This HT1 allows you
to quickly find out if a string name had been seen before. When you're
done constructing the dirent cache, free(HT1).

2b. HT2 is used for whiteouts. It contains the actual strings of whited-out
entries you've seen, which you add each time you find a whiteout. You
look this HT2 each time you need to determine if an entry you're about
to add to the dirent cache had been whited-out or not. When you're done
constructing the dirent cache, free(HT2).

Hope you can use this as a starting point.

Cheers,
Erez.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/