Re: [PATCH v4 0/3] FUSE: Implement atomic lookup + open/create

From: Vivek Goyal
Date: Thu May 05 2022 - 08:54:13 EST


On Thu, May 05, 2022 at 11:42:51AM +0530, Dharmendra Hans wrote:
> On Thu, May 5, 2022 at 12:48 AM Vivek Goyal <vgoyal@xxxxxxxxxx> wrote:
> >
> > On Mon, May 02, 2022 at 03:55:18PM +0530, Dharmendra Singh wrote:
> > > In FUSE, as of now, uncached lookups are expensive over the wire.
> > > E.g additional latencies and stressing (meta data) servers from
> > > thousands of clients. These lookup calls possibly can be avoided
> > > in some cases. Incoming three patches address this issue.
> >
> > BTW, these patches are designed to improve performance by cutting down
> > on number of fuse commands sent. Are there any performance numbers
> > which demonstrate what kind of improvement you are seeing.
> >
> > Say, If I do kernel build, is the performance improvement observable?
>
> Here are the numbers I took last time. These were taken on tmpfs to
> actually see the effect of reduced calls. On local file systems it
> might not be that much visible. But we have observed that on systems
> where we have thousands of clients hammering the metadata servers, it
> helps a lot (We did not take numbers yet as we are required to change
> a lot of our client code but would be doing it later on).
>
> Note that for a change in performance number due to the new version of
> these patches, we have just refactored the code and functionality has
> remained the same since then.
>
> here is the link to the performance numbers
> https://lore.kernel.org/linux-fsdevel/20220322121212.5087-1-dharamhans87@xxxxxxxxx/

There is a lot going in that table. Trying to understand it.

- Why care about No-Flush. I mean that's independent of these changes,
right? I am assuming this means that upon file close do not send
a flush to fuse server. Not sure how bringing No-Flush into the
mix is helpful here.

- What is "Patched Libfuse"? I am assuming that these are changes
needed in libfuse to support atomic create + atomic open. Similarly
assuming "Patched FuseK" means patched kernel with your changes.

If this is correct, I would probably only be interested in
looking at "Patched Libfuse + Patched FuseK" numbers to figure out
what's the effect of your changes w.r.t vanilla kernel + libfuse.
Am I understanding it right?

- I am wondering why do we measure "Sequential" and "Random" patterns.
These optimizations are primarily for file creation + file opening
and I/O pattern should not matter.

- Also wondering why performance of Read/s improves. Assuming once
file has been opened, I think your optimizations get out of the
way (no create, no open) and we are just going through data path of
reading file data and no lookups happening. If that's the case, why
do Read/s numbers show an improvement.

- Why do we measure "Patched Libfuse". It shows performance regression
of 4-5% in table 0B, Sequential workoad. That sounds bad. So without
any optimization kicking in, it has a performance cost.

Thanks
Vivek