Re: [PATCH v7 4/5] rust: add find_bit_benchmark_rust module.

From: Burak Emir
Date: Thu Apr 24 2025 - 12:45:55 EST

Next message: Jacek Kowalski: "Re: [PATCH] e1000e: disregard NVM checksum on tgp when valid checksum mask is not set"
Previous message: tip-bot2 for Ard Biesheuvel: " [tip: x86/urgent] x86/boot: Work around broken busybox 'truncate' tool"
In reply to: Alice Ryhl: "Re: [PATCH v7 4/5] rust: add find_bit_benchmark_rust module."
Next in thread: Boqun Feng: "Re: [PATCH v7 4/5] rust: add find_bit_benchmark_rust module."
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On Wed, Apr 23, 2025 at 6:56 PM Yury Norov <yury.norov@xxxxxxxxx> wrote:
> So? Can you show your numbers?

For now, I only have numbers that may not be very interesting:

- for find_next_bit, find_next_zero_bit and find_next_zero_bit (sparse):
22 ns/iteration in C, 32 ns/iteration in Rust.

- for sparse find_next_bit (sparse):
60 ns/iteration in C, 70 ns/iteration in Rust.

This is a VM running nested in a VM. More importantly: the C helper
method is not inlined.
So we are likely measuring the overhead (plus the extra bounds checking).

I would like to get cross-language inlining to work with thinLTO to
have a more realistic comparison.
However, that is not something that works out of the box.
I am looking at Gary Guo's patch for this:
https://lore.kernel.org/all/20250319205141.3528424-1-gary@xxxxxxxxxxx/
Currently, I get duplicate symbol errors.

> Can you print the existing C test output back to back with the new one?
> Can you also ask 0-day folks to enable your test in their rust config?

Will look into these. Rong (hi!) is working on LTO for kernel and will
know a lot more than me how Rust will fit in eventually.
IMHO, making cross-language inlining work out of the box will be a
necessary baseline to get Rust performance for hot code.

> > We add a fill_random() method protected by the config in order to
> > maintain the abstraction.
> >
> > Minor fix to the documentation of the corresponding C config
> > FIND_BIT_BENCHMARK, it was mentioning the wrong module name.
>
> Indeed. Can you make it a separate patch, please?

Will do.

Next message: Jacek Kowalski: "Re: [PATCH] e1000e: disregard NVM checksum on tgp when valid checksum mask is not set"
Previous message: tip-bot2 for Ard Biesheuvel: " [tip: x86/urgent] x86/boot: Work around broken busybox 'truncate' tool"
In reply to: Alice Ryhl: "Re: [PATCH v7 4/5] rust: add find_bit_benchmark_rust module."
Next in thread: Boqun Feng: "Re: [PATCH v7 4/5] rust: add find_bit_benchmark_rust module."
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]