Re: [PATCH] smp: add a best_effort version of smp_call_function_many()

From: Luigi Rizzo
Date: Tue Apr 20 2021 - 10:40:33 EST

Next message: Calvin Walton: "Re: [PATCH v2] tools/power turbostat: Fix RAPL summary collection on AMD processors"
Previous message: Christoph Hellwig: "Re: [PATCH 1/3] nds32: Cleanup deprecated function strlen_user"
In reply to: Peter Zijlstra: "Re: [PATCH] smp: add a best_effort version of smp_call_function_many()"
Next in thread: kernel test robot: "Re: [PATCH] smp: add a best_effort version of smp_call_function_many()"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On Tue, Apr 20, 2021 at 3:33 PM Peter Zijlstra <peterz@xxxxxxxxxxxxx> wrote:
>
> On Tue, Apr 20, 2021 at 12:41:08PM +0200, Luigi Rizzo wrote:
> > On Tue, Apr 20, 2021 at 11:14 AM Peter Zijlstra <peterz@xxxxxxxxxxxxx> wrote:
...
> > My case too requires that the request is eventually handled, but with
> > this non-blocking IPI the caller has a better option than blocking:
> > it can either retry the multicast IPI at a later time if conditions allow,
> > or it can post a dedicated CSD (with the advantage that being my
> > requests idempotent, if the CSD is locked there is no need to retry
> > because it means the handler has not started yet).
> >
> > In fact, if we had the option to use dedicated CSDs for multicast IPI,
> > we wouldn't even need to retry because we'd know that the posted CSD
> > is for our call back and not someone else's.
>
> What are you doing that CSD contention is such a problem?

Basically what I said in a previous email: send a targeted interrupt to a
subset of the CPUs (large enough that the multicast IPI makes sense) so
they can start doing some work that has been posted for them.
Not too different from RFS, in a way.

The sender doesn't need (or want, obviously) to block, but occasional
O(100+us) stalls were clearly visible, and trivial to reproduce in tests
(e.g. when the process on the target CPU runs getrusage() and has
a very large number of threads, even if idle ones).

Even the _cond() version is not a sufficient to avoid the stall:
I could in principle use the callback to skip CPUs for which I
have a request posted and not processed yet, but if the csd
is in use by another pending IPI I have no alternative but spin.

cheers
luigi

Next message: Calvin Walton: "Re: [PATCH v2] tools/power turbostat: Fix RAPL summary collection on AMD processors"
Previous message: Christoph Hellwig: "Re: [PATCH 1/3] nds32: Cleanup deprecated function strlen_user"
In reply to: Peter Zijlstra: "Re: [PATCH] smp: add a best_effort version of smp_call_function_many()"
Next in thread: kernel test robot: "Re: [PATCH] smp: add a best_effort version of smp_call_function_many()"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]