Re: [RFC][PATCH tip/sched/core] sched/rt: Simplify the IPI rt balancing logic

From: Peter Zijlstra
Date: Mon Apr 24 2017 - 08:52:17 EST


On Mon, Apr 24, 2017 at 08:43:18AM -0400, Steven Rostedt wrote:
> On Mon, 24 Apr 2017 10:51:54 +0200
> Peter Zijlstra <peterz@xxxxxxxxxxxxx> wrote:
>
> > On Fri, Apr 21, 2017 at 10:49:29PM -0400, Steven Rostedt wrote:
> > > When a CPU schedules in a lower priority task and wants to make sure
> > > overloaded CPUs know about it. It increments the rto_loop_next. Then it does
> > > an atomic_inc_return() on rto_loop_start. If the returned value is not "1",
> > > then it does atomic_dec() on rt_loop_start and returns. If the value is "1",
> > > then it will take the rto_lock to synchronize with a possible IPI being sent
> > > around to the overloaded CPUs.
> >
> > > + start = atomic_inc_return(&rq->rd->rto_loop_start);
> > > + if (start != 1)
> > > + goto out;
> >
> > > +out:
> > > + atomic_dec(&rq->rd->rto_loop_start);
> >
> >
> > Did you just write a very expensive test-and-set trylock?
>
> Probably. I didn't know we had a generic one. Where is it?
>

There isn't. What I was getting at though is that something like:

static inline bool rto_start_trylock(atomic_t *v)
{
int zero = 0;
return atomic_try_cmpxchg(v, &zero, 1);
}

static void rto_start_unlock(atomic_t *v)
{
atomic_set_release(v, 0);
}

Is more: clearer, faster and correct.


Clearer as that it better describes what it does, faster as that you
only have a single atomic, and more correct because it does a RELEASE in
the case we care about.