Re: [RFC PATCH v3 0/3] sched: simplify the select_task_rq_fair()

From: Peter Zijlstra
Date: Thu Feb 21 2013 - 05:21:15 EST


On Thu, 2013-02-21 at 12:51 +0800, Michael Wang wrote:
> The old logical when locate affine_sd is:
>
> if prev_cpu != curr_cpu
> if wake_affine()
> prev_cpu = curr_cpu
> new_cpu = select_idle_sibling(prev_cpu)
> return new_cpu
>
> The new logical is same to the old one if prev_cpu == curr_cpu, so
> let's
> simplify the old logical like:
>
> if wake_affine()
> new_cpu = select_idle_sibling(curr_cpu)
> else
> new_cpu = select_idle_sibling(prev_cpu)
>
> return new_cpu
>
> Actually that doesn't make sense.

It does :-)

> I think wake_affine() is trying to check whether move a task from
> prev_cpu to curr_cpu will break the balance in affine_sd or not, but
> why
> won't break balance means curr_cpu is better than prev_cpu for
> searching
> the idle cpu?

It doesn't, the whole affine wakeup stuff is meant to pull waking tasks
towards the cpu that does the wakeup, we limit this by putting bounds on
the imbalance this is may create.

The reason we want to run tasks on the cpu that does the wakeup is
because that cpu 'obviously' is running something related and it seems
like a good idea to run related tasks close together.

So look at affine wakeups as a force that groups related tasks.

> So the new logical in this patch set is:
>
> new_cpu = select_idle_sibling(prev_cpu)
> if idle_cpu(new_cpu)
> return new_cpu
>
> new_cpu = select_idle_sibling(curr_cpu)
> if idle_cpu(new_cpu) {
> if wake_affine()
> return new_cpu
> }
>
> return prev_cpu
>
> And now, unless we are really going to move load from prev_cpu to
> curr_cpu, we won't use wake_affine() any more.

That's completely breaks stuff, not cool.

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/