Re: [PATCH v7 2/2] sched/fair: update scale invariance of PELT

From: Peter Zijlstra
Date: Thu Nov 29 2018 - 07:54:04 EST


On Wed, Nov 28, 2018 at 11:53:36AM +0000, Patrick Bellasi wrote:

> diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c
> index ac855b2f4774..93e0cf5d8a76 100644
> --- a/kernel/sched/fair.c
> +++ b/kernel/sched/fair.c
> @@ -3661,6 +3661,10 @@ util_est_dequeue(struct cfs_rq *cfs_rq, struct task_struct *p, bool task_sleep)
> if (!task_sleep)
> return;
>
> + /* Skip samples which do not represent an actual utilization */
> + if (unlikely(task_util(p) > capacity_of(task_cpu(p))))
> + return;
> +
> /*
> * If the PELT values haven't changed since enqueue time,
> * skip the util_est update.

Would you not want something like:

min(task_util(p), capacity_of(task_cpu(p)))

And is this the only place where we need this?

OTOH, if the task is always running, it will be always running
irrespective of where it runs.

Not storing these samples seems weird though; this is the exact
condition you want to record -- the task is very active, if we skip
these, we'll come back at a low frequency on the next wakeup.