Re: [PATCH UPDATED 2/4] sched: adjust when cpu_active and cpusetconfigurations are updated during cpu on/offlining

From: Peter Zijlstra
Date: Fri Jun 04 2010 - 07:58:21 EST


On Wed, 2010-06-02 at 18:03 +0200, Tejun Heo wrote:
> Currently, when a cpu goes down, cpu_active is cleared before
> CPU_DOWN_PREPARE starts and cpuset configuration is updated from a
> default priority cpu notifier. When a cpu is coming up, it's set
> before CPU_ONLINE but cpuset configuration again is updated from the
> same cpu notifier.
>
> For cpu notifiers, this presents an inconsistent state. Threads which
> a CPU_DOWN_PREPARE notifier expects to be bound to the CPU can be
> migrated to other cpus because the cpu is no more inactive.
>
> Fix it by updating cpu_active in the highest priority cpu notifier and
> cpuset configuration in the second highest when a cpu is coming up.
> Down path is updated similarly. This guarantees that all other cpu
> notifiers see consistent cpu_active and cpuset configuration.
>
> cpuset_track_online_cpus() notifier is converted to
> cpuset_update_active_cpus() which just updates the configuration and
> now called from cpuset_cpu_[in]active() notifiers registered from
> sched_init_smp(). If cpuset is disabled, cpuset_update_active_cpus()
> degenerates into partition_sched_domains() making separate notifier
> for !CONFIG_CPUSETS unnecessary.
>
> This problem is triggered by cmwq. During CPU_DOWN_PREPARE, hotplug
> callback creates a kthread and kthread_bind()s it to the target cpu,
> and the thread is expected to run on that cpu.
>
> Signed-off-by: Tejun Heo <tj@xxxxxxxxxx>
> Cc: Rusty Russell <rusty@xxxxxxxxxxxxxxx>
> Cc: Peter Zijlstra <a.p.zijlstra@xxxxxxxxx>
> Cc: Ingo Molnar <mingo@xxxxxxx>
> Cc: Paul Menage <menage@xxxxxxxxxx>
> ---
> Okay, how about this one? Using notifiers seems better for the
> following reasons.
>
> * Rollback on failure.
>
> * cpuset/sched_domain don't expect to be called before smp
> configuration is complete. If hardcoded into cpu_up/down(),
> condition checks need to be added so that they're skipped if the
> system is bringing up the cpus for the first time.
>
> Works fine here w/ CPUSET enabled and disabled.

OK, this looks good.

Thanks Tejun.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/