[PATCH 0/4] CPU hotplug,cpusets: Fix CPU online handling related to cpusets

From: Srivatsa S. Bhat
Date: Tue Feb 07 2012 - 13:55:57 EST


There is a very long standing issue related to how cpusets handle CPU
hotplug events. The problem is that when a CPU goes offline, it is removed
from all cpusets. However, when that CPU comes back online, it is added
*only* to the root cpuset. Which means, any task attached to a cpuset lower
in the hierarchy will have one CPU less in its cpuset, though it had this
CPU in its cpuset before the CPU went offline.

The issue gets enormously aggravated in the case of suspend/resume. During
suspend, all non-boot CPUs are taken offline. Which means, all those CPUs
get removed from all the cpusets. When the system resumes, all CPUs are
brought back online; however, the newly onlined CPUs get added only to the
root cpuset - and all other cpusets have cpuset.cpus = 0 (boot cpu alone)!
This means, (as is obvious), all those tasks attached to non-root cpusets
will be constrained to run only on one single cpu!

So, imagine the amount of performance degradation after suspend/resume!!

In particular, libvirt is one of the active users of cpusets. And apparently,
people hit this problem long ago:
https://bugzilla.redhat.com/show_bug.cgi?id=714271

But unfortunately this never got resolved since people probably thought that
the bug was in libvirt... and all this time the kernel was the culprit!

--
Srivatsa S. Bhat (4):
CPU hotplug, cpuset: Maintain a copy of the cpus_allowed mask before CPU hotplug
cpuset: Split up update_cpumask() so that its functionality can be reused
cpuset: Add function to introduce CPUs to cpusets during CPU online
CPU hotplug, cpusets: Differentiate the CPU online and CPU offline callbacks


include/linux/cpuset.h | 4 +
kernel/cpuset.c | 177 +++++++++++++++++++++++++++++++++++++++++-------
kernel/sched/core.c | 12 +++
3 files changed, 163 insertions(+), 30 deletions(-)


Regards,
Srivatsa S. Bhat
IBM Linux Technology Center

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/