Re: [RFC REPOST] cgroup: removing css reference drain wait duringcgroup removal

From: KAMEZAWA Hiroyuki
Date: Wed Mar 14 2012 - 20:18:29 EST


(2012/03/14 18:46), Glauber Costa wrote:

> On 03/14/2012 04:28 AM, KAMEZAWA Hiroyuki wrote:
>> IIUC, in general, even in the processes are in a tree, in major case
>> of servers, their workloads are independent.
>> I think FLAT mode is the dafault. 'heararchical' is a crazy thing which
>> cannot be managed.
>
> Better pay attention to the current overall cgroups discussions being
> held by Tejun then. ([RFD] cgroup: about multiple hierarchies)
>
> The topic of whether of adapting all cgroups to be hierarchical by
> deafult is a recurring one.
>
> I personally think that it is not unachievable to make res_counters
> cheaper, therefore making this less of a problem.
>


I thought of this a little yesterday. Current my idea is applying following
rule for res_counter.

1. All res_counter is hierarchical. But behavior should be optimized.

2. If parent res_counter has UNLIMITED limit, 'usage' will not be propagated
to its parent at _charge_.

3. If a res_counter has UNLIMITED limit, at reading usage, it must visit
all children and returns a sum of them.

Then,
/cgroup/
memory/ (unlimited)
libivirt/ (unlimited)
qeumu/ (unlimited)
guest/(limited)

All dir can show hierarchical usage and the guest will not have
any lock contention at runtime.


By this
1. no runtime overhead if the parent has unlimited limit.
2. All res_counter can show aggregate resource usage of children.

To do this
1. res_coutner should have children list by itself.

Implementation problem
- What should happens when a user set new limit to a res_counter which have
childrens ? Shouldn't we allow it ? Or take all locks of children and
update in atomic ?
- memory.use_hierarchy should be obsolete ?

Other problem I'm not sure at all
- blkcg doesn't support hierarchy at all.

Hmm.

Thanks,
-Kame








--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/