Re: [PATCH] subset zonelists and big numa friendly mempolicyMPOL_MBIND

From: Paul Jackson
Date: Wed Aug 04 2004 - 17:38:01 EST


A couple of days ago, Andi wrote:
> My first reaction that if you really want to do that, just pass
> the policy node bitmap to alloc_pages and try_to_free_pages
> and use the normal per node zone list with the bitmap as filter.

I have coded your suggestion up, and indeed, it is much simpler

I added an optional (#ifdef CONFIG_CPUSET) "mems_allowed" field to the
task_struct, and filter zones on this mask (if not in_interrupt()) in
the page allocation and free'ing code.

Thank-you. I think you're right, and it's the way to go.

I will be posting a full cpuset patch that includes this change on
lse-tech and lkml, hopefully in a few hours. I will be most grateful
for any further feedback.

Providing support for dividing very large systems into subsets of
nodes (cpu and memory), in order to obtain good memory locality and
good isolation of big jobs from each other, is essential to the success
of certain high end Linux users.

--
I won't rest till it's the best ...
Programmer, Linux Scalability
Paul Jackson <pj@xxxxxxx> 1.650.933.1373
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/