Re: [RFC][PATCH] page reclaim throttle take2

From: David Rientjes
Date: Wed Feb 27 2008 - 02:57:56 EST


On Wed, 27 Feb 2008, KAMEZAWA Hiroyuki wrote:

> Hmm, but kswapd, which is main worker of page reclaiming, is per-node.
> And reclaim is done based on zone.
> per-zone/per-node throttling seems to make sense.
>

That's another argument for not introducing the sysctl; the number of
nodes and zones are a static property of the machine that cannot change
without a reboot (numa=fake, mem=, introducing movable zones, etc). We
don't have node hotplug that can suddenly introduce additional zones from
which to reclaim.

My point was that there doesn't appear to be any use case for tuning this
via a sysctl that isn't simply attempting to workaround some other reclaim
problem when the VM is stressed. If that's agreed upon, then deciding
between a config option that is either per-cpu or per-node should be based
on the benchmarks that you've run. At this time, it appears that per-node
is the more advantageous.

> I know his environment has 4cpus per node but throttle to 3 was the best
> number in his measurement. Then it seems num-per-cpu is excessive.
> (At least, ratio(%) is better.)

That seems to indicate that the NUMA topology is more important than lock
contention for the reclaim throttle.

David
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/