Re: [PATCH 00/31] Move LRU page reclaim from zones to nodes v8

From: Dave Chinner
Date: Thu Jul 07 2016 - 19:27:30 EST


On Fri, Jul 01, 2016 at 04:37:15PM +0100, Mel Gorman wrote:
> Previous releases double accounted LRU stats on the zone and the node
> because it was required by should_reclaim_retry. The last patch in the
> series removes the double accounting. It's not integrated with the series
> as reviewers may not like the solution. If not, it can be safely dropped
> without a major impact to the results.
....
> tiobench on ext4
> ----------------

[snip other tests on ext4 which show good results]

.....
> This series is not without its hazards. There are at least three areas
> that I'm concerned with even though I could not reproduce any problems in
> that area.
>
> 1. Reclaim/compaction is going to be affected because the amount of reclaim is
> no longer targetted at a specific zone. Compaction works on a per-zone basis
> so there is no guarantee that reclaiming a few THP's worth page pages will
> have a positive impact on compaction success rates.
>
> 2. The Slab/LRU reclaim ratio is affected because the frequency the shrinkers
> are called is now different. This may or may not be a problem but if it
> is, it'll be because shrinkers are not called enough and some balancing
> is required.

Given that XFS has a much more complex set of shrinkers and has a
much more finely tuned balancing between LRU and shrinker reclaim,
I'd be interested to see if you get the same results on XFS for the
tests you ran on ext4. It might also be worth running some highly
concurrent inode cache benchmarks (e.g. the 50-million inode, 16-way
concurrent fsmark tests) to see what impact heavy slab cache
pressure has on shrinker behaviour and system balance...

Cheers,

Dave.
--
Dave Chinner
david@xxxxxxxxxxxxx