Re: [PATCH 1/9] ksm: fix mlockfreed to munlocked

From: KOSAKI Motohiro
Date: Mon Nov 30 2009 - 01:01:51 EST


> On Thu, 26 Nov 2009, Mel Gorman wrote:
> > On Tue, Nov 24, 2009 at 04:40:55PM +0000, Hugh Dickins wrote:
> > > When KSM merges an mlocked page, it has been forgetting to munlock it:
> > > that's been left to free_page_mlock(), which reports it in /proc/vmstat
> > > as unevictable_pgs_mlockfreed instead of unevictable_pgs_munlocked (and
> > > whinges "Page flag mlocked set for process" in mmotm, whereas mainline
> > > is silently forgiving). Call munlock_vma_page() to fix that.
> > >
> > > Signed-off-by: Hugh Dickins <hugh.dickins@xxxxxxxxxxxxx>
> >
> > Acked-by: Mel Gorman <mel@xxxxxxxxx>
>
> Rik & Mel, thanks for the Acks.
>
> But please clarify: that patch was for mmotm and hopefully 2.6.33,
> but the vmstat issue (minus warning message) is there in 2.6.32-rc.
> Should I
>
> (a) forget it for 2.6.32
> (b) rush Linus a patch for 2.6.32 final
> (c) send a patch for 2.6.32.stable later on

I personally prefer (3). though I don't know ksm so detail.


>
> ? I just don't have a feel for how important this is.
>
> Typically, these pages are immediately freed, and the only issue is
> which stats they get added to; but if fork has copied them into other
> mms, then such pages might stay unevictable indefinitely, despite no
> longer being in any mlocked vma.
>
> There's a remark in munlock_vma_page(), apropos a different issue,
> /*
> * We lost the race. let try_to_unmap() deal
> * with it. At least we get the page state and
> * mlock stats right. However, page is still on
> * the noreclaim list. We'll fix that up when
> * the page is eventually freed or we scan the
> * noreclaim list.
> */
> which implies that sometimes we scan the unevictable list and resolve
> such cases. But I wonder if that's nowadays the case?

We don't scan unevictable list at all. munlock_vma_page() logic is.

1) clear PG_mlock always anyway
2) isolate page
3) scan related vma and remark PG_mlock if necessary

So, as far as I understand, the above comment describe the case when (2) is
failed. it mean another task already isolated the page. it makes the task
putback the page to evictable list and vmscan's try_to_unmap() move
the page to unevictable list again.



--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/