Re: [RFC PATCH 4/4] mm: Add PG_zero support

From: Andrew Morton
Date: Thu Apr 23 2020 - 20:37:04 EST

Next message: Hyunki Koo: "RE: [PATCH v8 0/3] 32-bit access for TX/RX hold registers for samsung_tty driver"
Previous message: Gershwin Wanneburg: "Hi"
In reply to: Vlastimil Babka: "Re: [RFC PATCH 4/4] mm: Add PG_zero support"
Next in thread: Matthew Wilcox: "Re: [RFC PATCH 4/4] mm: Add PG_zero support"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On Wed, 22 Apr 2020 16:09:00 +0200 Vlastimil Babka <vbabka@xxxxxxx> wrote:

> On 4/13/20 11:05 PM, Andrew Morton wrote:
> > On Mon, 13 Apr 2020 08:11:59 -0700 Alexander Duyck <alexander.h.duyck@xxxxxxxxxxxxxxx> wrote:
> >
> >> In addition, unlike madvising the page away there is a pretty
> >> significant performance penalty for having to clear the page a second
> >> time when the page is split or merged.
> >
> > I wonder if there might be an issue with increased memory traffic (and
> > increased energy consumption, etc). If a page is zeroed immediately
> > before getting data written into it (eg, plain old file write(),
> > anonymous pagefault) then we can expect that those 4096 zeroes will be
> > in CPU cache and mostly not written back. But if that page was zeroed
> > a "long" time ago, the caches will probably have been written back.
> > Net result: we go from 4k of memory traffic for a 4k page up to 8k of
> > memory traffic?
>
> Heh, I was quite sure that this is not the first time background zeroing is
> proposed, so I went to google for it... and found that one BSD kernel actually
> removed this functionality in 2016 [1] and this was one of the reasons.
>
> [1]
> https://gitweb.dragonflybsd.org/dragonfly.git/commitdiff/afd2da4dc9056ea79cdf15e8a9386a3d3998f33e

Interesting.

However this:

- Pre-zeroing a page leads to a cold-cache case on-use, forcing the fault
source (e.g. a userland program) to actually get the data from main
memory in its likely immediate use of the faulted page, reducing
performance.

implies that BSD was zeroing with non-temporal stores which bypass the
CPU cache. And which presumably invalidate any part of the target
memory which was already in cache. We wouldn't do it that way so
perhaps the results would differ.

Next message: Hyunki Koo: "RE: [PATCH v8 0/3] 32-bit access for TX/RX hold registers for samsung_tty driver"
Previous message: Gershwin Wanneburg: "Hi"
In reply to: Vlastimil Babka: "Re: [RFC PATCH 4/4] mm: Add PG_zero support"
Next in thread: Matthew Wilcox: "Re: [RFC PATCH 4/4] mm: Add PG_zero support"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]