Re: [v3 0/9] parallelized "struct page" zeroing

From: David Miller
Date: Wed May 10 2017 - 11:19:51 EST


From: Michal Hocko <mhocko@xxxxxxxxxx>
Date: Wed, 10 May 2017 16:57:26 +0200

> Have you measured that? I do not think it would be super hard to
> measure. I would be quite surprised if this added much if anything at
> all as the whole struct page should be in the cache line already. We do
> set reference count and other struct members. Almost nobody should be
> looking at our page at this time and stealing the cache line. On the
> other hand a large memcpy will basically wipe everything away from the
> cpu cache. Or am I missing something?

I guess it might be clearer if you understand what the block
initializing stores do on sparc64. There are no memory accesses at
all.

The cpu just zeros out the cache line, that's it.

No L3 cache line is allocated. So this "wipe everything" behavior
will not happen in the L3.