Re: [PATCH] mm/mapping_dirty_helpers: Update huge page-table entry callbacks

From: Steven Price
Date: Fri Jan 31 2020 - 06:33:15 EST


On Fri, Jan 31, 2020 at 10:00:52AM +0000, Thomas Hellström (VMware) wrote:
> From: Thomas Hellstrom <thellstrom@xxxxxxxxxx>
>
> Following the update of pagewalk code
> commit a07984d48146 ("mm: pagewalk: add p4d_entry() and pgd_entry()")
> we can modify the mapping_dirty_helpers' huge page-table entry callbacks
> to avoid splitting when a huge pud or -pmd is encountered.
>
> Signed-off-by: Thomas Hellstrom <thellstrom@xxxxxxxxxx>
> Cc: Steven Price <steven.price@xxxxxxx>

LGTM

Reviewed-by: Steven Price <steven.price@xxxxxxx>

> ---
> mm/mapping_dirty_helpers.c | 42 ++++++++++++++++++++++++++++++++++----
> 1 file changed, 38 insertions(+), 4 deletions(-)
>
> diff --git a/mm/mapping_dirty_helpers.c b/mm/mapping_dirty_helpers.c
> index 71070dda9643..2c7d03675903 100644
> --- a/mm/mapping_dirty_helpers.c
> +++ b/mm/mapping_dirty_helpers.c
> @@ -111,26 +111,60 @@ static int clean_record_pte(pte_t *pte, unsigned long addr,
> return 0;
> }
>
> -/* wp_clean_pmd_entry - The pagewalk pmd callback. */
> +/*
> + * wp_clean_pmd_entry - The pagewalk pmd callback.
> + *
> + * Dirty-tracking should take place on the PTE level, so
> + * WARN() if encountering a dirty huge pmd.
> + * Furthermore, never split huge pmds, since that currently
> + * causes dirty info loss. The pagefault handler should do
> + * that if needed.
> + */
> static int wp_clean_pmd_entry(pmd_t *pmd, unsigned long addr, unsigned long end,
> struct mm_walk *walk)
> {
> - /* Dirty-tracking should be handled on the pte level */
> pmd_t pmdval = pmd_read_atomic(pmd);
>
> + if (!pmd_trans_unstable(&pmdval))
> + return 0;
> +
> + if (pmd_none(pmdval)) {
> + walk->action = ACTION_AGAIN;
> + return 0;
> + }
> +
> + /* Huge pmd, present or migrated */
> + walk->action = ACTION_CONTINUE;
> if (pmd_trans_huge(pmdval) || pmd_devmap(pmdval))
> WARN_ON(pmd_write(pmdval) || pmd_dirty(pmdval));
>
> return 0;
> }
>
> -/* wp_clean_pud_entry - The pagewalk pud callback. */
> +/*
> + * wp_clean_pud_entry - The pagewalk pud callback.
> + *
> + * Dirty-tracking should take place on the PTE level, so
> + * WARN() if encountering a dirty huge puds.
> + * Furthermore, never split huge puds, since that currently
> + * causes dirty info loss. The pagefault handler should do
> + * that if needed.
> + */
> static int wp_clean_pud_entry(pud_t *pud, unsigned long addr, unsigned long end,
> struct mm_walk *walk)
> {
> - /* Dirty-tracking should be handled on the pte level */
> pud_t pudval = READ_ONCE(*pud);
>
> + if (!pud_trans_unstable(&pudval))
> + return 0;
> +
> + if (pud_none(pudval)) {
> + walk->action = ACTION_AGAIN;
> + return 0;
> + }
> +
> + /* Huge pud */
> + walk->action = ACTION_CONTINUE;
> if (pud_trans_huge(pudval) || pud_devmap(pudval))
> WARN_ON(pud_write(pudval) || pud_dirty(pudval));
>
> --
> 2.21.1
>