Re: [PATCH v7] arm64/mm: Optimize loop to reduce redundant operations of contpte_ptep_get
From: Catalin Marinas
Date: Thu Jul 03 2025 - 15:05:08 EST
On Tue, 24 Jun 2025 23:25:49 +0800, Xavier Xia wrote:
> This commit optimizes the contpte_ptep_get and contpte_ptep_get_lockless
> function by adding early termination logic. It checks if the dirty and
> young bits of orig_pte are already set and skips redundant bit-setting
> operations during the loop. This reduces unnecessary iterations and
> improves performance.
>
> In order to verify the optimization performance, a test function has been
> designed. The function's execution time and instruction statistics have
> been traced using perf, and the following are the operation results on a
> certain Qualcomm mobile phone chip:
>
> [...]
Applied to arm64 (for-next/misc), thanks!
[1/1] arm64/mm: Optimize loop to reduce redundant operations of contpte_ptep_get
https://git.kernel.org/arm64/c/093ae7a033cf
--
Catalin