Re: [RFC PATCH v9 13/27] x86/mm: Shadow Stack page fault error checking

From: Dave Hansen
Date: Wed Feb 26 2020 - 17:47:20 EST


On 2/5/20 10:19 AM, Yu-cheng Yu wrote:
> If a page fault is triggered by a Shadow Stack (SHSTK) access
> (e.g. CALL/RET) or SHSTK management instructions (e.g. WRUSSQ), then bit[6]
> of the page fault error code is set.

How about starting with a definition:

Shadow stack accesses are those that are performed by the CPU
where it expects to encounter a shadow stack mapping. These
accesses are performed implicitly by CALL/RET at the site of the
shadow stack pointer. These accesses are made explicitly by
shadow stack management instructions like WRUSSQ.

> In access_error(), verify a SHSTK page fault is within a SHSTK memory area.
> It is always an error otherwise.

How about: Shadow stacks accesses to shadow-stack mapping can see faults
in normal, valid operation just like regular accesses to regular
mappings. Shadow stacks need some of the same features like delayed
allocation, swap and copy-on-write.

Shadow stack accesses can also result in errors, such as when a shadow
stack overflows, or if a shadow stack access occurs to a
non-shadow-stack mapping.

> For a valid SHSTK access, set FAULT_FLAG_WRITE to effect copy-on-write.

It seems rather odd to want copy-on-write behavior for read faults.
Could you elaborate on why, please?

> diff --git a/arch/x86/include/asm/traps.h b/arch/x86/include/asm/traps.h
> index 7ac26bbd0bef..8023d177fcd8 100644
> --- a/arch/x86/include/asm/traps.h
> +++ b/arch/x86/include/asm/traps.h
> @@ -169,6 +169,7 @@ enum {
> * bit 3 == 1: use of reserved bit detected
> * bit 4 == 1: fault was an instruction fetch
> * bit 5 == 1: protection keys block access
> + * bit 6 == 1: shadow stack access fault
> */
> enum x86_pf_error_code {
> X86_PF_PROT = 1 << 0,
> @@ -177,5 +178,6 @@ enum x86_pf_error_code {
> X86_PF_RSVD = 1 << 3,
> X86_PF_INSTR = 1 << 4,
> X86_PF_PK = 1 << 5,
> + X86_PF_SHSTK = 1 << 6,
> };
> #endif /* _ASM_X86_TRAPS_H */
> diff --git a/arch/x86/mm/fault.c b/arch/x86/mm/fault.c
> index 304d31d8cbbc..9c1243302663 100644
> --- a/arch/x86/mm/fault.c
> +++ b/arch/x86/mm/fault.c
> @@ -1187,6 +1187,17 @@ access_error(unsigned long error_code, struct vm_area_struct *vma)
> (error_code & X86_PF_INSTR), foreign))
> return 1;
>
> + /*
> + * Verify X86_PF_SHSTK is within a Shadow Stack VMA.
> + * It is always an error if there is a Shadow Stack
> + * fault outside a Shadow Stack VMA.
> + */

Nit: there was an access that caused the fault. We can be a bit more
broad in the implications from the comment if we say "access" instead of
"fault".

> + if (error_code & X86_PF_SHSTK) {
> + if (!(vma->vm_flags & VM_SHSTK))
> + return 1;
> + return 0;
> + }
> +
> if (error_code & X86_PF_WRITE) {
> /* write, present and write, not present: */
> if (unlikely(!(vma->vm_flags & VM_WRITE)))

Is there an analogous check for !X86_PF_SHSTK faults to VM_SHSTK VMAs?

> @@ -1344,6 +1355,13 @@ void do_user_addr_fault(struct pt_regs *regs,
>
> perf_sw_event(PERF_COUNT_SW_PAGE_FAULTS, 1, regs, address);
>
> + /*
> + * If the fault is caused by a Shadow Stack access,
> + * i.e. CALL/RET/SAVEPREVSSP/RSTORSSP, then set
> + * FAULT_FLAG_WRITE to effect copy-on-write.
> + */
> + if (hw_error_code & X86_PF_SHSTK)
> + flags |= FAULT_FLAG_WRITE;
> if (hw_error_code & X86_PF_WRITE)
> flags |= FAULT_FLAG_WRITE;
> if (hw_error_code & X86_PF_INSTR)

It would be great if you could also include the *why*. *Why* do read
faults need copy-on-write semantics?