Re: [PATCH v2] x86/kvm: Disable KVM_ASYNC_PF_SEND_ALWAYS

From: Paolo Bonzini
Date: Mon Mar 09 2020 - 04:40:15 EST


On 09/03/20 07:57, Thomas Gleixner wrote:
> Thomas Gleixner <tglx@xxxxxxxxxxxxx> writes:
>> Thomas Gleixner <tglx@xxxxxxxxxxxxx> writes:
>>> Andy Lutomirski <luto@xxxxxxxxxx> writes:
>>>> On Sat, Mar 7, 2020 at 7:47 AM Thomas Gleixner <tglx@xxxxxxxxxxxxx> wrote:
>>>>> If MCE, NMI trigger a real pagefault then the #PF injection needs to
>>>>> clear apf_reason and set the correct CR2. When that #PF returns then the
>>>>> old CR2 and apf_reason need to be restored.
>>>>
>>> The host does not care about the IRET. It solely has to check whether
>>> apf_reason is 0 or not. That way it knows that the guest has read CR2
>>> and apf_reason.
>
> Some hours or sleep and not staring at this meess later and while
> reading the leaves of my morning tea:
>
> guest side:
>
> nmi()/mce() ...
>
> stash_crs();
>
> + stash_and_clear_apf_reason();
>
> ....
>
> + restore_apf_reason();
>
> restore_cr2();
>
> Too obvious, isn't it?

Yes, this works but Andy was not happy about adding more
save-and-restore to NMIs. If you do not want to do that, I'm okay with
disabling async page fault support for now.

Storing the page fault reason in memory was not a good idea. Better
options would be to co-opt the page fault error code (e.g. store the
reason in bits 31:16, mark bits 15:0 with the invalid error code
RSVD=1/P=0), or to use the virtualization exception area.

Paolo