Re: [RFC PATCH v7 17/23] kernel/entry: Add support for core-wide protection of kernel-mode

From: Thomas Gleixner
Date: Thu Sep 03 2020 - 10:50:23 EST


On Thu, Sep 03 2020 at 00:34, Joel Fernandes wrote:
> On Wed, Sep 2, 2020 at 12:57 PM Dario Faggioli <dfaggioli@xxxxxxxx> wrote:
>> 2) protection of the kernel from the other thread running in userspace
>> may be achieved in different ways. This is one, sure. ASI will probably
>> be another. Hence if/when we'll have both, this and ASI, it would be
>> cool to be able to configure the system in such a way that there is
>> only one active, to avoid paying the price of both! :-)
>
> Actually, no. Part of ASI will involve exactly what this patch does -
> IPI-pausing siblings but ASI does so when they have no choice but to
> switch away from the "limited kernel" mapping, into the full host
> kernel mapping. I am not sure if they have yet implemented that part
> but they do talk of it in [1] and in their pretty LPC slides. It is
> just that ASI tries to avoid that scenario of kicking all siblings out
> of guest mode. So, maybe this patch can be a stepping stone to ASI.
> At least I got the entry hooks right, and the algorithm is efficient
> IMO (useless IPIs are avoided). ASI can then come in and avoid
> sending IPIs even more by doing their limited-kernel-mapping things if
> needed. So, it does not need to be this vs ASI, both may be needed.

Right. There are different parts which are seperate:

1) Core scheduling as a best effort feature (performance for certain use
cases)

2) Enforced core scheduling (utilizes #1 basics)

3) ASI

4) Kick sibling out of guest/host and wait mechanics

#1, #2, #3 can be used stand alone. #4 is a utility

Then you get combos:

A) #2 + #4:

core wide protection. i.e. what this series tries to achieve. #3
triggers the kick at the low level VMEXIT or entry from user mode
boundary. The wait happens at the same level

B) #3 + #4:

ASI plus kicking the sibling/wait mechanics independent of what's
scheduled. #3 triggers the kick at the ASI switch to full host
mapping boundary and the wait is probably the same as in #A

C) #2 + #3 + #4:

The full concert, but trigger/wait wise the same as #B

So we really want to make at least #4 an independent utility.

Thanks,

tglx