Re: [RFC PATCH 01/11] x86: kernel FineIBT

From: Joao Moreira
Date: Mon May 02 2022 - 13:18:08 EST


On 2022-04-28 18:37, Josh Poimboeuf wrote:
On Tue, Apr 19, 2022 at 05:42:31PM -0700, joao@xxxxxxxxxxxxxxxxxx wrote:
+void __noendbr __fineibt_handler(void){
+ unsigned i;
+ unsigned long flags;
+ bool skip;
+ void * ret;
+ void * caller;
+
+ DO_ALL_PUSHS;

So this function isn't C ABI compliant, right? e.g. the compiler just
calls the handler without regard for preserving registers?

If this function is going to be implemented in C, it should probably
have an asm thunk wrapper which can properly save/restore the registers
before calling into the C version.

Even better, if the compiler did an invalid op (UD2?), which I think you
mentioned elsewhere, instead of calling the handler directly, and there
were a way for the trap code to properly detect it as a FineIBT
violation, we could get rid of the pushes/pops, plus the uaccess objtool
warning from patch 7, plus I'm guessing a bunch of noinstr validation
warnings.

Cool, I'll try to come up with something!


+
+ spin_lock_irqsave(&fineibt_lock, flags);
+ skip = false;
+
+ asm("\t movq 0x90(%%rsp),%0" : "=r"(ret));
+ asm("\t movq 0x98(%%rsp),%0" : "=r"(caller));

This is making some questionable assumptions about the stack layout.

I assume this function is still in the prototype stage ;-)

Yeah, this is just a messy instrumentation to get reports about mismatching prototypes (as the ones reported further down the series).

The issue with having the call is that it bloats the binary, so the ud2 is 3-bytes-per-function better. Yet, we may consider a FINEIBT_DEBUG config, which can then enable a handler. This would be useful together with a fuzzer or a stress tool to cover possible control-flow paths within the kernel and map mismatching prototypes more properly I guess.


+ if(!skip) {
+ printk("FineIBT violation: %px:%px:%u\n", ret, caller,
+ vlts_next);
+ }
+ DO_ALL_POPS;
+}

Right now this handler just does a printk if it hasn't already for this
caller/callee combo, and then resumes control. Which is fine for
debugging, but it really needs to behave similarly to an IBT violation,
by panicking unless "ibt=warn" on the cmdline.

Not sure what would happen for "ibt=off"? Maybe apply_ibt_endbr() could
NOP out all the FineIBT stuff.

Either that, or...

I'm thinking about a way to have FineIBT interchangeable with KCFI. Currently KCFI adds a 4 byte hash + 2 byte nops before function entry, to allow for proper prototype checking. After that, there should be an ENDBR of 4 bytes. This gives us 10 bytes in total. Then, my yet to be properly thought idea would be patch these 10 bytes with:

endbr
call fineibt_handler_<$HASH>
nop

and then, on the caller side, patch the "cmp <$HASH>, -0x6(%r11); je; ud2; call" sequence with a "sub 0x6, r11; mov $HASH, %r10; call %r11, add 0x6 %r11". This would then allow the kernel to verify if the CPU is IBT capable on boot time and only then setting the proper scheme.

The downsides of having something like this would be that this sub r11/add r11 sequence is kinda meh. We can avoid that by having two padding nops after the original ENDBR, which will be skipped when the function is reached directly by the linker optimization I'm working on, and that we can convert into a JMP -offset that makes control flow reach the padding area before the prologue and from where we can call the fineibt_handler function. The resulting instrumentation would be something like:

1:
call fineibt_handler_<$HASH>
jmp 2f
<foo>
endbr
jmp 1b
2:

Also, it would prevent a paranoid user to have both schemes simultaneously (there are reasons why people could want that).

Any thoughts?