Re: [PATCH v2 2/5] x86/fpu: Fix FNSAVE usage in eagerfpu mode

From: Dave Hansen
Date: Mon Jan 25 2016 - 10:40:51 EST


On 01/24/2016 02:38 PM, Andy Lutomirski wrote:
> if (fpu->fpregs_active) {
> + /*
> + * Ignore return value -- we don't care if reg state
> + * is clobbered.
> + */
> copy_fpregs_to_fpstate(fpu);
> } else {
> this_cpu_write(fpu_fpregs_owner_ctx, NULL);
> @@ -189,8 +193,12 @@ void fpu__save(struct fpu *fpu)
>
> preempt_disable();
> if (fpu->fpregs_active) {
> - if (!copy_fpregs_to_fpstate(fpu))
> - fpregs_deactivate(fpu);
> + if (!copy_fpregs_to_fpstate(fpu)) {
> + if (use_eager_fpu())
> + copy_kernel_to_fpregs(&fpu->state);
> + else
> + fpregs_deactivate(fpu);
> + }
> }
> preempt_enable();

I wonder if we should just make the

> + if (use_eager_fpu())
> + copy_kernel_to_fpregs(&fpu->state);
> + else
> + fpregs_deactivate(fpu);

behavior the default _inside_ copy_fpregs_to_fpstate(fpu). We evidently
got it wrong in 2/3 of the call sites that needed it. It ends up being
an optimization for FNSAVE (because it allows us to avoid an FRSTOR),
but we only take advantage of that in cases of kernel_fpu_begin/end().

FXSAVE has been around since at _least_ 1999, and I'd expect it to get
used in place of FNSAVE everywhere that it is available.

If we don't want to do that, maybe we should add a "clobber" argument to
copy_fpregs_to_fpstate() for when it's allowed to clobber the register
state.

I just hate putting this logic at all the call sites.