Re: [PATCH v2] vfio: fix potential deadlock on vfio group lock

From: Alex Williamson
Date: Thu Jan 12 2023 - 19:57:48 EST


On Thu, 12 Jan 2023 23:29:53 +0000
Sean Christopherson <seanjc@xxxxxxxxxx> wrote:

> On Thu, Jan 12, 2023, Matthew Rosato wrote:
> > On 1/12/23 4:05 PM, Alex Williamson wrote:
> > > On Thu, 12 Jan 2023 15:38:44 -0500
> > > Matthew Rosato <mjrosato@xxxxxxxxxxxxx> wrote:
> > >> @@ -344,6 +345,35 @@ static bool vfio_assert_device_open(struct vfio_device *device)
> > >> return !WARN_ON_ONCE(!READ_ONCE(device->open_count));
> > >> }
> > >>
> > >> +static bool vfio_kvm_get_kvm_safe(struct kvm *kvm)
> > >> +{
> > >> + bool (*fn)(struct kvm *kvm);
> > >> + bool ret;
> > >> +
> > >> + fn = symbol_get(kvm_get_kvm_safe);
> > >> + if (WARN_ON(!fn))
>
> In a related vein to Alex's comments about error handling, this should not WARN.
> WARNing during vfio_kvm_put_kvm() makes sense, but the "get" is somewhat blind.

It's not exactly blind though, we wouldn't have a kvm pointer if the
kvm-vfio device hadn't stuffed one into the group. We only call into
here if we have a non-NULL pointer, so it wouldn't simply be that the
kvm module isn't available for this to fire, but more that we have an
API change to make the symbol no longer exist. A WARN for that doesn't
seem unreasonable. Thanks,

Alex

> > >> + return false;
> > >> +
> > >> + ret = fn(kvm);
> > >> +
> > >> + symbol_put(kvm_get_kvm_safe);
> > >> +
> > >> + return ret;
> > >> +}
> > >> +
> > >> +static void vfio_kvm_put_kvm(struct kvm *kvm)
> > >> +{
> > >> + void (*fn)(struct kvm *kvm);
> > >> +
> > >> + fn = symbol_get(kvm_put_kvm);
> > >> + if (WARN_ON(!fn))
> > >> + return;
> > >> +
> > >> + fn(kvm);
> > >> +
> > >> + symbol_put(kvm_put_kvm);
> > >> +}
>