Re: [PATCH v2] vfio: fix potential deadlock on vfio group lock

From: Sean Christopherson
Date: Thu Jan 12 2023 - 21:05:28 EST


On Thu, Jan 12, 2023, Alex Williamson wrote:
> On Thu, 12 Jan 2023 23:29:53 +0000
> Sean Christopherson <seanjc@xxxxxxxxxx> wrote:
>
> > On Thu, Jan 12, 2023, Matthew Rosato wrote:
> > > On 1/12/23 4:05 PM, Alex Williamson wrote:
> > > > On Thu, 12 Jan 2023 15:38:44 -0500
> > > > Matthew Rosato <mjrosato@xxxxxxxxxxxxx> wrote:
> > > >> @@ -344,6 +345,35 @@ static bool vfio_assert_device_open(struct vfio_device *device)
> > > >> return !WARN_ON_ONCE(!READ_ONCE(device->open_count));
> > > >> }
> > > >>
> > > >> +static bool vfio_kvm_get_kvm_safe(struct kvm *kvm)
> > > >> +{
> > > >> + bool (*fn)(struct kvm *kvm);
> > > >> + bool ret;
> > > >> +
> > > >> + fn = symbol_get(kvm_get_kvm_safe);
> > > >> + if (WARN_ON(!fn))
> >
> > In a related vein to Alex's comments about error handling, this should not WARN.
> > WARNing during vfio_kvm_put_kvm() makes sense, but the "get" is somewhat blind.
>
> It's not exactly blind though, we wouldn't have a kvm pointer if the
> kvm-vfio device hadn't stuffed one into the group. We only call into
> here if we have a non-NULL pointer, so it wouldn't simply be that the
> kvm module isn't available for this to fire, but more that we have an
> API change to make the symbol no longer exist. A WARN for that doesn't
> seem unreasonable. Thanks,

Hmm, I was thinking that it might be possible for kvm.ko to be on its way out,
but barring use of force module unload, which breaks things left and right, kvm.ko
can only be going if all VMs have been destroyed.

Agreed, WARN is fine.