RE: [PATCH net] ice: check whether AUX devices/drivers are supported in ice_rebuild

From: Liu, Yongxin
Date: Wed Sep 08 2021 - 15:15:05 EST


> -----Original Message-----
> From: Ertman, David M <david.m.ertman@xxxxxxxxx>
> Sent: Thursday, September 9, 2021 1:31 AM
> To: Leon Romanovsky <leon@xxxxxxxxxx>; Liu, Yongxin
> <Yongxin.Liu@xxxxxxxxxxxxx>
> Cc: Saleem, Shiraz <shiraz.saleem@xxxxxxxxx>; Nguyen, Anthony L
> <anthony.l.nguyen@xxxxxxxxx>; netdev@xxxxxxxxxxxxxxx; linux-
> kernel@xxxxxxxxxxxxxxx; davem@xxxxxxxxxxxxx; Brandeburg, Jesse
> <jesse.brandeburg@xxxxxxxxx>; intel-wired-lan@xxxxxxxxxxxxxxxx;
> kuba@xxxxxxxxxx
> Subject: RE: [PATCH net] ice: check whether AUX devices/drivers are supported
> in ice_rebuild
>
> > -----Original Message-----
> > From: Leon Romanovsky <leon@xxxxxxxxxx>
> > Sent: Sunday, September 5, 2021 12:24 AM
> > To: Yongxin Liu <yongxin.liu@xxxxxxxxxxxxx>
> > Cc: Ertman, David M <david.m.ertman@xxxxxxxxx>; Saleem, Shiraz
> > <shiraz.saleem@xxxxxxxxx>; Nguyen, Anthony L
> > <anthony.l.nguyen@xxxxxxxxx>; netdev@xxxxxxxxxxxxxxx; linux-
> > kernel@xxxxxxxxxxxxxxx; davem@xxxxxxxxxxxxx; Brandeburg, Jesse
> > <jesse.brandeburg@xxxxxxxxx>; intel-wired-lan@xxxxxxxxxxxxxxxx;
> > kuba@xxxxxxxxxx
> > Subject: Re: [PATCH net] ice: check whether AUX devices/drivers are
> > supported in ice_rebuild
> >
> > On Fri, Sep 03, 2021 at 09:25:00AM +0800, Yongxin Liu wrote:
> > > In ice_rebuild(), check whether AUX devices/drivers are supported or
> > > not before calling ice_plug_aux_dev().
> > >
> > > Fix the following call trace, if RDMA functionality is not available.
> > >
> > > auxiliary ice.roce.0: adding auxiliary device failed!: -17
> > > sysfs: cannot create duplicate filename
> '/bus/auxiliary/devices/ice.roce.0'
> > > Workqueue: ice ice_service_task [ice]
> > > Call Trace:
> > > dump_stack_lvl+0x38/0x49
> > > dump_stack+0x10/0x12
> > > sysfs_warn_dup+0x5b/0x70
> > > sysfs_do_create_link_sd.isra.2+0xc8/0xd0
> > > sysfs_create_link+0x25/0x40
> > > bus_add_device+0x6d/0x110
> > > device_add+0x49d/0x940
> > > ? _printk+0x52/0x6e
> > > ? _printk+0x52/0x6e
> > > __auxiliary_device_add+0x60/0xc0
> > > ice_plug_aux_dev+0xd3/0xf0 [ice]
> > > ice_rebuild+0x27d/0x510 [ice]
> > > ice_do_reset+0x51/0xe0 [ice]
> > > ice_service_task+0x108/0xe70 [ice]
> > > ? __switch_to+0x13b/0x510
> > > process_one_work+0x1de/0x420
> > > ? apply_wqattrs_cleanup+0xc0/0xc0
> > > worker_thread+0x34/0x400
> > > ? apply_wqattrs_cleanup+0xc0/0xc0
> > > kthread+0x14d/0x180
> > > ? set_kthread_struct+0x40/0x40
> > > ret_from_fork+0x1f/0x30
> > >
> > > Fixes: f9f5301e7e2d ("ice: Register auxiliary device to provide
> > > RDMA")
> > > Signed-off-by: Yongxin Liu <yongxin.liu@xxxxxxxxxxxxx>
> > > ---
> > > drivers/net/ethernet/intel/ice/ice_main.c | 4 +++-
> > > 1 file changed, 3 insertions(+), 1 deletion(-)
> > >
> > > diff --git a/drivers/net/ethernet/intel/ice/ice_main.c
> > b/drivers/net/ethernet/intel/ice/ice_main.c
> > > index 0d6c143f6653..98cc708e9517 100644
> > > --- a/drivers/net/ethernet/intel/ice/ice_main.c
> > > +++ b/drivers/net/ethernet/intel/ice/ice_main.c
> > > @@ -6466,7 +6466,9 @@ static void ice_rebuild(struct ice_pf *pf,
> > > enum
> > ice_reset_req reset_type)
> > > /* if we get here, reset flow is successful */
> > > clear_bit(ICE_RESET_FAILED, pf->state);
> > >
> > > - ice_plug_aux_dev(pf);
> > > + if (ice_is_aux_ena(pf))
> > > + ice_plug_aux_dev(pf);
> > > +
> >
> > The change is ok, but it hints that auxiliary bus is used horribly
> > wrong in this driver. In proper implementation, which should rely on
> > driver/core, every subdriver like ice.eth, ice.roce e.t.c is supposed
> > to be retriggered by the code and shouldn't ave "if (ice_is_aux_ena(pf))"
> checks.
> >
> > Thanks
>
> Hi Leon and Liu -
>
> First of all, thanks Liu for tracking this down - it is an issue that needs
> to be fixed. The ice_is_aux_ena() functions only purpose is to determine if
> this PF supports RDMA functionality. In probe(), the aux devices are not
> even initialized if this test returns false. If this check fixed the issue
> for you, the PF you are on does not currently support RDMA.
> The bit this test is based on is only set one place in the driver currently -
> at probe time when we are checking the capabilities (common_caps) of the
> device.
>
> That being said, the call to check this should be in ice_plug_aux_dev
> function itself. That way it is taken into account for all attempts to
> create the auxiliary device. There is another consideration about disabling
> RDMA that needs to also needs to be taken into account to avoid a similar
> situation.
>
> If it is acceptable, I will create a new patch today and get it out either
> this afternoon or tomorrow.

Thanks Leon and Dave E for your quick response and input.
Please go ahead and I am expecting this issue to be fixed in a more reasonable way.

Thanks,
Yongxin

>
> Thanks,
> Dave E
>
> >
> > > return;
> > >
> > > err_vsi_rebuild:
> > > --
> > > 2.14.5
> > >