Re: [PATCH] therm_throt.c: Fix error handling inthermal_throttle_add_dev

From: Fenghua Yu
Date: Fri Sep 03 2010 - 17:45:13 EST


On Fri, Sep 03, 2010 at 02:35:53AM -0700, Ingo Molnar wrote:
>
> * Fenghua Yu <fenghua.yu@xxxxxxxxx> wrote:
>
> > From: Fenghua Yu <fenghua.yu@xxxxxxxxx>
> >
> > Warn when sysfs_add_file_to_group fails.
> >
> > Also add missing curly braces.
> >
> > Signed-off-by: Fenghua Yu <fenghua.yu@xxxxxxxxx>
> > ---
> > arch/x86/kernel/cpu/mcheck/therm_throt.c | 14 ++++++++++----
> > 1 files changed, 10 insertions(+), 4 deletions(-)
>
> > + WARN_ON(err);
>
> Hm, we tend to use WARN_ON_ONCE(), to avoid repeat spamming of the
> syslog. Also, and perhaps more importantly, WARN_ON() is not a
> particularly smart way to handle errors. How do other drivers handle
> sysfs registration failures?

drivers/pci/pcie/aspm.c doesn't collect error from sysfs_add_file_to_group. This
driver is similar to our case because neither of sysfs_add_file_to_group error
is fatal and we can ignore the error and continue to add other files.

Other drivers exit function when sysfs_add_file_to_group fails. This assumes
there is dependency among each devices and an earlier failure prevents from
later sysfs_add_file_to_group. This is different from our case.

>
> Also, that's not the only thing the patch does:
>
> > @@ -211,20 +211,26 @@ static __cpuinit int thermal_throttle_add_dev(struct sys_device *sys_dev)
>
> > - return err;
> > + return 0;
>
> How is the ignoring of an error and turning it into a WARN_ON() a fix?
> Either it makes no sense to return errors - in which case the whole
> add_dev method needs to be fixed in all drivers - or it makes sense, in
> which case the behavior here is inconsistent.
>
> At minimum more explanation is needed in the changelog.

"return err" always returns the err value from the last sysfs_add_file_to_group
I.e. one earlier err value is not reflected by thermal_throttle_add_dev(). If
you think the original code is ok, I can keep the original error handling code
and just have the missing curly braces fix.

To log an error, the patch just warns on each error but returns 0 because
neither of the error is fatal. If you think this option is ok, I can change
WAR_ON to WARN_ON_ONCE.

Or other options could be:

1. Just calling sysfs_add_file_to_group() without collecting returned error and
return 0 at the end (driver/pci/pcie/aspm.c does like this). The drawback is
there is no error logged if an unlikely errorr occurs. But user can see some
files are missing in sysfs.
2. Or collect errors in err1, err2, etc for each sysfs_add_file_to_group. At
the end, return -ENODEV(??) if any err1, err2, etc is not 0. This option makes
code unreasonable complex to handle unlikely errors.

What do you think?

Thanks.

-Fenghua
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/