Re: [patch 1/2] infrastructure to debug (dynamic) objects

From: Thomas Gleixner
Date: Sat Mar 01 2008 - 06:46:01 EST


On Sat, 1 Mar 2008, Andrew Morton wrote:
> On Sat, 01 Mar 2008 10:24:56 -0000 Thomas Gleixner <tglx@xxxxxxxxxxxxx> wrote:
> > The debug code can be compiled in without being active. The runtime
> > overhead is minimal and could be optimized by asm alternatives. A
> > kernel command line option enables the debugging code.
> >
>
> hm.
>
> The enabled-via-command-line thing is a bit sad. I guess runtime
> switchability would be hard.

Yes, keeping track of such stuff from some random state is not really
a good idea.

> The proof of this pudding is "how many subsystems use it". If things like
> the fault-injection framework are a guide, the answer is "zero unless
> Thomas does it". Call me an experienced cynic.

Hey, I'm as much a cynic as you. It just bothered me to hack this
solely into the timer code with the knowledge that there are other
potential users lurking.

And I want to have it in the timer code for a simple reason: Everytime
when the timer list code explodes due to some stupid timer_list user,
I'm the moron who has to apply hackery to help the users to identify
the root cause. This happened more than 5 times in the last 3 month
and I'm fed up with keeping my ugly debug hackery up to date and
instruct bug reporters how to use it. The delta to get down to the
cause of the problem in the example case is:

one mail versus five.
zero patch vs. finding and updating the old version

I have better ways to waste my sparse leisure time :)

You can be sure, that I will add the specific debug code to other
parts of the kernel whenever I trap over a problem which smells like
the kfree'd or reinitialized timer wreckage.

> > +enum debug_obj_op {
> > + ODEBUG_INIT,
> > + ODEBUG_ADD,
> > + ODEBUG_DEL,
> > + ODEBUG_FREE,
> > + ODEBUG_TEST,
> > +};
>
> Interface nit: five different API calls would be preferable to one call
> with a mode argument.

Will do.

> > +enum {
> > + ODEBUG_TYPE_UNKNOWN,
> > + ODEBUG_MAX_TYPES
> > +};
>
> The need for each client to patch this table is regrettable.

I did not bother with a dynamic register interface before getting some
feedback on the idea in general. Will do when there is some agreement
that this is a useful infrastructure in general.

> > --- linux-2.6.orig/include/linux/mm.h
> > +++ linux-2.6/include/linux/mm.h
> > @@ -11,6 +11,7 @@
> > #include <linux/rbtree.h>
> > #include <linux/prio_tree.h>
> > #include <linux/debug_locks.h>
> > +#include <linux/debugobjects.h>
> > #include <linux/mm_types.h>
> >
> > struct mempolicy;
>
> I suspect this wasn't supposed to be here.

Well, I'm lazy as hell and I found the debug_locks.h include already,
which was added for the same reason: not adding the include to all the
affected files in mm/

Btw. DEBUG_LOCK_ALLOC is a perfect candidate to move over to this as
the current code only detects locks held by the calling thread, while
debugobjects detects also a lock held by any owner.

> > +
> > +#define ODEBUG_HASH_SIZE 4096
>
> power-of-2 is said to be a very poor size for a hash table.

The hash is not a randomized hash as one would expect. It's purely
generated from the object address to simplify the lookup during the
free check. So the power of 2 size is a good thing :) /me adds
comment.

> > +#define ODEBUG_HASH_MASK (ODEBUG_HASH_SIZE - 1)
> > +
> > +struct odebug {
> > + struct list_head list;
>
> hlist?

Good point.

> > + spinlock_t lock;
> > +};
> > +
> > +static struct odebug obj_hash[ODEBUG_HASH_SIZE];
> > +
> > +static int debug_objects_selftest __read_mostly;
> > +static int debug_objects_maxchain __read_mostly;
> > +static int debug_objects_fixups __read_mostly;
> > +static int debug_objects_enabled __read_mostly;
> > +
> > +static int __init enable_object_debug(char *str)
> > +{
> > + debug_objects_enabled = 1;
> > + return 0;
> > +}
> > +
> > +early_param("debug_objects", enable_object_debug);
>
> I can never remember whether these things need to return 0 or 1 on success.

Same here. As usual I copied it from some other place assuming that it
was correct there. Will check.

> <looks at debug_kernel(), quiet_kernel() and loglevel(). Shudders.
> Moves on.>

Hehehe.

Thanks for the review,

tglx
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/