Re: [PATCHSET] mempool, percpu, blkcg: fix percpu stat allocationand remove stats_lock

From: Vivek Goyal
Date: Wed Feb 29 2012 - 14:04:06 EST


On Mon, Feb 27, 2012 at 01:22:20PM -0500, Vivek Goyal wrote:
> On Thu, Feb 23, 2012 at 03:12:04PM -0800, Tejun Heo wrote:
> > On Thu, Feb 23, 2012 at 03:01:23PM -0800, Tejun Heo wrote:
> > > Hmmm... going through the thread again, ah, okay, I forgot about that
> > > completely. Yeah, that is an actual problem. Both __GFP_WAIT which
> > > isn't GFP_KERNEL and GFP_KERNEL are valid use cases. I guess we'll be
> > > building async percpu pool in blkcg then. Great. :(
> >
> > Vivek, you win. :) Can you please refresh the async alloc patch on top
> > of blkcg-stacking branch? I'll rool that into this series and drop
> > the mempool stuff.
>
> Hi Tejun,
>
> Looks like there is one more issue. After booting kernel, I tried to read
> one of the cgroup files.
>
> cat /sys/fs/cgroup/blkio/blkio.throttle.read_bps_device and it oopses.
>
> Basically it looks like that we are getting a null dev pointer while we
> do following.
>
> dev_name(blkg->q->backing_dev_info.dev);
>
> I put some printk to verify backing_dev_info.dev is null. That means
> somehow we have a request queue and we did not call add_disk() on that?
>
> I checked that queue is not in bypass mode or dead mode. So nobody called
> blk_cleanup_queue() on that.

Ok, more debugging revealed that culprit here is "floppy" driver module.
It instanciates a queue but never registers a disk against it. Better
skip such queues/groups. Here is the patch.


blk-cgroup: Skip a group if there is no disk associated with the queue

blk-cgroup code currently assumes that there is a device/disk associated
with every queue in the system. But modules like floppy, can instanciate
bunch of request queue and not register disk against all the queues. This
can lead to blkcg crashing.

Skip the queue/blkg which don't have any dev/disk associated with them.

Signed-off-by: Vivek Goyal <vgoyal@xxxxxxxxxx>
---
block/blk-cgroup.c | 24 ++++++++++++++++++++++--
1 file changed, 22 insertions(+), 2 deletions(-)

Index: tejun-misc/block/blk-cgroup.c
===================================================================
--- tejun-misc.orig/block/blk-cgroup.c 2012-02-28 00:33:36.000000000 -0500
+++ tejun-misc/block/blk-cgroup.c 2012-03-01 00:51:15.276346953 -0500
@@ -1136,9 +1136,22 @@ static void blkio_print_group_conf(struc
int plid = BLKIOFILE_POLICY(cft->private);
int fileid = BLKIOFILE_ATTR(cft->private);
struct blkg_policy_data *pd = blkg->pd[plid];
- const char *dname = dev_name(blkg->q->backing_dev_info.dev);
+ const char *dname;
+ struct device *dev;
int rw = WRITE;

+ dev = blkg->q->backing_dev_info.dev;
+
+ /*
+ * Some drivers (like floppy), might instanciate a request queue
+ * and not register any disk against that queue. So dev can be
+ * null
+ */
+ if (!dev)
+ return;
+
+ dname = dev_name(dev);
+
switch (plid) {
case BLKIO_POLICY_PROP:
if (pd->conf.weight)
@@ -1230,9 +1243,16 @@ static int blkio_read_blkg_stats(struct
spin_lock_irq(&blkcg->lock);

hlist_for_each_entry(blkg, n, &blkcg->blkg_list, blkcg_node) {
- const char *dname = dev_name(blkg->q->backing_dev_info.dev);
+ struct device *dev;
+ const char *dname;
+
int plid = BLKIOFILE_POLICY(cft->private);

+ dev = blkg->q->backing_dev_info.dev;
+ if (!dev)
+ continue;
+ dname = dev_name(dev);
+
if (pcpu) {
cgroup_total += blkio_get_stat_cpu(blkg, plid,
cb, dname, type);
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/