Re: [PATCH 07/17] hpsa: do not give up retry of driver cmds afteronly 3 retries

From: Andi Shyti
Date: Wed May 02 2012 - 16:28:08 EST


On Wed, May 02, 2012 at 11:30:09AM -0500, scameron@xxxxxxxxxxxxxxxxxx wrote:
> On Tue, May 01, 2012 at 11:39:34PM +0200, Andi Shyti wrote:
> > Hi Steve,
> >
> > Thanks for the walk-through, but still some doubts...
> >
> > On Tue, May 01, 2012 at 01:20:11PM -0500, scameron@xxxxxxxxxxxxxxxxxx wrote:
> > > On Tue, May 01, 2012 at 07:26:34PM +0200, Andi Shyti wrote:
> > > > >
> > > > > do {
> > > > > memset(c->err_info, 0, sizeof(*c->err_info));
> > > > > hpsa_scsi_do_simple_cmd_core(h, c);
> > > > > retry_count++;
> > > > > + if (retry_count > 3) {
> > > > > + msleep(backoff_time);
> > > >
> > > > for 10ms isn't it better to avoid using msleep?
> > >
> > > [...]
> > > > > + if (backoff_time < 1000)
> > > > > + backoff_time *= 2;
> > >
> > > Eh, maybe. from Documentation/timers-howto.txt
> > >
> > > msleep(1~20) may not do what the caller intends, and
> > > will often sleep longer (~20 ms actual sleep for any
> > > value given in the 1~20ms range). In many cases this
> > > is not the desired behavior.
> > >
> > > Sleeping longer (~20ms instead of 10ms) in this instance is fine, as I don't
> > > really care too much exactly how long it sleeps, and it backs off to up to
> > > 1280ms eventually anyway. The idea is, "wait a bit, and retry, and then if
> > > that doesn't work, wait twice as long, and retry, etc." *exactly* how long
> > > "a bit" is is not super important. I could change the initial back_off time
> > > to 20 or 30 to satisfy the letter of the advice in Documentation/timers-howto.txt,
> > > if doing so is important.
> >
> > No, you're right, it should not really matter, but here in the
> > worst case you put the driver on sleep for almost 22 seconds,
> > that is a huge difference compared to the original
> > implementation.
> >
> > > This is kind of a corner case of a corner case, I don't expect
> > > things will ordinarily end up waiting that long, because normally
> > > one of the 1st 3 retries will succeed. I just wanted to make it
> > > a little more robust and not just give up immediately if the 3
> > > initial retries don't succeed, the specific number of retries,
> > > wait times, etc, I just made up.
> >
> > Premising that I don't know the device, therefore I could be
> > totally wrong, if you don't expect things to wait so long, why not
> > to decrease the MAX_DRIVER_CMD_RETRIES and sleep increasingly (as
> > you did) but for shorter period?
>
> [...]
>
> So yeah, the "echo 1 > /sys/blah/blah/rescan" process will sleep for up to ~20 seconds
> in the event of some (presumably very rare) 20 second BUSY condition, but it will work in
> cases the old code will not, and so it sleeps 20 secs, I don't think that's really a
> problem, esp. compared to the alternative of just failing.
>

Uhh... that was a good explanation :)
Thanks a lot! You convinced me :)

If you want you can add

Reviewed-by: Andi Shyti <andi.shyti@xxxxxxxxx>
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/