Re: [RFC][PATCH] Dynamic Tick: Allow 32-bit machines to sleep formorethan2.15 seconds

From: Jon Hunter
Date: Wed May 13 2009 - 13:54:50 EST



John Stultz wrote:
Alternatively instead of NSEC_PER_SEC/HZ, we could always drop the
larger of NSEC_PER_SEC/HZ or max_deferment/10? That way we should scale
up without a problem.
Yes, may be this would be a safer option. Thinking about this I was wondering if we should always use max_deferement/10, because I did not think that there would ever be a case where NSEC_PER_SEC/HZ would be greater. If NSEC_PER_SEC/HZ was greater than max_deferement/10 this would imply that the clocksource would wrap after only 10 jiffies, if I have the math right...

Right, but even with such limitiations, if an arch can skip every 5
ticks, they probably will try, right? :)

Sure, but I guess I was wondering if there would ever be a clocksource that would overflow in 10-20 ticks? If not then it would be safe to always use -10% or -5% margin and we can forget about NSEC_PER_SEC/HZ.

Unless I am understanding this wrong, but I thought we are just trying to make sure we never sleep for a time longer than the total time a clocksource can count.

That sounds reasonable to me.

Great.

One final question, I noticed in clocksource.h that the definition of function cyc2ns returns a type of s64, however, in the function itself a variable of type u64 is used and returned. Should this function be modified as follows?

static inline s64 cyc2ns(struct clocksource *cs, cycle_t cycles)
{
- u64 ret = (u64)cycles;
+ s64 ret = (s64)cycles;
ret = (ret * cs->mult) >> cs->shift;
return ret;
}

Damn. So this brings up an issue I had missed prior.

Any comments on whether this should be u64 versus s64?

I'll have to think about how that would change
timekeeping_max_deferment() and how we'd have to calculate a reasonable
max efficiently.

Other then this issue (which is my fault for not noticing it earlier),
you're patch looks great. I just feel badly for making you rev this
thing over and over.

No problem, its fine. Its more important for us to get this right so I am happy to help where I can.

One option if you're itching to push it in and be done with it: Make
timekeeping_max_deferment() return just 1 second for now. Your patch
provides the right infrastructure for the timekeeping code to provide
its limits to the clockevents code. So you can use a safe short constant
value for now, and we can extend that out correctly in a future patch.

How about going back to your original thought and making it 50% margin for now? In other words, use max_deferment/2? Therefore, for clocksource that can count for 10s of years before overflowing it will not be as severe.

Sorry again for not catching this until now. :(

No problem at all. Thanks for all the inputs.

Cheers
Jon


--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/