Re: [topic change] jiffies as a time value
From: George Anzinger
Date: Tue Mar 15 2005 - 18:15:29 EST
john stultz wrote:
On Mon, 2005-03-14 at 15:40 -0800, George Anzinger wrote:
john stultz wrote:
On Sat, 2005-03-12 at 16:49 -0800, Matt Mackall wrote:
+ /* finally, update legacy time values */
+ write_seqlock_irqsave(&xtime_lock, x_flags);
+ xtime = ns2timespec(system_time + wall_time_offset);
+ wall_to_monotonic = ns2timespec(wall_time_offset);
+ wall_to_monotonic.tv_sec = -wall_to_monotonic.tv_sec;
+ wall_to_monotonic.tv_nsec = -wall_to_monotonic.tv_nsec;
+ /* XXX - should jiffies be updated here? */
Excellent question.
Indeed. Currently jiffies is used as both a interrupt counter and a
time unit, and I'm trying make it just the former. If I emulate it then
it stops functioning as a interrupt counter, and if I don't then I'll
probably break assumptions about jiffies being a time unit. So I'm not
sure which is the easiest path to go until all the users of jiffies are
audited for intent.
Really? Who counts interrupts??? The timer code treats jiffies as a unit of
time. You will need to rewrite that to make it otherwise.
Ug. I'm thin on time this week, so I was hoping to save this discussion
for later, but I guess we can get into it now.
Well, assuming timer interrupts actually occur HZ times a second, yes
one could (and current practice, one does) implicitly interpret jiffies
as being a valid notion of time. However with SMIs, bad drivers that
disable interrupts for too long, and virtualization the reality is that
that assumption doesn't hold.
We do have the lost-ticks compensation code that tries to help this, but
that conflicts with some virtualization implementations. Suspend/resume
tries to compensate jiffies for ticks missed over time suspended, but
I'm not sure how accurate it really is (additionally, looking at it now,
it assumes jiffies is only 32bits).
Adding to that, the whole jiffies doesn't really increment at HZ, but
ACTHZ confusion, or bad drivers that assume HZ=100, we get a fair amount
of trouble stemming from folks using jiffies as a time value. Because
in reality, it is just a interrupt counter.
Well, currently, in x86 systems it causes wall clock to advance a very well
defined amount. That it is not exactly 1/HZ is something we need to live with...
So now, if new timeofday code emulates jiffies, we have to decide if it
emulates jiffies at HZ or ACTHZ? Also there could be issues with jiffies
possibly jittering from it being incremented every tick and then set to
the proper time when the timekeeping code runs.
I think your overlooking timers. We have a given resolution for timers and some
code, at least, expects timers to run with that resolution. This REQUIRES
interrupts at resolution frequency. We can argue about what that interrupt
event is called (currently a jiffies interrupt) and disparage the fact that
hardware can not give us "nice" numbers for the resolution, but we do need the
interrupts. That there are bad places in the code where interrupts are delayed
is not really important in this discussion. For what it worth, the RT patch
Ingo is working on is getting latencies down in the 10s of microseconds region.
We also need, IMNSHO to recognize that, at lest with some hardware, that
interrupt IS in fact the clock and is the only reasonable way we have of reading
it. This is true, for example, on the x86. The TSC we use as a fill in for
between interrupts is not stable in the long term and should only be used to
interpolate over 1 to 10 ticks or so.
I'm not sure which is the best way to go, but it sounds that emulating
it is probably the easiest. I just deferred the question with a comment
until now because its not completely obvious. Any suggestions on the
above questions (I'm guessing the answers are: use ACTHZ, and the jitter
won't hurt that bad).
But then you have
another problem. To correctly function, times need to expire on time (hay how
bout that) not some time later. To do this we need an interrupt source. To
this point in time, the jiffies interrupt has been the indication that one or
more timer may have expired. While we don't need to "count" the interrupts, we
DO need them to expire the timers AND they need to be on time.
Well, something Nish Aravamudan has been working on is converting the
common users of jiffies (drivers) to start using human time units. These
very well understood units (which avoid HZ/ACTHZ/HZ=100 assumptions) can
then be accurately changed to jiffies (or possibly some other time unit)
internally. It would even be possible for soft-timers to expire based
upon the actual high-res time value, rather then the low-res tick-
counter(which is something else Nish has been playing with). When that
occurs we can easily start doing other interesting things that I believe
you've already been working on in your HRT code, such as changing the
timer interrupt frequency dynamically, or working with multiple timer
interrupt sources.
This is also what is done in things like posix timers. The fact remains that,
at least in the posix timers case, the resolution is exported to the user and
implies certain things. I am not sure we are explicitly exporting the
resolution in the kernel, but, down under the code, there is a resolution AND it
impacts what one should expect of timers.
So basically, lots of interesting questions and possibilities and I very
much look forward to your input and suggestions.
It may help to understand that MOST internal timers (i.e. timers the kernel
sets) never expire. They are canceled by the caller because they were really
"dead man" timers, i.e. "it better happen be this time or we are hurting".
Users, on the other hand, for the most part set up timers to allow bits of code
to run either periodically or at some specified time. These are the folks who
care about latency and being on time.
User also, of course, also use "dead man" timers such as, for example, the time
out on select. By their nature the "dead man" timer, usually, does not have
strict on time requirements.
--
George Anzinger george@xxxxxxxxxx
High-res-timers: http://sourceforge.net/projects/high-res-timers/
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/