Re: [PATCH 2/6] RFC perf_counter: singleshot support

From: Ingo Molnar
Date: Thu Apr 02 2009 - 06:52:24 EST



* Peter Zijlstra <a.p.zijlstra@xxxxxxxxx> wrote:

> By request, provide a way for counters to disable themselves and
> signal at the first counter overflow.
>
> This isn't complete, we really want pending work to be done ASAP
> after queueing it. My preferred method would be a self-IPI, that
> would ensure we run the code in a usable context right after the
> current (IRQ-off, NMI) context is done.

Hm. I do think self-IPIs can be fragile but the more work we do in
NMI context the more compelling of a case can be made for a
self-IPI. So no big arguments against that.

Regarding single-shot - i think the code you posted fails to do the
single-shot aspect: the counter does not stop, to wait for the
signal handler to get its act together.

So i'd suggest to separate the two concepts: add signal support and
single-shot via two attributes. User-space might decide to use
single-shot if it does not want to deal with queued signals or lost
events due to not processing the previous signal fast enough.

Also, user-space might want to use single-shot _without_ signals
perhaps.

And that brings up that user-space might want to generate N events
and stop then, reliably. I.e. single-shot is a specific form of a
"trigger limit".

Plus the question comes up: dont we need an ioctl to let user-space
refill/re-enable the trigger limit?

That is an issue for the signal case as well: we stop the counter
... what restarts it? Does the signal handler have to close the
counter and re-create it just to get the next single-shot event?

So i think we need 3 separate things:

- the ability to set a signal attribute of the counter (during
creation) via a (signo,tid) pair.

Semantics:

- it can be a regular signal (signo < 32),
or an RT/queued signal (signo >= 32).

- It may be sent to the task that generated the event (tid == 0),
or it may be sent to a specific task (tid > 0),
or it may be sent to a task group (tid < 0).

- 'event limit' attribute: the ability to pause new events after N
events. This limit auto-decrements on each event.
limit==1 is the special case for single-shot.

- new ioctl method to refill the limit, when user-space is ready to
receive new events. A special-case of this is when a signal
handler calls ioctl(refill_limit, 1) in the single-shot case -
this re-enables events after the signal has been handled.

Another observation: i think perf_counter_output() needs to depend
on whether the counter is signalling, not on the single-shot-ness of
the counter.

A completely valid use of this would be for user-space to create an
mmap() buffer of 1024 events, then set the limit to 1024, and wait
for the 1024 events to happen - process them and close the counter.
Without any signalling.

Basically the 'limit' allows counter events to be used as a tracer
in essence. So i think it's a nice touch of the whole scheme, and it
should be decoupled from the signal attribute.

Hm?

Ingo
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/