Re: [patch 05/24] perfmon: X86 generic code (x86)

From: stephane eranian
Date: Thu Nov 27 2008 - 06:36:14 EST


On Thu, Nov 27, 2008 at 12:31 PM, Andi Kleen <andi@xxxxxxxxxxxxxx> wrote:
>> The only reason why I have to deal with NMI is not so much to allow
>> for profiling irq-off regions but because I have to share the PMU with
>> the NMI watchdog. Otherwise I'd have to fail or disable the NMI watchdog
>> on the fly.
>
> The NMI watchdog is now off by default so failing with it enabled
> is fine.

Yes, but most likely it is on in distro kernels.

>
> Longer term having NMI profiling is still a useful feature I think,
> but of course it needs to be implemented cleanly.
>

The difficulty with NMI is locking. Unlike Oprofile, perfmon code needs
locking. If you recall our discussion yesterday about passing the file
descriptor
around or even using it with multiple threads inside the same process.
You have to handle the case where the NMI fires while you are holding
a perfmon lock. What you have in the patch (and the the fully-featured version)
is that we get the NMI and we stop the PMU WITHOUT grabbing any perfmon
lock, and the we repost the interrupt with the regular vector. We also make sure
we grab the RIP at NMI. That is the one we want to see reported in the sampling
buffer.

I am still wondering how Oprofile handles the case where multiple processes or
threads access the same file descriptor.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/