Re: [PATCH v2] NOHZ: fix nohz on cpu unplug

From: Mark Lord
Date: Fri Feb 06 2009 - 09:52:21 EST


Suresh Siddha wrote:
On Thu, 2009-02-05 at 11:54 -0800, Mark Lord wrote:
How far back in (kernel release) time does this problem exist?
Candidate for -stable ?

Problem is present for a while now. But I don't think this is a common
case scenario (as the issue happens only for the duration when we leave
a cpu offline, and it should get fixed the moment that logical cpu is
back online).
..

There is an existing bug (for some time now) in the kernel shutdown
for multi-CPUs. Once in a while, perhaps every 20-30 halts,
the kernel fails to power-off the machine.

I've seen this problem here since 2.6.18 or so, on multiple different
machines with Core2duo and Core2quad processors.

It comes and goes, depending upon the kernel version and exact .config
that is used. Any attempt to instrument it generally changes the race
conditions enough that it stops happening.

I'm just wondering if this bug might explain some of that.

Cheers
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/