Re: Scaling noise

From: Daniel Phillips
Date: Wed Sep 03 2003 - 20:48:21 EST


On Wednesday 03 September 2003 17:31, Steven Cole wrote:
> On Wed, 2003-09-03 at 06:47, Antonio Vargas wrote:
> > As you may probably know, CC-clusters were heavily advocated by the
> > same Larry McVoy who has started this thread.
>
> Yes, thanks. I'm well aware of that. I would like to get a discussion
> going again on CC-clusters, since that seems to be a way out of the
> scaling spiral. Here is an interesting link:
> http://www.opersys.com/adeos/practical-smp-clusters/

As you know, the argument is that locking overhead grows by some factor worse
than linear as the size of an SMP cluster increases, so that the locking
overhead explodes at some point, and thus it would be more efficient to
eliminate the SMP overhead entirely and run a cluster of UP kernels,
communicating through the high bandwidth channel provided by shared memory.

There are other arguments, such as how complex locking is, and how it will
never work correctly, but those are noise: it's pretty much done now, the
complexity is still manageable, and Linux has never been more stable.

There was a time when SMP locking overhead actually cost something in the high
single digits on Linux, on certain loads. Today, you'd have to work at it to
find a real load where the 2.5/6 kernel spends more than 1% of its time in
locking overhead, even on a large SMP machine (sample size of one: I asked
Bill Irwin how his 32 node Numa cluster is running these days). This blows
the ccCluster idea out of the water, sorry. The only way ccCluster gets to
live is if SMP locking is pathetic and it's not.

As for Karim's work, it's a quintessentially flashy trick to make two UP
kernels run on a dual processor. It's worth doing, but not because it blazes
the way forward for ccClusters. It can be the basis for hot kernel swap:
migrate all the processes to one of the two CPUs, load and start a new kernel
on the other one, migrate all processes to it, and let the new kernel restart
the first processor, which is now idle.

Regards,

Daniel

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/