Re: [patch][rfc] x86, mutex: non-atomic unlock (and a rant)

From: Linus Torvalds
Date: Mon Nov 02 2009 - 10:20:29 EST

Next message: Ben Dooks: "Re: Using statically allocated memory for platform_data."
Previous message: Avi Kivity: "Re: [PATCH 08/11] Add "wait for page" hypercall."
In reply to: Nick Piggin: "[patch][rfc] x86, mutex: non-atomic unlock (and a rant)"
Next in thread: Nick Piggin: "Re: [patch][rfc] x86, mutex: non-atomic unlock (and a rant)"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On Mon, 2 Nov 2009, Nick Piggin wrote:
>
> Non-atomic unlock for mutexs maybe? I do this by relying on cache
> coherence on a cacheline basis for ordering rather than the memory
> consistency of the x86. Linus I know you've told me this is an incorrect
> assumption in the past, but I'm not so sure.

I'm sure.

This is simply buggy:

> + atomic_set(&lock->count, 1);
> + barrier();
> + if (unlikely(lock->waiters))
> + fail_fn(lock);

because it doesn't matter one whit whether 'lock->count' and
'lock->waiters' are in the same cacheline or not.

The cache coherency deals in cachelines, but the instruction re-ordering
logic does not. It's entirely possible that the CPU will turn this into

tmp = lock->waiters;
...
atomic_set(&lock->count, 1);
if (tmp)
fail_fn(lock);

and your "barrier()" did absolutely nothing.

The fact that it may _work_ in almost all circumstances (and perhaps even
"always" on some microarchitectures) is irrelevant. It's simply not
guaranteed to work. Yes, you need just the right timings, and yes, it's
probably hard to hit. And yes, I can well imagine that some micro-
architecture will even guarantee the write->read ordering, and that it
would _always_ work on that micro-architecture.

But I can see your thing failing even on an in-order CPU. It literally
doesn't even need OoO to fail, all it needs is a sufficiently deep write
buffer on an in-order core. And to fail in practice, maybe there needs to
be lots of writes in that buffer, and some bad luck, but the thing is,
write buffers are not coherent between cores - so the write may have
happened as far as the core that does it is concerned, but other cores
(or even HT) may not see the new value until after the read has taken
effect.

Linus
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

Next message: Ben Dooks: "Re: Using statically allocated memory for platform_data."
Previous message: Avi Kivity: "Re: [PATCH 08/11] Add "wait for page" hypercall."
In reply to: Nick Piggin: "[patch][rfc] x86, mutex: non-atomic unlock (and a rant)"
Next in thread: Nick Piggin: "Re: [patch][rfc] x86, mutex: non-atomic unlock (and a rant)"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]