Nick Piggin <nickpiggin@xxxxxxxxxxxx> wrote:
Ah, but if the cache is on the CPU side of the dotted line, does that thenI don't think it has to[*]. It would guarantee the _order_ in which "global
mean that a write memory barrier guarantees the CPU's cache to have
updated memory?
memory" of this model ie. visibility for other "CPUs" see the writes,
whether that visibility ultimately be implemented by cache coherency
protocol or something else, I don't think matters (for a discussion of
memory ordering).
It does matter, because I have to make it clear that the effect of the memory
barrier usually stops at the cache, and in fact memory barriers may have no
visibility at all on another CPU because it's all done inside a CPU's cache,
until that other CPU tries to observe the results.
If anything it confused the matter for the case of Alpha.
Nah... Alpha is self-confusing:-)
All the programmer needs to know is that there is some horizon (memory)
beyond which stores are visible to other CPUs, and stores can travel there
at different speeds so later ones can overtake earlier ones. And likewise
loads can come from memory to the CPU at different speeds too, so later
loads can contain earlier results.
They also need to know that memory barriers don't imply an ordering on the
cache.
[*] Nor would your model require a smp_wmb() to update CPU caches either, I
think: it wouldn't have to flush the store buffer, just order it.
Exactly.
But in your diagram, given that it doesn't show the cache, you don't know that
the memory barrier doesn't extend through the cache and all the way to memory.