Re: LFENCE instruction (was: [rfc][patch 3/3] x86: optimise barriers)

From: Nick Piggin
Date: Mon Oct 15 2007 - 20:22:41 EST


On Tue, Oct 16, 2007 at 12:08:01AM +0200, Mikulas Patocka wrote:
> > On Mon, 15 Oct 2007 22:47:42 +0200 (CEST)
> > Mikulas Patocka <mikulas@xxxxxxxxxxxxxxxxxxxxxxxx> wrote:
> >
> > > > According to latest memory ordering specification documents from
> > > > Intel and AMD, both manufacturers are committed to in-order loads
> > > > from cacheable memory for the x86 architecture. Hence, smp_rmb()
> > > > may be a simple barrier.
> > > >
> > > > http://developer.intel.com/products/processor/manuals/318147.pdf
> > > > http://www.amd.com/us-en/assets/content_type/white_papers_and_tech_docs/24593.pdf
> > >
> > > Hi
> > >
> > > I'm just wondering about one thing --- what is LFENCE instruction
> > > good for?
> > >
> > > SFENCE is for enforcing ordering in write-combining buffers (it
> > > doesn't have sense in write-back cache mode).
> > > MFENCE is for preventing of moving stores past loads.
> > >
> > > But what is LFENCE for? I read the above documents and they already
> > > say that CPUs have ordered loads.
> > >
> >
> > The cpus also have an explicit set of instructions that deliberately do
> > unordered stores/loads, and s/lfence etc are mostly designed for those.
>
> I know about unordered stores (movnti & similar) --- they basically use
> write-combining method on memory that is normally write-back --- and they
> need sfence. But which one instruction does unordered load and needs
> lefence?

Also, for non-wb memory. I don't think the Intel document referenced
says anything about this, but the AMD document says that loads can pass
loads (page 8, rule b).

This is why our rmb() is still an lfence.

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/