Re: XDP socket rings, and LKMM litmus tests

From: Alan Stern
Date: Wed Mar 03 2021 - 22:23:25 EST


On Wed, Mar 03, 2021 at 02:03:48PM -0800, Paul E. McKenney wrote:
> On Wed, Mar 03, 2021 at 03:22:46PM -0500, Alan Stern wrote:
> > On Wed, Mar 03, 2021 at 09:40:22AM -0800, Paul E. McKenney wrote:
> > > On Wed, Mar 03, 2021 at 12:12:21PM -0500, Alan Stern wrote:
> >
> > > > Local variables absolutely should be treated just like CPU registers, if
> > > > possible. In fact, the compiler has the option of keeping local
> > > > variables stored in registers.
> > > >
> > > > (Of course, things may get complicated if anyone writes a litmus test
> > > > that uses a pointer to a local variable, Especially if the pointer
> > > > could hold the address of a local variable in one execution and a
> > > > shared variable in another! Or if the pointer is itself a shared
> > > > variable and is dereferenced in another thread!)
> > >
> > > Good point! I did miss this complication. ;-)
> >
> > I suspect it wouldn't be so bad if herd7 disallowed taking addresses of
> > local variables.
> >
> > > As you say, when its address is taken, the "local" variable needs to be
> > > treated as is it were shared. There are exceptions where the pointed-to
> > > local is still used only by its process. Are any of these exceptions
> > > problematic?
> >
> > Easiest just to rule out the whole can of worms.
>
> Good point, given that a global can be used instead of a local for
> any case where an address must be taken.

Another thing to consider: Almost all marked accesses involve using the
address of the storage location (for example, smp_load_acquire's first
argument must be a pointer). As far as I can remember at the moment,
the only ones that don't are READ_ONCE and WRITE_ONCE. So although we
might or might not want to allow READ_ONCE or WRITE_ONCE on a local
variable, we won't have to worry about any of the other kinds of marked
accesses.

> > > > But even if local variables are treated as non-shared storage locations,
> > > > we should still handle this correctly. Part of the problem seems to lie
> > > > in the definition of the to-r dependency relation; the relevant portion
> > > > is:
> > > >
> > > > (dep ; [Marked] ; rfi)
> > > >
> > > > Here dep is the control dependency from the READ_ONCE to the
> > > > local-variable store, and the rfi refers to the following load of the
> > > > local variable. The problem is that the store to the local variable
> > > > doesn't go in the Marked class, because it is notated as a plain C
> > > > assignment. (And likewise for the following load.)
> > > >
> > > > Should we change the model to make loads from and stores to local
> > > > variables always count as Marked?
> > >
> > > As long as the initial (possibly unmarked) load would be properly
> > > complained about.
> >
> > Sorry, I don't understand what you mean.
>
> I was thinking in terms of something like this in one of the processes:
>
> p = gp; // Unmarked!
> r1 = p;
> q = r1; // Implicitly marked now?
> if (q)
> WRITE_ONCE(x, 1); // ctrl dep from gp???

I hope we won't have to worry about this! :-) Treating local variable
accesses as if they are always marked looks wrong.

> > > And I cannot immediately think of a situation where
> > > this approach would break that would not result in a data race being
> > > flagged. Or is this yet another failure of my imagination?
> >
> > By definition, an access to a local variable cannot participate in a
> > data race because all such accesses are confined to a single thread.
>
> True, but its value might have come from a load from a shared variable.

Then that load could have participated in a data race. But the store to
the local variable cannot.

> > However, there are other aspects to consider, in particular, the
> > ordering relations on local-variable accesses. But if, as Luc says,
> > local variables are treated just like registers then perhaps the issue
> > doesn't arise.
>
> Here is hoping!
>
> > > > What should have happened if the local variable were instead a shared
> > > > variable which the other thread didn't access at all? It seems like a
> > > > weak point of the memory model that it treats these two things
> > > > differently.
> > >
> > > But is this really any different than the situation where a global
> > > variable is only accessed by a single thread?
> >
> > Indeed; it is the _same_ situation. Which leads to some interesting
> > questions, such as: What does READ_ONCE(r) mean when r is a local
> > variable? Should it be allowed at all? In what way is it different
> > from a plain read of r?
> >
> > One difference is that the LKMM doesn't allow dependencies to originate
> > from a plain load. Of course, when you're dealing with a local
> > variable, what matters is not the load from that variable but rather the
> > earlier loads which determined the value that had been stored there.
> > Which brings us back to the case of the
> >
> > dep ; rfi
> >
> > dependency relation, where the accesses in the middle are plain and
> > non-racy. Should the LKMM be changed to allow this?
>
> It would be nice, give or take the potential side effects. ;-)
> As in it would be nice, but might not be worthwhile.

Treating local variables like registers will automatically bring this
behavior. So I think we'll be good.

> > There are other differences to consider. For example:
> >
> > r = READ_ONCE(x);
> > smp_wmb();
> > WRITE_ONCE(y, 1);
> >
> > If the write to r were treated as a marked store, the smp_wmb would
> > order it (and consequently the READ_ONCE) before the WRITE_ONCE.
> > However we don't want to do this when r is a local variable. Indeed, a
> > plain store wouldn't be ordered this way because the compiler might
> > optimize the store away entirely, leaving the smp_wmb nothing to act on.
>
> Agreed, having smp_wmb() order things due to a write to a local variable
> would not be what we want.
>
> > So overall the situation is rather puzzling. Treating local variables
> > as registers is probably the best answer.
>
> That is sounding quite appealing at the moment.

Agreed.

Alan