Re: [PATCH 1/1] VSOCK: Introduce VM Sockets

From: Andy King
Date: Wed Feb 13 2013 - 22:20:32 EST

> I've seen you have a notify_ops in the vmci bits. Do you have different
> notify ops depending on socket type or something? Does it make sense to
> move the notify ops ptr into "struct vsock_sock" maybe?

The notify stuff only applies to STREAMs. However, we have two different
notify impls, one for legacy ESX and one for newer, and we figure out at
runtime which protocol we're using with the hypervisor and set the
callbacks appropriately. The difference between the two is that the
newer one is much smarter and knows not to signal (the peer) quite so much,
i.e., it has some basic but sensible flow-control, which improves
performance quite a bit. Again, that might not make any sense at all
for virtio. Do you need to signal when you enqueue to a ring? And is
there coalescing? Dunno...

> And can we make it optional please (i.e. allow the function pointers to
> be NULL)?

They were originally allowed to be NULL, but I changed it in the last
round of patches while moving them into the transport, since I disliked
the NULL checks so much. I can put them back, but that's a bigger
change, and I'm not sure we want to push large patches to Dave right
now :)

> Which problem you are trying to tackle with the notifications?

It's to do with signaling the peer, or more appropriately, trying to
avoid signaling the peer when possible. The naive impl. is to signal
every time we enqueue or dequeue data (into our VMCI queuepairs).
But signaling is slow, since it involves a world exit, so we prefer
not to. Which means we need to keep track of rate of flow and figure
out when we should and should not, and that's what all the notification
stuff does. It's...ugly...

> > For the VMCI transport, it indicates if the underlying queuepair is
> > still around (i.e., make sure we haven't torn it down while sleeping
> > in a blocking send or receive). Perhaps it's not the best name?
> How you'd hit that? Peer closing the socket while sleeping? Other
> thread closing the socket wile sleeping? Both?
> I think a state field in struct vsock_sock would be a better solution here.

Hrm, lemme think about this one.

> >> What is *_allow?
> >
> > It's very basic filtering. We have specific addresses that we don't
> > allow, and we look for them in the allow() functions. You can just
> > return true if you like.
> Can we make those calls optional too please?


> The notify_*_init could return a opaque pointer instead. But then
> you'll need a notify_*_free too. And you can't place it on the stack
> and thus have a allocation in the hot path, which I guess you are trying
> to avoid in the first place.

Avoiding allocation is good, but at the same time, I wonder if it's not
a big deal compared with the time spent copying out of the buffers, in
which case it won't matter. So maybe we can do this.

- Andy
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at
Please read the FAQ at