Re: Xen & VMI?

From: Jeremy Fitzhardinge
Date: Tue Mar 06 2007 - 16:46:55 EST


Ingo Molnar wrote:
> 1 sane ABI instead of 4 parallel ones? It's the same difference as the
> difference between 300 system calls and 1200 system calls. Alot more
> focus, alot more integration, alot less pain. Every time i change a
> detail in Linux i have to update (and think about) 1 virtualization
> aspect - not 4 (or more).
>

But you're just pushing the problem off; it doesn't go away, and it
doesn't really stop being your problem. If you change the way you use
the abi and something breaks in one of its implementations, you're still
going to get "wahh, you broke Xen/ESX/whatever". And then you either
have to wait for the ABI implementation to get fixed, or work around it
on the Linux side.

I don't see why you think an ABI is easier to reason about than the
pv_ops api? If you're going to make a kernel change that has an effect
on the API, its also going to have an effect on the ABI, but with the
ABI you're stuck waiting for the ABI to catch up before you can do
something. If your kernel change relies on pv_ops but won't need to
change it, then reasoning about the pv_ops API is enough to make things
work.

pvops has, what, about 90 calls in it, which break down into a few broad
classes:

1. various setup things, which more or less correspond to subsystems
(interrupts, time, memory, etc)
2. calls which are direct analogues of hardware operations
3. pagetable/tlb operations
4. random corner cases (like the apic stuff to plug into ESX's apic
emulation)

The (functionally) large missing component is SMP support, though I
expect that will only come down to a handful of extra operations.

All of these operations are pretty easily understood in their own terms
at the pv_ops interface level. Knowing how the pv_ops backends map
these onto a particular hypercall interface is not really necessary to
understand the pv_ops interface, though its readily visible by reading
the source. But if you need to look and change it, you can; hiding it
under a layer of magic ABI dust isn't going to fix that.

The other big part that's missing from paravirt_ops is all the hooks for
calling back into the kernel - because they're not necessary. You
skipped that point in my earlier mail, but it is important. A basic
element in Xen's design is that it interacts with the guest kernel in a
fairly high level, and makes use of existing kernel mechanisms whereever
possible. How do you see that working in your proposal?

J
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/