Re: generic DMA bypass flag

From: Robin Murphy
Date: Fri Nov 15 2019 - 13:12:53 EST


On 14/11/2019 7:41 am, Christoph Hellwig wrote:
On Wed, Nov 13, 2019 at 02:45:15PM +0000, Robin Murphy wrote:
In all honesty, this seems silly. If we can set a per-device flag to say
"oh, bypass these ops and use some other ops instead", then we can just as
easily simply give the device the appropriate ops in the first place.
Because, y'know, the name of the game is "per-device ops".

Except that we can't do it _that_ easily. The problem is that for both
the powerpc and intel case the selection is dynamic. If a device is in
the identify domain with intel-iommu (or the equivalent on powerpc which
doesn't use the normal iommu framework), we still want to use the iommu
to be able to map memory for devices with a too small dma mask using
the iommu instead of using swiotlb bouncing. So to "just" use the
per-device dma ops we'd need:

a) a hook in dma_direct_supported to pick another set of ops for
small dma masks
b) a hook in the IOMMU ops to propagate to the direct ops for full
64-bit masks

And is that any different from where you would choose to "just" set a generic bypass flag?


diff --git a/drivers/iommu/iommu.c b/drivers/iommu/iommu.c
index d658c7c6a2ab..40e096d3dbc5 100644
--- a/drivers/iommu/iommu.c
+++ b/drivers/iommu/iommu.c
@@ -2242,12 +2242,14 @@ request_default_domain_for_dev(struct device *dev, unsigned long type)
/* Request that a device is direct mapped by the IOMMU */
int iommu_request_dm_for_dev(struct device *dev)
{
+ set_dma_ops(dev, NULL);
return request_default_domain_for_dev(dev, IOMMU_DOMAIN_IDENTITY);
}

/* Request that a device can't be direct mapped by the IOMMU */
int iommu_request_dma_domain_for_dev(struct device *dev)
{
+ set_dma_ops(dev, &iommu_dma_ops);
return request_default_domain_for_dev(dev, IOMMU_DOMAIN_DMA);
}


If intel-iommu gets fully converted such that everyone using default domains is also using iommu-dma, that should be it as far as the generic DMA ops are concerned (ultimately we might even be able to do it in __iommu_attach_device() based on the domain type). Like I said, the hard part is deciding when to make *these* calls, but apparently intel-iommu can already do that.

I looked into that for powerpc a while ago and it wasn't pretty at all.
Compared to that just checking another flag for the DMA direct calls
is relatively clean and trivial as seens in the diffstat for this series
alone.

I don't see a great benefit to pulling legacy cruft out into common code
instead of just working to get rid of it in-place, when said cruft pulls in
the opposite direction to where we're taking the common code (i.e. it's
inherently based on the premise of global ops).

I'm not sure what legacy cruft it pull in. I think it actually fits very
much into a mental model of "direct mapping is the default, to be overriden
if needed" which is pretty close to what we have at the moment. Just
with a slightly more complicated processing of the override.

Because the specific "slightly more complicated" currently used by the existing powerpc code will AFAICS continue to be needed only by the existing powerpc code, thus I don't see any benefit to cluttering up the common code with it today. You may as well just refactor powerpc to swizzle its own ops to obviate archdata.iommu_bypass, and delete a fair chunk of old code.

Robin.