Re: [PATCH 4/7] dm: prevent DAX mounts if not supported

From: Ross Zwisler
Date: Fri May 25 2018 - 17:36:55 EST


On Fri, May 25, 2018 at 03:54:10PM -0400, Mike Snitzer wrote:
> On Thu, May 24 2018 at 10:55pm -0400,
> Ross Zwisler <ross.zwisler@xxxxxxxxxxxxxxx> wrote:
>
> > Currently the code in dm_dax_direct_access() only checks whether the target
> > type has a direct_access() operation defined, not whether the underlying
> > block devices all support DAX. This latter property can be seen by looking
> > at whether we set the QUEUE_FLAG_DAX request queue flag when creating the
> > DM device.
> >
> > This is problematic if we have, for example, a dm-linear device made up of
> > a PMEM namespace in fsdax mode followed by a ramdisk from BRD.
> > QUEUE_FLAG_DAX won't be set on the dm-linear device's request queue, but
> > we have a working direct_access() entry point and the first member of the
> > dm-linear set *does* support DAX.
> >
> > This allows the user to create a filesystem on the dm-linear device, and
> > then mount it with DAX. The filesystem's bdev_dax_supported() test will
> > pass because it'll operate on the first member of the dm-linear device,
> > which happens to be a fsdax PMEM namespace.
> >
> > All DAX I/O will then fail to that dm-linear device because the lack of
> > QUEUE_FLAG_DAX prevents fs_dax_get_by_bdev() from working. This means that
> > the struct dax_device isn't ever set in the filesystem, so
> > dax_direct_access() will always return -EOPNOTSUPP.
> >
> > By failing out of dm_dax_direct_access() if QUEUE_FLAG_DAX isn't set we let
> > the filesystem know we don't support DAX at mount time. The filesystem
> > will then silently fall back and remove the dax mount option, causing it to
> > work properly.
> >
> > Signed-off-by: Ross Zwisler <ross.zwisler@xxxxxxxxxxxxxxx>
> > Fixes: commit 545ed20e6df6 ("dm: add infrastructure for DAX support")
> > ---
> > drivers/md/dm.c | 5 ++---
> > 1 file changed, 2 insertions(+), 3 deletions(-)
> >
> > diff --git a/drivers/md/dm.c b/drivers/md/dm.c
> > index 0a7b0107ca78..9728433362d1 100644
> > --- a/drivers/md/dm.c
> > +++ b/drivers/md/dm.c
> > @@ -1050,14 +1050,13 @@ static long dm_dax_direct_access(struct dax_device *dax_dev, pgoff_t pgoff,
> >
> > if (!ti)
> > goto out;
> > - if (!ti->type->direct_access)
> > + if (!blk_queue_dax(md->queue))
> > goto out;
> > len = max_io_len(sector, ti) / PAGE_SECTORS;
> > if (len < 1)
> > goto out;
> > nr_pages = min(len, nr_pages);
> > - if (ti->type->direct_access)
> > - ret = ti->type->direct_access(ti, pgoff, nr_pages, kaddr, pfn);
> > + ret = ti->type->direct_access(ti, pgoff, nr_pages, kaddr, pfn);
>
> So I followed all the rationale for this patch. But the last change
> doesn't make any sense. We should still verify that the target has
> ti->type->direct_access before calling it. So please reinstate that
> check before calling it.

You know that type has direct_access() via the blk_queue_dax() check. This
tells you not only that the target has direct_access(), but also that you've
successfully checked all members of that DM device and they all have working
DAX I/O paths, etc. This is all done via the bdev_dax_supported() check and
the rest of the code in dm_table_supports_dax() and device_supports_dax().

If this is too subtle I can add a comment or add the check back.