Re: remove exofs, the T10 OSD code and block/scsi bidi support V3

From: Douglas Gilbert
Date: Wed Dec 19 2018 - 21:10:42 EST


On 2018-12-19 9:43 a.m., Christoph Hellwig wrote:
On Mon, Nov 26, 2018 at 07:11:10PM +0200, Boaz Harrosh wrote:
On 11/11/18 15:32, Christoph Hellwig wrote:
The only real user of the T10 OSD protocol, the pNFS object layout
driver never went to the point of having shipping products, and we
removed it 1.5 years ago. Exofs is just a simple example without
real life users.


You have failed to say what is your motivation for this patchset? What
is it you are trying to fix/improve.

Drop basically unused support, which allows us to

1) reduce the size of every kernel with block layer support, and
even more for every kernel with scsi support

By proposing the removal of bidi support from the block layer, it isn't
just the SCSI subsystem that will be impacted. Those NVMe documents
that you referred me to earlier in the year, in the command tables
in 1.3c and earlier you have noticed the 2 bit direction field and
what 11b means? Even if there aren't any bidi NVMe commands *** yet,
the fact that NVMe's 64 byte command format has provision for 4
(not 2) independent data transfers (data + meta, for each direction).
Surely NVMe will sooner or later take advantage of those ... a
command like READ GATHERED comes to mind.

2) reduce the size of the critical struct request structure by
128 bits, thus reducing the memory used by every blk-mq driver
significantly, never mind the cache effects

Hmm, one pointer (that is null in the non-bidi case) should be enough,
that's 64 or 32 bits.

3) stop having the maintainance overhead for this code in the
block layer, which has been rather painful at times

You won't get any sympathy from me :-) The sg driver is trying to
inject _SCSI_ commands into the SCSI mid-level for onward processing
by SCSI LLDs. So WTF does it have to deal with the block layer.


While on the subject of bidi, the order of transfers: is the data-out
(to the target) always before the data-in or is it the target device
that decides (depending on the semantics of the command) who is first?

Doug Gilbert

*** there could already be vendor specific bidi NVMe commands out
there (ditto for SCSI)