public inbox for linux-scsi@vger.kernel.org
 help / color / mirror / Atom feed
* SCSI Generic version 4 interface, release 1.2
@ 2007-03-14 16:18 Douglas Gilbert
  2007-03-17 21:15 ` Pete Wyckoff
  2007-03-19 10:49 ` FUJITA Tomonori
  0 siblings, 2 replies; 5+ messages in thread
From: Douglas Gilbert @ 2007-03-14 16:18 UTC (permalink / raw)
  To: linux-scsi; +Cc: fujita.tomonori, pw, jens.axboe

[-- Attachment #1: Type: text/plain, Size: 563 bytes --]

After reviewing this post by Pete Wyckoff:
http://marc.theaimsgroup.com/?l=linux-scsi&m=117278879816029&w=2

I decided to update my sg v4 interface document originally
posted 20061106 which I will now call release 1.1 :
http://lwn.net/Articles/208082/

Pete was proposing to put back din_iovec_count and
dout_iovec_count that had been dropped out of bsg but
had been in release 1.1 . Hmm.

Some other items have been picked up from the bsg
implementation plus the suggestion from LSF'07 to
add dout_resid.

See the attachment, comments welcome.

Doug Gilbert





[-- Attachment #2: sg4_proposal.txt --]
[-- Type: text/plain, Size: 6123 bytes --]

                      SCSI Generic version 4 interface structure
                      ==========================================
                                                      Release 1.2

Goals:
  - handle both generalized request/response and data_out/data_in
    independently in same invocation (i.e. synchronous usage).
  - alternatively the request and data_out could be instigated in one
    invocation with pointers given for the incoming response and data_in.
    Then a second invocation (as a result of polling or asynchronous
    notification) reports the response and/or data_in is done, plus
    provides error/resid/timing information. This is asynchronous usage.
    This allows for the most complicated SCSI commands: tagged, variable
    length cdbs with bidirectional data transfers.
  - support multiple protocols. If they are generalized request-response
    protocols then they can choose either the request/response part of the
    interface or the data_out/data_in part.
  - layered error/condition reporting: (OS) driver, transport and device
    (logical unit). Method used to present this struct to OS (e.g. ioctl())
    may also report error (e.g. EPERM).
  - allow for auxiliary information to be passed back for the application
    client to consider
  - same structure can be used for a synchronous (e.g. interruptible ioctl)
    or asynchronous (e.g. ioctl()/read() ) pass through.
  - leave device (lu) or target addressing issues to some other mechanism
    (what SCSI standards call the I_T_L or the I_T nexus respectively) as
    they are transport dependent. However do include the tag level (the
    "_Q" part of a I_T_L_Q nexus).
  - stay close enough to struct sg_io_hdr (sg version 3 interface) to use
    with existing SG_IO ioctls, current implementations expect 'S' in
    'guard'


Comments:
  - unsigned 64 bit integers used as pointer carriers to ease 32/64
    bit code interworking (e.g. 32 bit app on 64 bit kernel)
  - should there be more (or less) spare fields?
  - the write() usage in the sg driver's asynchronous interface has
    caused problems when mistakenly applied to a block device node
    rather than a sg device node. Using an ioctl(flag_async) followed
    by a read() for asynchronous work offers similar functionality and
    is safer. Using ioctl(flag_async_start) and ioctl(flag_async_finish)
    is another possibility.
  - rather than have a separate ATA pass through mechanism, the SAT
    defined ATA PASS THROUGH SCSI commands could be used with the
    driver implementation routing the ATA commands to their
    subsystem. This could be flagged so it didn't preclude a SAT layer
    in a SCSI transport (e.g. MPT SAS HBA firmware).
  - if SAM/SPC does not define an enumeration for lesser used input
    fields, then use the value 0 for inert/off/don't_care .
  - the SCSI command tag field as currently defined in SAM-4 can be
    up to 64 bits (with a proposal to increase that to 96 bits for FCP)
    Should we let the transport layer/LLD worry about that?


ChangeLog for release 1.2 [20070314]
  - add dout_resid
  - re-arrange uint64_t types (i.e. pointer carriers) to be on a
    8 byte boundary
  - reinstate dout_iovec_count and din_iovec_count (they were in
    release 1.1 but bsg dropped them)
  - change name: response_len_wr to response_len
  - pick up some descriptions from bsg

ChangeLog for release 1.1 [20061106]
  - was called sg version 4 interface, version 1.1
    so change the second "version" to "release"


-------------------------------------------------------------------
#include <stdint.h>


struct sg_io_v4
{
        int32_t guard;          /* [i] 'Q' to differentiate from v3 */
        uint32_t protocol;      /* [i] 0 -> SCSI , .... */
        uint32_t subprotocol;   /* [i] 0 -> SCSI command, 1 -> SCSI task
                                   management function, .... */

        uint32_t request_len;   /* [i] in bytes {SCSI: cdb length} */
        uint64_t request;       /* [i], [*i] {SCSI: cdb} */
        uint32_t request_attr;  /* [i] {SCSI: task attribute} */
        uint32_t request_tag;   /* [i] {SCSI: task tag (only if flagged)} */
        uint32_t request_priority; /* [i] {SCSI: task priority} */
        uint32_t max_response_len; /* [i] in bytes */
        uint64_t response;      /* [i], [*o]  {SCSI: (auto)sense data} */

        /* "dout_": data out (to device); "din_": data in (from device) */
        uint32_t dout_iovec_count;      /* [i] 0 -> "flat" dout transfer */
                                /* else dout_xfer points to array of iovec */
        uint32_t dout_xfer_len; /* [i] bytes to be transferred to device */
        uint32_t din_iovec_count;       /* [i] 0 -> "flat" din transfer */
        uint32_t din_xfer_len;  /* [i] bytes to be transferred from device */
        uint64_t dout_xfer;     /* [i], [*i]  */
        uint64_t din_xfer;      /* [i], [*o]  */

        uint32_t timeout;       /* [i] units: milliseconds */
        uint32_t flags;         /* [i] additional control: bit mask */
        uint64_t usr_ptr;       /* [i->o] unused internally */
        uint32_t spare_in;      /* [i] */

        uint32_t driver_status; /* [o] 0 -> ok */
        uint32_t transport_status;      /* [o] 0 -> ok */
        uint32_t device_status; /* [o] {SCSI: command completion status} */
        uint32_t retry_delay;   /* [o] {SCSI: status+, units: 100 ms} */
        uint32_t info;          /* [o] additional information, bit mask */
        uint32_t duration;      /* [o] time to complete, in milliseconds */
        uint32_t response_len;  /* [o] bytes of response actually written */
        int32_t din_resid;      /* [o] actual_din_xfer_len - din_xfer_len */
        int32_t dout_resid;     /* [o] actual_dout_xfer_len - dout_xfer_len */
        uint32_t generated_tag; /* [o] {SCSI: transport generated task tag} */
        uint32_t spare_out;     /* [o] */

        /* uint32_t padding[1];   */

};

/* sizeof(sg_io_v4)=144 [divisible by 8 (and 16)] */


-------------------------------------------------------------------------

Doug Gilbert
14th March 2007

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: SCSI Generic version 4 interface, release 1.2
  2007-03-14 16:18 SCSI Generic version 4 interface, release 1.2 Douglas Gilbert
@ 2007-03-17 21:15 ` Pete Wyckoff
  2007-03-17 23:13   ` Mike Christie
  2007-03-19 10:49 ` FUJITA Tomonori
  1 sibling, 1 reply; 5+ messages in thread
From: Pete Wyckoff @ 2007-03-17 21:15 UTC (permalink / raw)
  To: Douglas Gilbert; +Cc: linux-scsi, fujita.tomonori, jens.axboe

dougg@torque.net wrote on Wed, 14 Mar 2007 12:18 -0400:
> After reviewing this post by Pete Wyckoff:
> http://marc.theaimsgroup.com/?l=linux-scsi&m=117278879816029&w=2
> 
> I decided to update my sg v4 interface document originally
> posted 20061106 which I will now call release 1.1 :
> http://lwn.net/Articles/208082/
> 
> Pete was proposing to put back din_iovec_count and
> dout_iovec_count that had been dropped out of bsg but
> had been in release 1.1 . Hmm.
> 
> Some other items have been picked up from the bsg
> implementation plus the suggestion from LSF'07 to
> add dout_resid.
> 
> See the attachment, comments welcome.

Do you want to define the iovec format too?  As I commented in my
patch, v3 sg_iovec has pointers with 32/64-bit issues.  Would be
nice to see you declare v4 sg_iovec as pure u64.  (By the way, don't
use the patch:  casting from the new to the old can put junk in the
top half of the 64-bit sg_iovec.iov_len).

Another issue I wonder about is queue DMA alignment.  In bsg,
blk_rq_map_user will use a bounce buffer if the user-supplied start
and end addresses are not aligned.  sg will happily map user pages
at any offset without checking, although I haven't checked if Mike's
patches change this.  ll_rw_blk.c says regarding blk_rq_map_user:

    We don't allow misaligned data like bio_map_user() does.  If the
    user is using sg, they're expected to know the alignment
    constraints and respect them accordingly.

Should this still be true for both iovec and non-iovec uses of sgv4?
I modified bsg to use iovec and ignore alignment issues, just like
sg, but left in the bounce buffer for non-iovec usage.  Seems
awkward.  scsi_ioctl's sg_io has the same odd situation:  non-iovec
is bounced, iovec must be aligned by user.

		-- Pete


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: SCSI Generic version 4 interface, release 1.2
  2007-03-17 21:15 ` Pete Wyckoff
@ 2007-03-17 23:13   ` Mike Christie
  2007-03-18 17:48     ` Pete Wyckoff
  0 siblings, 1 reply; 5+ messages in thread
From: Mike Christie @ 2007-03-17 23:13 UTC (permalink / raw)
  To: Pete Wyckoff; +Cc: Douglas Gilbert, linux-scsi, fujita.tomonori, jens.axboe

Pete Wyckoff wrote:
> dougg@torque.net wrote on Wed, 14 Mar 2007 12:18 -0400:
>> After reviewing this post by Pete Wyckoff:
>> http://marc.theaimsgroup.com/?l=linux-scsi&m=117278879816029&w=2
>>
>> I decided to update my sg v4 interface document originally
>> posted 20061106 which I will now call release 1.1 :
>> http://lwn.net/Articles/208082/
>>
>> Pete was proposing to put back din_iovec_count and
>> dout_iovec_count that had been dropped out of bsg but
>> had been in release 1.1 . Hmm.
>>
>> Some other items have been picked up from the bsg
>> implementation plus the suggestion from LSF'07 to
>> add dout_resid.
>>
>> See the attachment, comments welcome.
> 
> Do you want to define the iovec format too?  As I commented in my
> patch, v3 sg_iovec has pointers with 32/64-bit issues.  Would be
> nice to see you declare v4 sg_iovec as pure u64.  (By the way, don't
> use the patch:  casting from the new to the old can put junk in the
> top half of the 64-bit sg_iovec.iov_len).
> 
> Another issue I wonder about is queue DMA alignment.  In bsg,
> blk_rq_map_user will use a bounce buffer if the user-supplied start
> and end addresses are not aligned.  sg will happily map user pages
> at any offset without checking, although I haven't checked if Mike's
> patches change this.  ll_rw_blk.c says regarding blk_rq_map_user:
> 

Currently and before any of my patches, sg checked the start
        if (((unsigned long)hp->dxferp &
                        queue_dma_alignment(sdev->request_queue)) != 0)
It did not check the start and end like blk_rq_map_user.

With my updated patches, blk_rq_map_user is just checking the start.
There was a thread about not checking the end on linux-scsi. I will try
to dig it up. A couple kernels ago the bio map helpers were modified to
only check the start.

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: SCSI Generic version 4 interface, release 1.2
  2007-03-17 23:13   ` Mike Christie
@ 2007-03-18 17:48     ` Pete Wyckoff
  0 siblings, 0 replies; 5+ messages in thread
From: Pete Wyckoff @ 2007-03-18 17:48 UTC (permalink / raw)
  To: Mike Christie; +Cc: Douglas Gilbert, linux-scsi, fujita.tomonori, jens.axboe

michaelc@cs.wisc.edu wrote on Sat, 17 Mar 2007 18:13 -0500:
> Pete Wyckoff wrote:
> > dougg@torque.net wrote on Wed, 14 Mar 2007 12:18 -0400:
> >> After reviewing this post by Pete Wyckoff:
> >> http://marc.theaimsgroup.com/?l=linux-scsi&m=117278879816029&w=2
> >>
> >> I decided to update my sg v4 interface document originally
> >> posted 20061106 which I will now call release 1.1 :
> >> http://lwn.net/Articles/208082/
> >>
> >> Pete was proposing to put back din_iovec_count and
> >> dout_iovec_count that had been dropped out of bsg but
> >> had been in release 1.1 . Hmm.
> >>
> >> Some other items have been picked up from the bsg
> >> implementation plus the suggestion from LSF'07 to
> >> add dout_resid.
> >>
> >> See the attachment, comments welcome.
> > 
> > Do you want to define the iovec format too?  As I commented in my
> > patch, v3 sg_iovec has pointers with 32/64-bit issues.  Would be
> > nice to see you declare v4 sg_iovec as pure u64.  (By the way, don't
> > use the patch:  casting from the new to the old can put junk in the
> > top half of the 64-bit sg_iovec.iov_len).
> > 
> > Another issue I wonder about is queue DMA alignment.  In bsg,
> > blk_rq_map_user will use a bounce buffer if the user-supplied start
> > and end addresses are not aligned.  sg will happily map user pages
> > at any offset without checking, although I haven't checked if Mike's
> > patches change this.  ll_rw_blk.c says regarding blk_rq_map_user:
> 
> Currently and before any of my patches, sg checked the start
>         if (((unsigned long)hp->dxferp &
>                         queue_dma_alignment(sdev->request_queue)) != 0)
> It did not check the start and end like blk_rq_map_user.
> 
> With my updated patches, blk_rq_map_user is just checking the start.
> There was a thread about not checking the end on linux-scsi. I will try
> to dig it up. A couple kernels ago the bio map helpers were modified to
> only check the start.

Oh, I missed that.  There's this thread about USB needing end
alignment too, on all but the last iovec entry:

    http://thread.gmane.org/gmane.linux.usb.devel/16230/focus=16409

		-- Pete

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: SCSI Generic version 4 interface, release 1.2
  2007-03-14 16:18 SCSI Generic version 4 interface, release 1.2 Douglas Gilbert
  2007-03-17 21:15 ` Pete Wyckoff
@ 2007-03-19 10:49 ` FUJITA Tomonori
  1 sibling, 0 replies; 5+ messages in thread
From: FUJITA Tomonori @ 2007-03-19 10:49 UTC (permalink / raw)
  To: dougg; +Cc: linux-scsi, fujita.tomonori, pw, jens.axboe

From: Douglas Gilbert <dougg@torque.net>
Subject: SCSI Generic version 4 interface, release 1.2
Date: Wed, 14 Mar 2007 12:18:16 -0400

>   - the write() usage in the sg driver's asynchronous interface has
>   caused problems when mistakenly applied to a block device node
>   rather than a sg device node. Using an ioctl(flag_async) followed
>   by a read() for asynchronous work offers similar functionality and
>   is safer. Using ioctl(flag_async_start) and
>   ioctl(flag_async_finish) is another possibility.

I'm not sure about this because users read/write via /dev/bsgX not
/dev/sdX.

Anyway, I'm not sure the read/write interface is good for bsg.


>   - the SCSI command tag field as currently defined in SAM-4 can be
>     up to 64 bits (with a proposal to increase that to 96 bits for FCP)
>     Should we let the transport layer/LLD worry about that?

SRP uses 64-bits tags.

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2007-03-19 10:49 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2007-03-14 16:18 SCSI Generic version 4 interface, release 1.2 Douglas Gilbert
2007-03-17 21:15 ` Pete Wyckoff
2007-03-17 23:13   ` Mike Christie
2007-03-18 17:48     ` Pete Wyckoff
2007-03-19 10:49 ` FUJITA Tomonori

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox