qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
From: Peter Lieven <pl@kamp.de>
To: ronnie sahlberg <ronniesahlberg@gmail.com>
Cc: Kevin Wolf <kwolf@redhat.com>,
	Paolo Bonzini <pbonzini@redhat.com>,
	qemu-devel <qemu-devel@nongnu.org>,
	Stefan Hajnoczi <stefanha@redhat.com>
Subject: Re: [Qemu-devel] [PATCH] block/iscsi: use 16 byte CDBs only when necessary
Date: Wed, 04 Jun 2014 16:43:32 +0200	[thread overview]
Message-ID: <538F3094.8040204@kamp.de> (raw)
In-Reply-To: <CAN05THTaDzwHf-A0SmAGgmODhoqsGoyyn7apiBmnwYeFBQc63Q@mail.gmail.com>

Am 04.06.2014 16:00, schrieb ronnie sahlberg:
> Looks good.
>
> As an alternative, you could do the 10 vs 16 decision based on the LBA
> instead of the size of the device :
>
> -    if (use_16_for_ws) {
> +   if (lba >= 0x100000000) {
>         iTask.task = iscsi_writesame16_task(iscsilun->iscsi, iscsilun->lun, lba,
>                                             iscsilun->zeroblock,
> iscsilun->block_size,
>                                             nb_blocks, 0, !!(flags &
> BDRV_REQ_MAY_UNMAP),
>                                             0, 0, iscsi_co_generic_cb, &iTask);
>     } else {
>         iTask.task = iscsi_writesame10_task(iscsilun->iscsi, iscsilun->lun, lba,
>                                             iscsilun->zeroblock,
> iscsilun->block_size,
>                                             nb_blocks, 0, !!(flags &
> BDRV_REQ_MAY_UNMAP),
>                                             0, 0, iscsi_co_generic_cb, &iTask);
>     }
>
> That would mean you get to use the 10 version of the cdb even for very
> large devices (as long as the IO is for blocks at the beginning of the
> device) and thus provide partial avoidance of this issue for those
> large devices.

I like that idea, however I fear that this would introduce additional bugs.
- Using 10 Byte CDBs where the target might expect 16 Byte CDBs?!
- What if lba + num_blocks > 2^32-1 ?

The switch I added is like Linux does it - as Paolo pointed out earlier.

In my case the number of >2TB Targets is not that big so I can work with
the switch based on the capacity. Until the bug is fixed I just can't move
those 2TB volumes around on the storage array.

Peter

>
>
> ronnie shalberg
>
>
> On Wed, Jun 4, 2014 at 6:47 AM, Peter Lieven <pl@kamp.de> wrote:
>> this patch changes the driver to uses 16 Byte CDBs for
>> READ/WRITE only if the target requires 64bit lba addressing.
>>
>> On one hand this saves 6 bytes in each PDU on the other
>> hand it seems that 10 Byte CDBs seems to be much better
>> supported and tested as a recent issue I had with a
>> major storage supplier lined out.
>>
>> For WRITESAME the logic is a bit more tricky as WRITESAME10
>> with UNMAP was added really late. Thus a fallback to WRITESAME16
>> is possible if it supports UNMAP and WRITESAME10 not.
>>
>> Signed-off-by: Peter Lieven <pl@kamp.de>
>> ---
>>  block/iscsi.c |   58 +++++++++++++++++++++++++++++++++++++++------------------
>>  1 file changed, 40 insertions(+), 18 deletions(-)
>>
>> diff --git a/block/iscsi.c b/block/iscsi.c
>> index d241e83..019b324 100644
>> --- a/block/iscsi.c
>> +++ b/block/iscsi.c
>> @@ -65,6 +65,7 @@ typedef struct IscsiLun {
>>      unsigned char *zeroblock;
>>      unsigned long *allocationmap;
>>      int cluster_sectors;
>> +    bool use_16_for_rw;
>>  } IscsiLun;
>>
>>  typedef struct IscsiTask {
>> @@ -368,10 +369,17 @@ static int coroutine_fn iscsi_co_writev(BlockDriverState *bs,
>>      num_sectors = sector_qemu2lun(nb_sectors, iscsilun);
>>      iscsi_co_init_iscsitask(iscsilun, &iTask);
>>  retry:
>> -    iTask.task = iscsi_write16_task(iscsilun->iscsi, iscsilun->lun, lba,
>> -                                    data, num_sectors * iscsilun->block_size,
>> -                                    iscsilun->block_size, 0, 0, 0, 0, 0,
>> -                                    iscsi_co_generic_cb, &iTask);
>> +    if (iscsilun->use_16_for_rw) {
>> +        iTask.task = iscsi_write16_task(iscsilun->iscsi, iscsilun->lun, lba,
>> +                                        data, num_sectors * iscsilun->block_size,
>> +                                        iscsilun->block_size, 0, 0, 0, 0, 0,
>> +                                        iscsi_co_generic_cb, &iTask);
>> +    } else {
>> +        iTask.task = iscsi_write10_task(iscsilun->iscsi, iscsilun->lun, lba,
>> +                                        data, num_sectors * iscsilun->block_size,
>> +                                        iscsilun->block_size, 0, 0, 0, 0, 0,
>> +                                        iscsi_co_generic_cb, &iTask);
>> +    }
>>      if (iTask.task == NULL) {
>>          g_free(buf);
>>          return -ENOMEM;
>> @@ -545,20 +553,17 @@ static int coroutine_fn iscsi_co_readv(BlockDriverState *bs,
>>
>>      iscsi_co_init_iscsitask(iscsilun, &iTask);
>>  retry:
>> -    switch (iscsilun->type) {
>> -    case TYPE_DISK:
>> +    if (iscsilun->use_16_for_rw) {
>>          iTask.task = iscsi_read16_task(iscsilun->iscsi, iscsilun->lun, lba,
>>                                         num_sectors * iscsilun->block_size,
>>                                         iscsilun->block_size, 0, 0, 0, 0, 0,
>>                                         iscsi_co_generic_cb, &iTask);
>> -        break;
>> -    default:
>> +    } else {
>>          iTask.task = iscsi_read10_task(iscsilun->iscsi, iscsilun->lun, lba,
>>                                         num_sectors * iscsilun->block_size,
>>                                         iscsilun->block_size,
>>                                         0, 0, 0, 0, 0,
>>                                         iscsi_co_generic_cb, &iTask);
>> -        break;
>>      }
>>      if (iTask.task == NULL) {
>>          return -ENOMEM;
>> @@ -864,19 +869,27 @@ coroutine_fn iscsi_co_write_zeroes(BlockDriverState *bs, int64_t sector_num,
>>      struct IscsiTask iTask;
>>      uint64_t lba;
>>      uint32_t nb_blocks;
>> +    bool use_16_for_ws = iscsilun->use_16_for_rw;
>>
>>      if (!is_request_lun_aligned(sector_num, nb_sectors, iscsilun)) {
>>          return -EINVAL;
>>      }
>>
>> -    if ((flags & BDRV_REQ_MAY_UNMAP) && !iscsilun->lbp.lbpws) {
>> -        /* WRITE SAME with UNMAP is not supported by the target,
>> -         * fall back and try WRITE SAME without UNMAP */
>> -        flags &= ~BDRV_REQ_MAY_UNMAP;
>> +    if (flags & BDRV_REQ_MAY_UNMAP) {
>> +        if (!use_16_for_ws && !iscsilun->lbp.lbpws10) {
>> +            /* WRITESAME10 with UNMAP is unsupported try WRITESAME16 */
>> +            use_16_for_ws = true;
>> +        }
>> +        if (use_16_for_ws && !iscsilun->lbp.lbpws) {
>> +            /* WRITESAME16 with UNMAP is not supported by the target,
>> +             * fall back and try WRITESAME10/16 without UNMAP */
>> +            flags &= ~BDRV_REQ_MAY_UNMAP;
>> +            use_16_for_ws = iscsilun->use_16_for_rw;
>> +        }
>>      }
>>
>>      if (!(flags & BDRV_REQ_MAY_UNMAP) && !iscsilun->has_write_same) {
>> -        /* WRITE SAME without UNMAP is not supported by the target */
>> +        /* WRITESAME without UNMAP is not supported by the target */
>>          return -ENOTSUP;
>>      }
>>
>> @@ -889,10 +902,18 @@ coroutine_fn iscsi_co_write_zeroes(BlockDriverState *bs, int64_t sector_num,
>>
>>      iscsi_co_init_iscsitask(iscsilun, &iTask);
>>  retry:
>> -    if (iscsi_writesame16_task(iscsilun->iscsi, iscsilun->lun, lba,
>> -                               iscsilun->zeroblock, iscsilun->block_size,
>> -                               nb_blocks, 0, !!(flags & BDRV_REQ_MAY_UNMAP),
>> -                               0, 0, iscsi_co_generic_cb, &iTask) == NULL) {
>> +    if (use_16_for_ws) {
>> +        iTask.task = iscsi_writesame16_task(iscsilun->iscsi, iscsilun->lun, lba,
>> +                                            iscsilun->zeroblock, iscsilun->block_size,
>> +                                            nb_blocks, 0, !!(flags & BDRV_REQ_MAY_UNMAP),
>> +                                            0, 0, iscsi_co_generic_cb, &iTask);
>> +    } else {
>> +        iTask.task = iscsi_writesame10_task(iscsilun->iscsi, iscsilun->lun, lba,
>> +                                            iscsilun->zeroblock, iscsilun->block_size,
>> +                                            nb_blocks, 0, !!(flags & BDRV_REQ_MAY_UNMAP),
>> +                                            0, 0, iscsi_co_generic_cb, &iTask);
>> +    }
>> +    if (iTask.task == NULL) {
>>          return -ENOMEM;
>>      }
>>
>> @@ -1087,6 +1108,7 @@ static void iscsi_readcapacity_sync(IscsiLun *iscsilun, Error **errp)
>>                      iscsilun->num_blocks = rc16->returned_lba + 1;
>>                      iscsilun->lbpme = rc16->lbpme;
>>                      iscsilun->lbprz = rc16->lbprz;
>> +                    iscsilun->use_16_for_rw = (rc16->returned_lba > 0xffffffff);
>>                  }
>>              }
>>              break;
>> --
>> 1.7.9.5
>>

  reply	other threads:[~2014-06-04 14:43 UTC|newest]

Thread overview: 24+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-06-04 13:47 [Qemu-devel] [PATCH] block/iscsi: use 16 byte CDBs only when necessary Peter Lieven
2014-06-04 14:00 ` ronnie sahlberg
2014-06-04 14:43   ` Peter Lieven [this message]
2014-06-04 14:54     ` ronnie sahlberg
2014-06-05  9:12   ` Michael Tokarev
2014-06-05  9:27     ` Peter Lieven
2014-06-17  6:14     ` Peter Lieven
2014-06-17 11:15       ` Paolo Bonzini
2014-06-17 11:37         ` Peter Lieven
2014-06-17 11:46           ` Paolo Bonzini
2014-06-17 11:50             ` Peter Lieven
2014-06-17 13:45             ` Peter Lieven
2014-09-01 15:21             ` Peter Lieven
2014-09-02 15:28               ` ronnie sahlberg
2014-09-02 18:14                 ` Peter Lieven
2014-09-02 19:30                 ` Peter Lieven
2014-09-03  8:09                   ` Peter Lieven
2014-09-03 12:31                     ` Stefan Hajnoczi
2014-09-03 13:13                       ` Peter Lieven
2014-09-03 14:17                     ` ronnie sahlberg
2014-09-03 14:18                       ` Paolo Bonzini
2014-09-03 14:48                         ` ronnie sahlberg
2014-09-03 19:29                           ` Peter Lieven
2014-06-04 15:31 ` Paolo Bonzini

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=538F3094.8040204@kamp.de \
    --to=pl@kamp.de \
    --cc=kwolf@redhat.com \
    --cc=pbonzini@redhat.com \
    --cc=qemu-devel@nongnu.org \
    --cc=ronniesahlberg@gmail.com \
    --cc=stefanha@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).