From mboxrd@z Thu Jan  1 00:00:00 1970
Received: from eggs.gnu.org ([2001:4830:134:3::10]:57755)
	by lists.gnu.org with esmtp (Exim 4.71) (envelope-from <pl@kamp.de>)
	id 1XOTQH-0003L5-Ob
	for qemu-devel@nongnu.org; Mon, 01 Sep 2014 11:21:46 -0400
Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71)
	(envelope-from <pl@kamp.de>) id 1XOTQ6-00054A-NM
	for qemu-devel@nongnu.org; Mon, 01 Sep 2014 11:21:37 -0400
Received: from mx-v6.kamp.de ([2a02:248:0:51::16]:54447 helo=mx01.kamp.de)
	by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from <pl@kamp.de>)
	id 1XOTQ6-00053i-AJ
	for qemu-devel@nongnu.org; Mon, 01 Sep 2014 11:21:26 -0400
Message-ID: <54048EE8.3040305@kamp.de>
Date: Mon, 01 Sep 2014 17:21:12 +0200
From: Peter Lieven <pl@kamp.de>
MIME-Version: 1.0
References: <1401889659-24035-1-git-send-email-pl@kamp.de>
	<CAN05THTaDzwHf-A0SmAGgmODhoqsGoyyn7apiBmnwYeFBQc63Q@mail.gmail.com>
	<53903469.8070902@msgid.tls.msk.ru> <539FDCAD.5060006@kamp.de>
	<53A02351.8060900@redhat.com> <53A02867.7090400@kamp.de>
	<53A02AA6.2060006@redhat.com>
In-Reply-To: <53A02AA6.2060006@redhat.com>
Content-Type: multipart/alternative;
	boundary="------------010705020509080903000905"
Subject: Re: [Qemu-devel] [PATCH] block/iscsi: use 16 byte CDBs only when
 necessary
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
	<mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <http://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
	<mailto:qemu-devel-request@nongnu.org?subject=subscribe>
To: Paolo Bonzini <pbonzini@redhat.com>, Michael Tokarev <mjt@tls.msk.ru>, ronnie sahlberg <ronniesahlberg@gmail.com>
Cc: Kevin Wolf <kwolf@redhat.com>, qemu-devel <qemu-devel@nongnu.org>, Stefan Hajnoczi <stefanha@redhat.com>

This is a multi-part message in MIME format.
--------------010705020509080903000905
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: 7bit

On 17.06.2014 13:46, Paolo Bonzini wrote:
> Il 17/06/2014 13:37, Peter Lieven ha scritto:
>> On 17.06.2014 13:15, Paolo Bonzini wrote:
>>> Il 17/06/2014 08:14, Peter Lieven ha scritto:
>>>>>
>>>>
>>>> BTW, while debugging a case with a bigger storage supplier I found
>>>> that open-iscsi seems to do exactly this undeterministic behaviour.
>>>> I have a 3TB LUN. If I access < 2TB sectors it uses READ10/WRITE10 and
>>>> if I go beyond 2TB it changes to READ16/WRITE16.
>>>
>>> Isn't that exactly what your latest patch does for >64K sector writes? :)
>>
>> Not exactly, we choose the default by checking the LUN size. 10 Byte for
>> < 2TB and 16 Byte otherwise.
>
> Yeah, I meant introducing the non-determinism.
>
>> My latest patch makes an exception if a request is bigger than 64K
>> sectors and
>> switches to 16 Byte requests. These would otherwise end in an I/O error.
>
> It could also be split at the block layer, like we do for unmap. I think there's also a maximum transfer size somewhere in the VPD, we could to READ16/WRITE16 if it is >64K sectors.

It seems that there might be a real world example where Linux issues >32MB write requests. Maybe someone familiar with btrfs can advise.
I see iSCSI Protocol Errors in my logs:

Sep  1 10:10:14 libiscsi:0 PDU header: 01 a1 00 00 00 01 00 00 00 00 00 00 00 00 00 00 00 00 00 07 06 8f 30 00 00 00 00 06 00 00 00 0a 2a 00 01 09 9e 50 00 47 98 00 00 00 00 00 00 00 [XXX]
Sep  1 10:10:14 qemu-2.0.0: iSCSI: Failed to write10 data to iSCSI lun. Request was rejected with reason: 0x04 (Protocol Error)

Looking at the headers the xferlen in the iSCSI PDU is 110047232 Byte which is 214936 sectors.
214936 % 65536 = 18328 which is exactly the number of blocks in the SCSI WRITE10 CDB.

Can someone advise if this is something that btrfs can cause or if I have to blame the customer that he issues very big write requests with Direct I/O?

The user sseems something like this in the log:
/[34640.489284] BTRFS: bdev /dev/vda2 errs: wr 8232, rd 0, flush 0, corrupt 0, gen 0/
/[34640.490379] end_request: I/O error, dev vda, sector 17446880/
/[34640.491251] end_request: I/O error, dev vda, sector 5150144/
/[34640.491290] end_request: I/O error, dev vda, sector 17472080/
/[34640.492201] end_request: I/O error, dev vda, sector 17523488/
/[34640.492201] end_request: I/O error, dev vda, sector 17536592/
/[34640.492201] end_request: I/O error, dev vda, sector 17599088/
/[34640.492201] end_request: I/O error, dev vda, sector 17601104/
/[34640.685611] end_request: I/O error, dev vda, sector 15495456/
/[34640.685650] end_request: I/O error, dev vda, sector 7138216/

Thanks,
Peter


--------------010705020509080903000905
Content-Type: text/html; charset=utf-8
Content-Transfer-Encoding: 8bit

<html>
  <head>
    <meta content="text/html; charset=utf-8" http-equiv="Content-Type">
  </head>
  <body bgcolor="#FFFFFF" text="#000000">
    <div class="moz-cite-prefix">On 17.06.2014 13:46, Paolo Bonzini
      wrote:<br>
    </div>
    <blockquote cite="mid:53A02AA6.2060006@redhat.com" type="cite">Il
      17/06/2014 13:37, Peter Lieven ha scritto:
      <br>
      <blockquote type="cite">On 17.06.2014 13:15, Paolo Bonzini wrote:
        <br>
        <blockquote type="cite">Il 17/06/2014 08:14, Peter Lieven ha
          scritto:
          <br>
          <blockquote type="cite">
            <blockquote type="cite">
              <br>
            </blockquote>
            <br>
            BTW, while debugging a case with a bigger storage supplier I
            found
            <br>
            that open-iscsi seems to do exactly this undeterministic
            behaviour.
            <br>
            I have a 3TB LUN. If I access &lt; 2TB sectors it uses
            READ10/WRITE10 and
            <br>
            if I go beyond 2TB it changes to READ16/WRITE16.
            <br>
          </blockquote>
          <br>
          Isn't that exactly what your latest patch does for &gt;64K
          sector writes? :)
          <br>
        </blockquote>
        <br>
        Not exactly, we choose the default by checking the LUN size. 10
        Byte for
        <br>
        &lt; 2TB and 16 Byte otherwise.
        <br>
      </blockquote>
      <br>
      Yeah, I meant introducing the non-determinism.
      <br>
      <br>
      <blockquote type="cite">My latest patch makes an exception if a
        request is bigger than 64K
        <br>
        sectors and
        <br>
        switches to 16 Byte requests. These would otherwise end in an
        I/O error.
        <br>
      </blockquote>
      <br>
      It could also be split at the block layer, like we do for unmap. 
      I think there's also a maximum transfer size somewhere in the VPD,
      we could to READ16/WRITE16 if it is &gt;64K sectors.
      <br>
    </blockquote>
    <br>
    It seems that there might be a real world example where Linux issues
    &gt;32MB write requests. Maybe someone familiar with btrfs can
    advise.<br>
    I see iSCSI Protocol Errors in my logs:<br>
    <br>
    Sep  1 10:10:14 libiscsi:0 PDU header: 01 a1 00 00 00 01 00 00 00 00
    00 00 00 00 00 00 00 00 00 07 06 8f 30 00 00 00 00 06 00 00 00 0a 2a
    00 01 09 9e 50 00 47 98 00 00 00 00 00 00 00 [XXX]<br>
    Sep  1 10:10:14 qemu-2.0.0: iSCSI: Failed to write10 data to iSCSI
    lun. Request was rejected with reason: 0x04 (Protocol Error)<br>
    <br>
    Looking at the headers the xferlen in the iSCSI PDU is 110047232
    Byte which is 214936 sectors.<br>
    214936 % 65536 = 18328 which is exactly the number of blocks in the
    SCSI WRITE10 CDB.<br>
    <br>
    Can someone advise if this is something that btrfs can cause or if I
    have to blame the customer that he issues very big write requests
    with Direct I/O?<br>
    <br>
    The user sseems something like this in the log:<br>
    <div style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family:
      'Times New Roman', serif;"><i><span style="font-size: 11pt;
          font-family: Calibri, sans-serif; color: rgb(31, 73, 125);">[34640.489284]
          BTRFS: bdev /dev/vda2 errs: wr 8232, rd 0, flush 0, corrupt 0,
          gen 0<o:p></o:p></span></i></div>
    <div style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family:
      'Times New Roman', serif;"><i><span style="font-size: 11pt;
          font-family: Calibri, sans-serif; color: rgb(31, 73, 125);">[34640.490379]
          end_request: I/O error, dev vda, sector 17446880<o:p></o:p></span></i></div>
    <div style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family:
      'Times New Roman', serif;"><i><span style="font-size: 11pt;
          font-family: Calibri, sans-serif; color: rgb(31, 73, 125);">[34640.491251]
          end_request: I/O error, dev vda, sector 5150144<o:p></o:p></span></i></div>
    <div style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family:
      'Times New Roman', serif;"><i><span style="font-size: 11pt;
          font-family: Calibri, sans-serif; color: rgb(31, 73, 125);">[34640.491290]
          end_request: I/O error, dev vda, sector 17472080<o:p></o:p></span></i></div>
    <div style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family:
      'Times New Roman', serif;"><i><span style="font-size: 11pt;
          font-family: Calibri, sans-serif; color: rgb(31, 73, 125);">[34640.492201]
          end_request: I/O error, dev vda, sector 17523488<o:p></o:p></span></i></div>
    <div style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family:
      'Times New Roman', serif;"><i><span style="font-size: 11pt;
          font-family: Calibri, sans-serif; color: rgb(31, 73, 125);">[34640.492201]
          end_request: I/O error, dev vda, sector 17536592<o:p></o:p></span></i></div>
    <div style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family:
      'Times New Roman', serif;"><i><span style="font-size: 11pt;
          font-family: Calibri, sans-serif; color: rgb(31, 73, 125);">[34640.492201]
          end_request: I/O error, dev vda, sector 17599088<o:p></o:p></span></i></div>
    <div style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family:
      'Times New Roman', serif;"><i><span style="font-size: 11pt;
          font-family: Calibri, sans-serif; color: rgb(31, 73, 125);">[34640.492201]
          end_request: I/O error, dev vda, sector 17601104<o:p></o:p></span></i></div>
    <div style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family:
      'Times New Roman', serif;"><i><span style="font-size: 11pt;
          font-family: Calibri, sans-serif; color: rgb(31, 73, 125);">[34640.685611]
          end_request: I/O error, dev vda, sector 15495456<o:p></o:p></span></i></div>
    <i><span style="font-size: 11pt; font-family: Calibri, sans-serif;
        color: rgb(31, 73, 125);">[34640.685650] end_request: I/O error,
        dev vda, sector 7138216</span></i><br>
    <br>
    Thanks,<br>
    Peter<br>
    <br>
  </body>
</html>

--------------010705020509080903000905--