qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
From: Max Reitz <mreitz@redhat.com>
To: Eric Blake <eblake@redhat.com>, Qemu-block <qemu-block@nongnu.org>
Cc: Kevin Wolf <kwolf@redhat.com>,
	"qemu-devel@nongnu.org" <qemu-devel@nongnu.org>
Subject: Re: [Qemu-devel] Unaligned images with O_DIRECT
Date: Tue, 14 May 2019 19:28:16 +0200	[thread overview]
Message-ID: <20a7f01f-894c-e121-afc9-03415f55aa82@redhat.com> (raw)
In-Reply-To: <ce551fef-3987-a5fd-7280-e406226c6a20@redhat.com>

[-- Attachment #1: Type: text/plain, Size: 3831 bytes --]

On 14.05.19 18:15, Max Reitz wrote:
> On 14.05.19 17:45, Eric Blake wrote:
>> On 5/14/19 10:06 AM, Max Reitz wrote:
>>> Hi,
>>>
>>> Unaligned images don’t work so well with O_DIRECT:
>>>
>>> $ echo > foo
>>> $ qemu-img map --image-opts driver=file,filename=foo,cache.direct=on
>>> Offset          Length          Mapped to       File
>>> qemu-img: block/io.c:2093: bdrv_co_block_status: Assertion `*pnum &&
>>> QEMU_IS_ALIGNED(*pnum, align) && align > offset - aligned_offset' failed.
>>> [1]    10954 abort (core dumped)  qemu-img map --image-opts
>>> driver=file,filename=foo,cache.direct=on
>>>
>>> (compare https://bugzilla.redhat.com/show_bug.cgi?id=1588356)
>>>
>>> This is because the request_alignment is 512 (in my case), but the EOF
>>> is not aligned accordingly, so raw_co_block_status() returns an aligned
>>> *pnum.
>>
>> Uggh. Yet another reason why I want qemu to support byte-accurate
>> sizing, instead of rounding up. The rounding keeps raising its head in
>> more and more places. I have pending patches that are trying to improve
>> block status to round driver answers up to match request_alignment (when
>> the protocol layer has finer granularity than the format layer); but
>> this sounds like it is a bug in the file driver itself for returning an
>> answer that is not properly rounded according to its own
>> request_alignment boundary, and not one where my pending patches would help.
> 
> Yes, I think so, too.
> 
>>> I suppose having an unaligned tail is not so bad and maybe we can just
>>> adjust the assertion accordingly.  On the other hand, this has been
>>> broken for a while.  Does it even make sense to use O_DIRECT with
>>> unaligned images?  Shouldn’t we just reject them outright?
>>
>> The tail of an unaligned file is generally inaccessible to O_DIRECT,
> 
> Especially with this.
> 
>> where it is easier to use ftruncate() up to an aligned boundary if you
>> really must play with that region of the file, and then ftruncate() back
>> to the intended size after I/O. But that sounds hairy.  We could also
>> round down and silently ignore the tail of the file, but that is at odds
>> with our practice of rounding size up.  So for the short term, I'd be
>> happy with a patch that just rejects any attempt to use cache.direct=on
>> (O_DIRECT) with a file that is not already a multiple of the alignment
>> required thereby. (For reference, that's what qemu as NBD client
>> recently did when talking to a server that advertises a size
>> inconsistent with forced minimum block access: commit 3add3ab7)
> 
> OK, I’ll send a patch.  Thanks for you explanation!
Well, or maybe not.

$ ./qemu-img create -f qcow2 foo.qcow2 64M
$ ./qemu-img map --image-opts \
    driver=qcow2,file.filename=foo.qcow2,cache.direct=on
qemu-img: Could not open
'driver=qcow2,file.filename=foo.qcow2,cache.direct=on': File length
(196616 bytes) is not a multiple of the O_DIRECT alignment (512 bytes)
Try cache.direct=off, or increasing the file size to match the alignment

That may be considered a bug in qcow2.  Maybe it should always fill all
clusters.  But even if we did so and fixed it now, we can’t disallow
qemu from opening such images.

Also, well, the tail is accessible, we just need to access it with the
proper alignment (and then we get a short read).  This seems to be
handled just fine.

So I think file-posix should just return a rounded result.  Well, or
bdrv_co_Block_status() could ignore it for the EOF, because it throws
away everything past the EOF anyway with:

    if (*pnum > bytes) {
        *pnum = bytes;
    }

On one hand, I agree that file-posix should return an aligned result.
On the other, it doesn’t make a difference, so I don’t think we need to
enforce it (at EOF).

Max


[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 488 bytes --]

  reply	other threads:[~2019-05-14 17:29 UTC|newest]

Thread overview: 5+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2019-05-14 15:06 [Qemu-devel] Unaligned images with O_DIRECT Max Reitz
2019-05-14 15:45 ` Eric Blake
2019-05-14 16:15   ` Max Reitz
2019-05-14 17:28     ` Max Reitz [this message]
2019-05-14 21:36       ` Eric Blake

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20a7f01f-894c-e121-afc9-03415f55aa82@redhat.com \
    --to=mreitz@redhat.com \
    --cc=eblake@redhat.com \
    --cc=kwolf@redhat.com \
    --cc=qemu-block@nongnu.org \
    --cc=qemu-devel@nongnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).