All of lore.kernel.org
 help / color / mirror / Atom feed
From: Josh Durgin <josh.durgin@inktank.com>
To: Andrey Korolyov <andrey@xdel.ru>
Cc: ceph-devel <ceph-devel@vger.kernel.org>
Subject: Re: Different geoms for an rbd block device
Date: Tue, 30 Oct 2012 14:58:34 -0700	[thread overview]
Message-ID: <50904D8A.80405@inktank.com> (raw)
In-Reply-To: <CABYiri8KcpMt=+1WHWvxkDuMF1=f-=oB7oJDNDhVgaAZF_PF7Q@mail.gmail.com>

On 10/30/2012 02:41 PM, Andrey Korolyov wrote:
> On Wed, Oct 31, 2012 at 1:07 AM, Josh Durgin <josh.durgin@inktank.com> wrote:
>> On 10/28/2012 03:02 AM, Andrey Korolyov wrote:
>>>
>>> Hi,
>>>
>>> Should following behavior considered to be normal?
>>>
>>> $ rbd map test-rack0/debiantest --user qemukvm --secret qemukvm.key
>>> $ fdisk /dev/rbd1
>>>
>>> Command (m for help): p
>>>
>>> Disk /dev/rbd1: 671 MB, 671088640 bytes
>>> 255 heads, 63 sectors/track, 81 cylinders, total 1310720 sectors
>>> Units = sectors of 1 * 512 = 512 bytes
>>> Sector size (logical/physical): 512 bytes / 512 bytes
>>> I/O size (minimum/optimal): 4194304 bytes / 4194304 bytes
>>> Disk identifier: 0x00056f14
>>>
>>>        Device Boot      Start         End      Blocks   Id  System
>>> /dev/rbd1p1            2048       63487       30720   82  Linux swap /
>>> Solaris
>>> Partition 1 does not start on physical sector boundary.
>>> /dev/rbd1p2           63488     1292287      614400   83  Linux
>>> Partition 2 does not start on physical sector boundary.
>>>
>>> Meanwhile, in the guest vm over same image:
>>>
>>> fdisk /dev/vda
>>>
>>> Command (m for help): p
>>>
>>> Disk /dev/vda: 671 MB, 671088640 bytes
>>> 16 heads, 63 sectors/track, 1300 cylinders, total 1310720 sectors
>>
>>
>> I'm guessing the reported number of cylinders is the issue?
>> You can control that with a qemu option. I think
>>
>> -drive ...cyls=81
>>
>> will do it. You can also set the min/opt i/o sizes via
>> qemu device properties min_io_size and opt_io_size in
>> the same way you can adjust discard granularity:
>>
>> http://ceph.com/docs/master/rbd/qemu-rbd/#enabling-discard-trim
>>
>> Unfortunately min_io_size is a uint16 in qemu, so it won't
>> be able to store 4194304.
>>
>>
>>> Units = sectors of 1 * 512 = 512 bytes
>>> Sector size (logical/physical): 512 bytes / 512 bytes
>>> I/O size (minimum/optimal): 512 bytes / 512 bytes
>>> Disk identifier: 0x00056f14
>>>
>>>      Device Boot      Start         End      Blocks   Id  System
>>> /dev/vda1            2048       63487       30720   82  Linux swap /
>>> Solaris
>>> /dev/vda2           63488     1292287      614400   83  Linux
>>>
>>> The real pain starts when I try to repartition disk from after 'rbd
>>> map' using its geometry - it simply broke partition layout, for
>>> example, first block offset moves from 2048b to 8192. Of course I can
>>> specify geometry by hand, but before that I may need to start vm at
>>> least once or do something else which will print me out actual layout.
>>>
>>> Thanks!
>>
>>
>> Setting the geometry at qemu boot time should work, and is a bit easier.
>> qemu actually has code to try to guess disk geometry from a partition
>> table, but perhaps it doesn't support the format you're using.
>>
>> Josh
>
> So preferable geometry is one provided by kernel client, right? Is
> there any advantages of using large blocks for I/O with discard(ofc,
> not right now, I`ll wait for virtio bus support :) )?  At first sight,
> TCP transfers should not differ by resulting speed on typical
> workloads, but only on exotic ones - like delayed commit on the guest
> FS + intensive writes.

Generally larger I/Os are better, but the kernel in the guest will
probably restrict them to less than the full 4MB. I'm not sure how
large discard operations will get, but if they span an entire object
the object will be deleted instead of needing to zero out a chunk of it.

      reply	other threads:[~2012-10-30 21:58 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-10-28 10:02 Different geoms for an rbd block device Andrey Korolyov
2012-10-30 21:07 ` Josh Durgin
2012-10-30 21:41   ` Andrey Korolyov
2012-10-30 21:58     ` Josh Durgin [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=50904D8A.80405@inktank.com \
    --to=josh.durgin@inktank.com \
    --cc=andrey@xdel.ru \
    --cc=ceph-devel@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.