From mboxrd@z Thu Jan  1 00:00:00 1970
Received: from eggs.gnu.org ([140.186.70.92]:55995)
	by lists.gnu.org with esmtp (Exim 4.71)
	(envelope-from <pankajr141@gmail.com>) id 1S0VVj-0003d2-6N
	for qemu-devel@nongnu.org; Thu, 23 Feb 2012 05:02:55 -0500
Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71)
	(envelope-from <pankajr141@gmail.com>) id 1S0VVY-0001Vd-RD
	for qemu-devel@nongnu.org; Thu, 23 Feb 2012 05:02:51 -0500
Received: from mail-lpp01m010-f45.google.com ([209.85.215.45]:59661)
	by eggs.gnu.org with esmtp (Exim 4.71)
	(envelope-from <pankajr141@gmail.com>) id 1S0VVY-0001VW-Az
	for qemu-devel@nongnu.org; Thu, 23 Feb 2012 05:02:40 -0500
Received: by lahi5 with SMTP id i5so1342150lah.4
	for <qemu-devel@nongnu.org>; Thu, 23 Feb 2012 02:02:39 -0800 (PST)
MIME-Version: 1.0
In-Reply-To: <20120223083204.GA10698@stefanha-thinkpad.localdomain>
References: <CABZruFAg81ayd3P6RWKnbhLMW9jmN00hcp=LRsbMHZ3NgupA2A@mail.gmail.com>
	<20120223083204.GA10698@stefanha-thinkpad.localdomain>
From: PANKAJ RAWAT <pankajr141@gmail.com>
Date: Thu, 23 Feb 2012 15:32:19 +0530
Message-ID: <CABZruFCDYwWsRBBMi8cPWhhHdTKeai6XubUMBA7SXxrkdEOSOQ@mail.gmail.com>
Content-Type: multipart/alternative; boundary=90e6ba308e8886ae6404b99ebe98
Subject: Re: [Qemu-devel] Cluster_size parameter issue on qcow2 image format
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
	<mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <http://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
	<mailto:qemu-devel-request@nongnu.org?subject=subscribe>
To: Stefan Hajnoczi <stefanha@gmail.com>
Cc: qemu-devel@nongnu.org

--90e6ba308e8886ae6404b99ebe98
Content-Type: text/plain; charset=ISO-8859-1

Thanks for the reply .
i am not using a backing file.
My concern is growing file system.
The performance of 64K is better than 1M , 2M or 32K

Is the degrade in performance is only due to allocation of large cluster
during expansion of qcow2 image ?

But the trend is same in case of
Sequential write
Random write  of 1 GB data

In random i can understand the sparseness of data
But in sequential write I don't understand as the write is performed on
sequential bases

is there is any reason behind it or i am not getting it right ?

On Thu, Feb 23, 2012 at 2:02 PM, Stefan Hajnoczi <stefanha@gmail.com> wrote:

> On Thu, Feb 23, 2012 at 11:01:46AM +0530, PANKAJ RAWAT wrote:
> > I theory regarding  cluster size it is written that as the size of
> cluster
> > increase performance should increase.
> >
> > But something surprising happen The performance is degrading as the size
> of
> > cluster increased from 64K to 1M  ( during expansion of qcow2 image)
>
> It's not true that performance should increase by raising the cluster
> size, otherwise the default would be infinity.  It's an algorithms/data
> structure trade-off.
>
> Most importantly is the relative latency between a small guest I/O
> request (e.g. 4 KB) and the cluster size (e.g. 64 KB).  If the cluster
> size latency is orders of magnitude larger than a small guest I/O
> request, then be prepared to see extreme effects described below:
>
>  * Bigger clusters decrease the frequency of metadata operations and
>   increase metadata cache hit rates.  Bigger clusters means less
>   metadata so qcow2 performs fewer metadata operations overall.
>
>   Performance boost.
>
>  * Bigger clusters increase the cost of allocating a new cluster.  For
>   example, a 8 KB write to a new cluster will incur a 1 MB write to the
>   image file (the untouched regions are filled with zeros).  This can
>   be optimized in some cases but not everywhere (e.g. reallocating a
>   data cluster versus extending the image file size and relying on the
>   file system to provide zeroed space).  This is especially expensive
>   when a backing file is in use because up to 1 MB of the backing file
>   needs to be read to populate the newly allocated cluster!
>
>   Performance loss.
>
>  * Bigger clusters can reduce fragmentation of data on the physical
>   disk.  The file system sees fewer, bigger allocating writes and is
>   therefore able to allocate more contiguous data - less fragmentation.
>
>   Performance boost.
>
>  * Bigger clusters reduce the compactness of sparse files. you use more
>   image file space on the host file system when the cluster size is
>   large.
>
>   Space efficiency loss.
>
> Here's a scenario where a 1 MB cluster size is great compared to a large
> cluster size:
>
> You have a fully allocated qcow2 image, you will never need to do any
> allocating writes.
>
> Here's a scenario where a 1 MB cluster size is terrible compared to a
> small cluster size:
>
> You have an empty qcow2 file and perform 4 KB writes to the first sector
> of each 1 MB chunk, and there is a backing file.
>
> So it depends on the application.
>
> Stefan
>


-- 
*Pankaj Rawat*

--90e6ba308e8886ae6404b99ebe98
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Thanks for the reply .<br>i am not using a backing file. <br>My concern is =
growing file system. <br>The performance of 64K is better than 1M , 2M or 3=
2K <br><br>Is the degrade in performance is only due to allocation of large=
 cluster during expansion of qcow2 image ?<br>

<br>But the trend is same in case of<br>Sequential write<br>Random write=A0=
 of 1 GB data<br><br>In random i can understand the sparseness of data<br>B=
ut in sequential write I don&#39;t understand as the write is performed on =
sequential bases <br>

<br>is there is any reason behind it or i am not getting it right ?<br><br>=
<div class=3D"gmail_quote">On Thu, Feb 23, 2012 at 2:02 PM, Stefan Hajnoczi=
 <span dir=3D"ltr">&lt;<a href=3D"mailto:stefanha@gmail.com">stefanha@gmail=
.com</a>&gt;</span> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div class=3D"im">On Thu, Feb 23, 2012 at 11=
:01:46AM +0530, PANKAJ RAWAT wrote:<br>
&gt; I theory regarding =A0cluster size it is written that as the size of c=
luster<br>
&gt; increase performance should increase.<br>
&gt;<br>
&gt; But something surprising happen The performance is degrading as the si=
ze of<br>
&gt; cluster increased from 64K to 1M =A0( during expansion of qcow2 image)=
<br>
<br>
</div>It&#39;s not true that performance should increase by raising the clu=
ster<br>
size, otherwise the default would be infinity. =A0It&#39;s an algorithms/da=
ta<br>
structure trade-off.<br>
<br>
Most importantly is the relative latency between a small guest I/O<br>
request (e.g. 4 KB) and the cluster size (e.g. 64 KB). =A0If the cluster<br=
>
size latency is orders of magnitude larger than a small guest I/O<br>
request, then be prepared to see extreme effects described below:<br>
<br>
=A0* Bigger clusters decrease the frequency of metadata operations and<br>
 =A0 increase metadata cache hit rates. =A0Bigger clusters means less<br>
 =A0 metadata so qcow2 performs fewer metadata operations overall.<br>
<br>
 =A0 Performance boost.<br>
<br>
=A0* Bigger clusters increase the cost of allocating a new cluster. =A0For<=
br>
 =A0 example, a 8 KB write to a new cluster will incur a 1 MB write to the<=
br>
 =A0 image file (the untouched regions are filled with zeros). =A0This can<=
br>
 =A0 be optimized in some cases but not everywhere (e.g. reallocating a<br>
 =A0 data cluster versus extending the image file size and relying on the<b=
r>
 =A0 file system to provide zeroed space). =A0This is especially expensive<=
br>
 =A0 when a backing file is in use because up to 1 MB of the backing file<b=
r>
 =A0 needs to be read to populate the newly allocated cluster!<br>
<br>
 =A0 Performance loss.<br>
<br>
=A0* Bigger clusters can reduce fragmentation of data on the physical<br>
 =A0 disk. =A0The file system sees fewer, bigger allocating writes and is<b=
r>
 =A0 therefore able to allocate more contiguous data - less fragmentation.<=
br>
<br>
 =A0 Performance boost.<br>
<br>
=A0* Bigger clusters reduce the compactness of sparse files. you use more<b=
r>
 =A0 image file space on the host file system when the cluster size is<br>
 =A0 large.<br>
<br>
 =A0 Space efficiency loss.<br>
<br>
Here&#39;s a scenario where a 1 MB cluster size is great compared to a larg=
e<br>
cluster size:<br>
<br>
You have a fully allocated qcow2 image, you will never need to do any<br>
allocating writes.<br>
<br>
Here&#39;s a scenario where a 1 MB cluster size is terrible compared to a<b=
r>
small cluster size:<br>
<br>
You have an empty qcow2 file and perform 4 KB writes to the first sector<br=
>
of each 1 MB chunk, and there is a backing file.<br>
<br>
So it depends on the application.<br>
<span class=3D"HOEnZb"><font color=3D"#888888"><br>
Stefan<br>
</font></span></blockquote></div><br><br clear=3D"all"><br>-- <br><div styl=
e=3D"text-align:right"><b>Pankaj Rawat</b></div><br>

--90e6ba308e8886ae6404b99ebe98--