qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
From: "Benoît Canet" <benoit.canet@irqsave.net>
To: Stefan Hajnoczi <stefanha@gmail.com>
Cc: "Benoît Canet" <benoit.canet@irqsave.net>,
	"Kevin Wolf" <kwolf@redhat.com>,
	qemu-devel <qemu-devel@nongnu.org>,
	"Stefan Hajnoczi" <stefanha@redhat.com>,
	"Paolo Bonzini" <pbonzini@redhat.com>
Subject: Re: [Qemu-devel] QCOW2 deduplication design
Date: Thu, 10 Jan 2013 16:18:12 +0100	[thread overview]
Message-ID: <20130110151812.GA3457@irqsave.net> (raw)
In-Reply-To: <CAJSP0QXvT=3L8NY+VebECwZeS4CSmFG3+F8h2scg2CrWsUdviQ@mail.gmail.com>

> Now I understand.  This case covers overwriting existing data with new
> contents.  That is common :).
> 
> But are you seeing a cluster with refcount > 1 being overwritten
> often?  If so, it's worth looking into why that happens.  It may be a
> common pattern for certain file systems or applications to write
> initial data 'A' first and then change it later.  This actually
> suggests against online dedup, or at least for something like qcow2
> delayed write where we don't "commit" yet because the guest will
> probably still modify or append to the data.

I apologize for the bogus former information.

The deduplication metrics accounting code was confusing the delete cluster
operation with the more common hash removal from tree operation.
After fixing the metrics code commons files manipulations on the guest only
generate a few delete cluster operations.

The cases where a lots of cluster are deleted is when the image is overwritten
with zeroes and reformating a partition with ext3.

Regards

Benoît

  reply	other threads:[~2013-01-10 15:17 UTC|newest]

Thread overview: 9+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-01-09 15:24 [Qemu-devel] QCOW2 deduplication design Benoît Canet
2013-01-09 16:16 ` Stefan Hajnoczi
2013-01-09 16:32   ` Eric Blake
2013-01-10  6:59     ` Stefan Hajnoczi
2013-01-09 16:40   ` Benoît Canet
2013-01-10  8:16     ` Stefan Hajnoczi
2013-01-10 15:18       ` Benoît Canet [this message]
2013-01-10 15:28         ` Stefan Hajnoczi
2013-01-09 20:57   ` Benoît Canet

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20130110151812.GA3457@irqsave.net \
    --to=benoit.canet@irqsave.net \
    --cc=kwolf@redhat.com \
    --cc=pbonzini@redhat.com \
    --cc=qemu-devel@nongnu.org \
    --cc=stefanha@gmail.com \
    --cc=stefanha@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).