linux-ext4.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Ted Ts'o <tytso@mit.edu>
To: Eric Sandeen <sandeen@redhat.com>
Cc: Carlos Maiolino <cmaiolino@redhat.com>,
	Andreas Dilger <adilger@dilger.ca>,
	linux-ext4@vger.kernel.org
Subject: Re: [PATCH] Add better example about how to compress e2image raw image
Date: Mon, 26 Sep 2011 15:23:58 -0400	[thread overview]
Message-ID: <20110926192358.GC3282@thunk.org> (raw)
In-Reply-To: <4E80A73F.2070205@redhat.com>

On Mon, Sep 26, 2011 at 11:24:31AM -0500, Eric Sandeen wrote:
> > 
> >     bunzip2 < hda1.e2i.bz2 | make-sparse hda1.e2i
> > 
> > ... and this creates a sparse file in hda1.e2i.
> 
> or | cp --sparse=always /dev/stdin sparse.img works too.
> 
> But have you ever tried this with a multi-terabyte image?
> 
> It takes -forever- to process all those 0s, with cpus pegged.

Yeah, I didn't realize until I read another message on this thread
that bzip2's CPU problems were causing problems.  Is gzip sufficiently
better, I wonder, or is it still problematic?

> Ted, your concern about space - it doesn't take the full fs size worth
> of space, right, just the metadata space?  So in general it should not
> be THAT much ...

Yes, it's just the metadata space that I was worried about.  So it's
not *that* much, but it still adds up on large systems.  But then
again, on large systems we precisely have the problem of bzip2 taking
forever.

If we decide that we're OK with not compressing qcow2, we could use
qcow2.  But note that the qcow2 format is still very compressible ---
it looks like it could do a better job removing zero blocks.  (I had a
256meg qcow2 e2image file compress down to 9 megs.)  Unfortunately we
can't do stream compression with qcow2.

Long run I think we should make the qcow2 support better (by dropping
all-zero blocks, and adding support for qcow2 to
debugfs/dumpe2fs/e2fsck, and perhaps adding support for native
compression).  Anyone looking for a project?  :-)

						- Ted

  reply	other threads:[~2011-09-26 19:24 UTC|newest]

Thread overview: 11+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2011-09-23 18:47 [PATCH] Add better example about how to compress e2image raw image Carlos Maiolino
2011-09-23 20:24 ` Andreas Dilger
2011-09-23 20:51   ` Carlos Maiolino
2011-09-24 16:31     ` Ted Ts'o
2011-09-26 12:13       ` Carlos Maiolino
2011-09-26 12:22       ` Bernd Schubert
2011-09-26 16:25         ` Eric Sandeen
2011-09-26 16:24       ` Eric Sandeen
2011-09-26 19:23         ` Ted Ts'o [this message]
2011-09-26 19:36           ` Carlos Maiolino
2011-09-26 20:01           ` Amir Goldstein

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20110926192358.GC3282@thunk.org \
    --to=tytso@mit.edu \
    --cc=adilger@dilger.ca \
    --cc=cmaiolino@redhat.com \
    --cc=linux-ext4@vger.kernel.org \
    --cc=sandeen@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).