All of lore.kernel.org
 help / color / mirror / Atom feed
From: Ric Wheeler <ric@emc.com>
To: Zach Brown <zab@zabbo.net>
Cc: David Chinner <dgc@sgi.com>, linux-fsdevel@vger.kernel.org
Subject: Re: bdar: efficiently backup allocated bytes in file systems
Date: Wed, 19 Mar 2008 20:32:15 -0400	[thread overview]
Message-ID: <47E1B08F.5090706@emc.com> (raw)
In-Reply-To: <47E03CE3.3080903@zabbo.net>

Zach Brown wrote:
>> Neat, Zach. You should look at xfs_copy - it does pretty much this for XFS
>> filesystems....
> 
> haha, yet another round of the -fsdevel XFS drinking game :)
> 
> Does xfs_copy tend to assert the XFS file format in the backup files it
> generates?  One of the things I was hoping for with bdar was to have the
> resulting copy image be agnostic.  It's just a sparse map with some
> checksumming, really.
> 
> That limits what we can do, of course.  The current trivial format only
> has one address space which doesn't fit well with the plans file systems
> have of working with multiple addressable block ranges.
> 
> But I think I'm fine with that.  The value:complexity ratio of this
> trivial version is refreshingly large.
> 
> - z

About a year back, I was trying various ways to read every file on a 
fairly massive (reiserfs v3) file system (order of tens of millions of 
files).

I don't recall how close I came to native dd speed, but I could get a 
substantial win by grabbing a substantial chunk of files (say 5000), 
sort them by either inode number or creation time, and then read them in 
that order.

We had some good experience with this, but our use case has no sparse 
files and tends to have lots of little or medium sized files.  This only 
did the read phase, but the basic assumption is that the file system 
will tend to allocate disk sectors in sequential order over time and 
this gave a fairly close approximation of that ;-)

ric


  parent reply	other threads:[~2008-03-20  0:32 UTC|newest]

Thread overview: 11+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2008-03-18  1:13 bdar: efficiently backup allocated bytes in file systems Zach Brown
2008-03-18  7:47 ` Sitsofe Wheeler
2008-03-20 16:25   ` Zach Brown
2008-03-18 21:35 ` David Chinner
2008-03-18 22:06   ` Zach Brown
2008-03-18 23:52     ` David Chinner
2008-03-20  0:26       ` Szabolcs Szakacsits
2008-03-20  1:13       ` Andreas Dilger
2008-03-20  0:32     ` Ric Wheeler [this message]
2008-03-19  2:58 ` Andreas Dilger
2008-03-19  3:10   ` Zach Brown

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=47E1B08F.5090706@emc.com \
    --to=ric@emc.com \
    --cc=dgc@sgi.com \
    --cc=linux-fsdevel@vger.kernel.org \
    --cc=zab@zabbo.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.