From: Sunil Mushran <sunil.mushran@oracle.com>
To: Micah Dombrowski <mpdwibble@gmail.com>
Cc: linux-fsdevel@vger.kernel.org
Subject: Re: 'Subset' Hard Links
Date: Thu, 17 Sep 2009 18:31:10 -0700 [thread overview]
Message-ID: <4AB2E2DE.2030800@oracle.com> (raw)
In-Reply-To: <AB75EE84-3054-4D17-8F62-61FF96D0E162@gmail.com>
Micah Dombrowski wrote:
> I couldn't think of anywhere else to ask such a question, and google
> is useless as I have no unique keywords. I am wondering if it is
> possible with some/any filesystems to have multiple hard links to a
> file, some of which only point to a subset of the file's data.
>
> Eg:
> firstname -> all data bytes 1 to 10
> secondname -> bytes 3 to 10
> thirdname -> bytes 5 to 7
>
> This would clearly require some interesting locking of the file WRT
> writes, but it seems like it should be possible, and even easy for
> read-only access. I deal with moderately large data files (50+GB),
> and such a thing would be incredibly useful to me for pulling out
> interesting bits of my data without having to make copies of the data
> itself.
>
> A related method I was wishing existed would allow concatenation of
> files simply by deleting all but one hard link, and changing the
> remaining one to point to all of the original files' data as
> fragments. This would be great, as 'cat'ing together six 10GB files
> is pretty slow, and it seems silly to be copying all of that data
> around when I only need one actual instance of the full data on disk,
> and that instance already exists, albeit in a fragmented manner.
>
> Do any tools for doing this sort of thing exist?
>
btrfs should able to handle most of this.
http://git.kernel.org/?p=linux/kernel/git/mason/btrfs-unstable.git;a=commitdiff;h=c5c9cd4d1b827fe545ed2a945e91e3a6909f3886
However, note that file systems operate in terms of blocks. So the start
offset
would need to be block aligned.
next prev parent reply other threads:[~2009-09-18 1:31 UTC|newest]
Thread overview: 3+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-09-18 0:02 'Subset' Hard Links Micah Dombrowski
2009-09-18 1:31 ` Sunil Mushran [this message]
2009-09-18 4:26 ` Sage Weil
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4AB2E2DE.2030800@oracle.com \
--to=sunil.mushran@oracle.com \
--cc=linux-fsdevel@vger.kernel.org \
--cc=mpdwibble@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.