All of lore.kernel.org
 help / color / mirror / Atom feed
From: Sunil Mushran <sunil.mushran@oracle.com>
To: Micah Dombrowski <mpdwibble@gmail.com>
Cc: linux-fsdevel@vger.kernel.org
Subject: Re: 'Subset' Hard Links
Date: Thu, 17 Sep 2009 18:31:10 -0700	[thread overview]
Message-ID: <4AB2E2DE.2030800@oracle.com> (raw)
In-Reply-To: <AB75EE84-3054-4D17-8F62-61FF96D0E162@gmail.com>

Micah Dombrowski wrote:
> I couldn't think of anywhere else to ask such a question, and google 
> is useless as I have no unique keywords.  I am wondering if it is 
> possible with some/any filesystems to have multiple hard links to a 
> file, some of which only point to a subset of the file's data.
>
> Eg:
> firstname -> all data bytes 1 to 10
> secondname -> bytes 3 to 10
> thirdname -> bytes 5 to 7
>
> This would clearly require some interesting locking of the file WRT 
> writes, but it seems like it should be possible, and even easy for 
> read-only access.  I deal with moderately large data files (50+GB), 
> and such a thing would be incredibly useful to me for pulling out 
> interesting bits of my data without having to make copies of the data 
> itself.
>
> A related method I was wishing existed would allow concatenation of 
> files simply by deleting all but one hard link, and changing the 
> remaining one to point to all of the original files' data as 
> fragments.  This would be great, as 'cat'ing together six 10GB files 
> is pretty slow, and it seems silly to be copying all of that data 
> around when I only need one actual instance of the full data on disk, 
> and that instance already exists, albeit in a fragmented manner.
>
> Do any tools for doing this sort of thing exist?
>

btrfs should able to handle most of this.

http://git.kernel.org/?p=linux/kernel/git/mason/btrfs-unstable.git;a=commitdiff;h=c5c9cd4d1b827fe545ed2a945e91e3a6909f3886

However, note that file systems operate in terms of blocks. So the start 
offset
would need to be block aligned.

  reply	other threads:[~2009-09-18  1:31 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-09-18  0:02 'Subset' Hard Links Micah Dombrowski
2009-09-18  1:31 ` Sunil Mushran [this message]
2009-09-18  4:26   ` Sage Weil

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4AB2E2DE.2030800@oracle.com \
    --to=sunil.mushran@oracle.com \
    --cc=linux-fsdevel@vger.kernel.org \
    --cc=mpdwibble@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.