All of lore.kernel.org
 help / color / mirror / Atom feed
From: Peter Braam <Peter.Braam@Sun.COM>
To: lustre-devel@lists.lustre.org
Subject: [Lustre-devel] How store HSM metadata in MDT ?
Date: Tue, 08 Jul 2008 11:41:10 -0600	[thread overview]
Message-ID: <C49902D6.4E45%peter.braam@sun.com> (raw)
In-Reply-To: <48732AB1.10802@cea.fr>

I think we have come to the following conclusions:

1. The HSM or a database associated with it implements a table to map FIDs
to stored HSM versions of a file, with other metadata it may need to
maintain its archives.

2. An HSM utility can query and learn about the versions stored for a fid
(or file name).  A "restore" function can copy any version out of the HSM
and place it in the file system.  This is similar to restoring a file from a
backup archive.

3. The file system only has attributes to indicate the state of the primary
archived copy (probably the last fully archived copy of the file), and can
retrieve that file on demand (without user intervention).

4. The HSM database will allow files in snapshots to be encoded with (fsid,
fid) or something similar.

5. for now we ignore block level dedup in the HSM

Can the owner of the HLD make updates?  Please also read on - I have some
more questions below.


On 7/8/08 2:52 AM, "Aurelien Degremont" <aurelien.degremont@cea.fr> wrote:

> Lee Ward a ?crit :
>> If HSM, then, do you intend that the user be allowed to specify *which*
>> version of the file content is desired?
> 
> User could say:
>    "overwrite the current version of this file with this older copies
> which was made few time ago."
> 
> -The current file content is lost.
> -That is the only way to access the older copies content.

Yes, that is reasonable.
 
> There is no namespace tricks, no huge API changes, always one version of
> a file in Lustre, just few functions added to 'lfs' command.

NO - this will not be an lfs command.  This is an HSM command.

> The purpose is just, using the HSM infrastructure, simply add few
> feature to help people asking us for backup features, but this will not
> be a true backup system. This kind of utility requires much more
> development.

I think it would be good to review one more time the following aspects of
the design:

1. how is a bare metal restore arranged (ie. How is metadata moved into the
HSM)?  Can this restore put files in a file system different than Lustre?

2. how are small files grouped then "tar'd up" and how are we setting the
attributes of the inodes of the files that have been placed in the HSM after
this?  How does the index entry for the fids in the HSM database function?

3. how are multiple coordinators and agents utilized to distribute load so
that the HSM can keep up with massive small file creation?

For all of these we have seen sketchy answers in the past, let's dig in and
make sure that we have this right.

Regards,

Peter

  reply	other threads:[~2008-07-08 17:41 UTC|newest]

Thread overview: 21+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2008-07-03 11:43 [Lustre-devel] How store HSM metadata in MDT ? Aurelien Degremont
2008-07-03 21:10 ` Peter Braam
2008-07-04 14:37   ` Aurelien Degremont
2008-07-05 16:50     ` Andreas Dilger
2008-07-06  3:20       ` Peter Braam
2008-07-06  3:24     ` Peter Braam
2008-07-06 19:24       ` Lee Ward
2008-07-06 22:53         ` Peter Braam
2008-07-08 12:06           ` Rick Matthews
2008-07-08  8:52         ` Aurelien Degremont
2008-07-08 17:41           ` Peter Braam [this message]
2008-07-09 13:25             ` Aurelien Degremont
2008-07-09 13:49               ` Peter Braam
2008-07-11 14:32                 ` Jacques-Charles Lafoucriere
2008-07-11 22:03                   ` Peter Braam
2008-07-11 14:37                 ` Jacques-Charles Lafoucriere
2008-07-11 22:12                   ` Peter Braam
2008-07-11 14:31       ` Jacques-Charles Lafoucriere
2008-07-11 21:57         ` Peter Braam
2008-07-16 10:26           ` Jacques-Charles Lafoucriere
2008-07-16 19:00             ` Peter Braam

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=C49902D6.4E45%peter.braam@sun.com \
    --to=peter.braam@sun.com \
    --cc=lustre-devel@lists.lustre.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.