All of lore.kernel.org
 help / color / mirror / Atom feed
From: Peter Braam <Peter.Braam@Sun.COM>
To: lustre-devel@lists.lustre.org
Subject: [Lustre-devel] How store HSM metadata in MDT ?
Date: Sat, 05 Jul 2008 21:24:55 -0600	[thread overview]
Message-ID: <C4959727.4DA5%peter.braam@sun.com> (raw)
In-Reply-To: <486E35AB.7010405@cea.fr>




On 7/4/08 8:37 AM, "Aurelien Degremont" <aurelien.degremont@cea.fr> wrote:

> Peter Braam a ?crit :
>> If there is more than one copy in the archive, it would be preferable if the
>> archive could maintain a mapping from the Lustre fid of the file to the
>> archived copies.  Associated with the FID of the data would then be a list
>> of archived copies, timestamps etc.
> 
> Do you mean that the HSM will be aware of various versions of one same
> file, identified in Lustre by a FID ?
> Or this will be masked by the archiving tool , doing some tricks to
> simulate it ?
> 
>> Can that be done in HPSS?
> 
> HPSS alone cannot do versioning on its files presently.

But your archiving utility that copies from Lustre to HPSS can maintain
database of these objects - no need to store anything in Lustre.


> 
> 
>> If not, policy related operations like purging older files etc will become
>> very complex and not scalable.  For example, a search to find older files in
>> the archive would require an e2scan operation to find the inodes and then
>> the objects in the archive.  If the file system was not available anymore
>> (for whatever reason), it is not even clear that such a purge could still
>> happen.
>> 
>> With an archive based database this can be an indexed search in the archive,
>> which is faster and more appropriate.
> 
> By purgin do mean purging in Lustre or in the HSM?

The HSM.

> There's no issue with purging in Lustre because this do not imply the HSM.
> And removal of oldest copies in the HSM could be done asynchronously,
> slowly.

There is a rule in Lustre - no scanning, ever.  This rule will not be broken
by HSM.  

So, you have to move your management of ID's of the archvied copies outside
of Lustre, in some database.  This will actually save you time - doing this
in the MDS will be no fun.

The MDS should only get attributes to indicate if and what version of a file
is in the archive and a cursor (maybe other information) in relation with
ongoing restores.

Peter


> 
> I'm not sure I see what you mean here
> 

  parent reply	other threads:[~2008-07-06  3:24 UTC|newest]

Thread overview: 21+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2008-07-03 11:43 [Lustre-devel] How store HSM metadata in MDT ? Aurelien Degremont
2008-07-03 21:10 ` Peter Braam
2008-07-04 14:37   ` Aurelien Degremont
2008-07-05 16:50     ` Andreas Dilger
2008-07-06  3:20       ` Peter Braam
2008-07-06  3:24     ` Peter Braam [this message]
2008-07-06 19:24       ` Lee Ward
2008-07-06 22:53         ` Peter Braam
2008-07-08 12:06           ` Rick Matthews
2008-07-08  8:52         ` Aurelien Degremont
2008-07-08 17:41           ` Peter Braam
2008-07-09 13:25             ` Aurelien Degremont
2008-07-09 13:49               ` Peter Braam
2008-07-11 14:32                 ` Jacques-Charles Lafoucriere
2008-07-11 22:03                   ` Peter Braam
2008-07-11 14:37                 ` Jacques-Charles Lafoucriere
2008-07-11 22:12                   ` Peter Braam
2008-07-11 14:31       ` Jacques-Charles Lafoucriere
2008-07-11 21:57         ` Peter Braam
2008-07-16 10:26           ` Jacques-Charles Lafoucriere
2008-07-16 19:00             ` Peter Braam

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=C4959727.4DA5%peter.braam@sun.com \
    --to=peter.braam@sun.com \
    --cc=lustre-devel@lists.lustre.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.