All of lore.kernel.org
 help / color / mirror / Atom feed
From: Malcolm Haak <malcolm@sgi.com>
To: Sage Weil <sage@inktank.com>, ceph-devel@vger.kernel.org
Subject: Re: HSM
Date: Mon, 11 Nov 2013 09:17:07 +1000	[thread overview]
Message-ID: <528013F3.1010300@sgi.com> (raw)
In-Reply-To: <alpine.DEB.2.00.1311090027210.8192@cobra.newdream.net>

Hi All,

If you are talking specifically about Lustre HSM, its really an 
interface to add HSM functionality by leveraging existing HSM's (DMF for 
example)

So with Lustre HSM you have a policy engine that triggers the migrations 
out of the filesystem. Rules are based around size, last accessed and 
target state (online, dual and offline).

There is a 'coordinator' process involved here as well, it (from what I 
understand) runs on MDS nodes. It handles the interaction with the 
copytool. The copytool is provided by the HSM solution you are acutally 
using.

For recalls when caps are aquired on the MDS for an exported file the 
resposible MSD contacts the coordinator, which in-turn uses the copytool 
to pull the required file out of the HSM.

In the Lustre HSM, the objects that make up a file are all recalled and 
the file, not the objects, are handed to the HSM.

For Lustre all it needs to keep track of is the current state of the 
file and the correct ID to reqest from the HSM. This is done inside the 
normal metadata storage.

So there aren't really any hooks in that exports are triggered by the 
policy engine after a scan of the metadata, and the recalls are 
triggered when caps are requested on offline files. Then its just 
standard POSIX blocking until the file is available.

Most of the state and ID stuff could be stored as XATTRS in cephfs. I'm 
not as sure how to do it for other things but as long as you could store 
some kind of extended metadata about whole objects, it could use the 
same interfaces as well.

Hope that was acutually helpful and not just an obvious rehash...

Regards

Malcolm Haak

On 09/11/13 18:33, Sage Weil wrote:
> The latest Lustre just added HSM support:
>
> 	http://archive.hpcwire.com/hpcwire/2013-11-06/lustre_scores_business_class_upgrade_with_hsm.html
>
> Here is a slide deck with some high-level detail:
> 	
> 	https://jira.hpdd.intel.com/secure/attachment/13185/Lustre_HSM_Design.pdf
>
> Is anyone familiar with the interfaces and requirements of the file system
> itself?  I don't know much about how these systems are implemented, but I
> would guess there are relatively lightweight requirements on the fs (ceph
> mds in our case) to keep track of file state (online or archived
> elsewhere).  And some hooks to trigger migrations?
>
> If anyone is interested in this area, I would be happy to help figure out
> how to integrate things cleanly!
>
> sage
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>

  parent reply	other threads:[~2013-11-10 23:17 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-11-09  8:33 HSM Sage Weil
2013-11-09 14:20 ` HSM Tim Bell
2013-11-11  9:58   ` HSM Sebastien Ponce
2013-11-10 23:17 ` Malcolm Haak [this message]
2013-11-11 11:04   ` HSM John Spray
2013-11-12  0:13     ` HSM Gregory Farnum
2013-11-12  0:57       ` HSM Malcolm Haak
2013-11-11  9:50 ` HSM Sebastien Ponce
2013-11-12  9:47 ` HSM Andreas Joachim Peters
2013-11-18 19:22   ` HSM Dmitry Borodaenko
2013-11-20 12:09     ` HSM Malcolm Haak
  -- strict thread matches above, loose matches on Subject: below --
2013-11-11 16:05 HSM bernhard glomm

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=528013F3.1010300@sgi.com \
    --to=malcolm@sgi.com \
    --cc=ceph-devel@vger.kernel.org \
    --cc=sage@inktank.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.