From: Sebastien Ponce <sebastien.ponce@cern.ch>
To: Sage Weil <sage@inktank.com>
Cc: ceph-devel@vger.kernel.org
Subject: Re: HSM
Date: Mon, 11 Nov 2013 10:50:47 +0100 [thread overview]
Message-ID: <1384163447.6296.33.camel@sebmain.cern.ch> (raw)
In-Reply-To: <alpine.DEB.2.00.1311090027210.8192@cobra.newdream.net>
It may be even more lightweight than that depending on the Mass Storage
system intrinsic capabilities.
Here at CERN, we are testing the use of ceph in the disk cache in our
Mass Storage system and the only requirements we had was to store and
retrieve a file (put/get functionality). We've thus used the rados
library with a small extension implementing striping of big files the
way you've done it in cephFS. I will actually submit a blueprint for
this extension in case you want to integrate it.
This supposed that we have our own tools and protocols to migrate/recall
files and query the status of them.
In the general case, you're pretty right that you need to store the
state of the file (some use a stub file when the file is not on disk)
and hooks after the writing of a new file and in case of read failure
(cache miss).
One difficult point though, as Tim mentioned already, is to reach
optimal usage of your tape backend. This may end up in long queuing
times (hours) for recalls and one has to handle waiting clients during
that time.
Sebastien
On Sat, 2013-11-09 at 00:33 -0800, Sage Weil wrote:
> The latest Lustre just added HSM support:
>
> http://archive.hpcwire.com/hpcwire/2013-11-06/lustre_scores_business_class_upgrade_with_hsm.html
>
> Here is a slide deck with some high-level detail:
>
> https://jira.hpdd.intel.com/secure/attachment/13185/Lustre_HSM_Design.pdf
>
> Is anyone familiar with the interfaces and requirements of the file system
> itself? I don't know much about how these systems are implemented, but I
> would guess there are relatively lightweight requirements on the fs (ceph
> mds in our case) to keep track of file state (online or archived
> elsewhere). And some hooks to trigger migrations?
>
> If anyone is interested in this area, I would be happy to help figure out
> how to integrate things cleanly!
>
> sage
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
next prev parent reply other threads:[~2013-11-11 10:00 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-11-09 8:33 HSM Sage Weil
2013-11-09 14:20 ` HSM Tim Bell
2013-11-11 9:58 ` HSM Sebastien Ponce
2013-11-10 23:17 ` HSM Malcolm Haak
2013-11-11 11:04 ` HSM John Spray
2013-11-12 0:13 ` HSM Gregory Farnum
2013-11-12 0:57 ` HSM Malcolm Haak
2013-11-11 9:50 ` Sebastien Ponce [this message]
2013-11-12 9:47 ` HSM Andreas Joachim Peters
2013-11-18 19:22 ` HSM Dmitry Borodaenko
2013-11-20 12:09 ` HSM Malcolm Haak
-- strict thread matches above, loose matches on Subject: below --
2013-11-11 16:05 HSM bernhard glomm
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1384163447.6296.33.camel@sebmain.cern.ch \
--to=sebastien.ponce@cern.ch \
--cc=ceph-devel@vger.kernel.org \
--cc=sage@inktank.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.