All of lore.kernel.org
 help / color / mirror / Atom feed
* Re: How are you using Ceph?
@ 2012-09-17 23:57 Nick Couchman
  2012-09-18  6:35 ` John Axel Eriksson
  0 siblings, 1 reply; 29+ messages in thread
From: Nick Couchman @ 2012-09-17 23:57 UTC (permalink / raw)
  To: mark.nelson, john; +Cc: ross, ceph-devel

John,
I'd be really interested to hear how Btrfs goes over time.  I tried it out a few kernel versions ago and regretted it - lost some data after using it.  Hopefully the stability is better than it was before, and inline compression is always great!

-Nick

>>> John Axel Eriksson  09/17/12 5:26 PM >>>
Our use of Ceph started pretty recently (this summer). We only use
rados together with the radosgw. We moved from another distributed
storage solution that had failed us more than once and we lost data.
Since the old system had an http interface (not S3 compatible though)
we looked around for another similar system. In the end we chose Ceph
since it had been in development for quite some time, had been
incorporated in the kernel (well, the client for the fs that is) and
recently got a company behind it. Ceph felt pretty solid, even though
it's still early days I guess.

We obviously liked the fact that it has an S3 compatible interface,
especially since we started backing up data to Amazon S3 some time ago
- having the same interface simplified our client code tremendously.
We don't actually need extreme throughput (yet anyway :-) but we do
need replication. We're quite happy with the performance so far since
it's better than our old system.
We store medical data for archival and conversion from and to
different formats. Since we (after previous failures in the old
storage system) store everything in Amazon S3 as well, we made a bet
on Kernel 3.5 and Btrfs with compression for some quite dramatic space
savings - the data we store often compresses really well. So far we
haven't regretted that choice, but we've only been running it in
production for about two months while slowly phasing out the old
storage system.

John

On Tue, Sep 18, 2012 at 12:53 AM, Mark Nelson  wrote:
> Hi Nick,
>
> All I have to say, is that is totally awesome and scary at the same time. :)
>
> Glad to hear that it recovers well when people shut their desktops off!
>
> Mark
>
>
> On 09/17/2012 05:47 PM, Nick Couchman wrote:
>>
>> My use of Ceph is probably pretty unique in some of the aspects of
>> where/how I'm using it.  I run an IT department for a medium-sized
>> engineering firm.  One of my goals is to try to make the best possible use
>> of the hardware we're deploying to users' desktops.  Often times users
>> cannot get by with a thin client and a VM somewhere, they actually need
>> decent hardware on the desktop.  However, when the hardware isn't being
>> used, it's nice to be able to have access to some of the free disk space,
>> I/O bandwidth, memory, and CPU cycles available on the hardware.  So, Ceph
>> is part of an overall strategy for making use of the hardware.  I'm guessing
>> most folks run it on racked servers in datacenters, but I'm distributing it
>> across desktops.
>>
>> I've started by rolling out Linux to the desktop bare metal rather than
>> Windows.  I run openSuSE 12.1, probably moving to 12.2 here in the
>> near-future (I have Ceph packages available and built for openSuSE 11.4,
>> 12.1, and 12.2 on my OBS project).  I run the Xen kernel on this hardware so
>> that I can run VMs on top of it for various purposes.  For folks who need
>> Windows, I use Windows-based VMs on Xen.  For the types who are comfortable
>> with switching between Linux and Windows, I use a Windows VM and then
>> rdesktop to connect from the Linux desktop/window manager.  For the types
>> who are only comfortable in Windows, I use VGA and PCI pass-through in Xen
>> to pass the video card and the USB controllers to the Windows guest, making
>> the Linux base install transparent to the end-user.
>>
>> To make use of free CPU cycles, in addition to VMs, I use the latest
>> freely-available version of the software formerly known as the Sun Grid
>> Engine to make these desktop systems part of the batching system that allows
>> engineers to run HPC jobs.  They mount various filesystems from our NFS
>> servers and jobs can execute on these systems on evenings and weekends.
>>
>> Ceph is a pretty recent addition to these configurations.  I wanted to
>> find an easy way to make use of the free disk space on these systems, but in
>> a useful way that aggregates it all together.  After looking at several
>> distributed filesystems, Ceph came up as the one with the feature sets that
>> made the most sense for me.  So, I've spent a bunch of time building
>> packages, testing out Ceph, and have finally rolled it out on these two
>> dozen Linux desktops, aggregating 100GB from each desktop's 250GB drive into
>> a single pool that adds up to roughly 2.2TB of raw storage.  I currently do
>> 3 replications for all of my pools in Ceph to try to protect against a
>> desktop machine going down, getting shut down, etc., which does happen from
>> time-to-time.  So far this has worked out pretty well, and Ceph seems to
>> recover pretty well from these failures, moving blocks to different systems
>> when necessary, then re-doing that when the systems come back online.
>>
>> My next steps for this setup, including Ceph, really get into more of a
>> private cloud infrastructure using desktop commodity hardware.  I'd like to
>> be able to install something like Openstack or the XAPI/XCP software on
>> these systems and centrally manage the aggregated storage along with memory
>> and CPU with a tool like that.  This would give me the ability to deploy
>> these inexpensive systems across the organization, but make sure they're
>> used to their best capacity, and it also allows for great flexibility when
>> users move from machine to machine, or VMs need to move from place to place.
>> I do keep a lot of my critical infrastructure in my datacenter on more
>> traditional compute systems - a SAN, XenServer, fileservers/NAS with
>> NFS/CIFS, etc. - but this is a good way for me to prove out the usefulness
>> and reliability of systems like Ceph and other cloud-computing concepts and
>> then take those and apply them to increasingly complex and critical needs in
>> my organization.
>>
>> For Ceph improvements that would help me out, the ability to support POSIX
>> and NFSv4 ACLs would be a fantastic addition.  We use these types of
>> permissions on our main filesystems to control access better than the
>> traditional UGO-style permissions, and I already miss it while using Ceph.
>> Also, I know the concept of deduplication has been discussed, and this, too,
>> would be great.  I was actually wondering about the feasibility of
>> implementing post-processing deduplication on Ceph, first, rather than
>> inline deduplication - obviously this increases disk space requirements
>> since there has to be enough to store the duplicated data, but still seems
>> to beat no deduplication at all.  Not a huge requirement at this point, but
>> playing with FSs that support deduplication makes me want it everywhere :-).
>>
>> -Nick
>>
>>>>> On 2012/09/17 at 16:14, Ross Turk  wrote:
>>>
>>> Hi, all!
>>>
>>> One of the most important parts of Inktank's mission is to spread the
>>> word about Ceph. We want everyone to know what it is and how to use
>>> it.
>>>
>>> In order to tell a better story to potential new users, I'm trying to
>>> get a sense for today's deployments. We've spent the last few months
>>> talking to folks around the world, but I'm sure there are a few great
>>> stories we haven't heard yet!
>>>
>>> If you've got a spare five minutes, I would love to hear what you're
>>> up to. What kind of projects are you working on, and in what stage?
>>> What is your workload? Are you using Ceph alongside other
>>> technologies? How has your experience been?
>>>
>>> This is also a good opportunity for me to introduce myself to those I
>>> haven't met yet! Feel free to copy the list if you think others would
>>> be interested (and you don't mind sharing).
>>>
>>> Cheers,
>>> Ross
>>>
>>> --
>>> Ross Turk
>>> Ceph Community Guy
>>>
>>> "Any sufficiently advanced technology is indistinguishable from magic."
>>> -- Arthur C. Clarke
>>> --
>>> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
>>> the body of a message to majordomo@vger.kernel.org
>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>
>>
>>
>>
>> --------
>> This e-mail may contain confidential and privileged material for the sole
>> use of the intended recipient.  If this email is not intended for you, or
>> you are not responsible for the delivery of this message to the intended
>> recipient, please note that this message may contain SEAKR Engineering
>> (SEAKR) Privileged/Proprietary Information.  In such a case, you are
>> strictly prohibited from downloading, photocopying, distributing or
>> otherwise using this message, its contents or attachments in any way.  If
>> you have received this message in error, please notify us immediately by
>> replying to this e-mail and delete the message from your mailbox.
>> Information contained in this message that does not relate to the business
>> of SEAKR is neither endorsed by nor attributable to SEAKR.
>> --
>> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>
>
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html



--------

This e-mail may contain confidential and privileged material for the sole use of the intended recipient.  If this email is not intended for you, or you are not responsible for the delivery of this message to the intended recipient, please note that this message may contain SEAKR Engineering (SEAKR) Privileged/Proprietary Information.  In such a case, you are strictly prohibited from downloading, photocopying, distributing or otherwise using this message, its contents or attachments in any way.  If you have received this message in error, please notify us immediately by replying to this e-mail and delete the message from your mailbox.  Information contained in this message that does not relate to the business of SEAKR is neither endorsed by nor attributable to SEAKR.

^ permalink raw reply	[flat|nested] 29+ messages in thread
* Re: How are you using Ceph?
@ 2012-09-17 23:55 Nick Couchman
  0 siblings, 0 replies; 29+ messages in thread
From: Nick Couchman @ 2012-09-17 23:55 UTC (permalink / raw)
  To: mark.nelson; +Cc: ross, ceph-devel

We actually ask people to not shut off their desktops, so it doesn't happen very often :-).  Also, I run the MDS and MON systems inside my datacenter, so only the OSDs are out there on the desktops.

-Nick

>>> Mark Nelson  09/17/12 4:53 PM >>>
Hi Nick,

All I have to say, is that is totally awesome and scary at the same time. :)

Glad to hear that it recovers well when people shut their desktops off!

Mark

On 09/17/2012 05:47 PM, Nick Couchman wrote:
> My use of Ceph is probably pretty unique in some of the aspects of where/how I'm using it.  I run an IT department for a medium-sized engineering firm.  One of my goals is to try to make the best possible use of the hardware we're deploying to users' desktops.  Often times users cannot get by with a thin client and a VM somewhere, they actually need decent hardware on the desktop.  However, when the hardware isn't being used, it's nice to be able to have access to some of the free disk space, I/O bandwidth, memory, and CPU cycles available on the hardware.  So, Ceph is part of an overall strategy for making use of the hardware.  I'm guessing most folks run it on racked servers in datacenters, but I'm distributing it across desktops.
>
> I've started by rolling out Linux to the desktop bare metal rather than Windows.  I run openSuSE 12.1, probably moving to 12.2 here in the near-future (I have Ceph packages available and built for openSuSE 11.4, 12.1, and 12.2 on my OBS project).  I run the Xen kernel on this hardware so that I can run VMs on top of it for various purposes.  For folks who need Windows, I use Windows-based VMs on Xen.  For the types who are comfortable with switching between Linux and Windows, I use a Windows VM and then rdesktop to connect from the Linux desktop/window manager.  For the types who are only comfortable in Windows, I use VGA and PCI pass-through in Xen to pass the video card and the USB controllers to the Windows guest, making the Linux base install transparent to the end-user.
>
> To make use of free CPU cycles, in addition to VMs, I use the latest freely-available version of the software formerly known as the Sun Grid Engine to make these desktop systems part of the batching system that allows engineers to run HPC jobs.  They mount various filesystems from our NFS servers and jobs can execute on these systems on evenings and weekends.
>
> Ceph is a pretty recent addition to these configurations.  I wanted to find an easy way to make use of the free disk space on these systems, but in a useful way that aggregates it all together.  After looking at several distributed filesystems, Ceph came up as the one with the feature sets that made the most sense for me.  So, I've spent a bunch of time building packages, testing out Ceph, and have finally rolled it out on these two dozen Linux desktops, aggregating 100GB from each desktop's 250GB drive into a single pool that adds up to roughly 2.2TB of raw storage.  I currently do 3 replications for all of my pools in Ceph to try to protect against a desktop machine going down, getting shut down, etc., which does happen from time-to-time.  So far this has worked out pretty well, and Ce
 ph seems to recover pretty well from these failures, moving blocks to different systems when necessary, then re-doing that when the systems come back online.
>
> My next steps for this setup, including Ceph, really get into more of a private cloud infrastructure using desktop commodity hardware.  I'd like to be able to install something like Openstack or the XAPI/XCP software on these systems and centrally manage the aggregated storage along with memory and CPU with a tool like that.  This would give me the ability to deploy these inexpensive systems across the organization, but make sure they're used to their best capacity, and it also allows for great flexibility when users move from machine to machine, or VMs need to move from place to place.  I do keep a lot of my critical infrastructure in my datacenter on more traditional compute systems - a SAN, XenServer, fileservers/NAS with NFS/CIFS, etc. - but this is a good way for me to prove out the
  usefulness and reliability of systems like Ceph and other cloud-computing concepts and then take those and apply them to increasingly complex and critical needs in my organization.
>
> For Ceph improvements that would help me out, the ability to support POSIX and NFSv4 ACLs would be a fantastic addition.  We use these types of permissions on our main filesystems to control access better than the traditional UGO-style permissions, and I already miss it while using Ceph.  Also, I know the concept of deduplication has been discussed, and this, too, would be great.  I was actually wondering about the feasibility of implementing post-processing deduplication on Ceph, first, rather than inline deduplication - obviously this increases disk space requirements since there has to be enough to store the duplicated data, but still seems to beat no deduplication at all.  Not a huge requirement at this point, but playing with FSs that support deduplication makes me want it everywher
 e :-).
>
> -Nick
>
>>>> On 2012/09/17 at 16:14, Ross Turk  wrote:
>> Hi, all!
>>
>> One of the most important parts of Inktank's mission is to spread the
>> word about Ceph. We want everyone to know what it is and how to use
>> it.
>>
>> In order to tell a better story to potential new users, I'm trying to
>> get a sense for today's deployments. We've spent the last few months
>> talking to folks around the world, but I'm sure there are a few great
>> stories we haven't heard yet!
>>
>> If you've got a spare five minutes, I would love to hear what you're
>> up to. What kind of projects are you working on, and in what stage?
>> What is your workload? Are you using Ceph alongside other
>> technologies? How has your experience been?
>>
>> This is also a good opportunity for me to introduce myself to those I
>> haven't met yet! Feel free to copy the list if you think others would
>> be interested (and you don't mind sharing).
>>
>> Cheers,
>> Ross
>>
>> --
>> Ross Turk
>> Ceph Community Guy
>>
>> "Any sufficiently advanced technology is indistinguishable from magic."
>> -- Arthur C. Clarke
>> --
>> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>
>
>
> --------
> This e-mail may contain confidential and privileged material for the sole use of the intended recipient.  If this email is not intended for you, or you are not responsible for the delivery of this message to the intended recipient, please note that this message may contain SEAKR Engineering (SEAKR) Privileged/Proprietary Information.  In such a case, you are strictly prohibited from downloading, photocopying, distributing or otherwise using this message, its contents or attachments in any way.  If you have received this message in error, please notify us immediately by replying to this e-mail and delete the message from your mailbox.  Information contained in this message that does not relate to the business of SEAKR is neither endorsed by nor attributable to SEAKR.
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html




--------

This e-mail may contain confidential and privileged material for the sole use of the intended recipient.  If this email is not intended for you, or you are not responsible for the delivery of this message to the intended recipient, please note that this message may contain SEAKR Engineering (SEAKR) Privileged/Proprietary Information.  In such a case, you are strictly prohibited from downloading, photocopying, distributing or otherwise using this message, its contents or attachments in any way.  If you have received this message in error, please notify us immediately by replying to this e-mail and delete the message from your mailbox.  Information contained in this message that does not relate to the business of SEAKR is neither endorsed by nor attributable to SEAKR.

^ permalink raw reply	[flat|nested] 29+ messages in thread
* How are you using Ceph?
@ 2012-09-17 22:14 Ross Turk
  2012-09-17 22:47 ` Nick Couchman
                   ` (2 more replies)
  0 siblings, 3 replies; 29+ messages in thread
From: Ross Turk @ 2012-09-17 22:14 UTC (permalink / raw)
  To: ceph-devel

Hi, all!

One of the most important parts of Inktank's mission is to spread the
word about Ceph. We want everyone to know what it is and how to use
it.

In order to tell a better story to potential new users, I'm trying to
get a sense for today's deployments. We've spent the last few months
talking to folks around the world, but I'm sure there are a few great
stories we haven't heard yet!

If you've got a spare five minutes, I would love to hear what you're
up to. What kind of projects are you working on, and in what stage?
What is your workload? Are you using Ceph alongside other
technologies? How has your experience been?

This is also a good opportunity for me to introduce myself to those I
haven't met yet! Feel free to copy the list if you think others would
be interested (and you don't mind sharing).

Cheers,
Ross

--
Ross Turk
Ceph Community Guy

"Any sufficiently advanced technology is indistinguishable from magic."
-- Arthur C. Clarke

^ permalink raw reply	[flat|nested] 29+ messages in thread

end of thread, other threads:[~2012-09-18 17:09 UTC | newest]

Thread overview: 29+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <1784724793.100.1347938272315.JavaMail.root@thunderbeast.private.linuxbox.com>
2012-09-18  3:19 ` How are you using Ceph? Matt W. Benjamin
2012-09-18  5:44   ` Ian Pye
2012-09-18  6:06     ` Yehuda Sadeh
2012-09-18 16:13   ` Sage Weil
2012-09-18 16:21     ` Matt W. Benjamin
2012-09-17 23:57 Nick Couchman
2012-09-18  6:35 ` John Axel Eriksson
  -- strict thread matches above, loose matches on Subject: below --
2012-09-17 23:55 Nick Couchman
2012-09-17 22:14 Ross Turk
2012-09-17 22:47 ` Nick Couchman
2012-09-17 22:53   ` Mark Nelson
2012-09-17 23:26     ` John Axel Eriksson
2012-09-18  7:47       ` Plaetinck, Dieter
2012-09-18 14:34         ` John Axel Eriksson
2012-09-18 14:51           ` Plaetinck, Dieter
2012-09-18 14:56             ` Mark Nelson
2012-09-18 15:19               ` Plaetinck, Dieter
2012-09-18 15:27                 ` Mark Nelson
2012-09-18 15:46                   ` Plaetinck, Dieter
2012-09-18 16:20           ` Xiaopong Tran
2012-09-18 17:09             ` John Axel Eriksson
2012-09-18  0:05 ` Smart Weblications GmbH - Florian Wiessner
2012-09-18  0:18   ` Tren Blackburn
2012-09-18  2:32     ` Sage Weil
2012-09-18 11:48       ` Smart Weblications GmbH - Florian Wiessner
2012-09-18 16:20         ` Sage Weil
2012-09-18 16:35       ` Tren Blackburn
2012-09-18 17:00         ` Sage Weil
2012-09-18 16:01 ` Travis Rhoden

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.