All of lore.kernel.org
 help / color / mirror / Atom feed
From: Sebastian Riemer <sebastian.riemer@profitbricks.com>
To: Roy Sigurd Karlsbakk <roy@karlsbakk.net>
Cc: Linux RAID <linux-raid@vger.kernel.org>,
	Jeff Johnson <jeff.johnson@aeoncomputing.com>
Subject: Re: Best practice for large storage?
Date: Mon, 18 Feb 2013 11:35:57 +0100	[thread overview]
Message-ID: <5122040D.9040404@profitbricks.com> (raw)
In-Reply-To: <10665743.0.1361018900938.JavaMail.root@zimbra>

On 16.02.2013 13:48, Roy Sigurd Karlsbakk wrote:
>> On 14.02.2013 18:48, Jeff Johnson wrote:
>>> Stable enough where it is being used at Lawrence Livermore Nat'l
>>> Labs on
>>> a 55PB Lustre resource.
>>>
>>> I've been using it on a pre-release Lustre 2.4 and I have not had
>>> any
>>> issues.
>>
>> ZFS completely fragments if you've got massive parallel write IO -
>> especially with Solaris 11. You'll get only 2..3 MiB/s after some time
>> as everything is stored completely random then. So if you don't really
>> need these snapshots you shouldn't use ZFS. NILFS is also good for
>> snapshots.
> 
> This won't be massive parallel I/O, just a fileserver with a limited amount of users. Also, can you document this claim?

Of cause, we had ZFS in production in our IaaS public cloud. In such a
cloud nearly everything is random. Customers create and delete their
storage from time to time and some of them do a lot of small writes. It
fills up quite fast. We even had no snapshots and we already had the ZIL
dedicated on enterprise SSDs.

http://thomas.gouverneur.name/2011/06/20110609zfs-fragmentation-issue-examining-the-zil/
http://www.racktopsystems.com/dedicated-zfs-intent-log-aka-slogzil-and-data-fragmentation/
http://www.eall.com.br/blog/?p=2481
http://www.techforce.com.br/news/layout/set/print/linux_blog/zfs_part_4_sustained_random_small_files_sync_write_iops

ZFS as block device with COMSTAR exports is really crap. You've got
mostly synchronous (and small database) IO. This is why we switched to a
Linux storage with LVM (without thin stuff). The customers run their
file systems in their VMs anyway.

Cheers,
Sebastian

  reply	other threads:[~2013-02-18 10:35 UTC|newest]

Thread overview: 15+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-02-14 17:28 Best practice for large storage? Roy Sigurd Karlsbakk
2013-02-14 17:33 ` Jeff Johnson
2013-02-14 17:39   ` Roy Sigurd Karlsbakk
2013-02-14 17:48     ` Jeff Johnson
2013-02-15  9:51       ` Sebastian Riemer
2013-02-16 12:48         ` Roy Sigurd Karlsbakk
2013-02-18 10:35           ` Sebastian Riemer [this message]
2013-02-16  5:40       ` Stan Hoeppner
2013-02-15  2:18 ` Chris Murphy
2013-02-16  7:09 ` Stan Hoeppner
     [not found] <CAH3kUhFbR3coJSwPvqqOGrqcsvoJpdAPgAAiAgc_th1ym33DzA@mail.gmail.com>
2013-02-14 18:27 ` Roy Sigurd Karlsbakk
     [not found] <CAH3kUhGVt1iyn9tt=2-+f6H++obOGSK3x0pBBPZV8CFUXjp5yw@mail.gmail.com>
2013-02-14 23:23 ` Roy Sigurd Karlsbakk
2013-02-14 23:42   ` Roberto Spadim
2013-02-15  1:01     ` Adam Goryachev
2013-02-15  1:13       ` Roberto Spadim

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=5122040D.9040404@profitbricks.com \
    --to=sebastian.riemer@profitbricks.com \
    --cc=jeff.johnson@aeoncomputing.com \
    --cc=linux-raid@vger.kernel.org \
    --cc=roy@karlsbakk.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.