linux-ext4.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Ted Ts'o <tytso@mit.edu>
To: Andreas Dilger <adilger@dilger.ca>
Cc: "i@coly.li" <i@coly.li>, Andreas Dilger <adilger@whamcloud.com>,
	linux-ext4 development <linux-ext4@vger.kernel.org>,
	Alex Zhuravlev <bzzz@whamcloud.com>, Tao Ma <tm@tao.ma>,
	"hao.bigrat@gmail.com" <hao.bigrat@gmail.com>
Subject: Re: bigalloc and max file size
Date: Mon, 31 Oct 2011 12:22:23 -0400	[thread overview]
Message-ID: <20111031162223.GD16825@thunk.org> (raw)
In-Reply-To: <422BEB28-76D0-4FD8-B7AE-130C9AAE10C0@dilger.ca>

On Mon, Oct 31, 2011 at 10:08:20AM -0600, Andreas Dilger wrote:
> On 2011-10-31, at 4:22 AM, Theodore Tso <tytso@MIT.EDU> wrote:
> For cluster file systems, such as when you might build Hadoop on top
> > of ext4, there's no real advantage of using RAID arrays as opposed
> > to having single file systems on each disk.  In fact, due to the
> > specd of being able to check multiple disk spindles in parallel,
> > it's advantageous to build cluster file systems on single disk
> > file systems.
> 
> For Lustre at least there are a number of reasons why it uses large
> RAID devices to store the data instead of many small devices: -
> fewer devices that need to be managed. Lustre runs on systems with
> more than 13000 drives, and having to manage connection state for
> that many internal devices is a lot of overhead.

Well, per the discussion on the ext4 call, with Lustre hardware
multiple RAID LUN's get used, so while they might have tens of
petabytes of data, it is still split across a thousand hardware LUN's
or so.  So there is a middle ground between "put all of your 13000
devices on a single hardware RAID LUN", and "use 13000 file systems".
And in that middle ground, it seems surprising that someone would be
bumping into the the 1EB file system limit offered by ext4.

I'm curious why TaoBao is so interested in changing the extent
encoding for bigalloc file systems.  Currently we can support up to 1
EB worth of physical block numbers, and 16TB of logical block numbers.
Are you concerned about bumping into the 1 EB file system limit?  Or
the 16 TB file size limit?  Or something else?

Regards,

						- Ted

  reply	other threads:[~2011-10-31 16:22 UTC|newest]

Thread overview: 24+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2011-10-26 23:36 bigalloc and max file size Andreas Dilger
2011-10-27  1:05 ` Tao Ma
2011-10-27  6:35 ` Theodore Tso
     [not found]   ` <EB03FF23-73BC-4FDC-B991-5EB3FEEB8DAE@whamcloud.com>
2011-10-27 11:48     ` Theodore Tso
     [not found]       ` <97D9C5CC-0F22-4BC7-BDFA-7781D33CA7F3@whamcloud.com>
2011-10-27 21:42         ` Theodore Tso
2011-10-28  3:31           ` Tao Ma
2011-10-31 10:15             ` Theodore Tso
2011-10-31 10:27               ` Tao Ma
2011-10-31 18:53                 ` Sunil Mushran
2011-10-31 19:09                   ` Andreas Dilger
2011-10-31 20:00                 ` Ted Ts'o
2011-11-01  4:06                   ` Tao Ma
2011-10-30  5:37           ` Coly Li
2011-10-30 19:49             ` Theodore Tso
2011-10-31  9:35               ` Coly Li
2011-10-31 10:22                 ` Theodore Tso
2011-10-31 16:08                   ` Andreas Dilger
2011-10-31 16:22                     ` Ted Ts'o [this message]
2011-10-31 17:39                       ` Coly Li
2011-10-31 19:38                         ` Ted Ts'o
2011-11-01  1:10                           ` Coly Li
2011-11-01 11:47                             ` Theodore Tso
2011-11-01 12:22                               ` Coly Li
2011-10-31 16:34                     ` Andreas Dilger

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20111031162223.GD16825@thunk.org \
    --to=tytso@mit.edu \
    --cc=adilger@dilger.ca \
    --cc=adilger@whamcloud.com \
    --cc=bzzz@whamcloud.com \
    --cc=hao.bigrat@gmail.com \
    --cc=i@coly.li \
    --cc=linux-ext4@vger.kernel.org \
    --cc=tm@tao.ma \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).