From: Ric Wheeler <rwheeler@redhat.com>
To: Andreas Dilger <adilger@dilger.ca>
Cc: Stephan Boettcher <boettcher@physik.uni-kiel.de>,
ext4 development <linux-ext4@vger.kernel.org>
Subject: Re: 20TB ext4
Date: Mon, 13 Dec 2010 22:27:16 -0500 [thread overview]
Message-ID: <4D06E414.6060600@redhat.com> (raw)
In-Reply-To: <D46B0853-722D-4337-9561-42CECF7DAF47@dilger.ca>
On 12/13/2010 04:57 PM, Andreas Dilger wrote:
> On 2010-12-13, at 09:23, Stephan Boettcher wrote:
>> A raid1 (/dev/md1) over three 20GB partitions is the root filesystem,
>> three 20GB partitions for swap, and a RAID5 (/dev/md0) from the six big
>> partitions.
>>
>> The 10TB /dev/md0 is exported via nbd. I had to patch nbd-client to
>> import this on a 32-bit machine, so that part works.
>>
>> The intention was to export two (later three) via nbd to one of the
>> servers, which combines them to a RAID5² with net capacity 20TB. With
>> e2fsprogs master branch I could make a filesystem, but dumpe2fs and
>> fsck failed. Mounting the filesystem said: EFBIG.
> RAID-5 on top of RAID-5 is going to be VERY SLOW... Also note that only a single "nbd client" system will be able to use this storage at one time. If you have dedicated server nodes, and you want to be able to use these 20TB from multiple clients, you might consider using Lustre, which uses ext4 as the back-end storage, and can scale to many PB filesystems (largest known filesystem is 20PB, from 1344 * 8TB separate ext4 filesystems).
>
>> Obviously, with 32-bit pgoff_t this will not work, and it was said
>> elsewhere that making pgoff_t 64-bit on i386 will require a lot of faith
>> and luck, since there are more than 3000 unsigned longs in the fs tree.
> I don't think that is going to happen any time soon. Lustre _can_ export from a 32-bit server, though it definitely isn't very common anymore. For the cost of a single 2TB drive you can likely get a new motherboard + 64-bit CPU + RAM...
>
>> I'd prefer to run the setup selfcontained without an extra 64-bit head.
>> Maybe I will partition it down to a 16TB and a 4TB partition. Maybe I
>> just dare to compile a kernel with typedef unsigned long long pgoff_t
>> and see what happens, maybe I can help fixing that kind of configuration.
> I would suggest you examine what it is you are really trying to get out of this system? Is it just for fun, to test ext4 with> 16TB filesystems? Great, you can probably do that with the 64-bit nbd client. Do you actually want to use this for some data you care about? Then trying to get 32-bit kernels to handle> 16TB block devices is a risky strategy to take for a few hundred USD. Given that you are willing to spend a few thousand USD for the 2TB drives, you should consider just getting a 64-bit CPU + RAM to handle it.
>
> Also note that running e2fsck on such a large filesystem will need 6-8GB of RAM at a minimum, and can be a lot more if there are serious problems (e.g. duplicate blocks). Recently I saw a report of 22GB of RAM needed for e2fsck to complete, which is just impossible on a 32-bit machine.
>
>
> Cheers, Andreas
>
I have to agree here - I do not see this as being a great investment of time.
Even low powered CPU's can often run in 64 bit mode these days and as Andreas
says, you will need a lot of DRAM to fsck this box :)
Ric
--
To unsubscribe from this list: send the line "unsubscribe linux-ext4" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
next prev parent reply other threads:[~2010-12-14 3:27 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <s6nzks9y74l.fsf@falbala.ieap.uni-kiel.de>
2010-12-13 18:12 ` 20TB ext4 Lukas Czerner
2010-12-13 21:57 ` Andreas Dilger
2010-12-14 3:27 ` Ric Wheeler [this message]
2010-12-14 8:59 ` Stephan Boettcher
2010-12-14 20:51 ` Stephan Boettcher
2010-12-15 9:21 ` Andreas Dilger
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4D06E414.6060600@redhat.com \
--to=rwheeler@redhat.com \
--cc=adilger@dilger.ca \
--cc=boettcher@physik.uni-kiel.de \
--cc=linux-ext4@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox