* Network-based RAID6
@ 2011-03-30 7:11 Roman Mamedov
2011-03-30 7:20 ` CoolCold
0 siblings, 1 reply; 13+ messages in thread
From: Roman Mamedov @ 2011-03-30 7:11 UTC (permalink / raw)
To: linux-raid
[-- Attachment #1: Type: text/plain, Size: 721 bytes --]
Hello,
Let's say I have 10 machines, each having 1TB of free disk space. They
communicate over a gigabit network.
I would like to create a fault-tolerant array from these machines and their
storage, that would match RAID6 in overhead and fault resilience.
That is, it should provide 8 TB of usable space and tolerate a 2-member
failure without data loss.
Does anyone know if any of the current distributed filesystems will meet these
requirements? Working/stable/non-experimental level code is preferred.
The simple and obvious solution is mdadm over AoE/iSCSI, but maybe there is
something else built with network awareness from ground up, that would be a
better choice?
--
With respect,
Roman
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 198 bytes --]
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: Network-based RAID6
2011-03-30 7:11 Network-based RAID6 Roman Mamedov
@ 2011-03-30 7:20 ` CoolCold
2011-03-30 8:49 ` Roman Mamedov
0 siblings, 1 reply; 13+ messages in thread
From: CoolCold @ 2011-03-30 7:20 UTC (permalink / raw)
To: Roman Mamedov; +Cc: linux-raid
On Wed, Mar 30, 2011 at 11:11 AM, Roman Mamedov <rm@romanrm.ru> wrote:
> Hello,
>
> Let's say I have 10 machines, each having 1TB of free disk space. They
> communicate over a gigabit network.
>
> I would like to create a fault-tolerant array from these machines and their
> storage, that would match RAID6 in overhead and fault resilience.
> That is, it should provide 8 TB of usable space and tolerate a 2-member
> failure without data loss.
>
> Does anyone know if any of the current distributed filesystems will meet these
> requirements? Working/stable/non-experimental level code is preferred.
>
> The simple and obvious solution is mdadm over AoE/iSCSI, but maybe there is
> something else built with network awareness from ground up, that would be a
> better choice?
Looking on your host I think you speak russian, so may be this will be
somehow helpful for you
http://community.livejournal.com/ru_root/2216389.html
>
> --
> With respect,
> Roman
>
--
Best regards,
[COOLCOLD-RIPN]
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: Network-based RAID6
2011-03-30 7:20 ` CoolCold
@ 2011-03-30 8:49 ` Roman Mamedov
2011-03-30 13:35 ` Stan Hoeppner
0 siblings, 1 reply; 13+ messages in thread
From: Roman Mamedov @ 2011-03-30 8:49 UTC (permalink / raw)
To: CoolCold; +Cc: linux-raid
[-- Attachment #1: Type: text/plain, Size: 862 bytes --]
On Wed, 30 Mar 2011 11:20:41 +0400
CoolCold <coolthecold@gmail.com> wrote:
> Looking on your host I think you speak russian, so may be this will be
> somehow helpful for you
> http://community.livejournal.com/ru_root/2216389.html
Thanks -- I have looked through the websites of some distributed filesystems
(Ceph, GlusterFS, MooseFS etc) and checked this thread too, but from what I
could find, all filesystems I read about so far are at most capable of RAID0 or
RAID1-like modes, where fault-tolerance is either not provided, or achieved
only by "all data is replicated across N nodes", which of course divides the
total usable space by N. I haven't found any FS which would do block-level
replication relying not on dumb copies, but on RAID5/6-like parity algorithms
for fault-tolerance. Maybe I missed something?
--
With respect,
Roman
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 198 bytes --]
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: Network-based RAID6
2011-03-30 8:49 ` Roman Mamedov
@ 2011-03-30 13:35 ` Stan Hoeppner
2011-03-30 14:24 ` Roberto Spadim
2011-03-30 17:50 ` Roman Mamedov
0 siblings, 2 replies; 13+ messages in thread
From: Stan Hoeppner @ 2011-03-30 13:35 UTC (permalink / raw)
To: Roman Mamedov; +Cc: CoolCold, linux-raid
Roman Mamedov put forth on 3/30/2011 3:49 AM:
> On Wed, 30 Mar 2011 11:20:41 +0400
> CoolCold <coolthecold@gmail.com> wrote:
>
>> Looking on your host I think you speak russian, so may be this will be
>> somehow helpful for you
>> http://community.livejournal.com/ru_root/2216389.html
>
> Thanks -- I have looked through the websites of some distributed filesystems
> (Ceph, GlusterFS, MooseFS etc) and checked this thread too, but from what I
> could find, all filesystems I read about so far are at most capable of RAID0 or
> RAID1-like modes, where fault-tolerance is either not provided, or achieved
> only by "all data is replicated across N nodes", which of course divides the
> total usable space by N. I haven't found any FS which would do block-level
> replication relying not on dumb copies, but on RAID5/6-like parity algorithms
> for fault-tolerance. Maybe I missed something?
You likely won't find any distributed filesystem that performs block
level replication over the network, at least not a FOSS one. These are
filesystems, mind you, not distributed block device drivers. If they
perform any replication to afford a level of fault tolerance, it will be
at the file level, not the block level.
If you want true block level replication over a network, look into DRBD.
However, it is also limited to mirroring.
--
Stan
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: Network-based RAID6
2011-03-30 13:35 ` Stan Hoeppner
@ 2011-03-30 14:24 ` Roberto Spadim
2011-03-30 14:43 ` Miles Fidelman
2011-03-30 17:50 ` Roman Mamedov
1 sibling, 1 reply; 13+ messages in thread
From: Roberto Spadim @ 2011-03-30 14:24 UTC (permalink / raw)
To: Stan Hoeppner; +Cc: Roman Mamedov, CoolCold, linux-raid
OCFS with DRBD could work very nice in linux
i didn't tested mdadm with ndb in a production enviroment
DRBD have brainsplit solutions, since you will run a complex
filesystem, i consider using working solutions, DRBD and OCFS is nice
yes, mdadm can run under DRBD
DRBD = raid1 over network
OCFS = oracle filesystem (cluster filesystem)
it works
2011/3/30 Stan Hoeppner <stan@hardwarefreak.com>:
> Roman Mamedov put forth on 3/30/2011 3:49 AM:
>> On Wed, 30 Mar 2011 11:20:41 +0400
>> CoolCold <coolthecold@gmail.com> wrote:
>>
>>> Looking on your host I think you speak russian, so may be this will be
>>> somehow helpful for you
>>> http://community.livejournal.com/ru_root/2216389.html
>>
>> Thanks -- I have looked through the websites of some distributed filesystems
>> (Ceph, GlusterFS, MooseFS etc) and checked this thread too, but from what I
>> could find, all filesystems I read about so far are at most capable of RAID0 or
>> RAID1-like modes, where fault-tolerance is either not provided, or achieved
>> only by "all data is replicated across N nodes", which of course divides the
>> total usable space by N. I haven't found any FS which would do block-level
>> replication relying not on dumb copies, but on RAID5/6-like parity algorithms
>> for fault-tolerance. Maybe I missed something?
>
> You likely won't find any distributed filesystem that performs block
> level replication over the network, at least not a FOSS one. These are
> filesystems, mind you, not distributed block device drivers. If they
> perform any replication to afford a level of fault tolerance, it will be
> at the file level, not the block level.
>
> If you want true block level replication over a network, look into DRBD.
> However, it is also limited to mirroring.
>
> --
> Stan
> --
> To unsubscribe from this list: send the line "unsubscribe linux-raid" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>
--
Roberto Spadim
Spadim Technology / SPAEmpresarial
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: Network-based RAID6
2011-03-30 14:24 ` Roberto Spadim
@ 2011-03-30 14:43 ` Miles Fidelman
2011-03-30 14:58 ` Roberto Spadim
0 siblings, 1 reply; 13+ messages in thread
From: Miles Fidelman @ 2011-03-30 14:43 UTC (permalink / raw)
Cc: linux-raid
I wonder how OCFS (or some other distributed file system) would run over
btfrs (raid10, no raid6), over AOE or iSCSI block devices.
Roberto Spadim wrote:
> OCFS with DRBD could work very nice in linux
> i didn't tested mdadm with ndb in a production enviroment
> DRBD have brainsplit solutions, since you will run a complex
> filesystem, i consider using working solutions, DRBD and OCFS is nice
> yes, mdadm can run under DRBD
> DRBD = raid1 over network
> OCFS = oracle filesystem (cluster filesystem)
> it works
>
> 2011/3/30 Stan Hoeppner<stan@hardwarefreak.com>:
>
>> Roman Mamedov put forth on 3/30/2011 3:49 AM:
>>
>>> On Wed, 30 Mar 2011 11:20:41 +0400
>>> CoolCold<coolthecold@gmail.com> wrote:
>>>
>>>
>>>> Looking on your host I think you speak russian, so may be this will be
>>>> somehow helpful for you
>>>> http://community.livejournal.com/ru_root/2216389.html
>>>>
>>> Thanks -- I have looked through the websites of some distributed filesystems
>>> (Ceph, GlusterFS, MooseFS etc) and checked this thread too, but from what I
>>> could find, all filesystems I read about so far are at most capable of RAID0 or
>>> RAID1-like modes, where fault-tolerance is either not provided, or achieved
>>> only by "all data is replicated across N nodes", which of course divides the
>>> total usable space by N. I haven't found any FS which would do block-level
>>> replication relying not on dumb copies, but on RAID5/6-like parity algorithms
>>> for fault-tolerance. Maybe I missed something?
>>>
>> You likely won't find any distributed filesystem that performs block
>> level replication over the network, at least not a FOSS one. These are
>> filesystems, mind you, not distributed block device drivers. If they
>> perform any replication to afford a level of fault tolerance, it will be
>> at the file level, not the block level.
>>
>> If you want true block level replication over a network, look into DRBD.
>> However, it is also limited to mirroring.
>>
>> --
>> Stan
>> --
>> To unsubscribe from this list: send the line "unsubscribe linux-raid" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at http://vger.kernel.org/majordomo-info.html
>>
>>
>
>
>
--
In theory, there is no difference between theory and practice.
In<fnord> practice, there is. .... Yogi Berra
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: Network-based RAID6
2011-03-30 14:43 ` Miles Fidelman
@ 2011-03-30 14:58 ` Roberto Spadim
0 siblings, 0 replies; 13+ messages in thread
From: Roberto Spadim @ 2011-03-30 14:58 UTC (permalink / raw)
To: Miles Fidelman; +Cc: linux-raid
ocfs is a file system, btrfs too, why using two filesystems? only ocfs
would be used when cluster filesystem
you want more disks per computer using others computers as disks?
or you want more disks in cluster using others computers as nodes of a cluster?
first is a single writer filesystem (ext4,reiserfs,xfs)
second is a cluster filesystem (ocfs, and others, check wikipedia for
filesystem lists)
you can use raid1 as block device for filesystem, but i remember that
some cluster filesystems have replication (raid1)
2011/3/30 Miles Fidelman <mfidelman@meetinghouse.net>:
> I wonder how OCFS (or some other distributed file system) would run over
> btfrs (raid10, no raid6), over AOE or iSCSI block devices.
>
> Roberto Spadim wrote:
>>
>> OCFS with DRBD could work very nice in linux
>> i didn't tested mdadm with ndb in a production enviroment
>> DRBD have brainsplit solutions, since you will run a complex
>> filesystem, i consider using working solutions, DRBD and OCFS is nice
>> yes, mdadm can run under DRBD
>> DRBD = raid1 over network
>> OCFS = oracle filesystem (cluster filesystem)
>> it works
>>
>> 2011/3/30 Stan Hoeppner<stan@hardwarefreak.com>:
>>
>>>
>>> Roman Mamedov put forth on 3/30/2011 3:49 AM:
>>>
>>>>
>>>> On Wed, 30 Mar 2011 11:20:41 +0400
>>>> CoolCold<coolthecold@gmail.com> wrote:
>>>>
>>>>
>>>>>
>>>>> Looking on your host I think you speak russian, so may be this will be
>>>>> somehow helpful for you
>>>>> http://community.livejournal.com/ru_root/2216389.html
>>>>>
>>>>
>>>> Thanks -- I have looked through the websites of some distributed
>>>> filesystems
>>>> (Ceph, GlusterFS, MooseFS etc) and checked this thread too, but from
>>>> what I
>>>> could find, all filesystems I read about so far are at most capable of
>>>> RAID0 or
>>>> RAID1-like modes, where fault-tolerance is either not provided, or
>>>> achieved
>>>> only by "all data is replicated across N nodes", which of course divides
>>>> the
>>>> total usable space by N. I haven't found any FS which would do
>>>> block-level
>>>> replication relying not on dumb copies, but on RAID5/6-like parity
>>>> algorithms
>>>> for fault-tolerance. Maybe I missed something?
>>>>
>>>
>>> You likely won't find any distributed filesystem that performs block
>>> level replication over the network, at least not a FOSS one. These are
>>> filesystems, mind you, not distributed block device drivers. If they
>>> perform any replication to afford a level of fault tolerance, it will be
>>> at the file level, not the block level.
>>>
>>> If you want true block level replication over a network, look into DRBD.
>>> However, it is also limited to mirroring.
>>>
>>> --
>>> Stan
>>> --
>>> To unsubscribe from this list: send the line "unsubscribe linux-raid" in
>>> the body of a message to majordomo@vger.kernel.org
>>> More majordomo info at http://vger.kernel.org/majordomo-info.html
>>>
>>>
>>
>>
>>
>
>
> --
> In theory, there is no difference between theory and practice.
> In<fnord> practice, there is. .... Yogi Berra
>
>
> --
> To unsubscribe from this list: send the line "unsubscribe linux-raid" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>
--
Roberto Spadim
Spadim Technology / SPAEmpresarial
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: Network-based RAID6
2011-03-30 13:35 ` Stan Hoeppner
2011-03-30 14:24 ` Roberto Spadim
@ 2011-03-30 17:50 ` Roman Mamedov
2011-03-30 18:17 ` Roberto Spadim
2011-03-31 4:56 ` Stan Hoeppner
1 sibling, 2 replies; 13+ messages in thread
From: Roman Mamedov @ 2011-03-30 17:50 UTC (permalink / raw)
To: Stan Hoeppner; +Cc: CoolCold, linux-raid
[-- Attachment #1: Type: text/plain, Size: 465 bytes --]
On Wed, 30 Mar 2011 08:35:04 -0500
Stan Hoeppner <stan@hardwarefreak.com> wrote:
> If you want true block level replication over a network, look into DRBD.
> However, it is also limited to mirroring.
So I guess a viable and interesting solution could be an mdadm stripe over two
or more DRBD mirrors, effectively providing RAID10. Although that's not quite
what I am looking for (RAID5/6-levels of overhead and resilience).
--
With respect,
Roman
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 198 bytes --]
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: Network-based RAID6
2011-03-30 17:50 ` Roman Mamedov
@ 2011-03-30 18:17 ` Roberto Spadim
2011-03-31 4:56 ` Stan Hoeppner
1 sibling, 0 replies; 13+ messages in thread
From: Roberto Spadim @ 2011-03-30 18:17 UTC (permalink / raw)
To: Roman Mamedov; +Cc: Stan Hoeppner, CoolCold, linux-raid
DRBD - computer replication
raid1 - disk 'replication'
what your needs?
2011/3/30 Roman Mamedov <rm@romanrm.ru>:
> On Wed, 30 Mar 2011 08:35:04 -0500
> Stan Hoeppner <stan@hardwarefreak.com> wrote:
>
>> If you want true block level replication over a network, look into DRBD.
>> However, it is also limited to mirroring.
>
> So I guess a viable and interesting solution could be an mdadm stripe over two
> or more DRBD mirrors, effectively providing RAID10. Although that's not quite
> what I am looking for (RAID5/6-levels of overhead and resilience).
>
> --
> With respect,
> Roman
>
--
Roberto Spadim
Spadim Technology / SPAEmpresarial
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: Network-based RAID6
2011-03-30 17:50 ` Roman Mamedov
2011-03-30 18:17 ` Roberto Spadim
@ 2011-03-31 4:56 ` Stan Hoeppner
2011-03-31 5:16 ` Roman Mamedov
1 sibling, 1 reply; 13+ messages in thread
From: Stan Hoeppner @ 2011-03-31 4:56 UTC (permalink / raw)
To: Roman Mamedov; +Cc: CoolCold, linux-raid
Roman Mamedov put forth on 3/30/2011 12:50 PM:
> On Wed, 30 Mar 2011 08:35:04 -0500
> Stan Hoeppner <stan@hardwarefreak.com> wrote:
>
>> If you want true block level replication over a network, look into DRBD.
>> However, it is also limited to mirroring.
>
> So I guess a viable and interesting solution could be an mdadm stripe over two
> or more DRBD mirrors, effectively providing RAID10. Although that's not quite
> what I am looking for (RAID5/6-levels of overhead and resilience).
The mdraid driver sits beneath DRBD. What you suggest above is impossible.
--
Stan
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: Network-based RAID6
2011-03-31 4:56 ` Stan Hoeppner
@ 2011-03-31 5:16 ` Roman Mamedov
2011-03-31 5:35 ` Stan Hoeppner
2011-03-31 7:59 ` hansbkk
0 siblings, 2 replies; 13+ messages in thread
From: Roman Mamedov @ 2011-03-31 5:16 UTC (permalink / raw)
To: Stan Hoeppner; +Cc: CoolCold, linux-raid
[-- Attachment #1: Type: text/plain, Size: 755 bytes --]
On Wed, 30 Mar 2011 23:56:26 -0500
Stan Hoeppner <stan@hardwarefreak.com> wrote:
> Roman Mamedov put forth on 3/30/2011 12:50 PM:
> > So I guess a viable and interesting solution could be an mdadm stripe over
> > two or more DRBD mirrors, effectively providing RAID10. Although that's
> > not quite what I am looking for (RAID5/6-levels of overhead and
> > resilience).
>
> The mdraid driver sits beneath DRBD. What you suggest above is impossible.
I thought DRBD presents just a regular kernel-level block device in /dev, and
seeing how one can create mdraid out of just any kind of block device,
including those provided by AoE, iSCSI, LVM, dmcrypt or even 'loop', are you
really sure this matters here?
--
With respect,
Roman
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 198 bytes --]
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: Network-based RAID6
2011-03-31 5:16 ` Roman Mamedov
@ 2011-03-31 5:35 ` Stan Hoeppner
2011-03-31 7:59 ` hansbkk
1 sibling, 0 replies; 13+ messages in thread
From: Stan Hoeppner @ 2011-03-31 5:35 UTC (permalink / raw)
To: Roman Mamedov; +Cc: CoolCold, linux-raid
Roman Mamedov put forth on 3/31/2011 12:16 AM:
> On Wed, 30 Mar 2011 23:56:26 -0500
> Stan Hoeppner <stan@hardwarefreak.com> wrote:
>
>> Roman Mamedov put forth on 3/30/2011 12:50 PM:
>>> So I guess a viable and interesting solution could be an mdadm stripe over
>>> two or more DRBD mirrors, effectively providing RAID10. Although that's
>>> not quite what I am looking for (RAID5/6-levels of overhead and
>>> resilience).
>>
>> The mdraid driver sits beneath DRBD. What you suggest above is impossible.
>
> I thought DRBD presents just a regular kernel-level block device in /dev, and
> seeing how one can create mdraid out of just any kind of block device,
> including those provided by AoE, iSCSI, LVM, dmcrypt or even 'loop', are you
> really sure this matters here?
Do it and report your results.
--
Stan
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: Network-based RAID6
2011-03-31 5:16 ` Roman Mamedov
2011-03-31 5:35 ` Stan Hoeppner
@ 2011-03-31 7:59 ` hansbkk
1 sibling, 0 replies; 13+ messages in thread
From: hansbkk @ 2011-03-31 7:59 UTC (permalink / raw)
To: Roman Mamedov; +Cc: linux-raid
On Thu, Mar 31, 2011 at 12:16 PM, Roman Mamedov <rm@romanrm.ru> wrote:
> On Wed, 30 Mar 2011 23:56:26 -0500
> I thought DRBD presents just a regular kernel-level block device in /dev, and
> seeing how one can create mdraid out of just any kind of block device,
> including those provided by AoE, iSCSI, LVM, dmcrypt or even 'loop', are you
> really sure this matters here?
The only advantage of RAID5/6 over mirroring is cost savings,
certainly not greater fault tolerance taking rebuilding times into
account - if you're planning to set up separate whole servers for each
component of your redundancy strategy, it seems odd to me you're
trying to saving a few bucks on hard drives.
DRBD is a well-regarded solution for this application, but it is
oriented toward mirroring whole filesystems (usually those of
mission-critical servers) that may themselves already be protected
locally with RAID. I would advise you follow their standard
recommendations at first, don't get too "creative" if you're actually
looking for fault-tolerance rather than just experimenting around.
All that said, if you're looking for a fun experiment and go ahead,
please do document your results and report back here!
^ permalink raw reply [flat|nested] 13+ messages in thread
end of thread, other threads:[~2011-03-31 7:59 UTC | newest]
Thread overview: 13+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-03-30 7:11 Network-based RAID6 Roman Mamedov
2011-03-30 7:20 ` CoolCold
2011-03-30 8:49 ` Roman Mamedov
2011-03-30 13:35 ` Stan Hoeppner
2011-03-30 14:24 ` Roberto Spadim
2011-03-30 14:43 ` Miles Fidelman
2011-03-30 14:58 ` Roberto Spadim
2011-03-30 17:50 ` Roman Mamedov
2011-03-30 18:17 ` Roberto Spadim
2011-03-31 4:56 ` Stan Hoeppner
2011-03-31 5:16 ` Roman Mamedov
2011-03-31 5:35 ` Stan Hoeppner
2011-03-31 7:59 ` hansbkk
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).