All of lore.kernel.org
 help / color / mirror / Atom feed
From: Miles Fidelman <mfidelman@meetinghouse.net>
Cc: "linux-raid@vger.kernel.org" <linux-raid@vger.kernel.org>
Subject: Re: possibly silly configuration question
Date: Thu, 27 Dec 2012 11:02:22 -0500	[thread overview]
Message-ID: <50DC710E.7040402@meetinghouse.net> (raw)
In-Reply-To: <50DBD1E5.8000206@websitemanagers.com.au>

Adam,

Thanks for the suggestions.  The thing I'm worried about is how much 
traffic gets generated as I start wiring together more complex 
configurations, and the kind of performance hits involved (particularly 
if a node goes down and things start getting re-syncd).

Miles

Adam Goryachev wrote:
> On 27/12/12 15:16, Miles Fidelman wrote:
>> Hi Folks,
>>
>> I find myself having four servers, each with 4 large disks, that I'm
>> trying to assemble into a high-availability cluster.  (Note: I've got
>> 4 gigE ports on each box, 2 set aside for outside access, 2 for
>> inter-node clustering)
>>
>> Now it's easy enough to RAID disks on each server, and/or mirror disks
>> pair-wise with DRBD, but DRBD doesn't work as well with >2 servers.
>>
>> No what I really should do is separate storage nodes from compute
>> nodes - but I'm limited by rack space and chassis configuration of the
>> hardware I've got, and I've been thinking through various
>> configurations to make use of the resources at hand.
>>
>> One option is to put all the drives into one large pool managed by
>> gluster - but I expect that would result in some serious performance
>> hits (and gluster's replicated/distributed mode is fairly new).
>>
>> It's late at night and a thought occurred to me that is probably
>> wrongheaded (or at least silly) - but maybe I'm too tired to see any
>> obvious problems.  So I'd welcome 2nd (and 3rd) opinions.
>>
>> The basic notion:
>> - mount all 16 drives as network block devices via iSCSI or AoE
>> - build 4 RAID10 volumes - each volume consisting of one drive from
>> each server
>> - run LVM on top of the RAID volumes
>> - then use NFS or maybe OCFS2 to make volumes available across nodes
>> - of course md would be running on only one node (for each array), so
>> if a node goes down, use pacemaker to startup md on another node,
>> reassemble the array, and remount everything
>>
>> Does this make sense, or is it totally crazy?
>>
> Not entirely crazy... but, how about another option:
> On each node:
> 1) Partition each drive into two halves
> 2) Create two RAID arrays using each half of the 4 drives (ie, sd[abcd]1
> in one RAID and sd[abcd]2 in the second RAID)
> 3) Create 4 x DRBD volumes where
> drbd0 uses server1_raid1 and server2_raid1
> drbd1 uses server2_raid2 and server3_raid2
> drbd2 uses server3_raid1 and server4_raid1
> drbd3 uses server4_raid2 and server1_raid2
>
> Now you can run iscsi on all servers, where each server will export one
> DRBD device:
> iscsi server1 drbd0
> iscsi server2 drbd1
> iscsi server3 drbd2
> iscsi server4 drbd3
>
> If a server goes down, you need to use pacemaker to start iscsi (and
> steal the virtual IP) on the "partner" server.
> In this way, you can lose any one server, or you can lose two servers
> (if they are the right two).
>
> You could adjust this further to have a third drbd host, and reduce the
> total number of iscsi exported devices to 3.
>
> Each VM config would use the specific virtual IP/iSCSI exported location.
>
> Maybe that will provide some ideas.... It is slightly better than two
> storage + two working nodes, and gives the added reliability of
> potentially losing two servers without losing any services....
>
> PS, I'd probably put LVM2 on top of each drbd device, to divide the
> storage for each VM, and export each VM over iscsi individually.
>
> Regards,
> Adam
>


-- 
In theory, there is no difference between theory and practice.
In practice, there is.   .... Yogi Berra


  reply	other threads:[~2012-12-27 16:02 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-12-27  4:16 possibly silly configuration question Miles Fidelman
2012-12-27  4:43 ` Adam Goryachev
2012-12-27 16:02   ` Miles Fidelman [this message]
2012-12-27 16:21     ` Adam Goryachev
2012-12-27 16:44       ` Miles Fidelman
2012-12-27 21:11 ` Roy Sigurd Karlsbakk

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=50DC710E.7040402@meetinghouse.net \
    --to=mfidelman@meetinghouse.net \
    --cc=linux-raid@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.