From: Alex Netes <alexne-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
To: Ira Weiny <weiny2-i2BcT+NCU+M@public.gmane.org>
Cc: Brian Ginsbach <ginsbach-WVYJKLFxKCc@public.gmane.org>,
Doug Ledford <dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>,
Hal Rosenstock <hal-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>,
"linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org"
<linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>
Subject: Re: [Patch opensm] Allow for easily configuring multiple fabrics on one opensm server
Date: Fri, 2 Mar 2012 12:13:24 +0200 [thread overview]
Message-ID: <20120302101324.GA6644@calypso> (raw)
In-Reply-To: <20120301144645.09aa0d80.weiny2-i2BcT+NCU+M@public.gmane.org>
On 14:46 Thu 01 Mar , Ira Weiny wrote:
> On Wed, 29 Feb 2012 20:15:02 -0600
> Brian Ginsbach <ginsbach-WVYJKLFxKCc@public.gmane.org> wrote:
>
> > On Wed, Feb 29, 2012 at 02:47:00PM -0500, Doug Ledford wrote:
> > > On 02/29/2012 02:22 PM, Ira Weiny wrote:
> > > > Doug,
> > > >
> > > > First thanks for this. Some comments below.
> > > >
> > > > On Wed, 29 Feb 2012 00:01:16 -0500
> > > > Doug Ledford <dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> wrote:
> > > >
> > > >> There are two things that stand in the way of opensm being run on
> > > >> redundant fabrics easily:
> > > >>
> > > >> 1) The opensm init script only starts one instance of opensm and opensm
> > > >> will only work on one fabric per instance
> > > >> 2) Even if you start multiple instances, you have to hand modify config
> > > >> files for each instance and then when you upgrade the opensm rpm you
> > > >> either loose your modifications or loose getting new default settings
> > > >>
> > > >> I worked around both of these issues, I've attached the files I used to
> > > >> do so.
> > > >>
> > > >> First, I have an opensm init script that allows starting multiple opensm
> > > >> instances. It supports configuring this in one of two ways:
> > > >>
> > > >> 1) Create multiple opensm.conf files, each with a numbered suffix (so
> > > >> opensm.conf.1, opensm.conf.2, etc.) and it will start one opensm
> > > >> instance per config file. This allows an admin to copy the default
> > > >> config over and edit the things they need, and on rpm upgrade there will
> > > >> be a new default opensm.conf file so they can diff between their edited
> > > >> version and the new default and see if there are changes they need to
> > > >> bring back in. This also allows for complete flexibility in setting up
> > > >> the different fabrics, for instance you could use one type of routing on
> > > >> one and a totally different type on the others.
> > > >>
> > > >> 2) Edit the file /etc/sysconfig/opensm and define more than one GUID in
> > > >> the GUIDs variable. This will cause the opensm init script to
> > > >> automatically start one instance per GUID, passing the GUID in on the
> > > >> command line.
> > > >
> > > > I know you are going for ease of use here, which is good, however, I worry about this file becoming a redefinition of opensm.conf.
> > >
> > > Hehehe, I don't think you'll ever have to worry about that. You have
> > > looked at opensm.conf in recent times I take it? Replacing that with
> > > command line options in a shell startup script isn't reasonable.
> > >
> > > However, if you are going to run a redundant fabric setup, then the two
> > > things you *know* you will have to set are the guid and subnet_prefix
> > > (assuming you want to use openmpi). If you are going to run
> >
> > Assuming you are doing this for openmpi. The subnet_prefix should
> > not be needed if the separate subnets are for disjoint networks
> > (mpi and storage) or multiple storage networks.
> >
> > > master/slave setup, then the one thing you *know* you will have to set
> > > is the priority. Supporting setting those items in an init script is
> > > reasonable. Beyond that, I would agree, you should just edit the config
> > > files.
> > >
> >
> > Not everything can be done in the config files. I'm not sure that
> > it is a good idea to have every opensm instance using the same
> > temporary and cache directories (OSM_TMP_DIR and OSM_CACHE_DIR
> > environment variables). Seems like these fall into the *know* you
> > will have to set category.
>
> Brian brings up a really good point. Even though some things can't be configured now, opensm.conf is the better way to configure log file placement etc. So in my mind this re-emphasises the need to simply allow for multiple opensm.conf's and not introduce another config file. But as I said before it is Alex's call.
>
> Ira
>
I agree with Ira on that point. sysconfig/opensm is being used for syncing
guid2lid cache file between remote SMs, which is out of scope of opensm
configuration and it's completely external mechanism.
Regarding the first approach (using multiple opensm.conf files), I think that
beyond starting all opensm instances, the init script should be able to
manipulate (restart/stop/signal HUP,USR1) to any of the instances of opensm
separately.
Moreover, OSM_TMP_DIR and OSM_CACHE_DIR can be added as options to opensm.conf
so if configured will overwrite the global setting or alternatively add pid or
opensm id as a suffix to the log/state files.
> >
> > You'd also want to make sure that other potentially very useful
> > things are configured in the config files (e.g. log_file and
> > log_prefix). Aren't these also things you *know* you will have to
> > set.
> >
> > --
> > Brian Ginsbach Cray Inc.
>
>
> --
> Ira Weiny
> Member of Technical Staff
> Lawrence Livermore National Lab
> 925-423-8008
> weiny2-i2BcT+NCU+M@public.gmane.org
> --
> To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
> the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
--
-- Alex
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
next prev parent reply other threads:[~2012-03-02 10:13 UTC|newest]
Thread overview: 14+ messages / expand[flat|nested] mbox.gz Atom feed top
2012-02-29 5:01 [Patch opensm] Allow for easily configuring multiple fabrics on one opensm server Doug Ledford
[not found] ` <4F4DB11C.5080203-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
2012-02-29 19:22 ` Ira Weiny
[not found] ` <20120229112229.136f25b7.weiny2-i2BcT+NCU+M@public.gmane.org>
2012-02-29 19:47 ` Doug Ledford
[not found] ` <20120301021501.GB961@bukharin.us.cray.com>
[not found] ` <20120301021501.GB961-7GFyYy+Av7rWWZS0+0nfmVaTQe2KTcn/@public.gmane.org>
2012-03-01 13:31 ` Doug Ledford
[not found] ` <4F4F7A4B.4060007-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
2012-03-05 12:52 ` Hal Rosenstock
[not found] ` <4F54B707.1070606-LDSdmyG8hGV8YrgS2mwiifqBs+8SCbDb@public.gmane.org>
2012-03-05 15:28 ` Doug Ledford
[not found] ` <2962b1d0-a679-45d0-a82b-5d624e2081f9-HOthUlaS0a9+R5eDjrG6zsCp5Q1pQRjfhaY/URYTgi6ny3qCrzbmXA@public.gmane.org>
2012-03-05 15:53 ` Hal Rosenstock
[not found] ` <4F54E177.9030302-LDSdmyG8hGV8YrgS2mwiifqBs+8SCbDb@public.gmane.org>
2012-03-05 17:25 ` Doug Ledford
2012-03-01 22:46 ` Ira Weiny
[not found] ` <20120301144645.09aa0d80.weiny2-i2BcT+NCU+M@public.gmane.org>
2012-03-02 10:13 ` Alex Netes [this message]
2012-03-02 10:30 ` Alex Netes
2012-03-02 15:31 ` Doug Ledford
[not found] ` <4F50E7CE.6050204-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
2012-03-02 15:47 ` Doug Ledford
2012-03-05 20:51 ` Ira Weiny
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20120302101324.GA6644@calypso \
--to=alexne-vpraknaxozvwk0htik3j/w@public.gmane.org \
--cc=dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org \
--cc=ginsbach-WVYJKLFxKCc@public.gmane.org \
--cc=hal-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org \
--cc=linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
--cc=weiny2-i2BcT+NCU+M@public.gmane.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox