public inbox for linux-rdma@vger.kernel.org
 help / color / mirror / Atom feed
From: Ira Weiny <weiny2-i2BcT+NCU+M@public.gmane.org>
To: "Heinz,
	Michael William"
	<michael.william.heinz-ral2JQCrhuEAvxtiuMwx3w@public.gmane.org>
Cc: "linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org"
	<linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>,
	Doug Ledford <dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>,
	Bob Ciotti <Bob.Ciotti-NSQ8wuThN14@public.gmane.org>,
	James Silva <jsilva-i2BcT+NCU+M@public.gmane.org>
Subject: Re: [RFC] Proposal to change Node Description naming scheme for HCA's
Date: Mon, 2 Apr 2012 13:58:08 -0700	[thread overview]
Message-ID: <20120402135808.df12d36b.weiny2@llnl.gov> (raw)
In-Reply-To: <F93D748CDCC46C42B498F40A38D90369641004-AtyAts71sc9zLByeVOV5+bfspsVTdybXVpNB7YpNyf8@public.gmane.org>

On Mon, 2 Apr 2012 15:27:35 +0000
"Heinz, Michael William" <michael.william.heinz-ral2JQCrhuEAvxtiuMwx3w@public.gmane.org> wrote:

> I like this idea - but it reminds me of a related issue I raised a while back: nodes can often set the HCA description before they received a hostname from DHCP - in which case you end up with saqueries full of "localhost HCA-1".
> 
> At the time, QLogic's proposal was to modify the kernel stack so that it extracted the hostname at the time of the query instead of at boot time - but the linux_rdma list did not like that solution.
> 

The reason this is not acceptable is it hard codes hostname at a low level.  We should allow the user to override the Node Descriptor if they so desire.  In the case of the SM being faster than the boot scripts (rdma, openibd, udev, dhcp, or whatever).  It is better that the SM see the Node Description change trap and update the ND there.  As a final "fix" OpenSM has an "update_desc" option in the console to force a resweep if your names from the SA are wrong.

> Any ideas on how we could solve the hostname problem while we're changing the description?
> 
> As for what installs the openibd script, I'm pretty sure that's part of ofa_kernel.

Thanks, I will look for that.

Ira

> 
> -----Original Message-----
> From: linux-rdma-owner-u79uwXL29TY76Z2rM5mHXA@public.gmane.org [mailto:linux-rdma-owner-u79uwXL29TY76Z2rM5mHXA@public.gmane.org] On Behalf Of Ira Weiny
> Sent: Friday, March 30, 2012 8:39 PM
> To: linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
> Cc: Doug Ledford; Bob Ciotti; James Silva
> Subject: [RFC] Proposal to change Node Description naming scheme for HCA's
> 
> First, a question: what package installs the openibd script in OFED?  For the life of me I can't find this script in 1.5.4.1 or 3.2 ...  :-/  [*]
> 
> Right now the "standard" for node description is, AFAIK, "<hostname> HCA-<num>", where num is simply a counter for the HCA's as they are found in /sys/class/infiniband.
> 
> The problem is resolving this "random" HCA number to an actual HCA on the host.  I thought about including the node description in ibstat but that seems a bit short sighted.  I think the better solution would be to append the hca name (ie mlx4_X, qibX, etc) to the hostname for the Node Description.
> 
> Hacking the RHEL start up script is really easy to do this and results in nice names on the fabric which are easily resolved by the infiniband-diags, ibverbs, and perftest utilities.
> 
> bash-4.1# ibhosts
> Ca      : 0x0002c90300325280 ports 1 "ending mlx4_2"
> Ca      : 0x001175000079da38 ports 1 "happy qib0"
> Ca      : 0x0002c90300108f2e ports 1 "ending mlx4_1"
> Ca      : 0x001175000077d90e ports 2 "ending qib0"
> Ca      : 0x0002c903004bebda ports 2 "happy mlx4_0"
> bash-4.1# hostname
> happy
> bash-4.1# ibstat mlx4_0
> CA 'mlx4_0'
>         CA type: MT26428
>         Number of ports: 2
>         Firmware version: 2.8.600
> ...
> bash-4.1# ibv_rc_pingpong -d mlx4_0
>   local address:  LID 0x0008, QPN 0x16004a, PSN 0x2e8316 ...
> bash-4.1# rdma_bw -d mlx4_0
> 6089: | port=18515 | ib_port=1 | size=65536 | tx_depth=100 | sl=0 | iters=1000 | duplex=0 | cma=0 | ...
> 
> I realize this is really a distro thing but it would be nice if we could agree to change the current "standard".
> 
> I can send a patch for RHEL and OFED (if I someone can point me to the openibd script or srpm).
> 
> Thoughts?
> Ira
> 
> 
> [*] Last I knew openibd does the same as RHEL's rdma start up script in this regard.
> 
> --
> Ira Weiny
> Member of Technical Staff
> Lawrence Livermore National Lab
> 925-423-8008
> weiny2-i2BcT+NCU+M@public.gmane.org
> --
> To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at  http://vger.kernel.org/majordomo-info.html


-- 
Ira Weiny
Member of Technical Staff
Lawrence Livermore National Lab
925-423-8008
weiny2-i2BcT+NCU+M@public.gmane.org
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

  parent reply	other threads:[~2012-04-02 20:58 UTC|newest]

Thread overview: 16+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-03-31  0:39 [RFC] Proposal to change Node Description naming scheme for HCA's Ira Weiny
     [not found] ` <20120330173908.0abdd264.weiny2-i2BcT+NCU+M@public.gmane.org>
2012-04-02 15:27   ` Heinz, Michael William
     [not found]     ` <F93D748CDCC46C42B498F40A38D90369641004-AtyAts71sc9zLByeVOV5+bfspsVTdybXVpNB7YpNyf8@public.gmane.org>
2012-04-02 18:45       ` Jason Gunthorpe
     [not found]         ` <20120402184545.GA1311-ePGOBjL8dl3ta4EC/59zMFaTQe2KTcn/@public.gmane.org>
2012-04-02 18:52           ` Heinz, Michael William
2012-04-04  2:14           ` Ira Weiny
     [not found]             ` <20120403191442.ce88dc0a.weiny2-i2BcT+NCU+M@public.gmane.org>
2012-04-04 17:12               ` Jason Gunthorpe
     [not found]                 ` <20120404171200.GC20889-ePGOBjL8dl3ta4EC/59zMFaTQe2KTcn/@public.gmane.org>
2012-04-04 17:17                   ` Heinz, Michael William
     [not found]                     ` <F93D748CDCC46C42B498F40A38D903696420A5-AtyAts71sc9zLByeVOV5+bfspsVTdybXVpNB7YpNyf8@public.gmane.org>
2012-04-04 17:25                       ` Jason Gunthorpe
2012-04-05  1:24                   ` Ira Weiny
2012-04-05 17:46                   ` Doug Ledford
     [not found]                     ` <4F7DDA6B.7090304-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
2012-04-05 18:19                       ` Jason Gunthorpe
2012-04-02 20:58       ` Ira Weiny [this message]
2012-04-05 18:31   ` Bart Van Assche
     [not found]     ` <4F7DE4F3.1050704-HInyCGIudOg@public.gmane.org>
2012-04-05 18:49       ` Ira Weiny
  -- strict thread matches above, loose matches on Subject: below --
2012-04-01 20:49 Doug Ledford
     [not found] ` <xmpk2bxv6exy7ipt8dh9iqeg.1333313353298-2ueSQiBKiTY7tOexoI0I+QC/G2K4zDHf@public.gmane.org>
2012-04-01 21:47   ` Joseph Glanville

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20120402135808.df12d36b.weiny2@llnl.gov \
    --to=weiny2-i2bct+ncu+m@public.gmane.org \
    --cc=Bob.Ciotti-NSQ8wuThN14@public.gmane.org \
    --cc=dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org \
    --cc=jsilva-i2BcT+NCU+M@public.gmane.org \
    --cc=linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
    --cc=michael.william.heinz-ral2JQCrhuEAvxtiuMwx3w@public.gmane.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox