* OFED 1.5.4.1 on Ubuntu 10.04 with Mellanox cards?
@ 2012-06-22 0:46 Chet Murthy
2012-06-22 8:43 ` Sebastian Riemer
0 siblings, 1 reply; 4+ messages in thread
From: Chet Murthy @ 2012-06-22 0:46 UTC (permalink / raw)
To: linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
Hi,
A long while ago, I got OFED 1.5.2 working on Ubuntu 10.04 (Lucid) on
Opterons with Mellanox DDR cards. It was a little messy, getting the
RPMs compiled, but it was pretty straightforward. Basically, I (a)
built a kernel with neither infiniband nor mellanox ethernet drivers,
and (b) ran the OFED install.pl with some minor modifications to
convert the RPMs into DEBs as they were built. And everything worked,
smooth as a whistle.
Today, I tried to do the same thing with OFED 1.5.4.1, and while the
process of -building- was straightforward, once I get done, the card's
state is all zeroes:
chet@memstore3:~$ sudo ibstatus
Infiniband device 'mlx4_0' port 1 status:
default gid: 0000:0000:0000:0000:0000:0000:0000:0000
base lid: 0x0
sm lid: 0x0
state: 1: DOWN
phys state: 3: Disabled
rate: 2.5 Gb/sec (1X)
link_layer: Ethernet
Infiniband device 'mlx4_0' port 2 status:
default gid: 0000:0000:0000:0000:0000:0000:0000:0000
base lid: 0x0
sm lid: 0x0
state: 1: DOWN
phys state: 3: Disabled
rate: 2.5 Gb/sec (1X)
link_layer: Ethernet
The card's a modern ConnectX
1f:00.0 Ethernet controller: Mellanox Technologies MT26448 [ConnectX EN
10GigE, PCIe 2.0 5GT/s] (rev b0)
and on identical RedHat machines, the card's status is quite
different:
[root@memstore4 chet]# ibstatus
Infiniband device 'mlx4_0' port 1 status:
default gid: fe80:0000:0000:0000:0202:c9ff:fe4b:5890
base lid: 0x0
sm lid: 0x0
state: 1: DOWN
phys state: 3: Disabled
rate: 10 Gb/sec (1X QDR)
link_layer: Ethernet
Infiniband device 'mlx4_0' port 2 status:
default gid: fe80:0000:0000:0000:0202:c9ff:fe4b:5891
base lid: 0x0
sm lid: 0x0
state: 4: ACTIVE
phys state: 5: LinkUp
rate: 10 Gb/sec (1X QDR)
link_layer: Ethernet
I'm not even sure how to go about debugging this. Has anybody gotten
OFED to work on Ubuntu with such modern cards?
Thanks,
--chet--
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: OFED 1.5.4.1 on Ubuntu 10.04 with Mellanox cards?
2012-06-22 0:46 OFED 1.5.4.1 on Ubuntu 10.04 with Mellanox cards? Chet Murthy
@ 2012-06-22 8:43 ` Sebastian Riemer
[not found] ` <4FE4304E.3090002-EIkl63zCoXaH+58JC4qpiA@public.gmane.org>
0 siblings, 1 reply; 4+ messages in thread
From: Sebastian Riemer @ 2012-06-22 8:43 UTC (permalink / raw)
To: chet-aZOuKsOsJu3MbYB6QlFGEg
Cc: linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
[-- Attachment #1: Type: text/plain, Size: 3933 bytes --]
Hi Chet,
the trick is to check out the latest pkg-ofed source from debian SVN
(svn://svn.debian.org/svn/pkg-ofed/) and to update the upstream source
by merging the stuff by extracting the source RPMs or even better by
importing the source directly from the git repos of the OFED user space.
In the "debian" directory there are some patches e.g. which change some
stuff in shell scripts for the dash. These need to be adopted.
But you'll have to ensure that the kernel code matches the OFED user
space. The kernel stuff included in OFED doesn't support latest kernels
and is based on an older code base (e.g. OFED 1.5.4 kernel stuff is
based on 2.6.30). I hope that you don't need iSER. The open-iscsi kernel
stuff in there is also based on 2.6.30 which means that you would need
old open-iscsi user space.
This is why we've decided to follow what they call "upstream" in this
list. This means: Use the OFED kernel code from the matching vanilla
kernel from kernel.org.
Here a simple list of matching code:
OFED-1.5.4 ---> kernel 3.2.x
OFED-1.5.4.1 ---> kernel 3.3.x
I've attached the IB user space HOWTO from Or Gerlitz for the git repos.
Some of the git repos already have a debinan directory.
Do you know how to build Debian packages?
Cheers,
Sebastian
On 22/06/12 02:46, Chet Murthy wrote:
>
> Hi,
>
> A long while ago, I got OFED 1.5.2 working on Ubuntu 10.04 (Lucid) on
> Opterons with Mellanox DDR cards. It was a little messy, getting the
> RPMs compiled, but it was pretty straightforward. Basically, I (a)
> built a kernel with neither infiniband nor mellanox ethernet drivers,
> and (b) ran the OFED install.pl with some minor modifications to
> convert the RPMs into DEBs as they were built. And everything worked,
> smooth as a whistle.
>
> Today, I tried to do the same thing with OFED 1.5.4.1, and while the
> process of -building- was straightforward, once I get done, the card's
> state is all zeroes:
>
> chet@memstore3:~$ sudo ibstatus
> Infiniband device 'mlx4_0' port 1 status:
> default gid: 0000:0000:0000:0000:0000:0000:0000:0000
> base lid: 0x0
> sm lid: 0x0
> state: 1: DOWN
> phys state: 3: Disabled
> rate: 2.5 Gb/sec (1X)
> link_layer: Ethernet
>
> Infiniband device 'mlx4_0' port 2 status:
> default gid: 0000:0000:0000:0000:0000:0000:0000:0000
> base lid: 0x0
> sm lid: 0x0
> state: 1: DOWN
> phys state: 3: Disabled
> rate: 2.5 Gb/sec (1X)
> link_layer: Ethernet
>
> The card's a modern ConnectX
>
> 1f:00.0 Ethernet controller: Mellanox Technologies MT26448 [ConnectX EN
> 10GigE, PCIe 2.0 5GT/s] (rev b0)
>
> and on identical RedHat machines, the card's status is quite
> different:
>
>
> [root@memstore4 chet]# ibstatus
> Infiniband device 'mlx4_0' port 1 status:
> default gid: fe80:0000:0000:0000:0202:c9ff:fe4b:5890
> base lid: 0x0
> sm lid: 0x0
> state: 1: DOWN
> phys state: 3: Disabled
> rate: 10 Gb/sec (1X QDR)
> link_layer: Ethernet
>
> Infiniband device 'mlx4_0' port 2 status:
> default gid: fe80:0000:0000:0000:0202:c9ff:fe4b:5891
> base lid: 0x0
> sm lid: 0x0
> state: 4: ACTIVE
> phys state: 5: LinkUp
> rate: 10 Gb/sec (1X QDR)
> link_layer: Ethernet
>
> I'm not even sure how to go about debugging this. Has anybody gotten
> OFED to work on Ubuntu with such modern cards?
>
> Thanks,
> --chet--
>
>
> --
> To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
> the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
[-- Attachment #2: IB-user-space-howto.txt --]
[-- Type: text/plain, Size: 3720 bytes --]
IB user space HOWTO
June 2012
Or Gerlitz <ogerlitz-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
This little note attempts to get you through how to get the upstream
user-space IB packages, specifically libibverbs/libmlx4/librdmacm and/or
opensm and the IB diags.
Under Fedora / RHEL, installing the INBOX user-space IB/RDMA offering is easy as
# yum groupinstall "Infiniband Support"
The IB service is called rdma (vs. openibd which used to be the name in older RHEL/Fedora
releases) and there is an rpm named "rdma" with various scripts. Note that this will
not install opensm/diags (see below).
If you are seeking the latest RELEASE done by the maintainers, its also trivial,
the releases are provided in the form of tar balls which you plug into
"rpmbuild -ts" and you have fresh source RPM to build and later install.
Going more hackish, you would need to build the sources from the maintainers
git, the git trees contain spec files, so the process would be to create
the tarballs and then repeat the rpmbuild excercise.
See below links to where there are tarball releases and the git trees where
here gitweb links are provided, they have the git pointer to clone from inside.
Here's the list of maintainers
Roland Dreier <roland-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org> for libibverbs/libmlx4
Sean Hefty <sean.hefty-ral2JQCrhuEAvxtiuMwx3w@public.gmane.org> for librdmacm
Alex Netes <alexne-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org> for opensm and libibumad
Ira Weiny <weiny2-i2BcT+NCU+M@public.gmane.org> for libibmad and the IB diags
Ido Shamai <idos-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org> for perftest (ib_send_bw and friends)
Roland and Sean are also the maintainers of the IB kernel stack.
The mailing list for reporting issues is <linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>,
there's no need for subscription, please make sure to CC the list when
you send email to the maintainer.
Last but not least, make sure to enjoy your work, its really simple,
and please do let me know if/what something is missing here.
tarballs based releases
libibverbs https://openfabrics.org/downloads/verbs
libmlx4 https://openfabrics.org/downloads/mlx4
librdmacm https://openfabrics.org/downloads/rdmacm
perftest https://openfabrics.org/downloads/perftest
opensm, libibumad, libibmad and infiniband-diags under
https://openfabrics.org/downloads/management
GIT
libibverbs http://git.kernel.org/?p=libs/infiniband/libibverbs.git;a=summary
libmlx4 http://git.kernel.org/?p=libs/infiniband/libmlx4.git;a=summary
librdmacm http://git.openfabrics.org/git?p=~shefty/librdmacm.git;a=summary
opensm http://git.openfabrics.org/git?p=~alexnetes/opensm.git;a=summary
libibumad http://git.openfabrics.org/git?p=~alexnetes/libibumad.git;a=shortlog
ib-diags http://git.openfabrics.org/git?p=~iraweiny/infiniband-diags.git;a=summary
libibmad http://git.openfabrics.org/git?p=~iraweiny/libibmad.git;a=summary
diags look for the git trees under which belong to Ira Weiny
# yum groupinfo "Infiniband Support"
Loaded plugins: product-id, security, subscription-manager
Updating certificate-based repositories.
Setting up Group Process
Group: Infiniband Support
Description: Software designed for supporting clustering and grid connectivity using RDMA-based InfiniBand and iWARP fabrics.
Mandatory Packages:
libibcm
libibverbs
libibverbs-utils
librdmacm
librdmacm-utils
rdma
Default Packages:
dapl
ibsim
ibutils
libcxgb3
libibmad
libibumad
libipathverbs
libmlx4
libmthca
libnes
rds-tools
Optional Packages:
compat-dapl
infiniband-diags
libibcommon
mstflint
opensm
perftest
qperf
srptools
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: OFED 1.5.4.1 on Ubuntu 10.04 with Mellanox cards?
[not found] ` <4FE4304E.3090002-EIkl63zCoXaH+58JC4qpiA@public.gmane.org>
@ 2012-06-22 19:02 ` Chet Murthy
2012-06-25 9:35 ` Sebastian Riemer
0 siblings, 1 reply; 4+ messages in thread
From: Chet Murthy @ 2012-06-22 19:02 UTC (permalink / raw)
To: Sebastian Riemer; +Cc: linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
Sebastian,
Thank you for taking the time to explain these things! It's a little
confusing ....
> Here a simple list of matching code:
> OFED-1.5.4 ---> kernel 3.2.x
> OFED-1.5.4.1 ---> kernel 3.3.x
(1) Is there a more-exhaustive list of the right kernel to use with
each OFED release? I was going by the OFED docs (e.g. release notes),
which seemed to indicate that for 1.5.4.1, the "right" range of
kernels was (kernel.org: 2.6.30 - 3.1), and specific kernel versions
for various distros.
(2) I'm pretty familiar with adminstering Debian systems and building
debian packages, hacking their insides, alienizing, hacking that
process, etc.
(I -think- ;-) The only real question for me is, which versions, with
which patches, of the various bits, will work together with this RoCEE
card.
(3) I'm -not at all- familiar with the workflow/process that Debian
Developers use. For instance, I don't really understand what you men
below:
> But you'll have to ensure that the kernel code matches the OFED user
> space. The kernel stuff included in OFED doesn't support latest kernels
> and is based on an older code base (e.g. OFED 1.5.4 kernel stuff is
> based on 2.6.30).
Do you mean that the kernel-ib RPM in 1.5.4 is the code form the
2.6.30 kernel? But then the list below doesn't seem to make sense.
> Here a simple list of matching code:
> OFED-1.5.4 ---> kernel 3.2.x
> OFED-1.5.4.1 ---> kernel 3.3.x
(4) I think what you're saying here
> the trick is to check out the latest pkg-ofed source from debian SVN
> (svn://svn.debian.org/svn/pkg-ofed/) and to update the upstream source
> by merging the stuff by extracting the source RPMs or even better by
> importing the source directly from the git repos of the OFED user space.
> In the "debian" directory there are some patches e.g. which change some
> stuff in shell scripts for the dash. These need to be adopted.
is:
(a) check out the stuff from svn.debian.org
(b) pull source from the OFED repos user-space
(c) -copy- that (latest) OFED source into the tree I checked-out
from debian
(d) make sure that the patches in the debian directories apply
properly to the various shellscripts
(e) build debian packages per usual
And per your instructions above, I believe you're saying I should be
using a 3.3.x kernel?
Thanks,
--chet--
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: OFED 1.5.4.1 on Ubuntu 10.04 with Mellanox cards?
2012-06-22 19:02 ` Chet Murthy
@ 2012-06-25 9:35 ` Sebastian Riemer
0 siblings, 0 replies; 4+ messages in thread
From: Sebastian Riemer @ 2012-06-25 9:35 UTC (permalink / raw)
To: Chet Murthy; +Cc: linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
Hi Chet,
On 22/06/12 21:02, Chet Murthy wrote:
>
> Sebastian,
>
> Thank you for taking the time to explain these things! It's a little
> confusing ....
>
>> Here a simple list of matching code:
>> OFED-1.5.4 ---> kernel 3.2.x
>> OFED-1.5.4.1 ---> kernel 3.3.x
>
> (1) Is there a more-exhaustive list of the right kernel to use with
> each OFED release? I was going by the OFED docs (e.g. release notes),
> which seemed to indicate that for 1.5.4.1, the "right" range of
> kernels was (kernel.org: 2.6.30 - 3.1), and specific kernel versions
> for various distros.
Unfortunately, there is no more-exhaustive list for matching the kernel
code with the OFED user space. It's a matter of comparing dates - kernel
release and OFED release.
O.K., here is how they put the OFA kernel code into OFED:
- kernel developers develop for the latest kernel release cycle (here 3.3)
- OFED packagers use an older kernel as basis (2.6.30) and forward port
the OFA kernel stuff to the current kernel release (here 3.3) by patches
for kernels (2.6.30..3.1) - this leaves space for failures (e.g. that
they don't port the open-iscsi kernel code correctly)
- this is why they say that they don't support the mainline kernels
completely
We at ProfitBricks need latest kernels anyway. This is why we match it
from upstream (OFA kernel stuff from kernel.org). And we don't have to
build the OFA kernel modules from out-of-tree which simplifies our
kernel build chain. We have OFED-1.5.4 with OFA kernel code from kernel
3.2 at the moment.
But there is also a new OFED release approach:
Perhaps you've seen the OFED-3.2 already?! This is the OFED especially
for kernel 3.2. This makes it easier to match OFED user space and kernel
code. Here they just backport the OFA kernel stuff e.g. from 3.4 to 3.2.
Looks promising, but I have no experience with that, yet.
> (2) I'm pretty familiar with adminstering Debian systems and building
> debian packages, hacking their insides, alienizing, hacking that
> process, etc.
>
> (I -think- ;-) The only real question for me is, which versions, with
> which patches, of the various bits, will work together with this RoCEE
> card.
Your issue can be something with the shell scripts, kernel code to user
space matching or plain that you don't have the opensm running. Without
a running instance of a subnet manager your card will get no LID
assigned, no partition key, etc. IPoIB, MPI, iSER, SRP, etc. won't work.
Check with "ibdiagnet -r" if your master subnet manager is running. IB
is self-managed by the subnet manager. Make sure that your opensm
configuration is correct.
We have big deployments and don't want to have "rpm" installed on Debian
systems. This is why we've taken OFED-1.5.2 stuff from debian
experimental from "pkg-ofed". We've converted the SVN stuff into git
repos for OFED, imported the OFED-1.5.4 upstream code and adopted the
modifications by Debian (e.g. shell code changes). Now, we can build
OFED with "git-buildpackage" and upload the deb packages to our debian
repository.
> (3) I'm -not at all- familiar with the workflow/process that Debian
> Developers use. For instance, I don't really understand what you men
> below:
>
>> But you'll have to ensure that the kernel code matches the OFED user
>> space. The kernel stuff included in OFED doesn't support latest kernels
>> and is based on an older code base (e.g. OFED 1.5.4 kernel stuff is
>> based on 2.6.30).
>
> Do you mean that the kernel-ib RPM in 1.5.4 is the code form the
> 2.6.30 kernel? But then the list below doesn't seem to make sense.
>
>> Here a simple list of matching code:
>> OFED-1.5.4 ---> kernel 3.2.x
>> OFED-1.5.4.1 ---> kernel 3.3.x
I've explained this above.
> (4) I think what you're saying here
>
>> the trick is to check out the latest pkg-ofed source from debian SVN
>> (svn://svn.debian.org/svn/pkg-ofed/) and to update the upstream source
>> by merging the stuff by extracting the source RPMs or even better by
>> importing the source directly from the git repos of the OFED user space.
>> In the "debian" directory there are some patches e.g. which change some
>> stuff in shell scripts for the dash. These need to be adopted.
>
> is:
>
> (a) check out the stuff from svn.debian.org
>
> (b) pull source from the OFED repos user-space
>
> (c) -copy- that (latest) OFED source into the tree I checked-out
> from debian
>
> (d) make sure that the patches in the debian directories apply
> properly to the various shellscripts
>
> (e) build debian packages per usual
>
> And per your instructions above, I believe you're saying I should be
> using a 3.3.x kernel?
Yes, this is exactly what I would suggest to you if you want to have a
really working solution without "rpm". You should at least have a look
at this or try it to see if this fixes your issues and if this gives you
advantages.
Cheers,
Sebastian
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2012-06-25 9:35 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2012-06-22 0:46 OFED 1.5.4.1 on Ubuntu 10.04 with Mellanox cards? Chet Murthy
2012-06-22 8:43 ` Sebastian Riemer
[not found] ` <4FE4304E.3090002-EIkl63zCoXaH+58JC4qpiA@public.gmane.org>
2012-06-22 19:02 ` Chet Murthy
2012-06-25 9:35 ` Sebastian Riemer
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox