* mode connected infiniband
@ 2010-01-14 8:11 Nico
[not found] ` <8fd3bb681001140011k5ec9492eg19b59b110e45a2b5-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
0 siblings, 1 reply; 10+ messages in thread
From: Nico @ 2010-01-14 8:11 UTC (permalink / raw)
To: linux-rdma-u79uwXL29TY76Z2rM5mHXA
Hello,
I'am using an infiniband kernel 2.6.32-trunk-686 from debian,
and it seems /sys/class/net/ib0/mode is missing :
vza:/sys/class/net/ib0# ls -l
total 0
-r--r--r-- 1 root root 4096 janv. 14 08:55 address
-r--r--r-- 1 root root 4096 janv. 14 08:55 addr_len
-r--r--r-- 1 root root 4096 janv. 14 08:55 broadcast
-r--r--r-- 1 root root 4096 janv. 14 08:55 carrier
--w--w--w- 1 root root 4096 janv. 14 08:55 create_child
--w--w--w- 1 root root 4096 janv. 14 08:55 delete_child
lrwxrwxrwx 1 root root 0 janv. 14 08:55 device -> ../../../0000:05:00.0
-r--r--r-- 1 root root 4096 janv. 14 08:55 dev_id
-r--r--r-- 1 root root 4096 janv. 14 08:55 dormant
-r--r--r-- 1 root root 4096 janv. 14 08:55 features
-rw-r--r-- 1 root root 4096 janv. 14 08:55 flags
-rw-r--r-- 1 root root 4096 janv. 14 08:55 ifalias
-r--r--r-- 1 root root 4096 janv. 14 08:55 ifindex
-r--r--r-- 1 root root 4096 janv. 14 08:55 iflink
-r--r--r-- 1 root root 4096 janv. 14 08:55 link_mode
-rw-r--r-- 1 root root 4096 janv. 14 08:55 mtu
-r--r--r-- 1 root root 4096 janv. 14 08:55 operstate
-r--r--r-- 1 root root 4096 janv. 14 08:55 pkey
drwxr-xr-x 2 root root 0 janv. 14 08:55 power
drwxr-xr-x 2 root root 0 janv. 14 08:55 statistics
lrwxrwxrwx 1 root root 0 janv. 14 08:55 subsystem ->
../../../../../../../../class/net
-rw-r--r-- 1 root root 4096 janv. 14 08:55 tx_queue_len
-r--r--r-- 1 root root 4096 janv. 14 08:55 type
-rw-r--r-- 1 root root 4096 janv. 14 08:55 uevent
-rw-r--r-- 1 root root 4096 janv. 14 08:55 umcast
as I am doing some benchs and wanted to size up MTU with connected mode,
* is there another way to use connected mode ?
* is it related to debian packaging ?
Regards.
Nicolas
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 10+ messages in thread[parent not found: <8fd3bb681001140011k5ec9492eg19b59b110e45a2b5-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>]
* Re: mode connected infiniband [not found] ` <8fd3bb681001140011k5ec9492eg19b59b110e45a2b5-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> @ 2010-01-14 16:06 ` Frank Leers 2010-01-14 17:22 ` Roland Dreier 1 sibling, 0 replies; 10+ messages in thread From: Frank Leers @ 2010-01-14 16:06 UTC (permalink / raw) To: Nico; +Cc: linux-rdma-u79uwXL29TY76Z2rM5mHXA On Jan 14, 2010, at 12:11 AM, Nico wrote: > Hello, > > I'am using an infiniband kernel 2.6.32-trunk-686 from debian, > and it seems /sys/class/net/ib0/mode is missing : > It might be that when OFED was built, the "CONFIG_INFINIBAND_IPOIB_CM=y" was set to 'n' in your 'configure.mk.kernel' or on the command line during the ./configure step - --with-ipoib-cm make CONFIG_INFINIBAND_IPOIB_CM=y [yes] --without-ipoib-cm [no] -frank > vza:/sys/class/net/ib0# ls -l > total 0 > -r--r--r-- 1 root root 4096 janv. 14 08:55 address > -r--r--r-- 1 root root 4096 janv. 14 08:55 addr_len > -r--r--r-- 1 root root 4096 janv. 14 08:55 broadcast > -r--r--r-- 1 root root 4096 janv. 14 08:55 carrier > --w--w--w- 1 root root 4096 janv. 14 08:55 create_child > --w--w--w- 1 root root 4096 janv. 14 08:55 delete_child > lrwxrwxrwx 1 root root 0 janv. 14 08:55 device -> ../../../0000:05:00.0 > -r--r--r-- 1 root root 4096 janv. 14 08:55 dev_id > -r--r--r-- 1 root root 4096 janv. 14 08:55 dormant > -r--r--r-- 1 root root 4096 janv. 14 08:55 features > -rw-r--r-- 1 root root 4096 janv. 14 08:55 flags > -rw-r--r-- 1 root root 4096 janv. 14 08:55 ifalias > -r--r--r-- 1 root root 4096 janv. 14 08:55 ifindex > -r--r--r-- 1 root root 4096 janv. 14 08:55 iflink > -r--r--r-- 1 root root 4096 janv. 14 08:55 link_mode > -rw-r--r-- 1 root root 4096 janv. 14 08:55 mtu > -r--r--r-- 1 root root 4096 janv. 14 08:55 operstate > -r--r--r-- 1 root root 4096 janv. 14 08:55 pkey > drwxr-xr-x 2 root root 0 janv. 14 08:55 power > drwxr-xr-x 2 root root 0 janv. 14 08:55 statistics > lrwxrwxrwx 1 root root 0 janv. 14 08:55 subsystem -> > ../../../../../../../../class/net > -rw-r--r-- 1 root root 4096 janv. 14 08:55 tx_queue_len > -r--r--r-- 1 root root 4096 janv. 14 08:55 type > -rw-r--r-- 1 root root 4096 janv. 14 08:55 uevent > -rw-r--r-- 1 root root 4096 janv. 14 08:55 umcast > > as I am doing some benchs and wanted to size up MTU with connected mode, > > * is there another way to use connected mode ? > * is it related to debian packaging ? > > Regards. > > Nicolas > -- > To unsubscribe from this list: send the line "unsubscribe linux-rdma" in > the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org > More majordomo info at http://vger.kernel.org/majordomo-info.html -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: mode connected infiniband [not found] ` <8fd3bb681001140011k5ec9492eg19b59b110e45a2b5-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 2010-01-14 16:06 ` Frank Leers @ 2010-01-14 17:22 ` Roland Dreier [not found] ` <adabpgwmwiw.fsf-BjVyx320WGW9gfZ95n9DRSW4+XlvGpQz@public.gmane.org> 1 sibling, 1 reply; 10+ messages in thread From: Roland Dreier @ 2010-01-14 17:22 UTC (permalink / raw) To: Nico; +Cc: linux-rdma-u79uwXL29TY76Z2rM5mHXA > I'am using an infiniband kernel 2.6.32-trunk-686 from debian, I guess the problem is that standard don't enable CONFIG_INFINIBAND_IPOIB_CM. It's probably worth filing a Debian bug for that, and we should probably change the upstream kernel default to "yes" as well. -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 10+ messages in thread
[parent not found: <adabpgwmwiw.fsf-BjVyx320WGW9gfZ95n9DRSW4+XlvGpQz@public.gmane.org>]
* Re: mode connected infiniband [not found] ` <adabpgwmwiw.fsf-BjVyx320WGW9gfZ95n9DRSW4+XlvGpQz@public.gmane.org> @ 2010-01-16 13:42 ` Nico 2010-01-17 8:03 ` Tziporet Koren 1 sibling, 0 replies; 10+ messages in thread From: Nico @ 2010-01-16 13:42 UTC (permalink / raw) To: Roland Dreier; +Cc: linux-rdma-u79uwXL29TY76Z2rM5mHXA Hi, I just created a debian bug : http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=565494 Regards. Nicolas 2010/1/14 Roland Dreier <rdreier-FYB4Gu1CFyUAvxtiuMwx3w@public.gmane.org>: > > > I'am using an infiniband kernel 2.6.32-trunk-686 from debian, > > I guess the problem is that standard don't enable > CONFIG_INFINIBAND_IPOIB_CM. It's probably worth filing a Debian bug for > that, and we should probably change the upstream kernel default to "yes" > as well. > -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: mode connected infiniband [not found] ` <adabpgwmwiw.fsf-BjVyx320WGW9gfZ95n9DRSW4+XlvGpQz@public.gmane.org> 2010-01-16 13:42 ` Nico @ 2010-01-17 8:03 ` Tziporet Koren [not found] ` <4B52C456.5030501-VPRAkNaXOzVS1MOuV/RT9w@public.gmane.org> 1 sibling, 1 reply; 10+ messages in thread From: Tziporet Koren @ 2010-01-17 8:03 UTC (permalink / raw) To: Roland Dreier; +Cc: Nico, linux-rdma-u79uwXL29TY76Z2rM5mHXA On 1/14/2010 7:22 PM, Roland Dreier wrote: > and we should probably change the upstream kernel default to "yes" > as well. > > I agree IPoIB CM is very stable now and has a better performance Tziporet -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 10+ messages in thread
[parent not found: <4B52C456.5030501-VPRAkNaXOzVS1MOuV/RT9w@public.gmane.org>]
* Re: mode connected infiniband [not found] ` <4B52C456.5030501-VPRAkNaXOzVS1MOuV/RT9w@public.gmane.org> @ 2010-01-18 2:56 ` Jason Gunthorpe [not found] ` <20100118025626.GG9059-ePGOBjL8dl3ta4EC/59zMFaTQe2KTcn/@public.gmane.org> 0 siblings, 1 reply; 10+ messages in thread From: Jason Gunthorpe @ 2010-01-18 2:56 UTC (permalink / raw) To: Tziporet Koren; +Cc: linux-rdma-u79uwXL29TY76Z2rM5mHXA On Sun, Jan 17, 2010 at 10:03:34AM +0200, Tziporet Koren wrote: > IPoIB CM is very stable now and has a better performance The last time I tried to use it the kernel began reporting lots of OOM events (2.6.30 stock). I thought this was well known because CM mode uses high order allocations?? Jason -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 10+ messages in thread
[parent not found: <20100118025626.GG9059-ePGOBjL8dl3ta4EC/59zMFaTQe2KTcn/@public.gmane.org>]
* Re: mode connected infiniband [not found] ` <20100118025626.GG9059-ePGOBjL8dl3ta4EC/59zMFaTQe2KTcn/@public.gmane.org> @ 2010-01-20 1:09 ` Roland Dreier [not found] ` <ab66a8181001200002n120af0e7m3733b42a945dc06b@mail.gmail.com> [not found] ` <adar5pl37ky.fsf-BjVyx320WGW9gfZ95n9DRSW4+XlvGpQz@public.gmane.org> 2010-01-20 1:10 ` Roland Dreier 1 sibling, 2 replies; 10+ messages in thread From: Roland Dreier @ 2010-01-20 1:09 UTC (permalink / raw) To: Jason Gunthorpe; +Cc: Tziporet Koren, linux-rdma-u79uwXL29TY76Z2rM5mHXA > The last time I tried to use it the kernel began reporting lots of > OOM events (2.6.30 stock). I thought this was well known because CM > mode uses high order allocations?? That's not well-known to me. What's the backtrace for those high-order allocations? I thought the CM code was careful to allocate receive buffers using page-size fragments but maybe there's some other path (skb rings, QP/CQ structures?) that does a higher-order alloc. - R. -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 10+ messages in thread
[parent not found: <ab66a8181001200002n120af0e7m3733b42a945dc06b@mail.gmail.com>]
[parent not found: <ab66a8181001200002n120af0e7m3733b42a945dc06b-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>]
* Re: mode connected infiniband [not found] ` <ab66a8181001200002n120af0e7m3733b42a945dc06b-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> @ 2010-01-20 19:21 ` Roland Dreier 0 siblings, 0 replies; 10+ messages in thread From: Roland Dreier @ 2010-01-20 19:21 UTC (permalink / raw) To: Gennadiy Nerubayev Cc: Jason Gunthorpe, Tziporet Koren, linux-rdma-u79uwXL29TY76Z2rM5mHXA > I'll contribute my 2c here, as I have experienced some page allocation > errors in the past when using drbd and cm. I think drbd in particular > might trigger it since it's pushes a lot of data through the link, but > regardless the circumstances are somewhat difficult to replicate. I'm not > completely certain they are relevant, but here are some of the traces that > I've noticed then; these were observed on some version of OFED 1.4 if I > recall. Setting /proc/sys/vm/min_free_kbytes to some ridiculous number does > work around this. > > http://www.mail-archive.com/drbd-user-cunTk1MwBs8qoQakbn7OcQ@public.gmane.org/msg00441.html > http://lists.linbit.com/pipermail/drbd-user/2009-April/011880.html Thanks. I notice that neither of these tracebacks actually have any IPoIB functions (or indeed anything RDMA-related). So it seems the problem is probably somewhere more generic, triggered by the large MTU allowed by IPoIB CM. One traceback starts with drbd0_worker: page allocation failure. order:5, mode:0x10 so someone is trying to do an order 5 allocation on the send path. And the other one starts with drbd0_receiver: page allocation failure. order:4, mode:0x20 So similarly, the receive path is trying an order 4 allocation. Are both of these tracebacks using IPoIB CM with a ~64K MTU? - R. -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 10+ messages in thread
[parent not found: <adar5pl37ky.fsf-BjVyx320WGW9gfZ95n9DRSW4+XlvGpQz@public.gmane.org>]
* Re: mode connected infiniband [not found] ` <adar5pl37ky.fsf-BjVyx320WGW9gfZ95n9DRSW4+XlvGpQz@public.gmane.org> @ 2010-01-20 19:29 ` Jason Gunthorpe 0 siblings, 0 replies; 10+ messages in thread From: Jason Gunthorpe @ 2010-01-20 19:29 UTC (permalink / raw) To: Roland Dreier; +Cc: Tziporet Koren, linux-rdma-u79uwXL29TY76Z2rM5mHXA On Tue, Jan 19, 2010 at 05:09:49PM -0800, Roland Dreier wrote: > > > The last time I tried to use it the kernel began reporting lots of > > OOM events (2.6.30 stock). I thought this was well known because CM > > mode uses high order allocations?? > > That's not well-known to me. What's the backtrace for those high-order > allocations? I thought the CM code was careful to allocate receive > buffers using page-size fragments but maybe there's some other path (skb > rings, QP/CQ structures?) that does a higher-order alloc. Hmm, I'll keep an eye out then. I've seen it several times in very different systems, usually just turn CM mode off and it goes away. Unfortunately I have no systems like that now and I didn't keep the traces... The traces did have ipoib function calls. Jason -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: mode connected infiniband [not found] ` <20100118025626.GG9059-ePGOBjL8dl3ta4EC/59zMFaTQe2KTcn/@public.gmane.org> 2010-01-20 1:09 ` Roland Dreier @ 2010-01-20 1:10 ` Roland Dreier 1 sibling, 0 replies; 10+ messages in thread From: Roland Dreier @ 2010-01-20 1:10 UTC (permalink / raw) To: Jason Gunthorpe; +Cc: Tziporet Koren, linux-rdma-u79uwXL29TY76Z2rM5mHXA > > IPoIB CM is very stable now and has a better performance > > The last time I tried to use it the kernel began reporting lots of > OOM events (2.6.30 stock). I thought this was well known because CM > mode uses high order allocations?? And in any case changing the kernel config option default just means that the code gets built, but we start in datagram mode by default anyway. So it just means people can enable it at runtime without having to rebuild the kernel. - R. -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 10+ messages in thread
end of thread, other threads:[~2010-01-20 19:29 UTC | newest]
Thread overview: 10+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-01-14 8:11 mode connected infiniband Nico
[not found] ` <8fd3bb681001140011k5ec9492eg19b59b110e45a2b5-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2010-01-14 16:06 ` Frank Leers
2010-01-14 17:22 ` Roland Dreier
[not found] ` <adabpgwmwiw.fsf-BjVyx320WGW9gfZ95n9DRSW4+XlvGpQz@public.gmane.org>
2010-01-16 13:42 ` Nico
2010-01-17 8:03 ` Tziporet Koren
[not found] ` <4B52C456.5030501-VPRAkNaXOzVS1MOuV/RT9w@public.gmane.org>
2010-01-18 2:56 ` Jason Gunthorpe
[not found] ` <20100118025626.GG9059-ePGOBjL8dl3ta4EC/59zMFaTQe2KTcn/@public.gmane.org>
2010-01-20 1:09 ` Roland Dreier
[not found] ` <ab66a8181001200002n120af0e7m3733b42a945dc06b@mail.gmail.com>
[not found] ` <ab66a8181001200002n120af0e7m3733b42a945dc06b-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2010-01-20 19:21 ` Roland Dreier
[not found] ` <adar5pl37ky.fsf-BjVyx320WGW9gfZ95n9DRSW4+XlvGpQz@public.gmane.org>
2010-01-20 19:29 ` Jason Gunthorpe
2010-01-20 1:10 ` Roland Dreier
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox