From: Rick Jones <rick.jones2@hp.com>
To: Bryan Lawver <lawver1@llnl.gov>
Cc: "Michael S. Tsirkin" <mst@dev.mellanox.co.il>,
general@lists.openfabrics.org,
Linux Network Development list <netdev@vger.kernel.org>
Subject: Re: [ofa-general] Re: IPoIB forwarding
Date: Fri, 27 Apr 2007 15:32:39 -0700 [thread overview]
Message-ID: <46327A07.1000404@hp.com> (raw)
In-Reply-To: <6.1.2.0.2.20070427152027.13fe46d0@mail.llnl.gov>
Bryan Lawver wrote:
> I hit the IP NIC over the head with a hammer and turned off all offload
> features and I no longer get the super jumbo packet and I have symmetric
> performance. This NIC supported "ethtool -K ethx tso/tx/rx/sg on/off"
> and I am not sure at this time which one I needed to whack but all off
> solved the problem.
Yeah, that does seem like a rather broad remedy, but I guess if it works... :)
And I suppose most of those offloads don't matter for a NIC being used in a router.
Only problem is we don't know if it worked because it slowed-down the 10G side
or because it had LRO disabling as a side-effect. If I were to guess, of those
things listed, I'd guess that receive cko would have that as a side effect.
Just what sort of 10G NIC was this anyway? With that knowledge we could
probably narrow things down to a more specific modprobe setting, or maybe even
an ethtool command, for some suitable revision of ethtool.
rick jones
>
> Thanks for listening and re enforcing my search process.
>
> bryan
>
> At 01:32 PM 4/27/2007, Rick Jones wrote:
>
>> Bryan Lawver wrote:
>>
>>> Your right about the ipoib module not combining packets (I believed
>>> you without checking) but I did never the less. The ipoib_start_xmit
>>> routine is definitely handed a "double packet" which means that the
>>> IP NIC driver or the kernel is combining two packets into a single
>>> super jumbo packet. This issue is irrespective of the IP MTU setting
>>> because I have set all interfaces to 9000k yet ipoib accepts and
>>> forwards this 17964 packet to the next IB node and onto the TCP stack
>>> where it is never acknowledged. This may not have come up in prior
>>> testing because I am using some of the fastest IP NICs which have no
>>> trouble keeping up with or exceeding the bandwidth of the IB side.
>>> This issue arises exactly every 8 packets...(ring buffer overrun??)
>>> I will be at Sonoma for the next few days as many on this list will be.
>>
>>
>>
>> Some NICs (esp 10G) support large receive offload - they coalesce TCP
>> segments from the wire/fiber into larger ones they pass up the stack.
>> Perhaps that is happening here?
>>
>> I'm going to go out a bit on a limb, cross the streams, and include
>> netdev, because I suspect that if a system is acting as an IP router,
>> one doesn't want large receive offload enabled. That may need some
>> discussion in netdev - it may then require some changes to default
>> settings or some documentation enhancements. That or I'll learn that
>> the stack is already dealing with the issue...
>>
>> rick jones
>>
>>> bryan
>>>
>>> At 11:06 AM 4/26/2007, Michael S. Tsirkin wrote:
>>>
>>>> > Quoting Bryan Lawver <lawver1@llnl.gov>:
>>>> > Subject: Re: IPoIB forwarding
>>>> >
>>>> > Here's a tcpdump of the same sequence. The TCP MSS is 8960 and it
>>>> appears
>>>> > that two payloads are queued at ipoib which combines them into a
>>>> single
>>>> > 17920 payload with assumingly correct IP header (40) and IB header
>>>> > (4). The application or TCP stack does not acknowledge this
>>>> double packet
>>>> > ie. it does not ACK until each of the 8960 packets are resent
>>>> > individually. Being an IB newbie, I am guessing this combining is
>>>> > allowable but may violate TCP protocol.
>>>>
>>>> IPoIB does nothing like this - it's just a network device so
>>>> it sends all packets out as is.
>>>>
>>>> --
>>>> MST
>>>
>>>
>>> _______________________________________________
>>> general mailing list
>>> general@lists.openfabrics.org
>>> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
>>> To unsubscribe, please visit
>>> http://openib.org/mailman/listinfo/openib-general
next prev parent reply other threads:[~2007-04-27 22:32 UTC|newest]
Thread overview: 20+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <6.1.2.0.2.20070423160212.12db6400@mail.llnl.gov>
[not found] ` <20070425124652.GG1624@mellanox.co.il>
[not found] ` <6.1.2.0.2.20070426083410.1389d9e0@mail.llnl.gov>
[not found] ` <20070426161409.GF15540@mellanox.co.il>
[not found] ` <6.1.2.0.2.20070426095112.138e9a68@mail.llnl.gov>
[not found] ` <20070426180618.GJ15540@mellanox.co.il>
[not found] ` <6.1.2.0.2.20070427115435.13ea5ec0@mail.llnl.gov>
2007-04-27 20:32 ` [ofa-general] Re: IPoIB forwarding Rick Jones
2007-04-27 22:26 ` Bryan Lawver
2007-04-27 22:32 ` Rick Jones [this message]
2007-04-27 22:43 ` Bryan Lawver
2007-04-27 23:37 ` Rick Jones
2007-04-27 23:39 ` David Miller
2007-04-27 23:48 ` Rick Jones
2007-04-27 23:52 ` David Miller
2007-04-30 17:16 ` Rick Jones
2007-05-01 22:43 ` [PATCH] make myri10ge use default MTU of 1500 bytes Loic Prylli
2007-04-28 6:51 ` [ofa-general] Re: IPoIB forwarding Bill Fink
2007-04-29 19:40 ` Loic Prylli
2007-04-30 21:12 ` Rick Jones
2007-05-01 22:05 ` Loic Prylli
2007-05-01 22:12 ` Rick Jones
2007-05-03 23:37 ` Bryan Lawver
2007-04-30 17:07 ` Rick Jones
2007-05-01 5:57 ` Bill Fink
2007-05-01 16:26 ` Loic Prylli
2007-04-28 2:35 ` parks
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=46327A07.1000404@hp.com \
--to=rick.jones2@hp.com \
--cc=general@lists.openfabrics.org \
--cc=lawver1@llnl.gov \
--cc=mst@dev.mellanox.co.il \
--cc=netdev@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.