netdev.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Rick Jones <rick.jones2@hp.com>
To: Bryan Lawver <lawver1@llnl.gov>
Cc: Linux Network Development list <netdev@vger.kernel.org>,
	"Michael S. Tsirkin" <mst@dev.mellanox.co.il>,
	general@lists.openfabrics.org
Subject: Re: [ofa-general] Re: IPoIB forwarding
Date: Fri, 27 Apr 2007 13:32:51 -0700	[thread overview]
Message-ID: <46325DF3.2050203@hp.com> (raw)
In-Reply-To: <6.1.2.0.2.20070427115435.13ea5ec0@mail.llnl.gov>

Bryan Lawver wrote:
> Your right about the ipoib module not combining packets (I believed you 
> without checking) but I did never the less.  The ipoib_start_xmit 
> routine is definitely handed a "double packet"  which means that the IP 
> NIC driver or the kernel is combining two packets into a single super 
> jumbo packet.  This issue is irrespective of the IP MTU setting because 
> I have set all interfaces to 9000k yet  ipoib accepts and forwards this 
> 17964 packet to the next IB node and onto the TCP stack where it is 
> never acknowledged.  This may not have come up in prior testing because 
> I am using some of the fastest IP NICs which have no trouble keeping up 
> with or exceeding the bandwidth of the IB side.  This issue arises 
> exactly every 8 packets...(ring buffer overrun??)
> 
> I will be at Sonoma for the next few days as many on this list will be.


Some NICs (esp 10G) support large receive offload - they coalesce TCP segments 
from the wire/fiber into larger ones they pass up the stack.  Perhaps that is 
happening here?

I'm going to go out a bit on a limb, cross the streams, and include netdev, 
because I suspect that if a system is acting as an IP router, one doesn't want 
large receive offload enabled.  That may need some discussion in netdev - it may 
then require some changes to default settings or some documentation 
enhancements.  That or I'll learn that the stack is already dealing with the 
issue...

rick jones

> bryan
> 
> 
> 
> At 11:06 AM 4/26/2007, Michael S. Tsirkin wrote:
> 
>> > Quoting Bryan Lawver <lawver1@llnl.gov>:
>> > Subject: Re: IPoIB forwarding
>> >
>> > Here's a tcpdump of the same sequence.  The TCP MSS is 8960 and it 
>> appears
>> > that two payloads are queued at ipoib which combines them into a single
>> > 17920 payload with assumingly correct IP header (40) and IB header
>> > (4).  The application or TCP stack does not acknowledge this double 
>> packet
>> > ie. it does not ACK until each of the 8960 packets are resent
>> > individually.  Being an IB newbie, I am guessing this combining is
>> > allowable but may violate TCP protocol.
>>
>> IPoIB does nothing like this - it's just a network device so
>> it sends all packets out as is.
>>
>> -- 
>> MST
> 
> 
> _______________________________________________
> general mailing list
> general@lists.openfabrics.org
> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
> 
> To unsubscribe, please visit 
> http://openib.org/mailman/listinfo/openib-general

       reply	other threads:[~2007-04-27 20:32 UTC|newest]

Thread overview: 20+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <6.1.2.0.2.20070423160212.12db6400@mail.llnl.gov>
     [not found] ` <20070425124652.GG1624@mellanox.co.il>
     [not found]   ` <6.1.2.0.2.20070426083410.1389d9e0@mail.llnl.gov>
     [not found]     ` <20070426161409.GF15540@mellanox.co.il>
     [not found]       ` <6.1.2.0.2.20070426095112.138e9a68@mail.llnl.gov>
     [not found]         ` <20070426180618.GJ15540@mellanox.co.il>
     [not found]           ` <6.1.2.0.2.20070427115435.13ea5ec0@mail.llnl.gov>
2007-04-27 20:32             ` Rick Jones [this message]
2007-04-27 22:26               ` [ofa-general] Re: IPoIB forwarding Bryan Lawver
2007-04-27 22:32                 ` Rick Jones
2007-04-27 22:43                   ` Bryan Lawver
2007-04-27 23:37                     ` Rick Jones
2007-04-27 23:39                       ` David Miller
2007-04-27 23:48                         ` Rick Jones
2007-04-27 23:52                           ` David Miller
2007-04-30 17:16                             ` Rick Jones
2007-05-01 22:43                               ` [PATCH] make myri10ge use default MTU of 1500 bytes Loic Prylli
2007-04-28  6:51                       ` [ofa-general] Re: IPoIB forwarding Bill Fink
2007-04-29 19:40                         ` Loic Prylli
2007-04-30 21:12                           ` Rick Jones
2007-05-01 22:05                             ` Loic Prylli
2007-05-01 22:12                               ` Rick Jones
2007-05-03 23:37                               ` Bryan Lawver
2007-04-30 17:07                         ` Rick Jones
2007-05-01  5:57                           ` Bill Fink
2007-05-01 16:26                             ` Loic Prylli
2007-04-28  2:35                 ` parks

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=46325DF3.2050203@hp.com \
    --to=rick.jones2@hp.com \
    --cc=general@lists.openfabrics.org \
    --cc=lawver1@llnl.gov \
    --cc=mst@dev.mellanox.co.il \
    --cc=netdev@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).