netdev.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Ben Greear <greearb@candelatech.com>
To: NetDev <netdev@vger.kernel.org>
Subject: Re: TCP connection hang problem with 2.6.16.16, e1000.
Date: Wed, 31 May 2006 12:37:41 -0700	[thread overview]
Message-ID: <447DF085.2050604@candelatech.com> (raw)
In-Reply-To: <447DD9B5.9050100@candelatech.com>

Ben Greear wrote:

> I haven't seen this problem on 2.6.13, so I'm now starting a manual bisect
> to see if I can narrow down where the problem appeared.

Turns out, I can reproduce it in 2.6.13, and 2.6.9.  I haven't tried anything
older.

I also tried to reproduce it using a simpler traffic generation tool, but could
not reproduce the problem with it.  That points to something wierd that my application
is doing, but I can't imagine what user-space could do to screw up a TCP connection
like this.

In all cases, there is a lot of data in the send-queue, but for whatever
reason, the connection will not make progress.  To user-space, it appears
that poll returns neither readable nor writable for the sockets.

I notice that if I increase the send-buffer-size while the connection is
in the hung state, my app will quickly fill the larger send buffer, but
still receives nothing new.

Starting a new connection on the same interfaces works for a few seconds
and then hangs as well, so the NICs can pass traffic.

Here is output from /proc/net/tcp and netstat from the 2.6.16.16 kernel.

netstat info:

tcp        0 5635368 172.1.5.169:33058       172.1.5.168:33057       ESTABLISHED
tcp        0 5987504 172.1.5.168:33057       172.1.5.169:33058       ESTABLISHED

/proc/net/tcp:

   20: A90501AC:8122 A80501AC:8121 01 0055FD28:00000000 01:00001A9F 0000000A
0        0 21309 2 f36d8580 120000 40 0 1 58
   21: A80501AC:8121 A90501AC:8122 01 005B5CB0:00000000 01:00001C9D 0000000A
0        0 21226 3 ef7bfa80 120000 40 0 1 35


-- 
Ben Greear <greearb@candelatech.com>
Candela Technologies Inc  http://www.candelatech.com


  reply	other threads:[~2006-05-31 19:37 UTC|newest]

Thread overview: 9+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2006-05-31 16:45 TCP connection hang problem with 2.6.16.16, e1000 Ben Greear
2006-05-31 17:26 ` Ben Greear
2006-05-31 17:54 ` Auke Kok
2006-05-31 18:00   ` Ben Greear
2006-05-31 19:37     ` Ben Greear [this message]
2006-05-31 21:45       ` TCP connection hang problem with 2.6.16.16, e1000 (found work-around) Ben Greear
2006-05-31 22:10         ` Stephen Hemminger
2006-05-31 22:38           ` Ben Greear
2006-06-01  2:56 ` TCP connection hang problem with 2.6.16.16, e1000 Herbert Xu

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=447DF085.2050604@candelatech.com \
    --to=greearb@candelatech.com \
    --cc=netdev@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).