All of lore.kernel.org
 help / color / mirror / Atom feed
From: Ben Greear <greearb@candelatech.com>
To: NetDev <netdev@vger.kernel.org>
Subject: Re: TCP connection hang problem with 2.6.16.16, e1000.
Date: Wed, 31 May 2006 12:37:41 -0700	[thread overview]
Message-ID: <447DF085.2050604@candelatech.com> (raw)
In-Reply-To: <447DD9B5.9050100@candelatech.com>

Ben Greear wrote:

> I haven't seen this problem on 2.6.13, so I'm now starting a manual bisect
> to see if I can narrow down where the problem appeared.

Turns out, I can reproduce it in 2.6.13, and 2.6.9.  I haven't tried anything
older.

I also tried to reproduce it using a simpler traffic generation tool, but could
not reproduce the problem with it.  That points to something wierd that my application
is doing, but I can't imagine what user-space could do to screw up a TCP connection
like this.

In all cases, there is a lot of data in the send-queue, but for whatever
reason, the connection will not make progress.  To user-space, it appears
that poll returns neither readable nor writable for the sockets.

I notice that if I increase the send-buffer-size while the connection is
in the hung state, my app will quickly fill the larger send buffer, but
still receives nothing new.

Starting a new connection on the same interfaces works for a few seconds
and then hangs as well, so the NICs can pass traffic.

Here is output from /proc/net/tcp and netstat from the 2.6.16.16 kernel.

netstat info:

tcp        0 5635368 172.1.5.169:33058       172.1.5.168:33057       ESTABLISHED
tcp        0 5987504 172.1.5.168:33057       172.1.5.169:33058       ESTABLISHED

/proc/net/tcp:

   20: A90501AC:8122 A80501AC:8121 01 0055FD28:00000000 01:00001A9F 0000000A
0        0 21309 2 f36d8580 120000 40 0 1 58
   21: A80501AC:8121 A90501AC:8122 01 005B5CB0:00000000 01:00001C9D 0000000A
0        0 21226 3 ef7bfa80 120000 40 0 1 35


-- 
Ben Greear <greearb@candelatech.com>
Candela Technologies Inc  http://www.candelatech.com


  reply	other threads:[~2006-05-31 19:37 UTC|newest]

Thread overview: 9+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2006-05-31 16:45 TCP connection hang problem with 2.6.16.16, e1000 Ben Greear
2006-05-31 17:26 ` Ben Greear
2006-05-31 17:54 ` Auke Kok
2006-05-31 18:00   ` Ben Greear
2006-05-31 19:37     ` Ben Greear [this message]
2006-05-31 21:45       ` TCP connection hang problem with 2.6.16.16, e1000 (found work-around) Ben Greear
2006-05-31 22:10         ` Stephen Hemminger
2006-05-31 22:38           ` Ben Greear
2006-06-01  2:56 ` TCP connection hang problem with 2.6.16.16, e1000 Herbert Xu

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=447DF085.2050604@candelatech.com \
    --to=greearb@candelatech.com \
    --cc=netdev@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.