All of lore.kernel.org
 help / color / mirror / Atom feed
From: Ingo Molnar <mingo@elte.hu>
To: Willy Tarreau <w@1wt.eu>, "David S. Miller" <davem@davemloft.net>
Cc: David Newall <davidn@davidnewall.com>,
	Linus Torvalds <torvalds@linux-foundation.org>,
	akpm@linux-foundation.org, netdev@vger.kernel.org,
	linux-kernel@vger.kernel.org,
	Stefan Richter <stefanr@s5r6.in-berlin.de>,
	"Rafael J. Wysocki" <rjw@sisk.pl>
Subject: [TCP bug, regression] stuck distcc connections in latest -git
Date: Thu, 24 Jul 2008 08:04:48 +0200	[thread overview]
Message-ID: <20080724060448.GA10203@elte.hu> (raw)
In-Reply-To: <20080723082643.GA992@elte.hu>


* Ingo Molnar <mingo@elte.hu> wrote:

> Alas, the problem has not reoccured since then - more than a thousand 
> kernel builds down the line. [...]

the permanently hug distcc kernel build bug triggered again, twice. 
First time it happened yesterday, i left it running overnight and it 
never recovered after a 14+ hours of wait.

It shows a similar pattern, 'ESTABLISHED' state on both sides, but the 
client-side is stuck and the server (running latest kernel) is seemingly 
clueless about that fact:

 client:

  Proto Recv-Q Send-Q Local Address       Foreign Address     State
  tcp        0 375450 10.0.1.16:39201     10.0.1.19:3632      ESTABLISHED

 server:

  Proto Recv-Q Send-Q Local Address       Foreign Address     State
  tcp        0      0 10.0.1.19:3632      10.0.1.16:39201     ESTABLISHED

i waited ~30 minutes in this second case.

the client (running 2.6.24) does periodic 120 seconds retransmits:

07:40:48.255452 IP dione.39201 > phoenix.distcc: . 1608:2144(536) ack 1 win 584007:40:48.255547 IP phoenix.distcc > dione.39201: . ack 2144 win 65535
07:40:48.255564 IP dione.39201 > phoenix.distcc: . 67143:67679(536) ack 1 win 5840
07:40:48.255648 IP phoenix.distcc > dione.39201: . ack 2144 win 65535
07:42:48.255440 IP dione.39201 > phoenix.distcc: . 2144:2680(536) ack 1 win 5840
07:42:48.255559 IP phoenix.distcc > dione.39201: . ack 2680 win 65535
07:42:48.255570 IP dione.39201 > phoenix.distcc: . 67679:68215(536) ack 1 win 5840
07:42:48.255659 IP phoenix.distcc > dione.39201: . ack 2680 win 65535
07:44:48.255436 IP dione.39201 > phoenix.distcc: . 2680:3216(536) ack 1 win 584007:44:48.255570 IP phoenix.distcc > dione.39201: . ack 3216 win 65535
07:44:48.255585 IP dione.39201 > phoenix.distcc: . 68215:68751(536) ack 1 win 5840
07:44:48.255669 IP phoenix.distcc > dione.39201: . ack 3216 win 65535

the server (running the latest kernel) responds:

07:40:47.551098 IP dione.39201 > phoenix.distcc: . 1072:1608(536) ack 1 win 584007:40:47.551141 IP phoenix.distcc > dione.39201: . ack 1608 win 65535
07:40:47.551204 IP dione.39201 > phoenix.distcc: . 66607:67143(536) ack 1 win 5840
07:40:47.551213 IP phoenix.distcc > dione.39201: . ack 1608 win 65535
07:42:47.570994 IP dione.39201 > phoenix.distcc: . 1608:2144(536) ack 1 win 584007:42:47.571027 IP phoenix.distcc > dione.39201: . ack 2144 win 65535
07:42:47.571117 IP dione.39201 > phoenix.distcc: . 67143:67679(536) ack 1 win 5840
07:42:47.571127 IP phoenix.distcc > dione.39201: . ack 2144 win 65535
07:44:47.590901 IP dione.39201 > phoenix.distcc: . 2144:2680(536) ack 1 win 584007:44:47.590960 IP phoenix.distcc > dione.39201: . ack 2680 win 65535
07:44:47.591042 IP dione.39201 > phoenix.distcc: . 67679:68215(536) ack 1 win 5840
07:44:47.591054 IP phoenix.distcc > dione.39201: . ack 2680 win 65535

full client socket state:

 dione:~> grep $(printf "%X\n" 39201) /proc/net/tcp
   44: 1001000A:9921 1301000A:0E30 01 0005ABF2:00000000 01:00002B8A 
       00000000   500        0 63130083 2 ffff81000c762d00 120000 0 0 28 101

 [ a few minutes later ]

   44: 1001000A:9921 1301000A:0E30 01 0005A392:00000000 01:00002BF0 
       00000000   500        0 63130083 2 ffff81000c762d00 120000 0 0 32 101

 [ i.e. the tx queue did increase by 2144 bytes - 4x 536 bytes ]

full server socket state:

 phoenix:~> grep $(printf "%X\n" 39201) /proc/net/tcp
    6: 1301000A:0E30 1001000A:9921 01 00000000:00000000 00:00000000 
       00000000    99        0 728382 1 ffff88042d8db280 300 4 30 2 -1

 [ a few minutes later ]

    6: 1301000A:0E30 1001000A:9921 01 00000000:00000000 00:00000000 
       00000000    99        0 728382 1 ffff88042d8db280 300 4 30 2 -1

 [ i.e. no change - no pending packets ]

I've started a longer capture session as well - it seems the TCP stack 
is slowly cycling through retransmissions of 536-byte packets, with 
375450 bytes pending? At 120 seconds a pop that would be about 23 hours 
to make any progress on - but i'm not sure i interpreted that right. It 
all looks very weird.

The timestamps of the two boxes are synced up to within about 1 second:

  earth4:~> for N in dione phoenix; do ssh $N date; done
  Thu Jul 24 07:44:02 CEST 2008
  Thu Jul 24 07:44:02 CEST 2008

( but the two boxes are responding to each other fine, so ordering of 
  events is not a question here. )

Any other state you'd like to see before i continue with -tip testing? 

	Ingo

  reply	other threads:[~2008-07-24  6:05 UTC|newest]

Thread overview: 255+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2008-07-20 17:44 [GIT]: Networking David Miller
2008-07-20 17:59 ` Arjan van de Ven
2008-07-20 23:52   ` David Miller
2008-07-21 20:32   ` David Miller
2008-07-21  0:54 ` Linus Torvalds
2008-07-21  1:03   ` David Miller
2008-07-21  1:09     ` Alexey Dobriyan
2008-07-21  1:14       ` David Miller
2008-07-21  1:22         ` Alexey Dobriyan
2008-07-21  2:40         ` Alexey Dobriyan
2008-07-21  2:48           ` David Miller
2008-07-21  5:11             ` David Miller
2008-07-21  9:48               ` Alexander Beregalov
2008-07-21 10:16                 ` Ben Hutchings
2008-07-21 15:35                   ` David Miller
2008-07-21 16:04                     ` Alexander Beregalov
2008-07-21 11:57               ` Alexey Dobriyan
2008-07-21 15:27                 ` David Miller
2008-07-21 16:49               ` Linus Torvalds
2008-07-21 16:53                 ` David Miller
2008-07-21  1:20     ` Patrick McHardy
2008-07-21 11:28       ` Stefan Richter
2008-07-21 11:45       ` James Morris
2008-07-21 12:05         ` Patrick McHardy
2008-07-21 17:28           ` David Miller
2008-07-21 17:40             ` Linus Torvalds
2008-07-21 20:33               ` Patrick McHardy
2008-07-23 23:42                 ` David Miller
2008-07-21  1:07   ` Linus Torvalds
2008-07-21  1:17     ` David Miller
2008-07-21  1:17       ` David Miller
2008-07-21  8:36 ` iwlwifi: fix build bug in "iwlwifi: fix LED stall" Ingo Molnar
2008-07-21 10:02   ` Winkler, Tomas
2008-07-21 10:53     ` Ingo Molnar
2008-07-21 12:12   ` [PATCH] iwlwifi: RS small compile warnings without CONFIG_IWLWIFI_DEBUG Tomas Winkler
2008-07-21 12:12     ` [PATCH] iwlwifi: " Tomas Winkler
2008-07-21 12:12       ` [PATCH] iwlwifi: compilation error when CONFIG_IWLWIFI_DEBUG is not set Tomas Winkler
2008-07-21 13:30 ` [crash, bisected] Kernel BUG at ffffffff8079afb1 (__netif_schedule()) Ingo Molnar
2008-07-21 13:45   ` [crash] BUG: unable to handle kernel NULL pointer dereference at 0000000000000370 Ingo Molnar
2008-07-21 14:30     ` Ingo Molnar
2008-07-21 15:04       ` Ingo Molnar
2008-07-21 15:24         ` David Miller
2008-07-21 18:18           ` Ian Schram
2008-07-21 19:06             ` Ingo Molnar
2008-07-21 19:13               ` Larry Finger
2008-07-21 19:13                 ` Larry Finger
2008-07-21 19:34                 ` Ingo Molnar
2008-07-21 19:34                   ` Ingo Molnar
2008-07-21 19:43                   ` Larry Finger
2008-07-21 19:43                     ` Larry Finger
2008-07-21 19:47                     ` Linus Torvalds
2008-07-21 19:47                       ` Linus Torvalds
2008-07-21 20:15                       ` David Miller
2008-07-21 20:28                       ` Larry Finger
2008-07-21 20:28                         ` Larry Finger
2008-07-21 20:21                     ` David Miller
2008-07-21 20:21                       ` David Miller
2008-07-21 20:38                       ` Larry Finger
2008-07-21 20:38                         ` Larry Finger
2008-07-21 20:46                         ` David Miller
2008-07-21 20:51                           ` Patrick McHardy
2008-07-21 21:01                             ` David Miller
2008-07-21 21:06                               ` Patrick McHardy
2008-07-21 21:35                                 ` Patrick McHardy
2008-07-21 21:35                                   ` Patrick McHardy
2008-07-21 21:42                                   ` Patrick McHardy
2008-07-21 21:42                                     ` Patrick McHardy
2008-07-21 21:51                                   ` Larry Finger
2008-07-21 21:51                                     ` Larry Finger
2008-07-21 22:04                                     ` Patrick McHardy
2008-07-21 22:04                                       ` Patrick McHardy
2008-07-21 22:40                                       ` Larry Finger
2008-07-21 22:40                                         ` Larry Finger
2008-07-21 23:15                                         ` David Miller
2008-07-21 23:15                                           ` David Miller
2008-07-22  6:34                                           ` Larry Finger
2008-07-22 10:51                                             ` Jarek Poplawski
2008-07-22 10:51                                               ` Jarek Poplawski
2008-07-22 11:32                                             ` David Miller
2008-07-22 12:52                                               ` Larry Finger
2008-07-22 20:43                                                 ` David Miller
2008-07-22 13:02                                               ` Larry Finger
2008-07-22 14:53                                                 ` Patrick McHardy
2008-07-22 14:53                                                   ` Patrick McHardy
2008-07-22 21:17                                                   ` David Miller
2008-07-22 21:17                                                     ` David Miller
2008-07-22 16:39                                                 ` Kernel WARNING: at net/core/dev.c:1330 __netif_schedule+0x2c/0x98() Larry Finger
2008-07-22 16:39                                                   ` Larry Finger
2008-07-22 17:20                                                   ` Patrick McHardy
2008-07-22 17:20                                                     ` Patrick McHardy
2008-07-22 18:39                                                     ` Larry Finger
2008-07-22 18:44                                                       ` Patrick McHardy
2008-07-22 19:30                                                         ` Larry Finger
2008-07-22 19:30                                                           ` Larry Finger
2008-07-22 23:04                                                       ` David Miller
2008-07-23  6:20                                                         ` Jarek Poplawski
2008-07-23  7:59                                                           ` David Miller
2008-07-23  8:54                                                             ` Jarek Poplawski
2008-07-23  8:54                                                               ` Jarek Poplawski
2008-07-23  9:03                                                               ` Peter Zijlstra
2008-07-23  9:03                                                                 ` Peter Zijlstra
2008-07-23  9:35                                                                 ` Jarek Poplawski
2008-07-23  9:35                                                                   ` Jarek Poplawski
2008-07-23  9:50                                                                   ` Peter Zijlstra
2008-07-23  9:50                                                                     ` Peter Zijlstra
2008-07-23 10:13                                                                     ` Jarek Poplawski
2008-07-23 10:13                                                                       ` Jarek Poplawski
2008-07-23 10:58                                                                       ` Peter Zijlstra
2008-07-23 11:35                                                                         ` Jarek Poplawski
2008-07-23 11:35                                                                           ` Jarek Poplawski
2008-07-23 11:49                                                                           ` Jarek Poplawski
2008-07-23 11:49                                                                             ` Jarek Poplawski
2008-07-23 20:16                                                                             ` David Miller
2008-07-23 20:43                                                                               ` Jarek Poplawski
2008-07-23 20:55                                                                                 ` David Miller
2008-07-23 20:55                                                                                   ` David Miller
2008-07-24  9:10                                                                               ` Peter Zijlstra
2008-07-24  9:10                                                                                 ` Peter Zijlstra
2008-07-24  9:20                                                                                 ` David Miller
2008-07-24  9:20                                                                                   ` David Miller
2008-07-24  9:27                                                                                   ` Peter Zijlstra
2008-07-24  9:27                                                                                     ` Peter Zijlstra
2008-07-24  9:32                                                                                     ` David Miller
2008-07-24 10:08                                                                                       ` Peter Zijlstra
2008-07-24 10:08                                                                                         ` Peter Zijlstra
2008-07-24 10:38                                                                                         ` Nick Piggin
2008-07-24 10:55                                                                                           ` Miklos Szeredi
2008-07-24 10:55                                                                                             ` Miklos Szeredi
2008-07-24 11:06                                                                                             ` Nick Piggin
2008-07-24 11:06                                                                                               ` Nick Piggin
2008-08-01 21:10                                                                                               ` Paul E. McKenney
2008-08-01 21:10                                                                                                 ` Paul E. McKenney
2008-07-24 10:59                                                                                           ` Peter Zijlstra
2008-07-24 10:59                                                                                             ` Peter Zijlstra
2008-08-01 21:10                                                                                           ` Paul E. McKenney
2008-07-23 20:14                                                                         ` David Miller
2008-07-23 20:14                                                                           ` David Miller
2008-07-24  7:00                                                                           ` Peter Zijlstra
2008-07-24  7:00                                                                             ` Peter Zijlstra
2008-07-25 17:04                                                                           ` Ingo Oeser
2008-07-25 18:36                                                                             ` Jarek Poplawski
2008-07-25 18:36                                                                               ` Jarek Poplawski
2008-07-25 19:16                                                                               ` Johannes Berg
2008-07-25 19:34                                                                                 ` Jarek Poplawski
2008-07-25 19:34                                                                                   ` Jarek Poplawski
2008-07-25 19:36                                                                                   ` Johannes Berg
2008-07-25 20:01                                                                                     ` Jarek Poplawski
2008-07-26  9:18                                                                                       ` David Miller
2008-07-26  9:18                                                                                         ` David Miller
2008-07-26 10:53                                                                                         ` Jarek Poplawski
2008-07-26 13:18                                                                                         ` Jarek Poplawski
2008-07-26 13:18                                                                                           ` Jarek Poplawski
2008-07-27  0:34                                                                                           ` David Miller
2008-07-27  0:34                                                                                             ` David Miller
2008-07-27 20:37                                                                                             ` Jarek Poplawski
2008-07-27 20:37                                                                                               ` Jarek Poplawski
2008-07-31 12:29                                                                                               ` David Miller
2008-07-31 12:29                                                                                                 ` David Miller
2008-07-31 12:38                                                                                                 ` Nick Piggin
2008-07-31 12:38                                                                                                   ` Nick Piggin
2008-07-31 12:44                                                                                                   ` David Miller
2008-08-01  4:27                                                                                                 ` David Miller
2008-08-01  7:09                                                                                                   ` Peter Zijlstra
2008-08-01  7:09                                                                                                     ` Peter Zijlstra
2008-08-01  6:48                                                                                                 ` Jarek Poplawski
2008-08-01  7:00                                                                                                   ` David Miller
2008-08-01  7:00                                                                                                     ` David Miller
2008-08-01  7:01                                                                                                   ` Jarek Poplawski
2008-08-01  7:01                                                                                                     ` David Miller
2008-08-01  7:01                                                                                                       ` David Miller
2008-08-01  7:41                                                                                                       ` Jarek Poplawski
2008-07-25  6:20                           ` [lockdep warning] AOE / networking: aoenet_xmit: noop_qdisc.q.lock, INFO: inconsistent lock state at 0000000000000370 Ingo Molnar
2008-07-25  6:25                             ` David Miller
2008-07-25  7:26                               ` Ingo Molnar
2008-07-25  8:23                                 ` David Miller
2008-07-21 15:10       ` [crash] BUG: unable to handle kernel NULL pointer dereference " David Miller
2008-07-21 15:10         ` David Miller
2008-07-21 15:10         ` David Miller
2008-07-21 18:23     ` [crash] kernel BUG at net/core/dev.c:1328! Ingo Molnar
2008-07-21 18:35       ` Linus Torvalds
2008-07-21 18:46         ` Ingo Molnar
2008-07-21 19:30           ` Ingo Molnar
2008-07-22 11:21           ` [TCP bug] stuck distcc connections in latest -git Ingo Molnar
2008-07-22 13:45             ` David Newall
2008-07-22 13:57               ` Ingo Molnar
2008-07-22 14:54                 ` David Newall
2008-07-22 15:34                   ` Ingo Molnar
2008-07-22 21:12                     ` Willy Tarreau
2008-07-23  8:26                       ` Ingo Molnar
2008-07-24  6:04                         ` Ingo Molnar [this message]
2008-07-24  6:32                           ` [TCP bug, regression] " Ingo Molnar
2008-07-24  7:33                             ` Willy Tarreau
2008-07-24  8:35                               ` Ingo Molnar
2008-07-24  7:53                             ` Herbert Xu
2008-07-24  8:24                               ` Willy Tarreau
2008-07-24  8:27                               ` Ingo Molnar
2008-07-24  8:36                                 ` David Miller
2008-07-24  9:05                           ` Herbert Xu
2008-07-24  9:22                             ` David Miller
2008-07-24  9:34                               ` Ingo Molnar
2008-07-24 11:56                                 ` [regression] nf_iterate(), BUG: unable to handle kernel NULL pointer dereference Ingo Molnar
2008-07-24 11:59                                   ` Ingo Molnar
2008-07-24 12:03                                     ` Patrick McHardy
2008-07-24 12:22                                       ` Herbert Xu
2008-07-24 12:40                                         ` Pekka Enberg
2008-07-24 12:50                                           ` Herbert Xu
2008-07-24 12:56                                             ` Nick Piggin
2008-07-24 13:04                                               ` Herbert Xu
2008-07-24 13:13                                                 ` Nick Piggin
2008-07-24 13:32                                                   ` Pekka Enberg
2008-07-24 19:21                                                     ` Matt Mackall
2008-07-25  9:09                                                       ` Nick Piggin
2008-07-24 13:11                                             ` Matt Mackall
2008-07-24 14:37                                               ` Herbert Xu
2008-07-24 17:47                                                 ` Matt Mackall
2008-07-25  1:39                                                   ` Herbert Xu
2008-07-25  2:59                                                     ` Matt Mackall
2008-07-24 12:44                                       ` Pekka Enberg
2008-07-24 12:49                                         ` Patrick McHardy
2008-07-24 13:23                                           ` Pekka Enberg
2008-07-24 13:31                                             ` Patrick McHardy
2008-07-24 13:34                                               ` Pekka Enberg
2008-07-24 18:51                                                 ` Andrew Morton
2008-07-24 18:55                                                   ` Pekka Enberg
2008-07-24 20:58                                                     ` David Miller
2008-07-25  8:02                                                     ` Dieter Ries
2008-07-25 10:41                                                       ` Pekka Enberg
2008-07-24 19:35                                                   ` Ingo Molnar
2008-07-26 16:09                                                     ` Patrick McHardy
2008-07-26 17:34                                                       ` Ingo Molnar
2008-07-26 13:43                                                   ` Patrick McHardy
2008-07-24 21:13                                           ` Linus Torvalds
2008-07-24 22:09                                             ` David Miller
2008-07-26 13:47                                             ` Patrick McHardy
2008-08-01 21:10                                             ` Paul E. McKenney
2008-07-24 14:23                                     ` Ingo Molnar
2008-07-24 15:23                                       ` Patrick McHardy
2008-07-24 15:32                                         ` Ingo Molnar
2008-07-24 15:34                                           ` Patrick McHardy
2008-07-24 18:00                                           ` Krzysztof Oledzki
2008-07-24 13:01                               ` [TCP bug, regression] stuck distcc connections in latest -git Willy Tarreau
2008-07-24  9:25                             ` Ingo Molnar
2008-07-24  9:29                               ` David Miller
2008-07-24 11:12                               ` Herbert Xu
2008-07-24  9:36                             ` Ilpo Järvinen
2008-07-24 10:03                               ` Ilpo Järvinen
2008-07-21 19:00         ` [crash] kernel BUG at net/core/dev.c:1328! David Miller
2008-07-21 19:20           ` Stefan Richter
2008-07-21 20:11             ` David Miller
2008-07-21 21:26               ` Stefan Richter
2008-07-21 19:44           ` Ingo Molnar
2008-07-21 20:20             ` David Miller
2008-07-21 15:07   ` [crash, bisected] Kernel BUG at ffffffff8079afb1 (__netif_schedule()) David Miller
2008-07-21 13:50 ` [GIT]: Networking Ingo Molnar
2008-07-21 14:15   ` Stefan Richter

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20080724060448.GA10203@elte.hu \
    --to=mingo@elte.hu \
    --cc=akpm@linux-foundation.org \
    --cc=davem@davemloft.net \
    --cc=davidn@davidnewall.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=netdev@vger.kernel.org \
    --cc=rjw@sisk.pl \
    --cc=stefanr@s5r6.in-berlin.de \
    --cc=torvalds@linux-foundation.org \
    --cc=w@1wt.eu \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.