netdev.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* Re: [fixed] [patch] Re: [bug] stuck localhost TCP connections, v2.6.26-rc3+
@ 2008-06-11 15:06 Alexey Kuznetsov
  0 siblings, 0 replies; 35+ messages in thread
From: Alexey Kuznetsov @ 2008-06-11 15:06 UTC (permalink / raw)
  To: davem, mcmanus, netdev

Hello!

> On Tue, 2008-06-10 at 15:32 -0700, David Miller wrote:
> 
> I took a close look at this, it seems this patch adds
> an ABBA deadlock.  But I might be wrong.

Yes, this is not a real deadlock. But also it is not even a lock.
bh_lock_sock on listening socket is not enough to lock it.

Seems, the solution would be forcing bh_lock_sock() to protect accept_queue,
which is a mess. Or adding a separate spinlock, like it was done with
syn_wait_lock. Also not good.

Alexey

^ permalink raw reply	[flat|nested] 35+ messages in thread
* Re: [bug] stuck localhost TCP connections, v2.6.26-rc3+
@ 2008-05-29  8:45 Ingo Molnar
  2008-05-29 11:14 ` Ilpo Järvinen
  0 siblings, 1 reply; 35+ messages in thread
From: Ingo Molnar @ 2008-05-29  8:45 UTC (permalink / raw)
  To: linux-kernel; +Cc: netdev, David S. Miller, Rafael J. Wysocki, Andrew Morton


* Ingo Molnar <mingo@elte.hu> wrote:

> in an overnight -tip testruns that is based on recent -git i got two 
> stuck TCP connections:
> 
> Active Internet connections (w/o servers)
> Proto Recv-Q Send-Q Local Address               Foreign Address             State      
> tcp        0 174592 10.0.1.14:58015             10.0.1.14:3632              ESTABLISHED 
> tcp    72134      0 10.0.1.14:3632              10.0.1.14:58015             ESTABLISHED 

update: in the past 5 days of -tip testing i've gathered about 10 
randconfig kernel configs that all produced such failures.

Since the bug itself is very elusive (it takes up to 50 boot + 
kernel-rebuild-via-distccc iterations to trigger) bisection was still 
not an option - but with 10 configs statistical analysis of the configs 
is now possible.

I made a histogram of all kernel options present in those configs, and 
one networking related kernel option stood out:

      5 CONFIG_TCP_CONG_ADVANCED=y
      6 CONFIG_INET_TCP_DIAG=y
      6 CONFIG_TCP_MD5SIG=y
      9 CONFIG_TCP_CONG_CUBIC=y

that code is called in the bootlogs:

> [   13.279410] calling  cubictcp_register+0x0/0x80
> [   13.279412] TCP cubic registered

the likelyhood of CONFIG_TCP_CONG_CUBIC=y being enabled in my randconfig 
runs is 75%. The likelyhood of CONFIG_TCP_CONG_CUBIC=y being enabled in 
10 configs in a row is 0.75^10, or 5.6%. So statistical analysis can say 
it with a 95% confidence that the presence of this option correlates to 
the hung sockets.

i have started testing this theory now, via the patch below, which turns 
off TCP_CONG_CUBIC. It will take about 50 bootups on the affected 
testsystems to confirm. (it will take a couple of hours today as not all 
testsystems show these hung socket symptoms)

distributions enable TCP_CONG_CUBIC by default:

  $ grep CUBIC /boot/config-2.6.24.7-92.fc8
  CONFIG_TCP_CONG_CUBIC=y
  CONFIG_DEFAULT_CUBIC=y

which would explain why Arjan and Peter triggered similar hangs as well.

	Ingo

---------------------->
Subject: qa: no TCP_CONG_CUBIC
From: Ingo Molnar <mingo@elte.hu>
Date: Thu May 29 09:45:51 CEST 2008

---
 net/ipv4/Kconfig |    4 ++++
 1 file changed, 4 insertions(+)

Index: tip/net/ipv4/Kconfig
===================================================================
--- tip.orig/net/ipv4/Kconfig
+++ tip/net/ipv4/Kconfig
@@ -454,6 +454,8 @@ config TCP_CONG_BIC
 config TCP_CONG_CUBIC
 	tristate "CUBIC TCP"
 	default y
+	depends on BROKEN_BOOT_ALLOWED
+	select BROKEN_BOOT
 	---help---
 	This is version 2.0 of BIC-TCP which uses a cubic growth function
 	among other techniques.
@@ -608,6 +610,8 @@ endif
 config TCP_CONG_CUBIC
 	tristate
 	depends on !TCP_CONG_ADVANCED
+	depends on BROKEN_BOOT_ALLOWED
+	select BROKEN_BOOT
 	default y
 
 config DEFAULT_TCP_CONG

^ permalink raw reply	[flat|nested] 35+ messages in thread

end of thread, other threads:[~2008-06-11 15:13 UTC | newest]

Thread overview: 35+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-06-11 15:06 [fixed] [patch] Re: [bug] stuck localhost TCP connections, v2.6.26-rc3+ Alexey Kuznetsov
  -- strict thread matches above, loose matches on Subject: below --
2008-05-29  8:45 Ingo Molnar
2008-05-29 11:14 ` Ilpo Järvinen
2008-05-29 11:22   ` Ingo Molnar
2008-05-30 18:18     ` Ingo Molnar
2008-05-31  6:09       ` Ingo Molnar
2008-05-31 11:46         ` Ilpo Järvinen
2008-05-31 12:18           ` Ilpo Järvinen
2008-05-31 12:54             ` Ingo Molnar
2008-05-31 12:58               ` Ilpo Järvinen
2008-05-31 16:35                 ` Ingo Molnar
2008-06-03  9:40                   ` [fixed] [patch] " Ingo Molnar
2008-06-03 14:41                     ` Patrick McManus
2008-06-03 21:46                     ` Ilpo Järvinen
2008-06-03 22:01                       ` Ilpo Järvinen
2008-06-03 22:03                         ` David Miller
2008-06-03 22:10                           ` Ilpo Järvinen
2008-06-03 23:22                           ` Ilpo Järvinen
2008-06-03 23:54                             ` Joe Perches
2008-06-04  6:25                               ` Ilpo Järvinen
2008-06-04  2:54                             ` Patrick McManus
2008-06-04  6:42                               ` Ilpo Järvinen
2008-06-05 14:22                             ` Ingo Molnar
2008-06-05 18:00                               ` Ilpo Järvinen
2008-06-05 21:13                                 ` Ilpo Järvinen
2008-06-05 23:29                                   ` Patrick McManus
2008-06-06 10:03                                     ` Ilpo Järvinen
2008-06-06 17:11                                       ` Patrick McManus
2008-06-06 17:33                                         ` Ingo Molnar
2008-06-06 18:19                                           ` Ilpo Järvinen
2008-06-06 18:39                                             ` Ingo Molnar
2008-06-06 19:49                                               ` Ilpo Järvinen
2008-06-06 20:08                                               ` Patrick McManus
2008-06-06 21:12                                                 ` Ilpo Järvinen
2008-06-06 21:23                                                   ` Arjan van de Ven
2008-06-06 21:28                                                     ` Ilpo Järvinen
2008-06-10 22:49                                                 ` David Miller
2008-06-06 18:25                                         ` Ilpo Järvinen
2008-06-10 22:32                             ` David Miller
2008-06-11 13:10                               ` Patrick McManus
2008-06-11 15:13                               ` Ilpo Järvinen
2008-06-04  7:23                       ` Ingo Molnar
2008-06-04 18:24                         ` David Miller
2008-06-04 20:56                           ` Ilpo Järvinen
2008-06-04 21:55                             ` David Miller

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).