* Re: [Bug 10071] kernel hang in inet_init (PREEMPT_RCU problem) [not found] ` <20080228205303.261D311D108@picon.linux-foundation.org> @ 2008-02-28 21:34 ` Jarek Poplawski 2008-02-29 0:58 ` Paul E. McKenney 0 siblings, 1 reply; 5+ messages in thread From: Jarek Poplawski @ 2008-02-28 21:34 UTC (permalink / raw) To: bugme-daemon; +Cc: Yannick Dirou, Paul E. McKenney, linux-kernel, netdev On Thu, Feb 28, 2008 at 12:53:03PM -0800, bugme-daemon@bugzilla.kernel.org wrote: > http://bugzilla.kernel.org/show_bug.cgi?id=10071 ... > ------- Comment #17 from yannick.dirou@axetic.com 2008-02-28 12:53 ------- > bingo with classic rcu Congratulations Yannick! So, it seems mainly RCU vs. timer(?) problem. I send CC of this message to RCU author, linux-kernel and netdev lists. I'm not sure how much time you could/will to offer for this debugging. Anyway, if it's possible I think it would be interesting to check with CONFIG_PREEMPT_RCU = y again, but CONFIG_HOTPLUG_CPU = n ("Processor type and features --> Support for suspend on SMP and ...") in the meantime. Many thanks, Jarek P. > [ 0.770854] NET: Registered protocol family 2 > [ 0.770868] 4294668327 net/ipv4/af_inet.c 1387 > [ 0.770884] 4294668327 net/ipv4/af_inet.c 1390 > [ 0.770888] 4294668327 net/ipv4/af_inet.c 1393 > [ 0.770891] 4294668327 net/ipv4/af_inet.c 1402 > [ 0.770893] 4294668327 net/ipv4/af_inet.c 1407 > [ 0.773078] 0 4294668328l > [ 0.774036] 0 4294668329l > [ 0.775037] 0 4294668330l > [ 0.775040] 4294668330 net/ipv4/af_inet.c 1419 > [ 0.775077] 4294668330 net/ipv4/af_inet.c 1426 > [ 0.775077] 4294668330 net/ipv4/route.c 2998 > [ 0.775077] 4294668330 net/ipv4/route.c 3009 > [ 0.775077] 4294668330 net/ipv4/route.c 3016 > [ 0.775077] IP route cache hash table entries: 32768 (order: 5, 131072 > bytes) > [ 0.775077] 4294668330 net/ipv4/route.c 3027 > [ 0.775492] TCP established hash table entries: 131072 (order: 8, 1048576 > bytes) > [ 0.776318] TCP bind hash table entries: 65536 (order: 9, 2359296 bytes) > [ 0.785670] TCP: Hash tables configured (established 131072 bind 65536) > [ 0.785715] TCP reno registered > > > -- > Configure bugmail: http://bugzilla.kernel.org/userprefs.cgi?tab=email > ------- You are receiving this mail because: ------- > You are on the CC list for the bug, or are watching someone who is. > You are the assignee for the bug, or are watching the assignee. ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [Bug 10071] kernel hang in inet_init (PREEMPT_RCU problem) 2008-02-28 21:34 ` [Bug 10071] kernel hang in inet_init (PREEMPT_RCU problem) Jarek Poplawski @ 2008-02-29 0:58 ` Paul E. McKenney 2008-02-29 4:32 ` Paul E. McKenney 0 siblings, 1 reply; 5+ messages in thread From: Paul E. McKenney @ 2008-02-29 0:58 UTC (permalink / raw) To: Jarek Poplawski; +Cc: bugme-daemon, Yannick Dirou, linux-kernel, netdev On Thu, Feb 28, 2008 at 10:34:57PM +0100, Jarek Poplawski wrote: > On Thu, Feb 28, 2008 at 12:53:03PM -0800, bugme-daemon@bugzilla.kernel.org wrote: > > http://bugzilla.kernel.org/show_bug.cgi?id=10071 > ... > > ------- Comment #17 from yannick.dirou@axetic.com 2008-02-28 12:53 ------- > > bingo with classic rcu > > Congratulations Yannick! > > So, it seems mainly RCU vs. timer(?) problem. I send CC of this message > to RCU author, linux-kernel and netdev lists. Interesting! Any chance of a pointer to the original messages? A quick Google search and a look at LKML showed me nothing. :-/ Thanx, Paul > I'm not sure how much time you could/will to offer for this debugging. > Anyway, if it's possible I think it would be interesting to check with > CONFIG_PREEMPT_RCU = y again, but CONFIG_HOTPLUG_CPU = n ("Processor > type and features --> Support for suspend on SMP and ...") in the > meantime. > > Many thanks, > Jarek P. > > > > [ 0.770854] NET: Registered protocol family 2 > > [ 0.770868] 4294668327 net/ipv4/af_inet.c 1387 > > [ 0.770884] 4294668327 net/ipv4/af_inet.c 1390 > > [ 0.770888] 4294668327 net/ipv4/af_inet.c 1393 > > [ 0.770891] 4294668327 net/ipv4/af_inet.c 1402 > > [ 0.770893] 4294668327 net/ipv4/af_inet.c 1407 > > [ 0.773078] 0 4294668328l > > [ 0.774036] 0 4294668329l > > [ 0.775037] 0 4294668330l > > [ 0.775040] 4294668330 net/ipv4/af_inet.c 1419 > > [ 0.775077] 4294668330 net/ipv4/af_inet.c 1426 > > [ 0.775077] 4294668330 net/ipv4/route.c 2998 > > [ 0.775077] 4294668330 net/ipv4/route.c 3009 > > [ 0.775077] 4294668330 net/ipv4/route.c 3016 > > [ 0.775077] IP route cache hash table entries: 32768 (order: 5, 131072 > > bytes) > > [ 0.775077] 4294668330 net/ipv4/route.c 3027 > > [ 0.775492] TCP established hash table entries: 131072 (order: 8, 1048576 > > bytes) > > [ 0.776318] TCP bind hash table entries: 65536 (order: 9, 2359296 bytes) > > [ 0.785670] TCP: Hash tables configured (established 131072 bind 65536) > > [ 0.785715] TCP reno registered > > > > > > -- > > Configure bugmail: http://bugzilla.kernel.org/userprefs.cgi?tab=email > > ------- You are receiving this mail because: ------- > > You are on the CC list for the bug, or are watching someone who is. > > You are the assignee for the bug, or are watching the assignee. ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [Bug 10071] kernel hang in inet_init (PREEMPT_RCU problem) 2008-02-29 0:58 ` Paul E. McKenney @ 2008-02-29 4:32 ` Paul E. McKenney 2008-02-29 6:53 ` Yannick 0 siblings, 1 reply; 5+ messages in thread From: Paul E. McKenney @ 2008-02-29 4:32 UTC (permalink / raw) To: Jarek Poplawski; +Cc: bugme-daemon, Yannick Dirou, linux-kernel, netdev On Thu, Feb 28, 2008 at 04:58:23PM -0800, Paul E. McKenney wrote: > On Thu, Feb 28, 2008 at 10:34:57PM +0100, Jarek Poplawski wrote: > > On Thu, Feb 28, 2008 at 12:53:03PM -0800, bugme-daemon@bugzilla.kernel.org wrote: > > > http://bugzilla.kernel.org/show_bug.cgi?id=10071 > > ... > > > ------- Comment #17 from yannick.dirou@axetic.com 2008-02-28 12:53 ------- > > > bingo with classic rcu > > > > Congratulations Yannick! > > > > So, it seems mainly RCU vs. timer(?) problem. I send CC of this message > > to RCU author, linux-kernel and netdev lists. > > Interesting! Any chance of a pointer to the original messages? > A quick Google search and a look at LKML showed me nothing. :-/ Thanks to Raphael for pointing me at the above bugzilla URL! Yannick, Would you be willing to try your test again after applying the patch in http://lkml.org/lkml/2008/2/2/11 and rebuilding your kernel? This patch does not appear to be in 2.6.25-rc3. :-( I am reposting this patch separately. Thanx, Paul ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [Bug 10071] kernel hang in inet_init (PREEMPT_RCU problem) 2008-02-29 4:32 ` Paul E. McKenney @ 2008-02-29 6:53 ` Yannick 2008-02-29 16:58 ` Paul E. McKenney 0 siblings, 1 reply; 5+ messages in thread From: Yannick @ 2008-02-29 6:53 UTC (permalink / raw) To: paulmck; +Cc: Jarek Poplawski, linux-kernel, netdev (In reply to comment #20) > Reply-To: paulmck@linux.vnet.ibm.com > > Would you be willing to try your test again after applying the > patch in http://lkml.org/lkml/2008/2/2/11 and rebuilding your > kernel? > > This patch does not appear to be in 2.6.25-rc3. :-( > > I am reposting this patch separately. > > Thanx, Paul Congratulations Paul and thanks a lot to Jarek , It works, no more hang or slow boot : [1.061675] Calling initcall 0xc03b8e10: inet_init+0x0/0x3ba() [1.061895] NET: Registered protocol family 2 [1.339007] IP route cache hash table entries: 32768 (order: 5, 131072 bytes) [1.339470] TCP established hash table entries: 131072 (order: 8, 1048576 bytes) [1.340375] TCP bind hash table entries: 65536 (order: 9, 2359296 bytes) [1.349809] TCP: Hash tables configured (established 131072 bind 65536) [1.349906] TCP reno registered [1.098514] initcall 0xc03b8e10: inet_init+0x0/0x3ba() returned 0. [1.098604] initcall 0xc03b8e10 ran for 32 msecs: inet_init+0x0/0x3ba() Yannick ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [Bug 10071] kernel hang in inet_init (PREEMPT_RCU problem) 2008-02-29 6:53 ` Yannick @ 2008-02-29 16:58 ` Paul E. McKenney 0 siblings, 0 replies; 5+ messages in thread From: Paul E. McKenney @ 2008-02-29 16:58 UTC (permalink / raw) To: Yannick; +Cc: Jarek Poplawski, linux-kernel, netdev On Fri, Feb 29, 2008 at 07:53:23AM +0100, Yannick wrote: > (In reply to comment #20) > > Reply-To: paulmck@linux.vnet.ibm.com > > > > Would you be willing to try your test again after applying the > > patch in http://lkml.org/lkml/2008/2/2/11 and rebuilding your > > kernel? > > > > This patch does not appear to be in 2.6.25-rc3. :-( > > > > I am reposting this patch separately. > > > > Thanx, Paul > > Congratulations Paul and thanks a lot to Jarek , It works, > no more hang or slow boot : > [1.061675] Calling initcall 0xc03b8e10: inet_init+0x0/0x3ba() > [1.061895] NET: Registered protocol family 2 > [1.339007] IP route cache hash table entries: 32768 (order: 5, 131072 bytes) > [1.339470] TCP established hash table entries: 131072 (order: 8, 1048576 > bytes) > [1.340375] TCP bind hash table entries: 65536 (order: 9, 2359296 bytes) > [1.349809] TCP: Hash tables configured (established 131072 bind 65536) > [1.349906] TCP reno registered > [1.098514] initcall 0xc03b8e10: inet_init+0x0/0x3ba() returned 0. > [1.098604] initcall 0xc03b8e10 ran for 32 msecs: inet_init+0x0/0x3ba() Thank you very much for testing this, Yannick! Andrew and Ingo have this patch on track to merge into a later -rc of 2.6.25, so we will get there. ;-) Thanx, Paul ^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2008-02-29 16:58 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <bug-10071-11547@http.bugzilla.kernel.org/>
[not found] ` <20080228205303.261D311D108@picon.linux-foundation.org>
2008-02-28 21:34 ` [Bug 10071] kernel hang in inet_init (PREEMPT_RCU problem) Jarek Poplawski
2008-02-29 0:58 ` Paul E. McKenney
2008-02-29 4:32 ` Paul E. McKenney
2008-02-29 6:53 ` Yannick
2008-02-29 16:58 ` Paul E. McKenney
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).