public inbox for linux-nfs@vger.kernel.org
 help / color / mirror / Atom feed
From: Ian Campbell <ijc-KcIKpvwj1kUDXYZnReoRVg@public.gmane.org>
To: Trond Myklebust <trond.myklebust@fys.uio.no>
Cc: linux-nfs@vger.kernel.org,
	Max Kellermann <mk-xMchvyqCc6DQT0dZR+AlfA@public.gmane.org>,
	linux-kernel@vger.kernel.org, gcosta@redhat.com,
	Grant Coady <grant_lkml-rGYn+TmxqGy6c6uEtOJ/EA@public.gmane.org>,
	"J. Bruce Fields" <bfields@fieldses.org>,
	Tom Tucker <tom@opengridcomputing.com>
Subject: Re: [PATCH] NFS regression in 2.6.26?, "task blocked for more than 120 seconds"
Date: Tue, 25 Nov 2008 13:38:59 +0000	[thread overview]
Message-ID: <1227620339.9425.99.camel@zakaz.uk.xensource.com> (raw)
In-Reply-To: <1227619696.7057.19.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>

On Tue, 2008-11-25 at 08:28 -0500, Trond Myklebust wrote:
> On Tue, 2008-11-25 at 07:09 +0000, Ian Campbell wrote:
> > On Sat, 2008-11-01 at 09:41 -0400, Trond Myklebust wrote:
> > > On Sat, 2008-11-01 at 11:45 +0000, Ian Campbell wrote:
> > > > On Mon, 2008-10-20 at 07:27 +0100, Ian Campbell wrote:
> > > > > So far I have bisected down to this range and am currently testing
> > > > > acee478 which has been up for >4days.
> > > > 
> > > > Another update. It has now bisected down to a small range 
> > > > 
> > > > 7272dcd31d56580dee7693c21e369fd167e137fe SUNRPC: xprt_autoclose() should not call xprt_disconnect()
> > > > e06799f958bf7f9f8fae15f0c6f519953fb0257c SUNRPC: Use shutdown() instead of close() when disconnecting a TCP socket
> > > > ef80367071dce7d2533e79ae8f3c84ec42708dc8 SUNRPC: TCP clear XPRT_CLOSE_WAIT when the socket is closed for writes
> > > > 3b948ae5be5e22532584113e2e02029519bbad8f SUNRPC: Allow the client to detect if the TCP connection is closed
> > > > 67a391d72ca7efb387c30ec761a487e50a3ff085 SUNRPC: Fix TCP rebinding logic
> > > > 66af1e558538137080615e7ad6d1f2f80862de01 SUNRPC: Fix a race in xs_tcp_state_change()
> > > > 
> > > > I'm currently testing 3b948ae5be5e22532584113e2e02029519bbad8f.
> > > > 
> > > > 7272dcd31d56580dee7693c21e369fd167e137fe repro'd in half a day while
> > > > ef818a28fac9bd214e676986d8301db0582b92a9 (parent of
> > > > 66af1e558538137080615e7ad6d1f2f80862de01) survived for 7 days.
> > 
> > According to bisect:
> > 
> > e06799f958bf7f9f8fae15f0c6f519953fb0257c is first bad commit
> > commit e06799f958bf7f9f8fae15f0c6f519953fb0257c
> > Author: Trond Myklebust <Trond.Myklebust@netapp.com>
> > Date:   Mon Nov 5 15:44:12 2007 -0500
> > 
> >     SUNRPC: Use shutdown() instead of close() when disconnecting a TCP socket
> >     
> >     By using shutdown() rather than close() we allow the RPC client to wait
> >     for the TCP close handshake to complete before we start trying to reconnect
> >     using the same port.
> >     We use shutdown(SHUT_WR) only instead of shutting down both directions,
> >     however we wait until the server has closed the connection on its side.
> >     
> >     Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
> > 
> > I've started testing 2.6.26 + revert. It's been a long while since I
> > started this process so I'll also have a go at an up to date version.
> > 
> > Cheers,
> 
> That would indicate that the server is failing to close the TCP
> connection when the client closes on its end.
> 
> Could you remind me what server you are using?

2.6.25-2-486 which is a Debian package from backports.org, changelog
indicates that it contains 2.6.25.7.

> Also, does 'netstat -t'
> show connections that are stuck in the CLOSE_WAIT state when you see the
> hang?

I'd have to wait for it to reproduce again to be 100% sure but according
to http://lkml.indiana.edu/hypermail/linux/kernel/0808.3/0120.html
I was seeing connections in FIN_WAIT2 but not CLOSE_WAIT.

Ian.

-- 
Ian Campbell
Current Noise: Diamond Head - It's Electric

"The only real way to look younger is not to be born so soon."
		-- Charles Schulz, "Things I've Had to Learn Over and
		   Over and Over"


  parent reply	other threads:[~2008-11-25 13:39 UTC|newest]

Thread overview: 79+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <20081017123207.GA14979@rabbit.intern.cm-ag>
     [not found] ` <1224484046.23068.14.camel@localhost.localdomain>
     [not found]   ` <1225539927.2221.3.camel@localhost.localdomain>
     [not found]     ` <1225546878.4390.3.camel@heimdal.trondhjem.org>
2008-11-25  7:09       ` [PATCH] NFS regression in 2.6.26?, "task blocked for more than 120 seconds" Ian Campbell
     [not found]         ` <1227596962.16868.22.camel-bi+AKbBUZKY6gyzm1THtWbp2dZbC/Bob@public.gmane.org>
2008-11-25 13:28           ` Trond Myklebust
     [not found]             ` <1227619696.7057.19.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2008-11-25 13:38               ` Ian Campbell [this message]
     [not found]                 ` <1227620339.9425.99.camel-o4Be2W7LfRlXesXXhkcM7miJhflN2719@public.gmane.org>
2008-11-25 13:57                   ` Trond Myklebust
     [not found]                     ` <1227621434.7057.33.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2008-11-25 14:04                       ` Ian Campbell
     [not found]                         ` <1227621877.9425.102.camel-o4Be2W7LfRlXesXXhkcM7miJhflN2719@public.gmane.org>
2008-11-26 22:12                           ` Ian Campbell
     [not found]                             ` <1227737539.31008.2.camel-bi+AKbBUZKY6gyzm1THtWbp2dZbC/Bob@public.gmane.org>
2008-12-01  0:17                               ` [PATCH 0/3] " Trond Myklebust
     [not found]                                 ` <1228090631.7112.11.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2008-12-01  0:18                                   ` [PATCH 1/3] SUNRPC: Ensure the server closes sockets in a timely fashion Trond Myklebust
     [not found]                                     ` <1228090719.7112.13.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2008-12-17 15:27                                       ` Tom Tucker
2008-12-17 18:08                                         ` Trond Myklebust
     [not found]                                           ` <1229537296.7257.37.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2008-12-17 18:59                                             ` Tom Tucker
2008-12-01  0:20                                   ` [PATCH 3/3] SUNRPC: svc_xprt_enqueue should not refuse to enqueue 'XPT_DEAD' transports Trond Myklebust
2008-12-17 15:35                                     ` Tom Tucker
2008-12-17 19:07                                       ` Trond Myklebust
     [not found]                                         ` <1229540877.7257.97.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2008-12-23 14:49                                           ` Tom Tucker
2008-12-23 23:39                                             ` Tom Tucker
2009-01-02 21:44                                           ` Tom Tucker
2009-01-04 19:12                                             ` Trond Myklebust
     [not found]                                               ` <1231096358.7363.6.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2009-01-04 19:25                                                 ` Trond Myklebust
2009-01-05  3:33                                                 ` Tom Tucker
     [not found]                                               ` <1231097131.7 363.11.camel@heimdal.trondhjem.org>
     [not found]                                                 ` <1231097131.7363.11.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2009-01-05  3:33                                                   ` Tom Tucker
2009-01-05 17:04                                                   ` Tom Tucker
2009-01-05 17:13                                                     ` Trond Myklebust
     [not found]                                                       ` <1231175613.7127.6.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2009-01-05 19:33                                                         ` Tom Tucker
2009-01-05 19:51                                                           ` Trond Myklebust
     [not found]                                                             ` <1231185115.7127.28.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2009-01-05 20:13                                                               ` Tom Tucker
2009-01-05 20:41                                                               ` Tom Tucker
2009-01-05 20:48                                                                 ` Trond Myklebust
     [not found]                                                                   ` <1231188518.7127.30.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2009-01-05 21:10                                                                     ` Tom Tucker
2008-12-01  0:29                                   ` [PATCH 0/3] NFS regression in 2.6.26?, "task blocked for more than 120 seconds" Trond Myklebust
     [not found]                                     ` <1228091380.7112.17.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2008-12-02 15:22                                       ` Kasparek Tomas
2008-12-02 15:37                                         ` Trond Myklebust
     [not found]                                           ` <1228232222.3090.5.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2008-12-02 16:26                                             ` Kasparek Tomas
2008-12-02 18:10                                               ` Trond Myklebust
     [not found]                                                 ` <1228241407.3090.7.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2008-12-04 10:23                                                   ` Kasparek Tomas
     [not found]                                                     ` <1229284201.6463.98.camel@heimdal.trondhjem.org>
     [not found]                                                       ` <1229284201.6463.98.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2008-12-16 12:05                                                         ` Kasparek Tomas
2008-12-16 12:10                                                           ` Kasparek Tomas
2008-12-16 12:59                                                             ` Trond Myklebust
2008-12-23 22:34                                                           ` Trond Myklebust
     [not found]                                                             ` <1230071647.17701.27.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2009-01-05 12:18                                                               ` Kasparek Tomas
2009-01-09 14:56                                                               ` Kasparek Tomas
2009-01-09 17:59                                                                 ` Trond Myklebust
     [not found]                                                                   ` <1231523966.7179.67.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2009-01-10 10:24                                                                     ` Kasparek Tomas
2009-01-10 16:00                                                                       ` Trond Myklebust
     [not found]                                                                         ` <20090112090404.GL47559@fit.vutbr.cz>
     [not found]                                                                           ` <1231782009.7322.12.camel@heimdal.trondhjem.org>
     [not found]                                                                             ` <1231809446.7322.17.camel@heimdal.trondhjem.org>
     [not found]                                                                               ` <1231809446.7322.17.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2009-01-13 15:22                                                                                 ` Kasparek Tomas
2009-01-16 10:48                                                                                   ` Kasparek Tomas
2009-01-18 13:08                                                                                     ` Kasparek Tomas
2009-01-20 15:03                                                                                       ` Kasparek Tomas
2009-01-20 15:32                                                                                         ` Trond Myklebust
     [not found]                                                                                           ` <1232465547.7055.3.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2009-01-28  8:18                                                                                             ` Kasparek Tomas
2009-02-06  6:35                                                                                               ` Kasparek Tomas
2009-02-10  7:55                                                                                                 ` Kasparek Tomas
2009-03-03 12:08                                                                                           ` Kasparek Tomas
2009-03-03 14:16                                                                                             ` Trond Myklebust
     [not found]                                                                                               ` <1236089767.9631.4.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2009-03-25  8:46                                                                                                 ` Kasparek Tomas
2009-04-18  5:17                                                                                                 ` Kasparek Tomas
2009-04-22 17:27                                                                                                   ` NFS client packet storm on 2.6.27.x Kasparek Tomas
2009-04-29 12:12                                                                                                     ` Steve Dickson
     [not found]                                                                                                       ` <49F84436.5090007-AfCzQyP5zfLQT0dZR+AlfA@public.gmane.org>
2009-04-29 14:57                                                                                                         ` Kasparek Tomas
2009-06-25  5:55                                                                                                     ` Kasparek Tomas
2009-07-13 11:12                                                                                                       ` Kasparek Tomas
2009-07-13 17:20                                                                                                         ` [stable] " Greg KH
2009-07-13 17:40                                                                                                           ` Trond Myklebust
     [not found]                                                                                                             ` <1247506817.14524.25.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2009-07-24  8:54                                                                                                               ` Kasparek Tomas
2009-07-28 18:31                                                                                                               ` Greg KH
2008-12-01 22:09                                   ` [PATCH 0/3] NFS regression in 2.6.26?, "task blocked for more than 120 seconds" Ian Campbell
     [not found]                                     ` <1228169383.20370.3.camel-bi+AKbBUZKY6gyzm1THtWbp2dZbC/Bob@public.gmane.org>
2008-12-06 12:16                                       ` Ian Campbell
     [not found]                                         ` <1228565812.10856.30.camel-bi+AKbBUZKY6gyzm1THtWbp2dZbC/Bob@public.gmane.org>
2008-12-14 18:24                                           ` Ian Campbell
     [not found]                                             ` <1229279045.3721.1.camel-bi+AKbBUZKY6gyzm1THtWbp2dZbC/Bob@public.gmane.org>
2008-12-16 17:55                                               ` J. Bruce Fields
2008-12-16 18:39                                                 ` Ian Campbell
     [not found]                                                   ` <1229452775.3721.25.camel-bi+AKbBUZKY6gyzm1THtWbp2dZbC/Bob@public.gmane.org>
2009-01-07 22:21                                                     ` J. Bruce Fields
2009-01-08 18:20                                                       ` J. Bruce Fields
2009-01-08 21:22                                                         ` Ian Campbell
     [not found]                                                           ` <1231449753.21688.12.camel-bi+AKbBUZKY6gyzm1THtWbp2dZbC/Bob@public.gmane.org>
2009-01-08 21:26                                                             ` J. Bruce Fields
2009-01-12  9:46                                                               ` Ian Campbell
2009-01-22  8:27                                                               ` Ian Campbell
     [not found]                                                                 ` <1232612860.29604.57.camel-bi+AKbBUZKY6gyzm1THtWbp2dZbC/Bob@public.gmane.org>
2009-01-22 16:44                                                                   ` J. Bruce Fields
2008-12-01  0:19                                 ` [PATCH 2/3] SUNRPC: We only need to call svc_delete_xprt() once Trond Myklebust
2008-11-26  9:16                 ` [PATCH] NFS regression in 2.6.26?, Tomas Kasparek

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1227620339.9425.99.camel@zakaz.uk.xensource.com \
    --to=ijc-kcikpvwj1kudxyznreorvg@public.gmane.org \
    --cc=bfields@fieldses.org \
    --cc=gcosta@redhat.com \
    --cc=grant_lkml-rGYn+TmxqGy6c6uEtOJ/EA@public.gmane.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-nfs@vger.kernel.org \
    --cc=mk-xMchvyqCc6DQT0dZR+AlfA@public.gmane.org \
    --cc=tom@opengridcomputing.com \
    --cc=trond.myklebust@fys.uio.no \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox