From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1750740AbXDRIT2 (ORCPT ); Wed, 18 Apr 2007 04:19:28 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1751130AbXDRIT2 (ORCPT ); Wed, 18 Apr 2007 04:19:28 -0400 Received: from amsfep19-int.chello.nl ([213.46.243.16]:52821 "EHLO amsfep11-int.chello.nl" rhost-flags-OK-FAIL-OK-FAIL) by vger.kernel.org with ESMTP id S1750740AbXDRIT0 (ORCPT ); Wed, 18 Apr 2007 04:19:26 -0400 Subject: Re: [PATCH 0/4] 2.6.21-rc7 NFS writes: fix a series of issues From: Peter Zijlstra To: Trond Myklebust Cc: Linus Torvalds , Florin Iucha , Andrew Morton , Adrian Bunk , OGAWA Hirofumi , linux-kernel@vger.kernel.org In-Reply-To: <20070418011946.11679.34920.stgit@heimdal.trondhjem.org> References: <20070416125905.GA2769@iucha.net> <1176736734.6761.45.camel@heimdal.trondhjem.org> <1176740307.6761.56.camel@heimdal.trondhjem.org> <1176741408.6761.62.camel@heimdal.trondhjem.org> <1176792399.3035.30.camel@twins> <20070416125905.GA2769@iucha.net> <1176792399.3035.30.camel@twins> <1176796503.3035.33.camel@twins> <87hcrfaqam.fsf@duaron.myhome.or.jp> <20070418011946.11679.34920.stgit@heimdal.trondhjem.org> Content-Type: text/plain Date: Wed, 18 Apr 2007 10:19:22 +0200 Message-Id: <1176884362.7066.3.camel@twins> Mime-Version: 1.0 X-Mailer: Evolution 2.10.1 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org On Tue, 2007-04-17 at 21:19 -0400, Trond Myklebust wrote: > I've split the issues introduced by the 2.6.21-rcX write code up into 4 > subproblems. > > The first patch is just a cleanup in order to ease review. > > Patch number 2 ensures that we never release the PG_writeback flag until > _after_ we've either discarded the unstable request altogether, or put it > on the nfs_inode's commit or dirty lists. > > Patch number 3 fixes the 'desynchronized value of nfs_i.ncommit' error. It > uses the PG_NEED_COMMIT flag as an indicator for whether or not the request > may be redirtied. > > Patch number 4 protects the NFS '.set_page_dirty' address_space operation > against races with nfs_inode_add_request. Ok, stuck them in, and my debug patch from yesterday, just in case... However, I can't seem to run long enough to establish whether the problem is gone. It deadlocks between 10-30 minutes due to missing IO completions, whereas yesterday it took between 45-60 minutes to trigger the 'desynchronized value of nfs_i.ncommit' messages. I will continue trying go get a good run, however if you got some (perhaps experimental .22) patches you want me to try..