From mboxrd@z Thu Jan 1 00:00:00 1970 From: Al Boldi Subject: Re: [RFC] ext3: per-process soft-syncing data=ordered mode Date: Wed, 30 Jan 2008 21:39:22 +0300 Message-ID: <200801302139.22281.a1426z@gawab.com> References: <200801242336.00340.a1426z@gawab.com> <200801300904.48299.a1426z@gawab.com> <200801300929.21778.chris.mason@oracle.com> Mime-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Cc: Jan Kara , Chris Snook , linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org To: Chris Mason Return-path: Received: from [212.12.190.71] ([212.12.190.71]:47567 "EHLO raad.intranet" rhost-flags-FAIL-FAIL-OK-FAIL) by vger.kernel.org with ESMTP id S1754642AbYA3SmP (ORCPT ); Wed, 30 Jan 2008 13:42:15 -0500 In-Reply-To: <200801300929.21778.chris.mason@oracle.com> Content-Disposition: inline Sender: linux-fsdevel-owner@vger.kernel.org List-ID: Chris Mason wrote: > On Wednesday 30 January 2008, Al Boldi wrote: > > Jan Kara wrote: > > > > Chris Snook wrote: > > > > > Al Boldi wrote: > > > > > > This RFC proposes to introduce a tunable which allows to disable > > > > > > fsync and changes ordered into writeback writeout on a > > > > > > per-process basis like this: > > > > > > > > > > > > echo 1 > /proc/`pidof process`/softsync > > > > > > > > > > This is basically a kernel workaround for stupid app behavior. > > > > > > > > Exactly right to some extent, but don't forget the underlying > > > > data=ordered starvation problem, which looks like a genuinely deep > > > > problem maybe related to blockIO. > > > > > > It is a problem with the way how ext3 does fsync (at least that's > > > what we ended up with in that konqueror problem)... It has to flush > > > the current transaction which means that app doing fsync() has to wait > > > till all dirty data of all files on the filesystem are written (if we > > > are in ordered mode). And that takes quite some time... There are > > > possibilities how to avoid that but especially with freshly created > > > files, it's tough and I don't see a way how to do it without some > > > fundamental changes to JBD. > > > > Ok, but keep in mind that this starvation occurs even in the absence of > > fsync, as the benchmarks show. > > > > And, a quick test of successive 1sec delayed syncs shows no hangs until > > about 1 minute (~180mb) of db-writeout activity, when the sync abruptly > > hangs for minutes on end, and io-wait shows almost 100%. > > Do you see this on older kernels as well? The first thing we need to > understand is if this particular stall is new. 2.6.24,22,19 and 2.4.32 show the same problem. Thanks! -- Al