From: Trond Myklebust <trond.myklebust@fys.uio.no>
To: Jeff Moyer <jmoyer@redhat.com>
Cc: netdev@vger.kernel.org, Andrew Morton <akpm@linux-foundation.org>,
Jens Axboe <jens.axboe@oracle.com>,
linux-kernel@vger.kernel.org, "Rafael J. Wysocki" <rjw@sisk.pl>,
Olga Kornievskaia <aglo@citi.umich.edu>,
"J. Bruce Fields" <bfields@fieldses.org>,
Jim Rees <rees@umich.edu>,
linux-nfs@vger.kernel.org
Subject: Re: 2.6.30-rc deadline scheduler performance regression for iozone over NFS
Date: Thu, 14 May 2009 10:33:40 -0400 [thread overview]
Message-ID: <1242311620.6560.14.camel@heimdal.trondhjem.org> (raw)
In-Reply-To: <x49octv7qr8.fsf-RRHT56Q3PSP4kTEheFKJxxDDeQx5vsVwAInAS/Ez/D0@public.gmane.org>
On Thu, 2009-05-14 at 09:34 -0400, Jeff Moyer wrote:
> Trond Myklebust <trond.myklebust@fys.uio.no> writes:
>
> > On Wed, 2009-05-13 at 15:29 -0400, Jeff Moyer wrote:
> >> Hi, netdev folks. The summary here is:
> >>
> >> A patch added in the 2.6.30 development cycle caused a performance
> >> regression in my NFS iozone testing. The patch in question is the
> >> following:
> >>
> >> commit 47a14ef1af48c696b214ac168f056ddc79793d0e
> >> Author: Olga Kornievskaia <aglo@citi.umich.edu>
> >> Date: Tue Oct 21 14:13:47 2008 -0400
> >>
> >> svcrpc: take advantage of tcp autotuning
> >>
> >> which is also quoted below. Using 8 nfsd threads, a single client doing
> >> 2GB of streaming read I/O goes from 107590 KB/s under 2.6.29 to 65558
> >> KB/s under 2.6.30-rc4. I also see more run to run variation under
> >> 2.6.30-rc4 using the deadline I/O scheduler on the server. That
> >> variation disappears (as does the performance regression) when reverting
> >> the above commit.
> >
> > It looks to me as if we've got a bug in the svc_tcp_has_wspace() helper
> > function. I can see no reason why we should stop processing new incoming
> > RPC requests just because the send buffer happens to be 2/3 full. If we
> > see that we have space for another reply, then we should just go for it.
> > OTOH, we do want to ensure that the SOCK_NOSPACE flag remains set, so
> > that the TCP layer knows that we're congested, and that we'd like it to
> > increase the send window size, please.
> >
> > Could you therefore please see if the following (untested) patch helps?
>
> I'm seeing slightly better results with the patch:
>
> 71548
> 75987
> 71557
> 87432
> 83538
>
> But that's still not up to the speeds we saw under 2.6.29. The packet
> capture for one run can be found here:
> http://people.redhat.com/jmoyer/trond.pcap.bz2
>
> Cheers,
> Jeff
Yes. Something is very wrong there...
See for instance frame 1195, where the client finishes sending a whole
series of READ requests, and we go into a flurry of ACKs passing
backwards and forwards, but no data. It looks as if the NFS server isn't
processing anything, probably because the window size falls afoul of the
svc_tcp_has_wspace()...
Does something like this help?
Cheers
Trond
---------------------------------------------------------------------
>From 85e3f5860a9063d193bdb45516b3d3d347b87301 Mon Sep 17 00:00:00 2001
From: Trond Myklebust <Trond.Myklebust@netapp.com>
Date: Thu, 14 May 2009 10:33:07 -0400
Subject: [PATCH] SUNRPC: Always allow the NFS server to process at least one request
Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
---
net/sunrpc/svcsock.c | 9 ++++++++-
1 files changed, 8 insertions(+), 1 deletions(-)
diff --git a/net/sunrpc/svcsock.c b/net/sunrpc/svcsock.c
index 8962355..4837442 100644
--- a/net/sunrpc/svcsock.c
+++ b/net/sunrpc/svcsock.c
@@ -972,9 +972,16 @@ static int svc_tcp_has_wspace(struct svc_xprt *xprt)
{
struct svc_sock *svsk = container_of(xprt, struct svc_sock, sk_xprt);
struct svc_serv *serv = svsk->sk_xprt.xpt_server;
+ int reserved;
int required;
- required = (atomic_read(&xprt->xpt_reserved) + serv->sv_max_mesg) * 2;
+ reserved = atomic_read(&xprt->xpt_reserved);
+ /* Always allow the server to process at least one request, whether
+ * or not the TCP window is large enough
+ */
+ if (reserved == 0)
+ return 1;
+ required = (reserved + serv->sv_max_mesg) << 1;
if (sk_stream_wspace(svsk->sk_sk) < required)
goto out_nospace;
return 1;
--
1.6.0.4
next prev parent reply other threads:[~2009-05-14 14:33 UTC|newest]
Thread overview: 21+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <x49hc0f79k9.fsf@segfault.boston.devel.redhat.com>
[not found] ` <20090508120119.8c93cfd7.akpm@linux-foundation.org>
[not found] ` <20090511081415.GL4694@kernel.dk>
[not found] ` <x49skjb21b7.fsf@segfault.boston.devel.redhat.com>
[not found] ` <20090511165826.GG4694@kernel.dk>
[not found] ` <x494ovp4r51.fsf@segfault.boston.devel.redhat.com>
2009-05-13 3:44 ` 2.6.30-rc deadline scheduler performance regression for iozone over NFS Andrew Morton
2009-05-13 14:58 ` Jeff Moyer
[not found] ` <x49y6t1rqw0.fsf-RRHT56Q3PSP4kTEheFKJxxDDeQx5vsVwAInAS/Ez/D0@public.gmane.org>
2009-05-13 16:20 ` Olga Kornievskaia
[not found] ` <b4ff356f0905130920v184ab529mb52a4346d4c77c14-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2009-05-13 16:32 ` Andrew Morton
2009-05-13 18:16 ` Olga Kornievskaia
[not found] ` <b4ff356f0905131116o48181ccu4786578cc72c8ceb-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2009-05-13 19:06 ` Jeff Moyer
2009-05-13 18:25 ` Jim Rees
2009-05-13 19:45 ` Trond Myklebust
2009-05-13 19:29 ` Jeff Moyer
2009-05-13 23:45 ` Trond Myklebust
[not found] ` <1242258338.5407.244.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2009-05-14 13:34 ` Jeff Moyer
[not found] ` <x49octv7qr8.fsf-RRHT56Q3PSP4kTEheFKJxxDDeQx5vsVwAInAS/Ez/D0@public.gmane.org>
2009-05-14 14:33 ` Trond Myklebust [this message]
[not found] ` <1242311620.6560.14.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2009-05-14 14:38 ` Jeff Moyer
2009-05-14 15:00 ` Jeff Moyer
[not found] ` <x49ws8j686r.fsf-RRHT56Q3PSP4kTEheFKJxxDDeQx5vsVwAInAS/Ez/D0@public.gmane.org>
2009-05-17 19:10 ` Trond Myklebust
2009-05-17 19:12 ` Trond Myklebust
[not found] ` <1242587524.17796.3.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2009-05-18 14:15 ` Jeff Moyer
2009-05-22 23:45 ` J. Bruce Fields
2009-05-14 17:55 ` J. Bruce Fields
2009-05-14 18:26 ` Trond Myklebust
[not found] ` <1242325569.6560.27.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2009-05-15 21:37 ` J. Bruce Fields
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1242311620.6560.14.camel@heimdal.trondhjem.org \
--to=trond.myklebust@fys.uio.no \
--cc=aglo@citi.umich.edu \
--cc=akpm@linux-foundation.org \
--cc=bfields@fieldses.org \
--cc=jens.axboe@oracle.com \
--cc=jmoyer@redhat.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-nfs@vger.kernel.org \
--cc=netdev@vger.kernel.org \
--cc=rees@umich.edu \
--cc=rjw@sisk.pl \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox