From: NeilBrown <neilb@suse.de>
To: Ben Greear <greearb@candelatech.com>
Cc: Dean <seattleplus@gmail.com>,
"J.Bruce Fields" <bfields@citi.umich.edu>,
Olga Kornievskaia <aglo@citi.umich.edu>,
NFS <linux-nfs@vger.kernel.org>
Subject: Re: Is tcp autotuning really what NFS wants?
Date: Tue, 16 Jul 2013 14:46:51 +1000 [thread overview]
Message-ID: <20130716144651.3d93940d@notabene.brown> (raw)
In-Reply-To: <51E48670.9070901@candelatech.com>
[-- Attachment #1: Type: text/plain, Size: 3356 bytes --]
On Mon, 15 Jul 2013 16:32:00 -0700 Ben Greear <greearb@candelatech.com> wrote:
> On 07/14/2013 09:35 PM, NeilBrown wrote:
> > On Wed, 10 Jul 2013 10:39:38 -0700 Ben Greear <greearb@candelatech.com> wrote:
> >
> >> On 07/10/2013 10:33 AM, Dean wrote:
> >>> > This could significantly limit the amount of parallelism that can be achieved for a single TCP connection (and given that the
> >>> > Linux client strongly prefers a single connection now, this could become more of an issue).
> >>>
> >>> I understand the simplicity in using a single tcp connection, but performance-wise it is definitely not the way to go on WAN links. When even a miniscule amount
> >>> of packet loss is added to the link (<0.001% packet loss), the tcp buffer collapses and performance drops significantly (especially on 10GigE WAN links). I
> >>> think new TCP algorithms could help the problem somewhat, but nothing available today makes much of a difference vs. cubic.
> >>>
> >>> Using multiple tcp connections allows better saturation of the link, since when packet loss occurs on a stream, the other streams can fill the void. Today, the
> >>> only solution is to scale up the number of physical clients, which has high coordination overhead, or use a wan accelerator such as Bitspeed or Riverbed (which
> >>> comes with its own issues such as extra hardware, cost, etc).
> >>
> >> I have a set of patches that allows one to do multiple unique mounts to the same server from a single
> >> client, but the patches are for the client side, so it would not help
> >> non-Linux clients. And, the patches were rejected for upstream as not being
> >> useful. But, if you are interested in such, please let me know and I can point
> >> you to them...
> >
> > Yes please!
>
>
> I haven't ported these forward to 3.10 yet, but you can find my 3.9 tree
> here:
>
> http://dmz2.candelatech.com/git/gitweb.cgi?p=linux-3.9.dev.y/.git;a=summary
>
> There's a bunch of other patches, but the nfs related ones are all in a row
> and the patches are rebased, so you can probably pull them out w/out
> too much difficulty. The patches introduce a bug where it fails to compile
> w/out NFS 4.1 defined..I haven't bothered to fix it yet but it's probably
> simple..or just compile with NFS 4.1 as I do.
>
> Older trees found here, but we don't bother back-porting many patches,
> so I'd use the latest if you can.
>
> http://dmz2.candelatech.com/git/gitweb.cgi
>
> And, you'll need a patched mount.nfs:
>
> https://github.com/greearb/nfs-utils-ct
>
> I hope to get started on porting these to 3.10 later this week.. If
> there is any interest in this patch series or something like it going
> upstream (there wasn't in the past, I've no good reason to think that
> has changed), let me know and I will clean them up and post them to
> the mailing list again...
>
> If you search for greearb@candelatech.com and 'bind to local' you
> can find previous threads in the mailing list archives...
Thanks.
You are supporting multiple independent mounts by using different client-side
IP addresses. That has clear benefits for controlling routing, but also
admin costs if you don't care about routing. The latter case applies to me.
So I'll bookmark this, but it isn't what I need just now.
Thanks,
NeilBrown
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 828 bytes --]
next prev parent reply other threads:[~2013-07-16 4:47 UTC|newest]
Thread overview: 27+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <20130710092255.0240a36d@notabene.brown>
2013-07-10 2:27 ` Is tcp autotuning really what NFS wants? J.Bruce Fields
2013-07-10 4:32 ` NeilBrown
2013-07-10 19:07 ` J.Bruce Fields
2013-07-15 4:32 ` NeilBrown
2013-07-16 1:58 ` J.Bruce Fields
2013-07-16 4:00 ` NeilBrown
2013-07-16 14:24 ` J.Bruce Fields
2013-07-18 0:03 ` Ben Myers
2013-07-24 21:07 ` J.Bruce Fields
2013-07-25 1:30 ` [PATCH] NFSD/sunrpc: avoid deadlock on TCP connection due to memory pressure NeilBrown
2013-07-25 12:35 ` Jim Rees
2013-07-25 20:18 ` J.Bruce Fields
2013-07-25 20:33 ` NeilBrown
2013-07-26 14:19 ` J.Bruce Fields
2013-07-30 2:48 ` NeilBrown
2013-08-01 2:49 ` J.Bruce Fields
2013-07-10 17:33 ` Is tcp autotuning really what NFS wants? Dean
2013-07-10 17:39 ` Ben Greear
2013-07-15 4:35 ` NeilBrown
2013-07-15 23:32 ` Ben Greear
2013-07-16 4:46 ` NeilBrown [this message]
2013-07-10 19:59 ` Michael Richardson
2013-07-15 1:26 ` Jim Rees
2013-07-15 5:02 ` NeilBrown
2013-07-15 11:57 ` Jim Rees
2013-07-15 13:42 ` Jim Rees
2013-07-16 1:10 ` NeilBrown
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20130716144651.3d93940d@notabene.brown \
--to=neilb@suse.de \
--cc=aglo@citi.umich.edu \
--cc=bfields@citi.umich.edu \
--cc=greearb@candelatech.com \
--cc=linux-nfs@vger.kernel.org \
--cc=seattleplus@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).