From: Felix Fietkau <nbd@openwrt.org>
To: Sujith Manoharan <sujith@msujith.org>,
Eric Dumazet <eric.dumazet@gmail.com>
Cc: netdev@vger.kernel.org, Dave Taht <dave.taht@gmail.com>
Subject: Re: TCP performance regression
Date: Mon, 11 Nov 2013 17:38:24 +0100 [thread overview]
Message-ID: <52810800.9020402@openwrt.org> (raw)
In-Reply-To: <21121.575.539384.948990@gargle.gargle.HOWL>
On 2013-11-11 17:13, Sujith Manoharan wrote:
> Eric Dumazet wrote:
>> We have many choices.
>>
>> 1) Add back a minimum of ~128 K of outstanding bytes per TCP session,
>> so that buggy drivers can sustain 'line rate'.
>>
>> Note that with 100 concurrent TCP streams, total amount of bytes
>> queued on the NIC is 12 MB.
>> And pfifo_fast qdisc will drop packets anyway.
>>
>> Thats what we call 'BufferBloat'
>>
>> 2) Try lower values like 64K. Still bufferbloat.
>>
>> 3) Fix buggy drivers, using a proper logic, or shorter timers (mvneta
>> case for example)
>>
>> 4) Add a new netdev attribute, so that well behaving NIC drivers do not
>> have to artificially force TCP stack to queue too many bytes in
>> Qdisc/NIC queues.
>
> I think the quirks of 802.11 aggregation should be taken into account.
> I am adding Felix to this thread, who would have more to say on latency/bufferbloat
> with wireless drivers.
I don't think this issue is about something as simple as timer handling
for tx completion (or even broken/buggy drivers).
There's simply no way to make 802.11 aggregation work well and have
similar tx completion latency characteristics as Ethernet devices.
802.11 aggregation reduces the per-packet airtime overhead by combining
multiple packets into one transmission (saving a lot of time getting a
tx opportunity, transmitting the PHY header, etc.), which makes the
'line rate' heavily depend on the amount of buffering.
Aggregating multiple packets into one transmission also causes extra
packet loss, which is compensated by retransmission and reordering, thus
introducing additional latency.
I don't think that TSQ can do a decent job of mitigating bufferbloat on
802.11n devices without a significant performance hit, so adding a new
netdev attribute might be a good idea.
- Felix
next prev parent reply other threads:[~2013-11-11 16:38 UTC|newest]
Thread overview: 26+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-11-11 5:30 TCP performance regression Sujith Manoharan
2013-11-11 5:55 ` Eric Dumazet
2013-11-11 6:07 ` Sujith Manoharan
2013-11-11 6:54 ` Eric Dumazet
2013-11-11 8:19 ` Sujith Manoharan
2013-11-11 14:27 ` Eric Dumazet
2013-11-11 14:39 ` Eric Dumazet
2013-11-11 16:44 ` Eric Dumazet
2013-11-11 15:05 ` David Laight
2013-11-11 15:29 ` Eric Dumazet
2013-11-11 15:43 ` David Laight
2013-11-11 16:17 ` Eric Dumazet
2013-11-11 16:35 ` David Laight
2013-11-11 17:41 ` Eric Dumazet
2013-11-12 7:42 ` Willy Tarreau
2013-11-12 14:16 ` Eric Dumazet
2013-11-14 9:54 ` Dave Taht
2013-11-11 16:13 ` Sujith Manoharan
2013-11-11 16:38 ` Felix Fietkau [this message]
2013-11-11 17:38 ` Eric Dumazet
2013-11-11 17:44 ` Felix Fietkau
2013-11-11 18:03 ` Dave Taht
2013-11-11 18:29 ` Sujith Manoharan
2013-11-11 18:31 ` Dave Taht
2013-11-11 19:11 ` Ben Greear
2013-11-11 19:24 ` Dave Taht
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=52810800.9020402@openwrt.org \
--to=nbd@openwrt.org \
--cc=dave.taht@gmail.com \
--cc=eric.dumazet@gmail.com \
--cc=netdev@vger.kernel.org \
--cc=sujith@msujith.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.