From: Rick Jones <rick.jones2@hp.com>
To: Jay Vosburgh <fubar@us.ibm.com>
Cc: netdev@vger.kernel.org
Subject: Re: [PATCH] remove claim balance_rr won't reorder on many to one
Date: Tue, 30 Oct 2007 15:12:18 -0700 [thread overview]
Message-ID: <4727AC42.2060709@hp.com> (raw)
In-Reply-To: <5242.1193777750@death>
Jay Vosburgh wrote:
> Rick Jones <rick.jones2@hp.com> wrote:
> [...]
>
>>- Note that this out of order delivery occurs when both the
>>- sending and receiving systems are utilizing a multiple
>>- interface bond. Consider a configuration in which a
>>- balance-rr bond feeds into a single higher capacity network
>>- channel (e.g., multiple 100Mb/sec ethernets feeding a single
>>- gigabit ethernet via an etherchannel capable switch). In this
>>- configuration, traffic sent from the multiple 100Mb devices to
>>- a destination connected to the gigabit device will not see
>>- packets out of order. However, traffic sent from the gigabit
>>- device to the multiple 100Mb devices may or may not see
>>- traffic out of order, depending upon the balance policy of the
>>- switch. Many switches do not support any modes that stripe
>>- traffic (instead choosing a port based upon IP or MAC level
>>- addresses); for those devices, traffic flowing from the
>>- gigabit device to the many 100Mb devices will only utilize one
>>- interface.
>
>
> Rather than simply removing this entirely (because I do think
> there is value in discussion of the reordering aspects of balance-rr),
> I'd rather see something that makes the following points:
>
> 1- the worst reordering is balance-rr to balance-rr, back to
> back. The reordering rate here depends upon (a) the number of slaves
> involved and (b) packet reception scheduling behaviors (packet
> coalescing, NAPI, etc), and thus will vary signficantly, but won't be
> better than case #2.
>
> 2- next worst is "balance-rr many slow" to "single fast", with
> the reordering rate generally being substantially lower than case #1 (it
> looked like your test showed about a 1% reordering rate, if I'm reading
> your data correctly).
>
> 3- For the "single fast" to "balance-rr many" case, going
> through a switch configured for etherchannel "may or may not see traffic
> out of order, depending upon the balance policy of the switch. Many
> switches do not support any modes that stripe traffic (instead choosing
> a port based upon IP or MAC level addresses); for those devices, traffic
> flowing from the [single fast] device to the [balance-rr many] devices
> will only utilize one interface."
I have to wonder if the full description of the different versions of being a
little bit pregnant is worth it. Just saying that using balance-rr will result
in reordering seems much more simple to comprehend. Also, since balance-rr is
strictly an outbound policy, does case three even enter into it - as you say,
that will be up to the switch, which will be doing whatever it was told or felt
like doing regardless of balance-rr on the bond in the host.
>
> [...]
>
>> This mode requires the switch to have the appropriate ports
>>- configured for "etherchannel" or "trunking."
>>+ configured for "etherchannel" or "aggregation." N.B. some
>>+ switches might use the term "trunking" for something other
>>+ than link aggregation.
>
>
> If memory serves, Sun uses the term "trunking" to refer to
> "etherchannel" compatible behavior.
I'm not really all that tied to that part of the change - it is there because I
noticed in one of the HP ITRC forums someone talking about a switch (Cisco?)
where trunking meant something with vlans rather than aggregation.
>
> I'm also hearing "aggregation" used to described 802.3ad
> specifically.
>
> Perhaps text of the form:
>
> This mode requires the switch to have the appropriate ports
> configured for "Etherchannel." Some switches use different terms, so
> the configuration may be called "trunking" or "aggregation." Note that
> both of these terms also have other meanings. For example, "trunking"
> is also used to describe a type of switch port, and "aggregation" or
> "link aggregation" is often used to refer to 802.3ad link aggregation,
> which is compatible with bonding's 802.3ad mode, but not balance-rr.
>
> Thoughts?
Even better would be to be able to start to move away from "etherchannel"
towards the de jure standard's terms, whatever the heck they are :)
rick jones
next prev parent reply other threads:[~2007-10-30 22:12 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2007-10-30 19:48 [PATCH] remove claim balance_rr won't reorder on many to one Rick Jones
2007-10-30 20:55 ` Jay Vosburgh
2007-10-30 22:12 ` Rick Jones [this message]
2007-10-31 0:22 ` Jay Vosburgh
2007-10-31 1:02 ` Rick Jones
2007-10-31 1:08 ` Rick Jones
2007-11-06 21:40 ` Rick Jones
2007-11-06 22:49 ` Jay Vosburgh
2007-11-06 22:59 ` Rick Jones
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4727AC42.2060709@hp.com \
--to=rick.jones2@hp.com \
--cc=fubar@us.ibm.com \
--cc=netdev@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).