From: David Lamparter <equinox@diac24.net>
To: Stephen Hemminger <stephen@networkplumber.org>
Cc: netdev@vger.kernel.org, roopa@cumulusnetworks.com,
bridge@lists.linux-foundation.org, amine.kherbouche@6wind.com,
David Lamparter <equinox@diac24.net>
Subject: Re: [Bridge] [RFC net-next v2] bridge lwtunnel, VPLS & NVGRE
Date: Tue, 22 Aug 2017 02:29:56 +0200 [thread overview]
Message-ID: <20170822002956.GY773745@eidolon> (raw)
In-Reply-To: <20170821170151.5b12a392@xeon-e3>
On Mon, Aug 21, 2017 at 05:01:51PM -0700, Stephen Hemminger wrote:
> On Mon, 21 Aug 2017 19:15:17 +0200 David Lamparter <equinox@diac24.net> wrote:
> > > P.S.: For a little context on the bridge FDB changes - I'm hoping to
> > > find some time to extend this to the MDB to allow aggregating dst
> > > metadata and handing down a list of dst metas on TX. This isn't
> > > specifically for VPLS but rather to give sufficient information to the
> > > 802.11 stack to allow it to optimize selecting rates (or unicasting)
> > > for multicast traffic by having the multicast subscriber list known.
> > > This is done by major commercial wifi solutions (e.g. google "dynamic
> > > multicast optimization".)
> >
> > You can find hacks at this on:
> > https://github.com/eqvinox/vpls-linux-kernel/tree/mdb-hack
> > Please note that the patches in that branch are not at an acceptable
> > quality level, but you can see the semantic relation to 802.11.
> >
> > I would, however, like to point out that this branch has pseudo-working
> > IGMP/MLD snooping for VPLS, and it'd be 20-ish lines to add it to NVGRE
> > (I'll do that as soon as I get to it, it'll pop up on that branch too.)
> >
> > This is relevant to the discussion because it's a feature which is
> > non-obvious (to me) on how to do with the VXLAN model of having an
> > entirely separate FDB. Meanwhile, with this architecture, the proof of
> > concept / hack is coming in at a measly cost of:
> > 8 files changed, 176 insertions(+), 15 deletions(-)
>
> I know the bridge is an easy target to extend L2 forwarding, but it is not
> the only option. Have you condidered building a new driver
Yes I have; I dismissed the approach because even though an fdb is
reasonable to duplicate, I did not believe replicating multicast
snooping code into both VPLS and 802.11 (and possibly VXLAN) to be a
viable option. ...is it?
> (like VXLAN does) which does the forwarding you want. Having all
> features in one driver makes for worse performance, and increased
> complexity.
Can you elaborate? I agree with that sentence as a general statement,
but a general statement needs to apply to a specific situation. As
discussed in the previous thread with Nikolay, checking skb->_refdst
against 0 should be doable without touching additional cachelines, so
the performance cost should be rather small. For complexity - it's
keeping an extra pointer around, which is semantically bound to the
existing net_bridge_fdb_entry->dst. On the other hand, it spares us
from another copy of a fdb implementation, and two copies of multicast
snooping code... I honestly believe this patchset is a good approach.
-David
WARNING: multiple messages have this Message-ID (diff)
From: David Lamparter <equinox@diac24.net>
To: Stephen Hemminger <stephen@networkplumber.org>
Cc: David Lamparter <equinox@diac24.net>,
netdev@vger.kernel.org, bridge@lists.linux-foundation.org,
amine.kherbouche@6wind.com, roopa@cumulusnetworks.com
Subject: Re: [RFC net-next v2] bridge lwtunnel, VPLS & NVGRE
Date: Tue, 22 Aug 2017 02:29:56 +0200 [thread overview]
Message-ID: <20170822002956.GY773745@eidolon> (raw)
In-Reply-To: <20170821170151.5b12a392@xeon-e3>
On Mon, Aug 21, 2017 at 05:01:51PM -0700, Stephen Hemminger wrote:
> On Mon, 21 Aug 2017 19:15:17 +0200 David Lamparter <equinox@diac24.net> wrote:
> > > P.S.: For a little context on the bridge FDB changes - I'm hoping to
> > > find some time to extend this to the MDB to allow aggregating dst
> > > metadata and handing down a list of dst metas on TX. This isn't
> > > specifically for VPLS but rather to give sufficient information to the
> > > 802.11 stack to allow it to optimize selecting rates (or unicasting)
> > > for multicast traffic by having the multicast subscriber list known.
> > > This is done by major commercial wifi solutions (e.g. google "dynamic
> > > multicast optimization".)
> >
> > You can find hacks at this on:
> > https://github.com/eqvinox/vpls-linux-kernel/tree/mdb-hack
> > Please note that the patches in that branch are not at an acceptable
> > quality level, but you can see the semantic relation to 802.11.
> >
> > I would, however, like to point out that this branch has pseudo-working
> > IGMP/MLD snooping for VPLS, and it'd be 20-ish lines to add it to NVGRE
> > (I'll do that as soon as I get to it, it'll pop up on that branch too.)
> >
> > This is relevant to the discussion because it's a feature which is
> > non-obvious (to me) on how to do with the VXLAN model of having an
> > entirely separate FDB. Meanwhile, with this architecture, the proof of
> > concept / hack is coming in at a measly cost of:
> > 8 files changed, 176 insertions(+), 15 deletions(-)
>
> I know the bridge is an easy target to extend L2 forwarding, but it is not
> the only option. Have you condidered building a new driver
Yes I have; I dismissed the approach because even though an fdb is
reasonable to duplicate, I did not believe replicating multicast
snooping code into both VPLS and 802.11 (and possibly VXLAN) to be a
viable option. ...is it?
> (like VXLAN does) which does the forwarding you want. Having all
> features in one driver makes for worse performance, and increased
> complexity.
Can you elaborate? I agree with that sentence as a general statement,
but a general statement needs to apply to a specific situation. As
discussed in the previous thread with Nikolay, checking skb->_refdst
against 0 should be doable without touching additional cachelines, so
the performance cost should be rather small. For complexity - it's
keeping an extra pointer around, which is semantically bound to the
existing net_bridge_fdb_entry->dst. On the other hand, it spares us
from another copy of a fdb implementation, and two copies of multicast
snooping code... I honestly believe this patchset is a good approach.
-David
next prev parent reply other threads:[~2017-08-22 0:29 UTC|newest]
Thread overview: 36+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-08-21 17:15 [Bridge] [RFC net-next v2] bridge lwtunnel, VPLS & NVGRE David Lamparter
2017-08-21 17:15 ` David Lamparter
2017-08-21 17:15 ` [Bridge] [PATCH 1/6] bridge: lwtunnel support in FDB David Lamparter
2017-08-21 17:15 ` David Lamparter
2017-08-21 17:15 ` [Bridge] [PATCH 2/6] bridge: lwtunnel netlink interface David Lamparter
2017-08-21 17:15 ` David Lamparter
2017-08-21 17:15 ` [Bridge] [PATCH 3/6] gretap: support lwtunnel under bridge (NVGRE) David Lamparter
2017-08-21 17:15 ` David Lamparter
2017-08-21 17:15 ` [Bridge] [PATCH 4/6] mpls: split forwarding path on rx/tx boundary David Lamparter
2017-08-21 17:15 ` David Lamparter
2017-08-21 17:15 ` [Bridge] [PATCH 5/6] mpls: add VPLS entry points David Lamparter
2017-08-21 17:15 ` David Lamparter
2017-08-21 17:15 ` [Bridge] [PATCH 6/6] mpls: VPLS support David Lamparter
2017-08-21 17:15 ` David Lamparter
2017-08-28 9:21 ` Amine Kherbouche
2017-08-28 9:21 ` [Bridge] " Amine Kherbouche
2017-08-22 0:01 ` [Bridge] [RFC net-next v2] bridge lwtunnel, VPLS & NVGRE Stephen Hemminger
2017-08-22 0:01 ` Stephen Hemminger
2017-08-22 0:29 ` David Lamparter [this message]
2017-08-22 0:29 ` David Lamparter
2017-08-22 11:01 ` [Bridge] " Nikolay Aleksandrov
2017-08-22 11:01 ` Nikolay Aleksandrov
2017-08-22 11:32 ` [Bridge] " David Lamparter
2017-08-22 11:32 ` David Lamparter
2017-08-22 11:55 ` [Bridge] " Nikolay Aleksandrov
2017-08-22 11:55 ` Nikolay Aleksandrov
2017-08-22 12:06 ` [Bridge] " David Lamparter
2017-08-22 12:06 ` David Lamparter
2017-08-22 4:43 ` [Bridge] " Roopa Prabhu
2017-08-22 4:43 ` Roopa Prabhu
2017-08-22 11:24 ` [Bridge] " David Lamparter
2017-08-22 11:24 ` David Lamparter
2017-09-11 8:02 ` Amine Kherbouche
2017-09-11 8:02 ` [Bridge] " Amine Kherbouche
2017-09-19 14:46 ` Amine Kherbouche
2017-09-19 14:46 ` [Bridge] " Amine Kherbouche
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20170822002956.GY773745@eidolon \
--to=equinox@diac24.net \
--cc=amine.kherbouche@6wind.com \
--cc=bridge@lists.linux-foundation.org \
--cc=netdev@vger.kernel.org \
--cc=roopa@cumulusnetworks.com \
--cc=stephen@networkplumber.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.