From: Tom Herbert <tom@herbertland.com>
To: <davem@davemloft.net>, <netdev@vger.kernel.org>
Cc: <kernel-team@fb.com>
Subject: [PATCH net-next 0/6] ila: Optimization to preserve value of early demux
Date: Tue, 29 Sep 2015 15:17:17 -0700 [thread overview]
Message-ID: <1443565043-1287886-1-git-send-email-tom@herbertland.com> (raw)
In the current implementation of ILA, LWT is used to perform
translation on both the input and output paths. This is functional,
however there is a big performance hit in the receive path. Early
demux occurs before the routing lookup (a hit actually obviates the
route lookup). Therefore the stack currently performs early
demux before translation so that a local connection with ILA
addresses is never matched. Note that this issue is not just
with ILA, but pretty much any translated or encapsulated packet
handled by LWT would miss the opportunity for early demux. Solving
the general problem seems non trivial since we would need to move
the route lookup before early demx thereby mitigating the value.
This patch set addresses the issue for ILA by adding a fast locator
lookup that occurs before early demux. This is done by creating an
XFRM hook to perform address translation early in the receive path.
For the backend we implement an rhashtable that contains identifier
to locator to mappings. The table also allows more specific matches
that include original locator and interface.
This patch set:
- Add an rhashtable function to atomically replace and element.
This is useful to implement sub-trees from a table entry
without needing to use a special anchor structure as the
table entry.
- Add a start callback for starting a netlink dump.
- Creates an ila directory under net/ipv6 and moves ila.c to it.
ila.c is split into ila_common.c and ila_lwt.c.
- Implement a table to do identifier->locator mapping. This is
an rhashtable.
- Configuration for the table with netlink.
- Add XFRM xlat_addr facility. This includes a callback registeration
function and hook to call registered callbacks.
- Call xfrm6_xlat_addr from ipv6_rcv before NF_HOOK and routing.
Testing:
Running 200 netperf TCP_RR streams
No ILA, baseline
85.72% CPU utilization
1861945 tps
93/163/330 50/90/99% latencies
ILA before fix (LWT on both input and output)
83.47 CPU utilization
16583186 tps (-11% from baseline)
107/183/338 50/90/99% latencies
ILA after fix (hook for input)
84.97% CPU utilization
1833948 tps (-1.5% from baseline)
95/164/331 50/90/99% latencies
Hacked DNPT to do ILA
80.94% CPU utilization
1683315 tps (-10% from baseline)
104/179/350 50/90/99% latencies
Tom Herbert (6):
ila: Create net/ipv6/ila directory
rhashtable: add function to replace an element
netlink: add a start callback for starting a netlink dump
xfrm: Add xfrm6 address translation function
ipv6: Call xfrm6_xlat_addr from ipv6_rcv
ila: Add support for xfrm6_xlat_addr
include/linux/netlink.h | 2 +
include/linux/rhashtable.h | 82 ++++++
include/net/genetlink.h | 2 +
include/net/xfrm.h | 25 ++
include/uapi/linux/ila.h | 22 ++
net/ipv6/Kconfig | 5 +
net/ipv6/Makefile | 3 +-
net/ipv6/ila.c | 229 ----------------
net/ipv6/ila/Makefile | 7 +
net/ipv6/ila/ila.h | 48 ++++
net/ipv6/ila/ila_common.c | 103 ++++++++
net/ipv6/ila/ila_lwt.c | 152 +++++++++++
net/ipv6/ila/ila_xlat.c | 642 +++++++++++++++++++++++++++++++++++++++++++++
net/ipv6/ip6_input.c | 3 +
net/ipv6/xfrm6_policy.c | 7 +
net/ipv6/xfrm6_xlat_addr.c | 66 +++++
net/netlink/af_netlink.c | 4 +
net/netlink/genetlink.c | 16 ++
18 files changed, 1188 insertions(+), 230 deletions(-)
delete mode 100644 net/ipv6/ila.c
create mode 100644 net/ipv6/ila/Makefile
create mode 100644 net/ipv6/ila/ila.h
create mode 100644 net/ipv6/ila/ila_common.c
create mode 100644 net/ipv6/ila/ila_lwt.c
create mode 100644 net/ipv6/ila/ila_xlat.c
create mode 100644 net/ipv6/xfrm6_xlat_addr.c
--
2.4.6
next reply other threads:[~2015-09-29 22:17 UTC|newest]
Thread overview: 15+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-09-29 22:17 Tom Herbert [this message]
2015-09-29 22:17 ` [PATCH net-next 1/6] ila: Create net/ipv6/ila directory Tom Herbert
2015-09-29 22:17 ` [PATCH net-next 2/6] rhashtable: add function to replace an element Tom Herbert
2015-09-29 22:17 ` [PATCH net-next 3/6] netlink: add a start callback for starting a netlink dump Tom Herbert
2015-09-29 22:17 ` [PATCH net-next 4/6] xfrm: Add xfrm6 address translation function Tom Herbert
2015-09-29 22:58 ` David Ahern
2015-09-30 8:39 ` Steffen Klassert
2015-09-29 22:17 ` [PATCH net-next 5/6] ipv6: Call xfrm6_xlat_addr from ipv6_rcv Tom Herbert
2015-09-29 23:26 ` Florian Westphal
2015-09-30 9:06 ` Steffen Klassert
2015-09-30 18:40 ` Tom Herbert
2015-09-29 22:17 ` [PATCH net-next 6/6] ila: Add support for xfrm6_xlat_addr Tom Herbert
2015-09-29 22:34 ` kbuild test robot
2015-09-29 22:49 ` kbuild test robot
2015-09-29 23:18 ` kbuild test robot
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1443565043-1287886-1-git-send-email-tom@herbertland.com \
--to=tom@herbertland.com \
--cc=davem@davemloft.net \
--cc=kernel-team@fb.com \
--cc=netdev@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).