From: Nicolas Dichtel <nicolas.dichtel@6wind.com>
To: Andrea Mayer <andrea.mayer@uniroma2.it>, netdev@vger.kernel.org
Cc: "David S . Miller" <davem@davemloft.net>,
David Ahern <dsahern@kernel.org>,
Eric Dumazet <edumazet@google.com>,
Jakub Kicinski <kuba@kernel.org>, Paolo Abeni <pabeni@redhat.com>,
Simon Horman <horms@kernel.org>,
Stefano Salsano <stefano.salsano@uniroma2.it>,
Paolo Lungaroni <paolo.lungaroni@uniroma2.it>,
Ahmed Abdelsalam <ahabdels@cisco.com>,
Justin Iurman <justin.iurman@6wind.com>,
linux-kernel@vger.kernel.org
Subject: Re: [RFC PATCH net-next 2/3] seg6: add SRv6 L2 tunnel device (srl2)
Date: Thu, 26 Mar 2026 17:44:29 +0100 [thread overview]
Message-ID: <a4a4ba6d-da13-4bb1-b6b2-1f9a5de190a7@6wind.com> (raw)
In-Reply-To: <20260322000557.12559-3-andrea.mayer@uniroma2.it>
Le 22/03/2026 à 01:05, Andrea Mayer a écrit :
> Introduce srl2, an Ethernet pseudowire device over SRv6. It
> encapsulates L2 frames in IPv6 with a Segment Routing Header for
> transmission across an SRv6 network.
>
> The encapsulation logic reuses seg6_do_srh_encap() with
> IPPROTO_ETHERNET. The transmit path uses the standard IPv6 tunnel
> infrastructure (dst_cache, ip6_route_output, ip6tunnel_xmit).
>
> The device is configured with a segment list for point-to-point
> L2 encapsulation.
>
> Usage:
>
> ip link add srl2-0 type srl2 segs fc00::a,fc00::b
>
> Co-developed-by: Stefano Salsano <stefano.salsano@uniroma2.it>
> Signed-off-by: Stefano Salsano <stefano.salsano@uniroma2.it>
> Signed-off-by: Andrea Mayer <andrea.mayer@uniroma2.it>
> ---
> include/linux/srl2.h | 7 +
> include/uapi/linux/srl2.h | 20 +++
> net/ipv6/Kconfig | 16 +++
> net/ipv6/Makefile | 1 +
> net/ipv6/seg6.c | 1 +
> net/ipv6/srl2.c | 269 ++++++++++++++++++++++++++++++++++++++
> 6 files changed, 314 insertions(+)
> create mode 100644 include/linux/srl2.h
> create mode 100644 include/uapi/linux/srl2.h
> create mode 100644 net/ipv6/srl2.c
>
> diff --git a/include/linux/srl2.h b/include/linux/srl2.h
> new file mode 100644
> index 000000000000..c1342b979402
> --- /dev/null
> +++ b/include/linux/srl2.h
> @@ -0,0 +1,7 @@
> +/* SPDX-License-Identifier: GPL-2.0-or-later */
> +#ifndef _LINUX_SRL2_H
> +#define _LINUX_SRL2_H
> +
> +#include <uapi/linux/srl2.h>
> +
> +#endif
Is this really needed?
> diff --git a/include/uapi/linux/srl2.h b/include/uapi/linux/srl2.h
> new file mode 100644
> index 000000000000..e7c8f6fc0791
> --- /dev/null
> +++ b/include/uapi/linux/srl2.h
> @@ -0,0 +1,20 @@
> +/* SPDX-License-Identifier: GPL-2.0-or-later WITH Linux-syscall-note */
> +/*
> + * SRv6 L2 tunnel device
> + *
> + * Author:
> + * Andrea Mayer <andrea.mayer@uniroma2.it>
> + */
> +
> +#ifndef _UAPI_LINUX_SRL2_H
> +#define _UAPI_LINUX_SRL2_H
> +
> +enum {
> + IFLA_SRL2_UNSPEC,
> + IFLA_SRL2_SRH, /* binary: struct ipv6_sr_hdr + segments */
> + __IFLA_SRL2_MAX,
> +};
> +
> +#define IFLA_SRL2_MAX (__IFLA_SRL2_MAX - 1)
It should probably be generated automatically from specs, see
https://docs.kernel.org/userspace-api/netlink/intro-specs.html
> +
> +#endif
> diff --git a/net/ipv6/Kconfig b/net/ipv6/Kconfig
> index b8f9a8c0302e..9c8f7e254435 100644
> --- a/net/ipv6/Kconfig
> +++ b/net/ipv6/Kconfig
> @@ -318,6 +318,22 @@ config IPV6_SEG6_BPF
> depends on IPV6_SEG6_LWTUNNEL
> depends on IPV6 = y
>
> +config IPV6_SRL2
> + tristate "IPv6: SRv6 L2 tunnel device"
> + depends on IPV6_SEG6_LWTUNNEL
> + select DST_CACHE
> + help
> + SRv6 virtual Ethernet device that encapsulates L2 frames in
> + IPv6 with a Segment Routing Header (SRH) for transmission
> + over an SRv6 network.
> + Intended for use with a remote seg6local L2 decapsulation
> + behavior, such as End.DT2U or End.DX2.
> +
> + To compile this as a module, choose M here: the module will
> + be called srl2.
> +
> + If unsure, say N.
> +
> config IPV6_RPL_LWTUNNEL
> bool "IPv6: RPL Source Routing Header support"
> depends on IPV6
> diff --git a/net/ipv6/Makefile b/net/ipv6/Makefile
> index 2c9ce2ccbde1..a7e81d0293ca 100644
> --- a/net/ipv6/Makefile
> +++ b/net/ipv6/Makefile
> @@ -24,6 +24,7 @@ ipv6-$(CONFIG_SYN_COOKIES) += syncookies.o
> ipv6-$(CONFIG_NETLABEL) += calipso.o
> ipv6-$(CONFIG_IPV6_SEG6_LWTUNNEL) += seg6_iptunnel.o seg6_local.o
> ipv6-$(CONFIG_IPV6_SEG6_HMAC) += seg6_hmac.o
> +obj-$(CONFIG_IPV6_SRL2) += srl2.o
> ipv6-$(CONFIG_IPV6_RPL_LWTUNNEL) += rpl_iptunnel.o
> ipv6-$(CONFIG_IPV6_IOAM6_LWTUNNEL) += ioam6_iptunnel.o
>
> diff --git a/net/ipv6/seg6.c b/net/ipv6/seg6.c
> index 1c3ad25700c4..23213ab4fefd 100644
> --- a/net/ipv6/seg6.c
> +++ b/net/ipv6/seg6.c
> @@ -72,6 +72,7 @@ bool seg6_validate_srh(struct ipv6_sr_hdr *srh, int len, bool reduced)
>
> return true;
> }
> +EXPORT_SYMBOL_GPL(seg6_validate_srh);
>
> struct ipv6_sr_hdr *seg6_get_srh(struct sk_buff *skb, int flags)
> {
> diff --git a/net/ipv6/srl2.c b/net/ipv6/srl2.c
> new file mode 100644
> index 000000000000..66aa5375d218
> --- /dev/null
> +++ b/net/ipv6/srl2.c
> @@ -0,0 +1,269 @@
> +// SPDX-License-Identifier: GPL-2.0-or-later
> +/*
> + * SRv6 L2 tunnel device (srl2)
> + *
> + * A virtual Ethernet device that encapsulates L2 frames in IPv6 with a
> + * Segment Routing Header (SRH) for transmission over an SRv6 network.
> + * On the remote side, a seg6_local behavior such as End.DT2U or End.DX2
> + * decapsulates the inner Ethernet frame for L2 delivery.
> + *
> + * The encapsulation logic reuses seg6_do_srh_encap() from seg6_iptunnel.c
> + * with IPPROTO_ETHERNET (143). The transmit path uses the standard IPv6
> + * tunnel infrastructure (dst_cache, ip6_route_output, ip6tunnel_xmit).
> + *
> + * Authors:
> + * Andrea Mayer <andrea.mayer@uniroma2.it>
> + * Stefano Salsano <stefano.salsano@uniroma2.it>
> + */
> +
> +#include <linux/module.h>
> +#include <linux/netdevice.h>
> +#include <linux/etherdevice.h>
> +#include <net/dst_cache.h>
> +#include <net/ip6_route.h>
> +#include <net/ip_tunnels.h>
> +#include <net/ip6_tunnel.h>
> +#include <net/seg6.h>
> +#include <linux/seg6.h>
> +#include <linux/srl2.h>
> +
> +/* Conservative initial estimate for SRH size before newlink provides
> + * the actual value. 256 bytes accommodates up to 15 SIDs.
> + */
> +#define SRL2_SRH_HEADROOM_EST 256
> +
> +struct srl2_priv {
> + struct ipv6_sr_hdr *srh;
> + struct dst_cache dst_cache;
> +};
> +
> +/*
> + * srl2_xmit - encapsulate an L2 frame in IPv6+SRH and transmit
> + *
> + * When the bridge (or local stack) sends a frame through this device,
> + * skb->data points to the inner Ethernet header. We look up a route
> + * towards the first SID, prepend the outer IPv6+SRH via
> + * seg6_do_srh_encap(), and transmit via ip6tunnel_xmit().
> + *
> + * The route lookup result is cached per-cpu in dst_cache. Since the
> + * first SID is constant for the lifetime of the device, the cache
> + * avoids repeated route lookups in the common case.
> + */
> +static netdev_tx_t srl2_xmit(struct sk_buff *skb, struct net_device *dev)
> +{
> + struct srl2_priv *priv = netdev_priv(dev);
> + struct net *net = dev_net(dev);
> + struct dst_entry *dst;
> + struct flowi6 fl6;
> + int err;
> +
> + local_bh_disable();
> + dst = dst_cache_get(&priv->dst_cache);
> + local_bh_enable();
> +
> + if (unlikely(!dst)) {
> + memset(&fl6, 0, sizeof(fl6));
> + fl6.daddr = priv->srh->segments[priv->srh->first_segment];
> +
> + dst = ip6_route_output(net, NULL, &fl6);
> + if (dst->error) {
> + dst_release(dst);
> + DEV_STATS_INC(dev, tx_carrier_errors);
> + goto drop;
> + }
> +
> + if (dst_dev(dst) == dev) {
> + dst_release(dst);
> + DEV_STATS_INC(dev, collisions);
> + goto drop;
> + }
> +
> + local_bh_disable();
> + /* saddr is unused */
> + dst_cache_set_ip6(&priv->dst_cache, dst, &fl6.saddr);
> + local_bh_enable();
> + }
> +
> + skb_scrub_packet(skb, false);
> +
> + skb_dst_set(skb, dst);
> +
> + err = seg6_do_srh_encap(skb, priv->srh, IPPROTO_ETHERNET);
> + if (unlikely(err)) {
> + DEV_STATS_INC(dev, tx_errors);
> + kfree_skb(skb);
> + return NETDEV_TX_OK;
> + }
> +
> + skb->protocol = htons(ETH_P_IPV6);
> +
> + ip6tunnel_xmit(NULL, skb, dev, 0);
> +
> + return NETDEV_TX_OK;
> +
> +drop:
> + DEV_STATS_INC(dev, tx_dropped);
> + kfree_skb(skb);
> + return NETDEV_TX_OK;
> +}
> +
> +static int srl2_dev_init(struct net_device *dev)
> +{
> + struct srl2_priv *priv = netdev_priv(dev);
> +
> + return dst_cache_init(&priv->dst_cache, GFP_KERNEL);
> +}
> +
> +static void srl2_dev_uninit(struct net_device *dev)
> +{
> + struct srl2_priv *priv = netdev_priv(dev);
> +
> + dst_cache_destroy(&priv->dst_cache);
> +}
> +
> +static void srl2_dev_free(struct net_device *dev)
> +{
> + struct srl2_priv *priv = netdev_priv(dev);
> +
> + kfree(priv->srh);
> +}
> +
> +static const struct net_device_ops srl2_netdev_ops = {
> + .ndo_init = srl2_dev_init,
> + .ndo_uninit = srl2_dev_uninit,
> + .ndo_start_xmit = srl2_xmit,
> + .ndo_set_mac_address = eth_mac_addr,
> + .ndo_validate_addr = eth_validate_addr,
> +};
> +
> +static void srl2_setup(struct net_device *dev)
> +{
> + ether_setup(dev);
> +
> + dev->netdev_ops = &srl2_netdev_ops;
> + dev->needs_free_netdev = true;
> + dev->pcpu_stat_type = NETDEV_PCPU_STAT_DSTATS;
> + dev->needed_headroom = LL_MAX_HEADER + sizeof(struct ipv6hdr) +
> + SRL2_SRH_HEADROOM_EST;
> +
> + dev->priv_flags &= ~IFF_TX_SKB_SHARING;
> + dev->priv_flags |= IFF_LIVE_ADDR_CHANGE | IFF_NO_QUEUE;
> + dev->lltx = true;
> +
Maybe setting dev->netns_immutable to true ?
Regards,
Nicolas
> + eth_hw_addr_random(dev);
> +}
> +
> +static const struct nla_policy srl2_policy[IFLA_SRL2_MAX + 1] = {
> + [IFLA_SRL2_SRH] = { .type = NLA_BINARY },
> +};
> +
> +static int srl2_validate(struct nlattr *tb[], struct nlattr *data[],
> + struct netlink_ext_ack *extack)
> +{
> + if (!data || !data[IFLA_SRL2_SRH]) {
> + NL_SET_ERR_MSG(extack, "SRH with segment list is required");
> + return -EINVAL;
> + }
> +
> + return 0;
> +}
> +
> +static int srl2_newlink(struct net_device *dev,
> + struct rtnl_newlink_params *params,
> + struct netlink_ext_ack *extack)
> +{
> + struct srl2_priv *priv = netdev_priv(dev);
> + struct nlattr **data = params->data;
> + struct ipv6_sr_hdr *srh;
> + int srhlen;
> + int len;
> +
> + srh = nla_data(data[IFLA_SRL2_SRH]);
> + len = nla_len(data[IFLA_SRL2_SRH]);
> +
> + if (len < sizeof(*srh) + sizeof(struct in6_addr)) {
> + NL_SET_ERR_MSG(extack, "SRH too short");
> + return -EINVAL;
> + }
> +
> + if (!seg6_validate_srh(srh, len, false)) {
> + NL_SET_ERR_MSG(extack, "Invalid SRH");
> + return -EINVAL;
> + }
> +
> + priv->srh = kmemdup(srh, len, GFP_KERNEL);
> + if (!priv->srh)
> + return -ENOMEM;
> +
> + srhlen = ipv6_optlen(srh);
> +
> + dev->needed_headroom = LL_MAX_HEADER + sizeof(struct ipv6hdr) + srhlen;
> +
> + /* dev->mtu is the inner L3 payload size. Since SRv6 encapsulation
> + * carries the full inner Ethernet frame, subtract both the outer
> + * IPv6+SRH overhead and ETH_HLEN from ETH_DATA_LEN.
> + */
> + dev->mtu = ETH_DATA_LEN - sizeof(struct ipv6hdr) - srhlen - ETH_HLEN;
> + dev->min_mtu = ETH_MIN_MTU;
> + dev->max_mtu = IP_MAX_MTU - sizeof(struct ipv6hdr) - srhlen - ETH_HLEN;
> +
> + dev->priv_destructor = srl2_dev_free;
> +
> + return register_netdevice(dev);
> +}
> +
> +static void srl2_dellink(struct net_device *dev, struct list_head *head)
> +{
> + unregister_netdevice_queue(dev, head);
> +}
> +
> +static size_t srl2_get_size(const struct net_device *dev)
> +{
> + const struct srl2_priv *priv = netdev_priv(dev);
> + int srhlen = ipv6_optlen(priv->srh);
> +
> + return nla_total_size(srhlen);
> +}
> +
> +static int srl2_fill_info(struct sk_buff *skb, const struct net_device *dev)
> +{
> + const struct srl2_priv *priv = netdev_priv(dev);
> + int srhlen = ipv6_optlen(priv->srh);
> +
> + if (nla_put(skb, IFLA_SRL2_SRH, srhlen, priv->srh))
> + return -EMSGSIZE;
> +
> + return 0;
> +}
> +
> +static struct rtnl_link_ops srl2_link_ops __read_mostly = {
> + .kind = "srl2",
> + .maxtype = IFLA_SRL2_MAX,
> + .policy = srl2_policy,
> + .priv_size = sizeof(struct srl2_priv),
> + .setup = srl2_setup,
> + .validate = srl2_validate,
> + .newlink = srl2_newlink,
> + .dellink = srl2_dellink,
> + .get_size = srl2_get_size,
> + .fill_info = srl2_fill_info,
> +};
> +
> +static int __init srl2_init(void)
> +{
> + return rtnl_link_register(&srl2_link_ops);
> +}
> +
> +static void __exit srl2_exit(void)
> +{
> + rtnl_link_unregister(&srl2_link_ops);
> +}
> +
> +module_init(srl2_init);
> +module_exit(srl2_exit);
> +
> +MODULE_AUTHOR("Andrea Mayer <andrea.mayer@uniroma2.it>");
> +MODULE_AUTHOR("Stefano Salsano <stefano.salsano@uniroma2.it>");
> +MODULE_DESCRIPTION("SRv6 L2 tunnel device");
> +MODULE_LICENSE("GPL");
> +MODULE_ALIAS_RTNL_LINK("srl2");
next prev parent reply other threads:[~2026-03-26 16:44 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-03-22 0:05 [RFC PATCH net-next 0/3] seg6: SRv6 L2 VPN with End.DT2U and srl2 device Andrea Mayer
2026-03-22 0:05 ` [RFC PATCH net-next 1/3] seg6: add support for the SRv6 End.DT2U behavior Andrea Mayer
2026-03-22 0:05 ` [RFC PATCH net-next 2/3] seg6: add SRv6 L2 tunnel device (srl2) Andrea Mayer
2026-03-24 16:08 ` Justin Iurman
2026-03-24 16:24 ` Justin Iurman
2026-03-25 13:43 ` Justin Iurman
2026-03-26 17:29 ` Stefano Salsano
2026-03-26 16:44 ` Nicolas Dichtel [this message]
2026-03-22 0:05 ` [RFC PATCH net-next 3/3] selftests: seg6: add SRv6 srl2 + End.DT2U L2 VPN test Andrea Mayer
2026-03-24 16:00 ` [RFC PATCH net-next 0/3] seg6: SRv6 L2 VPN with End.DT2U and srl2 device Justin Iurman
2026-03-25 7:10 ` Stefano Salsano
2026-03-25 8:35 ` Justin Iurman
2026-03-26 16:30 ` Nicolas Dichtel
2026-03-26 17:30 ` Stefano Salsano
2026-03-26 16:32 ` Nicolas Dichtel
2026-03-27 1:09 ` Stefano Salsano
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=a4a4ba6d-da13-4bb1-b6b2-1f9a5de190a7@6wind.com \
--to=nicolas.dichtel@6wind.com \
--cc=ahabdels@cisco.com \
--cc=andrea.mayer@uniroma2.it \
--cc=davem@davemloft.net \
--cc=dsahern@kernel.org \
--cc=edumazet@google.com \
--cc=horms@kernel.org \
--cc=justin.iurman@6wind.com \
--cc=kuba@kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=netdev@vger.kernel.org \
--cc=pabeni@redhat.com \
--cc=paolo.lungaroni@uniroma2.it \
--cc=stefano.salsano@uniroma2.it \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox