All of lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH net-next v5 00/16] netkit: Support for io_uring zero-copy and AF_XDP
@ 2026-01-09 21:26 Daniel Borkmann
  2026-01-09 21:26 ` [PATCH net-next v5 01/16] net: Add queue-create operation Daniel Borkmann
                   ` (15 more replies)
  0 siblings, 16 replies; 28+ messages in thread
From: Daniel Borkmann @ 2026-01-09 21:26 UTC (permalink / raw)
  To: netdev
  Cc: bpf, kuba, davem, razor, pabeni, willemb, sdf, john.fastabend,
	martin.lau, jordan, maciej.fijalkowski, magnus.karlsson, dw, toke,
	yangzhenze, wangdongdong.6

Containers use virtual netdevs to route traffic from a physical netdev
in the host namespace. They do not have access to the physical netdev
in the host and thus can't use memory providers or AF_XDP that require
reconfiguring/restarting queues in the physical netdev.

This patchset adds the concept of queue leasing to virtual netdevs that
allow containers to use memory providers and AF_XDP at native speed.
Leased queues are bound to a real queue in a physical netdev and act
as a proxy.

Memory providers and AF_XDP operations take an ifindex and queue id,
so containers would pass in an ifindex for a virtual netdev and a queue
id of a leased queue, which then gets proxied to the underlying real
queue.

We have implemented support for this concept in netkit and tested the
latter against Nvidia ConnectX-6 (mlx5) as well as Broadcom BCM957504
(bnxt_en) 100G NICs. For more details see the individual patches.

v4->v5:
 - Rework of the core API into queue-create op (Jakub)
 - Rename from queue peering to queue leasing (Jakub)
 - Add net selftests for queue leasing (Stan, Jakub)
 - Move netkit_queue_get_dma_dev into core (Jakub)
 - Dropped netkit_get_channels (Jakub)
 - Moved ndo_queue_create back to return index or error (Jakub)
 - Inline __netdev_rx_queue_{peer,unpeer} helpers (Jakub)
 - Adding helpers in patches where they are used (Jakub)
 - Undo inline for netdev_put_lock (Jakub)
 - Factoring out checks whether device can lease (Jakub)
 - Fix up return codes in netdev_nl_bind_queue_doit (Jakub)
 - Reject when AF_XDP or mp already bound (Jakub)
 - Switch some error cases to NL_SET_BAD_ATTR() (Jakub)
 - Rebase and retested everything with mlx5 + bnxt_en
v3->v4:
 - ndo_queue_create store dst queue via arg (Nikolay)
 - Small nits like a spelling issue + rev xmas (Nikolay)
 - admin-perm flag in bind-queue spec (Jakub)
 - Fix potential ABBA deadlock situation in bind (Jakub, Paolo, Stan)
 - Add a peer dev_tracker to not reuse the sysfs one (Jakub)
 - New patch (12/14) to handle the underlying device going away (Jakub)
 - Improve commit message on queue-get (Jakub)
 - Do not expose phys dev info from container on queue-get (Jakub)
 - Add netif_put_rx_queue_peer_locked to simplify code (Stan)
 - Rework xsk handling to simplify the code and drop a few patches
 - Rebase and retested everything with mlx5 + bnxt_en
v2->v3:
 - Use netdev_ops_assert_locked instead of netdev_assert_locked (syzbot)
 - Add missing netdev_lockdep_set_classes in netkit
v1->v2:
 - Removed bind sample ynl code (Stan)
 - Reworked netdev locking to have consistent order (Stan, Kuba)
 - Return 'not supported' in API patch (Stan)
 - Improved ynl documentation (Kuba)
 - Added 'max: s32-max' in ynl spec for ifindex (Kuba)
 - Added also queue type in ynl to have user specify rx to make
   it obvious (Kuba)
 - Use of netdev_hold (Kuba)
 - Avoid static inlines from another header (Kuba)
 - Squashed some commits (Kuba, Stan)
 - Removed ndo_{peer,unpeer}_queues callback and simplified
   code (Kuba)
 - Improved commit messages (Toke, Kuba, Stan, zf)
 - Got rid of locking genl_sk_priv_get (Stan)
 - Removed af_xdp cleanup churn (Maciej)
 - Added netdev locking asserts (Stan)
 - Reject ethtool ioctl path queue resizing (Kuba)
 - Added kdoc for ndo_queue_create (Stan)
 - Uninvert logic in netkit single dev mode (Jordan)
 - Added binding support for multiple queues

Daniel Borkmann (9):
  net: Add queue-create operation
  net: Implement netdev_nl_queue_create_doit
  net: Add lease info to queue-get response
  net, ethtool: Disallow leased real rxqs to be resized
  xsk: Extend xsk_rcv_check validation
  xsk: Proxy pool management for leased queues
  netkit: Add single device mode for netkit
  netkit: Add netkit notifier to check for unregistering devices
  netkit: Add xsk support for af_xdp applications

David Wei (7):
  net: Proxy net_mp_{open,close}_rxq for leased queues
  net: Proxy netdev_queue_get_dma_dev for leased queues
  netkit: Implement rtnl_link_ops->alloc and ndo_queue_create
  selftests/net: Add bpf skb forwarding program
  selftests/net: Add env for container based tests
  selftests/net: Make NetDrvContEnv support queue leasing
  selftests/net: Add netkit container tests

 Documentation/netlink/specs/netdev.yaml       |  44 +++
 drivers/net/netkit.c                          | 359 +++++++++++++++---
 include/linux/netdevice.h                     |   6 +
 include/net/netdev_queues.h                   |  19 +-
 include/net/netdev_rx_queue.h                 |  21 +-
 include/net/page_pool/memory_provider.h       |   4 +-
 include/net/xdp_sock_drv.h                    |   2 +-
 include/uapi/linux/if_link.h                  |   6 +
 include/uapi/linux/netdev.h                   |  11 +
 net/core/dev.c                                |   7 +
 net/core/dev.h                                |   2 +
 net/core/netdev-genl-gen.c                    |  20 +
 net/core/netdev-genl-gen.h                    |   2 +
 net/core/netdev-genl.c                        | 185 +++++++++
 net/core/netdev_queues.c                      |  74 +++-
 net/core/netdev_rx_queue.c                    | 173 +++++++--
 net/ethtool/channels.c                        |  12 +-
 net/ethtool/ioctl.c                           |   9 +-
 net/xdp/xsk.c                                 |  73 +++-
 tools/include/uapi/linux/netdev.h             |  11 +
 .../testing/selftests/drivers/net/README.rst  |   7 +
 .../selftests/drivers/net/hw/.gitignore       |   2 +
 .../testing/selftests/drivers/net/hw/Makefile |   2 +
 .../drivers/net/hw/lib/py/__init__.py         |   7 +-
 .../selftests/drivers/net/hw/nk_forward.bpf.c |  49 +++
 .../selftests/drivers/net/hw/nk_netns.py      |  23 ++
 .../selftests/drivers/net/hw/nk_qlease.py     |  55 +++
 .../selftests/drivers/net/lib/py/__init__.py  |   7 +-
 .../selftests/drivers/net/lib/py/env.py       | 148 +++++++-
 29 files changed, 1222 insertions(+), 118 deletions(-)
 create mode 100644 tools/testing/selftests/drivers/net/hw/nk_forward.bpf.c
 create mode 100755 tools/testing/selftests/drivers/net/hw/nk_netns.py
 create mode 100755 tools/testing/selftests/drivers/net/hw/nk_qlease.py

-- 
2.43.0


^ permalink raw reply	[flat|nested] 28+ messages in thread

end of thread, other threads:[~2026-01-13 16:58 UTC | newest]

Thread overview: 28+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2026-01-09 21:26 [PATCH net-next v5 00/16] netkit: Support for io_uring zero-copy and AF_XDP Daniel Borkmann
2026-01-09 21:26 ` [PATCH net-next v5 01/16] net: Add queue-create operation Daniel Borkmann
2026-01-09 21:26 ` [PATCH net-next v5 02/16] net: Implement netdev_nl_queue_create_doit Daniel Borkmann
2026-01-11 20:55   ` Stanislav Fomichev
2026-01-12  9:15     ` Daniel Borkmann
2026-01-09 21:26 ` [PATCH net-next v5 03/16] net: Add lease info to queue-get response Daniel Borkmann
2026-01-13  3:53   ` [net-next,v5,03/16] " Jakub Kicinski
2026-01-13 10:35     ` Daniel Borkmann
2026-01-09 21:26 ` [PATCH net-next v5 04/16] net, ethtool: Disallow leased real rxqs to be resized Daniel Borkmann
2026-01-09 21:26 ` [PATCH net-next v5 05/16] net: Proxy net_mp_{open,close}_rxq for leased queues Daniel Borkmann
2026-01-09 21:26 ` [PATCH net-next v5 06/16] net: Proxy netdev_queue_get_dma_dev " Daniel Borkmann
2026-01-09 21:26 ` [PATCH net-next v5 07/16] xsk: Extend xsk_rcv_check validation Daniel Borkmann
2026-01-09 21:26 ` [PATCH net-next v5 08/16] xsk: Proxy pool management for leased queues Daniel Borkmann
2026-01-09 21:26 ` [PATCH net-next v5 09/16] netkit: Add single device mode for netkit Daniel Borkmann
2026-01-09 21:26 ` [PATCH net-next v5 10/16] netkit: Implement rtnl_link_ops->alloc and ndo_queue_create Daniel Borkmann
2026-01-09 21:26 ` [PATCH net-next v5 11/16] netkit: Add netkit notifier to check for unregistering devices Daniel Borkmann
2026-01-13  3:53   ` [net-next,v5,11/16] " Jakub Kicinski
2026-01-09 21:26 ` [PATCH net-next v5 12/16] netkit: Add xsk support for af_xdp applications Daniel Borkmann
2026-01-09 21:26 ` [PATCH net-next v5 13/16] selftests/net: Add bpf skb forwarding program Daniel Borkmann
2026-01-11 20:59   ` Stanislav Fomichev
2026-01-13 16:57     ` David Wei
2026-01-09 21:26 ` [PATCH net-next v5 14/16] selftests/net: Add env for container based tests Daniel Borkmann
2026-01-13  3:58   ` Jakub Kicinski
2026-01-13 16:58     ` David Wei
2026-01-09 21:26 ` [PATCH net-next v5 15/16] selftests/net: Make NetDrvContEnv support queue leasing Daniel Borkmann
2026-01-13  3:59   ` Jakub Kicinski
2026-01-13 16:58     ` David Wei
2026-01-09 21:26 ` [PATCH net-next v5 16/16] selftests/net: Add netkit container tests Daniel Borkmann

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.