netdev.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [PATCH net-next v3 00/10] net: openvswitch: Add sample multicasting.
@ 2024-06-19 21:00 Adrian Moreno
  2024-06-19 21:00 ` [PATCH net-next v3 01/10] net: psample: add user cookie Adrian Moreno
                   ` (9 more replies)
  0 siblings, 10 replies; 14+ messages in thread
From: Adrian Moreno @ 2024-06-19 21:00 UTC (permalink / raw)
  To: netdev
  Cc: aconole, echaudro, horms, i.maximets, dev, Adrian Moreno,
	linux-kernel, linux-kselftest

** Background **
Currently, OVS supports several packet sampling mechanisms (sFlow,
per-bridge IPFIX, per-flow IPFIX). These end up being translated into a
userspace action that needs to be handled by ovs-vswitchd's handler
threads only to be forwarded to some third party application that
will somehow process the sample and provide observability on the
datapath.

A particularly interesting use-case is controller-driven
per-flow IPFIX sampling where the OpenFlow controller can add metadata
to samples (via two 32bit integers) and this metadata is then available
to the sample-collecting system for correlation.

** Problem **
The fact that sampled traffic share netlink sockets and handler thread
time with upcalls, apart from being a performance bottleneck in the
sample extraction itself, can severely compromise the datapath,
yielding this solution unfit for highly loaded production systems.

Users are left with little options other than guessing what sampling
rate will be OK for their traffic pattern and system load and dealing
with the lost accuracy.

Looking at available infrastructure, an obvious candidated would be
to use psample. However, it's current state does not help with the
use-case at stake because sampled packets do not contain user-defined
metadata.

** Proposal **
This series is an attempt to fix this situation by extending the
existing psample infrastructure to carry a variable length
user-defined cookie.

The main existing user of psample is tc's act_sample. It is also
extended to forward the action's cookie to psample.

Finally, a new OVS action (OVS_SAMPLE_ATTR_EMIT_SAMPLE) is created.
It accepts a group and an optional cookie and uses psample to
multicast the packet and the metadata.

--
v2 -> v3:
- Addressed comments from Simon, Aaron and Ilya.
- Dropped probability propagation in nested sample actions.
- Dropped patch v2's 7/9 in favor of a userspace implementation and
consume skb if emit_sample is the last action, same as we do with
userspace.
- Split ovs-dpctl.py features in independent patches.

v1 -> v2:
- Create a new action ("emit_sample") rather than reuse existing
  "sample" one.
- Add probability semantics to psample's sampling rate.
- Store sampling probability in skb's cb area and use it in emit_sample.
- Test combining "emit_sample" with "trunc"
- Drop group_id filtering and tracepoint in psample.

rfc_v2 -> v1:
- Accomodate Ilya's comments.
- Split OVS's attribute in two attributes and simplify internal
handling of psample arguments.
- Extend psample and tc with a user-defined cookie.
- Add a tracepoint to psample to facilitate troubleshooting.

rfc_v1 -> rfc_v2:
- Use psample instead of a new OVS-only multicast group.
- Extend psample and tc with a user-defined cookie.

Adrian Moreno (10):
  net: psample: add user cookie
  net: sched: act_sample: add action cookie to sample
  net: psample: skip packet copy if no listeners
  net: psample: allow using rate as probability
  net: openvswitch: add emit_sample action
  net: openvswitch: store sampling probability in cb.
  selftests: openvswitch: add emit_sample action
  selftests: openvswitch: add userspace parsing
  selftests: openvswitch: parse trunc action
  selftests: openvswitch: add emit_sample test

 Documentation/netlink/specs/ovs_flow.yaml     |  17 ++
 include/net/psample.h                         |   5 +-
 include/uapi/linux/openvswitch.h              |  30 +-
 include/uapi/linux/psample.h                  |  11 +-
 include/uapi/linux/tc_act/tc_sample.h         |   1 +
 net/openvswitch/Kconfig                       |   1 +
 net/openvswitch/actions.c                     |  63 +++-
 net/openvswitch/datapath.h                    |   3 +
 net/openvswitch/flow_netlink.c                |  33 ++-
 net/openvswitch/vport.c                       |   1 +
 net/psample/psample.c                         |  16 +-
 net/sched/act_sample.c                        |  12 +
 .../selftests/net/openvswitch/openvswitch.sh  | 110 ++++++-
 .../selftests/net/openvswitch/ovs-dpctl.py    | 272 +++++++++++++++++-
 14 files changed, 559 insertions(+), 16 deletions(-)

-- 
2.45.1


^ permalink raw reply	[flat|nested] 14+ messages in thread

end of thread, other threads:[~2024-06-24 21:40 UTC | newest]

Thread overview: 14+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-06-19 21:00 [PATCH net-next v3 00/10] net: openvswitch: Add sample multicasting Adrian Moreno
2024-06-19 21:00 ` [PATCH net-next v3 01/10] net: psample: add user cookie Adrian Moreno
2024-06-19 21:00 ` [PATCH net-next v3 02/10] net: sched: act_sample: add action cookie to sample Adrian Moreno
2024-06-19 21:00 ` [PATCH net-next v3 03/10] net: psample: skip packet copy if no listeners Adrian Moreno
2024-06-19 21:00 ` [PATCH net-next v3 04/10] net: psample: allow using rate as probability Adrian Moreno
2024-06-24 21:40   ` Ilya Maximets
2024-06-19 21:00 ` [PATCH net-next v3 05/10] net: openvswitch: add emit_sample action Adrian Moreno
2024-06-20  1:39   ` Jakub Kicinski
2024-06-20  7:29     ` Adrián Moreno
2024-06-19 21:00 ` [PATCH net-next v3 06/10] net: openvswitch: store sampling probability in cb Adrian Moreno
2024-06-19 21:00 ` [PATCH net-next v3 07/10] selftests: openvswitch: add emit_sample action Adrian Moreno
2024-06-19 21:00 ` [PATCH net-next v3 08/10] selftests: openvswitch: add userspace parsing Adrian Moreno
2024-06-19 21:00 ` [PATCH net-next v3 09/10] selftests: openvswitch: parse trunc action Adrian Moreno
2024-06-19 21:00 ` [PATCH net-next v3 10/10] selftests: openvswitch: add emit_sample test Adrian Moreno

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).