From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from mails.dpdk.org (mails.dpdk.org [217.70.189.124]) by smtp.lore.kernel.org (Postfix) with ESMTP id 97E28D73EAA for ; Fri, 30 Jan 2026 01:15:38 +0000 (UTC) Received: from mails.dpdk.org (localhost [127.0.0.1]) by mails.dpdk.org (Postfix) with ESMTP id ABE4B40A7A; Fri, 30 Jan 2026 02:14:22 +0100 (CET) Received: from mail-wm1-f54.google.com (mail-wm1-f54.google.com [209.85.128.54]) by mails.dpdk.org (Postfix) with ESMTP id A6E794067E for ; Fri, 30 Jan 2026 02:14:12 +0100 (CET) Received: by mail-wm1-f54.google.com with SMTP id 5b1f17b1804b1-47edd6111b4so15962165e9.1 for ; Thu, 29 Jan 2026 17:14:12 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=networkplumber-org.20230601.gappssmtp.com; s=20230601; t=1769735652; x=1770340452; darn=dpdk.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=0QisYXEnWlfMJY5zbInCw9rwciXwDHwbzq0/FjUtsd8=; b=bFF0yEAToRwKXhw29CW81hvqVZZDxIOSIyv2VjqIBsUmr8QBtPbOiUWpw+9NLqGHSw OTs++2XSPZvizQP69lMYssr9Rm7vuDAaQJ5ZKzVWgp/YKwMSW7cnH8UYg0n1eN0OfuV1 TYmbpdaIoux3vpZN1GRb3u/cLhQCLj8wxbvWZMlPGG3yiRnD25mrD+/IqpRMinzfoUDN lC0DTcUXxHXCgf8Pdgxdx6yBpAR8wvTzizdGli605WHWEioeGlL3yBzLv8wpRLeXZkBv 3i1Aa/Io2LTE2TSe5TbHFofYKab75u+G/MQXbjWKtWDdJIoZtZN2IKo/CYPxK1nYJpb7 6Buw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1769735652; x=1770340452; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-gg:x-gm-message-state:from :to:cc:subject:date:message-id:reply-to; bh=0QisYXEnWlfMJY5zbInCw9rwciXwDHwbzq0/FjUtsd8=; b=ra0DVAllBL/jlBjwYdYXNcfcCGkSMQd7jSzHd3SwmM0sEQBzcrJlMX8JHqkxO2db+G EgeHQTU3FViaCh+mnVL197AysGje6uT7Ik7WVZduKawxeIEoLSPT04JqBPVf8FSVwdEN ZQmyTIIJbrphPE84hibHJ8iJw/9iGneS2gcGmqPnUodUcET8WRJzo3WrETkyG5xi3dWY SMC+PwuBR/DWkEpMl9m1bViREL+wlY3Tdrc+K0KuxKHJagdgmq4hJ3i9bhzEwRhlNSoU qw7SpipAjmFi4o0EW9FViL9xF7poahAN7uKNOvdb95cadqYdo6yX85SdG07qvIQ29w3T DZiQ== X-Gm-Message-State: AOJu0Yzecb7i6YWHcqe79w7J0/dFmhIPuT+NcgdMf9v7skyS25bGGAM1 SGVmP27HRWQKua2y9gG3Jw07FGY4SpAlMtnemb9H7yAQAHgpDd8BNCZ7jL6HZZYSbbpKcnb1TMU aRSn4 X-Gm-Gg: AZuq6aIeC6zdOxBKWU2hXnOPIIP32jz71XMpPltp6zhtEnA5qLjsEmMOeer5X+c5Rrx Y1g8lu7cXUurV3mJ9sy/W0Ti860Chs4cTV2xP0nHr4UQMU4wpyyWrnRC2w+jSmHtph+Ngnxgaxu A1VpP/hVx9fJpkycgiEmlcTSbxJ3Kub4oVcd6FKoihA8h67NnWS5vkNoo4qM/2ix1TYuxjoyV6E yIVIDwKMYcluXza8vyNjwFhcBccG3gMeuz1RIT8Ol1xwRNuhRJjh5mGI/9s485suDMKb9154Wgo kZ6Qegk6u36+RYJDOb0oWvvPBE0anPlBtMzt3r5UbU/jCDp/LCv1nnOg49dQRoK4JlVC2V1rScH eIPQaTharV+Bq20HP+E5r1SLWB+o8D86ymFObUSqGlK5kOkXqrqYgy053jgI4Lp3ln9Gdb47g2r Ag7fY7De+NLvo581QfIzLIyd7zldQqg70HyKPhfFY1HIX70xpvXg== X-Received: by 2002:a05:600c:a00e:b0:477:a54a:acba with SMTP id 5b1f17b1804b1-482db481bffmr13228765e9.17.1769735652186; Thu, 29 Jan 2026 17:14:12 -0800 (PST) Received: from phoenix.lan (204-195-96-226.wavecable.com. [204.195.96.226]) by smtp.gmail.com with ESMTPSA id 5b1f17b1804b1-481a5e1d0f9sm2934825e9.18.2026.01.29.17.14.10 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 29 Jan 2026 17:14:11 -0800 (PST) From: Stephen Hemminger To: dev@dpdk.org Cc: Stephen Hemminger , Bruce Richardson Subject: [PATCH v10 13/19] net/pcap: support nanosecond timestamp precision Date: Thu, 29 Jan 2026 17:12:29 -0800 Message-ID: <20260130011344.204233-14-stephen@networkplumber.org> X-Mailer: git-send-email 2.51.0 In-Reply-To: <20260130011344.204233-1-stephen@networkplumber.org> References: <20260106182823.192350-1-stephen@networkplumber.org> <20260130011344.204233-1-stephen@networkplumber.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org Enable nanosecond-precision timestamps for both live capture and pcap file reading. Replace pcap_open_live() with the pcap_create()/pcap_activate() API, which allows setting PCAP_TSTAMP_PRECISION_NANO before activation. Similarly, use pcap_open_offline_with_tstamp_precision() for reading pcap files. The pcap_pkthdr timestamp field, despite being declared as struct timeval, actually contains nanoseconds (not microseconds) when nanosecond precision is requested. Make receive timestamp offloading conditional: timestamps are now only written to the mbuf dynamic field when RTE_ETH_RX_OFFLOAD_TIMESTAMP is enabled. Previously, timestamps were unconditionally added to every received packet. Other related changes: * Defer timestamp dynfield registration from probe to device start, and only when timestamp offloading is enabled * Add read_clock dev_op returning current UTC time for timestamp correlation * Move per-burst timestamp calculation outside the packet loop in tx_dumper * Enable immediate mode and improve error reporting in live capture setup Signed-off-by: Stephen Hemminger --- doc/guides/nics/pcap_ring.rst | 3 + doc/guides/rel_notes/release_26_03.rst | 2 + drivers/net/pcap/pcap_ethdev.c | 145 +++++++++++++++++++------ 3 files changed, 114 insertions(+), 36 deletions(-) diff --git a/doc/guides/nics/pcap_ring.rst b/doc/guides/nics/pcap_ring.rst index c005786ce3..5b9ca71b18 100644 --- a/doc/guides/nics/pcap_ring.rst +++ b/doc/guides/nics/pcap_ring.rst @@ -224,6 +224,9 @@ Features and Limitations ``RTE_ETH_TX_OFFLOAD_VLAN_INSERT`` is enabled and the mbuf has ``RTE_MBUF_F_TX_VLAN`` set. +* The PMD will insert the pcap header packet timestamp with nanoseconds resolution and + UNIX origin, i.e. time since 1-JAN-1970 UTC, if ``RTE_ETH_RX_OFFLOAD_TIMESTAMP`` is enabled. + Rings-based PMD ~~~~~~~~~~~~~~~ diff --git a/doc/guides/rel_notes/release_26_03.rst b/doc/guides/rel_notes/release_26_03.rst index 0264968567..50ba8bf109 100644 --- a/doc/guides/rel_notes/release_26_03.rst +++ b/doc/guides/rel_notes/release_26_03.rst @@ -59,6 +59,8 @@ New Features * Added support for VLAN insertion and stripping. * Added support for reporting link state and speed in ``iface`` mode. + * Receive timestamp offload is only done if offload flag set. + * Receive timestamps support nanosecond precision. Removed Items diff --git a/drivers/net/pcap/pcap_ethdev.c b/drivers/net/pcap/pcap_ethdev.c index 917a8eee36..90bedb6286 100644 --- a/drivers/net/pcap/pcap_ethdev.c +++ b/drivers/net/pcap/pcap_ethdev.c @@ -28,13 +28,11 @@ #include #include #include +#include #include "pcap_osdep.h" #define RTE_ETH_PCAP_SNAPSHOT_LEN 65535 -#define RTE_ETH_PCAP_SNAPLEN RTE_ETHER_MAX_JUMBO_FRAME_LEN -#define RTE_ETH_PCAP_PROMISC 1 -#define RTE_ETH_PCAP_TIMEOUT -1 #define ETH_PCAP_RX_PCAP_ARG "rx_pcap" #define ETH_PCAP_TX_PCAP_ARG "tx_pcap" @@ -78,6 +76,7 @@ struct pcap_rx_queue { uint16_t port_id; uint16_t queue_id; bool vlan_strip; + bool timestamp_offloading; struct rte_mempool *mb_pool; struct queue_stat rx_stat; struct queue_missed_stat missed_stat; @@ -109,6 +108,7 @@ struct pmd_internals { bool phy_mac; bool infinite_rx; bool vlan_strip; + bool timestamp_offloading; }; struct pmd_process_private { @@ -336,10 +336,21 @@ eth_pcap_rx(void *queue, struct rte_mbuf **bufs, uint16_t nb_pkts) if (pcap_q->vlan_strip) rte_vlan_strip(mbuf); - uint64_t us = (uint64_t)header->ts.tv_sec * US_PER_S + header->ts.tv_usec; + if (pcap_q->timestamp_offloading) { + /* + * The use of tv_usec as nanoseconds is not a bug here. + * Interface is always created with nanosecond precision, and + * that is how pcap API bodged in nanoseconds support. + */ + uint64_t ns = (uint64_t)header->ts.tv_sec * NSEC_PER_SEC + + header->ts.tv_usec; + + *RTE_MBUF_DYNFIELD(mbuf, timestamp_dynfield_offset, + rte_mbuf_timestamp_t *) = ns; + + mbuf->ol_flags |= timestamp_rx_dynflag; + } - *RTE_MBUF_DYNFIELD(mbuf, timestamp_dynfield_offset, rte_mbuf_timestamp_t *) = us; - mbuf->ol_flags |= timestamp_rx_dynflag; mbuf->port = pcap_q->port_id; bufs[num_rx] = mbuf; num_rx++; @@ -359,14 +370,19 @@ eth_null_rx(void *queue __rte_unused, return 0; } -#define NSEC_PER_SEC 1000000000L - /* - * This function stores nanoseconds in `tv_usec` field of `struct timeval`, - * because `ts` goes directly to nanosecond-precision dump. + * Calculate current timestamp in nanoseconds by computing + * offset from starting time value. + * + * Note: it is not a bug that this code is putting nanosecond + * value into microsecond timeval field. The pcap API is old + * and nanoseconds were bodged on as an after thought. + * As long as the pcap stream is set to nanosecond precision + * it expects nanoseconds here. */ static inline void -calculate_timestamp(struct timeval *ts) { +calculate_timestamp(struct timeval *ts) +{ uint64_t cycles; struct timespec cur_time; @@ -404,8 +420,10 @@ eth_pcap_tx_dumper(void *queue, struct rte_mbuf **bufs, uint16_t nb_pkts) if (dumper == NULL || nb_pkts == 0) return 0; - /* writes the nb_pkts packets to the previously opened pcap file - * dumper */ + /* all packets in burst have same timestamp */ + calculate_timestamp(&header.ts); + + /* writes the nb_pkts packets to the previously opened pcap file dumper */ for (i = 0; i < nb_pkts; i++) { struct rte_mbuf *mbuf = bufs[i]; uint32_t len, caplen; @@ -418,9 +436,6 @@ eth_pcap_tx_dumper(void *queue, struct rte_mbuf **bufs, uint16_t nb_pkts) } len = caplen = rte_pktmbuf_pkt_len(mbuf); - - calculate_timestamp(&header.ts); - header.len = len; header.caplen = caplen; @@ -519,22 +534,62 @@ eth_pcap_tx(void *queue, struct rte_mbuf **bufs, uint16_t nb_pkts) * pcap_open_live wrapper function */ static inline int -open_iface_live(const char *iface, pcap_t **pcap) { - *pcap = pcap_open_live(iface, RTE_ETH_PCAP_SNAPLEN, - RTE_ETH_PCAP_PROMISC, RTE_ETH_PCAP_TIMEOUT, errbuf); +open_iface_live(const char *iface, pcap_t **pcap) +{ + pcap_t *pc; + int status; - if (*pcap == NULL) { - PMD_LOG(ERR, "Couldn't open %s: %s", iface, errbuf); - return -1; + pc = pcap_create(iface, errbuf); + if (pc == NULL) { + PMD_LOG(ERR, "Couldn't create %s: %s", iface, errbuf); + goto error; + } + + status = pcap_set_tstamp_precision(pc, PCAP_TSTAMP_PRECISION_NANO); + if (status != 0) { + PMD_LOG(ERR, "%s: Could not set to ns precision: %s", + iface, pcap_statustostr(status)); + goto error; + } + + status = pcap_set_immediate_mode(pc, 1); + if (status != 0) + PMD_LOG(WARNING, "%s: Could not set to immediate mode: %s", + iface, pcap_statustostr(status)); + + status = pcap_set_promisc(pc, 1); + if (status != 0) + PMD_LOG(WARNING, "%s: Could not set to promiscuous: %s", + iface, pcap_statustostr(status)); + + status = pcap_set_snaplen(pc, RTE_ETH_PCAP_SNAPSHOT_LEN); + if (status != 0) + PMD_LOG(WARNING, "%s: Could not set snapshot length: %s", + iface, pcap_statustostr(status)); + + status = pcap_activate(pc); + if (status < 0) { + char *cp = pcap_geterr(pc); + + if (status == PCAP_ERROR) + PMD_LOG(ERR, "%s: could not activate: %s", iface, cp); + else + PMD_LOG(ERR, "%s: %s (%s)", iface, pcap_statustostr(status), cp); + goto error; } - if (pcap_setnonblock(*pcap, 1, errbuf)) { + if (pcap_setnonblock(pc, 1, errbuf)) { PMD_LOG(ERR, "Couldn't set non-blocking on %s: %s", iface, errbuf); - pcap_close(*pcap); - return -1; + goto error; } + *pcap = pc; return 0; + +error: + if (pc != NULL) + pcap_close(pc); + return -1; } static int @@ -581,7 +636,8 @@ open_single_tx_pcap(const char *pcap_filename, pcap_dumper_t **dumper) static int open_single_rx_pcap(const char *pcap_filename, pcap_t **pcap) { - *pcap = pcap_open_offline(pcap_filename, errbuf); + *pcap = pcap_open_offline_with_tstamp_precision(pcap_filename, + PCAP_TSTAMP_PRECISION_NANO, errbuf); if (*pcap == NULL) { PMD_LOG(ERR, "Couldn't open %s: %s", pcap_filename, errbuf); @@ -740,6 +796,7 @@ eth_dev_configure(struct rte_eth_dev *dev) const struct rte_eth_rxmode *rxmode = &dev_conf->rxmode; internals->vlan_strip = !!(rxmode->offloads & RTE_ETH_RX_OFFLOAD_VLAN_STRIP); + internals->timestamp_offloading = !!(rxmode->offloads & RTE_ETH_RX_OFFLOAD_TIMESTAMP); return 0; } @@ -757,7 +814,8 @@ eth_dev_info(struct rte_eth_dev *dev, dev_info->min_rx_bufsize = 0; dev_info->tx_offload_capa = RTE_ETH_TX_OFFLOAD_MULTI_SEGS | RTE_ETH_TX_OFFLOAD_VLAN_INSERT; - dev_info->rx_offload_capa = RTE_ETH_RX_OFFLOAD_VLAN_STRIP; + dev_info->rx_offload_capa = RTE_ETH_RX_OFFLOAD_VLAN_STRIP | + RTE_ETH_RX_OFFLOAD_TIMESTAMP; return 0; } @@ -979,6 +1037,7 @@ eth_rx_queue_setup(struct rte_eth_dev *dev, pcap_q->queue_id = rx_queue_id; pcap_q->vlan_strip = internals->vlan_strip; dev->data->rx_queues[rx_queue_id] = pcap_q; + pcap_q->timestamp_offloading = internals->timestamp_offloading; if (internals->infinite_rx) { struct pmd_process_private *pp; @@ -1104,12 +1163,24 @@ eth_tx_queue_stop(struct rte_eth_dev *dev, uint16_t tx_queue_id) return 0; } +/* Timestamp values in receive packets from libpcap are in nanoseconds */ +static int +eth_dev_read_clock(struct rte_eth_dev *dev __rte_unused, uint64_t *timestamp) +{ + struct timespec cur_time; + + timespec_get(&cur_time, TIME_UTC); + *timestamp = rte_timespec_to_ns(&cur_time); + return 0; +} + static const struct eth_dev_ops ops = { .dev_start = eth_dev_start, .dev_stop = eth_dev_stop, .dev_close = eth_dev_close, .dev_configure = eth_dev_configure, .dev_infos_get = eth_dev_info, + .read_clock = eth_dev_read_clock, .rx_queue_setup = eth_rx_queue_setup, .tx_queue_setup = eth_tx_queue_setup, .tx_queue_release = eth_tx_queue_release, @@ -1524,15 +1595,17 @@ pmd_pcap_probe(struct rte_vdev_device *dev) name = rte_vdev_device_name(dev); PMD_LOG(INFO, "Initializing pmd_pcap for %s", name); - timespec_get(&start_time, TIME_UTC); - start_cycles = rte_get_timer_cycles(); - hz = rte_get_timer_hz(); - - ret = rte_mbuf_dyn_rx_timestamp_register(×tamp_dynfield_offset, - ×tamp_rx_dynflag); - if (ret != 0) { - PMD_LOG(ERR, "Failed to register Rx timestamp field/flag"); - return -1; + /* Record info for timestamps on first probe */ + if (hz == 0) { + ret = rte_mbuf_dyn_rx_timestamp_register(×tamp_dynfield_offset, + ×tamp_rx_dynflag); + if (ret != 0) { + PMD_LOG(ERR, "Failed to register Rx timestamp field/flag"); + return ret; + } + timespec_get(&start_time, TIME_UTC); + start_cycles = rte_get_timer_cycles(); + hz = rte_get_timer_hz(); } if (rte_eal_process_type() == RTE_PROC_SECONDARY) { -- 2.51.0