From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from mails.dpdk.org (mails.dpdk.org [217.70.189.124]) by smtp.lore.kernel.org (Postfix) with ESMTP id 5CE37ECD6EE for ; Wed, 11 Feb 2026 22:22:44 +0000 (UTC) Received: from mails.dpdk.org (localhost [127.0.0.1]) by mails.dpdk.org (Postfix) with ESMTP id 6D1F840A89; Wed, 11 Feb 2026 23:22:01 +0100 (CET) Received: from mail-wm1-f54.google.com (mail-wm1-f54.google.com [209.85.128.54]) by mails.dpdk.org (Postfix) with ESMTP id 96653406B6 for ; Wed, 11 Feb 2026 23:21:59 +0100 (CET) Received: by mail-wm1-f54.google.com with SMTP id 5b1f17b1804b1-4834826e555so35252775e9.2 for ; Wed, 11 Feb 2026 14:21:59 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=networkplumber-org.20230601.gappssmtp.com; s=20230601; t=1770848519; x=1771453319; darn=dpdk.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=MBD8/zbSRzJneabNszrfvCyF8P05bN4743dTDuMLBIo=; b=prAvNmcyut7YvGB3VyFg7r7y2nlKVK5aAHtjU4XysY+OawkXHHehq7/K1m5oT6qTq7 UM6xvvgwXv/HTjRtw7IiX2ElCD/ILCKEjcCjK2nxvJH3COp2fQCcxJFQOpavheVcGAVx FJkjX3BIiVBVYFQYrbJJ2LXqdkwU3bLuMivMQDyD1TCM8Ox5QhZ/OJ2e73KQEmgCIcra gsiBf9t4LDixlQh/wi5GT5mk9owzhMav1N/n0IWlXENEUDBTyK/kd0NGGfCX2H7Ill+0 soe45pIv/cF/hO0Ei9D3zrASWN/6zfHwvB7jkCwKAQkNI5Bl+gZdXBRSPCffnj5I5kFy clyQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1770848519; x=1771453319; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-gg:x-gm-message-state:from :to:cc:subject:date:message-id:reply-to; bh=MBD8/zbSRzJneabNszrfvCyF8P05bN4743dTDuMLBIo=; b=imX610U2Hywd1lXDhYS0ImST2oD2yF6Tg9qnGlhc9VYeSln3bm7Rk4NMBQDJvLw5ga mm1hHq+OZrlP0T4VDlPGWMRKhlq6ndVHmlb8zaD2ASE6uIzQptD/PFrPKE7hZFj3yfvO HGZ2mXZiIfxKE/QnyyFSzakB+WioG+BZbLB59+Gam695F+yzFHaSuepZl6XQk3k0SMR9 BBQI8IsU3xdy5VOdK0B5L5lk691Pfc6xq9hMm+T+AaQ5+uVfsIoKrKFintghdw2ml1vP tKQeBIhXQlGVJDSpxH0uFeFn5e1fIYVBdh26zJD0Q/JH5yFViTwPrWoDa+XL2YBEN4Vn 4TGQ== X-Gm-Message-State: AOJu0YzfcA7ALHAIOLJD4RsbQc1eZiGttL39waH9t8TqAArAHRlDXHDH xQzkcCSxMn6Uh0mvIUXXTwQIIqy2kiyWHbT9YjdJdPdeAp4vBvWF4L7YDmj6fd4B1M8uV6OHt4p cBdP9 X-Gm-Gg: AZuq6aKqS6mNMI9PmgSA9XIfj48ZBecESYf9x8BiPmOjGctZqwYbRjWqL+JC524iX3L Y7JsZQzjDKitlSzui9sHkaDrHb0alq2Sa61GIEiYvmRG/1z6f72ZSAoNxKPAFIJU6w8+HSmsZS3 5xFbBLHfRDo4CBTUzFEOA12lfHU028Hh/ay1ZeRvc4cx1lBzvTFl1ZCohRX9qs+04JFJAyFtTvo /VGE3251uT4qEbsTo+KOVBmjwQMqfq1tK+6TDy4J3sHkXz2LSG6Ai4jm9pdK6QZii8GX4NYilW0 lNowmaRML1H5+wXFIiTANR5kxR5m3afHChI+GjEE+60JlQJLQ4TrpBwXsMCjd5MFPU59k1Qqvof ysPWFMAHs22kVDWm7hu1SinYHv+pJin8uQhSz/bWAEUPBK19iKyweSi6palkb5KG3IB4SRDoRjA 044KBuBfeGbuEY0dGBK+LYTRgvLhBqfcyoq3EI55K87zUAYAEaMJrAa1Om9Nvm70ZheAa8lck/ X-Received: by 2002:a05:600c:4584:b0:47b:de05:aa28 with SMTP id 5b1f17b1804b1-483656b08famr10335975e9.2.1770848519128; Wed, 11 Feb 2026 14:21:59 -0800 (PST) Received: from phoenix.lan (204-195-96-226.wavecable.com. [204.195.96.226]) by smtp.gmail.com with ESMTPSA id ffacd0b85a97d-43783e0196bsm7877965f8f.23.2026.02.11.14.21.57 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 11 Feb 2026 14:21:58 -0800 (PST) From: Stephen Hemminger To: dev@dpdk.org Cc: Stephen Hemminger Subject: [PATCH v14 10/18] net/pcap: support nanosecond timestamp precision Date: Wed, 11 Feb 2026 13:09:43 -0800 Message-ID: <20260211222130.338797-11-stephen@networkplumber.org> X-Mailer: git-send-email 2.51.0 In-Reply-To: <20260211222130.338797-1-stephen@networkplumber.org> References: <20260106182823.192350-1-stephen@networkplumber.org> <20260211222130.338797-1-stephen@networkplumber.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org Enable nanosecond-precision timestamps for both live capture and pcap file reading. Replace pcap_open_live() with the pcap_create()/pcap_activate() API, which allows setting PCAP_TSTAMP_PRECISION_NANO before activation. Similarly, use pcap_open_offline_with_tstamp_precision() for reading pcap files. The pcap_pkthdr timestamp field, despite being declared as struct timeval, actually contains nanoseconds (not microseconds) when nanosecond precision is requested. Make receive timestamp offloading conditional: timestamps are now only written to the mbuf dynamic field when RTE_ETH_RX_OFFLOAD_TIMESTAMP is enabled. Previously, timestamps were unconditionally added to every received packet. Other related changes: * Add read_clock dev_op returning current UTC time for timestamp correlation. * Move per-burst timestamp calculation outside the packet loop in tx_dumper. * Enable immediate mode and improve error reporting in live capture setup. Signed-off-by: Stephen Hemminger --- doc/guides/nics/pcap.rst | 3 + doc/guides/rel_notes/release_26_03.rst | 2 + drivers/net/pcap/pcap_ethdev.c | 157 +++++++++++++++++++------ 3 files changed, 127 insertions(+), 35 deletions(-) diff --git a/doc/guides/nics/pcap.rst b/doc/guides/nics/pcap.rst index bed5006a42..2709c6d017 100644 --- a/doc/guides/nics/pcap.rst +++ b/doc/guides/nics/pcap.rst @@ -258,3 +258,6 @@ Features and Limitations * The PMD will transparently insert a VLAN tag to transmitted packets if ``RTE_ETH_TX_OFFLOAD_VLAN_INSERT`` is enabled and the mbuf has ``RTE_MBUF_F_TX_VLAN`` set. + +* The PMD will insert the pcap header packet timestamp with nanoseconds resolution and + UNIX origin, i.e. time since 1-JAN-1970 UTC, if ``RTE_ETH_RX_OFFLOAD_TIMESTAMP`` is enabled. diff --git a/doc/guides/rel_notes/release_26_03.rst b/doc/guides/rel_notes/release_26_03.rst index 42ecba925e..5953995b77 100644 --- a/doc/guides/rel_notes/release_26_03.rst +++ b/doc/guides/rel_notes/release_26_03.rst @@ -86,6 +86,8 @@ New Features * Added support for VLAN insertion and stripping. * Added support for reporting link state and speed in ``iface`` mode. + * Receive timestamp offload is only done if offload flag set. + * Receive timestamps support nanosecond precision. Removed Items diff --git a/drivers/net/pcap/pcap_ethdev.c b/drivers/net/pcap/pcap_ethdev.c index 22d668163e..8e12f7cd0e 100644 --- a/drivers/net/pcap/pcap_ethdev.c +++ b/drivers/net/pcap/pcap_ethdev.c @@ -28,13 +28,11 @@ #include #include #include +#include #include "pcap_osdep.h" #define RTE_ETH_PCAP_SNAPSHOT_LEN 65535 -#define RTE_ETH_PCAP_SNAPLEN RTE_ETHER_MAX_JUMBO_FRAME_LEN -#define RTE_ETH_PCAP_PROMISC 1 -#define RTE_ETH_PCAP_TIMEOUT -1 #define ETH_PCAP_RX_PCAP_ARG "rx_pcap" #define ETH_PCAP_TX_PCAP_ARG "tx_pcap" @@ -78,6 +76,7 @@ struct pcap_rx_queue { uint16_t port_id; uint16_t queue_id; bool vlan_strip; + bool timestamp_offloading; struct rte_mempool *mb_pool; struct queue_stat rx_stat; struct queue_missed_stat missed_stat; @@ -109,6 +108,7 @@ struct pmd_internals { bool phy_mac; bool infinite_rx; bool vlan_strip; + bool timestamp_offloading; }; struct pmd_process_private { @@ -270,6 +270,15 @@ eth_pcap_rx_infinite(void *queue, struct rte_mbuf **bufs, uint16_t nb_pkts) if (pcap_q->vlan_strip) rte_vlan_strip(bufs[i]); + if (pcap_q->timestamp_offloading) { + struct timespec ts; + + timespec_get(&ts, TIME_UTC); + *RTE_MBUF_DYNFIELD(bufs[i], timestamp_dynfield_offset, + rte_mbuf_timestamp_t *) = rte_timespec_to_ns(&ts); + bufs[i]->ol_flags |= timestamp_rx_dynflag; + } + rx_bytes += bufs[i]->data_len; /* Enqueue packet back on ring to allow infinite rx. */ @@ -340,10 +349,21 @@ eth_pcap_rx(void *queue, struct rte_mbuf **bufs, uint16_t nb_pkts) if (pcap_q->vlan_strip) rte_vlan_strip(mbuf); - uint64_t us = (uint64_t)header->ts.tv_sec * US_PER_S + header->ts.tv_usec; + if (pcap_q->timestamp_offloading) { + /* + * The use of tv_usec as nanoseconds is not a bug here. + * Interface is always created with nanosecond precision, and + * that is how pcap API bodged in nanoseconds support. + */ + uint64_t ns = (uint64_t)header->ts.tv_sec * NSEC_PER_SEC + + header->ts.tv_usec; + + *RTE_MBUF_DYNFIELD(mbuf, timestamp_dynfield_offset, + rte_mbuf_timestamp_t *) = ns; + + mbuf->ol_flags |= timestamp_rx_dynflag; + } - *RTE_MBUF_DYNFIELD(mbuf, timestamp_dynfield_offset, rte_mbuf_timestamp_t *) = us; - mbuf->ol_flags |= timestamp_rx_dynflag; mbuf->port = pcap_q->port_id; bufs[num_rx] = mbuf; num_rx++; @@ -363,14 +383,19 @@ eth_null_rx(void *queue __rte_unused, return 0; } -#define NSEC_PER_SEC 1000000000L - /* - * This function stores nanoseconds in `tv_usec` field of `struct timeval`, - * because `ts` goes directly to nanosecond-precision dump. + * Calculate current timestamp in nanoseconds by computing + * offset from starting time value. + * + * Note: it is not a bug that this code is putting nanosecond + * value into microsecond timeval field. The pcap API is old + * and nanoseconds were bodged on as an after thought. + * As long as the pcap stream is set to nanosecond precision + * it expects nanoseconds here. */ static inline void -calculate_timestamp(struct timeval *ts) { +calculate_timestamp(struct timeval *ts) +{ uint64_t cycles; struct timespec cur_time; @@ -459,8 +484,10 @@ eth_pcap_tx_dumper(void *queue, struct rte_mbuf **bufs, uint16_t nb_pkts) if (dumper == NULL || nb_pkts == 0) return 0; - /* writes the nb_pkts packets to the previously opened pcap file - * dumper */ + /* all packets in burst have same timestamp */ + calculate_timestamp(&header.ts); + + /* writes the nb_pkts packets to the previously opened pcap file dumper */ for (i = 0; i < nb_pkts; i++) { uint32_t len, caplen; const uint8_t *data; @@ -470,9 +497,6 @@ eth_pcap_tx_dumper(void *queue, struct rte_mbuf **bufs, uint16_t nb_pkts) struct rte_mbuf *mbuf = bufs[i]; len = caplen = rte_pktmbuf_pkt_len(mbuf); - - calculate_timestamp(&header.ts); - header.len = len; header.caplen = caplen; @@ -601,22 +625,62 @@ eth_pcap_tx(void *queue, struct rte_mbuf **bufs, uint16_t nb_pkts) * pcap_open_live wrapper function */ static inline int -open_iface_live(const char *iface, pcap_t **pcap) { - *pcap = pcap_open_live(iface, RTE_ETH_PCAP_SNAPLEN, - RTE_ETH_PCAP_PROMISC, RTE_ETH_PCAP_TIMEOUT, errbuf); +open_iface_live(const char *iface, pcap_t **pcap) +{ + pcap_t *pc; + int status; - if (*pcap == NULL) { - PMD_LOG(ERR, "Couldn't open %s: %s", iface, errbuf); - return -1; + pc = pcap_create(iface, errbuf); + if (pc == NULL) { + PMD_LOG(ERR, "Couldn't create %s: %s", iface, errbuf); + goto error; } - if (pcap_setnonblock(*pcap, 1, errbuf)) { + status = pcap_set_tstamp_precision(pc, PCAP_TSTAMP_PRECISION_NANO); + if (status != 0) { + PMD_LOG(ERR, "%s: Could not set to ns precision: %s", + iface, pcap_statustostr(status)); + goto error; + } + + status = pcap_set_immediate_mode(pc, 1); + if (status != 0) + PMD_LOG(WARNING, "%s: Could not set to immediate mode: %s", + iface, pcap_statustostr(status)); + + status = pcap_set_promisc(pc, 1); + if (status != 0) + PMD_LOG(WARNING, "%s: Could not set to promiscuous: %s", + iface, pcap_statustostr(status)); + + status = pcap_set_snaplen(pc, RTE_ETH_PCAP_SNAPSHOT_LEN); + if (status != 0) + PMD_LOG(WARNING, "%s: Could not set snapshot length: %s", + iface, pcap_statustostr(status)); + + status = pcap_activate(pc); + if (status < 0) { + char *cp = pcap_geterr(pc); + + if (status == PCAP_ERROR) + PMD_LOG(ERR, "%s: could not activate: %s", iface, cp); + else + PMD_LOG(ERR, "%s: %s (%s)", iface, pcap_statustostr(status), cp); + goto error; + } + + if (pcap_setnonblock(pc, 1, errbuf)) { PMD_LOG(ERR, "Couldn't set non-blocking on %s: %s", iface, errbuf); - pcap_close(*pcap); - return -1; + goto error; } + *pcap = pc; return 0; + +error: + if (pc != NULL) + pcap_close(pc); + return -1; } static int @@ -663,7 +727,8 @@ open_single_tx_pcap(const char *pcap_filename, pcap_dumper_t **dumper) static int open_single_rx_pcap(const char *pcap_filename, pcap_t **pcap) { - *pcap = pcap_open_offline(pcap_filename, errbuf); + *pcap = pcap_open_offline_with_tstamp_precision(pcap_filename, + PCAP_TSTAMP_PRECISION_NANO, errbuf); if (*pcap == NULL) { PMD_LOG(ERR, "Couldn't open %s: %s", pcap_filename, errbuf); @@ -822,6 +887,7 @@ eth_dev_configure(struct rte_eth_dev *dev) const struct rte_eth_rxmode *rxmode = &dev_conf->rxmode; internals->vlan_strip = !!(rxmode->offloads & RTE_ETH_RX_OFFLOAD_VLAN_STRIP); + internals->timestamp_offloading = !!(rxmode->offloads & RTE_ETH_RX_OFFLOAD_TIMESTAMP); return 0; } @@ -839,7 +905,8 @@ eth_dev_info(struct rte_eth_dev *dev, dev_info->min_rx_bufsize = 0; dev_info->tx_offload_capa = RTE_ETH_TX_OFFLOAD_MULTI_SEGS | RTE_ETH_TX_OFFLOAD_VLAN_INSERT; - dev_info->rx_offload_capa = RTE_ETH_RX_OFFLOAD_VLAN_STRIP; + dev_info->rx_offload_capa = RTE_ETH_RX_OFFLOAD_VLAN_STRIP | + RTE_ETH_RX_OFFLOAD_TIMESTAMP; return 0; } @@ -1061,6 +1128,7 @@ eth_rx_queue_setup(struct rte_eth_dev *dev, pcap_q->queue_id = rx_queue_id; pcap_q->vlan_strip = internals->vlan_strip; dev->data->rx_queues[rx_queue_id] = pcap_q; + pcap_q->timestamp_offloading = internals->timestamp_offloading; if (internals->infinite_rx) { struct pmd_process_private *pp; @@ -1199,6 +1267,17 @@ eth_tx_queue_stop(struct rte_eth_dev *dev, uint16_t tx_queue_id) return 0; } +/* Timestamp values in receive packets from libpcap are in nanoseconds */ +static int +eth_dev_read_clock(struct rte_eth_dev *dev __rte_unused, uint64_t *timestamp) +{ + struct timespec cur_time; + + timespec_get(&cur_time, TIME_UTC); + *timestamp = rte_timespec_to_ns(&cur_time); + return 0; +} + static int eth_vlan_offload_set(struct rte_eth_dev *dev, int mask) { @@ -1225,6 +1304,7 @@ static const struct eth_dev_ops ops = { .dev_close = eth_dev_close, .dev_configure = eth_dev_configure, .dev_infos_get = eth_dev_info, + .read_clock = eth_dev_read_clock, .rx_queue_setup = eth_rx_queue_setup, .tx_queue_setup = eth_tx_queue_setup, .tx_queue_release = eth_tx_queue_release, @@ -1640,15 +1720,22 @@ pmd_pcap_probe(struct rte_vdev_device *dev) name = rte_vdev_device_name(dev); PMD_LOG(INFO, "Initializing pmd_pcap for %s", name); - timespec_get(&start_time, TIME_UTC); - start_cycles = rte_get_timer_cycles(); - hz = rte_get_timer_hz(); + /* Record info for timestamps on first probe */ + if (hz == 0) { + hz = rte_get_timer_hz(); + if (hz == 0) { + PMD_LOG(ERR, "Reported hz is zero!"); + return -1; + } - ret = rte_mbuf_dyn_rx_timestamp_register(×tamp_dynfield_offset, - ×tamp_rx_dynflag); - if (ret != 0) { - PMD_LOG(ERR, "Failed to register Rx timestamp field/flag"); - return -1; + ret = rte_mbuf_dyn_rx_timestamp_register(×tamp_dynfield_offset, + ×tamp_rx_dynflag); + if (ret != 0) { + PMD_LOG(ERR, "Failed to register Rx timestamp field/flag"); + return ret; + } + timespec_get(&start_time, TIME_UTC); + start_cycles = rte_get_timer_cycles(); } if (rte_eal_process_type() == RTE_PROC_SECONDARY) { -- 2.51.0