public inbox for bpf@vger.kernel.org
 help / color / mirror / Atom feed
From: sashiko-bot@kernel.org
To: "Jason Xing" <kerneljasonxing@gmail.com>
Cc: bpf@vger.kernel.org
Subject: Re: [PATCH RFC net-next v4 07/14] xsk: try to skip validating skb list in xmit path
Date: Wed, 15 Apr 2026 09:33:00 +0000	[thread overview]
Message-ID: <20260415093301.799F2C19424@smtp.kernel.org> (raw)
In-Reply-To: <20260415082654.21026-8-kerneljasonxing@gmail.com>

Sashiko AI review found 2 potential issue(s):
- [Critical] Bypassing `validate_xmit_skb_list()` based on `NETIF_F_SG` skips critical hardware capability checks, software fallbacks, and SKB initialization, leading to kernel panics and corrupted packets.
- [High] Breaking out of the transmit loop early leaves packets stuck in the TX ring indefinitely because the hardware doorbell is never flushed.
--

commit 7603ac608a36b8d39e026a60b567ea2b106617f0
Author: Jason Xing <kernelxing@tencent.com>

xsk: try to skip validating skb list in xmit path

This patch attempts to optimize the XSK transmit path by skipping
validate_xmit_skb_list() if the device supports Scatter-Gather,
aiming to improve performance for drivers like ixgbe.

> diff --git a/net/core/dev.c b/net/core/dev.c
> index 99188353abec6..12643cbab6530 100644
> --- a/net/core/dev.c
> +++ b/net/core/dev.c
[ ... ]
> @@ -4878,15 +4879,17 @@ int xsk_direct_xmit_batch(struct xdp_sock *xs, struct net_device *dev)
>  	local_bh_disable();
>  	HARD_TX_LOCK(dev, txq, smp_processor_id());
>  	while ((skb = __skb_dequeue(send_queue)) != NULL) {
> -		struct sk_buff *orig_skb = skb;
> -		bool again = false;
> -
> -		skb = validate_xmit_skb_list(skb, dev, &again);
> -		if (skb != orig_skb) {
> -			dev_core_stats_tx_dropped_inc(dev);
> -			kfree_skb_list(skb);
> -			ret = NET_XMIT_DROP;
> -			break;
> +		if (unlikely(need_validate)) {
> +			struct sk_buff *orig_skb = skb;
> +			bool again = false;
> +
> +			skb = validate_xmit_skb_list(skb, dev, &again);

Does skipping validate_xmit_skb_list() based on NETIF_F_SG bypass necessary
hardware capability checks and software fallbacks?

validate_xmit_skb() performs dynamic feature verification via
ndo_features_check(), transport header initialization via
skb_set_transport_header(), and software checksum fallbacks via
skb_csum_hwoffload_help().

AF_XDP users can request TX checksum offload, setting skb->ip_summed to
CHECKSUM_PARTIAL. By skipping validation solely because the device supports
Scatter-Gather, packets bypass skb_set_transport_header(), leaving
skb->transport_header uninitialized.

Could this cause drivers that use skb_transport_offset() to compute massive
sizes, corrupting TX descriptors and potentially hanging the NIC?

Additionally, if the device lacks NETIF_F_HW_CSUM, skipping
skb_csum_hwoffload_help() sends CHECKSUM_PARTIAL to drivers unable to handle
it. Might this result in silent data corruption on the wire or driver crashes?

> +			if (skb != orig_skb) {
> +				dev_core_stats_tx_dropped_inc(dev);
> +				kfree_skb_list(skb);
> +				ret = NET_XMIT_DROP;
> +				break;

If this loop breaks early, does it leave packets stuck in the TX ring?

While this behavior was introduced earlier in the series and not by this
specific patch, the issue is present in this modified block.

The batching logic calls netdev_start_xmit() with more = true for all packets
except the last one in the queue.

If the loop breaks early here due to a validate_xmit_skb_list() failure,
the function returns without ever calling netdev_start_xmit() with
more = false.

Will this cause the driver to defer ringing the hardware TX doorbell,
leaving previously submitted packets stranded in the TX ring and causing
latency spikes or TX watchdog timeouts?

> +			}
>  		}
>  
>  		if (netif_xmit_frozen_or_drv_stopped(txq)) {

-- 
Sashiko AI review · https://sashiko.dev/#/patchset/20260415082654.21026-1-kerneljasonxing@gmail.com?part=7

  reply	other threads:[~2026-04-15  9:33 UTC|newest]

Thread overview: 40+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-04-15  8:26 [PATCH RFC net-next v4 00/14] xsk: batch xmit in copy mode Jason Xing
2026-04-15  8:26 ` [PATCH RFC net-next v4 01/14] xsk: introduce XDP_GENERIC_XMIT_BATCH setsockopt Jason Xing
2026-04-15  8:51   ` sashiko-bot
2026-04-15 12:46     ` Jason Xing
2026-04-15  8:26 ` [PATCH RFC net-next v4 02/14] xsk: extend xsk_build_skb() to support passing an already allocated skb Jason Xing
2026-04-15  8:52   ` sashiko-bot
2026-04-15 13:19     ` Jason Xing
2026-04-15  8:26 ` [PATCH RFC net-next v4 03/14] xsk: add xsk_alloc_batch_skb() to build skbs in batch Jason Xing
2026-04-15  9:17   ` sashiko-bot
2026-04-16  1:18     ` Jason Xing
2026-04-15  8:26 ` [PATCH RFC net-next v4 04/14] xsk: cache data buffers to avoid frequently calling kmalloc_reserve Jason Xing
2026-04-15  9:38   ` sashiko-bot
2026-04-16  2:45     ` Jason Xing
2026-04-16 12:18       ` Jason Xing
2026-04-15  8:26 ` [PATCH RFC net-next v4 05/14] xsk: add direct xmit in batch function Jason Xing
2026-04-15  9:11   ` sashiko-bot
2026-04-16  3:04     ` Jason Xing
2026-04-15  8:26 ` [PATCH RFC net-next v4 06/14] xsk: support dynamic xmit.more control for batch xmit Jason Xing
2026-04-15  9:35   ` sashiko-bot
2026-04-16  3:43     ` Jason Xing
2026-04-16  4:50       ` Dmitry Torokhov
2026-04-16  4:51         ` Dmitry Torokhov
2026-04-15  8:26 ` [PATCH RFC net-next v4 07/14] xsk: try to skip validating skb list in xmit path Jason Xing
2026-04-15  9:33   ` sashiko-bot [this message]
2026-04-16  5:55     ` Jason Xing
2026-04-15  8:26 ` [PATCH RFC net-next v4 08/14] xsk: rename nb_pkts to nb_descs in xsk_tx_peek_release_desc_batch Jason Xing
2026-04-15  8:26 ` [PATCH RFC net-next v4 09/14] xsk: extend xskq_cons_read_desc_batch to count nb_pkts Jason Xing
2026-04-15  8:26 ` [PATCH RFC net-next v4 10/14] xsk: extend xsk_cq_reserve_locked() to reserve n slots Jason Xing
2026-04-15  8:26 ` [PATCH RFC net-next v4 11/14] xsk: support batch xmit main logic Jason Xing
2026-04-15  9:38   ` sashiko-bot
2026-04-16  9:58     ` Jason Xing
2026-04-15  8:26 ` [PATCH RFC net-next v4 12/14] xsk: separate read-mostly and write-heavy fields in xsk_buff_pool Jason Xing
2026-04-15  9:20   ` sashiko-bot
2026-04-16 10:09     ` Jason Xing
2026-04-15  8:26 ` [PATCH RFC net-next v4 13/14] xsk: retire old xmit path in copy mode Jason Xing
2026-04-15  9:18   ` sashiko-bot
2026-04-16 10:33     ` Jason Xing
2026-04-15  8:26 ` [PATCH RFC net-next v4 14/14] xsk: optimize xsk_build_skb for batch copy-mode fast path Jason Xing
2026-04-15  9:47   ` sashiko-bot
2026-04-16 13:12     ` Jason Xing

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20260415093301.799F2C19424@smtp.kernel.org \
    --to=sashiko-bot@kernel.org \
    --cc=bpf@vger.kernel.org \
    --cc=kerneljasonxing@gmail.com \
    --cc=sashiko@lists.linux.dev \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox