From: Justin Lai <justinlai0215@realtek.com>
To: Simon Horman <horms@kernel.org>
Cc: "kuba@kernel.org" <kuba@kernel.org>,
"davem@davemloft.net" <davem@davemloft.net>,
"edumazet@google.com" <edumazet@google.com>,
"pabeni@redhat.com" <pabeni@redhat.com>,
"andrew+netdev@lunn.ch" <andrew+netdev@lunn.ch>,
"linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
"netdev@vger.kernel.org" <netdev@vger.kernel.org>,
"stable@vger.kernel.org" <stable@vger.kernel.org>,
"richardcochran@gmail.com" <richardcochran@gmail.com>,
"aleksander.lobakin@intel.com" <aleksander.lobakin@intel.com>,
Ping-Ke Shih <pkshih@realtek.com>,
Larry Chiu <larry.chiu@realtek.com>
Subject: RE: [PATCH net v2] rtase: Workaround for IP fragmented UDP packet hardware bug
Date: Tue, 9 Jun 2026 08:20:10 +0000 [thread overview]
Message-ID: <1340406ab190498ab857ad9017529722@realtek.com> (raw)
In-Reply-To: <20260608190627.182292-2-horms@kernel.org>
Simon Horman <horms@kernel.org> wrote:
>
> From: 'Simon Horman' <horms@kernel.org>
>
> This is an AI-generated review of your patch. The human sending this email has
> considered the AI review valid, or at least plausible.
> Full review at: https://sashiko.dev
> ---
> rtase: Workaround for IP fragmented UDP packet hardware bug
>
> The hardware parser incorrectly interprets 319/320 in a short IP fragmented
> UDP packet payload as standard PTP destination ports and treats the fragment
> as a PTP packet for further parsing. Pad these packets so the transport data
> reaches RTASE_MIN_PAD_LEN before transmitting to avoid triggering the
> hardware issue.
>
> > diff --git a/drivers/net/ethernet/realtek/rtase/rtase_main.c
> > b/drivers/net/ethernet/realtek/rtase/rtase_main.c
> > index ef13109c49cff..b686b84e15fe2 100644
> > --- a/drivers/net/ethernet/realtek/rtase/rtase_main.c
> > +++ b/drivers/net/ethernet/realtek/rtase/rtase_main.c
>
> [ ... ]
>
> > @@ -1247,6 +1248,52 @@ static u32 rtase_tx_csum(struct sk_buff *skb,
> const struct net_device *dev)
> > return csum_cmd;
> > }
> >
> > +static bool rtase_skb_is_udp(struct sk_buff *skb) {
> > + int no = skb_network_offset(skb);
> > + struct ipv6hdr *i6h, _i6h;
> > + struct iphdr *ih, _ih;
> > +
> > + switch (vlan_get_protocol(skb)) {
> > + case htons(ETH_P_IP):
> > + ih = skb_header_pointer(skb, no, sizeof(_ih), &_ih);
> > + return ih && ih->protocol == IPPROTO_UDP;
> > + case htons(ETH_P_IPV6):
> > + i6h = skb_header_pointer(skb, no, sizeof(_i6h), &_i6h);
> > + return i6h && i6h->nexthdr == IPPROTO_UDP;
>
> [Severity: High]
> Does this check fail to traverse IPv6 extension headers?
>
> If an IPv6 packet contains extension headers, such as an IPv6 fragment
> (IPPROTO_FRAGMENT), the nexthdr field will contain the extension header's
> protocol number instead of UDP.
>
> Doesn't this bypass the workaround for the exact fragmented packets it is
> intended to fix?
>
Thanks for the review.
Unlike IPv4, IPv6 does not require extension header parsing for
this workaround. The hardware only treats IPv6 packets without
extension headers as PTP packets for further parsing.
Therefore, IPv6 packets carrying extension headers, including
fragments, do not enter this parsing path and are not affected
by the hardware issue addressed by this workaround.
Skipping such packets is intentional.
> > + default:
> > + return false;
> > + }
> > +}
> > +
> > +static bool rtase_skb_pad(struct sk_buff *skb) {
> > + u32 trans_data_len;
> > + u16 dest_port;
> > + u32 pad_len;
> > +
> > + if (!skb_transport_header_was_set(skb))
> > + return true;
>
> [Severity: High]
> Does this early return skip the workaround for forwarded (routed) IP
> fragments?
>
> The networking stack typically does not set the transport header offset for
> forwarded packets or non-first IP fragments.
>
> Could an attacker still trigger the hardware TX hang by routing short,
> fragmented UDP packets through this interface?
>
Thanks for pointing this out.
I agree that relying on skb_transport_header_was_set() is not
robust enough.
I will rework this logic to derive the required offset directly
from the packet contents using skb_header_pointer(), rather
than depending on skb_transport_header_was_set().
> > +
> > + trans_data_len = skb_tail_pointer(skb) -
> > + skb_transport_header(skb);
>
> [Severity: Critical]
> Does this calculation miss data stored in paged fragments?
>
> By only subtracting the transport header from the tail pointer, it treats the
> linear portion as the entire transport payload length.
>
> For non-linear packets, won't this incorrectly evaluate the payload as smaller
> than RTASE_MIN_PAD_LEN and trigger the padding path?
>
You're right that skb_tail_pointer() only covers the linear area
and can undercount non-linear skb.
skb->len includes both linear and paged fragments.
trans_data_len = skb->len - skb_transport_offset(skb);
> > + if (trans_data_len < offsetof(struct udphdr, len) ||
> > + trans_data_len >= RTASE_MIN_PAD_LEN)
> > + return true;
> > +
> > + if (!rtase_skb_is_udp(skb))
> > + return true;
> > +
> > + dest_port = ntohs(udp_hdr(skb)->dest);
> > +
> > + if (dest_port == PTP_EV_PORT || dest_port == PTP_GEN_PORT) {
> > + pad_len = RTASE_MIN_PAD_LEN - trans_data_len;
> > + if (__skb_put_padto(skb, skb->len + pad_len, false))
>
> [Severity: Critical]
> Will calling __skb_put_padto() with a length greater than skb->len cause a
> kernel panic on non-linear packets?
>
> Since __skb_put_padto() unconditionally invokes __skb_put() when the new
> length exceeds the current length, and __skb_put() contains
> SKB_LINEAR_ASSERT(skb) which triggers a BUG_ON for non-linear packets,
> won't this panic the kernel when processing a fragmented UDP packet?
>
Thanks for pointing this out.
Before calling __skb_put_padto(), I will check whether the skb is
non-linear. If so, I will first linearize it and then apply the
padding. This avoids calling __skb_put_padto() on a non-linear skb
while still ensuring that the workaround is applied to packets which
may trigger the hardware issue.
if (dest_port == PTP_EV_PORT || dest_port == PTP_GEN_PORT) {
pad_len = RTASE_MIN_PAD_LEN - trans_data_len;
if (skb_is_nonlinear(skb)) {
if (skb_linearize(skb))
return false;
}
if (__skb_put_padto(skb, skb->len + pad_len, false))
return false;
}
> > + return false;
> > + }
> > +
> > + return true;
> > +}
> > +
> > static int rtase_xmit_frags(struct rtase_ring *ring, struct sk_buff *skb,
> > u32 opts1, u32 opts2) { @@ -1360,6
> +1407,9
> > @@ static netdev_tx_t rtase_start_xmit(struct sk_buff *skb,
> > opts2 |= rtase_tx_csum(skb, dev);
> > }
> >
> > + if (!rtase_skb_pad(skb))
> > + goto err_dma_0;
> > +
> > frags = rtase_xmit_frags(ring, skb, opts1, opts2);
>
> [Severity: High]
> This is a pre-existing issue, but doesn't rtase_xmit_frags() crash on highmem
> pages?
>
> The driver advertises NETIF_F_HIGHDMA, but rtase_xmit_frags() attempts to
> map SKB fragments using skb_frag_address() and dma_map_single():
>
> addr = skb_frag_address(frag);
> mapping = dma_map_single(&tp->pdev->dev, addr, len, DMA_TO_DEVICE);
>
> Since skb_frag_address() returns an invalid pointer for HIGHMEM pages,
> shouldn't the driver be using skb_frag_dma_map() instead to prevent panics or
> memory corruption?
>
Thanks for pointing this out.
This appears to be a pre-existing issue and is unrelated to the change
in this patch. I will investigate whether skb_frag_dma_map() should be
used in rtase_xmit_frags() and address it separately if needed.
Thanks,
Justin
> > if (unlikely(frags < 0))
> > goto err_dma_0;
next prev parent reply other threads:[~2026-06-09 8:20 UTC|newest]
Thread overview: 4+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-06-04 10:13 [PATCH net v2] rtase: Workaround for IP fragmented UDP packet hardware bug Justin Lai
2026-06-08 19:06 ` Simon Horman
2026-06-09 8:20 ` Justin Lai [this message]
2026-06-09 12:43 ` David Laight
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1340406ab190498ab857ad9017529722@realtek.com \
--to=justinlai0215@realtek.com \
--cc=aleksander.lobakin@intel.com \
--cc=andrew+netdev@lunn.ch \
--cc=davem@davemloft.net \
--cc=edumazet@google.com \
--cc=horms@kernel.org \
--cc=kuba@kernel.org \
--cc=larry.chiu@realtek.com \
--cc=linux-kernel@vger.kernel.org \
--cc=netdev@vger.kernel.org \
--cc=pabeni@redhat.com \
--cc=pkshih@realtek.com \
--cc=richardcochran@gmail.com \
--cc=stable@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox