From: Jesper Dangaard Brouer <hawk@kernel.org>
To: Jakub Kicinski <kuba@kernel.org>, davem@davemloft.net
Cc: netdev@vger.kernel.org, edumazet@google.com, pabeni@redhat.com,
andrew+netdev@lunn.ch, horms@kernel.org, daniel@iogearbox.net,
john.fastabend@gmail.com, sdf@fomichev.me,
michael.chan@broadcom.com, anthony.l.nguyen@intel.com,
marcin.s.wojtas@gmail.com, tariqt@nvidia.com, mbloch@nvidia.com,
jasowang@redhat.com, bpf@vger.kernel.org,
aleksander.lobakin@intel.com, pavan.chebbi@broadcom.com,
przemyslaw.kitszel@intel.com
Subject: Re: [PATCH net-next 1/2] net: xdp: pass full flags to xdp_update_skb_shared_info()
Date: Tue, 9 Sep 2025 15:30:00 +0200 [thread overview]
Message-ID: <669d9245-ac4b-4d43-aea3-cc30ac5836a4@kernel.org> (raw)
In-Reply-To: <20250905221539.2930285-2-kuba@kernel.org>
On 06/09/2025 00.15, Jakub Kicinski wrote:
> xdp_update_skb_shared_info() needs to update skb state which
> was maintained in xdp_buff / frame. Pass full flags into it,
> instead of breaking it out bit by bit. We will need to add
> a bit for unreadable frags (even tho XDP doesn't support
> those the driver paths may be common), at which point almost
> all call sites would become:
>
> xdp_update_skb_shared_info(skb, num_frags,
> sinfo->xdp_frags_size,
> MY_PAGE_SIZE * num_frags,
> xdp_buff_is_frag_pfmemalloc(xdp),
> xdp_buff_is_frag_unreadable(xdp));
>
> Keep a helper for accessing the flags, in case we need to
> transform them somehow in the future (e.g. to cover up xdp_buff
> vs xdp_frame differences).
>
> While we are touching call callers - rename the helper to
> xdp_update_skb_frags_info(), previous name may have implied that
> it's shinfo that's updated. We are updating flags in struct sk_buff
> based on frags that got attched.
typo ^^^^^^^
>
I'm fine with the name xdp_update_skb_frags_info().
But I want to point out that the function *does* also update shinfo.
The xdp_buff/xdp-frame have a compatible layout for shinfo, except for a
union with sinfo->destructor_arg, which we need to clear. This is a
transition point from XDP to SKB, which is why I think the function name
change is appropiate.
> Signed-off-by: Jakub Kicinski <kuba@kernel.org>
> ---
> v1:
> - rename skb_flags arg to xdp_flags
Thanks for that. You kept the function name xdp_buff_get_skb_flags(),
indicating this is "skb_flags". I don't think it matters much, so to
avoid bikesheeting I'm just going to ACK this.
Acked-by: Jesper Dangaard Brouer <hawk@kernel.org>
--Jesper
> - rename xdp_update_skb_shared_info() -> xdp_update_skb_frags_info()
> RFC: https://lore.kernel.org/20250812161528.835855-1-kuba@kernel.org
> ---
> include/net/xdp.h | 25 +++++++++----------
> drivers/net/ethernet/broadcom/bnxt/bnxt_xdp.c | 7 +++---
> drivers/net/ethernet/intel/i40e/i40e_txrx.c | 15 ++++++-----
> drivers/net/ethernet/intel/ice/ice_txrx.c | 15 ++++++-----
> drivers/net/ethernet/marvell/mvneta.c | 7 +++---
> .../net/ethernet/mellanox/mlx5/core/en_rx.c | 23 ++++++++---------
> drivers/net/virtio_net.c | 7 +++---
> net/core/xdp.c | 21 ++++++++--------
> 8 files changed, 56 insertions(+), 64 deletions(-)
>
> diff --git a/include/net/xdp.h b/include/net/xdp.h
> index b40f1f96cb11..57189fc21168 100644
> --- a/include/net/xdp.h
> +++ b/include/net/xdp.h
> @@ -104,17 +104,16 @@ static __always_inline void xdp_buff_clear_frags_flag(struct xdp_buff *xdp)
> xdp->flags &= ~XDP_FLAGS_HAS_FRAGS;
> }
>
> -static __always_inline bool
> -xdp_buff_is_frag_pfmemalloc(const struct xdp_buff *xdp)
> -{
> - return !!(xdp->flags & XDP_FLAGS_FRAGS_PF_MEMALLOC);
> -}
> -
> static __always_inline void xdp_buff_set_frag_pfmemalloc(struct xdp_buff *xdp)
> {
> xdp->flags |= XDP_FLAGS_FRAGS_PF_MEMALLOC;
> }
>
> +static __always_inline u32 xdp_buff_get_skb_flags(const struct xdp_buff *xdp)
> +{
> + return xdp->flags;
> +}
> +
> static __always_inline void
> xdp_init_buff(struct xdp_buff *xdp, u32 frame_sz, struct xdp_rxq_info *rxq)
> {
> @@ -272,10 +271,10 @@ static __always_inline bool xdp_frame_has_frags(const struct xdp_frame *frame)
> return !!(frame->flags & XDP_FLAGS_HAS_FRAGS);
> }
>
> -static __always_inline bool
> -xdp_frame_is_frag_pfmemalloc(const struct xdp_frame *frame)
> +static __always_inline u32
> +xdp_frame_get_skb_flags(const struct xdp_frame *frame)
> {
> - return !!(frame->flags & XDP_FLAGS_FRAGS_PF_MEMALLOC);
> + return frame->flags;
> }
>
> #define XDP_BULK_QUEUE_SIZE 16
> @@ -312,9 +311,9 @@ static inline void xdp_scrub_frame(struct xdp_frame *frame)
> }
>
> static inline void
> -xdp_update_skb_shared_info(struct sk_buff *skb, u8 nr_frags,
> - unsigned int size, unsigned int truesize,
> - bool pfmemalloc)
> +xdp_update_skb_frags_info(struct sk_buff *skb, u8 nr_frags,
> + unsigned int size, unsigned int truesize,
> + u32 xdp_flags)
> {
> struct skb_shared_info *sinfo = skb_shinfo(skb);
>
> @@ -328,7 +327,7 @@ xdp_update_skb_shared_info(struct sk_buff *skb, u8 nr_frags,
> skb->len += size;
> skb->data_len += size;
> skb->truesize += truesize;
> - skb->pfmemalloc |= pfmemalloc;
> + skb->pfmemalloc |= !!(xdp_flags & XDP_FLAGS_FRAGS_PF_MEMALLOC);
> }
>
> /* Avoids inlining WARN macro in fast-path */
> diff --git a/drivers/net/ethernet/broadcom/bnxt/bnxt_xdp.c b/drivers/net/ethernet/broadcom/bnxt/bnxt_xdp.c
> index 58d579dca3f1..3e77a96e5a3e 100644
> --- a/drivers/net/ethernet/broadcom/bnxt/bnxt_xdp.c
> +++ b/drivers/net/ethernet/broadcom/bnxt/bnxt_xdp.c
> @@ -468,9 +468,8 @@ bnxt_xdp_build_skb(struct bnxt *bp, struct sk_buff *skb, u8 num_frags,
> if (!skb)
> return NULL;
>
> - xdp_update_skb_shared_info(skb, num_frags,
> - sinfo->xdp_frags_size,
> - BNXT_RX_PAGE_SIZE * num_frags,
> - xdp_buff_is_frag_pfmemalloc(xdp));
> + xdp_update_skb_frags_info(skb, num_frags, sinfo->xdp_frags_size,
> + BNXT_RX_PAGE_SIZE * num_frags,
> + xdp_buff_get_skb_flags(xdp));
> return skb;
> }
> diff --git a/drivers/net/ethernet/intel/i40e/i40e_txrx.c b/drivers/net/ethernet/intel/i40e/i40e_txrx.c
> index 048c33039130..98601c62c592 100644
> --- a/drivers/net/ethernet/intel/i40e/i40e_txrx.c
> +++ b/drivers/net/ethernet/intel/i40e/i40e_txrx.c
> @@ -2151,10 +2151,10 @@ static struct sk_buff *i40e_construct_skb(struct i40e_ring *rx_ring,
> memcpy(&skinfo->frags[skinfo->nr_frags], &sinfo->frags[0],
> sizeof(skb_frag_t) * nr_frags);
>
> - xdp_update_skb_shared_info(skb, skinfo->nr_frags + nr_frags,
> - sinfo->xdp_frags_size,
> - nr_frags * xdp->frame_sz,
> - xdp_buff_is_frag_pfmemalloc(xdp));
> + xdp_update_skb_frags_info(skb, skinfo->nr_frags + nr_frags,
> + sinfo->xdp_frags_size,
> + nr_frags * xdp->frame_sz,
> + xdp_buff_get_skb_flags(xdp));
>
> /* First buffer has already been processed, so bump ntc */
> if (++rx_ring->next_to_clean == rx_ring->count)
> @@ -2206,10 +2206,9 @@ static struct sk_buff *i40e_build_skb(struct i40e_ring *rx_ring,
> skb_metadata_set(skb, metasize);
>
> if (unlikely(xdp_buff_has_frags(xdp))) {
> - xdp_update_skb_shared_info(skb, nr_frags,
> - sinfo->xdp_frags_size,
> - nr_frags * xdp->frame_sz,
> - xdp_buff_is_frag_pfmemalloc(xdp));
> + xdp_update_skb_frags_info(skb, nr_frags, sinfo->xdp_frags_size,
> + nr_frags * xdp->frame_sz,
> + xdp_buff_get_skb_flags(xdp));
>
> i40e_process_rx_buffs(rx_ring, I40E_XDP_PASS, xdp);
> } else {
> diff --git a/drivers/net/ethernet/intel/ice/ice_txrx.c b/drivers/net/ethernet/intel/ice/ice_txrx.c
> index d2871757ec94..107632a71f3c 100644
> --- a/drivers/net/ethernet/intel/ice/ice_txrx.c
> +++ b/drivers/net/ethernet/intel/ice/ice_txrx.c
> @@ -1035,10 +1035,9 @@ ice_build_skb(struct ice_rx_ring *rx_ring, struct xdp_buff *xdp)
> skb_metadata_set(skb, metasize);
>
> if (unlikely(xdp_buff_has_frags(xdp)))
> - xdp_update_skb_shared_info(skb, nr_frags,
> - sinfo->xdp_frags_size,
> - nr_frags * xdp->frame_sz,
> - xdp_buff_is_frag_pfmemalloc(xdp));
> + xdp_update_skb_frags_info(skb, nr_frags, sinfo->xdp_frags_size,
> + nr_frags * xdp->frame_sz,
> + xdp_buff_get_skb_flags(xdp));
>
> return skb;
> }
> @@ -1115,10 +1114,10 @@ ice_construct_skb(struct ice_rx_ring *rx_ring, struct xdp_buff *xdp)
> memcpy(&skinfo->frags[skinfo->nr_frags], &sinfo->frags[0],
> sizeof(skb_frag_t) * nr_frags);
>
> - xdp_update_skb_shared_info(skb, skinfo->nr_frags + nr_frags,
> - sinfo->xdp_frags_size,
> - nr_frags * xdp->frame_sz,
> - xdp_buff_is_frag_pfmemalloc(xdp));
> + xdp_update_skb_frags_info(skb, skinfo->nr_frags + nr_frags,
> + sinfo->xdp_frags_size,
> + nr_frags * xdp->frame_sz,
> + xdp_buff_get_skb_flags(xdp));
> }
>
> return skb;
> diff --git a/drivers/net/ethernet/marvell/mvneta.c b/drivers/net/ethernet/marvell/mvneta.c
> index 476e73e502fe..7351e98d73f4 100644
> --- a/drivers/net/ethernet/marvell/mvneta.c
> +++ b/drivers/net/ethernet/marvell/mvneta.c
> @@ -2416,10 +2416,9 @@ mvneta_swbm_build_skb(struct mvneta_port *pp, struct page_pool *pool,
> skb->ip_summed = mvneta_rx_csum(pp, desc_status);
>
> if (unlikely(xdp_buff_has_frags(xdp)))
> - xdp_update_skb_shared_info(skb, num_frags,
> - sinfo->xdp_frags_size,
> - num_frags * xdp->frame_sz,
> - xdp_buff_is_frag_pfmemalloc(xdp));
> + xdp_update_skb_frags_info(skb, num_frags, sinfo->xdp_frags_size,
> + num_frags * xdp->frame_sz,
> + xdp_buff_get_skb_flags(xdp));
>
> return skb;
> }
> diff --git a/drivers/net/ethernet/mellanox/mlx5/core/en_rx.c b/drivers/net/ethernet/mellanox/mlx5/core/en_rx.c
> index b8c609d91d11..2925ece136c4 100644
> --- a/drivers/net/ethernet/mellanox/mlx5/core/en_rx.c
> +++ b/drivers/net/ethernet/mellanox/mlx5/core/en_rx.c
> @@ -1796,10 +1796,9 @@ mlx5e_skb_from_cqe_nonlinear(struct mlx5e_rq *rq, struct mlx5e_wqe_frag_info *wi
>
> if (xdp_buff_has_frags(&mxbuf->xdp)) {
> /* sinfo->nr_frags is reset by build_skb, calculate again. */
> - xdp_update_skb_shared_info(skb, wi - head_wi - 1,
> - sinfo->xdp_frags_size, truesize,
> - xdp_buff_is_frag_pfmemalloc(
> - &mxbuf->xdp));
> + xdp_update_skb_frags_info(skb, wi - head_wi - 1,
> + sinfo->xdp_frags_size, truesize,
> + xdp_buff_get_skb_flags(&mxbuf->xdp));
>
> for (struct mlx5e_wqe_frag_info *pwi = head_wi + 1; pwi < wi; pwi++)
> pwi->frag_page->frags++;
> @@ -2105,10 +2104,10 @@ mlx5e_skb_from_cqe_mpwrq_nonlinear(struct mlx5e_rq *rq, struct mlx5e_mpw_info *w
> struct mlx5e_frag_page *pagep;
>
> /* sinfo->nr_frags is reset by build_skb, calculate again. */
> - xdp_update_skb_shared_info(skb, frag_page - head_page,
> - sinfo->xdp_frags_size, truesize,
> - xdp_buff_is_frag_pfmemalloc(
> - &mxbuf->xdp));
> + xdp_update_skb_frags_info(skb, frag_page - head_page,
> + sinfo->xdp_frags_size,
> + truesize,
> + xdp_buff_get_skb_flags(&mxbuf->xdp));
>
> pagep = head_page;
> do
> @@ -2122,10 +2121,10 @@ mlx5e_skb_from_cqe_mpwrq_nonlinear(struct mlx5e_rq *rq, struct mlx5e_mpw_info *w
> if (xdp_buff_has_frags(&mxbuf->xdp)) {
> struct mlx5e_frag_page *pagep;
>
> - xdp_update_skb_shared_info(skb, sinfo->nr_frags,
> - sinfo->xdp_frags_size, truesize,
> - xdp_buff_is_frag_pfmemalloc(
> - &mxbuf->xdp));
> + xdp_update_skb_frags_info(skb, sinfo->nr_frags,
> + sinfo->xdp_frags_size,
> + truesize,
> + xdp_buff_get_skb_flags(&mxbuf->xdp));
>
> pagep = frag_page - sinfo->nr_frags;
> do
> diff --git a/drivers/net/virtio_net.c b/drivers/net/virtio_net.c
> index 975bdc5dab84..06708c9a979e 100644
> --- a/drivers/net/virtio_net.c
> +++ b/drivers/net/virtio_net.c
> @@ -2185,10 +2185,9 @@ static struct sk_buff *build_skb_from_xdp_buff(struct net_device *dev,
> skb_metadata_set(skb, metasize);
>
> if (unlikely(xdp_buff_has_frags(xdp)))
> - xdp_update_skb_shared_info(skb, nr_frags,
> - sinfo->xdp_frags_size,
> - xdp_frags_truesz,
> - xdp_buff_is_frag_pfmemalloc(xdp));
> + xdp_update_skb_frags_info(skb, nr_frags, sinfo->xdp_frags_size,
> + xdp_frags_truesz,
> + xdp_buff_get_skb_flags(xdp));
>
> return skb;
> }
> diff --git a/net/core/xdp.c b/net/core/xdp.c
> index 491334b9b8be..9100e160113a 100644
> --- a/net/core/xdp.c
> +++ b/net/core/xdp.c
> @@ -663,9 +663,8 @@ struct sk_buff *xdp_build_skb_from_buff(const struct xdp_buff *xdp)
> u32 tsize;
>
> tsize = sinfo->xdp_frags_truesize ? : nr_frags * xdp->frame_sz;
> - xdp_update_skb_shared_info(skb, nr_frags,
> - sinfo->xdp_frags_size, tsize,
> - xdp_buff_is_frag_pfmemalloc(xdp));
> + xdp_update_skb_frags_info(skb, nr_frags, sinfo->xdp_frags_size,
> + tsize, xdp_buff_get_skb_flags(xdp));
> }
>
> skb->protocol = eth_type_trans(skb, rxq->dev);
> @@ -692,7 +691,7 @@ static noinline bool xdp_copy_frags_from_zc(struct sk_buff *skb,
> struct skb_shared_info *sinfo = skb_shinfo(skb);
> const struct skb_shared_info *xinfo;
> u32 nr_frags, tsize = 0;
> - bool pfmemalloc = false;
> + u32 flags = 0;
>
> xinfo = xdp_get_shared_info_from_buff(xdp);
> nr_frags = xinfo->nr_frags;
> @@ -714,11 +713,12 @@ static noinline bool xdp_copy_frags_from_zc(struct sk_buff *skb,
> __skb_fill_page_desc_noacc(sinfo, i, page, offset, len);
>
> tsize += truesize;
> - pfmemalloc |= page_is_pfmemalloc(page);
> + if (page_is_pfmemalloc(page))
> + flags |= XDP_FLAGS_FRAGS_PF_MEMALLOC;
> }
>
> - xdp_update_skb_shared_info(skb, nr_frags, xinfo->xdp_frags_size,
> - tsize, pfmemalloc);
> + xdp_update_skb_frags_info(skb, nr_frags, xinfo->xdp_frags_size, tsize,
> + flags);
>
> return true;
> }
> @@ -823,10 +823,9 @@ struct sk_buff *__xdp_build_skb_from_frame(struct xdp_frame *xdpf,
> skb_metadata_set(skb, xdpf->metasize);
>
> if (unlikely(xdp_frame_has_frags(xdpf)))
> - xdp_update_skb_shared_info(skb, nr_frags,
> - sinfo->xdp_frags_size,
> - nr_frags * xdpf->frame_sz,
> - xdp_frame_is_frag_pfmemalloc(xdpf));
> + xdp_update_skb_frags_info(skb, nr_frags, sinfo->xdp_frags_size,
> + nr_frags * xdpf->frame_sz,
> + xdp_frame_get_skb_flags(xdpf));
>
> /* Essential SKB info: protocol and skb->dev */
> skb->protocol = eth_type_trans(skb, dev);
next prev parent reply other threads:[~2025-09-09 13:30 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-09-05 22:15 [PATCH net-next 0/2] net: xdp: handle frags with unreadable memory Jakub Kicinski
2025-09-05 22:15 ` [PATCH net-next 1/2] net: xdp: pass full flags to xdp_update_skb_shared_info() Jakub Kicinski
2025-09-09 13:30 ` Jesper Dangaard Brouer [this message]
2025-09-09 23:24 ` Jakub Kicinski
2025-09-05 22:15 ` [PATCH net-next 2/2] net: xdp: handle frags with unreadable memory Jakub Kicinski
2025-09-05 22:55 ` [PATCH net-next 0/2] " Stanislav Fomichev
2025-09-08 12:02 ` Alexander Lobakin
2025-09-11 10:10 ` patchwork-bot+netdevbpf
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=669d9245-ac4b-4d43-aea3-cc30ac5836a4@kernel.org \
--to=hawk@kernel.org \
--cc=aleksander.lobakin@intel.com \
--cc=andrew+netdev@lunn.ch \
--cc=anthony.l.nguyen@intel.com \
--cc=bpf@vger.kernel.org \
--cc=daniel@iogearbox.net \
--cc=davem@davemloft.net \
--cc=edumazet@google.com \
--cc=horms@kernel.org \
--cc=jasowang@redhat.com \
--cc=john.fastabend@gmail.com \
--cc=kuba@kernel.org \
--cc=marcin.s.wojtas@gmail.com \
--cc=mbloch@nvidia.com \
--cc=michael.chan@broadcom.com \
--cc=netdev@vger.kernel.org \
--cc=pabeni@redhat.com \
--cc=pavan.chebbi@broadcom.com \
--cc=przemyslaw.kitszel@intel.com \
--cc=sdf@fomichev.me \
--cc=tariqt@nvidia.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox