public inbox for dev@dpdk.org
 help / color / mirror / Atom feed
* [PATCH] net/mana: fix CQE suppression handling on error completions
@ 2026-03-25 18:38 Long Li
  2026-03-25 18:54 ` Stephen Hemminger
  0 siblings, 1 reply; 3+ messages in thread
From: Long Li @ 2026-03-25 18:38 UTC (permalink / raw)
  To: dev; +Cc: longli, weh, stephen, stable

On error CQEs (e.g. SA_DROP), the hardware generates one CQE per WQE
regardless of the suppression flag. The previous code honored the
suppress_tx_cqe flag unconditionally, which caused it to skip reading
error CQEs and misalign the CQ consumer index.

This misalignment causes subsequent completions to be misinterpreted:
valid CQEs are read at wrong offsets, leading to spurious error
counts, NULL packet frees, and potential use-after-free of mbufs
that were already completed.

Check the CQE type before honoring suppression: only skip CQE reading
when the completion is CQE_TX_OKAY.

Fixes: cce2c9df44 ("net/mana: suppress Tx CQE generation whenever possible")
Cc: stable@dpdk.org

Signed-off-by: Long Li <longli@microsoft.com>
---
 drivers/net/mana/tx.c | 6 ++++--
 1 file changed, 4 insertions(+), 2 deletions(-)

diff --git a/drivers/net/mana/tx.c b/drivers/net/mana/tx.c
index 40931ac027..e5ab566e8a 100644
--- a/drivers/net/mana/tx.c
+++ b/drivers/net/mana/tx.c
@@ -228,9 +228,11 @@ mana_tx_burst(void *dpdk_txq, struct rte_mbuf **tx_pkts, uint16_t nb_pkts)
 		txq->gdma_sq.tail += desc->wqe_size_in_bu;
 
 		/* If TX CQE suppression is used, don't read more CQE but move
-		 * on to the next packet
+		 * on to the next packet. On error CQEs, HW generates one CQE
+		 * per WQE regardless of suppression, so always advance.
 		 */
-		if (desc->suppress_tx_cqe)
+		if (desc->suppress_tx_cqe &&
+		    oob->cqe_hdr.cqe_type == CQE_TX_OKAY)
 			continue;
 
 		i++;
-- 
2.43.0


^ permalink raw reply related	[flat|nested] 3+ messages in thread

* Re: [PATCH] net/mana: fix CQE suppression handling on error completions
  2026-03-25 18:38 [PATCH] net/mana: fix CQE suppression handling on error completions Long Li
@ 2026-03-25 18:54 ` Stephen Hemminger
  2026-03-25 18:56   ` [EXTERNAL] " Long Li
  0 siblings, 1 reply; 3+ messages in thread
From: Stephen Hemminger @ 2026-03-25 18:54 UTC (permalink / raw)
  To: Long Li; +Cc: dev, weh, stable

On Wed, 25 Mar 2026 11:38:36 -0700
Long Li <longli@microsoft.com> wrote:

> On error CQEs (e.g. SA_DROP), the hardware generates one CQE per WQE
> regardless of the suppression flag. The previous code honored the
> suppress_tx_cqe flag unconditionally, which caused it to skip reading
> error CQEs and misalign the CQ consumer index.
> 
> This misalignment causes subsequent completions to be misinterpreted:
> valid CQEs are read at wrong offsets, leading to spurious error
> counts, NULL packet frees, and potential use-after-free of mbufs
> that were already completed.
> 
> Check the CQE type before honoring suppression: only skip CQE reading
> when the completion is CQE_TX_OKAY.
> 
> Fixes: cce2c9df44 ("net/mana: suppress Tx CQE generation whenever possible")
> Cc: stable@dpdk.org
> 
> Signed-off-by: Long Li <longli@microsoft.com>

Applied but needed to use longer SHA for Fixes to meet current guidelines.

Fixes: cce2c9df4481 ("net/mana: suppress Tx CQE generation whenever possible")

^ permalink raw reply	[flat|nested] 3+ messages in thread

* RE: [EXTERNAL] Re: [PATCH] net/mana: fix CQE suppression handling on error completions
  2026-03-25 18:54 ` Stephen Hemminger
@ 2026-03-25 18:56   ` Long Li
  0 siblings, 0 replies; 3+ messages in thread
From: Long Li @ 2026-03-25 18:56 UTC (permalink / raw)
  To: Stephen Hemminger; +Cc: dev@dpdk.org, Wei Hu, stable@dpdk.org

> Subject: [EXTERNAL] Re: [PATCH] net/mana: fix CQE suppression handling on
> error completions
> 
> On Wed, 25 Mar 2026 11:38:36 -0700
> Long Li <longli@microsoft.com> wrote:
> 
> > On error CQEs (e.g. SA_DROP), the hardware generates one CQE per WQE
> > regardless of the suppression flag. The previous code honored the
> > suppress_tx_cqe flag unconditionally, which caused it to skip reading
> > error CQEs and misalign the CQ consumer index.
> >
> > This misalignment causes subsequent completions to be misinterpreted:
> > valid CQEs are read at wrong offsets, leading to spurious error
> > counts, NULL packet frees, and potential use-after-free of mbufs that
> > were already completed.
> >
> > Check the CQE type before honoring suppression: only skip CQE reading
> > when the completion is CQE_TX_OKAY.
> >
> > Fixes: cce2c9df44 ("net/mana: suppress Tx CQE generation whenever
> > possible")
> > Cc: stable@dpdk.org
> >
> > Signed-off-by: Long Li <longli@microsoft.com>
> 
> Applied but needed to use longer SHA for Fixes to meet current guidelines.
> 
> Fixes: cce2c9df4481 ("net/mana: suppress Tx CQE generation whenever
> possible")

Thank you!

Long

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2026-03-25 18:56 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2026-03-25 18:38 [PATCH] net/mana: fix CQE suppression handling on error completions Long Li
2026-03-25 18:54 ` Stephen Hemminger
2026-03-25 18:56   ` [EXTERNAL] " Long Li

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox