From: Maxime Chevallier <maxime.chevallier@bootlin.com>
To: "Russell King (Oracle)" <rmk+kernel@armlinux.org.uk>,
Andrew Lunn <andrew@lunn.ch>,
Heiner Kallweit <hkallweit1@gmail.com>
Cc: Alexandre Torgue <alexandre.torgue@foss.st.com>,
Andrew Lunn <andrew+netdev@lunn.ch>,
"David S. Miller" <davem@davemloft.net>,
Eric Dumazet <edumazet@google.com>,
Jakub Kicinski <kuba@kernel.org>,
linux-arm-kernel@lists.infradead.org,
linux-stm32@st-md-mailman.stormreply.com,
Maxime Coquelin <mcoquelin.stm32@gmail.com>,
netdev@vger.kernel.org, Paolo Abeni <pabeni@redhat.com>
Subject: Re: [PATCH net-next] net: stmmac: fix dwmac4 transmit performance regression
Date: Fri, 16 Jan 2026 08:42:19 +0100 [thread overview]
Message-ID: <ab2d7cc9-e7d9-47fb-95ad-90ae4f5f1f67@bootlin.com> (raw)
In-Reply-To: <E1vgY1k-00000003vOC-0Z1H@rmk-PC.armlinux.org.uk>
Hi Russell,
On 16/01/2026 01:49, Russell King (Oracle) wrote:
> dwmac4's transmit performance dropped by a factor of four due to an
> incorrect assumption about which definitions are for what. This
> highlights the need for sane register macros.
>
> Commit 8409495bf6c9 ("net: stmmac: cores: remove many xxx_SHIFT
> definitions") changed the way the txpbl value is merged into the
> register:
>
> value = readl(ioaddr + DMA_CHAN_TX_CONTROL(dwmac4_addrs, chan));
> - value = value | (txpbl << DMA_BUS_MODE_PBL_SHIFT);
> + value = value | FIELD_PREP(DMA_BUS_MODE_PBL, txpbl);
>
> With the following in the header file:
>
> #define DMA_BUS_MODE_PBL BIT(16)
> -#define DMA_BUS_MODE_PBL_SHIFT 16
>
> The assumption here was that DMA_BUS_MODE_PBL was the mask for
> DMA_BUS_MODE_PBL_SHIFT, but this turns out not to be the case.
>
> The field is actually six bits wide, buts 21:16, and is called
> TXPBL.
>
> What's even more confusing is, there turns out to be a PBLX8
> single bit in the DMA_CHAN_CONTROL register (0x1100 for channel 0),
> and DMA_BUS_MODE_PBL seems to be used for that. However, this bit
> et.al. was listed under a comment "/* DMA SYS Bus Mode bitmap */"
> which is for register 0x1004.
>
> Fix this up by adding an appropriately named field definition under
> the DMA_CHAN_TX_CONTROL() register address definition.
>
> Move the RPBL mask definition under DMA_CHAN_RX_CONTROL(), correctly
> renaming it as well.
>
> Also move the PBL bit definition under DMA_CHAN_CONTROL(), correctly
> renaming it.
>
> This removes confusion over the PBL fields.
>
> Signed-off-by: Russell King (Oracle) <rmk+kernel@armlinux.org.uk>
Good job finding the problem ! However you need a Fixes tag, even though
ths is is for net-next.
It would also have been nice to be in CC, I spent some time on the bisect...
Besides that, problem solved on an imx8mp setup :)
Tested-by: Maxime Chevallier <maxime.chevallier@bootlin.com>
Reviewed-by: Maxime Chevallier <maxime.chevallier@bootlin.com>
Maxime
> ---
> drivers/net/ethernet/stmicro/stmmac/dwmac4_dma.c | 8 ++++----
> drivers/net/ethernet/stmicro/stmmac/dwmac4_dma.h | 7 ++++---
> 2 files changed, 8 insertions(+), 7 deletions(-)
>
> diff --git a/drivers/net/ethernet/stmicro/stmmac/dwmac4_dma.c b/drivers/net/ethernet/stmicro/stmmac/dwmac4_dma.c
> index 7036beccfc85..aaa83e9ff4f0 100644
> --- a/drivers/net/ethernet/stmicro/stmmac/dwmac4_dma.c
> +++ b/drivers/net/ethernet/stmicro/stmmac/dwmac4_dma.c
> @@ -52,7 +52,7 @@ static void dwmac4_dma_init_rx_chan(struct stmmac_priv *priv,
> u32 rxpbl = dma_cfg->rxpbl ?: dma_cfg->pbl;
>
> value = readl(ioaddr + DMA_CHAN_RX_CONTROL(dwmac4_addrs, chan));
> - value = value | FIELD_PREP(DMA_BUS_MODE_RPBL_MASK, rxpbl);
> + value = value | FIELD_PREP(DMA_CHAN_RX_CTRL_RXPBL_MASK, rxpbl);
> writel(value, ioaddr + DMA_CHAN_RX_CONTROL(dwmac4_addrs, chan));
>
> if (IS_ENABLED(CONFIG_ARCH_DMA_ADDR_T_64BIT) && likely(dma_cfg->eame))
> @@ -73,7 +73,7 @@ static void dwmac4_dma_init_tx_chan(struct stmmac_priv *priv,
> u32 txpbl = dma_cfg->txpbl ?: dma_cfg->pbl;
>
> value = readl(ioaddr + DMA_CHAN_TX_CONTROL(dwmac4_addrs, chan));
> - value = value | FIELD_PREP(DMA_BUS_MODE_PBL, txpbl);
> + value = value | FIELD_PREP(DMA_CHAN_TX_CTRL_TXPBL_MASK, txpbl);
>
> /* Enable OSP to get best performance */
> value |= DMA_CONTROL_OSP;
> @@ -98,7 +98,7 @@ static void dwmac4_dma_init_channel(struct stmmac_priv *priv,
> /* common channel control register config */
> value = readl(ioaddr + DMA_CHAN_CONTROL(dwmac4_addrs, chan));
> if (dma_cfg->pblx8)
> - value = value | DMA_BUS_MODE_PBL;
> + value = value | DMA_CHAN_CTRL_PBLX8;
> writel(value, ioaddr + DMA_CHAN_CONTROL(dwmac4_addrs, chan));
>
> /* Mask interrupts by writing to CSR7 */
> @@ -116,7 +116,7 @@ static void dwmac410_dma_init_channel(struct stmmac_priv *priv,
> /* common channel control register config */
> value = readl(ioaddr + DMA_CHAN_CONTROL(dwmac4_addrs, chan));
> if (dma_cfg->pblx8)
> - value = value | DMA_BUS_MODE_PBL;
> + value = value | DMA_CHAN_CTRL_PBLX8;
>
> writel(value, ioaddr + DMA_CHAN_CONTROL(dwmac4_addrs, chan));
>
> diff --git a/drivers/net/ethernet/stmicro/stmmac/dwmac4_dma.h b/drivers/net/ethernet/stmicro/stmmac/dwmac4_dma.h
> index 5f1e2916f099..9d9077a4ac9f 100644
> --- a/drivers/net/ethernet/stmicro/stmmac/dwmac4_dma.h
> +++ b/drivers/net/ethernet/stmicro/stmmac/dwmac4_dma.h
> @@ -24,8 +24,6 @@
>
> #define DMA_SYS_BUS_MODE 0x00001004
>
> -#define DMA_BUS_MODE_PBL BIT(16)
> -#define DMA_BUS_MODE_RPBL_MASK GENMASK(21, 16)
> #define DMA_BUS_MODE_MB BIT(14)
> #define DMA_BUS_MODE_FB BIT(0)
>
> @@ -68,19 +66,22 @@ static inline u32 dma_chanx_base_addr(const struct dwmac4_addrs *addrs,
>
> #define DMA_CHAN_CONTROL(addrs, x) dma_chanx_base_addr(addrs, x)
>
> +#define DMA_CHAN_CTRL_PBLX8 BIT(16)
> #define DMA_CONTROL_SPH BIT(24)
>
> #define DMA_CHAN_TX_CONTROL(addrs, x) (dma_chanx_base_addr(addrs, x) + 0x4)
>
> #define DMA_CONTROL_EDSE BIT(28)
> +#define DMA_CHAN_TX_CTRL_TXPBL_MASK GENMASK(21, 16)
> #define DMA_CONTROL_TSE BIT(12)
> #define DMA_CONTROL_OSP BIT(4)
> #define DMA_CONTROL_ST BIT(0)
>
> #define DMA_CHAN_RX_CONTROL(addrs, x) (dma_chanx_base_addr(addrs, x) + 0x8)
>
> -#define DMA_CONTROL_SR BIT(0)
> +#define DMA_CHAN_RX_CTRL_RXPBL_MASK GENMASK(21, 16)
> #define DMA_RBSZ_MASK GENMASK(14, 1)
> +#define DMA_CONTROL_SR BIT(0)
>
> #define DMA_CHAN_TX_BASE_ADDR_HI(addrs, x) (dma_chanx_base_addr(addrs, x) + 0x10)
> #define DMA_CHAN_TX_BASE_ADDR(addrs, x) (dma_chanx_base_addr(addrs, x) + 0x14)
next prev parent reply other threads:[~2026-01-16 7:50 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-01-16 0:49 [PATCH net-next] net: stmmac: fix dwmac4 transmit performance regression Russell King (Oracle)
2026-01-16 7:42 ` Maxime Chevallier [this message]
2026-01-16 23:21 ` Russell King (Oracle)
2026-01-17 9:26 ` Maxime Chevallier
2026-01-19 14:19 ` patchwork-bot+netdevbpf
2026-03-13 15:03 ` Georg Gottleuber
2026-03-13 16:35 ` Russell King (Oracle)
2026-03-13 18:39 ` Russell King (Oracle)
2026-03-16 10:51 ` Georg Gottleuber
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=ab2d7cc9-e7d9-47fb-95ad-90ae4f5f1f67@bootlin.com \
--to=maxime.chevallier@bootlin.com \
--cc=alexandre.torgue@foss.st.com \
--cc=andrew+netdev@lunn.ch \
--cc=andrew@lunn.ch \
--cc=davem@davemloft.net \
--cc=edumazet@google.com \
--cc=hkallweit1@gmail.com \
--cc=kuba@kernel.org \
--cc=linux-arm-kernel@lists.infradead.org \
--cc=linux-stm32@st-md-mailman.stormreply.com \
--cc=mcoquelin.stm32@gmail.com \
--cc=netdev@vger.kernel.org \
--cc=pabeni@redhat.com \
--cc=rmk+kernel@armlinux.org.uk \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox