* [PATCH] ks8851: Fix deadlock with the SPI chip variant
@ 2024-07-03 16:00 Ronald Wahl
2024-07-04 14:44 ` Jakub Kicinski
0 siblings, 1 reply; 3+ messages in thread
From: Ronald Wahl @ 2024-07-03 16:00 UTC (permalink / raw)
To: Ronald Wahl
Cc: Ronald Wahl, David S. Miller, Eric Dumazet, Jakub Kicinski,
Paolo Abeni, Simon Horman, netdev, stable
From: Ronald Wahl <ronald.wahl@raritan.com>
When SMP is enabled and spinlocks are actually functional then there is
a deadlock with the 'statelock' spinlock between ks8851_start_xmit_spi
and ks8851_irq:
watchdog: BUG: soft lockup - CPU#0 stuck for 27s!
call trace:
queued_spin_lock_slowpath+0x100/0x284
do_raw_spin_lock+0x34/0x44
ks8851_start_xmit_spi+0x30/0xb8
ks8851_start_xmit+0x14/0x20
netdev_start_xmit+0x40/0x6c
dev_hard_start_xmit+0x6c/0xbc
sch_direct_xmit+0xa4/0x22c
__qdisc_run+0x138/0x3fc
qdisc_run+0x24/0x3c
net_tx_action+0xf8/0x130
handle_softirqs+0x1ac/0x1f0
__do_softirq+0x14/0x20
____do_softirq+0x10/0x1c
call_on_irq_stack+0x3c/0x58
do_softirq_own_stack+0x1c/0x28
__irq_exit_rcu+0x54/0x9c
irq_exit_rcu+0x10/0x1c
el1_interrupt+0x38/0x50
el1h_64_irq_handler+0x18/0x24
el1h_64_irq+0x64/0x68
__netif_schedule+0x6c/0x80
netif_tx_wake_queue+0x38/0x48
ks8851_irq+0xb8/0x2c8
irq_thread_fn+0x2c/0x74
irq_thread+0x10c/0x1b0
kthread+0xc8/0xd8
ret_from_fork+0x10/0x20
This issue has not been identified earlier because tests were done on
a device with SMP disabled and so spinlocks were actually NOPs.
This commit moves the netif_wake_queue call outside the spinlock
protected area.
Fixes: 3dc5d4454545 ("net: ks8851: Fix TX stall caused by TX buffer overrun")
Cc: "David S. Miller" <davem@davemloft.net>
Cc: Eric Dumazet <edumazet@google.com>
Cc: Jakub Kicinski <kuba@kernel.org>
Cc: Paolo Abeni <pabeni@redhat.com>
Cc: Simon Horman <horms@kernel.org>
Cc: netdev@vger.kernel.org
Cc: stable@vger.kernel.org # 5.10+
Signed-off-by: Ronald Wahl <ronald.wahl@raritan.com>
---
drivers/net/ethernet/micrel/ks8851_common.c | 6 ++++--
1 file changed, 4 insertions(+), 2 deletions(-)
diff --git a/drivers/net/ethernet/micrel/ks8851_common.c b/drivers/net/ethernet/micrel/ks8851_common.c
index 6453c92f0fa7..60b959126b26 100644
--- a/drivers/net/ethernet/micrel/ks8851_common.c
+++ b/drivers/net/ethernet/micrel/ks8851_common.c
@@ -348,15 +348,17 @@ static irqreturn_t ks8851_irq(int irq, void *_ks)
if (status & IRQ_TXI) {
unsigned short tx_space = ks8851_rdreg16(ks, KS_TXMIR);
+ bool need_wake_queue;
netif_dbg(ks, intr, ks->netdev,
"%s: txspace %d\n", __func__, tx_space);
spin_lock(&ks->statelock);
ks->tx_space = tx_space;
- if (netif_queue_stopped(ks->netdev))
- netif_wake_queue(ks->netdev);
+ need_wake_queue = netif_queue_stopped(ks->netdev);
spin_unlock(&ks->statelock);
+ if (need_wake_queue)
+ netif_wake_queue(ks->netdev);
}
if (status & IRQ_SPIBEI) {
--
2.45.2
^ permalink raw reply related [flat|nested] 3+ messages in thread
* Re: [PATCH] ks8851: Fix deadlock with the SPI chip variant
2024-07-03 16:00 [PATCH] ks8851: Fix deadlock with the SPI chip variant Ronald Wahl
@ 2024-07-04 14:44 ` Jakub Kicinski
2024-07-04 20:18 ` Ronald Wahl
0 siblings, 1 reply; 3+ messages in thread
From: Jakub Kicinski @ 2024-07-04 14:44 UTC (permalink / raw)
To: Ronald Wahl
Cc: Ronald Wahl, David S. Miller, Eric Dumazet, Paolo Abeni,
Simon Horman, netdev, stable
On Wed, 3 Jul 2024 18:00:53 +0200 Ronald Wahl wrote:
> + bool need_wake_queue;
>
> netif_dbg(ks, intr, ks->netdev,
> "%s: txspace %d\n", __func__, tx_space);
>
> spin_lock(&ks->statelock);
> ks->tx_space = tx_space;
> - if (netif_queue_stopped(ks->netdev))
> - netif_wake_queue(ks->netdev);
> + need_wake_queue = netif_queue_stopped(ks->netdev);
> spin_unlock(&ks->statelock);
> + if (need_wake_queue)
> + netif_wake_queue(ks->netdev);
xmit runs in BH, this is just one way you can hit this deadlock
better fix would be to make sure statelock is always taken
using spin_lock_bh()
--
pw-bot: cr
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [PATCH] ks8851: Fix deadlock with the SPI chip variant
2024-07-04 14:44 ` Jakub Kicinski
@ 2024-07-04 20:18 ` Ronald Wahl
0 siblings, 0 replies; 3+ messages in thread
From: Ronald Wahl @ 2024-07-04 20:18 UTC (permalink / raw)
To: Jakub Kicinski, Ronald Wahl
Cc: David S. Miller, Eric Dumazet, Paolo Abeni, Simon Horman, netdev,
stable
Thanks, I made a v2.
I now also found another potential TX stall issue caused by improper
locking. In ks8851_tx_work we need to move
last = skb_queue_empty(&ks->txq);
under the lock or otherwise risk a TX stall because in case the queue
was empty and has meanwhile being completely filled while we were
waiting for the lock. I need to double check this scenario first. If it
is indeed an issue then I will provide a separate patch later.
On 04.07.24 16:44, Jakub Kicinski wrote:
> On Wed, 3 Jul 2024 18:00:53 +0200 Ronald Wahl wrote:
>> + bool need_wake_queue;
>>
>> netif_dbg(ks, intr, ks->netdev,
>> "%s: txspace %d\n", __func__, tx_space);
>>
>> spin_lock(&ks->statelock);
>> ks->tx_space = tx_space;
>> - if (netif_queue_stopped(ks->netdev))
>> - netif_wake_queue(ks->netdev);
>> + need_wake_queue = netif_queue_stopped(ks->netdev);
>> spin_unlock(&ks->statelock);
>> + if (need_wake_queue)
>> + netif_wake_queue(ks->netdev);
>
> xmit runs in BH, this is just one way you can hit this deadlock
> better fix would be to make sure statelock is always taken
> using spin_lock_bh()
________________________________
Ce message, ainsi que tous les fichiers joints à ce message, peuvent contenir des informations sensibles et/ ou confidentielles ne devant pas être divulguées. Si vous n'êtes pas le destinataire de ce message (ou que vous recevez ce message par erreur), nous vous remercions de le notifier immédiatement à son expéditeur, et de détruire ce message. Toute copie, divulgation, modification, utilisation ou diffusion, non autorisée, directe ou indirecte, de tout ou partie de ce message, est strictement interdite.
This e-mail, and any document attached hereby, may contain confidential and/or privileged information. If you are not the intended recipient (or have received this e-mail in error) please notify the sender immediately and destroy this e-mail. Any unauthorized, direct or indirect, copying, disclosure, distribution or other use of the material or parts thereof is strictly forbidden.
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2024-07-04 20:18 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-07-03 16:00 [PATCH] ks8851: Fix deadlock with the SPI chip variant Ronald Wahl
2024-07-04 14:44 ` Jakub Kicinski
2024-07-04 20:18 ` Ronald Wahl
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).