From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from bombadil.infradead.org (bombadil.infradead.org [198.137.202.133]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.lore.kernel.org (Postfix) with ESMTPS id 59DB8CD98CE for ; Fri, 12 Jun 2026 14:30:18 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=lists.infradead.org; s=bombadil.20210309; h=Sender:List-Subscribe:List-Help :List-Post:List-Archive:List-Unsubscribe:List-Id:In-Reply-To:References:Cc: Subject:From:To:Message-Id:Date:Content-Type:Content-Transfer-Encoding: Mime-Version:Reply-To:Content-ID:Content-Description:Resent-Date:Resent-From: Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID:List-Owner; bh=aFDFYB9La2SnSg556RWfwxCdEciJqAjpEnzTJLjJUqc=; b=uqtGl1rW/BgDVpe3erZgfn5eYB 9n1yQFTuufTJ5VTClKQII3/DV5zN15jK94/aRl7bmii8UrYq85/GhwbJSdqH5SF7EDHtEmngadp95 TwI9bW31p6B/GGjU3rOuArQWPmmsSLRMwmLONJL+Ikm/SHIhYTFkCdE5Kq86ABzee1U4KZJcnF45v 8U5tGIULNohKH+JD+ScCbYUjiYtxdZIRFAhVSBzgCZnbx86WWvMidfF1S3P3PUBBIoPNm1vp4VCYH Cmgct2XOcv59rdrFUdTGZKIZPJCzeTZaWEkWPkIAIHhK7gSYA+6klBFHeDjqBDx3mY5DEVp8E5mpO JbFStjcQ==; Received: from localhost ([::1] helo=bombadil.infradead.org) by bombadil.infradead.org with esmtp (Exim 4.99.1 #2 (Red Hat Linux)) id 1wY2tf-0000000B4Ur-2YF3; Fri, 12 Jun 2026 14:30:11 +0000 Received: from smtpout-02.galae.net ([185.246.84.56]) by bombadil.infradead.org with esmtps (Exim 4.99.1 #2 (Red Hat Linux)) id 1wY2tc-0000000B4T5-0K4H for linux-arm-kernel@lists.infradead.org; Fri, 12 Jun 2026 14:30:10 +0000 Received: from smtpout-01.galae.net (smtpout-01.galae.net [212.83.139.233]) by smtpout-02.galae.net (Postfix) with ESMTPS id C3DA61A38E6; Fri, 12 Jun 2026 14:30:05 +0000 (UTC) Received: from mail.galae.net (mail.galae.net [212.83.136.155]) by smtpout-01.galae.net (Postfix) with ESMTPS id 8FE0560012; Fri, 12 Jun 2026 14:30:05 +0000 (UTC) Received: from [127.0.0.1] (localhost [127.0.0.1]) by localhost (Mailerdaemon) with ESMTPSA id 4D4D4106C86A1; Fri, 12 Jun 2026 16:30:01 +0200 (CEST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=bootlin.com; s=dkim; t=1781274604; h=from:subject:date:message-id:to:cc:mime-version:content-type: content-transfer-encoding:in-reply-to:references; bh=aFDFYB9La2SnSg556RWfwxCdEciJqAjpEnzTJLjJUqc=; b=k9eZw9xmP3UuCmLaPgcuWj/ncboMzMbxIg9q5pyT5HyxUZs/rUf2v2ZjsSYH41uy3LP5cV /ke4C0U1ROIkDOu6H9/TrX29Fbseuw3wFAO/CQfUhqMb8W+yQcvNr7bcwpxV/JpvgBC2O6 mz9lX52MDZuIgqhFMHyvlnDTbbjYIhGQXB9R7cEN6mM0CCOrbjtWJyoPpCNZd2HU3E5xCZ umOh8iOqDqr426Lj2+6Z4fpH88nQKKfteS4Oo2lXTK7yC88oDQ1DDP/HMf3pngIPynbAWy 9sHzNxFEy2TkQbu1zWmdVouLRVH3qRRR85Df2lTtoBsB8vsEgM5xoJ1e3OxbNQ== Mime-Version: 1.0 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=UTF-8 Date: Fri, 12 Jun 2026 16:30:00 +0200 Message-Id: To: "Andrea della Porta" , "Nicolai Buchwitz" From: =?utf-8?q?Th=C3=A9o_Lebrun?= Subject: Re: [PATCH] net: macb: add TX stall timeout callback to recover from lost TSTART write Cc: , "Nicolas Ferre" , "Claudiu Beznea" , "Andrew Lunn" , "David S . Miller" , "Eric Dumazet" , "Jakub Kicinski" , "Paolo Abeni" , , , , "Lukasz Raczylo" , "Steffen Jaeckel" X-Mailer: aerc 0.21.0-0-g5549850facc2 References: <771b8faeaee1fce4a84a5ba2661d60b35a65a6d5.1781253818.git.andrea.porta@suse.com> <85507fd0fb42fca280aca1ee02178ca9@tipi-net.de> In-Reply-To: X-Last-TLS-Session-Version: TLSv1.3 X-CRM114-Version: 20100106-BlameMichelson ( TRE 0.9.0 (BSD) ) MR-646709E3 X-CRM114-CacheID: sfid-20260612_073008_250430_02E5B113 X-CRM114-Status: GOOD ( 16.79 ) X-BeenThere: linux-arm-kernel@lists.infradead.org X-Mailman-Version: 2.1.34 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: "linux-arm-kernel" Errors-To: linux-arm-kernel-bounces+linux-arm-kernel=archiver.kernel.org@lists.infradead.org On Fri Jun 12, 2026 at 4:28 PM CEST, Th=C3=A9o Lebrun wrote: > On Fri Jun 12, 2026 at 3:03 PM CEST, Andrea della Porta wrote: >> On 14:53 Fri 12 Jun , Nicolai Buchwitz wrote: >>> On 12.6.2026 14:51, Andrea della Porta wrote: >>> > > The commit message describes it as RP1 specific, but it gets applie= d >>> > > to all >>> > > other variants? >>> >=20 >>> > I've seen this issue happening only on RaspberryPi 5, but AFAIK it >>> > could affect also other MACB blocks connected through PCIe, so it >>> > may be widespread (even though it should have probably already been >>> > noticed in the past). In the orginal driver there's no timeout callba= ck >>> > defined and this is much like pretgending the issue causing the timeo= ut >>> > to happen to go away without doing anything (whatever the cause ot th= e >>> > specific hw are). So in my opinion we can just extend that to all MAC= B. >>> > Or maybe we should execute the restart conditionally on >>> > .compatible =3D "raspberrypi,rp1-gem"? >>>=20 >>> I just observed the issue once, but other people reported it to be happ= en >>> more >>> frequently. If we can narrow down a reproducer, it would be good to tes= t on >>> other >>> blocks too (like EyeQ at Th=C3=A9o's).| >>>=20 >>> So maybe you can imagine a good repro for this issue? >> >> Sure, it's happening quite often during bulk dataflow, at least >> on my RPi5. >> It can be reproduced with the following, issued from the DUT: >> >> iperf -c -P 10 -t 3000 -w 4M -i 1 >> >> plus, of course, the related command on server side: iperf -s. >> >> It usually happens a couple of times withing a few hours. > > Thanks for the reproducer command; I'll run it next week. > I'd be surprised if it reproduced on hardware that isn't the Pi 5. Sorry for the two-step message. I forgot to mention I'd prefer to have the timeout callback on all platforms: don't reserve it for Pi 5. Thanks, -- Th=C3=A9o Lebrun, Bootlin Embedded Linux and Kernel engineering https://bootlin.com