From: Robert Hancock <robert.hancock@calian.com>
To: "regressions@lists.linux.dev" <regressions@lists.linux.dev>
Cc: "linux-arm-kernel@lists.infradead.org"
<linux-arm-kernel@lists.infradead.org>,
"nbowler@draconx.ca" <nbowler@draconx.ca>,
"edumazet@google.com" <edumazet@google.com>,
"netdev@vger.kernel.org" <netdev@vger.kernel.org>,
"linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
"pabeni@redhat.com" <pabeni@redhat.com>,
"davem@davemloft.net" <davem@davemloft.net>
Subject: Re: PROBLEM: Broken or delayed ethernet on Xilinx ZCU104 since 5.18 (regression)
Date: Tue, 8 Aug 2023 16:00:22 +0000 [thread overview]
Message-ID: <efeee3765f8177bfde3c845face67757d383e4ef.camel@calian.com> (raw)
In-Reply-To: <3a3a53ec-81c0-c5f7-3c37-2c07212a6330@leemhuis.info>
On Fri, 2023-08-04 at 17:45 +0200, Linux regression tracking (Thorsten
Leemhuis) wrote:
> CAUTION: This email originated from outside of the organization. Do
> not click links or open attachments unless you recognize the sender
> and know the content is safe.
>
> [adding Robert Hancock (the author of the likely culprit) to the list
> of
> recipients as well as the network maintainers]
>
> [TLDR: I'm adding this report to the list of tracked Linux kernel
> regressions; the text you find below is based on a few templates
> paragraphs you might have encountered already in similar form.
> See link in footer if these mails annoy you.]
>
> On 04.08.23 17:26, Nick Bowler wrote:
> > Hi,
> >
> > With recent kernels (5.18 and newer) the ethernet is all wonky on
> > my
> > ZCU104 board.
> >
> > There is some behaviour inconsistency between kernel versions
> > identified
> > during bisection, so maybe there is more than one issue with the
> > ethernet?
> >
> > 6.5-rc4: after 10 seconds, the following message is printed:
> >
> > [ 10.761808] platform ff0e0000.ethernet: deferred probe
> > pending
> >
> > but the network device seemingly never appears (I waited about a
> > minute).
> >
> > 6.1 and 6.4: after 10 seconds, the device suddenly appears and
> > starts
> > working (but this is way too late).
> >
> > 5.18: the device never appears and no unusual messages are
> > printed
> > (I waited ten minutes).
> >
> > With 5.17 and earlier versions, the eth0 device appears without any
> > delay.
> >
> > Unfortunately, as bisection closed on the problematic section, all
> > the
> > built kernels became untestable as they appear to crash during
> > early
> > boot. Nevertheless, I manually selected a commit that sounded
> > relevant:
> >
> > commit e461bd6f43f4e568f7436a8b6bc21c4ce6914c36
> > Author: Robert Hancock <robert.hancock@calian.com>
> > Date: Thu Jan 27 10:37:36 2022 -0600
> >
> > arm64: dts: zynqmp: Added GEM reset definitions
> >
> > Reverting this fixes the problem on 5.18. Reverting this fixes the
> > problem on 6.1. Reverting this fixes the problem on 6.4. In all
> > of
> > these versions, with this change reverted, the network device
> > appears
> > without delay.
Hi Nick,
If this change triggered the problem, then I would suspect there's an
issue with either the zynqmp_reset driver itself, or something it's
dependent on, somehow not being enabled in your kernel configuration.
If the device tree has a reference to that reset device node as
providing a reset but the driver for that device hasn't loaded yet,
then the probe will keep being deferred in the hope that eventually
that driver will be ready, but if it's somehow disabled or has
unsatisfied dependencies, it never will be.
I'm not sure why you're seeing it start working after 10 seconds in
some versions and not at all in others, however.
Can you provide the kernel .config you are using?
> >
> > Unfortunately, it seems this is not sufficient to correct the
> > problem on
> > 6.5-rc4 -- there is no apparent change in behaviour, so maybe there
> > is
> > a new, different problem?
> >
> > I guess I can kick off another bisection to find out when this
> > revert
> > stops fixing things...
> >
> > Let me know if you need any more info!
>
> Thanks for the report. To be sure the issue doesn't fall through the
> cracks unnoticed, I'm adding it to regzbot, the Linux kernel
> regression
> tracking bot:
>
> #regzbot ^introduced e461bd6f43f4e5
> #regzbot title net/arm64: dts: Broken or delayed ethernet on Xilinx
> ZCU104
> #regzbot ignore-activity
>
> This isn't a regression? This issue or a fix for it are already
> discussed somewhere else? It was fixed already? You want to clarify
> when
> the regression started to happen? Or point out I got the title or
> something else totally wrong? Then just reply and tell me -- ideally
> while also telling regzbot about it, as explained by the page listed
> in
> the footer of this mail.
>
> Developers: When fixing the issue, remember to add 'Link:' tags
> pointing
> to the report (the parent of this mail). See page linked in footer
> for
> details.
>
> Ciao, Thorsten (wearing his 'the Linux kernel's regression tracker'
> hat)
> --
> Everything you wanna know about Linux kernel regression tracking:
> https://urldefense.com/v3/__https://linux-regtracking.leemhuis.info/about/*tldr__;Iw!!IOGos0k!lgEIjRg7ZtC0V68jiOVImg2yVuq1jEMnpbmTwHXR7xNZuBcSftX0P9hScWM3r9afZJfPr3EYJuD_MNzCp8paD1EGHe7akA$
> That page also explains what to do if mails like this annoy you.
--
Robert Hancock <robert.hancock@calian.com>
_______________________________________________
linux-arm-kernel mailing list
linux-arm-kernel@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/linux-arm-kernel
next prev parent reply other threads:[~2023-08-08 16:01 UTC|newest]
Thread overview: 21+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-08-04 15:26 PROBLEM: Broken or delayed ethernet on Xilinx ZCU104 since 5.18 (regression) Nick Bowler
2023-08-04 15:45 ` Linux regression tracking (Thorsten Leemhuis)
2023-08-08 16:00 ` Robert Hancock [this message]
2023-08-04 15:52 ` Rob Herring
2023-08-04 16:24 ` Nick Bowler
2023-08-04 16:28 ` Nick Bowler
2023-08-04 16:47 ` Russell King (Oracle)
2023-08-04 16:54 ` Nick Bowler
2023-08-04 17:02 ` Rob Herring
2023-08-04 17:52 ` Nick Bowler
2023-08-04 20:22 ` Rob Herring
2023-08-04 21:31 ` Nick Bowler
2023-08-04 22:27 ` Russell King (Oracle)
2023-08-05 6:57 ` Nick Bowler
2023-08-05 7:03 ` Andrew Lunn
2023-08-05 6:58 ` Andrew Lunn
2023-08-05 7:10 ` Nick Bowler
2023-08-05 7:25 ` Andrew Lunn
2023-08-05 7:34 ` Nick Bowler
2023-08-29 13:30 ` Linux regression tracking #update (Thorsten Leemhuis)
2023-08-05 1:03 ` Saravana Kannan
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=efeee3765f8177bfde3c845face67757d383e4ef.camel@calian.com \
--to=robert.hancock@calian.com \
--cc=davem@davemloft.net \
--cc=edumazet@google.com \
--cc=linux-arm-kernel@lists.infradead.org \
--cc=linux-kernel@vger.kernel.org \
--cc=nbowler@draconx.ca \
--cc=netdev@vger.kernel.org \
--cc=pabeni@redhat.com \
--cc=regressions@lists.linux.dev \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox