From: Lukas Wunner <lukas@wunner.de>
To: Andrew Lunn <andrew@lunn.ch>
Cc: Oleksij Rempel <o.rempel@pengutronix.de>,
Oliver Neukum <oneukum@suse.com>,
Oleksij Rempel <linux@rempel-privat.de>,
"netdev@vger.kernel.org" <netdev@vger.kernel.org>,
Heiner Kallweit <hkallweit1@gmail.com>
Subject: Re: ordering of call to unbind() in usbnet_disconnect
Date: Sat, 26 Mar 2022 14:01:48 +0100 [thread overview]
Message-ID: <20220326130148.GC31022@wunner.de> (raw)
In-Reply-To: <Yj8KnE5BeEK1SXDP@lunn.ch>
On Sat, Mar 26, 2022 at 01:44:12PM +0100, Andrew Lunn wrote:
> On Sat, Mar 26, 2022 at 01:25:52PM +0100, Lukas Wunner wrote:
> > Oleksij, I cannot reproduce your stacktrace (included in full length below).
> > I've tested with kernel 5.13 (since the stacktrace was with 5.13-rc3)
> > with all your (and other people's) asix patches applied on top,
> > except for 2c9d6c2b871d. Tried unplugging an AX88772A multiple times,
> > never got a stacktrace.
> >
> > I've also walked down the code paths from usbnet_disconnect() and cannot
> > see how the stacktrace could occur.
> >
> > Normally an unregistering netdev is removed from the linkwatch event list
> > (lweventlist) via this call stack:
> >
> > usbnet_disconnect()
> > unregister_netdev()
> > rtnl_unlock()
> > netdev_run_todo()
> > netdev_wait_allrefs()
> > linkwatch_forget_dev()
> > linkwatch_do_dev()
> >
> > For the stacktrace to occur, the netdev would have to be subsequently
> > re-added to the linkwatch event list via linkwatch_fire_event().
>
> What you might be missing is a call to phy_error()
But phy_error() has a WARN_ON(1) right at its top. So it produces
a stacktrace itself. That stacktrace is nowhere to be seen in the
dmesg output Oleksij posted. Hence it can't be caused by phy_error().
Also, recall that unregister_netdev() stops the netdev before
unregistering it. That in turn causes an invocation of phy_stop()
via ax88772_stop(). phy_stop() already puts the PHY into PHY_HALTED
state and resets phydev->link = 0. So a subsequent phy_error() cannot
result in a call to phy_link_down() (which would indeed trigger a
dangerous linkwatch_fire_event()).
> > That is called, among other places, from netif_carrier_off(). However,
> > netif_carrier_off() is already called *before* linkwatch_forget_dev()
> > when unregister_netdev() stops the netdev before unregistering it:
> >
> > usbnet_disconnect()
> > unregister_netdev()
> > unregister_netdevice()
> > unregister_netdevice_queue(dev, NULL)
> > unregister_netdevice_many()
> > dev_close_many()
> > __dev_close_many()
> > usbnet_stop()
> > ax88772_stop()
> > phy_stop() # state = PHY_HALTED
> > phy_state_machine()
>
> I'm guessing somewhere around here:
>
> If it calls into the PHY driver, and the PHY calls for an MDIO bus
> transaction, and that returns an error, -ENODEV or -EIO for example,
> because the USB device has gone away, and that results in a call to
> phy_error().
Oleksij amended phy_state_machine() to bail out if err == -ENODEV
with commit 06edf1a940be ("net: phy: do not print dump stack if device
was removed"). The commit skips the phy_error() on -ENODEV, which
makes a lot of sense.
Thanks,
Lukas
prev parent reply other threads:[~2022-03-26 13:01 UTC|newest]
Thread overview: 26+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-03-10 11:25 ordering of call to unbind() in usbnet_disconnect Oliver Neukum
2022-03-10 11:38 ` Oleksij Rempel
2022-03-14 18:42 ` Lukas Wunner
2022-03-14 19:14 ` Andrew Lunn
2022-03-15 5:44 ` Oleksij Rempel
2022-03-15 8:32 ` Lukas Wunner
2022-03-15 11:38 ` Oleksij Rempel
2022-03-15 13:28 ` Andrew Lunn
2022-03-17 15:53 ` Oliver Neukum
2022-03-17 21:03 ` Lukas Wunner
2022-03-21 10:17 ` Lukas Wunner
2022-03-21 10:43 ` Oleksij Rempel
2022-03-31 9:35 ` Oliver Neukum
2022-03-21 10:02 ` Lukas Wunner
2022-03-21 13:10 ` Andrew Lunn
2022-03-26 12:39 ` Lukas Wunner
2022-03-26 12:49 ` Andrew Lunn
2022-03-26 13:04 ` Lukas Wunner
2022-03-27 8:37 ` Oleksij Rempel
2022-03-31 9:20 ` Oliver Neukum
2022-03-31 9:30 ` Lukas Wunner
2022-03-31 9:59 ` Oliver Neukum
2022-03-31 11:22 ` Lukas Wunner
2022-03-26 12:25 ` Lukas Wunner
2022-03-26 12:44 ` Andrew Lunn
2022-03-26 13:01 ` Lukas Wunner [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20220326130148.GC31022@wunner.de \
--to=lukas@wunner.de \
--cc=andrew@lunn.ch \
--cc=hkallweit1@gmail.com \
--cc=linux@rempel-privat.de \
--cc=netdev@vger.kernel.org \
--cc=o.rempel@pengutronix.de \
--cc=oneukum@suse.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).