From: Vladimir Oltean <olteanv@gmail.com>
To: Christian Eggers <ceggers@arri.de>
Cc: Woojung Huh <woojung.huh@microchip.com>,
Microchip Linux Driver Support <UNGLinuxDriver@microchip.com>,
Andrew Lunn <andrew@lunn.ch>,
Vivien Didelot <vivien.didelot@gmail.com>,
Florian Fainelli <f.fainelli@gmail.com>,
"David S . Miller" <davem@davemloft.net>,
Jakub Kicinski <kuba@kernel.org>,
netdev@vger.kernel.org, linux-kernel@vger.kernel.org
Subject: Re: [net v2] net: dsa: microchip: fix race condition
Date: Tue, 6 Oct 2020 19:21:25 +0300 [thread overview]
Message-ID: <20201006162125.ulftqdiufdxjesn7@skbuf> (raw)
In-Reply-To: <20201006155651.21473-1-ceggers@arri.de>
On Tue, Oct 06, 2020 at 05:56:51PM +0200, Christian Eggers wrote:
> Between queuing the delayed work and finishing the setup of the dsa
> ports, the process may sleep in request_module() (via
> phy_device_create()) and the queued work may be executed prior to the
> switch net devices being registered. In ksz_mib_read_work(), a NULL
> dereference will happen within netof_carrier_ok(dp->slave).
>
> Not queuing the delayed work in ksz_init_mib_timer() makes things even
> worse because the work will now be queued for immediate execution
> (instead of 2000 ms) in ksz_mac_link_down() via
> dsa_port_link_register_of().
>
> Call tree:
> ksz9477_i2c_probe()
> \--ksz9477_switch_register()
> \--ksz_switch_register()
> +--dsa_register_switch()
> | \--dsa_switch_probe()
> | \--dsa_tree_setup()
> | \--dsa_tree_setup_switches()
> | +--dsa_switch_setup()
> | | +--ksz9477_setup()
> | | | \--ksz_init_mib_timer()
> | | | |--/* Start the timer 2 seconds later. */
> | | | \--schedule_delayed_work(&dev->mib_read, msecs_to_jiffies(2000));
> | | \--__mdiobus_register()
> | | \--mdiobus_scan()
> | | \--get_phy_device()
> | | +--get_phy_id()
> | | \--phy_device_create()
> | | |--/* sleeping, ksz_mib_read_work() can be called meanwhile */
> | | \--request_module()
> | |
> | \--dsa_port_setup()
> | +--/* Called for non-CPU ports */
> | +--dsa_slave_create()
> | | +--/* Too late, ksz_mib_read_work() may be called beforehand */
> | | \--port->slave = ...
> | ...
> | +--Called for CPU port */
> | \--dsa_port_link_register_of()
> | \--ksz_mac_link_down()
> | +--/* mib_read must be initialized here */
> | +--/* work is already scheduled, so it will be executed after 2000 ms */
> | \--schedule_delayed_work(&dev->mib_read, 0);
> \-- /* here port->slave is setup properly, scheduling the delayed work should be safe */
>
> Solution:
> 1. Do not queue (only initialize) delayed work in ksz_init_mib_timer().
> 2. Only queue delayed work in ksz_mac_link_down() if init is completed.
> 3. Queue work once in ksz_switch_register(), after dsa_register_switch()
> has completed.
>
> Fixes: 7c6ff470aa ("net: dsa: microchip: add MIB counter reading support")
> Signed-off-by: Christian Eggers <ceggers@arri.de>
> ---
Reviewed-by: Vladimir Oltean <olteanv@gmail.com>
You forgot to copy Florian's review tag from v1.
> v2:
> ---------
> - no changes in the patch itself
> - use correct subject-prefix
> - changed wording of commit description
> - added call tree to commit description
> - added "Fixes:" tag
>
[...]
> /* Only read MIB counters when the port is told to do.
> * If not, read only dropped counters when link is not up.
> */
> port_r_cnt() is called independently of p->read and netif_carrier_ok()... What
> is correct here (comment or code)?
port_r_cnt() iterates with mib->cnt_ptr through 2 loops.
Check how mib->cnt_ptr is set before port_r_ctr is called.
> I needed some amount of time to understand the segfault and to draw the
> call stack...
I'm sure you did.
> I am definitely not an expert for this driver. For starting/stopping the
> delayed work on demand, a separate work struct for each port could be useful.
> In this case, struct ksz_port would need a pointer to the ksz_device struct,
> as the ports are allocated seperately and container_of() cannot be used.
Me neither, I'm just a spectator.
> Using a bool variable has the property, that reading the MIB will not be
> performed "immediately" after phylink_mac_down(). But if I am correct, this
> is also not the case today as the work is typically already queued when
> ksz_mac_link_down() is executed.
>
> - First call of ksz_mac_link_down:
> Work is already queued (prior this patch) or will not be queued (after this
> patch).
>
> - Further calls:
> Work is already queued (it requeues itself).
>
> Result (please verify):
I can't verify this. Please ask the Microchip people. But the fix makes
sense.
> - Not scheduling the work in ksz_mac_link_down() won't change anything.
> - Checking for mib_read_interval in ksz_switch_remove() can be obmitted,
> as the condition is always true when ksz_switch_remove() is called.
If there's an error in the probe path, I expect that the
mib_read_interval will not get set, and the delayed workqueue will not
be scheduled, will it? So I think the check is ok there.
Thanks,
-Vladimir
next prev parent reply other threads:[~2020-10-06 16:21 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-10-06 15:56 [net v2] net: dsa: microchip: fix race condition Christian Eggers
2020-10-06 16:21 ` Vladimir Oltean [this message]
2020-10-06 16:24 ` Vladimir Oltean
2020-10-06 16:30 ` Christian Eggers
2020-10-06 16:57 ` Vladimir Oltean
2020-10-06 20:36 ` Jakub Kicinski
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20201006162125.ulftqdiufdxjesn7@skbuf \
--to=olteanv@gmail.com \
--cc=UNGLinuxDriver@microchip.com \
--cc=andrew@lunn.ch \
--cc=ceggers@arri.de \
--cc=davem@davemloft.net \
--cc=f.fainelli@gmail.com \
--cc=kuba@kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=netdev@vger.kernel.org \
--cc=vivien.didelot@gmail.com \
--cc=woojung.huh@microchip.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox