From: Guenter Roeck <linux@roeck-us.net>
To: Jonas Johansson <jonasj76@gmail.com>
Cc: netdev@vger.kernel.org, stephen@networkplumber.org,
f.fainelli@gmail.com, jiri@resnulli.us, sfeldma@gmail.com
Subject: Re: Using a waiting MDIO does not go well with a spinlocked bridge
Date: Fri, 20 Mar 2015 23:32:05 -0700 [thread overview]
Message-ID: <20150321063205.GA29867@roeck-us.net> (raw)
In-Reply-To: <alpine.LNX.2.03.1503201321050.27020@hellgate.skynet>
On Fri, Mar 20, 2015 at 01:22:46PM +0100, Jonas Johansson wrote:
> The bridge code will sometimes hold a spinlock and the code following must
> therefore be atomic. If using a MDIO call which uses a wait/sleep in this
> contex, the kernel will not be very happy.
>
> I'm using a switch device and wants to flush its FDB when the linux bridge
> FDB is flushed. I've implemented some hooks for this task.
> In short:
> bridge - br_fdb_flush() & br_fdb_delete_by_port
> -> switchdev - switch_flush()
> -> dsa - slave_flush()
> -> mv88e6xxx - mv88_flush()
>
> So, when a bridge port is flushed via e.g. sysfs, the mv88_flush() function
> will at the end be called. The mv88_flush() will use MDIO calls to set the
> proper registers and flush the device. But, due to that the MDIO on my
> platform uses wait_for_completion() and a spinlock is held (in this case in
> brport_store()) the process will not go very well.
>
I happen to have similar code, though not (yet) submitted upstream.
> The only possible solutions that came into my mind is:
> 1) Let mv88_flush() schedule a work queue to take care of the flush
> later on.
That is my implementation.
> 2) Change the MDIO implementation to use polling.
> 3) Dont use spinlock in bridge code.
>
> 1) Using this approach the the atomic part is missed, i.e. the switch device
> isn't guaranteed to be flushed after the command has been issued. And, if a
> FDB entry is added (atomic) to the switch device immediately after the flush
> command, there will not be defined if the entry will be added before or
> after the flush occurs. To solve this, all (FDB) operations must be added to
> a work queue to assure that they are executed in the right order.
>
In my code I did not bother about this. Which may be why I didn't submit it
upstream ;-). One possible simplification might be to reject fdb operations
while a flush operation is pending (EAGAIN or EBUSY ?), though I don't know
if that is feasible.
> 2) This will result in unsued CPU cycles.
>
> 3) Havent looked into this, but probably a lot of work.
>
I don't think 2) or 3) are good solutions. Both want to change other
subsystems due to problems in the dsa code. If we start doing that we
would mess up the kernel all over the place. I think the solution has
to come from within DSA.
Thanks,
Guenter
prev parent reply other threads:[~2015-03-21 6:32 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-03-20 12:22 Using a waiting MDIO does not go well with a spinlocked bridge Jonas Johansson
2015-03-20 13:16 ` Andrew Lunn
2015-03-21 6:37 ` Guenter Roeck
2015-03-20 18:46 ` Scott Feldman
2015-03-23 6:45 ` Jiri Pirko
2015-03-23 15:42 ` Jonas Johansson
2015-03-23 18:37 ` Scott Feldman
2015-03-21 6:32 ` Guenter Roeck [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20150321063205.GA29867@roeck-us.net \
--to=linux@roeck-us.net \
--cc=f.fainelli@gmail.com \
--cc=jiri@resnulli.us \
--cc=jonasj76@gmail.com \
--cc=netdev@vger.kernel.org \
--cc=sfeldma@gmail.com \
--cc=stephen@networkplumber.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).