From: Stephen Hemminger <shemminger@linux-foundation.org>
To: Jay Vosburgh <fubar@us.ibm.com>
Cc: Cong Wang <amwang@redhat.com>,
Neil Horman <nhorman@tuxdriver.com>,
Jeff, netdev@vger.kernel.org, Matt Mackall <mpm@selenic.com>,
bridge@lists.linux-foundation.org, linux-kernel@vger.kernel.org,
David Miller <davem@davemloft.net>, Moyer <jmoyer@redhat.com>,
Andy, Gospodarek <gospo@redhat.com>,
Eric Dumazet <eric.dumazet@gmail.com>,
bonding-devel@lists.sourceforge.net
Subject: Re: [Bridge] [Bonding-devel] [v3 Patch 2/3] bridge: make bridge support netpoll
Date: Tue, 13 Apr 2010 10:33:20 -0700 [thread overview]
Message-ID: <20100413103320.11a2a4f7@nehalam> (raw)
In-Reply-To: <8304.1271177567@death.nxdomain.ibm.com>
On Tue, 13 Apr 2010 09:52:47 -0700
Jay Vosburgh <fubar@us.ibm.com> wrote:
> Cong Wang <amwang@redhat.com> wrote:
>
> >Stephen Hemminger wrote:
> >> On Mon, 12 Apr 2010 12:38:57 +0200
> >> Eric Dumazet <eric.dumazet@gmail.com> wrote:
> >>
> >>> Le lundi 12 avril 2010 à 18:37 +0800, Cong Wang a écrit :
> >>>> Stephen Hemminger wrote:
> >>>>> There is no protection on dev->priv_flags for SMP access.
> >>>>> It would better bit value in dev->state if you are using it as control flag.
> >>>>>
> >>>>> Then you could use
> >>>>> if (unlikely(test_and_clear_bit(__IN_NETPOLL, &skb->dev->state)))
> >>>>> netpoll_send_skb(...)
> >>>>>
> >>>>>
> >>>> Hmm, I think we can't use ->state here, it is not for this kind of purpose,
> >>>> according to its comments.
> >>>>
> >>>> Also, I find other usages of IFF_XXX flags of ->priv_flags are also using
> >>>> &, | to set or clear the flags. So there must be some other things preventing
> >>>> the race...
> >>> Yes, its RTNL that protects priv_flags changes, hopefully...
> >>>
> >>
> >> The patch was not protecting priv_flags with RTNL.
> >> For example..
> >>
> >>
> >> @@ -308,7 +312,9 @@ static void netpoll_send_skb(struct netp
> >> tries > 0; --tries) {
> >> if (__netif_tx_trylock(txq)) {
> >> if (!netif_tx_queue_stopped(txq)) {
> >> + dev->priv_flags |= IFF_IN_NETPOLL;
> >> status = ops->ndo_start_xmit(skb, dev);
> >> + dev->priv_flags &= ~IFF_IN_NETPOLL;
> >> if (status == NETDEV_TX_OK)
> >> txq_trans_update(txq);
> >
> >Hmm, but I checked the bonding case (IFF_BONDING), it doesn't
> >hold rtnl_lock. Strange.
>
> I looked, and there are a couple of cases in bonding that don't
> have RTNL for adjusting priv_flags (in bond_ab_arp_probe when no slaves
> are up, and a couple of cases in 802.3ad). I think the solution there
> is to move bonding away from priv_flags for some of this (e.g., convert
> bonding to use a frame hook like bridge and macvlan, and greatly
> simplify skb_bond_should_drop), but that's a separate topic.
>
> The majority of the cases, however, do hold RTNL. Bonding
> generally doesn't have to acquire RTNL itself, since whatever called
> into bonding is holding it already. For example, the slave add and
> remove paths (bond_enslave, bond_release) are called either via sysfs or
> ioctl, both of which acquire RTNL. All of the set and clear operations
> for IFF_BONDING fall into this category; look at bonding_store_slaves
> for an example.
>
> Bonding does acquire RTNL itself when performing failovers,
> e.g., bond_mii_monitor holds RTNL prior to calling bond_miimon_commit,
> which will change priv_flags.
>
All this was related to netpoll. And netpoll processing often needs to occur
in hard IRQ context. Therefor netpoll stuff and RTNL (which is a mutex),
really don't mix well. Keep RTNL for what it was meant for network
reconfiguration. Don't turn it into a network special BKL.
--
WARNING: multiple messages have this Message-ID (diff)
From: Stephen Hemminger <shemminger@linux-foundation.org>
To: Jay Vosburgh <fubar@us.ibm.com>
Cc: Cong Wang <amwang@redhat.com>,
Eric Dumazet <eric.dumazet@gmail.com>,
Neil Horman <nhorman@tuxdriver.com>,
netdev@vger.kernel.org, Andy Gospodarek <gospo@redhat.com>,
bridge@lists.linux-foundation.org, linux-kernel@vger.kernel.org,
bonding-devel@lists.sourceforge.net,
Jeff Moyer <jmoyer@redhat.com>, Matt Mackall <mpm@selenic.com>,
David Miller <davem@davemloft.net>
Subject: Re: [Bonding-devel] [v3 Patch 2/3] bridge: make bridge support netpoll
Date: Tue, 13 Apr 2010 10:33:20 -0700 [thread overview]
Message-ID: <20100413103320.11a2a4f7@nehalam> (raw)
In-Reply-To: <8304.1271177567@death.nxdomain.ibm.com>
On Tue, 13 Apr 2010 09:52:47 -0700
Jay Vosburgh <fubar@us.ibm.com> wrote:
> Cong Wang <amwang@redhat.com> wrote:
>
> >Stephen Hemminger wrote:
> >> On Mon, 12 Apr 2010 12:38:57 +0200
> >> Eric Dumazet <eric.dumazet@gmail.com> wrote:
> >>
> >>> Le lundi 12 avril 2010 à 18:37 +0800, Cong Wang a écrit :
> >>>> Stephen Hemminger wrote:
> >>>>> There is no protection on dev->priv_flags for SMP access.
> >>>>> It would better bit value in dev->state if you are using it as control flag.
> >>>>>
> >>>>> Then you could use
> >>>>> if (unlikely(test_and_clear_bit(__IN_NETPOLL, &skb->dev->state)))
> >>>>> netpoll_send_skb(...)
> >>>>>
> >>>>>
> >>>> Hmm, I think we can't use ->state here, it is not for this kind of purpose,
> >>>> according to its comments.
> >>>>
> >>>> Also, I find other usages of IFF_XXX flags of ->priv_flags are also using
> >>>> &, | to set or clear the flags. So there must be some other things preventing
> >>>> the race...
> >>> Yes, its RTNL that protects priv_flags changes, hopefully...
> >>>
> >>
> >> The patch was not protecting priv_flags with RTNL.
> >> For example..
> >>
> >>
> >> @@ -308,7 +312,9 @@ static void netpoll_send_skb(struct netp
> >> tries > 0; --tries) {
> >> if (__netif_tx_trylock(txq)) {
> >> if (!netif_tx_queue_stopped(txq)) {
> >> + dev->priv_flags |= IFF_IN_NETPOLL;
> >> status = ops->ndo_start_xmit(skb, dev);
> >> + dev->priv_flags &= ~IFF_IN_NETPOLL;
> >> if (status == NETDEV_TX_OK)
> >> txq_trans_update(txq);
> >
> >Hmm, but I checked the bonding case (IFF_BONDING), it doesn't
> >hold rtnl_lock. Strange.
>
> I looked, and there are a couple of cases in bonding that don't
> have RTNL for adjusting priv_flags (in bond_ab_arp_probe when no slaves
> are up, and a couple of cases in 802.3ad). I think the solution there
> is to move bonding away from priv_flags for some of this (e.g., convert
> bonding to use a frame hook like bridge and macvlan, and greatly
> simplify skb_bond_should_drop), but that's a separate topic.
>
> The majority of the cases, however, do hold RTNL. Bonding
> generally doesn't have to acquire RTNL itself, since whatever called
> into bonding is holding it already. For example, the slave add and
> remove paths (bond_enslave, bond_release) are called either via sysfs or
> ioctl, both of which acquire RTNL. All of the set and clear operations
> for IFF_BONDING fall into this category; look at bonding_store_slaves
> for an example.
>
> Bonding does acquire RTNL itself when performing failovers,
> e.g., bond_mii_monitor holds RTNL prior to calling bond_miimon_commit,
> which will change priv_flags.
>
All this was related to netpoll. And netpoll processing often needs to occur
in hard IRQ context. Therefor netpoll stuff and RTNL (which is a mutex),
really don't mix well. Keep RTNL for what it was meant for network
reconfiguration. Don't turn it into a network special BKL.
--
next prev parent reply other threads:[~2010-04-13 17:33 UTC|newest]
Thread overview: 26+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-04-08 6:18 [Bridge] [v3 Patch 1/3] netpoll: add generic support for bridge and bonding devices Amerigo Wang
2010-04-08 6:18 ` Amerigo Wang
2010-04-08 6:18 ` [Bridge] [v3 Patch 2/3] bridge: make bridge support netpoll Amerigo Wang
2010-04-08 6:18 ` Amerigo Wang
2010-04-08 15:37 ` [Bridge] " Stephen Hemminger
2010-04-08 15:37 ` Stephen Hemminger
2010-04-09 5:43 ` [Bridge] " Cong Wang
2010-04-09 5:43 ` Cong Wang
2010-04-12 10:37 ` [Bridge] " Cong Wang
2010-04-12 10:37 ` Cong Wang
2010-04-12 10:38 ` [Bridge] " Eric Dumazet
2010-04-12 10:38 ` Eric Dumazet
2010-04-12 15:38 ` [Bridge] [Bonding-devel] " Stephen Hemminger
2010-04-12 15:38 ` Stephen Hemminger
2010-04-13 8:57 ` [Bridge] " Cong Wang
2010-04-13 8:57 ` Cong Wang
2010-04-13 16:52 ` [Bridge] " Jay Vosburgh
2010-04-13 16:52 ` Jay Vosburgh
2010-04-13 17:33 ` Stephen Hemminger [this message]
2010-04-13 17:33 ` Stephen Hemminger
2010-04-14 8:16 ` [Bridge] " Cong Wang
2010-04-14 8:16 ` Cong Wang
2010-04-14 8:11 ` [Bridge] " Cong Wang
2010-04-14 8:11 ` Cong Wang
2010-04-08 6:19 ` [Bridge] [v3 Patch 3/3] bonding: make bonding " Amerigo Wang
2010-04-08 6:19 ` Amerigo Wang
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20100413103320.11a2a4f7@nehalam \
--to=shemminger@linux-foundation.org \
--cc=amwang@redhat.com \
--cc=bonding-devel@lists.sourceforge.net \
--cc=bridge@lists.linux-foundation.org \
--cc=davem@davemloft.net \
--cc=eric.dumazet@gmail.com \
--cc=fubar@us.ibm.com \
--cc=gospo@redhat.com \
--cc=jmoyer@redhat.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mpm@selenic.com \
--cc=netdev@vger.kernel.org \
--cc=nhorman@tuxdriver.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.