From: Wengang Wang <wen.gang.wang@oracle.com>
To: zhuyj <zyjzyj2000@gmail.com>, netdev@vger.kernel.org
Subject: Re: [PATCH] net: take care of bonding in build_skb_flow_key
Date: Wed, 6 Jan 2016 15:45:11 +0800 [thread overview]
Message-ID: <568CC607.2040305@oracle.com> (raw)
In-Reply-To: <568CC3B2.9020106@gmail.com>
在 2016年01月06日 15:35, zhuyj 写道:
> IMHO, "The path MTU is set to the active slave device, not the bonding
> master."
> Can we set PMTU to bonding master when path MTU is set to the active
> slave device?
>
Actually the route is set on bonding master, not on any slave, the
trying to set PMTU to the active slave failed(it didn't found a route on
slave).
So "set PMTU to bonding master when path MTU is set to the active slave
device" is lacking a base.
thanks,
wengang
> If not appropriate, you can ignore it.
>
> Best Regards!
> Zhu Yanjun
>
> On 01/06/2016 03:06 PM, Wengang Wang wrote:
>> Hi Yanjun,
>>
>> Thanks for your review.
>> Master MTU is same as that for slaves.
>> Maybe fixing in bonding driver is a good idea, but I don't find a
>> good place to do that.
>> Let's go through the simplified follow:
>>
>> ...
>> 1) Fragmentation.
>> --This is is done is against the bonding master device(device MTU
>> and path MTU)
>> 2) bond_start_xmit
>> 3) ipoib_start_xmit(slaves are IPoIB interfaces)
>>
>> For the first send
>> 1) fragment size is 7000(in my case)
>> 2) bond_start_xmit its self is fine
>> 3) ipoib_start_xmit sees the packet size 7000 is larger than the
>> internal limit 2044, drops the packet and try to update PMTU.
>> without the patch, it tried update PMTU on slave device(no
>> changes to master).
>>
>> the seconds send comes, since no changes happen on bonding
>> master(PMTU), the fragment size is still 7000 and the behavior is
>> just the same as the first send.
>>
>> With the patch, the bonding master PMTU is changed to 2044 after the
>> first send(hopefully), for the seconds send the fragment size is set
>> to 2044.
>>
>> To fix in bonding code, I don't find where we can.
>>
>> thanks,
>> wengang
>>
>> 在 2016年01月06日 14:19, zhuyj 写道:
>>>
>>> IMHO, this should fix in bonding driver because the active slave mtu
>>> should be the same with the master.
>>> bonding master's mtu is changed to path MTU, then slave dev's MTU
>>> should be changed, too.
>>>
>>> Zhu Yanjun
>>> On 01/06/2016 01:49 PM, Wengang Wang wrote:
>>>> A problem is found that we are looking for route basing a bonding
>>>> device and
>>>> deal with path MTU there: The path MTU is set to the active slave
>>>> device, not
>>>> the bonding master.
>>>>
>>>> The patch tries to fix the issue by letting build_skb_flow_key()
>>>> take care
>>>> of the transition of device index from bonding slave to the master.
>>>>
>>>> Signed-off-by: Wengang Wang <wen.gang.wang@oracle.com>
>>>> ---
>>>> net/ipv4/route.c | 11 ++++++++++-
>>>> 1 file changed, 10 insertions(+), 1 deletion(-)
>>>>
>>>> diff --git a/net/ipv4/route.c b/net/ipv4/route.c
>>>> index 85f184e..3053f10 100644
>>>> --- a/net/ipv4/route.c
>>>> +++ b/net/ipv4/route.c
>>>> @@ -523,11 +523,20 @@ static void build_skb_flow_key(struct flowi4
>>>> *fl4, const struct sk_buff *skb,
>>>> const struct sock *sk)
>>>> {
>>>> const struct iphdr *iph = ip_hdr(skb);
>>>> - int oif = skb->dev->ifindex;
>>>> + int oif;
>>>> + struct net_device *master = NULL;
>>>> +
>>>> u8 tos = RT_TOS(iph->tos);
>>>> u8 prot = iph->protocol;
>>>> u32 mark = skb->mark;
>>>> + if (skb->dev->flags & IFF_SLAVE)
>>>> + master = netdev_master_upper_dev_get(skb->dev);
>>>> + if (master)
>>>> + oif = master->ifindex;
>>>> + else
>>>> + oif = skb->dev->ifindex;
>>>> +
>>>> __build_flow_key(fl4, sk, iph, oif, tos, prot, mark, 0);
>>>> }
>>>
>>
>
next prev parent reply other threads:[~2016-01-06 7:41 UTC|newest]
Thread overview: 13+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-01-06 5:49 [PATCH] net: take care of bonding in build_skb_flow_key Wengang Wang
2016-01-06 6:18 ` David Miller
2016-01-06 6:32 ` Wengang Wang
2016-01-06 6:44 ` zhuyj
2016-01-06 7:11 ` Wengang Wang
2016-01-06 6:19 ` zhuyj
2016-01-06 7:06 ` Wengang Wang
2016-01-06 7:35 ` zhuyj
2016-01-06 7:45 ` Wengang Wang [this message]
2016-01-06 8:00 ` zhuyj
2016-01-06 8:14 ` Wengang Wang
2016-01-06 8:18 ` zhuyj
2016-01-06 9:04 ` Wengang Wang
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=568CC607.2040305@oracle.com \
--to=wen.gang.wang@oracle.com \
--cc=netdev@vger.kernel.org \
--cc=zyjzyj2000@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).