netdev.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Wengang Wang <wen.gang.wang@oracle.com>
To: zhuyj <zyjzyj2000@gmail.com>, netdev@vger.kernel.org
Subject: Re: [PATCH] net: take care of bonding in build_skb_flow_key
Date: Wed, 6 Jan 2016 15:06:18 +0800	[thread overview]
Message-ID: <568CBCEA.20702@oracle.com> (raw)
In-Reply-To: <568CB1FD.4030907@gmail.com>

Hi Yanjun,

Thanks for your review.
Master MTU is same as that for slaves.
Maybe fixing in bonding driver is a good idea, but I don't find a good 
place to do that.
Let's go through the simplified follow:

...
1) Fragmentation.
    --This is is done is against the bonding master device(device MTU 
and path MTU)
2) bond_start_xmit
3) ipoib_start_xmit(slaves are IPoIB interfaces)

For the first send
1) fragment size is 7000(in my case)
2) bond_start_xmit its self is fine
3) ipoib_start_xmit sees the packet size 7000 is larger than the 
internal limit 2044, drops the packet and try to update PMTU.
     without the patch, it tried update PMTU on slave device(no changes 
to master).

the seconds send comes, since no changes happen on bonding master(PMTU), 
the fragment size is still 7000 and the behavior is just the same as the 
first send.

With the patch, the bonding master PMTU is changed to 2044 after the 
first send(hopefully), for the seconds send the fragment size is set to 
2044.

To fix in bonding code, I don't find where we can.

thanks,
wengang

在 2016年01月06日 14:19, zhuyj 写道:
>
> IMHO, this should fix in bonding driver because the active slave mtu 
> should be the same with the master.
> bonding master's mtu is changed to path MTU, then slave dev's MTU 
> should be changed, too.
>
> Zhu Yanjun
> On 01/06/2016 01:49 PM, Wengang Wang wrote:
>> A problem is found that we are looking for route basing a bonding 
>> device and
>> deal with path MTU there: The path MTU is set to the active slave 
>> device, not
>> the bonding master.
>>
>> The patch tries to fix the issue by letting build_skb_flow_key() take 
>> care
>> of the transition of device index from bonding slave to the master.
>>
>> Signed-off-by: Wengang Wang <wen.gang.wang@oracle.com>
>> ---
>>   net/ipv4/route.c | 11 ++++++++++-
>>   1 file changed, 10 insertions(+), 1 deletion(-)
>>
>> diff --git a/net/ipv4/route.c b/net/ipv4/route.c
>> index 85f184e..3053f10 100644
>> --- a/net/ipv4/route.c
>> +++ b/net/ipv4/route.c
>> @@ -523,11 +523,20 @@ static void build_skb_flow_key(struct flowi4 
>> *fl4, const struct sk_buff *skb,
>>                      const struct sock *sk)
>>   {
>>       const struct iphdr *iph = ip_hdr(skb);
>> -    int oif = skb->dev->ifindex;
>> +    int oif;
>> +    struct net_device *master = NULL;
>> +
>>       u8 tos = RT_TOS(iph->tos);
>>       u8 prot = iph->protocol;
>>       u32 mark = skb->mark;
>>   +    if (skb->dev->flags & IFF_SLAVE)
>> +        master = netdev_master_upper_dev_get(skb->dev);
>> +    if (master)
>> +        oif = master->ifindex;
>> +    else
>> +        oif = skb->dev->ifindex;
>> +
>>       __build_flow_key(fl4, sk, iph, oif, tos, prot, mark, 0);
>>   }
>

  reply	other threads:[~2016-01-06  7:02 UTC|newest]

Thread overview: 13+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2016-01-06  5:49 [PATCH] net: take care of bonding in build_skb_flow_key Wengang Wang
2016-01-06  6:18 ` David Miller
2016-01-06  6:32   ` Wengang Wang
2016-01-06  6:44     ` zhuyj
2016-01-06  7:11       ` Wengang Wang
2016-01-06  6:19 ` zhuyj
2016-01-06  7:06   ` Wengang Wang [this message]
2016-01-06  7:35     ` zhuyj
2016-01-06  7:45       ` Wengang Wang
2016-01-06  8:00         ` zhuyj
2016-01-06  8:14           ` Wengang Wang
2016-01-06  8:18             ` zhuyj
2016-01-06  9:04               ` Wengang Wang

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=568CBCEA.20702@oracle.com \
    --to=wen.gang.wang@oracle.com \
    --cc=netdev@vger.kernel.org \
    --cc=zyjzyj2000@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).