From: Alexey Kodanev <alexey.kodanev@oracle.com>
To: Steffen Klassert <steffen.klassert@secunet.com>
Cc: netdev@vger.kernel.org, David Miller <davem@davemloft.net>,
Petr Vorel <pvorel@suse.cz>
Subject: Re: [PATCH net-next] ip6_vti: adjust vti mtu according to mtu of output device
Date: Fri, 8 Dec 2017 14:54:46 +0300 [thread overview]
Message-ID: <1d086bbe-8d90-628f-4622-2fdb5fb1405e@oracle.com> (raw)
In-Reply-To: <20171208070215.nycywbhkd4u7twh7@gauss3.secunet.de>
On 12/08/2017 10:02 AM, Steffen Klassert wrote:
> On Wed, Dec 06, 2017 at 07:38:19PM +0300, Alexey Kodanev wrote:
>> LTP/udp6_ipsec_vti tests fail when sending large UDP datagrams
>> that require fragmentation and underlying device MTU <= 1500.
>> This happens because ip6_vti sets mtu to ETH_DATA_LEN and not
>> updating it depending on a destiantion address.
>>
>> Futhure attempts to send UDP packets may succeed because pmtu
>> get updated on ICMPV6_PKT_TOOBIG in vti6_err().
>>
>> Here is the example when output device MTU set to 9000:
>>
>> # ip a sh ltp_ns_veth2
>> ltp_ns_veth2@if7: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 9000 ...
>> inet 10.0.0.2/24 scope global ltp_ns_veth2
>> inet6 fd00::2/64 scope global
>> ...
>> # ip li add vti6 type vti6 local fd00::2 remote fd00::1
>> # ip li show vti6
>> vti6@NONE: <POINTOPOINT,NOARP> mtu 1500 ...
>> link/tunnel6 fd00::2 peer fd00::1
>>
>> After the patch:
>>
>> # ip li add vti6 type vti6 local fd00::2 remote fd00::1
>> # ip li show vti6
>> vti6@NONE: <POINTOPOINT,NOARP> mtu 8832 ...
>> link/tunnel6 fd00::2 peer fd00::1
>>
>> Regarding ip_vti, it already tunes mtu with ip_tunnel_bind_dev():
>>
>> # ip li add vti4 type vti local 10.0.0.2 remote 10.0.0.1
>> # ip li sh vti4
>> vti4@NONE: <POINTOPOINT,NOARP> mtu 8832 ...
>> link/ipip 10.0.0.2 peer 10.0.0.1
>>
>> Reported-by: Petr Vorel <pvorel@suse.cz>
>> Signed-off-by: Alexey Kodanev <alexey.kodanev@oracle.com>
>> ---
>>
>> ip6_vti mtu offset is the same (168) as in ip_vti because ip_vti
>> offset includes two sizes of struct iphdr: in dev->hard_header_len
>> and in t_hlen in ip_tunnel_bind_dev(). I'm not sure if it's correct.
>>
>> net/ipv6/ip6_vti.c | 18 ++++++++++++++++++
>> 1 files changed, 18 insertions(+), 0 deletions(-)
>>
>> diff --git a/net/ipv6/ip6_vti.c b/net/ipv6/ip6_vti.c
>> index dbb74f3..47e6464 100644
>> --- a/net/ipv6/ip6_vti.c
>> +++ b/net/ipv6/ip6_vti.c
>> @@ -638,6 +638,24 @@ static void vti6_link_config(struct ip6_tnl *t)
>> dev->flags |= IFF_POINTOPOINT;
>> else
>> dev->flags &= ~IFF_POINTOPOINT;
>> +
>> + if (p->flags & IP6_TNL_F_CAP_XMIT) {
>> + int strict = (ipv6_addr_type(&p->raddr) &
>> + (IPV6_ADDR_MULTICAST | IPV6_ADDR_LINKLOCAL));
>> +
>> + struct rt6_info *rt = rt6_lookup(t->net,
>> + &p->raddr, &p->laddr,
>> + p->link, strict);
>> +
>> + if (!rt)
>> + return;
>> +
>> + if (rt->dst.dev) {
>> + dev->mtu = max(rt->dst.dev->mtu - dev->hard_header_len,
>> + IPV6_MIN_MTU);
>
> Hm, I'm gettting this when compiling with your patch:
>
> In file included from /home/klassert/git/ipsec-next/include/linux/list.h:9:0,
> from /home/klassert/git/ipsec-next/include/linux/module.h:9,
> from /home/klassert/git/ipsec-next/net/ipv6/ip6_vti.c:18:
> /home/klassert/git/ipsec-next/net/ipv6/ip6_vti.c: In function ‘vti6_link_config’:
> /home/klassert/git/ipsec-next/include/linux/kernel.h:808:16: warning: comparison of distinct pointer types lacks a cast
> (void) (&max1 == &max2); \
> ^
> /home/klassert/git/ipsec-next/include/linux/kernel.h:817:2: note: in expansion of macro ‘__max’
> __max(typeof(x), typeof(y), \
> ^~~~~
> /home/klassert/git/ipsec-next/net/ipv6/ip6_vti.c:654:15: note: in expansion of macro ‘max’
> dev->mtu = max(rt->dst.dev->mtu - dev->hard_header_len,
>
rt->dst.dev->mtu and dev->hard_header_len are both unsigned and
IPV6_MIN_MTU considered as int, I guess IPV6_MIN_MTU can be changed
to dev->min_mtu as it is set to the same value in setup, but checking
in the way it is done in ip6_tnl_link_config() looks better.
I'll send 2nd version.
Thanks,
Alexey
next prev parent reply other threads:[~2017-12-08 11:45 UTC|newest]
Thread overview: 4+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-12-06 16:38 [PATCH net-next] ip6_vti: adjust vti mtu according to mtu of output device Alexey Kodanev
2017-12-08 7:02 ` Steffen Klassert
2017-12-08 11:54 ` Alexey Kodanev [this message]
2017-12-08 23:25 ` Shannon Nelson
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1d086bbe-8d90-628f-4622-2fdb5fb1405e@oracle.com \
--to=alexey.kodanev@oracle.com \
--cc=davem@davemloft.net \
--cc=netdev@vger.kernel.org \
--cc=pvorel@suse.cz \
--cc=steffen.klassert@secunet.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).