From: Fan Du <fan.du@windriver.com>
To: Vlad Yasevich <vyasevich@gmail.com>
Cc: <nhorman@tuxdriver.com>, <nicolas.dichtel@6wind.com>,
<davem@davemloft.net>, <netdev@vger.kernel.org>
Subject: Re: [RFC PATCH] sctp: Don't lookup dst if transport dst is still valid
Date: Thu, 4 Jul 2013 10:33:35 +0800 [thread overview]
Message-ID: <51D4DEFF.8030305@windriver.com> (raw)
In-Reply-To: <51D425B7.20204@gmail.com>
On 2013年07月03日 21:23, Vlad Yasevich wrote:
> On 07/02/2013 10:18 PM, Fan Du wrote:
>>
>>
>> On 2013年07月02日 22:29, Vlad Yasevich wrote:
>>> On 07/02/2013 02:39 AM, Fan Du wrote:
>>>> When sctp sits on IPv6, sctp_transport_dst_check pass cookie as ZERO,
>>>> as a result ip6_dst_check always fail out. This behaviour makes
>>>> transport->dst useless, because every sctp_packet_transmit must look
>>>> for valid dst(Is this what supposed to be?)
>>>>
>>>> One aggressive way is to call rt_genid_bump which invalid all dst to
>>>> make new dst for transport, apparently it also hurts others.
>>>> I'm sure this may not be the best for all, so any commnets?
>>>>
>>>> Signed-off-by: Fan Du <fan.du@windriver.com>
>>>> ---
>>>> include/net/sctp/sctp.h | 18 ++++++++++++------
>>>> net/sctp/ipv6.c | 2 ++
>>>> 2 files changed, 14 insertions(+), 6 deletions(-)
>>>>
>>>> diff --git a/include/net/sctp/sctp.h b/include/net/sctp/sctp.h
>>>> index cd89510..f05af01 100644
>>>> --- a/include/net/sctp/sctp.h
>>>> +++ b/include/net/sctp/sctp.h
>>>> @@ -719,14 +719,20 @@ static inline void sctp_v4_map_v6(union
>>>> sctp_addr *addr)
>>>> addr->v6.sin6_addr.s6_addr32[2] = htonl(0x0000ffff);
>>>> }
>>>>
>>>> -/* The cookie is always 0 since this is how it's used in the
>>>> - * pmtu code.
>>>> - */
>>>> +/* Set cookie with the right one for IPv6 and zero for others */
>>>> static inline struct dst_entry *sctp_transport_dst_check(struct
>>>> sctp_transport *t)
>>>> {
>>>> - if (t->dst && !dst_check(t->dst, 0)) {
>>>> - dst_release(t->dst);
>>>> - t->dst = NULL;
>>>> +
>>>> + if (t->dst) {
>>>> + struct rt6_info *rt = (struct rt6_info *)t->dst;
>>>> + u32 cookie = 0;
>>>> +
>>>> + if ((t->af_specific->sa_family == AF_INET6) && rt->rt6i_node)
>>>> + cookie = rt->rt6i_node->fn_sernum;
>>>> + if (!dst_check(t->dst, cookie)) {
>>>> + dst_release(t->dst);
>>>> + t->dst = NULL;
>>>> + }
>>>> }
>>>
>>> I think it would be better if we stored the dst_cookie in the
>>> transport structure and initialized it at lookup time. If you do that,
>>> then if the route table changes, we'd correctly detect it without
>>> artificially bumping rt_genid (and hurting ipv4).
>>
>> Hi Vlad/Neil
>>
>> Is this what you mean?
>
> Yes, exactly.
>
Hi Vlad
I thinks twice about below patch, this is actually a chicken-egg issue.
Look below scenario:
(1) The first time we push packet through a transport, dst_cookie is 0,
so sctp_transport_dst_check also pass cookie as 0, then return dst as NULL.
Then we lookup dst by sctp_transport_route, and in there we initiate dst_cookie
with rt->rt6i_node->fn_sernum
(2) Then the next time we push packet through this transport again,
we pass dst_cookie(rt->rt6i_node->fn_sernum) to ip6_dst_check, and
return valid dst without bothering to lookup dst again.
BUT, suppose when deleting the source address of this dst after transport->dst_cookie
has been well initialized. transport->dst_cookie still holds rt->rt6i_node->fn_sernum,
meaning ip6_dst_check will return valid dst, which it shouldn't in this case, the
result will be association ABORT.
Other way is invalid all transport->dst which using the deleting address as source address
without bumping gen_id, problem is the traverse times depends heavily on transport number,
and also need to take account locking issue it will introduce.
>
> No, you are not missing anything. IPv4 doesn't use the cookie and always seems to pass it as 0.
>
> Yes, ipv4 will bump the gen_id thus invalidating all routes (there has been disagreement about it).
> IPv6 doesn't do that. In ipv6, when the addresses are added or removed, routes are also added or removed and
> any time the route is added it will have a new serial number. So, you don't have to disturb ipv4 cache when ipv6 routing info changes.
Thank you very much for your explanation!
IPv6 don't bump gen_id, when adding/deleting address, and tag an serial number with each route.
Doing this way loose the semantic of dst_check, because SCTP depends no dst_check fulfill its
duty to actually check whether the holding dst is still valid, well most other Layer 4 protocol
simply rely on ip6_route_output/ip6_dst_lookup_flow to grab dst every time sending data out.
So please pronounce a final judgment.
> -vlad
>
>>
>> diff --git a/include/net/sctp/sctp.h b/include/net/sctp/sctp.h
>> index cd89510..0a646a5 100644
>> --- a/include/net/sctp/sctp.h
>> +++ b/include/net/sctp/sctp.h
>> @@ -724,7 +724,7 @@ static inline void sctp_v4_map_v6(union sctp_addr
>> *addr)
>> */
>> static inline struct dst_entry *sctp_transport_dst_check(struct
>> sctp_transport *t)
>> {
>> - if (t->dst && !dst_check(t->dst, 0)) {
>> + if (t->dst && !dst_check(t->dst, t->dst_cookie)) {
>> dst_release(t->dst);
>> t->dst = NULL;
>> }
>> diff --git a/include/net/sctp/structs.h b/include/net/sctp/structs.h
>> index 1bd4c41..cafdd19 100644
>> --- a/include/net/sctp/structs.h
>> +++ b/include/net/sctp/structs.h
>> @@ -946,6 +946,8 @@ struct sctp_transport {
>> __u64 hb_nonce;
>>
>> struct rcu_head rcu;
>> +
>> + u32 dst_cookie;
>> };
>>
>> struct sctp_transport *sctp_transport_new(struct net *, const union
>> sctp_addr *,
>> diff --git a/net/sctp/ipv6.c b/net/sctp/ipv6.c
>> index 8ee553b..82a420f 100644
>> --- a/net/sctp/ipv6.c
>> +++ b/net/sctp/ipv6.c
>> @@ -350,6 +350,7 @@ out:
>> struct rt6_info *rt;
>> rt = (struct rt6_info *)dst;
>> t->dst = dst;
>> + t->dst_cookie = rt->rt6i_node ? rt->rt6i_node->fn_sernum
>> : 0;
>> SCTP_DEBUG_PRINTK("rt6_dst:%pI6 rt6_src:%pI6\n",
>> &rt->rt6i_dst.addr, &fl6->saddr);
>> } else {
>>
>>
>>
>>
>>
>>> -vlad
>>>
>>>>
>>>> return t->dst;
>>>> diff --git a/net/sctp/ipv6.c b/net/sctp/ipv6.c
>>>> index 8ee553b..cfae77e 100644
>>>> --- a/net/sctp/ipv6.c
>>>> +++ b/net/sctp/ipv6.c
>>>> @@ -137,6 +137,8 @@ static int sctp_inet6addr_event(struct
>>>> notifier_block *this, unsigned long ev,
>>>> break;
>>>> }
>>>>
>>>> + /* invalid all transport dst forcing to look up new dst */
>>>> + rt_genid_bump(net);
>>>> return NOTIFY_DONE;
>>>> }
>>>>
>>>>
>>>
>>>
>>
>
>
--
浮沉随浪只记今朝笑
--fan
next prev parent reply other threads:[~2013-07-04 2:35 UTC|newest]
Thread overview: 24+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-07-02 6:39 [RFC PATCH] sctp: Don't lookup dst if transport dst is still valid Fan Du
2013-07-02 14:29 ` Vlad Yasevich
2013-07-02 15:55 ` Neil Horman
2013-07-03 2:39 ` Fan Du
2013-07-03 13:48 ` Vlad Yasevich
2013-07-02 19:47 ` David Miller
2013-07-03 2:18 ` Fan Du
2013-07-03 13:23 ` Vlad Yasevich
2013-07-03 14:11 ` Vlad Yasevich
2013-07-04 2:33 ` Fan Du [this message]
2013-07-05 13:03 ` Neil Horman
2013-07-09 7:11 ` Fan Du
2013-07-09 11:38 ` Neil Horman
2013-07-05 14:09 ` Vlad Yasevich
2013-07-09 15:11 ` Vlad Yasevich
2013-07-10 5:26 ` Fan Du
2013-07-12 11:19 ` Neil Horman
2013-07-16 9:13 ` Fan Du
2013-07-13 12:21 ` Vlad Yasevich
2013-07-17 7:04 ` Fan Du
2013-07-12 3:15 ` Fan Du
2013-07-12 22:58 ` David Miller
2013-07-13 12:18 ` Vlad Yasevich
2013-07-16 9:58 ` Fan Du
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=51D4DEFF.8030305@windriver.com \
--to=fan.du@windriver.com \
--cc=davem@davemloft.net \
--cc=netdev@vger.kernel.org \
--cc=nhorman@tuxdriver.com \
--cc=nicolas.dichtel@6wind.com \
--cc=vyasevich@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).