netdev.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [PATCH net] tcp: add missing tcp_skb_can_collapse() test in tcp_shift_skb_data()
@ 2022-02-01 18:46 Eric Dumazet
  2022-02-01 18:47 ` Soheil Hassas Yeganeh
  2022-02-03  0:40 ` patchwork-bot+netdevbpf
  0 siblings, 2 replies; 5+ messages in thread
From: Eric Dumazet @ 2022-02-01 18:46 UTC (permalink / raw)
  To: David S . Miller, Jakub Kicinski
  Cc: netdev, Eric Dumazet, Eric Dumazet, Paolo Abeni, Mat Martineau,
	Talal Ahmad, Arjun Roy, Soheil Hassas Yeganeh, Willem de Bruijn

From: Eric Dumazet <edumazet@google.com>

tcp_shift_skb_data() might collapse three packets into a larger one.

P_A, P_B, P_C  -> P_ABC

Historically, it used a single tcp_skb_can_collapse_to(P_A) call,
because it was enough.

In commit 85712484110d ("tcp: coalesce/collapse must respect MPTCP extensions"),
this call was replaced by a call to tcp_skb_can_collapse(P_A, P_B)

But the now needed test over P_C has been missed.

This probably broke MPTCP.

Then later, commit 9b65b17db723 ("net: avoid double accounting for pure zerocopy skbs")
added an extra condition to tcp_skb_can_collapse(), but the missing call
from tcp_shift_skb_data() is also breaking TCP zerocopy, because P_A and P_C
might have different skb_zcopy_pure() status.

Fixes: 85712484110d ("tcp: coalesce/collapse must respect MPTCP extensions")
Fixes: 9b65b17db723 ("net: avoid double accounting for pure zerocopy skbs")
Signed-off-by: Eric Dumazet <edumazet@google.com>
Cc: Paolo Abeni <pabeni@redhat.com>
Cc: Mat Martineau <mathew.j.martineau@linux.intel.com>
Cc: Talal Ahmad <talalahmad@google.com>
Cc: Arjun Roy <arjunroy@google.com>
Cc: Soheil Hassas Yeganeh <soheil@google.com>
Cc: Willem de Bruijn <willemb@google.com>
---
 net/ipv4/tcp_input.c | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/net/ipv4/tcp_input.c b/net/ipv4/tcp_input.c
index dc49a3d551eb919baf5ad812ef21698c5c7b9679..bfe4112e000c09ba9d7d8b64392f52337b9053e9 100644
--- a/net/ipv4/tcp_input.c
+++ b/net/ipv4/tcp_input.c
@@ -1660,6 +1660,8 @@ static struct sk_buff *tcp_shift_skb_data(struct sock *sk, struct sk_buff *skb,
 	    (mss != tcp_skb_seglen(skb)))
 		goto out;
 
+	if (!tcp_skb_can_collapse(prev, skb))
+		goto out;
 	len = skb->len;
 	pcount = tcp_skb_pcount(skb);
 	if (tcp_skb_shift(prev, skb, pcount, len))
-- 
2.35.0.rc2.247.g8bbb082509-goog


^ permalink raw reply related	[flat|nested] 5+ messages in thread

* Re: [PATCH net] tcp: add missing tcp_skb_can_collapse() test in tcp_shift_skb_data()
  2022-02-01 18:46 [PATCH net] tcp: add missing tcp_skb_can_collapse() test in tcp_shift_skb_data() Eric Dumazet
@ 2022-02-01 18:47 ` Soheil Hassas Yeganeh
  2022-02-01 20:01   ` Mat Martineau
  2022-02-03  0:40 ` patchwork-bot+netdevbpf
  1 sibling, 1 reply; 5+ messages in thread
From: Soheil Hassas Yeganeh @ 2022-02-01 18:47 UTC (permalink / raw)
  To: Eric Dumazet
  Cc: David S . Miller, Jakub Kicinski, netdev, Eric Dumazet,
	Paolo Abeni, Mat Martineau, Talal Ahmad, Arjun Roy,
	Willem de Bruijn

On Tue, Feb 1, 2022 at 1:46 PM Eric Dumazet <eric.dumazet@gmail.com> wrote:
>
> From: Eric Dumazet <edumazet@google.com>
>
> tcp_shift_skb_data() might collapse three packets into a larger one.
>
> P_A, P_B, P_C  -> P_ABC
>
> Historically, it used a single tcp_skb_can_collapse_to(P_A) call,
> because it was enough.
>
> In commit 85712484110d ("tcp: coalesce/collapse must respect MPTCP extensions"),
> this call was replaced by a call to tcp_skb_can_collapse(P_A, P_B)
>
> But the now needed test over P_C has been missed.
>
> This probably broke MPTCP.
>
> Then later, commit 9b65b17db723 ("net: avoid double accounting for pure zerocopy skbs")
> added an extra condition to tcp_skb_can_collapse(), but the missing call
> from tcp_shift_skb_data() is also breaking TCP zerocopy, because P_A and P_C
> might have different skb_zcopy_pure() status.
>
> Fixes: 85712484110d ("tcp: coalesce/collapse must respect MPTCP extensions")
> Fixes: 9b65b17db723 ("net: avoid double accounting for pure zerocopy skbs")
> Signed-off-by: Eric Dumazet <edumazet@google.com>
> Cc: Paolo Abeni <pabeni@redhat.com>
> Cc: Mat Martineau <mathew.j.martineau@linux.intel.com>
> Cc: Talal Ahmad <talalahmad@google.com>
> Cc: Arjun Roy <arjunroy@google.com>
> Cc: Soheil Hassas Yeganeh <soheil@google.com>
> Cc: Willem de Bruijn <willemb@google.com>

Acked-by: Soheil Hassas Yeganeh <soheil@google.com>

I wish there were some packetdrill tests for MPTCP. Thank you for the fix!

> ---
>  net/ipv4/tcp_input.c | 2 ++
>  1 file changed, 2 insertions(+)
>
> diff --git a/net/ipv4/tcp_input.c b/net/ipv4/tcp_input.c
> index dc49a3d551eb919baf5ad812ef21698c5c7b9679..bfe4112e000c09ba9d7d8b64392f52337b9053e9 100644
> --- a/net/ipv4/tcp_input.c
> +++ b/net/ipv4/tcp_input.c
> @@ -1660,6 +1660,8 @@ static struct sk_buff *tcp_shift_skb_data(struct sock *sk, struct sk_buff *skb,
>             (mss != tcp_skb_seglen(skb)))
>                 goto out;
>
> +       if (!tcp_skb_can_collapse(prev, skb))
> +               goto out;
>         len = skb->len;
>         pcount = tcp_skb_pcount(skb);
>         if (tcp_skb_shift(prev, skb, pcount, len))
> --
> 2.35.0.rc2.247.g8bbb082509-goog
>

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH net] tcp: add missing tcp_skb_can_collapse() test in tcp_shift_skb_data()
  2022-02-01 18:47 ` Soheil Hassas Yeganeh
@ 2022-02-01 20:01   ` Mat Martineau
  2022-02-02  9:38     ` Paolo Abeni
  0 siblings, 1 reply; 5+ messages in thread
From: Mat Martineau @ 2022-02-01 20:01 UTC (permalink / raw)
  To: Soheil Hassas Yeganeh
  Cc: Eric Dumazet, David S . Miller, Jakub Kicinski, netdev,
	Eric Dumazet, Paolo Abeni, Talal Ahmad, Arjun Roy,
	Willem de Bruijn, Davide Caratti

On Tue, 1 Feb 2022, Soheil Hassas Yeganeh wrote:

> On Tue, Feb 1, 2022 at 1:46 PM Eric Dumazet <eric.dumazet@gmail.com> wrote:
>>
>> From: Eric Dumazet <edumazet@google.com>
>>
>> tcp_shift_skb_data() might collapse three packets into a larger one.
>>
>> P_A, P_B, P_C  -> P_ABC
>>
>> Historically, it used a single tcp_skb_can_collapse_to(P_A) call,
>> because it was enough.
>>
>> In commit 85712484110d ("tcp: coalesce/collapse must respect MPTCP extensions"),
>> this call was replaced by a call to tcp_skb_can_collapse(P_A, P_B)
>>
>> But the now needed test over P_C has been missed.
>>
>> This probably broke MPTCP.
>>
>> Then later, commit 9b65b17db723 ("net: avoid double accounting for pure zerocopy skbs")
>> added an extra condition to tcp_skb_can_collapse(), but the missing call
>> from tcp_shift_skb_data() is also breaking TCP zerocopy, because P_A and P_C
>> might have different skb_zcopy_pure() status.
>>
>> Fixes: 85712484110d ("tcp: coalesce/collapse must respect MPTCP extensions")
>> Fixes: 9b65b17db723 ("net: avoid double accounting for pure zerocopy skbs")
>> Signed-off-by: Eric Dumazet <edumazet@google.com>
>> Cc: Paolo Abeni <pabeni@redhat.com>
>> Cc: Mat Martineau <mathew.j.martineau@linux.intel.com>
>> Cc: Talal Ahmad <talalahmad@google.com>
>> Cc: Arjun Roy <arjunroy@google.com>
>> Cc: Soheil Hassas Yeganeh <soheil@google.com>
>> Cc: Willem de Bruijn <willemb@google.com>
>
> Acked-by: Soheil Hassas Yeganeh <soheil@google.com>
>
> I wish there were some packetdrill tests for MPTCP. Thank you for the fix!
>

Soheil -

I have good news, there are packetdrill tests for MPTCP:

https://github.com/multipath-tcp/packetdrill

This is still in a fork. I think Davide has talked to Neal about 
upstreaming the MPTCP changes before but there may be some code that needs 
refactoring before that could happen.


>> ---
>>  net/ipv4/tcp_input.c | 2 ++
>>  1 file changed, 2 insertions(+)
>>
>> diff --git a/net/ipv4/tcp_input.c b/net/ipv4/tcp_input.c
>> index dc49a3d551eb919baf5ad812ef21698c5c7b9679..bfe4112e000c09ba9d7d8b64392f52337b9053e9 100644
>> --- a/net/ipv4/tcp_input.c
>> +++ b/net/ipv4/tcp_input.c
>> @@ -1660,6 +1660,8 @@ static struct sk_buff *tcp_shift_skb_data(struct sock *sk, struct sk_buff *skb,
>>             (mss != tcp_skb_seglen(skb)))
>>                 goto out;
>>
>> +       if (!tcp_skb_can_collapse(prev, skb))
>> +               goto out;
>>         len = skb->len;
>>         pcount = tcp_skb_pcount(skb);
>>         if (tcp_skb_shift(prev, skb, pcount, len))
>> --
>> 2.35.0.rc2.247.g8bbb082509-goog
>>
>

--
Mat Martineau
Intel

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH net] tcp: add missing tcp_skb_can_collapse() test in tcp_shift_skb_data()
  2022-02-01 20:01   ` Mat Martineau
@ 2022-02-02  9:38     ` Paolo Abeni
  0 siblings, 0 replies; 5+ messages in thread
From: Paolo Abeni @ 2022-02-02  9:38 UTC (permalink / raw)
  To: Mat Martineau, Soheil Hassas Yeganeh
  Cc: Eric Dumazet, David S . Miller, Jakub Kicinski, netdev,
	Eric Dumazet, Talal Ahmad, Arjun Roy, Willem de Bruijn,
	Davide Caratti

On Tue, 2022-02-01 at 12:01 -0800, Mat Martineau wrote:
> On Tue, 1 Feb 2022, Soheil Hassas Yeganeh wrote:
> 
> > On Tue, Feb 1, 2022 at 1:46 PM Eric Dumazet <eric.dumazet@gmail.com> wrote:
> > > 
> > > From: Eric Dumazet <edumazet@google.com>
> > > 
> > > tcp_shift_skb_data() might collapse three packets into a larger one.
> > > 
> > > P_A, P_B, P_C  -> P_ABC
> > > 
> > > Historically, it used a single tcp_skb_can_collapse_to(P_A) call,
> > > because it was enough.
> > > 
> > > In commit 85712484110d ("tcp: coalesce/collapse must respect MPTCP extensions"),
> > > this call was replaced by a call to tcp_skb_can_collapse(P_A, P_B)
> > > 
> > > But the now needed test over P_C has been missed.
> > > 
> > > This probably broke MPTCP.

Indeed it looks like it could cause MPTCP data stream corruption, in
case of multiple substreams, if we hit this code-path. Thanks for
catching and fixing it!

> > > Then later, commit 9b65b17db723 ("net: avoid double accounting for pure zerocopy skbs")
> > > added an extra condition to tcp_skb_can_collapse(), but the missing call
> > > from tcp_shift_skb_data() is also breaking TCP zerocopy, because P_A and P_C
> > > might have different skb_zcopy_pure() status.
> > > 
> > > Fixes: 85712484110d ("tcp: coalesce/collapse must respect MPTCP extensions")
> > > Fixes: 9b65b17db723 ("net: avoid double accounting for pure zerocopy skbs")
> > > Signed-off-by: Eric Dumazet <edumazet@google.com>
> > > Cc: Paolo Abeni <pabeni@redhat.com>
> > > Cc: Mat Martineau <mathew.j.martineau@linux.intel.com>
> > > Cc: Talal Ahmad <talalahmad@google.com>
> > > Cc: Arjun Roy <arjunroy@google.com>
> > > Cc: Soheil Hassas Yeganeh <soheil@google.com>
> > > Cc: Willem de Bruijn <willemb@google.com>
> > 
> > Acked-by: Soheil Hassas Yeganeh <soheil@google.com>

Acked-by: Paolo Abeni <pabeni@redhat.com>
> > 
> > I wish there were some packetdrill tests for MPTCP. Thank you for the fix!

Do you have by chance a drill for the zero-copy case? it may help
creating the MPTCP one, too.

Thanks!

Paolo


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH net] tcp: add missing tcp_skb_can_collapse() test in tcp_shift_skb_data()
  2022-02-01 18:46 [PATCH net] tcp: add missing tcp_skb_can_collapse() test in tcp_shift_skb_data() Eric Dumazet
  2022-02-01 18:47 ` Soheil Hassas Yeganeh
@ 2022-02-03  0:40 ` patchwork-bot+netdevbpf
  1 sibling, 0 replies; 5+ messages in thread
From: patchwork-bot+netdevbpf @ 2022-02-03  0:40 UTC (permalink / raw)
  To: Eric Dumazet
  Cc: davem, kuba, netdev, edumazet, pabeni, mathew.j.martineau,
	talalahmad, arjunroy, soheil, willemb

Hello:

This patch was applied to netdev/net.git (master)
by Jakub Kicinski <kuba@kernel.org>:

On Tue,  1 Feb 2022 10:46:40 -0800 you wrote:
> From: Eric Dumazet <edumazet@google.com>
> 
> tcp_shift_skb_data() might collapse three packets into a larger one.
> 
> P_A, P_B, P_C  -> P_ABC
> 
> Historically, it used a single tcp_skb_can_collapse_to(P_A) call,
> because it was enough.
> 
> [...]

Here is the summary with links:
  - [net] tcp: add missing tcp_skb_can_collapse() test in tcp_shift_skb_data()
    https://git.kernel.org/netdev/net/c/b67985be4009

You are awesome, thank you!
-- 
Deet-doot-dot, I am a bot.
https://korg.docs.kernel.org/patchwork/pwbot.html



^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2022-02-03  0:40 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2022-02-01 18:46 [PATCH net] tcp: add missing tcp_skb_can_collapse() test in tcp_shift_skb_data() Eric Dumazet
2022-02-01 18:47 ` Soheil Hassas Yeganeh
2022-02-01 20:01   ` Mat Martineau
2022-02-02  9:38     ` Paolo Abeni
2022-02-03  0:40 ` patchwork-bot+netdevbpf

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).