* [PATCH] net: Fix sk reference counting in ip_push_pending_frames and ip6_push_pending_frames
@ 2009-07-11 17:46 John Dykstra
2009-07-11 19:39 ` Eric Dumazet
0 siblings, 1 reply; 3+ messages in thread
From: John Dykstra @ 2009-07-11 17:46 UTC (permalink / raw)
To: netdev, eric.dumazet
Commit 2b85a34e911bf483c27cfdd124aeb1605145dc80 "net: No more expensive
sock_hold()/sock_put() on each tx" used sk_wmem_alloc rather than the struct sock reference
count to track in-flight transmit-path packets. However, it missed the __sock_put() calls
in ip_push_pending_frames() and ip6_push_pending_frames(). This results in too-small
reference counts when UDP or RAW sockets are used to send more than one MTU of data. This
in turn could lead to struct sock being freed and reused while it is still part of an
active socket.
A wide variety of socket symptoms may be fixed by this patch. It also fixes one cause
of WARN_ON's in sk_del_node_init() and sk_nulls_del_node_init_rcu().
Signed-off-by: John Dykstra <john.dykstra1@gmail.com>
---
net/ipv4/ip_output.c | 1 -
net/ipv6/ip6_output.c | 1 -
2 files changed, 0 insertions(+), 2 deletions(-)
diff --git a/net/ipv4/ip_output.c b/net/ipv4/ip_output.c
index 2470262..7d08210 100644
--- a/net/ipv4/ip_output.c
+++ b/net/ipv4/ip_output.c
@@ -1243,7 +1243,6 @@ int ip_push_pending_frames(struct sock *sk)
skb->len += tmp_skb->len;
skb->data_len += tmp_skb->len;
skb->truesize += tmp_skb->truesize;
- __sock_put(tmp_skb->sk);
tmp_skb->destructor = NULL;
tmp_skb->sk = NULL;
}
diff --git a/net/ipv6/ip6_output.c b/net/ipv6/ip6_output.c
index 7c76e3d..87f8419 100644
--- a/net/ipv6/ip6_output.c
+++ b/net/ipv6/ip6_output.c
@@ -1484,7 +1484,6 @@ int ip6_push_pending_frames(struct sock *sk)
skb->len += tmp_skb->len;
skb->data_len += tmp_skb->len;
skb->truesize += tmp_skb->truesize;
- __sock_put(tmp_skb->sk);
tmp_skb->destructor = NULL;
tmp_skb->sk = NULL;
}
--
1.5.4.3
^ permalink raw reply related [flat|nested] 3+ messages in thread
* Re: [PATCH] net: Fix sk reference counting in ip_push_pending_frames and ip6_push_pending_frames
2009-07-11 17:46 [PATCH] net: Fix sk reference counting in ip_push_pending_frames and ip6_push_pending_frames John Dykstra
@ 2009-07-11 19:39 ` Eric Dumazet
2009-07-11 20:00 ` John Dykstra
0 siblings, 1 reply; 3+ messages in thread
From: Eric Dumazet @ 2009-07-11 19:39 UTC (permalink / raw)
To: John Dykstra; +Cc: netdev, eric.dumazet
John Dykstra a écrit :
> Commit 2b85a34e911bf483c27cfdd124aeb1605145dc80 "net: No more expensive
> sock_hold()/sock_put() on each tx" used sk_wmem_alloc rather than the struct sock reference
> count to track in-flight transmit-path packets. However, it missed the __sock_put() calls
> in ip_push_pending_frames() and ip6_push_pending_frames(). This results in too-small
> reference counts when UDP or RAW sockets are used to send more than one MTU of data. This
> in turn could lead to struct sock being freed and reused while it is still part of an
> active socket.
>
> A wide variety of socket symptoms may be fixed by this patch. It also fixes one cause
> of WARN_ON's in sk_del_node_init() and sk_nulls_del_node_init_rcu().
>
> Signed-off-by: John Dykstra <john.dykstra1@gmail.com>
Nice, but are you aware same patch was already posted, and is waiting for David
approval ?
http://patchwork.ozlabs.org/patch/29618/
> ---
> net/ipv4/ip_output.c | 1 -
> net/ipv6/ip6_output.c | 1 -
> 2 files changed, 0 insertions(+), 2 deletions(-)
>
> diff --git a/net/ipv4/ip_output.c b/net/ipv4/ip_output.c
> index 2470262..7d08210 100644
> --- a/net/ipv4/ip_output.c
> +++ b/net/ipv4/ip_output.c
> @@ -1243,7 +1243,6 @@ int ip_push_pending_frames(struct sock *sk)
> skb->len += tmp_skb->len;
> skb->data_len += tmp_skb->len;
> skb->truesize += tmp_skb->truesize;
> - __sock_put(tmp_skb->sk);
> tmp_skb->destructor = NULL;
> tmp_skb->sk = NULL;
> }
> diff --git a/net/ipv6/ip6_output.c b/net/ipv6/ip6_output.c
> index 7c76e3d..87f8419 100644
> --- a/net/ipv6/ip6_output.c
> +++ b/net/ipv6/ip6_output.c
> @@ -1484,7 +1484,6 @@ int ip6_push_pending_frames(struct sock *sk)
> skb->len += tmp_skb->len;
> skb->data_len += tmp_skb->len;
> skb->truesize += tmp_skb->truesize;
> - __sock_put(tmp_skb->sk);
> tmp_skb->destructor = NULL;
> tmp_skb->sk = NULL;
> }
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [PATCH] net: Fix sk reference counting in ip_push_pending_frames and ip6_push_pending_frames
2009-07-11 19:39 ` Eric Dumazet
@ 2009-07-11 20:00 ` John Dykstra
0 siblings, 0 replies; 3+ messages in thread
From: John Dykstra @ 2009-07-11 20:00 UTC (permalink / raw)
To: Eric Dumazet; +Cc: netdev
On Sat, 2009-07-11 at 21:39 +0200, Eric Dumazet wrote:
> John Dykstra a écrit :
> > Commit 2b85a34e911bf483c27cfdd124aeb1605145dc80 "net: No more
> expensive
> > sock_hold()/sock_put() on each tx" used sk_wmem_alloc rather than
> the struct sock reference
> > count to track in-flight transmit-path packets. However, it missed
> the __sock_put() calls
> > in ip_push_pending_frames() and ip6_push_pending_frames(). This
> results in too-small
> > reference counts when UDP or RAW sockets are used to send more than
> one MTU of data. This
> > in turn could lead to struct sock being freed and reused while it is
> still part of an
> > active socket.
> >
> > A wide variety of socket symptoms may be fixed by this patch. It
> also fixes one cause
> > of WARN_ON's in sk_del_node_init() and sk_nulls_del_node_init_rcu().
> >
> > Signed-off-by: John Dykstra <john.dykstra1@gmail.com>
>
> Nice, but are you aware same patch was already posted, and is waiting
> for David
> approval ?
>
> http://patchwork.ozlabs.org/patch/29618/
<sigh> No, I wasn't. It took me a while to track down where the
reference counts were going wrong, and during that time I wasn't
tracking netdev traffic.
At least it's fixed.
-- John
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2009-07-11 20:00 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2009-07-11 17:46 [PATCH] net: Fix sk reference counting in ip_push_pending_frames and ip6_push_pending_frames John Dykstra
2009-07-11 19:39 ` Eric Dumazet
2009-07-11 20:00 ` John Dykstra
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).