From: David Miller <davem@davemloft.net>
To: eric.dumazet@gmail.com
Cc: dwilder@us.ibm.com, netdev@vger.kernel.org, predeep@us.ibm.com,
mjtarsel@us.ibm.com
Subject: Re: [PATCH net] net: fix IP early demux races
Date: Mon, 14 Dec 2015 23:52:27 -0500 (EST) [thread overview]
Message-ID: <20151214.235227.679144571237513640.davem@davemloft.net> (raw)
In-Reply-To: <1450130933.8474.27.camel@edumazet-glaptop2.roam.corp.google.com>
From: Eric Dumazet <eric.dumazet@gmail.com>
Date: Mon, 14 Dec 2015 14:08:53 -0800
> From: Eric Dumazet <edumazet@google.com>
>
> David Wilder reported crashes caused by dst reuse.
>
> <quote David>
> I am seeing a crash on a distro V4.2.3 kernel caused by a double
> release of a dst_entry. In ipv4_dst_destroy() the call to
> list_empty() finds a poisoned next pointer, indicating the dst_entry
> has already been removed from the list and freed. The crash occurs
> 18 to 24 hours into a run of a network stress exerciser.
> </quote>
>
> Thanks to his detailed report and analysis, we were able to understand
> the core issue.
>
> IP early demux can associate a dst to skb, after a lookup in TCP/UDP
> sockets.
>
> When socket cache is not properly set, we want to store into
> sk->sk_dst_cache the dst for future IP early demux lookups,
> by acquiring a stable refcount on the dst.
>
> Problem is this acquisition is simply using an atomic_inc(),
> which works well, unless the dst was queued for destruction from
> dst_release() noticing dst refcount went to zero, if DST_NOCACHE
> was set on dst.
>
> We need to make sure current refcount is not zero before incrementing
> it, or risk double free as David reported.
>
> This patch, being a stable candidate, adds two new helpers, and use
> them only from IP early demux problematic paths.
>
> It might be possible to merge in net-next skb_dst_force() and
> skb_dst_force_safe(), but I prefer having the smallest patch for stable
> kernels : Maybe some skb_dst_force() callers do not expect skb->dst
> can suddenly be cleared.
>
> Can probably be backported back to linux-3.6 kernels
>
> Reported-by: David J. Wilder <dwilder@us.ibm.com>
> Tested-by: David J. Wilder <dwilder@us.ibm.com>
> Signed-off-by: Eric Dumazet <edumazet@google.com>
Applied and queued up for -stable, thanks Eric.
prev parent reply other threads:[~2015-12-15 4:52 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-12-06 18:26 Double free of dst_entry in ipv4_dst_destroy() dwilder
2015-12-06 21:03 ` Eric Dumazet
2015-12-07 1:58 ` Eric Dumazet
2015-12-11 14:23 ` Eric Dumazet
2015-12-11 15:48 ` Eric Dumazet
2015-12-11 16:47 ` Eric Dumazet
2015-12-14 16:28 ` dwilder
2015-12-14 16:36 ` Eric Dumazet
2015-12-14 22:08 ` [PATCH net] net: fix IP early demux races Eric Dumazet
2015-12-15 4:52 ` David Miller [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20151214.235227.679144571237513640.davem@davemloft.net \
--to=davem@davemloft.net \
--cc=dwilder@us.ibm.com \
--cc=eric.dumazet@gmail.com \
--cc=mjtarsel@us.ibm.com \
--cc=netdev@vger.kernel.org \
--cc=predeep@us.ibm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).