From: Sabrina Dubroca <sd@queasysnail.net>
To: Eric Dumazet <eric.dumazet@gmail.com>
Cc: netdev@vger.kernel.org, Jianlin Shi <jishi@redhat.com>,
Stefano Brivio <sbrivio@redhat.com>
Subject: Re: [PATCH net] net: enforce xmit_recursion for devices with a queue
Date: Wed, 10 Apr 2019 11:45:07 +0200 [thread overview]
Message-ID: <20190410094507.GA16112@bistromath.localdomain> (raw)
In-Reply-To: <dea7ebea-075d-ea18-9d38-f96202929dd1@gmail.com>
Hi Eric,
2019-03-14, 10:51:49 -0700, Eric Dumazet wrote:
> On 03/14/2019 10:40 AM, Sabrina Dubroca wrote:
> > 2019-03-14, 07:56:10 -0700, Eric Dumazet wrote:
> >> On 03/14/2019 07:15 AM, Sabrina Dubroca wrote:
> >>> 2019-03-14, 05:58:03 -0700, Eric Dumazet wrote:
> >>>> On 03/14/2019 03:15 AM, Sabrina Dubroca wrote:
> >>>>> Commit 745e20f1b626 ("net: add a recursion limit in xmit path")
> >>>>> introduced a recursion limit, but it only applies to devices without a
> >>>>> queue. Virtual devices with a queue (either because they don't have
> >>>>> the IFF_NO_QUEUE flag, or because the administrator added one) can
> >>>>> still cause an unbounded recursion, via __dev_queue_xmit ->
> >>>>> __dev_xmit_skb -> qdisc_run -> __qdisc_run -> qdisc_restart ->
> >>>>> sch_direct_xmit -> dev_hard_start_xmit . Jianlin reported this in a
> >>>>> setup with 16 gretap devices stacked on top of one another.
> >>>>>
> >>>>> This patch prevents the stack overflow by incrementing xmit_recursion in
> >>>>> code paths that can call dev_hard_start_xmit() (like commit 745e20f1b626
> >>>>> did). If the recursion limit is exceeded, the packet is enqueued and the
> >>>>> qdisc is scheduled.
> >>>>>
> >>>>> Reported-by: Jianlin Shi <jishi@redhat.com>
> >>>>> Signed-off-by: Sabrina Dubroca <sd@queasysnail.net>
> >>>>> Reviewed-by: Stefano Brivio <sbrivio@redhat.com>
> >>>>
> >>>> Hi Sabrina, thanks for the patch.
> >>>>
> >>>> Can't we detect this in the control path instead ?
> >>>
> >>> I don't see how. You could have a perfectly reasonable set of gretap
> >>> devices that trigger this situation from simply reshuffling the IP
> >>> addresses:
> >>>
> >>> gretap$x remote 1.1.$((x-1)).{1,2}
> >>> (all those addresses set on a single veth device)
> >>>
> >>> Then you move those addresses to the corresponding device
> >>> (1.1.${x}.{1,2} on gretap$x), and your machine crashes.
> >>>
> >>
> >> If this only can be done with gretap, why gretap cant implement the protection,
> >> outside of the fast path ?
> >
> > It's not just gretap. VXLAN will do the same as long as you add a
> > qdisc. I expect other types of tunnels to behave like that.
> >
>
> It might make sense to add a helper using dev_queue_xmit()
> for tunnel users.
>
> Then remove the xmit recursion stuff out of the dev_queue_xmit()
>
> Lets make the fast path fast again.
I've been looking into this. Ignoring devices that link themselves to
their upper/lower (which would be easy to handle at setup time), we're
left with IP/UDP tunnels (also easy to handle, they all xmit via a few
output functions), and then a couple of weird cases:
- vti/xfrmi call dst_output
- ppp can end up calling ip_local_out, ip_queue_xmit, or ip6_xmit
- some devices call dev_queue_xmit. we'd need a recursion-limited
wrapper for those, without putting the penalty for users that
respect the rules (core, linked devices). I don't see a way to
enforce that (ipv6 needs dev_queue_xmit to be EXPORT'ed), and human
review is not perfect, so some devices could slip in using the
wrong dev_queue_xmit.
I'm not convinced this change is worth the risk.
--
Sabrina
prev parent reply other threads:[~2019-04-10 9:45 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2019-03-14 10:15 [PATCH net] net: enforce xmit_recursion for devices with a queue Sabrina Dubroca
2019-03-14 12:58 ` Eric Dumazet
2019-03-14 14:15 ` Sabrina Dubroca
2019-03-14 14:56 ` Eric Dumazet
2019-03-14 17:40 ` Sabrina Dubroca
2019-03-14 17:51 ` Eric Dumazet
2019-03-14 17:58 ` Sabrina Dubroca
2019-04-10 9:45 ` Sabrina Dubroca [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20190410094507.GA16112@bistromath.localdomain \
--to=sd@queasysnail.net \
--cc=eric.dumazet@gmail.com \
--cc=jishi@redhat.com \
--cc=netdev@vger.kernel.org \
--cc=sbrivio@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).