From: Guillaume Nault <gnault@redhat.com>
To: Tom Parkin <tparkin@katalix.com>
Cc: Ridge Kennedy <ridgek@alliedtelesis.co.nz>, netdev@vger.kernel.org
Subject: Re: [PATCH net] l2tp: Allow duplicate session creation with UDP
Date: Sat, 18 Jan 2020 20:13:36 +0100 [thread overview]
Message-ID: <20200118191336.GC12036@linux.home> (raw)
In-Reply-To: <20200117191939.GB3405@jackdaw>
On Fri, Jan 17, 2020 at 07:19:39PM +0000, Tom Parkin wrote:
> On Fri, Jan 17, 2020 at 15:25:58 +0100, Guillaume Nault wrote:
> > On Fri, Jan 17, 2020 at 01:18:49PM +0000, Tom Parkin wrote:
> > > More generally, for v3 having the session ID be unique to the LCCE is
> > > required to make IP-encap work at all. We can't reliably obtain the
> > > tunnel context from the socket because we've only got a 3-tuple
> > > address to direct an incoming frame to a given socket; and the L2TPv3
> > > IP-encap data packet header only contains the session ID, so that's
> > > literally all there is to work with.
> > >
> > I don't see how that differs from the UDP case. We should still be able
> > to get the corresponding socket and lookup the session ID in that
> > context. Or did I miss something? Sure, that means that the socket is
> > the tunnel, but is there anything wrong with that?
>
> It doesn't fundamentally differ from the UDP case.
>
> The issue is that if you're stashing tunnel context with the socket
> (as UDP currently does), then you're relying on the kernel's ability
> to deliver packets for a given tunnel on that tunnel's socket.
>
> In the UDP case this is normally easily done, assuming each UDP tunnel
> socket has a unique 5-tuple address. So if peers allow the use of
> ports other than port 1701, it's normally not an issue.
>
> However, if you do get a 5-tuple clash, then packets may start
> arriving on the "wrong" socket. In general this is a corner case
> assuming peers allow ports other than 1701 to be used, and so we don't
> see it terribly often.
>
> Contrast this with IP-encap. Because we don't have ports, the 5-tuple
> address now becomes a 3-tuple address. Suddenly it's quite easy to
> get a clash: two IP-encap tunnels between the same two peers would do
> it.
>
Well, the situation is the same with UDP, when the peer always uses
source port 1701, which is a pretty common case as you noted
previously.
I've never seen that as a problem in practice since establishing more
than one tunnel between two LCCE or LAC/LNS doesn't bring any
advantage.
> Since we don't want to arbitrarily limit IP-encap tunnels to on per
> pair of peers, it's not practical to stash tunnel context with the
> socket in the IP-encap data path.
>
Even though l2tp_ip doesn't lookup the session in the context of the
socket, it is limitted to one tunnel for a pair of peers, because it
doesn't support SO_REUSEADDR and SO_REUSEPORT.
> > > If we relax the restriction for UDP-encap then it fixes your (Ridge's)
> > > use case; but it does impose some restrictions:
> > >
> > > 1. The l2tp subsystem has an existing bug for UDP encap where
> > > SO_REUSEADDR is used, as I've mentioned. Where the 5-tuple address of
> > > two sockets clashes, frames may be directed to either socket. So
> > > determining the tunnel context from the socket isn't valid in this
> > > situation.
> > >
> > > For L2TPv2 we could fix this by looking the tunnel context up using
> > > the tunnel ID in the header.
> > >
> > > For L2TPv3 there is no tunnel ID in the header. If we allow
> > > duplicated session IDs for L2TPv3/UDP, there's no way to fix the
> > > problem.
> > >
> > > This sounds like a bit of a corner case, although its surprising how
> > > many implementations expect all traffic over port 1701, making
> > > 5-tuple clashes more likely.
> > >
> > Hum, I think I understand your scenario better. I just wonder why one
> > would establish several tunnels over the same UDP or IP connection (and
> > I've also been surprised by all those implementations forcing 1701 as
> > source port).
> >
>
> Indeed, it's not ideal :-(
>
> > > 2. Part of the rationale for L2TPv3's approach to IDs is that it
> > > allows the data plane to potentially be more efficient since a
> > > session can be identified by session ID alone.
> > >
> > > The kernel hasn't really exploited that fact fully (UDP encap
> > > still uses the socket to get the tunnel context), but if we make
> > > this change we'll be restricting the optimisations we might make
> > > in the future.
> > >
> > > Ultimately it comes down to a judgement call. Being unable to fix
> > > the SO_REUSEADDR bug would be the biggest practical headache I
> > > think.
> > And it would be good to have a consistent behaviour between IP and UDP
> > encapsulation. If one does a global session lookup, the other should
> > too.
>
> That would also be my preference.
>
Thinking more about the original issue, I think we could restrict the
scope of session IDs to the 3-tuple (for IP encap) or 5-tuple (for UDP
encap) of its parent tunnel. We could do that by adding the IP addresses,
protocol and ports to the hash key in the netns session hash-table.
This way:
* Sessions would be only accessible from the peer with whom we
established the tunnel.
* We could use multiple sockets bound and connected to the same
address pair, and lookup the right session no matter on which
socket L2TP messages are received.
* We would solve Ridge's problem because we could reuse session IDs
as long as the 3 or 5-tuple of the parent tunnel is different.
That would be something for net-next though. For -net, we could get
something like Ridge's patch, which is simpler, since we've never
supported multiple tunnels per session anyway.
next prev parent reply other threads:[~2020-01-18 19:14 UTC|newest]
Thread overview: 35+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-01-15 22:34 [PATCH net] l2tp: Allow duplicate session creation with UDP Ridge Kennedy
2020-01-16 12:31 ` Tom Parkin
2020-01-16 19:28 ` Guillaume Nault
2020-01-16 21:05 ` Tom Parkin
2020-01-17 13:43 ` Guillaume Nault
2020-01-17 18:59 ` Tom Parkin
2020-01-18 17:18 ` Guillaume Nault
2020-01-16 12:38 ` Guillaume Nault
2020-01-16 13:12 ` Tom Parkin
2020-01-16 19:05 ` Guillaume Nault
2020-01-16 21:23 ` Tom Parkin
2020-01-16 21:50 ` Ridge Kennedy
2020-01-17 13:18 ` Tom Parkin
2020-01-17 14:25 ` Guillaume Nault
2020-01-17 19:19 ` Tom Parkin
2020-01-18 19:13 ` Guillaume Nault [this message]
2020-01-20 15:09 ` Tom Parkin
2020-01-21 16:35 ` Guillaume Nault
2020-01-22 11:55 ` James Chapman
2020-01-25 11:57 ` Guillaume Nault
2020-01-27 9:25 ` James Chapman
2020-01-29 11:44 ` Guillaume Nault
2020-01-30 10:28 ` James Chapman
2020-01-30 22:34 ` Guillaume Nault
2020-01-31 8:12 ` James Chapman
2020-01-31 12:49 ` Guillaume Nault
2020-01-31 9:55 ` Tom Parkin
2020-01-31 12:50 ` Guillaume Nault
2020-01-17 16:36 ` Guillaume Nault
2020-01-17 19:29 ` Tom Parkin
2020-01-18 17:52 ` Guillaume Nault
2020-01-20 11:47 ` Tom Parkin
2020-01-16 21:26 ` Ridge Kennedy
2020-01-31 12:58 ` Guillaume Nault
2020-02-03 23:29 ` Ridge Kennedy
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20200118191336.GC12036@linux.home \
--to=gnault@redhat.com \
--cc=netdev@vger.kernel.org \
--cc=ridgek@alliedtelesis.co.nz \
--cc=tparkin@katalix.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.