From: Florian Westphal <fw@strlen.de>
To: Eugene Crosser <crosser@average.org>
Cc: netdev@vger.kernel.org, Florian Westphal <fw@strlen.de>,
Yi-Hung Wei <yihung.wei@gmail.com>,
Martin Bene <martin.bene@icomedias.com>
Subject: Re: conntrack: TCP CLOSE and TIME_WAIT are not counted towards per-zone limit, and can overflow global table
Date: Wed, 20 Sep 2023 23:47:37 +0200 [thread overview]
Message-ID: <20230920214737.GB25778@breakpoint.cc> (raw)
In-Reply-To: <8c7e44d2-e78f-4f8d-9016-2a4b8429e14d@average.org>
Eugene Crosser <crosser@average.org> wrote:
> we are running a virtualization platform, and assign different conntrack
> zones, with per-zone limits, to different users. The goal is to prevent
> situation when one user exhaust the whole conntrack table on the host,
> e.g. if the user is under some DDoS scenario.
>
> We noticed that under some flooding scenarios, the number of entries in
> the zone assigned to the user goes way above the per-zone limit, and
> reaches the global host limit. In our test, almost all of those entries
> were in "CLOSE" state.
>
> It looks like this function in net/filter/nf_conncount.c:71
>
> static inline bool already_closed(const struct nf_conn *conn)
> {
> if (nf_ct_protonum(conn) == IPPROTO_TCP)
> return conn->proto.tcp.state == TCP_CONNTRACK_TIME_WAIT ||
> conn->proto.tcp.state == TCP_CONNTRACK_CLOSE;
> else
> return false;
> }
>
> is used to explicitly exclude such entries from counting.
>
> As I understand, this creates a situation when an attacker can inflict a
> DoS situation on the host, by opening _and immediately closing_ a large
> number of TCP connections. That is to say, per-zone limits, as currently
> implemented, _do not_ allow to prevent overflow of the host-wide
> conntrack table.
>
> What was the reason to exclude such entries from counting?
I'd wager only intent was to limit *active* connections, not conntrack
entries.
This code originates from a time when zones did not exist, hence
conntrack upperlimit was sufficient, no partitioning needed.
> Should this exception be removed, and _all_ entries in the zone counted
> towards the limit?
I suppose so.
prev parent reply other threads:[~2023-09-20 21:47 UTC|newest]
Thread overview: 2+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-09-20 16:28 conntrack: TCP CLOSE and TIME_WAIT are not counted towards per-zone limit, and can overflow global table Eugene Crosser
2023-09-20 21:47 ` Florian Westphal [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20230920214737.GB25778@breakpoint.cc \
--to=fw@strlen.de \
--cc=crosser@average.org \
--cc=martin.bene@icomedias.com \
--cc=netdev@vger.kernel.org \
--cc=yihung.wei@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).