From: Hans Schillstrom <hans.schillstrom@ericsson.com>
To: Jozsef Kadlecsik <kadlec@blackhole.kfki.hu>
Cc: Hans Schillstrom <hans@schillstrom.com>,
Pablo Neira Ayuso <pablo@netfilter.org>,
Jan Engelhardt <jengelh@medozas.de>,
Patrick McHardy <kaber@trash.net>,
"netfilter-devel@vger.kernel.org"
<netfilter-devel@vger.kernel.org>,
"netdev@vger.kernel.org" <netdev@vger.kernel.org>
Subject: Re: [PATCH 1/1] netfilter: Add possibility to turn off netfilters defrag per netns
Date: Thu, 5 Jan 2012 08:19:19 +0100 [thread overview]
Message-ID: <201201050819.21363.hans.schillstrom@ericsson.com> (raw)
In-Reply-To: <alpine.DEB.2.00.1201042232510.3934@blackhole.kfki.hu>
On Wednesday 04 January 2012 22:40:09 Jozsef Kadlecsik wrote:
> On Wed, 4 Jan 2012, Hans Schillstrom wrote:
>
> > On Wednesday, January 04, 2012 19:05:10 Jozsef Kadlecsik wrote:
> > > On Wed, 4 Jan 2012, Pablo Neira Ayuso wrote:
> > >
> > > > On Wed, Jan 04, 2012 at 12:48:35PM +0100, Hans Schillstrom wrote:
> > > > > I like that idea, an "early" table at prio -500 with PREROUTING.
> > > > > There is also a need for a new flag "--allfrags"
> > > > > i.e. all fragments needs to be sorted out and sent to same dest for defrag.
> > > > >
> > > > > ex.
> > > > > iptables -t early -A PREROUTING -i eth0 --allfrags -j NOTRACK
> > > >
> > > > New tables add too much overhead. We have discussed this before with
> > > > Patrick.
> > > >
> > > > Since this still remains specific to your needs, I think you can
> > > > remove nf_conntrack module in your setup.
> > > >
> > > > I don't come with one sane setup that may want selectively defragment
> > > > some traffic yes and other not.
> > > >
> > > > Am I missing anything else?
> > >
> > > I agree. If you don't want defragmentation at all, then make sure you
> > > don't load the nf_conntrack module directly/indirectly. Conntrack doesn't
> > > work without defragmentation anyway.
> >
> > We are using LXC and it's only in the container that holds the external
> > interface that can't have defragmentation.
> > The problem is if it's loaded you have it in all namespaces :-(
>
> Conntrack is per net namespaces. You may have one container with conntrack
> enabled and another one without conntrack.
How do you disable conntrack per netns ?
I can't see how to do it except for NOTRACK
Then the nf_defrag issue is still there...
>
> Moreover, if you may receive fragments of the same packet at different
> interfaces in different blades, then you may receive different whole
> packets of the same flow at different interfaces/blades. But stateful
> firewalling relies on the assumption that all packets goes through of the
> firewall.
True you can't have stateful fw in that stage because of fragments.
> Because it's not assured, conntrack may not run in the
> containers you denoted as FW LXC.
Thats why I want to disable defrag and conntrack in them
A single flow, with Containers in any Blade.
+---------------------------+ / +--------------------+
--> | FW (no CT)frag HMARK sel |--->--- | Conntrack and IPVS |---->
+---------------------------+ \ +--------------------+
\ (fragments) ..
v
+---------------------------+ / ..
| de-frag HMARK sel |----->
+---------------------------+ \ +--------------------+
| Conntrack and IPVS |---->
+--------------------+
Note that HMARK makes a preselection of which IPVS to use, and directs the flow
to the same IPVS independent of which Blade/interface it arrives on.
i.e. the defrag:ed packed will reach the same IPVS as the others.
--
Regards
Hans Schillstrom <hans.schillstrom@ericsson.com>
next prev parent reply other threads:[~2012-01-05 7:19 UTC|newest]
Thread overview: 22+ messages / expand[flat|nested] mbox.gz Atom feed top
2012-01-04 8:07 [PATCH 1/1] netfilter: Add possibility to turn off netfilters defrag per netns Hans Schillstrom
2012-01-04 8:28 ` Jozsef Kadlecsik
2012-01-04 8:49 ` Hans Schillstrom
2012-01-04 9:03 ` Jozsef Kadlecsik
2012-01-04 9:32 ` Jan Engelhardt
2012-01-04 9:47 ` Hans Schillstrom
2012-01-04 17:23 ` Pablo Neira Ayuso
2012-01-04 9:49 ` Jozsef Kadlecsik
2012-01-04 10:18 ` Hans Schillstrom
2012-01-04 11:17 ` Jan Engelhardt
2012-01-04 11:48 ` Hans Schillstrom
2012-01-04 17:40 ` Pablo Neira Ayuso
2012-01-04 18:05 ` Jozsef Kadlecsik
2012-01-04 20:56 ` Hans Schillstrom
2012-01-04 21:40 ` Jozsef Kadlecsik
2012-01-05 7:19 ` Hans Schillstrom [this message]
2012-01-05 9:11 ` Jozsef Kadlecsik
2012-01-05 14:18 ` Pablo Neira Ayuso
2012-01-09 8:58 ` Hans Schillstrom
2012-01-10 3:17 ` Pablo Neira Ayuso
2012-01-04 20:45 ` Hans Schillstrom
2012-01-04 21:15 ` Hans Schillstrom
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=201201050819.21363.hans.schillstrom@ericsson.com \
--to=hans.schillstrom@ericsson.com \
--cc=hans@schillstrom.com \
--cc=jengelh@medozas.de \
--cc=kaber@trash.net \
--cc=kadlec@blackhole.kfki.hu \
--cc=netdev@vger.kernel.org \
--cc=netfilter-devel@vger.kernel.org \
--cc=pablo@netfilter.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.