From: Aaron Conole <aconole@redhat.com>
To: Florian Westphal <fw@strlen.de>
Cc: dev@openvswitch.org, Dumitru Ceara <dceara@redhat.com>,
netdev@vger.kernel.org
Subject: Re: [ovs-dev] [PATCH net-next] openvswitch: prepare for stolen verdict coming from conntrack and nat engine
Date: Wed, 03 Jul 2024 19:21:53 -0400 [thread overview]
Message-ID: <f7ttth6njse.fsf@redhat.com> (raw)
In-Reply-To: <20240703151900.GC29258@breakpoint.cc> (Florian Westphal's message of "Wed, 3 Jul 2024 17:19:00 +0200")
Florian Westphal <fw@strlen.de> writes:
> Aaron Conole <aconole@redhat.com> wrote:
>> > verdict with NF_DROP_REASON() helper,
>> >
>> > This helper releases the skb instantly (so drop_monitor can pinpoint
>> > precise location) and returns NF_STOLEN.
>> >
>> > Prepare call sites to deal with this before introducing such changes
>> > in conntrack and nat core.
>> >
>> > Signed-off-by: Florian Westphal <fw@strlen.de>
>> > ---
>>
>> AFAIU, these changes are only impacting the existing NF_DROP cases, and
>> won't impact how ovs + netfilter communicate about invalid packets. One
>> important thing to note is that we rely on:
>>
>> * Note that if the packet is deemed invalid by conntrack, skb->_nfct will be
>> * set to NULL and 0 will be returned.
>
> Right, this is about how to communicate 'packet dropped'.
>
> NF_DROP means 'please call kfree_skb for me'. Problem from introspection point
> of view is that drop monitor will blame nf_hook_slow() (for netfilter)
> and ovs resp. act_ct for the drop.
>
> Plan is to allow conntrack/nat engine to return STOLEN verdict ("skb
> might have been free'd already").
>
> Example change:
> @@ -52,10 +53,8 @@ nf_nat_masquerade_ipv4(struct sk_buff *skb,
> unsigned int hooknum,
> rt = skb_rtable(skb);
> nh = rt_nexthop(rt, ip_hdr(skb)->daddr);
> newsrc = inet_select_addr(out, nh, RT_SCOPE_UNIVERSE);
> - if (!newsrc) {
> - pr_info("%s ate my IP address\n", out->name);
> - return NF_DROP;
> - }
> + if (!newsrc)
> + return NF_DROP_REASON(skb, SKB_DROP_REASON_NETFILTER_DROP,
> EADDRNOTAVAIL);
>
>
> Where NF_DROP_REASON() is:
>
> static __always_inline int
> NF_DROP_REASON(struct sk_buff *skb, enum skb_drop_reason reason, u32 err)
> {
> BUILD_BUG_ON(err > 0xffff);
>
> kfree_skb_reason(skb, reason);
>
> return ((err << 16) | NF_STOLEN);
> }
>
> So drop monitoring tools will blame
> nf_nat_masquerade.c:nf_nat_masquerade_ipv4 and not
> the consumer of the NF_DROP verdict.
>
> I can't make such changes ATM because ovs and act_ct assume conntrack
> returns only ACCEPT and DROP, so we'd get double-free. Hope that makes
> sense.
>
> Thanks!
Makes sense to me, thanks!
next prev parent reply other threads:[~2024-07-03 23:22 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-07-03 10:46 [PATCH net-next] openvswitch: prepare for stolen verdict coming from conntrack and nat engine Florian Westphal
2024-07-03 14:59 ` Aaron Conole
2024-07-03 15:19 ` Florian Westphal
2024-07-03 23:21 ` Aaron Conole [this message]
2024-07-03 23:22 ` [ovs-dev] " Aaron Conole
2024-07-05 10:10 ` patchwork-bot+netdevbpf
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=f7ttth6njse.fsf@redhat.com \
--to=aconole@redhat.com \
--cc=dceara@redhat.com \
--cc=dev@openvswitch.org \
--cc=fw@strlen.de \
--cc=netdev@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).