[PATCH] netfilter: nf_tables: always synchronize with readers before releasing tables

netdev.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

* [PATCH] netfilter: nf_tables: always synchronize with readers before releasing tables
       [not found] <20230227121720.3775652-1-alexander.atanasov@virtuozzo.com>
@ 2023-02-27 12:21 ` Alexander Atanasov
  2023-02-27 12:44   ` Florian Westphal
  0 siblings, 1 reply; 7+ messages in thread
From: Alexander Atanasov @ 2023-02-27 12:21 UTC (permalink / raw)
  To: netdev

general protection fault, probably for non-canonical
address 0xdead000000000115: 0000 [#1] PREEMPT SMP NOPTI
RIP: 0010:__nf_tables_dump_rules+0x10d/0x170 [nf_tables]

__nf_tables_dump_rules runs under rcu_read_lock while __nft_release_table
is called from nf_tables_exit_net. commit_mutex is held inside
nf_tables_exit_net but this is not enough to guard against
lockless readers. When __nft_release_table does list_del(&rule->list)
next ptr is poisoned and it crashes while walking the list.

Before calling __nft_release_tables all lockless readers must be done -
to ensure this a call to synchronize_rcu() is required.

nf_tables_exit_net does this in case there is something to abort
inside __nf_tables_abort but it does not do so otherwise.
Fix this by add the missing synchronize_rcu() call before calling
__nft_release_table in the nothing to abort case.

Fixes: 6001a930ce03 ("netfilter: nftables: introduce table ownership")
Signed-off-by: Alexander Atanasov <alexander.atanasov@virtuozzo.com>
---
  net/netfilter/nf_tables_api.c | 6 ++++++
  1 file changed, 6 insertions(+)

diff --git a/net/netfilter/nf_tables_api.c b/net/netfilter/nf_tables_api.c
index d73edbd4eec4..849523ece109 100644
--- a/net/netfilter/nf_tables_api.c
+++ b/net/netfilter/nf_tables_api.c
@@ -10333,9 +10333,15 @@ static void __net_exit 
nf_tables_exit_net(struct net *net)
  	struct nftables_pernet *nft_net = nft_pernet(net);
   	mutex_lock(&nft_net->commit_mutex);
+	/* Need to call synchronize_rcu() to let any active rcu lockless
+	 * readers to finish. __nf_tables_abort does this internaly so
+	 * only call it here if there is nothing to abort.
+	 */
  	if (!list_empty(&nft_net->commit_list) ||
  	    !list_empty(&nft_net->module_list))
  		__nf_tables_abort(net, NFNL_ABORT_NONE);
+	else
+		synchronize_rcu();
  	__nft_release_tables(net);
  	mutex_unlock(&nft_net->commit_mutex);
  	WARN_ON_ONCE(!list_empty(&nft_net->tables));
-- 
2.31.1


^ permalink raw reply related	[flat|nested] 7+ messages in thread

* Re: [PATCH] netfilter: nf_tables: always synchronize with readers before releasing tables
  2023-02-27 12:21 ` [PATCH] netfilter: nf_tables: always synchronize with readers before releasing tables Alexander Atanasov
@ 2023-02-27 12:44   ` Florian Westphal
  2023-02-27 13:43     ` Alexander Atanasov
  0 siblings, 1 reply; 7+ messages in thread
From: Florian Westphal @ 2023-02-27 12:44 UTC (permalink / raw)
  To: Alexander Atanasov; +Cc: netdev

Alexander Atanasov <alexander.atanasov@virtuozzo.com> wrote:
> general protection fault, probably for non-canonical
> address 0xdead000000000115: 0000 [#1] PREEMPT SMP NOPTI
> RIP: 0010:__nf_tables_dump_rules+0x10d/0x170 [nf_tables]
> 
> __nf_tables_dump_rules runs under rcu_read_lock while __nft_release_table
> is called from nf_tables_exit_net. commit_mutex is held inside
> nf_tables_exit_net but this is not enough to guard against
> lockless readers. When __nft_release_table does list_del(&rule->list)
> next ptr is poisoned and it crashes while walking the list.
> 
> Before calling __nft_release_tables all lockless readers must be done -
> to ensure this a call to synchronize_rcu() is required.
> 
> nf_tables_exit_net does this in case there is something to abort
> inside __nf_tables_abort but it does not do so otherwise.
> Fix this by add the missing synchronize_rcu() call before calling
> __nft_release_table in the nothing to abort case.
> 
> Fixes: 6001a930ce03 ("netfilter: nftables: introduce table ownership")
> Signed-off-by: Alexander Atanasov <alexander.atanasov@virtuozzo.com>
> ---
>  net/netfilter/nf_tables_api.c | 6 ++++++
>  1 file changed, 6 insertions(+)
> 
> diff --git a/net/netfilter/nf_tables_api.c b/net/netfilter/nf_tables_api.c
> index d73edbd4eec4..849523ece109 100644
> --- a/net/netfilter/nf_tables_api.c
> +++ b/net/netfilter/nf_tables_api.c
> @@ -10333,9 +10333,15 @@ static void __net_exit nf_tables_exit_net(struct
> net *net)
>  	struct nftables_pernet *nft_net = nft_pernet(net);
>   	mutex_lock(&nft_net->commit_mutex);
> +	/* Need to call synchronize_rcu() to let any active rcu lockless
> +	 * readers to finish. __nf_tables_abort does this internaly so
> +	 * only call it here if there is nothing to abort.
> +	 */
>  	if (!list_empty(&nft_net->commit_list) ||
>  	    !list_empty(&nft_net->module_list))
>  		__nf_tables_abort(net, NFNL_ABORT_NONE);
> +	else
> +		synchronize_rcu();

Wouldn't it be better to just drop those list_empty() checks?
AFAICS __nf_tables_abort will DTRT in that case.

You can still add a comment like the one you added above to make
it clear that we also need to wait for those readers to finish.

Lastly, that list_del() in __nft_release_basechain should probably
be list_del_rcu()?

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH] netfilter: nf_tables: always synchronize with readers before releasing tables
  2023-02-27 12:44   ` Florian Westphal
@ 2023-02-27 13:43     ` Alexander Atanasov
       [not found]       ` <20230227161140.GA31439@breakpoint.cc>
  0 siblings, 1 reply; 7+ messages in thread
From: Alexander Atanasov @ 2023-02-27 13:43 UTC (permalink / raw)
  To: Florian Westphal
  Cc: netdev, ,Pablo Neira Ayuso, ,Jozsef Kadlecsik, ,Eric Dumazet,
	,David S. Miller, ,Jakub Kicinski, ,Paolo Abeni,
	",kernel"

Hello,

On 27.02.23 14:44, Florian Westphal wrote:
> Alexander Atanasov <alexander.atanasov@virtuozzo.com> wrote:
>> general protection fault, probably for non-canonical
>> address 0xdead000000000115: 0000 [#1] PREEMPT SMP NOPTI
>> RIP: 0010:__nf_tables_dump_rules+0x10d/0x170 [nf_tables]
>>
>> __nf_tables_dump_rules runs under rcu_read_lock while __nft_release_table
>> is called from nf_tables_exit_net. commit_mutex is held inside
>> nf_tables_exit_net but this is not enough to guard against
>> lockless readers. When __nft_release_table does list_del(&rule->list)
>> next ptr is poisoned and it crashes while walking the list.
>>
>> Before calling __nft_release_tables all lockless readers must be done -
>> to ensure this a call to synchronize_rcu() is required.
>>
>> nf_tables_exit_net does this in case there is something to abort
>> inside __nf_tables_abort but it does not do so otherwise.
>> Fix this by add the missing synchronize_rcu() call before calling
>> __nft_release_table in the nothing to abort case.
>>
>> Fixes: 6001a930ce03 ("netfilter: nftables: introduce table ownership")
>> Signed-off-by: Alexander Atanasov <alexander.atanasov@virtuozzo.com>
>> ---
>>   net/netfilter/nf_tables_api.c | 6 ++++++
>>   1 file changed, 6 insertions(+)
>>
>> diff --git a/net/netfilter/nf_tables_api.c b/net/netfilter/nf_tables_api.c
>> index d73edbd4eec4..849523ece109 100644
>> --- a/net/netfilter/nf_tables_api.c
>> +++ b/net/netfilter/nf_tables_api.c
>> @@ -10333,9 +10333,15 @@ static void __net_exit nf_tables_exit_net(struct
>> net *net)
>>   	struct nftables_pernet *nft_net = nft_pernet(net);
>>    	mutex_lock(&nft_net->commit_mutex);
>> +	/* Need to call synchronize_rcu() to let any active rcu lockless
>> +	 * readers to finish. __nf_tables_abort does this internaly so
>> +	 * only call it here if there is nothing to abort.
>> +	 */
>>   	if (!list_empty(&nft_net->commit_list) ||
>>   	    !list_empty(&nft_net->module_list))
>>   		__nf_tables_abort(net, NFNL_ABORT_NONE);
>> +	else
>> +		synchronize_rcu();
> 
> Wouldn't it be better to just drop those list_empty() checks?
> AFAICS __nf_tables_abort will DTRT in that case.

Ok, i will drop the checks.

> You can still add a comment like the one you added above to make
> it clear that we also need to wait for those readers to finish.

Ok.

> Lastly, that list_del() in __nft_release_basechain should probably
> be list_del_rcu()?

I am still in process of untwisting that place but so far.
Simple change to list_del_rcu wouldn't help as it wouldn't in 
__nft_release_table:

list_del(&rule->list);
ctx->chain->use--;
nf_tables_rule_release(ctx, rule) {
	nft_rule_expr_deactivate(ctx, rule, NFT_TRANS_RELEASE);
	nf_tables_rule_destroy(ctx, rule) {
		kfree(rule); <-- freed here
	}
}

List traversal would work but instead of crash it would become use after 
free.
Adding synchronize_rcu() before list iterattion there will probably do, 
it is already under commit_mutex when called from nf_tables_netdev_event.


-- 
Regards,
Alexander Atanasov


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH] netfilter: nf_tables: always synchronize with readers before releasing tables
       [not found]       ` <20230227161140.GA31439@breakpoint.cc>
@ 2023-02-27 18:50         ` Alexander Atanasov
  2023-02-27 23:31           ` Florian Westphal
  0 siblings, 1 reply; 7+ messages in thread
From: Alexander Atanasov @ 2023-02-27 18:50 UTC (permalink / raw)
  To: Florian Westphal
  Cc: netdev, Pablo Neira Ayuso, Jozsef Kadlecsik, Eric Dumazet,
	David S. Miller, Jakub Kicinski, Paolo Abeni

On 27.02.23 18:11, Florian Westphal wrote:
> Alexander Atanasov <alexander.atanasov@virtuozzo.com> wrote:
>>> Lastly, that list_del() in __nft_release_basechain should probably
>>> be list_del_rcu()?
>>
>> I am still in process of untwisting that place but so far.
>> Simple change to list_del_rcu wouldn't help as it wouldn't in
>> __nft_release_table:
>>
>> list_del(&rule->list);
>> ctx->chain->use--;
>> nf_tables_rule_release(ctx, rule) {
>> 	nft_rule_expr_deactivate(ctx, rule, NFT_TRANS_RELEASE);
>> 	nf_tables_rule_destroy(ctx, rule) {
>> 		kfree(rule); <-- freed here
>> 	}
>> }
>>
>> List traversal would work but instead of crash it would become use after
>> free.
>> Adding synchronize_rcu() before list iterattion there will probably do, it
>> is already under commit_mutex when called from nf_tables_netdev_event.
> 
> Hmm, please wait.  I have to look at this in more detail.
> I don't see a race conditon in the first place.
>
> netns dismantling already does synchronize_rcu(), so I don't see how we > can have this uaf in the first place.

As i said i am still trying to figure out the basechain place,
where is that synchronize_rcu() call done?

> Do you see this with current kernels or did the splat happen with
> an older version?

It's with a bit older kernel but there is no significant difference
wrt nf_tables_api code.
I will prepare a more detailed report for you. Unfortunately there is
no reproducer.

-- 
Regards,
Alexander Atanasov


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH] netfilter: nf_tables: always synchronize with readers before releasing tables
  2023-02-27 18:50         ` Alexander Atanasov
@ 2023-02-27 23:31           ` Florian Westphal
  2023-02-28  9:54             ` Alexander Atanasov
  0 siblings, 1 reply; 7+ messages in thread
From: Florian Westphal @ 2023-02-27 23:31 UTC (permalink / raw)
  To: Alexander Atanasov
  Cc: Florian Westphal, netdev, Pablo Neira Ayuso, Jozsef Kadlecsik,
	Eric Dumazet, David S. Miller, Jakub Kicinski, Paolo Abeni

Alexander Atanasov <alexander.atanasov@virtuozzo.com> wrote:
> As i said i am still trying to figure out the basechain place,
> where is that synchronize_rcu() call done?

cleanup_net() in net/core/net_namespace.c.

pre_exit handlers run, then synchronize_rcu, then the
normal exit handlers, then exit_batch.

> > Do you see this with current kernels or did the splat happen with
> > an older version?
> 
> It's with a bit older kernel but there is no significant difference
> wrt nf_tables_api code.
> I will prepare a more detailed report for you.

Thanks.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH] netfilter: nf_tables: always synchronize with readers before releasing tables
  2023-02-27 23:31           ` Florian Westphal
@ 2023-02-28  9:54             ` Alexander Atanasov
  2023-02-28 10:59               ` Florian Westphal
  0 siblings, 1 reply; 7+ messages in thread
From: Alexander Atanasov @ 2023-02-28  9:54 UTC (permalink / raw)
  To: Florian Westphal
  Cc: netdev, Pablo Neira Ayuso, Jozsef Kadlecsik, Eric Dumazet,
	David S. Miller, Jakub Kicinski, Paolo Abeni

On 28.02.23 1:31, Florian Westphal wrote:
> Alexander Atanasov <alexander.atanasov@virtuozzo.com> wrote:
>> As i said i am still trying to figure out the basechain place,
>> where is that synchronize_rcu() call done?
> 
> cleanup_net() in net/core/net_namespace.c.
> 
> pre_exit handlers run, then synchronize_rcu, then the
> normal exit handlers, then exit_batch.

It prevents anyone new to find the namespace but it does not guard 
against the ones that have already found it.
What stops them to enter a rcu_read_lock() section after the synchronize 
call in cleanup_net() is done and race with the exit handler?

synchronize_rcu() must be called with the commit_mutex held to be safe 
against lock less readers using data protected with commit_mutext.

-- 
Regards,
Alexander Atanasov


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH] netfilter: nf_tables: always synchronize with readers before releasing tables
  2023-02-28  9:54             ` Alexander Atanasov
@ 2023-02-28 10:59               ` Florian Westphal
  0 siblings, 0 replies; 7+ messages in thread
From: Florian Westphal @ 2023-02-28 10:59 UTC (permalink / raw)
  To: Alexander Atanasov
  Cc: Florian Westphal, netdev, Pablo Neira Ayuso, Jozsef Kadlecsik,
	Eric Dumazet, David S. Miller, Jakub Kicinski, Paolo Abeni

Alexander Atanasov <alexander.atanasov@virtuozzo.com> wrote:
> On 28.02.23 1:31, Florian Westphal wrote:
> > Alexander Atanasov <alexander.atanasov@virtuozzo.com> wrote:
> > > As i said i am still trying to figure out the basechain place,
> > > where is that synchronize_rcu() call done?
> > 
> > cleanup_net() in net/core/net_namespace.c.
> > 
> > pre_exit handlers run, then synchronize_rcu, then the
> > normal exit handlers, then exit_batch.
> 
> It prevents anyone new to find the namespace but it does not guard against
> the ones that have already found it.

The netns is being dismantled, how can there be any process left?

> What stops them to enter a rcu_read_lock() section after the synchronize
> call in cleanup_net() is done and race with the exit handler?

There should be no task in the first place.

> synchronize_rcu() must be called with the commit_mutex held to be safe
> against lock less readers using data protected with commit_mutext.

Sorry, I do not understand this bug nor the fix.

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2023-02-28 10:59 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <20230227121720.3775652-1-alexander.atanasov@virtuozzo.com>
2023-02-27 12:21 ` [PATCH] netfilter: nf_tables: always synchronize with readers before releasing tables Alexander Atanasov
2023-02-27 12:44   ` Florian Westphal
2023-02-27 13:43     ` Alexander Atanasov
     [not found]       ` <20230227161140.GA31439@breakpoint.cc>
2023-02-27 18:50         ` Alexander Atanasov
2023-02-27 23:31           ` Florian Westphal
2023-02-28  9:54             ` Alexander Atanasov
2023-02-28 10:59               ` Florian Westphal

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).