Netdev List
 help / color / mirror / Atom feed
* [PATCH bpf v2] bpf: fix memory leak in lpm_trie map_free callback function
@ 2018-02-14  3:00 Yonghong Song
  2018-02-14  3:17 ` Alexei Starovoitov
  0 siblings, 1 reply; 4+ messages in thread
From: Yonghong Song @ 2018-02-14  3:00 UTC (permalink / raw)
  To: ast, daniel, netdev; +Cc: kernel-team

There is a memory leak happening in lpm_trie map_free callback
function trie_free. The trie structure itself does not get freed.

Also, trie_free function did not do synchronize_rcu before freeing
various data structures. This is incorrect as some rcu_read_lock
region(s) for lookup, update, delete or get_next_key may not complete yet.
The fix is to add synchronize_rcu in the beginning of trie_free.
The useless spin_lock is removed from this function as well.

Fixes: b95a5c4db09b ("bpf: add a longest prefix match trie map implementation")
Reported-by: Mathieu Malaterre <malat@debian.org>
Reported-by: Alexei Starovoitov <ast@kernel.org>
Tested-by: Mathieu Malaterre <malat@debian.org>
Signed-off-by: Yonghong Song <yhs@fb.com>
---
 kernel/bpf/lpm_trie.c | 11 +++++++----
 1 file changed, 7 insertions(+), 4 deletions(-)

v1->v2:
  Make comments more precise and make label name more appropriate,
  as suggested by Daniel

diff --git a/kernel/bpf/lpm_trie.c b/kernel/bpf/lpm_trie.c
index 7b469d1..a75e02c 100644
--- a/kernel/bpf/lpm_trie.c
+++ b/kernel/bpf/lpm_trie.c
@@ -555,7 +555,10 @@ static void trie_free(struct bpf_map *map)
 	struct lpm_trie_node __rcu **slot;
 	struct lpm_trie_node *node;
 
-	raw_spin_lock(&trie->lock);
+	/* Wait for outstanding programs to complete
+	 * update/lookup/delete/get_next_key and free the trie.
+	 */
+	synchronize_rcu();
 
 	/* Always start at the root and walk down to a node that has no
 	 * children. Then free that node, nullify its reference in the parent
@@ -569,7 +572,7 @@ static void trie_free(struct bpf_map *map)
 			node = rcu_dereference_protected(*slot,
 					lockdep_is_held(&trie->lock));
 			if (!node)
-				goto unlock;
+				goto out;
 
 			if (rcu_access_pointer(node->child[0])) {
 				slot = &node->child[0];
@@ -587,8 +590,8 @@ static void trie_free(struct bpf_map *map)
 		}
 	}
 
-unlock:
-	raw_spin_unlock(&trie->lock);
+out:
+	kfree(trie);
 }
 
 static int trie_get_next_key(struct bpf_map *map, void *_key, void *_next_key)
-- 
2.9.5

^ permalink raw reply related	[flat|nested] 4+ messages in thread

* Re: [PATCH bpf v2] bpf: fix memory leak in lpm_trie map_free callback function
  2018-02-14  3:00 [PATCH bpf v2] bpf: fix memory leak in lpm_trie map_free callback function Yonghong Song
@ 2018-02-14  3:17 ` Alexei Starovoitov
  2018-02-22  3:40   ` Eric Dumazet
  0 siblings, 1 reply; 4+ messages in thread
From: Alexei Starovoitov @ 2018-02-14  3:17 UTC (permalink / raw)
  To: Yonghong Song; +Cc: ast, daniel, netdev, kernel-team

On Tue, Feb 13, 2018 at 07:00:21PM -0800, Yonghong Song wrote:
> There is a memory leak happening in lpm_trie map_free callback
> function trie_free. The trie structure itself does not get freed.
> 
> Also, trie_free function did not do synchronize_rcu before freeing
> various data structures. This is incorrect as some rcu_read_lock
> region(s) for lookup, update, delete or get_next_key may not complete yet.
> The fix is to add synchronize_rcu in the beginning of trie_free.
> The useless spin_lock is removed from this function as well.
> 
> Fixes: b95a5c4db09b ("bpf: add a longest prefix match trie map implementation")
> Reported-by: Mathieu Malaterre <malat@debian.org>
> Reported-by: Alexei Starovoitov <ast@kernel.org>
> Tested-by: Mathieu Malaterre <malat@debian.org>
> Signed-off-by: Yonghong Song <yhs@fb.com>
> ---
>  kernel/bpf/lpm_trie.c | 11 +++++++----
>  1 file changed, 7 insertions(+), 4 deletions(-)
> 
> v1->v2:
>   Make comments more precise and make label name more appropriate,
>   as suggested by Daniel

Applied to bpf tree, Thanks Yonghong.

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH bpf v2] bpf: fix memory leak in lpm_trie map_free callback function
  2018-02-14  3:17 ` Alexei Starovoitov
@ 2018-02-22  3:40   ` Eric Dumazet
  2018-02-22  3:55     ` Yonghong Song
  0 siblings, 1 reply; 4+ messages in thread
From: Eric Dumazet @ 2018-02-22  3:40 UTC (permalink / raw)
  To: Alexei Starovoitov, Yonghong Song; +Cc: ast, daniel, netdev, kernel-team

On Tue, 2018-02-13 at 19:17 -0800, Alexei Starovoitov wrote:
> On Tue, Feb 13, 2018 at 07:00:21PM -0800, Yonghong Song wrote:
> > There is a memory leak happening in lpm_trie map_free callback
> > function trie_free. The trie structure itself does not get freed.
> > 
> > Also, trie_free function did not do synchronize_rcu before freeing
> > various data structures. This is incorrect as some rcu_read_lock
> > region(s) for lookup, update, delete or get_next_key may not complete yet.
> > The fix is to add synchronize_rcu in the beginning of trie_free.
> > The useless spin_lock is removed from this function as well.
> > 
> > Fixes: b95a5c4db09b ("bpf: add a longest prefix match trie map implementation")
> > Reported-by: Mathieu Malaterre <malat@debian.org>
> > Reported-by: Alexei Starovoitov <ast@kernel.org>
> > Tested-by: Mathieu Malaterre <malat@debian.org>
> > Signed-off-by: Yonghong Song <yhs@fb.com>
> > ---
> >  kernel/bpf/lpm_trie.c | 11 +++++++----
> >  1 file changed, 7 insertions(+), 4 deletions(-)
> > 
> > v1->v2:
> >   Make comments more precise and make label name more appropriate,
> >   as suggested by Daniel
> 
> Applied to bpf tree, Thanks Yonghong.


This does not look good.

LOCKDEP surely should complain to

node = rcu_dereference_protected(*slot, lockdep_is_held(&trie->lock));

Since we no longer hold trie->lock

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH bpf v2] bpf: fix memory leak in lpm_trie map_free callback function
  2018-02-22  3:40   ` Eric Dumazet
@ 2018-02-22  3:55     ` Yonghong Song
  0 siblings, 0 replies; 4+ messages in thread
From: Yonghong Song @ 2018-02-22  3:55 UTC (permalink / raw)
  To: Eric Dumazet, Alexei Starovoitov; +Cc: ast, daniel, netdev, kernel-team



On 2/21/18 7:40 PM, Eric Dumazet wrote:
> On Tue, 2018-02-13 at 19:17 -0800, Alexei Starovoitov wrote:
>> On Tue, Feb 13, 2018 at 07:00:21PM -0800, Yonghong Song wrote:
>>> There is a memory leak happening in lpm_trie map_free callback
>>> function trie_free. The trie structure itself does not get freed.
>>>
>>> Also, trie_free function did not do synchronize_rcu before freeing
>>> various data structures. This is incorrect as some rcu_read_lock
>>> region(s) for lookup, update, delete or get_next_key may not complete yet.
>>> The fix is to add synchronize_rcu in the beginning of trie_free.
>>> The useless spin_lock is removed from this function as well.
>>>
>>> Fixes: b95a5c4db09b ("bpf: add a longest prefix match trie map implementation")
>>> Reported-by: Mathieu Malaterre <malat@debian.org>
>>> Reported-by: Alexei Starovoitov <ast@kernel.org>
>>> Tested-by: Mathieu Malaterre <malat@debian.org>
>>> Signed-off-by: Yonghong Song <yhs@fb.com>
>>> ---
>>>   kernel/bpf/lpm_trie.c | 11 +++++++----
>>>   1 file changed, 7 insertions(+), 4 deletions(-)
>>>
>>> v1->v2:
>>>    Make comments more precise and make label name more appropriate,
>>>    as suggested by Daniel
>>
>> Applied to bpf tree, Thanks Yonghong.
> 
> 
> This does not look good.
> 
> LOCKDEP surely should complain to
> 
> node = rcu_dereference_protected(*slot, lockdep_is_held(&trie->lock));
> 
> Since we no longer hold trie->lock

Eric,

Thanks for spotting this issue. Will fix this issue soon.

Yonghong

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2018-02-22  3:59 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2018-02-14  3:00 [PATCH bpf v2] bpf: fix memory leak in lpm_trie map_free callback function Yonghong Song
2018-02-14  3:17 ` Alexei Starovoitov
2018-02-22  3:40   ` Eric Dumazet
2018-02-22  3:55     ` Yonghong Song

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox