* [PATCH bpf v2] bpf: fix memory leak in lpm_trie map_free callback function
@ 2018-02-14 3:00 Yonghong Song
2018-02-14 3:17 ` Alexei Starovoitov
0 siblings, 1 reply; 4+ messages in thread
From: Yonghong Song @ 2018-02-14 3:00 UTC (permalink / raw)
To: ast, daniel, netdev; +Cc: kernel-team
There is a memory leak happening in lpm_trie map_free callback
function trie_free. The trie structure itself does not get freed.
Also, trie_free function did not do synchronize_rcu before freeing
various data structures. This is incorrect as some rcu_read_lock
region(s) for lookup, update, delete or get_next_key may not complete yet.
The fix is to add synchronize_rcu in the beginning of trie_free.
The useless spin_lock is removed from this function as well.
Fixes: b95a5c4db09b ("bpf: add a longest prefix match trie map implementation")
Reported-by: Mathieu Malaterre <malat@debian.org>
Reported-by: Alexei Starovoitov <ast@kernel.org>
Tested-by: Mathieu Malaterre <malat@debian.org>
Signed-off-by: Yonghong Song <yhs@fb.com>
---
kernel/bpf/lpm_trie.c | 11 +++++++----
1 file changed, 7 insertions(+), 4 deletions(-)
v1->v2:
Make comments more precise and make label name more appropriate,
as suggested by Daniel
diff --git a/kernel/bpf/lpm_trie.c b/kernel/bpf/lpm_trie.c
index 7b469d1..a75e02c 100644
--- a/kernel/bpf/lpm_trie.c
+++ b/kernel/bpf/lpm_trie.c
@@ -555,7 +555,10 @@ static void trie_free(struct bpf_map *map)
struct lpm_trie_node __rcu **slot;
struct lpm_trie_node *node;
- raw_spin_lock(&trie->lock);
+ /* Wait for outstanding programs to complete
+ * update/lookup/delete/get_next_key and free the trie.
+ */
+ synchronize_rcu();
/* Always start at the root and walk down to a node that has no
* children. Then free that node, nullify its reference in the parent
@@ -569,7 +572,7 @@ static void trie_free(struct bpf_map *map)
node = rcu_dereference_protected(*slot,
lockdep_is_held(&trie->lock));
if (!node)
- goto unlock;
+ goto out;
if (rcu_access_pointer(node->child[0])) {
slot = &node->child[0];
@@ -587,8 +590,8 @@ static void trie_free(struct bpf_map *map)
}
}
-unlock:
- raw_spin_unlock(&trie->lock);
+out:
+ kfree(trie);
}
static int trie_get_next_key(struct bpf_map *map, void *_key, void *_next_key)
--
2.9.5
^ permalink raw reply related [flat|nested] 4+ messages in thread
* Re: [PATCH bpf v2] bpf: fix memory leak in lpm_trie map_free callback function
2018-02-14 3:00 [PATCH bpf v2] bpf: fix memory leak in lpm_trie map_free callback function Yonghong Song
@ 2018-02-14 3:17 ` Alexei Starovoitov
2018-02-22 3:40 ` Eric Dumazet
0 siblings, 1 reply; 4+ messages in thread
From: Alexei Starovoitov @ 2018-02-14 3:17 UTC (permalink / raw)
To: Yonghong Song; +Cc: ast, daniel, netdev, kernel-team
On Tue, Feb 13, 2018 at 07:00:21PM -0800, Yonghong Song wrote:
> There is a memory leak happening in lpm_trie map_free callback
> function trie_free. The trie structure itself does not get freed.
>
> Also, trie_free function did not do synchronize_rcu before freeing
> various data structures. This is incorrect as some rcu_read_lock
> region(s) for lookup, update, delete or get_next_key may not complete yet.
> The fix is to add synchronize_rcu in the beginning of trie_free.
> The useless spin_lock is removed from this function as well.
>
> Fixes: b95a5c4db09b ("bpf: add a longest prefix match trie map implementation")
> Reported-by: Mathieu Malaterre <malat@debian.org>
> Reported-by: Alexei Starovoitov <ast@kernel.org>
> Tested-by: Mathieu Malaterre <malat@debian.org>
> Signed-off-by: Yonghong Song <yhs@fb.com>
> ---
> kernel/bpf/lpm_trie.c | 11 +++++++----
> 1 file changed, 7 insertions(+), 4 deletions(-)
>
> v1->v2:
> Make comments more precise and make label name more appropriate,
> as suggested by Daniel
Applied to bpf tree, Thanks Yonghong.
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [PATCH bpf v2] bpf: fix memory leak in lpm_trie map_free callback function
2018-02-14 3:17 ` Alexei Starovoitov
@ 2018-02-22 3:40 ` Eric Dumazet
2018-02-22 3:55 ` Yonghong Song
0 siblings, 1 reply; 4+ messages in thread
From: Eric Dumazet @ 2018-02-22 3:40 UTC (permalink / raw)
To: Alexei Starovoitov, Yonghong Song; +Cc: ast, daniel, netdev, kernel-team
On Tue, 2018-02-13 at 19:17 -0800, Alexei Starovoitov wrote:
> On Tue, Feb 13, 2018 at 07:00:21PM -0800, Yonghong Song wrote:
> > There is a memory leak happening in lpm_trie map_free callback
> > function trie_free. The trie structure itself does not get freed.
> >
> > Also, trie_free function did not do synchronize_rcu before freeing
> > various data structures. This is incorrect as some rcu_read_lock
> > region(s) for lookup, update, delete or get_next_key may not complete yet.
> > The fix is to add synchronize_rcu in the beginning of trie_free.
> > The useless spin_lock is removed from this function as well.
> >
> > Fixes: b95a5c4db09b ("bpf: add a longest prefix match trie map implementation")
> > Reported-by: Mathieu Malaterre <malat@debian.org>
> > Reported-by: Alexei Starovoitov <ast@kernel.org>
> > Tested-by: Mathieu Malaterre <malat@debian.org>
> > Signed-off-by: Yonghong Song <yhs@fb.com>
> > ---
> > kernel/bpf/lpm_trie.c | 11 +++++++----
> > 1 file changed, 7 insertions(+), 4 deletions(-)
> >
> > v1->v2:
> > Make comments more precise and make label name more appropriate,
> > as suggested by Daniel
>
> Applied to bpf tree, Thanks Yonghong.
This does not look good.
LOCKDEP surely should complain to
node = rcu_dereference_protected(*slot, lockdep_is_held(&trie->lock));
Since we no longer hold trie->lock
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [PATCH bpf v2] bpf: fix memory leak in lpm_trie map_free callback function
2018-02-22 3:40 ` Eric Dumazet
@ 2018-02-22 3:55 ` Yonghong Song
0 siblings, 0 replies; 4+ messages in thread
From: Yonghong Song @ 2018-02-22 3:55 UTC (permalink / raw)
To: Eric Dumazet, Alexei Starovoitov; +Cc: ast, daniel, netdev, kernel-team
On 2/21/18 7:40 PM, Eric Dumazet wrote:
> On Tue, 2018-02-13 at 19:17 -0800, Alexei Starovoitov wrote:
>> On Tue, Feb 13, 2018 at 07:00:21PM -0800, Yonghong Song wrote:
>>> There is a memory leak happening in lpm_trie map_free callback
>>> function trie_free. The trie structure itself does not get freed.
>>>
>>> Also, trie_free function did not do synchronize_rcu before freeing
>>> various data structures. This is incorrect as some rcu_read_lock
>>> region(s) for lookup, update, delete or get_next_key may not complete yet.
>>> The fix is to add synchronize_rcu in the beginning of trie_free.
>>> The useless spin_lock is removed from this function as well.
>>>
>>> Fixes: b95a5c4db09b ("bpf: add a longest prefix match trie map implementation")
>>> Reported-by: Mathieu Malaterre <malat@debian.org>
>>> Reported-by: Alexei Starovoitov <ast@kernel.org>
>>> Tested-by: Mathieu Malaterre <malat@debian.org>
>>> Signed-off-by: Yonghong Song <yhs@fb.com>
>>> ---
>>> kernel/bpf/lpm_trie.c | 11 +++++++----
>>> 1 file changed, 7 insertions(+), 4 deletions(-)
>>>
>>> v1->v2:
>>> Make comments more precise and make label name more appropriate,
>>> as suggested by Daniel
>>
>> Applied to bpf tree, Thanks Yonghong.
>
>
> This does not look good.
>
> LOCKDEP surely should complain to
>
> node = rcu_dereference_protected(*slot, lockdep_is_held(&trie->lock));
>
> Since we no longer hold trie->lock
Eric,
Thanks for spotting this issue. Will fix this issue soon.
Yonghong
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2018-02-22 3:59 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2018-02-14 3:00 [PATCH bpf v2] bpf: fix memory leak in lpm_trie map_free callback function Yonghong Song
2018-02-14 3:17 ` Alexei Starovoitov
2018-02-22 3:40 ` Eric Dumazet
2018-02-22 3:55 ` Yonghong Song
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.