netdev.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [PATCH net-next] tun: fix memory leak in error path
@ 2019-10-07 19:21 Eric Dumazet
  2019-10-08 19:31 ` Jakub Kicinski
  0 siblings, 1 reply; 4+ messages in thread
From: Eric Dumazet @ 2019-10-07 19:21 UTC (permalink / raw)
  To: David S . Miller; +Cc: netdev, Eric Dumazet, Eric Dumazet, Jiri Pirko, syzbot

syzbot reported a warning [1] that triggered after recent Jiri patch.

This exposes a bug that we hit already in the past (see commit
ff244c6b29b1 ("tun: handle register_netdevice() failures properly")
for details)

tun uses priv->destructor without an ndo_init() method.

register_netdevice() can return an error, but will
not call priv->destructor() in some cases. Jiri recent
patch added one more.

A long term fix would be to transfer the initialization
of what we destroy in ->destructor() in the ndo_init()

This looks a bit risky given the complexity of tun driver.

A simpler fix is to detect after the failed register_netdevice()
if the tun_free_netdev() function was called already.

[1]
ODEBUG: free active (active state 0) object type: timer_list hint: tun_flow_cleanup+0x0/0x280 drivers/net/tun.c:457
WARNING: CPU: 0 PID: 8653 at lib/debugobjects.c:481 debug_print_object+0x168/0x250 lib/debugobjects.c:481
Kernel panic - not syncing: panic_on_warn set ...
CPU: 0 PID: 8653 Comm: syz-executor976 Not tainted 5.4.0-rc1-next-20191004 #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 01/01/2011
Call Trace:
 __dump_stack lib/dump_stack.c:77 [inline]
 dump_stack+0x172/0x1f0 lib/dump_stack.c:113
 panic+0x2dc/0x755 kernel/panic.c:220
 __warn.cold+0x2f/0x3c kernel/panic.c:581
 report_bug+0x289/0x300 lib/bug.c:195
 fixup_bug arch/x86/kernel/traps.c:174 [inline]
 fixup_bug arch/x86/kernel/traps.c:169 [inline]
 do_error_trap+0x11b/0x200 arch/x86/kernel/traps.c:267
 do_invalid_op+0x37/0x50 arch/x86/kernel/traps.c:286
 invalid_op+0x23/0x30 arch/x86/entry/entry_64.S:1028
RIP: 0010:debug_print_object+0x168/0x250 lib/debugobjects.c:481
Code: dd 80 b9 e6 87 48 89 fa 48 c1 ea 03 80 3c 02 00 0f 85 b5 00 00 00 48 8b 14 dd 80 b9 e6 87 48 c7 c7 e0 ae e6 87 e8 80 84 ff fd <0f> 0b 83 05 e3 ee 80 06 01 48 83 c4 20 5b 41 5c 41 5d 41 5e 5d c3
RSP: 0018:ffff888095997a28 EFLAGS: 00010082
RAX: 0000000000000000 RBX: 0000000000000003 RCX: 0000000000000000
RDX: 0000000000000000 RSI: ffffffff815cb526 RDI: ffffed1012b32f37
RBP: ffff888095997a68 R08: ffff8880a92ac580 R09: ffffed1015d04101
R10: ffffed1015d04100 R11: ffff8880ae820807 R12: 0000000000000001
R13: ffffffff88fb5340 R14: ffffffff81627110 R15: ffff8880aa41eab8
 __debug_check_no_obj_freed lib/debugobjects.c:963 [inline]
 debug_check_no_obj_freed+0x2d4/0x43f lib/debugobjects.c:994
 kfree+0xf8/0x2c0 mm/slab.c:3755
 kvfree+0x61/0x70 mm/util.c:593
 netdev_freemem net/core/dev.c:9384 [inline]
 free_netdev+0x39d/0x450 net/core/dev.c:9533
 tun_set_iff drivers/net/tun.c:2871 [inline]
 __tun_chr_ioctl+0x317b/0x3f30 drivers/net/tun.c:3075
 tun_chr_ioctl+0x2b/0x40 drivers/net/tun.c:3355
 vfs_ioctl fs/ioctl.c:47 [inline]
 file_ioctl fs/ioctl.c:539 [inline]
 do_vfs_ioctl+0xdb6/0x13e0 fs/ioctl.c:726
 ksys_ioctl+0xab/0xd0 fs/ioctl.c:743
 __do_sys_ioctl fs/ioctl.c:750 [inline]
 __se_sys_ioctl fs/ioctl.c:748 [inline]
 __x64_sys_ioctl+0x73/0xb0 fs/ioctl.c:748
 do_syscall_64+0xfa/0x760 arch/x86/entry/common.c:290
 entry_SYSCALL_64_after_hwframe+0x49/0xbe
RIP: 0033:0x441439
Code: e8 9c ae 02 00 48 83 c4 18 c3 0f 1f 80 00 00 00 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 0f 83 3b 0a fc ff c3 66 2e 0f 1f 84 00 00 00 00
RSP: 002b:00007fff61c37438 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
RAX: ffffffffffffffda RBX: 0000000000000003 RCX: 0000000000441439
RDX: 0000000020000400 RSI: 00000000400454ca RDI: 0000000000000004
RBP: 00007fff61c37470 R08: 0000000000000001 R09: 0000000100000000
R10: 0000000000000000 R11: 0000000000000246 R12: ffffffffffffffff
R13: 0000000000000005 R14: 0000000000000000 R15: 0000000000000000
Kernel Offset: disabled
Rebooting in 86400 seconds..

Fixes: ff92741270bf ("net: introduce name_node struct to be used in hashlist")
Signed-off-by: Eric Dumazet <edumazet@google.com>
Cc: Jiri Pirko <jiri@mellanox.com>
Reported-by: syzbot <syzkaller@googlegroups.com>
---
 drivers/net/tun.c | 14 ++++++++++++--
 1 file changed, 12 insertions(+), 2 deletions(-)

diff --git a/drivers/net/tun.c b/drivers/net/tun.c
index 812dc3a65efbb9d1ee2724e73978dbc4803ec171..1e541b08b136364302aa524e31efb62062c43faa 100644
--- a/drivers/net/tun.c
+++ b/drivers/net/tun.c
@@ -2290,7 +2290,13 @@ static void tun_free_netdev(struct net_device *dev)
 	struct tun_struct *tun = netdev_priv(dev);
 
 	BUG_ON(!(list_empty(&tun->disabled)));
+
 	free_percpu(tun->pcpu_stats);
+	/* We clear pcpu_stats so that tun_set_iff() can tell if
+	 * tun_free_netdev() has been called from register_netdevice().
+	 */
+	tun->pcpu_stats = NULL;
+
 	tun_flow_uninit(tun);
 	security_tun_dev_free_security(tun->security);
 	__tun_set_ebpf(tun, &tun->steering_prog, NULL);
@@ -2859,8 +2865,12 @@ static int tun_set_iff(struct net *net, struct file *file, struct ifreq *ifr)
 
 err_detach:
 	tun_detach_all(dev);
-	/* register_netdevice() already called tun_free_netdev() */
-	goto err_free_dev;
+	/* We are here because register_netdevice() has failed.
+	 * If register_netdevice() already called tun_free_netdev()
+	 * while dealing with the error, tun->pcpu_stats has been cleared.
+	 */
+	if (!tun->pcpu_stats)
+		goto err_free_dev;
 
 err_free_flow:
 	tun_flow_uninit(tun);
-- 
2.23.0.581.g78d2f28ef7-goog


^ permalink raw reply related	[flat|nested] 4+ messages in thread

* Re: [PATCH net-next] tun: fix memory leak in error path
  2019-10-07 19:21 [PATCH net-next] tun: fix memory leak in error path Eric Dumazet
@ 2019-10-08 19:31 ` Jakub Kicinski
  2019-10-08 19:47   ` Eric Dumazet
  0 siblings, 1 reply; 4+ messages in thread
From: Jakub Kicinski @ 2019-10-08 19:31 UTC (permalink / raw)
  To: Eric Dumazet; +Cc: David S . Miller, netdev, Eric Dumazet, Jiri Pirko, syzbot

On Mon,  7 Oct 2019 12:21:05 -0700, Eric Dumazet wrote:
> syzbot reported a warning [1] that triggered after recent Jiri patch.
> 
> This exposes a bug that we hit already in the past (see commit
> ff244c6b29b1 ("tun: handle register_netdevice() failures properly")
> for details)
> 
> tun uses priv->destructor without an ndo_init() method.
> 
> register_netdevice() can return an error, but will
> not call priv->destructor() in some cases. Jiri recent
> patch added one more.
> 
> A long term fix would be to transfer the initialization
> of what we destroy in ->destructor() in the ndo_init()
> 
> This looks a bit risky given the complexity of tun driver.
> 
> A simpler fix is to detect after the failed register_netdevice()
> if the tun_free_netdev() function was called already.
> 
> [...]
> 
> Fixes: ff92741270bf ("net: introduce name_node struct to be used in hashlist")
> Signed-off-by: Eric Dumazet <edumazet@google.com>
> Cc: Jiri Pirko <jiri@mellanox.com>
> Reported-by: syzbot <syzkaller@googlegroups.com>

Looks good, obviously. Presumably we could remove the workaround added
by commit 0ad646c81b21 ("tun: call dev_get_valid_name() before
register_netdevice()") at this point? What are your thoughts on that?

> diff --git a/drivers/net/tun.c b/drivers/net/tun.c
> index 812dc3a65efbb9d1ee2724e73978dbc4803ec171..1e541b08b136364302aa524e31efb62062c43faa 100644
> --- a/drivers/net/tun.c
> +++ b/drivers/net/tun.c
> @@ -2290,7 +2290,13 @@ static void tun_free_netdev(struct net_device *dev)
>  	struct tun_struct *tun = netdev_priv(dev);
>  
>  	BUG_ON(!(list_empty(&tun->disabled)));
> +
>  	free_percpu(tun->pcpu_stats);
> +	/* We clear pcpu_stats so that tun_set_iff() can tell if
> +	 * tun_free_netdev() has been called from register_netdevice().
> +	 */
> +	tun->pcpu_stats = NULL;
> +
>  	tun_flow_uninit(tun);
>  	security_tun_dev_free_security(tun->security);
>  	__tun_set_ebpf(tun, &tun->steering_prog, NULL);
> @@ -2859,8 +2865,12 @@ static int tun_set_iff(struct net *net, struct file *file, struct ifreq *ifr)
>  
>  err_detach:
>  	tun_detach_all(dev);
> -	/* register_netdevice() already called tun_free_netdev() */
> -	goto err_free_dev;
> +	/* We are here because register_netdevice() has failed.
> +	 * If register_netdevice() already called tun_free_netdev()
> +	 * while dealing with the error, tun->pcpu_stats has been cleared.
> +	 */
> +	if (!tun->pcpu_stats)
> +		goto err_free_dev;
>  
>  err_free_flow:
>  	tun_flow_uninit(tun);


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH net-next] tun: fix memory leak in error path
  2019-10-08 19:31 ` Jakub Kicinski
@ 2019-10-08 19:47   ` Eric Dumazet
  2019-10-08 20:08     ` Jakub Kicinski
  0 siblings, 1 reply; 4+ messages in thread
From: Eric Dumazet @ 2019-10-08 19:47 UTC (permalink / raw)
  To: Jakub Kicinski; +Cc: David S . Miller, netdev, Eric Dumazet, Jiri Pirko, syzbot

On Tue, Oct 8, 2019 at 12:31 PM Jakub Kicinski
<jakub.kicinski@netronome.com> wrote:
>
> On Mon,  7 Oct 2019 12:21:05 -0700, Eric Dumazet wrote:
> > syzbot reported a warning [1] that triggered after recent Jiri patch.
> >
> > This exposes a bug that we hit already in the past (see commit
> > ff244c6b29b1 ("tun: handle register_netdevice() failures properly")
> > for details)
> >
> > tun uses priv->destructor without an ndo_init() method.
> >
> > register_netdevice() can return an error, but will
> > not call priv->destructor() in some cases. Jiri recent
> > patch added one more.
> >
> > A long term fix would be to transfer the initialization
> > of what we destroy in ->destructor() in the ndo_init()
> >
> > This looks a bit risky given the complexity of tun driver.
> >
> > A simpler fix is to detect after the failed register_netdevice()
> > if the tun_free_netdev() function was called already.
> >
> > [...]
> >
> > Fixes: ff92741270bf ("net: introduce name_node struct to be used in hashlist")
> > Signed-off-by: Eric Dumazet <edumazet@google.com>
> > Cc: Jiri Pirko <jiri@mellanox.com>
> > Reported-by: syzbot <syzkaller@googlegroups.com>
>
> Looks good, obviously. Presumably we could remove the workaround added
> by commit 0ad646c81b21 ("tun: call dev_get_valid_name() before
> register_netdevice()") at this point? What are your thoughts on that?

This is indeed something that could be done now, maybe by an independent revert.

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH net-next] tun: fix memory leak in error path
  2019-10-08 19:47   ` Eric Dumazet
@ 2019-10-08 20:08     ` Jakub Kicinski
  0 siblings, 0 replies; 4+ messages in thread
From: Jakub Kicinski @ 2019-10-08 20:08 UTC (permalink / raw)
  To: Eric Dumazet; +Cc: David S . Miller, netdev, Eric Dumazet, Jiri Pirko, syzbot

On Tue, 8 Oct 2019 12:47:56 -0700, Eric Dumazet wrote:
> On Tue, Oct 8, 2019 at 12:31 PM Jakub Kicinski wrote:
> > On Mon,  7 Oct 2019 12:21:05 -0700, Eric Dumazet wrote:  
> > > syzbot reported a warning [1] that triggered after recent Jiri patch.
> > >
> > > This exposes a bug that we hit already in the past (see commit
> > > ff244c6b29b1 ("tun: handle register_netdevice() failures properly")
> > > for details)
> > >
> > > tun uses priv->destructor without an ndo_init() method.
> > >
> > > register_netdevice() can return an error, but will
> > > not call priv->destructor() in some cases. Jiri recent
> > > patch added one more.
> > >
> > > A long term fix would be to transfer the initialization
> > > of what we destroy in ->destructor() in the ndo_init()
> > >
> > > This looks a bit risky given the complexity of tun driver.
> > >
> > > A simpler fix is to detect after the failed register_netdevice()
> > > if the tun_free_netdev() function was called already.
> > >
> > > [...]
> > >
> > > Fixes: ff92741270bf ("net: introduce name_node struct to be used in hashlist")
> > > Signed-off-by: Eric Dumazet <edumazet@google.com>
> > > Cc: Jiri Pirko <jiri@mellanox.com>
> > > Reported-by: syzbot <syzkaller@googlegroups.com>  
> >
> > Looks good, obviously. Presumably we could remove the workaround added
> > by commit 0ad646c81b21 ("tun: call dev_get_valid_name() before
> > register_netdevice()") at this point? What are your thoughts on that?  
> 
> This is indeed something that could be done now, maybe by an independent revert.

Independent revert seems like the best idea.

Applied this one, thanks!

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2019-10-08 20:08 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2019-10-07 19:21 [PATCH net-next] tun: fix memory leak in error path Eric Dumazet
2019-10-08 19:31 ` Jakub Kicinski
2019-10-08 19:47   ` Eric Dumazet
2019-10-08 20:08     ` Jakub Kicinski

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).