From: Stanislav Fomichev <sdf.kernel@gmail.com>
To: Jakub Kicinski <kuba@kernel.org>
Cc: Stanislav Fomichev <sdf.kernel@gmail.com>,
netdev@vger.kernel.org, davem@davemloft.net,
edumazet@google.com, pabeni@redhat.com
Subject: Re: [PATCH net-next v7 00/15] net: sleepable ndo_set_rx_mode
Date: Tue, 14 Apr 2026 12:24:39 -0700 [thread overview]
Message-ID: <ad6UEB5ripPKBfUD@devvm17672.vll0.facebook.com> (raw)
In-Reply-To: <20260413190831.1f3c53cb@kernel.org>
On 04/13, Jakub Kicinski wrote:
> On Mon, 13 Apr 2026 10:11:16 -0700 Stanislav Fomichev wrote:
> > This series adds a new ndo_set_rx_mode_async callback that enables
> > drivers to handle address list updates in a sleepable context. The
> > current ndo_set_rx_mode is called under the netif_addr_lock spinlock
> > with BHs disabled, which prevents drivers from sleeping. This is
> > problematic for ops-locked drivers that need to sleep.
>
> Hi hi, hit this on the new(ish) queue leasing test with bnxt (debug
> kernel):
>
> | [ 1148.733157] kselftest: Running tests in drivers/net/hw
> [ 1151.485032] ref_tracker: reference already released.
> [ 1151.491522] ref_tracker: allocated in:
> [ 1151.496526] __dev_set_rx_mode+0x398/0x4f0
> [ 1151.501923] dev_mc_add+0xe7/0x100
> [ 1151.506537] igmp6_group_added+0x31c/0x400
> [ 1151.511930] __ipv6_dev_mc_inc+0x282/0x590
> [ 1151.517320] __ipv6_sock_mc_join+0x40d/0x7c0
> [ 1151.522908] do_ipv6_setsockopt+0x3504/0x3700
> [ 1151.528594] ipv6_setsockopt+0x7e/0xf0
> [ 1151.533596] do_sock_setsockopt+0x164/0x3b0
> [ 1151.539089] __sys_setsockopt+0xe4/0x150
> [ 1151.544276] __x64_sys_setsockopt+0xbd/0x180
> [ 1151.549866] do_syscall_64+0xf3/0x5e0
> [ 1151.554763] entry_SYSCALL_64_after_hwframe+0x4b/0x53
> [ 1151.561229] ref_tracker: freed in:
> [ 1151.565840] netdev_rx_mode_work+0x205/0x410
> [ 1151.571429] process_one_work+0xdf5/0x1410
> [ 1151.576824] worker_thread+0x4f1/0xd60
> [ 1151.581827] kthread+0x364/0x460
> [ 1151.586246] ret_from_fork+0x4a4/0x720
> [ 1151.591251] ret_from_fork_asm+0x11/0x20
> [ 1151.596704] ------------[ cut here ]------------
> [ 1151.602747] WARNING: lib/ref_tracker.c:322 at ref_tracker_free.cold+0x10a/0x1f9, CPU#7: kworker/7:2/5262
> [ 1151.614292] Modules linked in:
> [ 1151.618626] CPU: 7 UID: 0 PID: 5262 Comm: kworker/7:2 Not tainted 7.0.0-rc7-hmtc-g3cb8f09d448a #1 PREEMPT(full)
> [ 1151.630928] Hardware name: Giga Computing E163-Z34-AAH1-000/MZ33-DC1-000, BIOS R30_F44 12/24/2025
> [ 1151.641762] Workqueue: events netdev_rx_mode_work
> [ 1151.647920] RIP: 0010:ref_tracker_free.cold+0x10a/0x1f9
> [ 1151.654646] Code: e0 2a 0f b6 04 02 84 c0 74 04 3c 03 7e 78 8b 7b 18 4c 89 04 24 e8 71 b2 e4 01 4c 8b 04 24 4c 89 c6 48 89 ef e8 32 92 05 04 90 <0f> 0b 90 ba ea ff ff ff e9 06 de e4 01 4c 89 ea 48 89 df 4c 89 04
> [ 1151.676612] RSP: 0018:ffa0000035f6fb18 EFLAGS: 00010296
> [ 1151.683343] RAX: 0000000000000000 RBX: ff110001f7ae9460 RCX: 0000000000000001
> [ 1151.692224] RDX: 00000000ffffffff RSI: 1ffffffff1841334 RDI: 0000000000000001
> [ 1151.701102] RBP: ff110001a3c12618 R08: 0000000000000000 R09: 0000000000000000
> [ 1151.709973] R10: 0000000000000007 R11: 0000000000000001 R12: 1ff4000006bedf67
> [ 1151.718835] R13: ff110001f7ae9478 R14: ff110001fd1949fc R15: ffffffff8a80b9a0
> [ 1151.727814] FS: 0000000000000000(0000) GS:ff11001882899000(0000) knlGS:0000000000000000
> [ 1151.737696] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [ 1151.744947] CR2: 00007f83287c3574 CR3: 00000017e26b0002 CR4: 0000000000771ef0
> [ 1151.753758] PKRU: 55555554
> [ 1151.757588] Call Trace:
> [ 1151.761130] <TASK>
> [ 1151.764282] ? ref_tracker_alloc+0x460/0x460
> [ 1151.769872] ? netdev_rx_mode_work+0x205/0x410
> [ 1151.775659] ? process_one_work+0xdf5/0x1410
> [ 1151.781249] ? worker_thread+0x4f1/0xd60
> [ 1151.786450] ? kthread+0x364/0x460
> [ 1151.791066] ? ret_from_fork+0x4a4/0x720
> [ 1151.796267] ? ret_from_fork_asm+0x11/0x20
> [ 1151.801666] ? mark_held_locks+0x40/0x70
> [ 1151.806870] netdev_rx_mode_work+0x205/0x410
> [ 1151.812461] ? trace_workqueue_execute_start+0x9b/0x180
> [ 1151.819124] ? process_one_work+0xdb4/0x1410
> [ 1151.824712] process_one_work+0xdf5/0x1410
> [ 1151.830111] ? pwq_dec_nr_in_flight+0x710/0x710
> [ 1151.835992] ? lock_acquire.part.0+0xbc/0x260
> [ 1151.841686] worker_thread+0x4f1/0xd60
> [ 1151.846694] ? rescuer_thread+0x1320/0x1320
> [ 1151.852188] kthread+0x364/0x460
> [ 1151.856609] ? trace_irq_enable.constprop.0+0x9b/0x180
> [ 1151.863180] ? kthread_affine_node+0x330/0x330
> [ 1151.868965] ret_from_fork+0x4a4/0x720
> [ 1151.873965] ? arch_exit_to_user_mode_prepare.isra.0+0xb0/0xb0
> [ 1151.881315] ? __switch_to+0x540/0xd00
> [ 1151.886320] ? kthread_affine_node+0x330/0x330
> [ 1151.892108] ret_from_fork_asm+0x11/0x20
> | [ 1151.905076] hardirqs last enabled at (7665): [<ffffffff847c61ea>] __up_console_sem+0x5a/0x70
> | [ 1151.915450] hardirqs last disabled at (7676): [<ffffffff847c61cf>] __up_console_sem+0x3f/0x70
> | [ 1151.925823] softirqs last enabled at (7600): [<ffffffff8462d50e>] handle_softirqs+0x60e/0x920
> | [ 1151.936291] softirqs last disabled at (7595): [<ffffffff8462dca2>] irq_exit_rcu+0xa2/0xf0
> | [ 1151.946275] ---[ end trace 0000000000000000 ]---
> [ 1152.247817] ref_tracker: netdev@ff110001a3c12618 has 1/1 users at\x0a __dev_set_rx_mode+0x398/0x4f0\x0a dev_mc_add+0xe7/0x100\x0a igmp6_group_added+0x31c/0x400\x0a __ipv6_dev_mc_inc+0x282/0x590\x0a addrconf_dad_begin+0x13c/0x540\x0a addrconf_dad_work+0x170/0x930\x0a process_one_work+0xdf5/0x1410\x0a worker_thread+0x4f1/0xd60\x0a kthread+0x364/0x460\x0a ret_from_fork+0x4a4/0x720\x0a ret_from_fork_asm+0x11/0x20\x0a
> [ 1152.318767] ------------[ cut here ]------------
> [ 1152.325512] WARNING: lib/ref_tracker.c:246 at ref_tracker_dir_exit+0x466/0x7e0, CPU#27: ip/15056
> [ 1152.336246] Modules linked in:
> [ 1152.340578] CPU: 27 UID: 0 PID: 15056 Comm: ip Tainted: G W 7.0.0-rc7-hmtc-g3cb8f09d448a #1 PREEMPT(full)
> [ 1152.353871] Tainted: [W]=WARN
> [ 1152.357997] Hardware name: Giga Computing E163-Z34-AAH1-000/MZ33-DC1-000, BIOS R30_F44 12/24/2025
> [ 1152.368759] RIP: 0010:ref_tracker_dir_exit+0x466/0x7e0
> [ 1152.375326] Code: e8 03 42 80 3c 38 00 0f 85 57 02 00 00 48 8b 03 49 89 de 49 39 dc 0f 85 2e ff ff ff 48 8b 74 24 10 48 89 ef e8 cb aa 20 02 90 <0f> 0b 90 48 8d 5d 44 be 04 00 00 00 48 89 df e8 a6 ee ec fe 48 89
> [ 1152.397212] RSP: 0018:ffa00000380470d0 EFLAGS: 00010286
> [ 1152.403876] RAX: 0000000000000000 RBX: ff110001a3c12668 RCX: 0000000000000001
> [ 1152.412685] RDX: 00000000ffffffff RSI: 1ffffffff1841334 RDI: 0000000000000001
> [ 1152.421496] RBP: ff110001a3c12618 R08: 0000000000000000 R09: 0000000000000000
> [ 1152.430303] R10: 000000000000000b R11: 0000000000000001 R12: ff110001a3c12668
> [ 1152.439103] R13: dead000000000100 R14: ff110001a3c12668 R15: dffffc0000000000
> [ 1152.447911] FS: 00007f05f6331840(0000) GS:ff11001883299000(0000) knlGS:0000000000000000
> [ 1152.457811] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [ 1152.465057] CR2: 00007f68f567f440 CR3: 00000001a9d12003 CR4: 0000000000771ef0
> [ 1152.473863] PKRU: 55555554
> [ 1152.477694] Call Trace:
> [ 1152.481231] <TASK>
> [ 1152.484384] ? rcu_is_watching+0x15/0xd0
> [ 1152.489585] ? ref_tracker_free+0x870/0x870
> [ 1152.495077] ? lockdep_hardirqs_on+0x8c/0x130
> [ 1152.500764] ? __stack_depot_get_stack_record+0x10/0x10
> [ 1152.507427] ? kfree+0x151/0x610
> [ 1152.511847] free_netdev+0x402/0x870
> [ 1152.516648] ? do_raw_spin_unlock+0x59/0x250
> [ 1152.522241] ? _raw_spin_unlock_irqrestore+0x40/0x80
> [ 1152.528613] netdev_run_todo+0x83b/0xbf0
> [ 1152.533804] ? unregister_netdevice_queued+0x80/0x80
> [ 1152.540172] ? mutex_is_locked+0x1c/0x50
> [ 1152.545373] ? generic_xdp_install+0x400/0x400
> [ 1152.551158] ? mutex_is_locked+0x1c/0x50
> [ 1152.556349] ? rtnl_is_locked+0x15/0x20
> [ 1152.561450] ? unregister_netdevice_queued+0x16/0x80
> [ 1152.567826] rtnl_dellink+0x4a5/0xae0
> [ 1152.572737] ? rtnl_mdb_del+0x580/0x580
> [ 1152.577839] ? __lock_acquire+0x508/0xc10
> [ 1152.583148] ? avc_has_perm_noaudit+0xf6/0x300
> [ 1152.588935] ? sched_init_numa+0x7f7/0xc30
> [ 1152.594331] ? mark_usage+0x61/0x170
> [ 1152.599142] ? __lock_acquire+0x508/0xc10
> [ 1152.604443] ? lock_acquire.part.0+0xbc/0x260
> [ 1152.610130] ? find_held_lock+0x2b/0x80
> [ 1152.615237] ? rtnl_mdb_del+0x580/0x580
> [ 1152.620339] ? __lock_release.isra.0+0x6b/0x1a0
> [ 1152.626215] ? rtnl_mdb_del+0x580/0x580
> [ 1152.631319] rtnetlink_rcv_msg+0x6fd/0xbd0
> [ 1152.636718] ? rtnl_fdb_dump+0x690/0x690
> [ 1152.641917] ? __lock_acquire+0x508/0xc10
> [ 1152.647218] ? lock_acquire.part.0+0xbc/0x260
> [ 1152.652905] ? find_held_lock+0x2b/0x80
> [ 1152.658008] netlink_rcv_skb+0x14e/0x3a0
> [ 1152.663210] ? rtnl_fdb_dump+0x690/0x690
> [ 1152.668412] ? netlink_ack+0xcd0/0xcd0
> [ 1152.673422] ? netlink_deliver_tap+0xc5/0x330
> [ 1152.679110] ? netlink_deliver_tap+0x13c/0x330
> [ 1152.684894] netlink_unicast+0x47c/0x740
> [ 1152.690098] ? netlink_attachskb+0x800/0x800
> [ 1152.695687] ? sock_has_perm+0x283/0x3f0
> [ 1152.700879] netlink_sendmsg+0x75b/0xc90
> [ 1152.706081] ? netlink_unicast+0x740/0x740
> [ 1152.711476] ? __lock_release.isra.0+0x6b/0x1a0
> [ 1152.717357] ? __import_iovec+0x36c/0x620
> [ 1152.722655] ? __might_fault+0x97/0x140
> [ 1152.727749] __sock_sendmsg+0xca/0x180
> [ 1152.732747] ? move_addr_to_kernel+0x36/0xf0
> [ 1152.738328] ____sys_sendmsg+0x609/0x830
> [ 1152.743529] ? copy_msghdr_from_user+0x2a0/0x460
> [ 1152.749510] ? kernel_sendmsg+0x30/0x30
> [ 1152.754611] ? move_addr_to_kernel+0xf0/0xf0
> [ 1152.760204] ? kasan_save_stack+0x3d/0x50
> [ 1152.765503] ? kasan_save_stack+0x2f/0x50
> [ 1152.770799] ? kasan_record_aux_stack+0x9b/0xc0
> [ 1152.776681] ? __call_rcu_common.constprop.0+0xb2/0xa10
> [ 1152.783337] ? kmem_cache_free+0x3d0/0x5f0
> [ 1152.788733] ? fput_close_sync+0xde/0x1b0
> [ 1152.794032] ? __x64_sys_close+0x8b/0xf0
> [ 1152.799235] ___sys_sendmsg+0x14e/0x1d0
> [ 1152.804337] ? copy_msghdr_from_user+0x460/0x460
> [ 1152.810330] ? rcu_is_watching+0x15/0xd0
> [ 1152.815532] ? trace_irq_enable.constprop.0+0x9b/0x180
> [ 1152.822106] __sys_sendmsg+0x145/0x1f0
> [ 1152.827110] ? __sys_sendmsg_sock+0x20/0x20
> [ 1152.832608] ? do_raw_spin_unlock+0x59/0x250
> [ 1152.838196] ? rcu_is_watching+0x15/0xd0
> [ 1152.843398] do_syscall_64+0xf3/0x5e0
> [ 1152.848307] ? trace_hardirqs_off+0xd/0x30
> [ 1152.853703] ? exc_page_fault+0xda/0xf0
> [ 1152.858805] entry_SYSCALL_64_after_hwframe+0x4b/0x53
> [ 1152.865267] RIP: 0033:0x7f05f656b22e
> [ 1152.870077] Code: 4d 89 d8 e8 94 bd 00 00 4c 8b 5d f8 41 8b 93 08 03 00 00 59 5e 48 83 f8 fc 74 11 c9 c3 0f 1f 80 00 00 00 00 48 8b 45 10 0f 05 <c9> c3 83 e2 39 83 fa 08 75 e7 e8 03 ff ff ff 0f 1f 00 f3 0f 1e fa
> [ 1152.891961] RSP: 002b:00007fff388be520 EFLAGS: 00000202 ORIG_RAX: 000000000000002e
> [ 1152.901262] RAX: ffffffffffffffda RBX: 0000000000000002 RCX: 00007f05f656b22e
> [ 1152.910070] RDX: 0000000000000000 RSI: 00007fff388be5d0 RDI: 0000000000000003
> [ 1152.918880] RBP: 00007fff388be530 R08: 0000000000000000 R09: 0000000000000000
> [ 1152.927690] R10: 0000000000000000 R11: 0000000000000202 R12: 0000000000000002
> [ 1152.936499] R13: 0000000069dd8ee8 R14: 000056113dabf040 R15: 0000000000000000
>
>
> Either decoding failed or I forgot where NIPA-HW puts the output :S
> But I think it's clear enough..
Looks like I need to juggle the tracker a bit, similar to how linkwatch_clean_dev
does it. Nice to see HW NIPA coming together!
prev parent reply other threads:[~2026-04-14 19:24 UTC|newest]
Thread overview: 20+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-04-13 17:11 [PATCH net-next v7 00/15] net: sleepable ndo_set_rx_mode Stanislav Fomichev
2026-04-13 17:11 ` [PATCH net-next v7 01/15] net: add address list snapshot and reconciliation infrastructure Stanislav Fomichev
2026-04-13 17:11 ` [PATCH net-next v7 02/15] net: introduce ndo_set_rx_mode_async and netdev_rx_mode_work Stanislav Fomichev
2026-04-13 17:11 ` [PATCH net-next v7 03/15] net: cache snapshot entries for ndo_set_rx_mode_async Stanislav Fomichev
2026-04-13 17:11 ` [PATCH net-next v7 04/15] net: move promiscuity handling into netdev_rx_mode_work Stanislav Fomichev
2026-04-13 17:11 ` [PATCH net-next v7 05/15] fbnic: convert to ndo_set_rx_mode_async Stanislav Fomichev
2026-04-13 17:11 ` [PATCH net-next v7 06/15] mlx5: " Stanislav Fomichev
2026-04-13 17:11 ` [PATCH net-next v7 07/15] bnxt: " Stanislav Fomichev
2026-04-13 17:11 ` [PATCH net-next v7 08/15] bnxt: use snapshot in bnxt_cfg_rx_mode Stanislav Fomichev
2026-04-13 17:11 ` [PATCH net-next v7 09/15] iavf: convert to ndo_set_rx_mode_async Stanislav Fomichev
2026-04-13 17:11 ` [PATCH net-next v7 10/15] netdevsim: " Stanislav Fomichev
2026-04-13 17:11 ` [PATCH net-next v7 11/15] dummy: " Stanislav Fomichev
2026-04-13 17:11 ` [PATCH net-next v7 12/15] netkit: " Stanislav Fomichev
2026-04-13 17:11 ` [PATCH net-next v7 13/15] net: warn ops-locked drivers still using ndo_set_rx_mode Stanislav Fomichev
2026-04-13 17:11 ` [PATCH net-next v7 14/15] selftests: net: add team_bridge_macvlan rx_mode test Stanislav Fomichev
2026-04-13 18:09 ` Breno Leitao
2026-04-13 17:11 ` [PATCH net-next v7 15/15] selftests: net: use ip commands instead of teamd in team " Stanislav Fomichev
2026-04-13 18:45 ` [PATCH net-next v7 00/15] net: sleepable ndo_set_rx_mode Jakub Kicinski
2026-04-14 2:08 ` Jakub Kicinski
2026-04-14 19:24 ` Stanislav Fomichev [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=ad6UEB5ripPKBfUD@devvm17672.vll0.facebook.com \
--to=sdf.kernel@gmail.com \
--cc=davem@davemloft.net \
--cc=edumazet@google.com \
--cc=kuba@kernel.org \
--cc=netdev@vger.kernel.org \
--cc=pabeni@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox