* [PATCH net] net/mana: Fix auxiliary device double-delete race
@ 2026-03-09 17:24 Konstantin Taranov
2026-03-11 17:03 ` Simon Horman
2026-03-11 23:38 ` Long Li
0 siblings, 2 replies; 3+ messages in thread
From: Konstantin Taranov @ 2026-03-09 17:24 UTC (permalink / raw)
To: shirazsaleem, kotaranov, pabeni, haiyangz, kys, edumazet, kuba,
davem, decui, wei.liu, longli, jgg, leon
Cc: linux-rdma, linux-kernel, netdev
From: Shiraz Saleem <shirazsaleem@microsoft.com>
Make remove_adev() safe to call concurrently from the service reset
and PCI eject paths by using xchg() to atomically claim the adev
pointer. This prevents double auxiliary_device_delete/uninit when
hv_eject_device_work races with the service reset workqueue.
Fixes: 505cc26bcae0 ("net: mana: Add support for auxiliary device servicing events")
Signed-off-by: Shiraz Saleem <shirazsaleem@microsoft.com>
Signed-off-by: Konstantin Taranov <kotaranov@microsoft.com>
---
drivers/net/ethernet/microsoft/mana/mana_en.c | 18 ++++++++++--------
1 file changed, 10 insertions(+), 8 deletions(-)
diff --git a/drivers/net/ethernet/microsoft/mana/mana_en.c b/drivers/net/ethernet/microsoft/mana/mana_en.c
index 9b5a72a..c45a66e 100644
--- a/drivers/net/ethernet/microsoft/mana/mana_en.c
+++ b/drivers/net/ethernet/microsoft/mana/mana_en.c
@@ -3402,14 +3402,18 @@ static void adev_release(struct device *dev)
static void remove_adev(struct gdma_dev *gd)
{
- struct auxiliary_device *adev = gd->adev;
- int id = adev->id;
+ struct auxiliary_device *adev = xchg(&gd->adev, NULL);
+ int id;
+
+ if (!adev)
+ return;
+
+ id = adev->id;
auxiliary_device_delete(adev);
auxiliary_device_uninit(adev);
mana_adev_idx_free(id);
- gd->adev = NULL;
}
static int add_adev(struct gdma_dev *gd, const char *name)
@@ -3473,7 +3477,7 @@ static void mana_rdma_service_handle(struct work_struct *work)
switch (serv_work->event) {
case GDMA_SERVICE_TYPE_RDMA_SUSPEND:
- if (!gd->adev || gd->is_suspended)
+ if (gd->is_suspended)
break;
remove_adev(gd);
@@ -3676,8 +3680,7 @@ void mana_remove(struct gdma_dev *gd, bool suspending)
cancel_delayed_work_sync(&ac->gf_stats_work);
/* adev currently doesn't support suspending, always remove it */
- if (gd->adev)
- remove_adev(gd);
+ remove_adev(gd);
for (i = 0; i < ac->num_ports; i++) {
ndev = ac->ports[i];
@@ -3764,8 +3767,7 @@ void mana_rdma_remove(struct gdma_dev *gd)
WRITE_ONCE(gd->rdma_teardown, true);
flush_workqueue(gc->service_wq);
- if (gd->adev)
- remove_adev(gd);
+ remove_adev(gd);
mana_gd_deregister_device(gd);
}
--
2.43.0
^ permalink raw reply related [flat|nested] 3+ messages in thread
* Re: [PATCH net] net/mana: Fix auxiliary device double-delete race
2026-03-09 17:24 [PATCH net] net/mana: Fix auxiliary device double-delete race Konstantin Taranov
@ 2026-03-11 17:03 ` Simon Horman
2026-03-11 23:38 ` Long Li
1 sibling, 0 replies; 3+ messages in thread
From: Simon Horman @ 2026-03-11 17:03 UTC (permalink / raw)
To: Konstantin Taranov
Cc: shirazsaleem, kotaranov, pabeni, haiyangz, kys, edumazet, kuba,
davem, decui, wei.liu, longli, jgg, leon, linux-rdma,
linux-kernel, netdev
On Mon, Mar 09, 2026 at 10:24:15AM -0700, Konstantin Taranov wrote:
> From: Shiraz Saleem <shirazsaleem@microsoft.com>
>
> Make remove_adev() safe to call concurrently from the service reset
> and PCI eject paths by using xchg() to atomically claim the adev
> pointer. This prevents double auxiliary_device_delete/uninit when
> hv_eject_device_work races with the service reset workqueue.
>
> Fixes: 505cc26bcae0 ("net: mana: Add support for auxiliary device servicing events")
> Signed-off-by: Shiraz Saleem <shirazsaleem@microsoft.com>
> Signed-off-by: Konstantin Taranov <kotaranov@microsoft.com>
Unfortunately the CI was not able to apply this net.
Please rebase and repost.
--
pw-bot: changes-requested
^ permalink raw reply [flat|nested] 3+ messages in thread
* RE: [PATCH net] net/mana: Fix auxiliary device double-delete race
2026-03-09 17:24 [PATCH net] net/mana: Fix auxiliary device double-delete race Konstantin Taranov
2026-03-11 17:03 ` Simon Horman
@ 2026-03-11 23:38 ` Long Li
1 sibling, 0 replies; 3+ messages in thread
From: Long Li @ 2026-03-11 23:38 UTC (permalink / raw)
To: Konstantin Taranov, Shiraz Saleem, Konstantin Taranov,
pabeni@redhat.com, Haiyang Zhang, KY Srinivasan,
edumazet@google.com, kuba@kernel.org, davem@davemloft.net,
Dexuan Cui, wei.liu@kernel.org, jgg@ziepe.ca, leon@kernel.org
Cc: linux-rdma@vger.kernel.org, linux-kernel@vger.kernel.org,
netdev@vger.kernel.org
> Subject: [PATCH net] net/mana: Fix auxiliary device double-delete race
>
> From: Shiraz Saleem <shirazsaleem@microsoft.com>
>
> Make remove_adev() safe to call concurrently from the service reset and PCI
> eject paths by using xchg() to atomically claim the adev pointer. This prevents
> double auxiliary_device_delete/uninit when hv_eject_device_work races with
> the service reset workqueue.
>
> Fixes: 505cc26bcae0 ("net: mana: Add support for auxiliary device servicing
> events")
> Signed-off-by: Shiraz Saleem <shirazsaleem@microsoft.com>
> Signed-off-by: Konstantin Taranov <kotaranov@microsoft.com>
Reviewed-by: Long Li <longli@microsoft.com>
> ---
> drivers/net/ethernet/microsoft/mana/mana_en.c | 18 ++++++++++--------
> 1 file changed, 10 insertions(+), 8 deletions(-)
>
> diff --git a/drivers/net/ethernet/microsoft/mana/mana_en.c
> b/drivers/net/ethernet/microsoft/mana/mana_en.c
> index 9b5a72a..c45a66e 100644
> --- a/drivers/net/ethernet/microsoft/mana/mana_en.c
> +++ b/drivers/net/ethernet/microsoft/mana/mana_en.c
> @@ -3402,14 +3402,18 @@ static void adev_release(struct device *dev)
>
> static void remove_adev(struct gdma_dev *gd) {
> - struct auxiliary_device *adev = gd->adev;
> - int id = adev->id;
> + struct auxiliary_device *adev = xchg(&gd->adev, NULL);
> + int id;
> +
> + if (!adev)
> + return;
> +
> + id = adev->id;
>
> auxiliary_device_delete(adev);
> auxiliary_device_uninit(adev);
>
> mana_adev_idx_free(id);
> - gd->adev = NULL;
> }
>
> static int add_adev(struct gdma_dev *gd, const char *name) @@ -3473,7
> +3477,7 @@ static void mana_rdma_service_handle(struct work_struct *work)
>
> switch (serv_work->event) {
> case GDMA_SERVICE_TYPE_RDMA_SUSPEND:
> - if (!gd->adev || gd->is_suspended)
> + if (gd->is_suspended)
> break;
>
> remove_adev(gd);
> @@ -3676,8 +3680,7 @@ void mana_remove(struct gdma_dev *gd, bool
> suspending)
> cancel_delayed_work_sync(&ac->gf_stats_work);
>
> /* adev currently doesn't support suspending, always remove it */
> - if (gd->adev)
> - remove_adev(gd);
> + remove_adev(gd);
>
> for (i = 0; i < ac->num_ports; i++) {
> ndev = ac->ports[i];
> @@ -3764,8 +3767,7 @@ void mana_rdma_remove(struct gdma_dev *gd)
> WRITE_ONCE(gd->rdma_teardown, true);
> flush_workqueue(gc->service_wq);
>
> - if (gd->adev)
> - remove_adev(gd);
> + remove_adev(gd);
>
> mana_gd_deregister_device(gd);
> }
> --
> 2.43.0
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2026-03-11 23:38 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2026-03-09 17:24 [PATCH net] net/mana: Fix auxiliary device double-delete race Konstantin Taranov
2026-03-11 17:03 ` Simon Horman
2026-03-11 23:38 ` Long Li
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox