* RE: [PATCH rdma-next 1/1] RDMA/mana_ib: Set correct device into ib
[not found] ` <20240626121118.GP29266@unreal>
@ 2024-11-21 0:03 ` Long Li
2024-11-25 15:56 ` Parav Pandit
0 siblings, 1 reply; 7+ messages in thread
From: Long Li @ 2024-11-21 0:03 UTC (permalink / raw)
To: Leon Romanovsky, Konstantin Taranov
Cc: Konstantin Taranov, Wei Hu, sharmaajay@microsoft.com,
jgg@ziepe.ca, linux-rdma@vger.kernel.org, linux-netdev,
open list:Hyper-V/Azure CORE AND DRIVERS
> >
> > Actually, another alternative solution for mana_ib is always set the
> > slave device, but in the GID mgmt code we need the following patch.
> > The problem is that it may require testing/confirmation from other ib providers
> as in the worst case some GIDs will not be listed.
>
> is_eth_active_slave_of_bonding_rcu() is for bonding.
Sorry, need to bring this issue up again.
This patch has broken user-space programs (e.g DPDK) that requires to export a kernel device to user-mode.
With this patch, the RDMA driver grabbed a reference from the master device, it's impossible to move the master device to user-mode.
I think the root cause is that the individual driver should not decide on which (master or slave) address should be used for GID. roce_gid_mgmt.c should handle this situation.
I think Konstantin's suggestion makes sense, how about we do this (don't need to define netdev_is_slave(dev)):
--- a/drivers/infiniband/core/roce_gid_mgmt.c
+++ b/drivers/infiniband/core/roce_gid_mgmt.c
@@ -161,7 +161,7 @@ is_eth_port_of_netdev_filter(struct ib_device *ib_dev, u32 port,
res = ((rdma_is_upper_dev_rcu(rdma_ndev, cookie) &&
(is_eth_active_slave_of_bonding_rcu(rdma_ndev, real_dev) &
REQUIRED_BOND_STATES)) ||
- real_dev == rdma_ndev);
+ (real_dev == rdma_ndev && !netif_is_bond_slave(rdma_ndev)));
rcu_read_unlock();
return res;
is_eth_port_of_netdev_filter() should not return true if this netdev is a bonded slave. In this case, only use the address of its bonded master.
Thanks,
Long
^ permalink raw reply [flat|nested] 7+ messages in thread
* RE: [PATCH rdma-next 1/1] RDMA/mana_ib: Set correct device into ib
2024-11-21 0:03 ` [PATCH rdma-next 1/1] RDMA/mana_ib: Set correct device into ib Long Li
@ 2024-11-25 15:56 ` Parav Pandit
2024-11-25 20:10 ` Leon Romanovsky
0 siblings, 1 reply; 7+ messages in thread
From: Parav Pandit @ 2024-11-25 15:56 UTC (permalink / raw)
To: NBU-Contact-longli (EXTERNAL), Leon Romanovsky,
Konstantin Taranov
Cc: Konstantin Taranov, Wei Hu, sharmaajay@microsoft.com,
jgg@ziepe.ca, linux-rdma@vger.kernel.org, linux-netdev,
open list:Hyper-V/Azure CORE AND DRIVERS
> From: Long Li <longli@microsoft.com>
> Sent: Thursday, November 21, 2024 5:34 AM
>
> > >
> > > Actually, another alternative solution for mana_ib is always set the
> > > slave device, but in the GID mgmt code we need the following patch.
> > > The problem is that it may require testing/confirmation from other
> > > ib providers
> > as in the worst case some GIDs will not be listed.
> >
> > is_eth_active_slave_of_bonding_rcu() is for bonding.
>
> Sorry, need to bring this issue up again.
>
> This patch has broken user-space programs (e.g DPDK) that requires to
> export a kernel device to user-mode.
>
> With this patch, the RDMA driver grabbed a reference from the master
> device, it's impossible to move the master device to user-mode.
>
> I think the root cause is that the individual driver should not decide on which
> (master or slave) address should be used for GID. roce_gid_mgmt.c should
> handle this situation.
>
> I think Konstantin's suggestion makes sense, how about we do this (don't
> need to define netdev_is_slave(dev)):
>
> --- a/drivers/infiniband/core/roce_gid_mgmt.c
> +++ b/drivers/infiniband/core/roce_gid_mgmt.c
> @@ -161,7 +161,7 @@ is_eth_port_of_netdev_filter(struct ib_device
> *ib_dev, u32 port,
> res = ((rdma_is_upper_dev_rcu(rdma_ndev, cookie) &&
> (is_eth_active_slave_of_bonding_rcu(rdma_ndev, real_dev) &
> REQUIRED_BOND_STATES)) ||
> - real_dev == rdma_ndev);
> + (real_dev == rdma_ndev &&
> + !netif_is_bond_slave(rdma_ndev)));
>
> rcu_read_unlock();
> return res;
>
>
> is_eth_port_of_netdev_filter() should not return true if this netdev is a
> bonded slave. In this case, only use the address of its bonded master.
>
Right. This change makes sense to me.
I don't have a setup presently to verify it to ensure I didn't miss a corner case.
Leon,
Can you or others please test the regression once with the formal patch?
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH rdma-next 1/1] RDMA/mana_ib: Set correct device into ib
2024-11-25 15:56 ` Parav Pandit
@ 2024-11-25 20:10 ` Leon Romanovsky
2024-11-27 19:46 ` [EXTERNAL] " Long Li
0 siblings, 1 reply; 7+ messages in thread
From: Leon Romanovsky @ 2024-11-25 20:10 UTC (permalink / raw)
To: Parav Pandit
Cc: NBU-Contact-longli (EXTERNAL), Konstantin Taranov,
Konstantin Taranov, Wei Hu, sharmaajay@microsoft.com,
jgg@ziepe.ca, linux-rdma@vger.kernel.org, linux-netdev,
open list:Hyper-V/Azure CORE AND DRIVERS
On Mon, Nov 25, 2024 at 03:56:01PM +0000, Parav Pandit wrote:
>
>
> > From: Long Li <longli@microsoft.com>
> > Sent: Thursday, November 21, 2024 5:34 AM
> >
> > > >
> > > > Actually, another alternative solution for mana_ib is always set the
> > > > slave device, but in the GID mgmt code we need the following patch.
> > > > The problem is that it may require testing/confirmation from other
> > > > ib providers
> > > as in the worst case some GIDs will not be listed.
> > >
> > > is_eth_active_slave_of_bonding_rcu() is for bonding.
> >
> > Sorry, need to bring this issue up again.
> >
> > This patch has broken user-space programs (e.g DPDK) that requires to
> > export a kernel device to user-mode.
> >
> > With this patch, the RDMA driver grabbed a reference from the master
> > device, it's impossible to move the master device to user-mode.
> >
> > I think the root cause is that the individual driver should not decide on which
> > (master or slave) address should be used for GID. roce_gid_mgmt.c should
> > handle this situation.
> >
> > I think Konstantin's suggestion makes sense, how about we do this (don't
> > need to define netdev_is_slave(dev)):
> >
> > --- a/drivers/infiniband/core/roce_gid_mgmt.c
> > +++ b/drivers/infiniband/core/roce_gid_mgmt.c
> > @@ -161,7 +161,7 @@ is_eth_port_of_netdev_filter(struct ib_device
> > *ib_dev, u32 port,
> > res = ((rdma_is_upper_dev_rcu(rdma_ndev, cookie) &&
> > (is_eth_active_slave_of_bonding_rcu(rdma_ndev, real_dev) &
> > REQUIRED_BOND_STATES)) ||
> > - real_dev == rdma_ndev);
> > + (real_dev == rdma_ndev &&
> > + !netif_is_bond_slave(rdma_ndev)));
> >
> > rcu_read_unlock();
> > return res;
> >
> >
> > is_eth_port_of_netdev_filter() should not return true if this netdev is a
> > bonded slave. In this case, only use the address of its bonded master.
> >
> Right. This change makes sense to me.
> I don't have a setup presently to verify it to ensure I didn't miss a corner case.
> Leon,
> Can you or others please test the regression once with the formal patch?
Sure, once Long will send the patch, I'll make sure that it is tested.
Thanks
>
^ permalink raw reply [flat|nested] 7+ messages in thread
* RE: [EXTERNAL] Re: [PATCH rdma-next 1/1] RDMA/mana_ib: Set correct device into ib
2024-11-25 20:10 ` Leon Romanovsky
@ 2024-11-27 19:46 ` Long Li
2024-11-28 9:39 ` Leon Romanovsky
0 siblings, 1 reply; 7+ messages in thread
From: Long Li @ 2024-11-27 19:46 UTC (permalink / raw)
To: Leon Romanovsky, Parav Pandit
Cc: Konstantin Taranov, Konstantin Taranov, Wei Hu,
sharmaajay@microsoft.com, jgg@ziepe.ca,
linux-rdma@vger.kernel.org, linux-netdev,
open list:Hyper-V/Azure CORE AND DRIVERS
> > > I think Konstantin's suggestion makes sense, how about we do this
> > > (don't need to define netdev_is_slave(dev)):
> > >
> > > --- a/drivers/infiniband/core/roce_gid_mgmt.c
> > > +++ b/drivers/infiniband/core/roce_gid_mgmt.c
> > > @@ -161,7 +161,7 @@ is_eth_port_of_netdev_filter(struct ib_device
> > > *ib_dev, u32 port,
> > > res = ((rdma_is_upper_dev_rcu(rdma_ndev, cookie) &&
> > > (is_eth_active_slave_of_bonding_rcu(rdma_ndev, real_dev) &
> > > REQUIRED_BOND_STATES)) ||
> > > - real_dev == rdma_ndev);
> > > + (real_dev == rdma_ndev &&
> > > + !netif_is_bond_slave(rdma_ndev)));
> > >
> > > rcu_read_unlock();
> > > return res;
> > >
> > >
> > > is_eth_port_of_netdev_filter() should not return true if this netdev
> > > is a bonded slave. In this case, only use the address of its bonded master.
> > >
> > Right. This change makes sense to me.
> > I don't have a setup presently to verify it to ensure I didn't miss a corner case.
> > Leon,
> > Can you or others please test the regression once with the formal patch?
>
> Sure, once Long will send the patch, I'll make sure that it is tested.
>
> Thanks
>
I posted patches for discussion.
https://lore.kernel.org/linux-rdma/1732736619-19941-1-git-send-email-longli@linuxonhyperv.com/T/#t
Thank you,
Long
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [EXTERNAL] Re: [PATCH rdma-next 1/1] RDMA/mana_ib: Set correct device into ib
2024-11-27 19:46 ` [EXTERNAL] " Long Li
@ 2024-11-28 9:39 ` Leon Romanovsky
2024-12-03 18:32 ` Long Li
2025-02-07 21:39 ` Long Li
0 siblings, 2 replies; 7+ messages in thread
From: Leon Romanovsky @ 2024-11-28 9:39 UTC (permalink / raw)
To: Long Li
Cc: Parav Pandit, Konstantin Taranov, Konstantin Taranov, Wei Hu,
sharmaajay@microsoft.com, jgg@ziepe.ca,
linux-rdma@vger.kernel.org, linux-netdev,
open list:Hyper-V/Azure CORE AND DRIVERS
On Wed, Nov 27, 2024 at 07:46:39PM +0000, Long Li wrote:
>
> > > > I think Konstantin's suggestion makes sense, how about we do this
> > > > (don't need to define netdev_is_slave(dev)):
> > > >
> > > > --- a/drivers/infiniband/core/roce_gid_mgmt.c
> > > > +++ b/drivers/infiniband/core/roce_gid_mgmt.c
> > > > @@ -161,7 +161,7 @@ is_eth_port_of_netdev_filter(struct ib_device
> > > > *ib_dev, u32 port,
> > > > res = ((rdma_is_upper_dev_rcu(rdma_ndev, cookie) &&
> > > > (is_eth_active_slave_of_bonding_rcu(rdma_ndev, real_dev) &
> > > > REQUIRED_BOND_STATES)) ||
> > > > - real_dev == rdma_ndev);
> > > > + (real_dev == rdma_ndev &&
> > > > + !netif_is_bond_slave(rdma_ndev)));
> > > >
> > > > rcu_read_unlock();
> > > > return res;
> > > >
> > > >
> > > > is_eth_port_of_netdev_filter() should not return true if this netdev
> > > > is a bonded slave. In this case, only use the address of its bonded master.
> > > >
> > > Right. This change makes sense to me.
> > > I don't have a setup presently to verify it to ensure I didn't miss a corner case.
> > > Leon,
> > > Can you or others please test the regression once with the formal patch?
> >
> > Sure, once Long will send the patch, I'll make sure that it is tested.
> >
> > Thanks
> >
>
> I posted patches for discussion.
> https://lore.kernel.org/linux-rdma/1732736619-19941-1-git-send-email-longli@linuxonhyperv.com/T/#t
Please resend these patches as series with cover letter and don't embed
extra patch (the one which is not numbered) into the series.
Thanks
>
> Thank you,
> Long
>
^ permalink raw reply [flat|nested] 7+ messages in thread
* RE: [EXTERNAL] Re: [PATCH rdma-next 1/1] RDMA/mana_ib: Set correct device into ib
2024-11-28 9:39 ` Leon Romanovsky
@ 2024-12-03 18:32 ` Long Li
2025-02-07 21:39 ` Long Li
1 sibling, 0 replies; 7+ messages in thread
From: Long Li @ 2024-12-03 18:32 UTC (permalink / raw)
To: Leon Romanovsky
Cc: Parav Pandit, Konstantin Taranov, Konstantin Taranov, Wei Hu,
sharmaajay@microsoft.com, jgg@ziepe.ca,
linux-rdma@vger.kernel.org, linux-netdev,
open list:Hyper-V/Azure CORE AND DRIVERS
> Subject: Re: [EXTERNAL] Re: [PATCH rdma-next 1/1] RDMA/mana_ib: Set correct
> device into ib
>
> On Wed, Nov 27, 2024 at 07:46:39PM +0000, Long Li wrote:
> >
> > > > > I think Konstantin's suggestion makes sense, how about we do
> > > > > this (don't need to define netdev_is_slave(dev)):
> > > > >
> > > > > --- a/drivers/infiniband/core/roce_gid_mgmt.c
> > > > > +++ b/drivers/infiniband/core/roce_gid_mgmt.c
> > > > > @@ -161,7 +161,7 @@ is_eth_port_of_netdev_filter(struct
> > > > > ib_device *ib_dev, u32 port,
> > > > > res = ((rdma_is_upper_dev_rcu(rdma_ndev, cookie) &&
> > > > > (is_eth_active_slave_of_bonding_rcu(rdma_ndev, real_dev) &
> > > > > REQUIRED_BOND_STATES)) ||
> > > > > - real_dev == rdma_ndev);
> > > > > + (real_dev == rdma_ndev &&
> > > > > + !netif_is_bond_slave(rdma_ndev)));
> > > > >
> > > > > rcu_read_unlock();
> > > > > return res;
> > > > >
> > > > >
> > > > > is_eth_port_of_netdev_filter() should not return true if this
> > > > > netdev is a bonded slave. In this case, only use the address of its bonded
> master.
> > > > >
> > > > Right. This change makes sense to me.
> > > > I don't have a setup presently to verify it to ensure I didn't miss a corner
> case.
> > > > Leon,
> > > > Can you or others please test the regression once with the formal patch?
> > >
> > > Sure, once Long will send the patch, I'll make sure that it is tested.
> > >
> > > Thanks
> > >
> >
> > I posted patches for discussion.
> > https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Flore
> > .kernel.org%2Flinux-rdma%2F1732736619-19941-1-git-send-email-longli%40
> >
> linuxonhyperv.com%2FT%2F%23t&data=05%7C02%7Clongli%40microsoft.com%7
> C4
> >
> 20bac91521e414ff34c08dd0f909cf6%7C72f988bf86f141af91ab2d7cd011db47%7
> C1
> > %7C0%7C638683835975667120%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU
> 1hcGkiOnRy
> >
> dWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D
> %
> >
> 3D%7C0%7C%7C%7C&sdata=7vTTi%2FilkYdEKNG1qwpgYYDriOPPUF%2Bp8Zh91
> 60CEVE%
> > 3D&reserved=0
>
> Please resend these patches as series with cover letter and don't embed extra
> patch (the one which is not numbered) into the series.
>
> Thanks
I will resend those as a series after addressing the other comments on bonding.
Thanks
^ permalink raw reply [flat|nested] 7+ messages in thread
* RE: [EXTERNAL] Re: [PATCH rdma-next 1/1] RDMA/mana_ib: Set correct device into ib
2024-11-28 9:39 ` Leon Romanovsky
2024-12-03 18:32 ` Long Li
@ 2025-02-07 21:39 ` Long Li
1 sibling, 0 replies; 7+ messages in thread
From: Long Li @ 2025-02-07 21:39 UTC (permalink / raw)
To: Leon Romanovsky
Cc: Parav Pandit, Konstantin Taranov, Konstantin Taranov, Wei Hu,
sharmaajay@microsoft.com, jgg@ziepe.ca,
linux-rdma@vger.kernel.org, linux-netdev,
open list:Hyper-V/Azure CORE AND DRIVERS
> On Wed, Nov 27, 2024 at 07:46:39PM +0000, Long Li wrote:
> >
> > > > > I think Konstantin's suggestion makes sense, how about we do
> > > > > this (don't need to define netdev_is_slave(dev)):
> > > > >
> > > > > --- a/drivers/infiniband/core/roce_gid_mgmt.c
> > > > > +++ b/drivers/infiniband/core/roce_gid_mgmt.c
> > > > > @@ -161,7 +161,7 @@ is_eth_port_of_netdev_filter(struct
> > > > > ib_device *ib_dev, u32 port,
> > > > > res = ((rdma_is_upper_dev_rcu(rdma_ndev, cookie) &&
> > > > > (is_eth_active_slave_of_bonding_rcu(rdma_ndev, real_dev) &
> > > > > REQUIRED_BOND_STATES)) ||
> > > > > - real_dev == rdma_ndev);
> > > > > + (real_dev == rdma_ndev &&
> > > > > + !netif_is_bond_slave(rdma_ndev)));
> > > > >
> > > > > rcu_read_unlock();
> > > > > return res;
> > > > >
> > > > >
> > > > > is_eth_port_of_netdev_filter() should not return true if this
> > > > > netdev is a bonded slave. In this case, only use the address of its bonded
> master.
> > > > >
> > > > Right. This change makes sense to me.
> > > > I don't have a setup presently to verify it to ensure I didn't miss a corner
> case.
> > > > Leon,
> > > > Can you or others please test the regression once with the formal patch?
> > >
> > > Sure, once Long will send the patch, I'll make sure that it is tested.
> > >
> > > Thanks
> > >
> >
> > I posted patches for discussion.
> > https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Flore
> > .kernel.org%2Flinux-rdma%2F1732736619-19941-1-git-send-email-longli%40
> >
> linuxonhyperv.com%2FT%2F%23t&data=05%7C02%7Clongli%40microsoft.com%7
> C4
> >
> 20bac91521e414ff34c08dd0f909cf6%7C72f988bf86f141af91ab2d7cd011db47%7
> C1
> >
> %7C0%7C638683835975667120%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1h
> cGkiOnRy
> >
> dWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D
> %
> >
> 3D%7C0%7C%7C%7C&sdata=7vTTi%2FilkYdEKNG1qwpgYYDriOPPUF%2Bp8Zh91
> 60CEVE%
> > 3D&reserved=0
>
> Please resend these patches as series with cover letter and don't embed extra
> patch (the one which is not numbered) into the series.
>
> Thanks
Sorry for the late relay. I have done some more testing and sent those patches in a series with a cover letter.
Please review the series.
Thanks,
Long
^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2025-02-07 21:39 UTC | newest]
Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <1719311307-7920-1-git-send-email-kotaranov@linux.microsoft.com>
[not found] ` <20240626054748.GN29266@unreal>
[not found] ` <PAXPR83MB0559F4678E73B0091A8ADFBBB4D62@PAXPR83MB0559.EURPRD83.prod.outlook.com>
[not found] ` <20240626121118.GP29266@unreal>
2024-11-21 0:03 ` [PATCH rdma-next 1/1] RDMA/mana_ib: Set correct device into ib Long Li
2024-11-25 15:56 ` Parav Pandit
2024-11-25 20:10 ` Leon Romanovsky
2024-11-27 19:46 ` [EXTERNAL] " Long Li
2024-11-28 9:39 ` Leon Romanovsky
2024-12-03 18:32 ` Long Li
2025-02-07 21:39 ` Long Li
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).