From: Jason Gunthorpe <jgg@ziepe.ca>
To: "Wei Hu (Xavier)" <xavier.huwei@huawei.com>
Cc: dledford@redhat.com, linux-rdma@vger.kernel.org,
lijun_nudt@163.com, oulijun@huawei.com,
charles.chenxin@huawei.com, liuyixian@huawei.com,
zhangxiping3@huawei.com, linuxarm@huawei.com,
linux-kernel@vger.kernel.org, xavier_huwei@163.com
Subject: Re: [PATCH rdma-next 3/3] RDMA/hns: Modify hns RoCE device's name
Date: Sun, 25 Nov 2018 20:13:54 -0700 [thread overview]
Message-ID: <20181126031354.GA15463@ziepe.ca> (raw)
In-Reply-To: <5BF94B9F.5030409@huawei.com>
On Sat, Nov 24, 2018 at 09:01:19PM +0800, Wei Hu (Xavier) wrote:
>
>
> On 2018/11/24 4:39, Jason Gunthorpe wrote:
> > On Fri, Nov 23, 2018 at 11:14:25PM +0800, Wei Hu (Xavier) wrote:
> >> This patch modifies the name of hns RoCE device's name in order
> >> to ensure that the name is consistent before and after reset.
> >>
> >> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>
> >> drivers/infiniband/hw/hns/hns_roce_device.h | 1 +
> >> drivers/infiniband/hw/hns/hns_roce_hw_v2.c | 3 +++
> >> drivers/infiniband/hw/hns/hns_roce_main.c | 4 +++-
> >> 3 files changed, 7 insertions(+), 1 deletion(-)
> >>
> >> diff --git a/drivers/infiniband/hw/hns/hns_roce_device.h b/drivers/infiniband/hw/hns/hns_roce_device.h
> >> index 259977b..a8cfe76 100644
> >> +++ b/drivers/infiniband/hw/hns/hns_roce_device.h
> >> @@ -954,6 +954,7 @@ struct hns_roce_dev {
> >> struct pci_dev *pci_dev;
> >> struct device *dev;
> >> struct hns_roce_uar priv_uar;
> >> + char name[IB_DEVICE_NAME_MAX];
> >> const char *irq_names[HNS_ROCE_MAX_IRQ_NUM];
> >> spinlock_t sm_lock;
> >> spinlock_t bt_cmd_lock;
> >> diff --git a/drivers/infiniband/hw/hns/hns_roce_hw_v2.c b/drivers/infiniband/hw/hns/hns_roce_hw_v2.c
> >> index 1d639a0..678c7ec 100644
> >> +++ b/drivers/infiniband/hw/hns/hns_roce_hw_v2.c
> >> @@ -6110,6 +6110,9 @@ static int hns_roce_hw_v2_get_cfg(struct hns_roce_dev *hr_dev,
> >> hr_dev->irq[i] = pci_irq_vector(handle->pdev,
> >> i + handle->rinfo.base_vector);
> >>
> >> + snprintf(hr_dev->name, IB_DEVICE_NAME_MAX, "hns%s",
> >> + handle->rinfo.netdev->name);
> > Why is this making up its own driver name? How is this avoiding
> > colliding with an existing name?
> >
> > This is very dangerous since we now have device renaming, the driver
> > could fail to load with no recovery.
> Hi, Jason
>
> The NIC driver notifies the RoCE driver to perform reset related
> processing by calling the .reset_notify() interface registered by the
> RoCE driver. If the RoCE reset processing fails, .reset_notify()
> returns non-zero, and then hns NIC driver will reschedule the
> reset task again.
>
> The current hardware version in hip08 SoC cannot support
> after reset process the application still communicates with the
> resources like QP requested before reset. In RoCE reset process,
> we will release the resources through ib_unregister_device, after
> the hardware reset is completed, driver will re-execute
> ib_register_device.
>
> Currently, we find that the ib_device's name after reset
> and the one before reset may be different. We can specify the
> device name to solve this problem.
No, now you just have unsolved races.
If you want to reset like this then you will need to do some kind of
revision to the IB core code to not loose the name assigned to the
device and not hacks like this.
Jason
next prev parent reply other threads:[~2018-11-26 3:13 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-11-23 15:14 [PATCH rdma-next 0/3] RDMA/hns: Updates for reset process of roce device in hip08 Wei Hu (Xavier)
2018-11-23 15:14 ` [PATCH rdma-next 1/3] RDMA/hns: Add support for reset and loading or unloading driver occur simultaneously Wei Hu (Xavier)
2018-11-23 15:14 ` [PATCH rdma-next 2/3] RDMA/hns: Stop sending mailbox&cmq&doorbell when reset occured or is occuring Wei Hu (Xavier)
2018-11-23 15:14 ` [PATCH rdma-next 3/3] RDMA/hns: Modify hns RoCE device's name Wei Hu (Xavier)
2018-11-23 20:39 ` Jason Gunthorpe
2018-11-24 13:01 ` Wei Hu (Xavier)
2018-11-26 3:13 ` Jason Gunthorpe [this message]
2018-11-26 8:34 ` Wei Hu (Xavier)
2018-11-26 17:44 ` Jason Gunthorpe
2018-11-27 1:07 ` Wei Hu (Xavier)
2018-11-23 20:42 ` [PATCH rdma-next 0/3] RDMA/hns: Updates for reset process of roce device in hip08 Jason Gunthorpe
2018-11-24 13:14 ` Wei Hu (Xavier)
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20181126031354.GA15463@ziepe.ca \
--to=jgg@ziepe.ca \
--cc=charles.chenxin@huawei.com \
--cc=dledford@redhat.com \
--cc=lijun_nudt@163.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-rdma@vger.kernel.org \
--cc=linuxarm@huawei.com \
--cc=liuyixian@huawei.com \
--cc=oulijun@huawei.com \
--cc=xavier.huwei@huawei.com \
--cc=xavier_huwei@163.com \
--cc=zhangxiping3@huawei.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).