From: Leon Romanovsky <leon-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>
To: Doug Ledford <dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>,
Jason Gunthorpe <jgg-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
Cc: RDMA mailing list
<linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>,
Mark Bloch <markb-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>,
Steve Wise
<swise-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
Subject: Re: [PATCH rdma-next v4 1/7] RDMA/restrack: Add general infrastructure to track RDMA resources
Date: Tue, 16 Jan 2018 23:19:07 +0200 [thread overview]
Message-ID: <20180116211907.GD13639@mtr-leonro.local> (raw)
In-Reply-To: <20180115151255.30167-2-leon-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>
[-- Attachment #1: Type: text/plain, Size: 9647 bytes --]
On Mon, Jan 15, 2018 at 05:12:49PM +0200, Leon Romanovsky wrote:
> From: Leon Romanovsky <leonro-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
>
> The RDMA subsystem has very strict set of objects to work on it,
> but it completely lacks tracking facilities and no visibility of
> resource utilization.
>
> The following patch adds such infrastructure to keep track of RDMA
> resources to help with debugging of user space applications. The primary
> user of this infrastructure is RDMA nldev netlink (following patches),
> but it is not limited too.
>
> At this stage, the main three objects (PD, CQ and QP) are added,
> and more will be added later.
>
> There are four new functions in use by RDMA/core:
> * rdma_restrack_init(...) - initializes restrack database
> * rdma_restrack_clean(...) - cleans restrack database
> * rdma_restrack_add(...) - adds object to be tracked
> * rdma_restrack_del(...) - removes object from tracking
>
> 3 functions and one iterator visible to kernel users:
> * rdma_restrack_count(...) - returns number of allocated objects of
> specific type
> * rdma_restrack_lock(...) - Lock primitive to protect access to list
> of resources
> * rdma_restrack_unlock(...)- Unlock primitive to protect access to list
> of resources
> * for_each_res_safe(...) - iterates over all relevant objects in
> the restrack database.
>
> Reviewed-by: Mark Bloch <markb-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
> Signed-off-by: Leon Romanovsky <leonro-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
> Reviewed-by: Steve Wise <swise-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
> ---
> drivers/infiniband/core/Makefile | 2 +-
> drivers/infiniband/core/core_priv.h | 1 +
> drivers/infiniband/core/device.c | 7 ++
> drivers/infiniband/core/restrack.c | 182 +++++++++++++++++++++++++++++++++
> include/rdma/ib_verbs.h | 17 +++-
> include/rdma/restrack.h | 197 ++++++++++++++++++++++++++++++++++++
> 6 files changed, 404 insertions(+), 2 deletions(-)
> create mode 100644 drivers/infiniband/core/restrack.c
> create mode 100644 include/rdma/restrack.h
>
> diff --git a/drivers/infiniband/core/Makefile b/drivers/infiniband/core/Makefile
> index 504b926552c6..f69833db0a32 100644
> --- a/drivers/infiniband/core/Makefile
> +++ b/drivers/infiniband/core/Makefile
> @@ -12,7 +12,7 @@ ib_core-y := packer.o ud_header.o verbs.o cq.o rw.o sysfs.o \
> device.o fmr_pool.o cache.o netlink.o \
> roce_gid_mgmt.o mr_pool.o addr.o sa_query.o \
> multicast.o mad.o smi.o agent.o mad_rmpp.o \
> - security.o nldev.o
> + security.o nldev.o restrack.o
>
> ib_core-$(CONFIG_INFINIBAND_USER_MEM) += umem.o
> ib_core-$(CONFIG_INFINIBAND_ON_DEMAND_PAGING) += umem_odp.o
> diff --git a/drivers/infiniband/core/core_priv.h b/drivers/infiniband/core/core_priv.h
> index aef9aa0ac0e6..2b1372da708a 100644
> --- a/drivers/infiniband/core/core_priv.h
> +++ b/drivers/infiniband/core/core_priv.h
> @@ -40,6 +40,7 @@
> #include <rdma/ib_verbs.h>
> #include <rdma/opa_addr.h>
> #include <rdma/ib_mad.h>
> +#include <rdma/restrack.h>
> #include "mad_priv.h"
>
> /* Total number of ports combined across all struct ib_devices's */
> diff --git a/drivers/infiniband/core/device.c b/drivers/infiniband/core/device.c
> index 2826e06311a5..c3e389f8c99a 100644
> --- a/drivers/infiniband/core/device.c
> +++ b/drivers/infiniband/core/device.c
> @@ -263,6 +263,11 @@ struct ib_device *ib_alloc_device(size_t size)
> if (!device)
> return NULL;
>
> + if (rdma_restrack_init(&device->res)) {
> + kfree(device);
> + return NULL;
> + }
> +
> device->dev.class = &ib_class;
> device_initialize(&device->dev);
>
> @@ -596,6 +601,8 @@ void ib_unregister_device(struct ib_device *device)
> }
> up_read(&lists_rwsem);
>
> + rdma_restrack_clean(&device->res);
> +
> ib_device_unregister_rdmacg(device);
> ib_device_unregister_sysfs(device);
>
> diff --git a/drivers/infiniband/core/restrack.c b/drivers/infiniband/core/restrack.c
> new file mode 100644
> index 000000000000..879b79ea31da
> --- /dev/null
> +++ b/drivers/infiniband/core/restrack.c
> @@ -0,0 +1,182 @@
> +/*
> + * Copyright (c) 2017 Mellanox Technologies. All rights reserved.
> + *
> + * Redistribution and use in source and binary forms, with or without
> + * modification, are permitted provided that the following conditions are met:
> + *
> + * 1. Redistributions of source code must retain the above copyright
> + * notice, this list of conditions and the following disclaimer.
> + * 2. Redistributions in binary form must reproduce the above copyright
> + * notice, this list of conditions and the following disclaimer in the
> + * documentation and/or other materials provided with the distribution.
> + * 3. Neither the names of the copyright holders nor the names of its
> + * contributors may be used to endorse or promote products derived from
> + * this software without specific prior written permission.
> + *
> + * Alternatively, this software may be distributed under the terms of the
> + * GNU General Public License ("GPL") version 2 as published by the Free
> + * Software Foundation.
> + *
> + * THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS"
> + * AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
> + * IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE
> + * ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT OWNER OR CONTRIBUTORS BE
> + * LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR
> + * CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF
> + * SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS
> + * INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN
> + * CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE)
> + * ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE
> + * POSSIBILITY OF SUCH DAMAGE.
> + */
> +
> +#include <rdma/ib_verbs.h>
> +#include <rdma/restrack.h>
> +#include <linux/rculist.h>
> +#include <linux/sched/task.h>
> +
> +int rdma_restrack_init(struct rdma_restrack_root *res)
> +{
> + int i = 0;
> +
> + for (; i < _RDMA_RESTRACK_MAX; i++) {
> + refcount_set(&res->cnt[i], 1);
> + INIT_LIST_HEAD_RCU(&res->list[i]);
> + init_rwsem(&res->rwsem[i]);
> + }
> +
> + return 0;
> +}
> +
> +void rdma_restrack_clean(struct rdma_restrack_root *res)
> +{
> + int i = 0;
> +
> + for (; i < _RDMA_RESTRACK_MAX; i++) {
> + WARN_ON_ONCE(!refcount_dec_and_test(&res->cnt[i]));
> + WARN_ON_ONCE(!list_empty(&res->list[i]));
> + }
> +}
> +
> +static bool is_restrack_valid(enum rdma_restrack_obj type)
> +{
> + return !(type >= _RDMA_RESTRACK_MAX);
> +}
> +
> +int rdma_restrack_count(struct rdma_restrack_root *res,
> + enum rdma_restrack_obj type)
> +{
> + if (!is_restrack_valid(type))
> + return 0;
> +
> + /*
> + * The counter was initialized to 1 at the beginning.
> + */
> + return refcount_read(&res->cnt[type]) - 1;
> +}
> +EXPORT_SYMBOL(rdma_restrack_count);
> +
> +void rdma_restrack_add(struct rdma_restrack_entry *res,
> + enum rdma_restrack_obj type, const char *comm)
> +{
> + struct ib_device *dev;
> + struct ib_pd *pd;
> + struct ib_cq *cq;
> + struct ib_qp *qp;
> +
> + if (!is_restrack_valid(type))
> + return;
> +
> + switch (type) {
> + case RDMA_RESTRACK_PD:
> + pd = container_of(res, struct ib_pd, res);
> + dev = pd->device;
> + break;
> + case RDMA_RESTRACK_CQ:
> + cq = container_of(res, struct ib_cq, res);
> + dev = cq->device;
> + break;
> + case RDMA_RESTRACK_QP:
> + qp = container_of(res, struct ib_qp, res);
> + dev = qp->device;
> + break;
> + default:
> + /* unreachable */
> + return;
> + }
> +
> + if (init_srcu_struct(&res->srcu))
> + /*
> + * We are not returning error, because there is nothing
> + * we can do it in such case, it is already too late to
> + * crash the driver just of failure in resource tracking.
> + *
> + * Simply leave this resource as not valid.
> + */
> + return;
> +
> + if (!comm || !strlen(comm)) {
> + res->kern_name = NULL;
> + get_task_struct(current);
> + res->task = current;
> + } else {
> + res->task = NULL;
> + res->kern_name = kstrdup_const(comm, GFP_KERNEL);
> + if (!res->kern_name)
> + goto out;
> + }
> +
> + refcount_inc(&dev->res.cnt[type]);
> +
> + down_write(&dev->res.rwsem[type]);
> + list_add(&res->list, &dev->res.list[type]);
> + res->valid = true;
> + up_write(&dev->res.rwsem[type]);
> + return;
> +
> +out:
> + res->valid = false;
> + cleanup_srcu_struct(&res->srcu);
> +}
> +EXPORT_SYMBOL(rdma_restrack_add);
> +
> +void rdma_restrack_del(struct rdma_restrack_entry *res,
> + enum rdma_restrack_obj type)
> +{
> + struct ib_device *dev;
> + struct ib_pd *pd;
> + struct ib_cq *cq;
> + struct ib_qp *qp;
> +
> + if (!is_restrack_valid(type) || !res->valid)
> + return;
> +
> + switch (type) {
> + case RDMA_RESTRACK_PD:
> + pd = container_of(res, struct ib_pd, res);
> + dev = pd->device;
> + break;
> + case RDMA_RESTRACK_CQ:
> + cq = container_of(res, struct ib_cq, res);
> + dev = cq->device;
> + break;
> + case RDMA_RESTRACK_QP:
> + qp = container_of(res, struct ib_qp, res);
> + dev = qp->device;
> + break;
> + default:
> + /* unreachable */
> + return;
> + }
> +
> + refcount_dec(&dev->res.cnt[type]);
> + down_write(&dev->res.rwsem[type]);
> + list_del(&res->list);
> + res->valid = false;
> + kfree_const(res->kern_name);
> + put_task_struct(res->task);
There is an error here, it should be
if(res->task)
put_task_struct(res->task);
Resend?
Thanks
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 833 bytes --]
next prev parent reply other threads:[~2018-01-16 21:19 UTC|newest]
Thread overview: 20+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-01-15 15:12 [PATCH rdma-next v4 0/7] RDMA resource tracking Leon Romanovsky
[not found] ` <20180115151255.30167-1-leon-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>
2018-01-15 15:12 ` [PATCH rdma-next v4 1/7] RDMA/restrack: Add general infrastructure to track RDMA resources Leon Romanovsky
[not found] ` <20180115151255.30167-2-leon-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>
2018-01-16 21:19 ` Leon Romanovsky [this message]
2018-01-16 21:33 ` Bart Van Assche
[not found] ` <1516138397.2844.34.camel-Sjgp3cTcYWE@public.gmane.org>
2018-01-17 5:47 ` Leon Romanovsky
[not found] ` <20180117054721.GE13639-U/DQcQFIOTAAJjI8aNfphQ@public.gmane.org>
2018-01-18 3:32 ` Bart Van Assche
[not found] ` <1516246321.3665.7.camel-Sjgp3cTcYWE@public.gmane.org>
2018-01-18 5:20 ` Leon Romanovsky
[not found] ` <20180118052020.GS13639-U/DQcQFIOTAAJjI8aNfphQ@public.gmane.org>
2018-01-18 5:33 ` Bart Van Assche
[not found] ` <1516253594.3665.10.camel-Sjgp3cTcYWE@public.gmane.org>
2018-01-18 5:42 ` Leon Romanovsky
2018-01-15 15:12 ` [PATCH rdma-next v4 2/7] RDMA/core: Add helper function to create named QPs Leon Romanovsky
[not found] ` <20180115151255.30167-3-leon-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>
2018-01-16 21:35 ` Bart Van Assche
[not found] ` <1516138540.2844.36.camel-Sjgp3cTcYWE@public.gmane.org>
2018-01-17 5:59 ` Leon Romanovsky
[not found] ` <20180117055942.GF13639-U/DQcQFIOTAAJjI8aNfphQ@public.gmane.org>
2018-01-18 3:22 ` Bart Van Assche
2018-01-15 15:12 ` [PATCH rdma-next v4 3/7] RDMA: Annotate create QP callers Leon Romanovsky
[not found] ` <20180115151255.30167-4-leon-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>
2018-01-16 23:03 ` Steve Wise
2018-01-18 5:22 ` Leon Romanovsky
2018-01-15 15:12 ` [PATCH rdma-next v4 4/7] RDMA/core: Add resource tracking for create and destroy CQs Leon Romanovsky
2018-01-15 15:12 ` [PATCH rdma-next v4 5/7] RDMA/core: Add resource tracking for create and destroy PDs Leon Romanovsky
2018-01-15 15:12 ` [PATCH rdma-next v4 6/7] RDMA/nldev: Provide global resource utilization Leon Romanovsky
2018-01-15 15:12 ` [PATCH rdma-next v4 7/7] RDMA/nldev: Provide detailed QP information Leon Romanovsky
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20180116211907.GD13639@mtr-leonro.local \
--to=leon-dgejt+ai2ygdnm+yrofe0a@public.gmane.org \
--cc=dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org \
--cc=jgg-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org \
--cc=linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
--cc=markb-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org \
--cc=swise-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox