From: Doug Ledford <dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
To: Leon Romanovsky <leon-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>
Cc: linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org,
Erez Shitrit <erezsh-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
Subject: Re: [rdma-next v3 19/24] IB/ipoib: Sync between remove_one to sysfs calls that use rtnl_lock
Date: Thu, 24 Aug 2017 16:49:58 -0400 [thread overview]
Message-ID: <1503607798.78641.52.camel@redhat.com> (raw)
In-Reply-To: <20170817125055.31424-20-leon-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>
On Thu, 2017-08-17 at 15:50 +0300, Leon Romanovsky wrote:
> From: Erez Shitrit <erezsh-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
>
> In order to avoid deadlock between sysfs functions (like
> create/delete
> child) and remove_one (both of them are using the sysfs lock and
> rtnl_lock) the driver will use a state mutex for sync.
>
> That will fix traces as the following:
> schedule+0x3e/0x90
> kernfs_drain+0x75/0xf0
> ? wait_woken+0x90/0x90
> __kernfs_remove+0x12e/0x1c0
> kernfs_remove+0x25/0x40
> sysfs_remove_dir+0x57/0x90
> kobject_del+0x22/0x60
> device_del+0x195/0x230
> pm_runtime_set_memalloc_noio+0xac/0xf0
> netdev_unregister_kobject+0x71/0x80
> rollback_registered_many+0x205/0x2f0
> rollback_registered+0x31/0x40
> unregister_netdevice_queue+0x58/0xb0
> unregister_netdev+0x20/0x30
> ipoib_remove_one+0xb7/0x240 [ib_ipoib]
> ib_unregister_device+0xbc/0x1b0 [ib_core]
> ib_unregister_mad_agent+0x29/0x30 [ib_core]
> mlx4_ib_remove+0x67/0x280 [mlx4_ib]
> INFO: task echo:24082 blocked for more than 120 seconds.
> Tainted: G OE 4.1.12-37.5.1.el6uek.x86_64 #2
> "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this
> message.
> Call Trace:
> schedule+0x3e/0x90
> schedule_preempt_disabled+0xe/0x10
> __mutex_lock_slowpath+0x95/0x110
> ? _rcu_barrier+0x177/0x220
> mutex_lock+0x23/0x40
> rtnl_lock+0x15/0x20
> netdev_run_todo+0x81/0x1f0
> rtnl_unlock+0xe/0x10
> ipoib_vlan_delete+0x12f/0x1c0 [ib_ipoib]
> delete_child+0x69/0x80 [ib_ipoib]
> dev_attr_store+0x20/0x30
> sysfs_kf_write+0x41/0x50
>
> Signed-off-by: Erez Shitrit <erezsh-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
> Reviewed-by: Alex Vesker <valex-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
> Signed-off-by: Leon Romanovsky <leon-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>
> ---
> drivers/infiniband/ulp/ipoib/ipoib.h | 1 +
> drivers/infiniband/ulp/ipoib/ipoib_cm.c | 8 +++++++-
> drivers/infiniband/ulp/ipoib/ipoib_main.c | 5 +++++
> drivers/infiniband/ulp/ipoib/ipoib_vlan.c | 22 ++++++++++++++++++---
> -
> 4 files changed, 31 insertions(+), 5 deletions(-)
>
> diff --git a/drivers/infiniband/ulp/ipoib/ipoib.h
> b/drivers/infiniband/ulp/ipoib/ipoib.h
> index 9e738104c2a1..7aa114cf53e3 100644
> --- a/drivers/infiniband/ulp/ipoib/ipoib.h
> +++ b/drivers/infiniband/ulp/ipoib/ipoib.h
> @@ -336,6 +336,7 @@ struct ipoib_dev_priv {
> unsigned long flags;
>
> struct rw_semaphore vlan_rwsem;
> + struct mutex sysfs_lock; /* Protect sysfs running*/
I detest calling a mutex a lock and vice versa, so I renamed this
sysfs_mutex, and then took the patch.
--
Doug Ledford <dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
GPG KeyID: B826A3330E572FDD
Key fingerprint = AE6B 1BDA 122B 23B4 265B 1274 B826 A333 0E57 2FDD
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
next prev parent reply other threads:[~2017-08-24 20:49 UTC|newest]
Thread overview: 39+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-08-17 12:50 [pull request][rdma-next v3 00/24] RDMA core, drivers and IPoIB fixes Leon Romanovsky
[not found] ` <20170817125055.31424-1-leon-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>
2017-08-17 12:50 ` [rdma-next v3 01/24] IB/cma: Fix erroneous validation of supported default GID type Leon Romanovsky
[not found] ` <20170817125055.31424-2-leon-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>
2017-08-23 8:50 ` Selvin Xavier
2017-08-17 12:50 ` [rdma-next v3 02/24] IB/uverbs: Introduce and use helper functions to copy ah attributes Leon Romanovsky
[not found] ` <20170817125055.31424-3-leon-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>
2017-08-24 20:40 ` Doug Ledford
[not found] ` <1503607242.78641.48.camel-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
2017-08-27 11:10 ` Leon Romanovsky
[not found] ` <20170827111059.GQ1724-U/DQcQFIOTAAJjI8aNfphQ@public.gmane.org>
2017-08-28 15:15 ` Doug Ledford
[not found] ` <1503933314.78641.82.camel-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
2017-08-28 15:55 ` Leon Romanovsky
[not found] ` <20170828155552.GB23726-U/DQcQFIOTAAJjI8aNfphQ@public.gmane.org>
2017-08-28 16:32 ` Doug Ledford
[not found] ` <1503937924.78641.94.camel-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
2017-08-28 17:20 ` Leon Romanovsky
[not found] ` <20170828172026.GE23726-U/DQcQFIOTAAJjI8aNfphQ@public.gmane.org>
2017-08-28 17:24 ` Leon Romanovsky
2017-08-28 18:09 ` Leon Romanovsky
2017-08-17 12:50 ` [rdma-next v3 03/24] RDMA/mlx4: Don't use uninitialized variable Leon Romanovsky
2017-08-17 12:50 ` [rdma-next v3 04/24] RDMA/mlx4: Fix create qp command alignment Leon Romanovsky
2017-08-17 12:50 ` [rdma-next v3 05/24] RDMA/(core,ulp): Convert register/unregister event handler to be void Leon Romanovsky
2017-08-17 12:50 ` [rdma-next v3 06/24] RDMA/core: Cleanup device capability enum Leon Romanovsky
2017-08-17 12:50 ` [rdma-next v3 07/24] RDMA/core: Delete BUG() from unreachable flow Leon Romanovsky
2017-08-17 12:50 ` [rdma-next v3 08/24] RDMA/core: Refactor get link layer wrapper Leon Romanovsky
2017-08-17 12:50 ` [rdma-next v3 09/24] RDMA/mlx4: Remove gfp_mask argument from acquire_group call Leon Romanovsky
2017-08-17 12:50 ` [rdma-next v3 10/24] RDMA/usnic: Fix remove address space warning Leon Romanovsky
2017-08-17 12:50 ` [rdma-next v3 11/24] RDMA/mthca: Make explicit conversion to 64bit value Leon Romanovsky
2017-08-17 12:50 ` [rdma-next v3 12/24] IB/mlx4: Fix some spelling mistakes Leon Romanovsky
2017-08-17 12:50 ` [rdma-next v3 13/24] IB/mlx5: " Leon Romanovsky
2017-08-17 12:50 ` [rdma-next v3 14/24] IB/mlx5: Add necessary delay drop assignment Leon Romanovsky
2017-08-17 12:50 ` [rdma-next v3 15/24] IB/mlx4: Fix RSS QP type in creation verb Leon Romanovsky
2017-08-17 12:50 ` [rdma-next v3 16/24] IB/mlx4: Fix struct mlx4_ib_create_wq alignment Leon Romanovsky
2017-08-17 12:50 ` [rdma-next v3 17/24] IB/mlx4: Remove redundant attribute in mlx4_ib_create_qp_rss struct Leon Romanovsky
[not found] ` <20170817125055.31424-18-leon-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>
2017-08-24 20:41 ` Doug Ledford
2017-08-17 12:50 ` [rdma-next v3 18/24] IB/mlx4: Check that reserved fields in mlx4_ib_create_qp_rss are zero Leon Romanovsky
2017-08-17 12:50 ` [rdma-next v3 19/24] IB/ipoib: Sync between remove_one to sysfs calls that use rtnl_lock Leon Romanovsky
[not found] ` <20170817125055.31424-20-leon-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>
2017-08-24 20:49 ` Doug Ledford [this message]
2017-08-17 12:50 ` [rdma-next v3 20/24] IB/ipoib: Add get statistics support to SRIOV VF Leon Romanovsky
[not found] ` <20170817125055.31424-21-leon-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>
2017-08-24 20:44 ` Doug Ledford
[not found] ` <1503607480.78641.51.camel-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
2017-08-27 11:17 ` Leon Romanovsky
2017-08-17 12:50 ` [rdma-next v3 21/24] IB/rxe: Make rxe_counter_name static Leon Romanovsky
2017-08-17 12:50 ` [rdma-next v3 22/24] RDMA/mlx5: Limit scope of get vector affinity local function Leon Romanovsky
2017-08-17 12:50 ` [rdma-next v3 23/24] RDMA/mlx4: Properly annotate link layer variable Leon Romanovsky
2017-08-17 12:50 ` [rdma-next v3 24/24] RDMA/nes: Remove zeroed parameter from port query callback Leon Romanovsky
2017-08-24 20:52 ` [pull request][rdma-next v3 00/24] RDMA core, drivers and IPoIB fixes Doug Ledford
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1503607798.78641.52.camel@redhat.com \
--to=dledford-h+wxahxf7alqt0dzr+alfa@public.gmane.org \
--cc=erezsh-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org \
--cc=leon-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org \
--cc=linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.