From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756051AbdC2LaB (ORCPT ); Wed, 29 Mar 2017 07:30:01 -0400 Received: from mx2.suse.de ([195.135.220.15]:37833 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1756001AbdC2L30 (ORCPT ); Wed, 29 Mar 2017 07:29:26 -0400 Date: Wed, 29 Mar 2017 13:29:22 +0200 From: Johannes Thumshirn To: John Garry Cc: "Martin K . Petersen" , Tejun Heo , James Bottomley , Dan Williams , Jack Wang , Hannes Reinecke , Linux SCSI Mailinglist , Linux Kernel Mailinglist Subject: Re: [PATCH 1/2] scsi: sas: flush destruct workqueue on device unregister Message-ID: <20170329112922.GB9183@linux-x5ow.site> References: <9580eaf323f5da17dcace9e32b22a1df4099961d.1490775958.git.jthumshirn@suse.de> <02778435-6c67-0ac9-2faa-03ebb7934477@huawei.com> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <02778435-6c67-0ac9-2faa-03ebb7934477@huawei.com> User-Agent: Mutt/1.5.24 (2015-08-30) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Mar 29, 2017 at 12:15:44PM +0100, John Garry wrote: > On 29/03/2017 10:41, Johannes Thumshirn wrote: > >In the advent of an SAS device unregister we have to wait for all destruct > >works to be done to not accidently delay deletion of a SAS rphy or it's > >children to the point when we're removing the SCSI or SAS hosts. > > > >Signed-off-by: Johannes Thumshirn > >--- > > drivers/scsi/libsas/sas_discover.c | 4 ++++ > > 1 file changed, 4 insertions(+) > > > >diff --git a/drivers/scsi/libsas/sas_discover.c b/drivers/scsi/libsas/sas_discover.c > >index 60de662..75b18f1 100644 > >--- a/drivers/scsi/libsas/sas_discover.c > >+++ b/drivers/scsi/libsas/sas_discover.c > >@@ -382,9 +382,13 @@ void sas_unregister_dev(struct asd_sas_port *port, struct domain_device *dev) > > } > > > > if (!test_and_set_bit(SAS_DEV_DESTROY, &dev->state)) { > >+ struct sas_discovery *disc = &dev->port->disc; > >+ struct sas_work *sw = &disc->disc_work[DISCE_DESTRUCT].work; > >+ > > sas_rphy_unlink(dev->rphy); > > list_move_tail(&dev->disco_list_node, &port->destroy_list); > > sas_discover_event(dev->port, DISCE_DESTRUCT); > >+ flush_work(&sw->work); > > I quickly tested plugging out the expander and we never get past this call > to flush - a hang results: Can you activat lockdep so we can see which lock it is that we're blocking on? It's most likely in sas_unregister_common_dev() but this function takes two spin locks, port->dev_list_lock and ha->lock. Thanks a lot, Johannes -- Johannes Thumshirn Storage jthumshirn@suse.de +49 911 74053 689 SUSE LINUX GmbH, Maxfeldstr. 5, 90409 Nürnberg GF: Felix Imendörffer, Jane Smithard, Graham Norton HRB 21284 (AG Nürnberg) Key fingerprint = EC38 9CAB C2C4 F25D 8600 D0D0 0393 969D 2D76 0850