From: Thadeu Lima de Souza Cascardo <cascardo@linux.vnet.ibm.com>
To: Jun'ichi Nomura <j-nomura@ce.jp.nec.com>
Cc: James Bottomley <James.Bottomley@HansenPartnership.com>,
Tejun Heo <tj@kernel.org>, Alan Stern <stern@rowland.harvard.edu>,
jaxboe@fusionio.com, roland@purestorage.com,
linux-scsi@vger.kernel.org,
"linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
device-mapper development <dm-devel@redhat.com>,
Kiyoshi Ueda <k-ueda@ct.jp.ne>
Subject: Re: [BUG] Oops when SCSI device under multipath is removed
Date: Wed, 31 Aug 2011 16:50:22 -0300 [thread overview]
Message-ID: <20110831195022.GE4004@oc1711230544.ibm.com> (raw)
In-Reply-To: <4E4CD737.4020402@ce.jp.nec.com>
On Thu, Aug 18, 2011 at 06:11:19PM +0900, Jun'ichi Nomura wrote:
> Hi James,
>
> On 08/16/11 20:26, Jun'ichi Nomura wrote:
> > The commit log of 86cbfb5607d4b81b1a993ff689bbd2addd5d3a9b
> > ("[SCSI] put stricter guards on queue dead checks") does not
> > explain about the move of scsi_free_queue().
> >
> > But according to the discussion below, it seems
> > the move was motivated to solve the following self-deadlock:
> > https://lkml.org/lkml/2011/4/12/9
> >
> > [in the context of kblockd_workqueue]
> > blk_delay_work
> > __blk_run_queue
> > scsi_request_fn
> > put_device
> > (puts final sdev refcount)
> > scsi_device_dev_release
> > execute_in_process_context(scsi_device_dev_release_usercontext)
> > [execute immediately because it's in process context]
> > scsi_device_dev_release_usercontext
> > scsi_free_queue
> > blk_cleanup_queue
> > blk_sync_queue
> > (wait for blk_delay_work to complete...)
> >
> > James, is my understanding correct?
> >
> > If so, isn't it possible to move the scsi_free_queue back to
> > the original place and solve the deadlock instead by
> > avoiding the wait in the same context?
>
> Actually, Tejun has posted a patch to replace
> execute_in_process_context() with queue_work()
> and asking your review:
>
> [PATCH RESEND] scsi: don't use execute_in_process_context()
> https://lkml.org/lkml/2011/4/30/87
>
> Do you think you can take the patch and revert the move
> of scsi_free_queue()?
>
> Thanks,
> --
> Jun'ichi Nomura, NEC Corporation
> --
I've tested with your suggestion (reverting the move of scsi_free_queue)
and it works like a charm. I did not get any oops after that. I tested
with a multipath setup on top of two iscsi targets. Using dd after
logging out of some of one of the iscsi targets would trigger the oops.
With this patch, it could not be triggered anymore.
Best regards,
Cascardo.
--
diff --git a/drivers/scsi/scsi_sysfs.c b/drivers/scsi/scsi_sysfs.c
index e0bd3f7..a6eb6f1 100644
--- a/drivers/scsi/scsi_sysfs.c
+++ b/drivers/scsi/scsi_sysfs.c
@@ -322,7 +322,11 @@ static void scsi_device_dev_release_usercontext(struct work_struct *work)
kfree(evt);
}
+ /* Freeing the queue signals to block that we're done */
+ scsi_free_queue(sdev->request_queue);
+
blk_put_queue(sdev->request_queue);
+
/* NULL queue means the device can't be used */
sdev->request_queue = NULL;
@@ -936,8 +940,6 @@ void __scsi_remove_device(struct scsi_device *sdev)
/* cause the request function to reject all I/O requests */
sdev->request_queue->queuedata = NULL;
- /* Freeing the queue signals to block that we're done */
- scsi_free_queue(sdev->request_queue);
put_device(dev);
}
---
next prev parent reply other threads:[~2011-08-31 19:50 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-08-10 4:29 [BUG] Oops when SCSI device under multipath is removed Jun'ichi Nomura
2011-08-10 19:52 ` James Bottomley
2011-08-11 0:24 ` Jun'ichi Nomura
2011-08-11 3:01 ` Jun'ichi Nomura
2011-08-11 14:33 ` James Bottomley
2011-08-11 14:59 ` Alan Stern
2011-08-11 15:05 ` James Bottomley
2011-08-11 15:16 ` Alan Stern
2011-08-16 11:26 ` Jun'ichi Nomura
2011-08-18 9:11 ` Jun'ichi Nomura
2011-08-31 19:50 ` Thadeu Lima de Souza Cascardo [this message]
2011-09-08 0:00 ` Jun'ichi Nomura
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20110831195022.GE4004@oc1711230544.ibm.com \
--to=cascardo@linux.vnet.ibm.com \
--cc=James.Bottomley@HansenPartnership.com \
--cc=dm-devel@redhat.com \
--cc=j-nomura@ce.jp.nec.com \
--cc=jaxboe@fusionio.com \
--cc=k-ueda@ct.jp.ne \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-scsi@vger.kernel.org \
--cc=roland@purestorage.com \
--cc=stern@rowland.harvard.edu \
--cc=tj@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox