From: FUJITA Tomonori <tomof@acm.org>
To: James.Bottomley@HansenPartnership.com
Cc: michaelc@cs.wisc.edu, pw@osc.edu, fujita.tomonori@lab.ntt.co.jp,
linux-scsi@vger.kernel.org, erezz@voltaire.com,
Jens.Axboe@oracle.com
Subject: Re: Serious regression caused by fix for [BUG 1/3] bsg queue oops with iscsi logout
Date: Wed, 26 Mar 2008 23:22:47 +0900 [thread overview]
Message-ID: <20080326232226X.tomof@acm.org> (raw)
In-Reply-To: <1206201960.4393.8.camel@localhost.localdomain>
On Sat, 22 Mar 2008 11:06:00 -0500
James Bottomley <James.Bottomley@HansenPartnership.com> wrote:
> On Tue, 2008-03-11 at 00:36 -0500, Mike Christie wrote:
> > Mike Christie wrote:
> > > Pete Wyckoff wrote:
> > >> I think this used not to happen; not sure. But I changed two things
> > >
> > > This most likely did not happen before 2.6.25-rc* or it broke in
> > > slightly different ways, because iscsi used to try and do
> > >
> > > echo 1 > /sys/block/sdX/device/delete
> > >
> > > from userspace instead of calling scsi_remove_target from the kernel.
> > >
> > > As you know around 2.6.21, the behavior of doing the echo to the delete
> > > file changed due to a driver model and scsi change and that broke the
> > > iscsi tools. The iscsi tools userspace removal was sort of hack in the
> > > first place and was racey, so we switched to removing devices/target
> > > like the FC class.
> > >
> > >
> > >> lately. 2.6.25-rc1 to -rc4 and fedora 8 iscsi-initiator-utils (865) to
> > >> fedora devel (868). Bidi and varlen patches always too.
> > >>
> > >> I'll follow with some more variations on this theme. Looks like bsg
> > >> needs to protect more carefully against the device going away. Any
> > >> ideas how best to do this? What was the approach in sg?
> > >>
> > >
> > > I think sg is broken in similar ways. The iser guys have some tests
> > > cases that have broken sg while IO is outstanding. I am ccing Erez.
> >
> > Actually one of the problems looks a little different than some of the
> > problems hit with sg and are caused because we remove the bsg device too
> > soon. I think we want to wait until all the references from the
> > commands/requests are released. The attached patch (untested) moves the
> > bsg unreg call to the scsi device release fn.
>
> Well, this fix is now upstream. However, it's causing all our
> scsi_devices never to get released, which is a serious regression.
> We're also doing spurious bsg_unregister_queue() for things that never
> actually registered one (all scan devices that return DID_NO_CONNECT),
> but bsg doesn't seem to be complaining about this.
>
> The essence of the problem is that bsg_register_queue() takes a ref to
> the sdev_gendev, so you can't move bsg_unregister_queue() into the
> release function because nothing ever puts bsg's device ref and so
> release is never called.
>
> Options for fixing this before 2.6.25 are
>
> 1. revert the patch
> 2. Do an additional put for the bsg reference in
> __scsi_remove_device (patch below). It's nasty but it preserves
> the semantics and does what you want
After some investigation, this patch doesn't fix the bug that Pete
reported (I'll send a new patch shortly).
Can you revert the commit 4b6f5b3a993cbe34b4280f252bccc76967c185c8
instead of merging this?
next prev parent reply other threads:[~2008-03-26 14:24 UTC|newest]
Thread overview: 27+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-03-09 16:53 [BUG 1/3] bsg queue oops with iscsi logout Pete Wyckoff
2008-03-09 16:54 ` [BUG 2/3] bsg null sdev " Pete Wyckoff
2008-03-09 16:55 ` [BUG 3/3] bsg mutex hang " Pete Wyckoff
2008-03-10 17:57 ` [BUG 1/3] bsg queue oops " Mike Christie
2008-03-11 5:36 ` Mike Christie
2008-03-11 22:46 ` FUJITA Tomonori
2008-03-15 0:45 ` Pete Wyckoff
2008-03-22 16:06 ` Serious regression caused by fix for " James Bottomley
2008-03-24 9:23 ` FUJITA Tomonori
2008-03-26 14:22 ` FUJITA Tomonori [this message]
2008-03-26 14:36 ` James Bottomley
2008-03-26 14:59 ` FUJITA Tomonori
2008-03-27 1:32 ` Mike Christie
2008-03-27 11:11 ` FUJITA Tomonori
2008-03-27 20:46 ` Mike Christie
2008-03-27 1:51 ` Mike Christie
2008-03-27 2:18 ` Mike Christie
2008-03-27 11:11 ` FUJITA Tomonori
2008-03-27 11:11 ` FUJITA Tomonori
2008-03-27 12:18 ` FUJITA Tomonori
2008-03-30 17:39 ` James Bottomley
2008-03-31 0:20 ` FUJITA Tomonori
2008-04-02 18:41 ` Boaz Harrosh
2008-04-02 21:00 ` FUJITA Tomonori
2008-04-03 7:58 ` Boaz Harrosh
2008-03-27 1:59 ` Mike Christie
2008-03-27 0:25 ` Mike Christie
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20080326232226X.tomof@acm.org \
--to=tomof@acm.org \
--cc=James.Bottomley@HansenPartnership.com \
--cc=Jens.Axboe@oracle.com \
--cc=erezz@voltaire.com \
--cc=fujita.tomonori@lab.ntt.co.jp \
--cc=linux-scsi@vger.kernel.org \
--cc=michaelc@cs.wisc.edu \
--cc=pw@osc.edu \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).