From: Hannes Reinecke <hare@suse.de>
To: Petr Tesarik <ptesarik@suse.cz>
Cc: linux-scsi@vger.kernel.org
Subject: Re: Locking scheme of /proc/scsi/scsi
Date: Tue, 22 Nov 2011 10:16:23 +0100 [thread overview]
Message-ID: <4ECB6867.8020509@suse.de> (raw)
In-Reply-To: <4ECB6477.4050703@suse.de>
On 11/22/2011 09:59 AM, Hannes Reinecke wrote:
> On 11/21/2011 06:32 PM, Petr Tesarik wrote:
>> Hi folks,
>>
>> I've been working on a kernel crash dump of an ancient kernel recently, and I
>> have come to the conculsion that walking the scsi devices via
>> bus_find_device() is completely flawed. While looking for an upstream fix, I
>> didn't find any, so the same flaw is probably still there. However, let me ask
>> here to check how this is supposed to work.
>>
>> First, this is how I understand the issue. The "/proc/scsi/scsi" file is
>> handled as a pretty standard seqfile, iterating over the devices with the
>> following function:
>>
>> static inline struct device *next_scsi_device(struct device *start)
>> {
>> struct device *next = bus_find_device(&scsi_bus_type, start, NULL,
>> always_match);
>> put_device(start);
>> return next;
>> }
>>
>> The returned value is used for the next iteration. Now, bus_find_device()
>> assumes that the device is still attached to the knode_bus klist, because
>> that's how it initializes the klist iterator. When it finds the next device,
>> it increments the reference count on the device with get_device(), but it
>> doesn't do anything about the knode_bus field. So, when somebody calls
>> scsi_remove_device() on the current device between two calls to
>> next_scsi_device, then it does:
>>
>> if (sdev->is_visible) {
>> [...]
>> device_del(dev);
>>
>> which in turn calls:
>>
>> bus_remove_device(dev);
>>
>> which does:
>>
>> if (klist_node_attached(&dev->p->knode_bus))
>> klist_del(&dev->p->knode_bus);
>>
>> So, even though the struct device has a non-zero refcount, the code in
>> next_scsi_device cannot continue, because it only has a stale pointer to an
>> already detached klist, right?
>>
>> At least that's what I saw in 2.6.16, and I can still see the same thing
>> possible in 3.1.
>>
> Hmm. Looks like we need to increase the refcount to knode_bus when
> we iterate devices.
> Let me check ...
>
No, this seems to be okay. klists are protected by their own
refcounting in ->n_ref (via klist_dec_and_del()), so the list
processing can continue.
However, seeing that you're working with 2.6.16 there has been a
rather serious issue with scsi device scanning, which has been fixed
by 32aeef605aa01e1fee45e052eceffb00e72ba2b0.
Please to check whether that patch is included.
Cheers,
Hannes
--
Dr. Hannes Reinecke zSeries & Storage
hare@suse.de +49 911 74053 688
SUSE LINUX Products GmbH, Maxfeldstr. 5, 90409 Nürnberg
GF: J. Hawn, J. Guild, F. Imendörffer, HRB 16746 (AG Nürnberg)
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
next prev parent reply other threads:[~2011-11-22 9:16 UTC|newest]
Thread overview: 4+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-11-21 17:32 Locking scheme of /proc/scsi/scsi Petr Tesarik
2011-11-22 8:59 ` Hannes Reinecke
2011-11-22 9:16 ` Hannes Reinecke [this message]
2011-11-22 10:57 ` Petr Tesarik
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4ECB6867.8020509@suse.de \
--to=hare@suse.de \
--cc=linux-scsi@vger.kernel.org \
--cc=ptesarik@suse.cz \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox