From: Chaitanya Kulkarni <chaitanyak@nvidia.com>
To: Shinichiro Kawasaki <shinichiro.kawasaki@wdc.com>
Cc: Yi Zhang <yi.zhang@redhat.com>,
"linux-block@vger.kernel.org" <linux-block@vger.kernel.org>,
Keith Busch <kbusch@kernel.org>,
"linux-nvme@lists.infradead.org" <linux-nvme@lists.infradead.org>,
"linux-scsi@vger.kernel.org" <linux-scsi@vger.kernel.org>,
"mstowe@redhat.com" <mstowe@redhat.com>,
"linux-pci@vger.kernel.org" <linux-pci@vger.kernel.org>,
Bjorn Helgaas <helgaas@kernel.org>
Subject: Re: blktests failures with v5.19-rc1
Date: Wed, 15 Jun 2022 22:01:39 +0000 [thread overview]
Message-ID: <cfaee02b-0390-6e1c-e26c-fa0ba3689704@nvidia.com> (raw)
In-Reply-To: <20220615194727.GA1022614@bhelgaas>
On 6/15/22 12:47, Bjorn Helgaas wrote:
> On Tue, Jun 14, 2022 at 04:00:45AM +0000, Shinichiro Kawasaki wrote:
>> On Jun 14, 2022 / 02:38, Chaitanya Kulkarni wrote:
>>> Shinichiro,
>>>
>>> On 6/13/22 19:23, Keith Busch wrote:
>>>> On Tue, Jun 14, 2022 at 01:09:07AM +0000, Shinichiro Kawasaki wrote:
>>>>> (CC+: linux-pci)
>>>>> On Jun 11, 2022 / 16:34, Yi Zhang wrote:
>>>>>> On Fri, Jun 10, 2022 at 10:49 PM Keith Busch <kbusch@kernel.org> wrote:
>>>>>>>
>>>>>>> And I am not even sure this is real. I don't know yet why
>>>>>>> this is showing up only now, but this should fix it:
>>>>>>
>>>>>> Hi Keith
>>>>>>
>>>>>> Confirmed the WARNING issue was fixed with the change, here is
>>>>>> the log:
>>>>>
>>>>> Thanks. I also confirmed that Keith's change to add
>>>>> __ATTR_IGNORE_LOCKDEP to dev_attr_dev_rescan avoids the fix, on
>>>>> v5.19-rc2.
>>>>>
>>>>> I took a closer look into this issue and found The deadlock
>>>>> WARN can be recreated with following two commands:
>>>>>
>>>>> # echo 1 > /sys/bus/pci/devices/0000\:00\:09.0/rescan
>>>>> # echo 1 > /sys/bus/pci/devices/0000\:00\:09.0/remove
>>>>>
>>>>> And it can be recreated with PCI devices other than NVME
>>>>> controller, such as SCSI controller or VGA controller. Then
>>>>> this is not a storage sub-system issue.
>>>>>
>>>>> I checked function call stacks of the two commands above. As
>>>>> shown below, it looks like ABBA deadlock possibility is
>>>>> detected and warned.
>>>>
>>>> Yeah, I was mistaken on this report, so my proposal to suppress
>>>> the warning is definitely not right. If I run both 'echo'
>>>> commands in parallel, I see it deadlock frequently. I'm not
>>>> familiar enough with this code to any good ideas on how to fix,
>>>> but I agree this is a generic pci issue.
>>>
>>> I think it is worth adding a testcase to blktests to make sure
>>> these future releases will test this.
>>
>> Yeah, this WARN is confusing for us then it would be valuable to
>> test by blktests not to repeat it. One point I wonder is: which test
>> group the test case will it fall in? The nvme group could be the
>> group to add, probably.
>>
since this issue been discovered with nvme rescan and revmoe,
it should be added to the nvme category.
>> Another point I wonder is other kernel test suite than blktests.
>> Don't we have more appropriate test suite to check PCI device
>> rescan/remove race ? Such a test sounds more like a PCI bus
>> sub-system test than block/storage test.
I don't think so we could have caught it long time back,
but we clearly did not.
>
> I'm not aware of such a test, but it would be nice to have one.
>
> Can you share your qemu config so I can reproduce this locally?
>
> Thanks for finding and reporting this!
>
> Bjorn
-ck
next prev parent reply other threads:[~2022-06-15 22:02 UTC|newest]
Thread overview: 23+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-06-09 23:53 blktests failures with v5.19-rc1 Shinichiro Kawasaki
2022-06-10 8:07 ` Christoph Hellwig
2022-06-10 9:22 ` Chaitanya Kulkarni
2022-06-10 9:32 ` Chaitanya Kulkarni
2022-06-10 12:25 ` Shinichiro Kawasaki
2022-06-10 13:15 ` Yi Zhang
2022-06-10 14:47 ` Keith Busch
2022-06-11 8:34 ` Yi Zhang
2022-06-14 1:09 ` Shinichiro Kawasaki
2022-06-14 2:23 ` Keith Busch
2022-06-14 2:38 ` Chaitanya Kulkarni
2022-06-14 4:00 ` Shinichiro Kawasaki
2022-06-15 19:47 ` Bjorn Helgaas
2022-06-15 22:01 ` Chaitanya Kulkarni [this message]
2022-06-15 23:13 ` Yi Zhang
2022-06-16 4:42 ` Shinichiro Kawasaki
2022-06-16 17:55 ` Chaitanya Kulkarni
2022-06-15 23:16 ` Keith Busch
2022-07-19 4:50 ` Shinichiro Kawasaki
2022-07-19 22:31 ` Bjorn Helgaas
2022-07-20 2:27 ` Shinichiro Kawasaki
2022-12-19 11:27 ` Shinichiro Kawasaki
2022-12-29 18:13 ` Bjorn Helgaas
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=cfaee02b-0390-6e1c-e26c-fa0ba3689704@nvidia.com \
--to=chaitanyak@nvidia.com \
--cc=helgaas@kernel.org \
--cc=kbusch@kernel.org \
--cc=linux-block@vger.kernel.org \
--cc=linux-nvme@lists.infradead.org \
--cc=linux-pci@vger.kernel.org \
--cc=linux-scsi@vger.kernel.org \
--cc=mstowe@redhat.com \
--cc=shinichiro.kawasaki@wdc.com \
--cc=yi.zhang@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox