public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Guenter Roeck <linux@roeck-us.net>
To: Bjorn Helgaas <bhelgaas@google.com>
Cc: Rajat Jain <rajatjain@juniper.net>,
	Yinghai Lu <yinghai@kernel.org>,
	Rajat Jain <rajatjain.linux@gmail.com>,
	"linux-pci@vger.kernel.org" <linux-pci@vger.kernel.org>,
	Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
	Kenji Kaneshige <kaneshige.kenji@jp.fujitsu.com>,
	Alex Williamson <alex.williamson@redhat.com>,
	Yijing Wang <wangyijing@huawei.com>,
	Paul Bolle <pebolle@tiscali.nl>,
	Rajat Jain <rajatxjain@gmail.com>,
	Guenter Roeck <groeck@juniper.net>
Subject: Re: [PATCH v2 2/4] pciehp: Use link change notifications for hot-plug and removal
Date: Mon, 16 Dec 2013 18:36:01 -0800	[thread overview]
Message-ID: <52AFB891.5020700@roeck-us.net> (raw)
In-Reply-To: <CAErSpo6pBk6HR79jLfxd2sZPQ3Xq=Z-u1t__ODzDqa31_TLPSA@mail.gmail.com>

On 12/16/2013 05:14 PM, Bjorn Helgaas wrote:
> On Mon, Dec 16, 2013 at 10:39 AM, Guenter Roeck <linux@roeck-us.net> wrote:
>> On Sun, Dec 15, 2013 at 05:18:26PM -0700, Bjorn Helgaas wrote:
>>> On Sun, Dec 15, 2013 at 4:24 PM, Rajat Jain <rajatjain@juniper.net> wrote:
>>>>>>>
>>>>>>>> Once again: the way I interpret this is: * Always enable Link events.
>>>>>>>> * Disable presence events if attention button is present.
>>>>>>>
>>>>>>> That sounds like a good plan to me.
>>>>>>
>>>>>> How about Diag_Reset from MPT2SAS and others?  link could up and down
>>>>>>
>>>>
>>>> I am assuming you are referring to
>>>>
>>>> static int _base_diag_reset(struct MPT2SAS_ADAPTER *ioc, int sleep_flag)
>>>>
>>>> Which as far as I could understand would cause link to go down and come up
>>>> again without the kernel knowing anything about it?  ...
>>>
>>>> In general, I guess the question is when a link goes down and back up,
>>>> whether or not we want to treat it as a hot unplug followed by a hotplug. I
>>>> think there may be cases such as AER (or the one Yinghai mentions) where we
>>>> don't want to treat it as a hotplug (see note below). And there may be cases
>>>> that we definitely want to treat it as hotplug (need link events!).
>>>> Situation gets more complex since there may be pciehp slots downstream of a
>>>> link getting reset.
>>>>
>>>> It seems to me that we are saying that a mechanism is needed so that a
>>>> voluntary Link flap is NOT treated like a hotplug temporarily?  ...
>>>
>>>> Notes: * it may not OK, if the kernel thinks the device is accessible when
>>>> it is really not.  What if during this downtime, someone tries to access the
>>>> device (say userspace)?  * How do we know after the link up, that the device
>>>> is really the same.  If during this reset, the device changed its
>>>> "character", say a different class?  I think a rescan should be mandated
>>>> after every such event.  * Do we need to unload and reload the driver after
>>>> the link reset, since it can be a different device?
>>>
>>> I am quite dubious about the idea of a voluntary link flap.  If the link goes
>>> down and comes back up, I don't see how we can make any assumptions about what
>>> device is there.
>>>
>>> I let Alex talk me into pciehp_reset_slot(), which disables presence detect
>>> interrupts while resetting a device, so we already have a bit of precedent for
>>> the idea.  But even in that case, the device could easily come out of reset as
>>> a different device, e.g., if the reset caused it to load updated firmware.
>>>
>>> I would feel much better if we treated link down as a remove and did a rescan
>>> on the link up.
>>>
>> Agreed. Question is if we might need some means for a driver to tell the PCIe
>> core about an upcoming "planned" link flap. pciehp_reset_slot() doesn't seem
>> to address the condition where a driver resets a connected chip by other means
>> than by calling pciehp_reset_slot(). Still not sure what happens when the
>> mpt2sas driver issues its diagnostic reset, to take Yinghai's example (or if
>> there would be a cleaner way to implement such a reset).
>
> In my opinion we should not add the concept of a planned link flap.
> We already have pci_reset_function(), and we can probably make that
> deal with link up/down events internally, so I think we should try to
> use that if we can.  I think it's too much of a mess to try to support
> link flaps for random driver-initiated resets that don't go through
> the PCI core.
>
Perfectly fine with me.

> That probably means going through and identifying all the drivers we
> can find that do their own internal resets, and converting them.
> We'll likely miss some, since the mechanisms are driver-specific.  And

Also might be difficult - that kind of work really asks for having
the hardware available for testing.

Guenter

> maybe there are some driver resets that think they add value over the
> core's pci_reset_function() (I'm not sure what that would be, but I'm
> open to discussion about it).
>
> Bjorn
>
>


  reply	other threads:[~2013-12-17  2:36 UTC|newest]

Thread overview: 16+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-12-03 22:32 [PATCH v2 2/4] pciehp: Use link change notifications for hot-plug and removal Rajat Jain
2013-12-05  9:07 ` Yijing Wang
2013-12-06  3:19   ` Rajat Jain
2013-12-12 22:44 ` Bjorn Helgaas
2013-12-13  6:26   ` Yinghai Lu
2013-12-13 13:21     ` Bjorn Helgaas
2013-12-13 19:04       ` Rajat Jain
2013-12-13 21:14         ` Bjorn Helgaas
2013-12-14  1:58           ` Yinghai Lu
2013-12-14  3:39             ` Guenter Roeck
2013-12-15 23:24               ` Rajat Jain
2013-12-16  0:18                 ` Bjorn Helgaas
2013-12-16 17:39                   ` Guenter Roeck
2013-12-17  1:14                     ` Bjorn Helgaas
2013-12-17  2:36                       ` Guenter Roeck [this message]
2013-12-15 22:23           ` Rajat Jain

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=52AFB891.5020700@roeck-us.net \
    --to=linux@roeck-us.net \
    --cc=alex.williamson@redhat.com \
    --cc=bhelgaas@google.com \
    --cc=groeck@juniper.net \
    --cc=kaneshige.kenji@jp.fujitsu.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-pci@vger.kernel.org \
    --cc=pebolle@tiscali.nl \
    --cc=rajatjain.linux@gmail.com \
    --cc=rajatjain@juniper.net \
    --cc=rajatxjain@gmail.com \
    --cc=wangyijing@huawei.com \
    --cc=yinghai@kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox