From: Mathias Nyman <mathias.nyman@linux.intel.com>
To: Ladislav Michl <oss-lists@triops.cz>, linux-usb@vger.kernel.org
Subject: Re: xHCI host dies on device unplug
Date: Fri, 16 Dec 2022 12:13:23 +0200 [thread overview]
Message-ID: <abfec817-0b32-ece3-4965-7503aa5a77fa@linux.intel.com> (raw)
In-Reply-To: <Y5tHWwHctY6wr+CJ@lenoch>
On 15.12.2022 18.12, Ladislav Michl wrote:
> +Cc Mathias as he last touched this code path and may know more :)
>
> On Tue, Dec 06, 2022 at 02:17:08PM +0100, Ladislav Michl wrote:
>> On Mon, Dec 05, 2022 at 10:27:57PM +0100, Ladislav Michl wrote:
>>> I'm running current linux.git on custom Marvell OCTEON III CN7020
>>> based board. USB devices like FTDI (idVendor=0403, idProduct=6001,
>>> bcdDevice= 6.00) Realtek WiFi dongle (idVendor=0bda, idProduct=8179,
>>> bcdDevice= 0.00) works without issues, while Ralink WiFi dongle
>>> (idVendor=148f, idProduct=5370, bcdDevice= 1.01) kills the host on
>>> disconnect:
>>> xhci-hcd xhci-hcd.0.auto: xHCI host not responding to stop endpoint command
>>> xhci-hcd xhci-hcd.0.auto: xHCI host controller not responding, assume dead
>>> xhci-hcd xhci-hcd.0.auto: HC died; cleaning up
>>>
>>> Unfortunately I do not have a datasheet for CN7020 SoC, so it is hard
>>> to tell if there is any errata :/ In case anyone see a clue in debug
>>> logs bellow, I'll happily give it a try.
>>
>> So I do have datasheet now. As a wild guess I tried to use dlmc_ref_clk0
>> instead of dlmc_ref_clk1 as a refclk-type-ss and it fixed unplug death.
>> I have no clue why, but anyway - sorry for the noise :) Perhaps Octeon's
>> clock init is worth to be verified...
>
> After all whenever xhci dies with "xHCI host not responding to stop endpoint
> command" depends also on temperature, so there seems to be race somewhere.
>
> As a quick and dirty verification, whenever xhci really died, following patch
> was tested and it fixed issue. It just treats ep as if stop endpoint command
> succeeded. Any clues? I'll happily provide more traces.
It's possible the controller did complete the stop endpoint command but driver
didn't get the interrupt for the event for some reason.
I wrote some patches that checks the event ring for this event during
timeout.
code is in a stop_endpoint_fixes branch in my tree:
git://git.kernel.org/pub/scm/linux/kernel/git/mnyman/xhci.git stop_endpoint_fixes
https://git.kernel.org/pub/scm/linux/kernel/git/mnyman/xhci.git/log/?h=stop_endpoint_fixes
Another thing would be checking command and event rings for this stop endpoint command.
So Instead of killing host at timeout, do nothing, and check sysfs after the disconnect:
cat /sys/kernel/debug/usb/xhci/<address>/event-ring/trbs
cat /sys/kernel/debug/usb/xhci/<address>/command-ring/trbs
-Mathias
next prev parent reply other threads:[~2022-12-16 10:12 UTC|newest]
Thread overview: 17+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-12-05 21:27 xHCI host dies on device unplug Ladislav Michl
2022-12-06 13:17 ` Ladislav Michl
2022-12-15 16:12 ` Ladislav Michl
2022-12-16 10:13 ` Mathias Nyman [this message]
2022-12-16 21:32 ` Ladislav Michl
2022-12-19 12:25 ` Mathias Nyman
2022-12-19 18:31 ` Ladislav Michl
2022-12-19 21:45 ` Ladislav Michl
2022-12-20 7:58 ` Ladislav Michl
2022-12-21 9:46 ` Mathias Nyman
2022-12-21 7:14 ` Ladislav Michl
2022-12-21 9:58 ` Mathias Nyman
2022-12-21 10:11 ` Ladislav Michl
2022-12-21 12:05 ` Ladislav Michl
2022-12-21 12:12 ` Mathias Nyman
2022-12-21 12:21 ` Ladislav Michl
2022-12-19 7:11 ` Ladislav Michl
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=abfec817-0b32-ece3-4965-7503aa5a77fa@linux.intel.com \
--to=mathias.nyman@linux.intel.com \
--cc=linux-usb@vger.kernel.org \
--cc=oss-lists@triops.cz \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.