From: Greg KH <gregkh@linuxfoundation.org>
To: dengjie03 <dengjie03@kylinos.cn>
Cc: rjw@rjwysocki.net, pavel@ucw.cz, len.brown@intel.com,
linux-usb@vger.kernel.org, linux-kernel@vger.kernel.org,
linux-pm@vger.kernel.org, xiehongyu1@kylinos.cn,
duanchenghao@kylinos.cn, xiongxin <xiongxin@kylinos.cn>
Subject: Re: [PATCH] KYLIN: USB: Fix the issue of S4 wakeup queisce phase where task resumption fails due to USB status
Date: Mon, 23 Sep 2024 15:06:58 +0200 [thread overview]
Message-ID: <2024092355-chip-stuffy-bd93@gregkh> (raw)
In-Reply-To: <20240923100553.119324-1-dengjie03@kylinos.cn>
On Mon, Sep 23, 2024 at 06:05:53PM +0800, dengjie03 wrote:
> Reproduction of the problem: During the S4 stress test,
> when a USB device is inserted or removed, there is a
> probability that the S4 wakeup will turn into a reboot.
> The following two points describe how to analyze and
> locate the problem points:
>
> 1. During the boot stage when S4 is awakened, after
> the USB RootHub is initialized,it will enter the
> runtime suspend state. From then on, whenever an
> xhci port change event occurs, it will trigger a
> remote wakeup request event and add wakeup_work to
> pm_wq, where the subsequent RootHub runtime resume
> process will be handled by pm_wq.
>
> xhci runtime suspend flow:
> S4 boot
> |->xhci init
> |->register_root_hub
> |->hub_probe
> |->callback = RPM_GET_CALLBACK(dev, runtime_suspend) /* xhci RootHub runtime suspend */
>
> xhci runtime resume flow :
> xhci_irq()
> |->xhci_handle_event()
> |->handle_port_status()
> |->if(hcd->state == HC_STATE_SUSPENDED)
> |->usb_hcd_resume_root_hub()
> |->set_bit(HCD_FLAG_WAKEUP_PENDING, &hcd->flags) /* wakeup pending signal to be set */
> |->queue_work(pm_wq, &hcd->wakeup_work)
> |->hcd_resume_work() /* hcd->wakeup_work */
> |->usb_remote_wakeup()
> |->callback = RPM_GET_CALLBACK(dev, runtime_resume)
> |->usb_runtime_resume() /* usb runtime resume */
> |->generic_resume()
> |->hcd_bus_resume()
> |->clear_bit(HCD_FLAG_WAKEUP_PENDING, &hcd->flags);
> /* wakeup pending signal to be clear */
>
> 2. However, during the quiesce phase of S4 wakeup,
> freeze_kernel_threads() will freeze this pm_wq,
> and between freeze_kernel_threads() and dpm_suspend_start(),
> there exists a very time-consuming S4 image loading process.
> This leads to a situation where, if an xhci port change event occurs
> after freeze_kernel_threads(), triggering the wakeup pending
> signal to be set,but it cannot be processed by pm_wq to clear
> this wakeup_pending bit, it will result in a subsequent
> dpm_suspend_start() where USB suspend_common() detects the
> wakeup pending signal being set and returns an -EBUSY error,
> interrupting the S4 quiesce process and reverting to a reboot.
>
> S4 wakeup
> |->resume_store
> |->software_resume()
> |->freeze_kernel_threads() /* will freeze pm_wq */
> |->load_image_and_restore()
> |->swsusp_read() /* S4 image loading: time-consuming .
> When an xhci port change event occurs at this point, it triggers the wakeup pending signal to be set.
> However, since the pm_wq is in a frozen state, the wakeup_pending bit cannot be cleared.*/
> |->hibernation_restore
> |->dpm_suspend_start(PMSG_QUIESCE)
> |->hcd_pci_suspend()
> |->suspend_common()
> |->if (do_wakeup && HCD_WAKEUP_PENDING(hcd)) return -EBUSY;
>
> Below is a description of the countermeasures taken to address this issue:
> 1. Considering the restore process that occurs after
> the quiesce phase during S4 wakeup, which essentially
> resets all root hubs,checking this wakeup pending status
> in USB suspend_common() during the quiesce phase is of
> little significance and should therefore be filtered out.
>
> S4 wakeup restore phase
> |->dpm_resume(PMSG_RESTORE)
> |->hcd_pci_restore()
> |->xhci_resume() /* reset all root hubs */
>
> Fixes: 3904bdf0821c40352495f632862637080e6bd744(PM: hibernate: Freeze kernel threads in software_resume())
Please read the documentation for how to list a Fixes: tag, it should be
a bit smaller than this line :)
> Signed-off-by: xiongxin <xiongxin@kylinos.cn>
> Co-developed-by: dengjie03 <dengjie03@kylinos.cn>
As the documentation states, we need real names, not email aliases.
And fix up how you use co-developed-by please, again, the documentation
shows how to do so.
> ---
> drivers/base/power/main.c | 5 +++++
> drivers/usb/core/hcd-pci.c | 21 ++++++++++++++-------
> include/linux/pm.h | 1 +
> 3 files changed, 20 insertions(+), 7 deletions(-)
>
> diff --git a/drivers/base/power/main.c b/drivers/base/power/main.c
> index fb4d18a0b185..264d08b9e331 100644
> --- a/drivers/base/power/main.c
> +++ b/drivers/base/power/main.c
> @@ -559,6 +559,11 @@ bool dev_pm_may_skip_resume(struct device *dev)
> return !dev->power.must_resume && pm_transition.event != PM_EVENT_RESTORE;
> }
>
> +bool is_pm_queisce(void)
Bad name for a global function :(
> +{
> + return pm_transition.event == PM_EVENT_QUIESCE;
What happens if it changes right after this? Where is the locking
involved? And why does USB only care about this and no other subsystem?
thanks,
greg k-h
next prev parent reply other threads:[~2024-09-23 13:07 UTC|newest]
Thread overview: 15+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-09-23 10:05 [PATCH] KYLIN: USB: Fix the issue of S4 wakeup queisce phase where task resumption fails due to USB status dengjie03
2024-09-23 13:06 ` Greg KH [this message]
2024-09-23 13:16 ` Greg KH
2024-09-24 10:59 ` dengjie
2024-09-25 2:50 ` [PATCH v2] " dengjie
2024-09-25 6:40 ` Greg KH
2024-09-25 8:06 ` dengjie
2024-09-25 8:54 ` Greg KH
2024-09-26 8:20 ` Deng Jie
2024-09-25 8:18 ` dengjie
2024-09-25 6:40 ` Greg KH
2024-09-30 19:38 ` Alan Stern
2024-10-10 0:46 ` Deng Jie
2024-10-10 14:01 ` Alan Stern
2024-10-11 7:17 ` Deng Jie
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=2024092355-chip-stuffy-bd93@gregkh \
--to=gregkh@linuxfoundation.org \
--cc=dengjie03@kylinos.cn \
--cc=duanchenghao@kylinos.cn \
--cc=len.brown@intel.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-pm@vger.kernel.org \
--cc=linux-usb@vger.kernel.org \
--cc=pavel@ucw.cz \
--cc=rjw@rjwysocki.net \
--cc=xiehongyu1@kylinos.cn \
--cc=xiongxin@kylinos.cn \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.