Devicetree
 help / color / mirror / Atom feed
From: Stephan Gerhold <stephan.gerhold@linaro.org>
To: "Aiqun(Maria) Yu" <aiqun.yu@oss.qualcomm.com>
Cc: Jingyi Wang <jingyi.wang@oss.qualcomm.com>,
	Bjorn Andersson <andersson@kernel.org>,
	Mathieu Poirier <mathieu.poirier@linaro.org>,
	Rob Herring <robh@kernel.org>,
	Krzysztof Kozlowski <krzk+dt@kernel.org>,
	Conor Dooley <conor+dt@kernel.org>,
	Manivannan Sadhasivam <mani@kernel.org>,
	Luca Weiss <luca.weiss@fairphone.com>,
	Bartosz Golaszewski <brgl@kernel.org>,
	Konrad Dybcio <konradybcio@kernel.org>,
	shengchao.guo@oss.qualcomm.com, tingwei.zhang@oss.qualcomm.com,
	trilok.soni@oss.qualcomm.com, yijie.yang@oss.qualcomm.com,
	linux-arm-msm@vger.kernel.org, linux-remoteproc@vger.kernel.org,
	devicetree@vger.kernel.org, linux-kernel@vger.kernel.org,
	Gokul Krishna Krishnakumar <gokul.krishnakumar@oss.qualcomm.com>
Subject: Re: [PATCH v6 5/6] remoteproc: qcom: pas: Add late attach support for subsystems
Date: Thu, 11 Jun 2026 11:10:21 +0200	[thread overview]
Message-ID: <aip7feoTn0ncwzL7@linaro.org> (raw)
In-Reply-To: <6df4c351-7287-4fb9-8af8-83b5deabfa07@oss.qualcomm.com>

On Thu, Jun 11, 2026 at 11:10:25AM +0800, Aiqun(Maria) Yu wrote:
> On 5/22/2026 8:07 PM, Stephan Gerhold wrote:
> > On Tue, May 19, 2026 at 12:24:23AM -0700, Jingyi Wang wrote:
> >> Subsystems can be brought out of reset by entities such as bootloaders.
> >> As the irq enablement could be later than subsystem bring up, the state
> >> of subsystem should be checked by reading SMP2P bits.
> >>
> >> A new qcom_pas_attach() function is introduced. if a crash state is
> >> detected for the subsystem, rproc_report_crash() is called. If the ready
> >> state is detected, it will be marked as "attached", otherwise it could
> >> be the early boot feature is not supported by other entities. In this
> >> case, the state will be marked as RPROC_OFFLINE so that the PAS driver
> >> can load the firmware and start the remoteproc.
> >>
> >> Co-developed-by: Gokul Krishna Krishnakumar <gokul.krishnakumar@oss.qualcomm.com>
> >> Signed-off-by: Gokul Krishna Krishnakumar <gokul.krishnakumar@oss.qualcomm.com>
> >> Signed-off-by: Jingyi Wang <jingyi.wang@oss.qualcomm.com>
> > 
> > Unfortunately, removing the ping-pong functionality that was present in
> > previous patch versions makes the whole mechanism a lot more fragile.
> > I'm not entirely sure if this has changed in SMP2P v2 or more recent
> > firmware versions, but in my experience the SMP2P "ready" bit does not
> > tell you if the remoteproc is actually running. The problem is that the
> > "ready" bit is asserted by the remoteproc when the firmware is ready,
> > but it is not cleared when you shutdown or forcibly stop the remoteproc.
> > 
> > If this is still the case, you can easily reproduce that with the
> > following test:
> > 
> >  1. Start the system as usual and let it attach the remoteproc
> >  2. Manually stop the remoteproc in sysfs (echo stop > state)
> >  3. modprobe -r qcom_q6v5_pas
> >  4. modprobe qcom_q6v5_pas
> >  5. If the "ready" bit is still set, the driver will try attaching the
> >     remoteproc, but it's actually not running. No recovery will happen.
> > 
> > In this situation, it is very difficult to detect the correct remoteproc
> > state without relying on an additional query mechanism like the
> > ping-pong feature.
> 
> This a valid use case and concern. We had a discussion with Bjorn, and
> want to take this scenario into consideration of the separate robustness
> improvement series[1].
> Stephan could you agree to have the basic function in this series can be
> go in firstly.
> 
> [1]
> https://lore.kernel.org/all/20260519-rproc-attach-issue-v2-0-caa1eaf75081@oss.qualcomm.com/
> 
> > 
> > You can make it a bit more reliable if you also check the status of the
> > "stop-ack" bit. This would tell you if the remoteproc was cleanly
> > stopped with the SMP2P "stop" mechanism. However, that will typically
> > still not fix the case above since nowadays remoteprocs are typically
> > stopped via the QMI qcom_sysmon and the "stop-ack" is not set in that
> > case. I believe this might set the separate "shutdown-ack" bit though
> > that is described for some SoCs, I never finished testing that.
> > 
> > And even if you check both "stop-ack" and "shutdown-ack", that doesn't
> > tell you if the remoteproc was forcibly killed using
> > qcom_scm_pas_shutdown() without gracefully stopping it first. The ideal
> > solution would be querying the PAS API to tell us if the remoteproc is
> > actively running, but the last time I checked I was unfortunately not
> > able to find a documented call that would tell us that.
> 
> It is a state currently kernel don't know whether the remoteproc is
> offline or crashed when ready==1 && error==0 && ping-pong==0 scenario.
> If it is re-modprob, the software don't have any data and only the
> firmware can tell us whether if it is active or not per my understanding.
> 
> Maybe let's have this scenario and solution discussion in the other
> series I mentioned before.
> 

If you add a new feature upstream, you must make sure that it is
reasonably robust and reliable. The other series is about generic
limitations in the remoteproc subsystem, so I don't think you should
move QC-specific parts over there as well (personally, I would have
probably kept all of it in one series to make it easier to understand,
but that's subjective).

With the current firmware design, it's hard - probably impossible - to
make the status detection perfectably reliable. I would therefore choose
some reasonable compromise to start with. Given that Shawn (and actually
me as well) would like to have attach working without firmware support
for the ping-pong functionality, I think it would be reasonable to start
with the basic detection scheme discussed above, i.e.

  ready==1 && handover==1 && fatal==0 && stop-ack==0 && shutdown-ack==0

The ping-pong functionality could be added later for platforms that
support it. It would be good to have the interrupts already defined in
the device tree, so you can tweak the driver without making DT changes
later.

Thanks,
Stephan

  reply	other threads:[~2026-06-11  9:10 UTC|newest]

Thread overview: 26+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-05-19  7:24 [PATCH v6 0/6] Add binding and driver for Kaanapali SoCCP Jingyi Wang
2026-05-19  7:24 ` [PATCH v6 1/6] dt-bindings: remoteproc: qcom: cleanup qcom,adsp.yaml Jingyi Wang
2026-05-19  7:24 ` [PATCH v6 2/6] dt-bindings: remoteproc: qcom: move interrupts and interrupt-names list out of pas-common Jingyi Wang
2026-05-19  7:24 ` [PATCH v6 3/6] dt-bindings: remoteproc: qcom: Document pas for SoCCP on Kaanapali and Glymur platforms Jingyi Wang
2026-05-19  8:05   ` sashiko-bot
2026-05-19  8:45   ` Shawn Guo
2026-05-20  2:44     ` Jingyi Wang
2026-05-19  7:24 ` [PATCH v6 4/6] dt-bindings: remoteproc: qcom: Document pas for SoCCP on Hawi SoC Jingyi Wang
2026-05-19  9:54   ` Rob Herring (Arm)
2026-05-20  4:11     ` Jingyi Wang
2026-05-19  7:24 ` [PATCH v6 5/6] remoteproc: qcom: pas: Add late attach support for subsystems Jingyi Wang
2026-05-19  8:33   ` Shawn Guo
2026-05-19  8:50   ` sashiko-bot
2026-05-20  8:27   ` Mukesh Ojha
2026-05-20 10:18     ` Shawn Guo
2026-05-21  3:42     ` Jingyi Wang
2026-05-21 11:22       ` Mukesh Ojha
2026-06-10  3:11         ` Jingyi Wang
2026-06-11  7:33           ` Mukesh Ojha
2026-06-09  2:33       ` Bjorn Andersson
2026-05-22 12:07   ` Stephan Gerhold
2026-05-25  4:30     ` Shawn Guo
2026-06-02  7:49     ` Jingyi Wang
2026-06-11  3:10     ` Aiqun(Maria) Yu
2026-06-11  9:10       ` Stephan Gerhold [this message]
2026-05-19  7:24 ` [PATCH v6 6/6] remoteproc: qcom_q6v5_pas: Add SoCCP node on Kaanapali Jingyi Wang

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=aip7feoTn0ncwzL7@linaro.org \
    --to=stephan.gerhold@linaro.org \
    --cc=aiqun.yu@oss.qualcomm.com \
    --cc=andersson@kernel.org \
    --cc=brgl@kernel.org \
    --cc=conor+dt@kernel.org \
    --cc=devicetree@vger.kernel.org \
    --cc=gokul.krishnakumar@oss.qualcomm.com \
    --cc=jingyi.wang@oss.qualcomm.com \
    --cc=konradybcio@kernel.org \
    --cc=krzk+dt@kernel.org \
    --cc=linux-arm-msm@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-remoteproc@vger.kernel.org \
    --cc=luca.weiss@fairphone.com \
    --cc=mani@kernel.org \
    --cc=mathieu.poirier@linaro.org \
    --cc=robh@kernel.org \
    --cc=shengchao.guo@oss.qualcomm.com \
    --cc=tingwei.zhang@oss.qualcomm.com \
    --cc=trilok.soni@oss.qualcomm.com \
    --cc=yijie.yang@oss.qualcomm.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox