From: Umang Chheda <umang.chheda@oss.qualcomm.com>
To: Ruidong Tian <tianruidond@linux.alibaba.com>,
Tony Luck <tony.luck@intel.com>, Borislav Petkov <bp@alien8.de>,
Rob Herring <robh@kernel.org>,
Krzysztof Kozlowski <krzk+dt@kernel.org>,
Conor Dooley <conor+dt@kernel.org>,
Bjorn Andersson <andersson@kernel.org>,
Konrad Dybcio <konradybcio@kernel.org>,
catalin.marinas@arm.com, will@kernel.org, lpieralisi@kernel.org,
rafael@kernel.org, mark.rutland@arm.com,
Sudeep Holla <sudeep.holla@kernel.org>
Cc: linux-arm-msm@vger.kernel.org, linux-acpi@vger.kernel.org,
linux-arm-kernel@lists.infradead.org, linux-edac@vger.kernel.org,
linux-kernel@vger.kernel.org, devicetree@vger.kernel.org,
linux-edac@vger.kernel.org,
Umang Chheda <umang.chheda@oss.qualcomm.com>
Subject: [PATCH 2/8] ras: aest: Fix CE/UE error counts not incrementing in debugfs
Date: Tue, 05 May 2026 17:53:46 +0530 [thread overview]
Message-ID: <20260505-aest-devicetree-support-v1-2-d5d6ffacf0a5@oss.qualcomm.com> (raw)
In-Reply-To: <20260505-aest-devicetree-support-v1-0-d5d6ffacf0a5@oss.qualcomm.com>
The error counts visible under:
/sys/kernel/debug/aest/<dev>/processor<cpu>/<node>/err_count
always reported zero, even though corrected errors (CEs) were being
serviced by the interrupt handler. aest_oncore_dev_init_debugfs() sets
up per CPU debugfs entries but wired them up incorrectly in two places:
- this_cpu_ptr(adev->adev_oncore) was used inside for_each_possible_cpu().
This always selects the slot for the CPU executing the init code, so all
debugfs files ended up referencing the same per CPU aest_device instance
instead of the CPU indicated by the loop variable.
- The code referenced adev->nodes[i], i.e. the template nodes allocated
before __setup_ppi, rather than the per-CPU copies at
percpu_dev->nodes[i]. The IRQ handler updates CE counters in the per-CPU
records created by __setup_ppi, the template records are never touched
at runtime, so err_count always read as zero.
Fix this by:
- Using per_cpu_ptr(adev->adev_oncore, cpu) when iterating over CPUs.
Wiring debugfs files to percpu_dev->nodes[i] so counters reflect the
data updated by the IRQ handler.
- Using adev->nodes[i].name for debugfs directory names. The per-CPU node
receives name via a shallow memcpy and is not the authoritative source.
Signed-off-by: Umang Chheda <umang.chheda@oss.qualcomm.com>
---
drivers/ras/aest/aest-sysfs.c | 19 +++++++++++++------
1 file changed, 13 insertions(+), 6 deletions(-)
diff --git a/drivers/ras/aest/aest-sysfs.c b/drivers/ras/aest/aest-sysfs.c
index 66e9c1103f99..f710503e4d74 100644
--- a/drivers/ras/aest/aest-sysfs.c
+++ b/drivers/ras/aest/aest-sysfs.c
@@ -189,16 +189,23 @@ aest_oncore_dev_init_debugfs(struct aest_device *adev)
char name[16];
for_each_possible_cpu(cpu) {
- percpu_dev = this_cpu_ptr(adev->adev_oncore);
+ percpu_dev = per_cpu_ptr(adev->adev_oncore, cpu);
- snprintf(name, sizeof(name), "processor%u%u", cpu);
+ snprintf(name, sizeof(name), "processor%u", cpu);
percpu_dev->debugfs = debugfs_create_dir(name, adev->debugfs);
for (i = 0; i < adev->node_cnt; i++) {
- node = &adev->nodes[i];
-
- node->debugfs = debugfs_create_dir(node->name,
- percpu_dev->debugfs);
+ node = &percpu_dev->nodes[i];
+
+ /*
+ * Use adev->nodes[i].name (the original) rather than
+ * node->name from the per-CPU copy. The per-CPU copy
+ * receives node->name via shallow memcpy in __setup_ppi;
+ * the original is the authoritative, guaranteed-valid
+ * string.
+ */
+ node->debugfs = debugfs_create_dir(adev->nodes[i].name,
+ percpu_dev->debugfs);
aest_node_init_debugfs(node);
}
}
--
2.34.1
next prev parent reply other threads:[~2026-05-05 12:25 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-05-05 12:23 [PATCH 0/8] ras: aest: extend AEST support to Device Tree frontend Umang Chheda
2026-05-05 12:23 ` [PATCH 1/8] ras: aest: Fix shared processor node handling and error log messages Umang Chheda
2026-05-05 12:23 ` Umang Chheda [this message]
2026-05-05 12:23 ` [PATCH 3/8] ras: aest: Skip unimplemented records in debugfs Umang Chheda
2026-05-05 12:23 ` [PATCH 4/8] ras: aest: Add panic_on_ue module parameter Umang Chheda
2026-05-05 12:23 ` [PATCH 5/8] dt-bindings: arm: ras: Introduce bindings for ARM AEST Umang Chheda
2026-05-05 12:23 ` [PATCH 6/8] ras: aest: Add DT frontend for ARM AEST RAS error sources Umang Chheda
2026-05-05 12:23 ` [PATCH 7/8] arm64: dts: qcom: lemans: add AEST error nodes Umang Chheda
2026-05-05 12:23 ` [PATCH 8/8] arm64: dts: qcom: monaco: " Umang Chheda
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20260505-aest-devicetree-support-v1-2-d5d6ffacf0a5@oss.qualcomm.com \
--to=umang.chheda@oss.qualcomm.com \
--cc=andersson@kernel.org \
--cc=bp@alien8.de \
--cc=catalin.marinas@arm.com \
--cc=conor+dt@kernel.org \
--cc=devicetree@vger.kernel.org \
--cc=konradybcio@kernel.org \
--cc=krzk+dt@kernel.org \
--cc=linux-acpi@vger.kernel.org \
--cc=linux-arm-kernel@lists.infradead.org \
--cc=linux-arm-msm@vger.kernel.org \
--cc=linux-edac@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=lpieralisi@kernel.org \
--cc=mark.rutland@arm.com \
--cc=rafael@kernel.org \
--cc=robh@kernel.org \
--cc=sudeep.holla@kernel.org \
--cc=tianruidond@linux.alibaba.com \
--cc=tony.luck@intel.com \
--cc=will@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox