From: sashiko-bot@kernel.org
To: "Dapeng Mi" <dapeng1.mi@linux.intel.com>
Cc: linux-perf-users@vger.kernel.org
Subject: Re: [Patch v8 21/23] perf/x86/intel: Enable PERF_PMU_CAP_SIMD_REGS capability
Date: Fri, 29 May 2026 10:43:09 +0000 [thread overview]
Message-ID: <20260529104309.C69DB1F00893@smtp.kernel.org> (raw)
In-Reply-To: <20260529075645.580362-22-dapeng1.mi@linux.intel.com>
Thank you for your contribution! Sashiko AI review found 2 potential issue(s) to consider:
- [High] The new SIMD registers capability and mask initialization are skipped for older PEBS V4+ architectures (like Icelake, Sapphire Rapids, Alder Lake, and Raptor Lake), leaving them without support for YMM, ZMM, eGPRs, and SSP sampling.
- [Medium] The logic to enable PERF_PMU_CAP_SIMD_REGS incorrectly couples the capability to the presence of non-SIMD extended features like CET_USER (SSP) and APX (eGPRs).
--
commit f179abf9419fef51aa84e8c413e619d7004c64c2
Author: Kan Liang <kan.liang@linux.intel.com>
perf/x86/intel: Enable PERF_PMU_CAP_SIMD_REGS capability
Enable the PERF_PMU_CAP_SIMD_REGS capability if XSAVES support is
available for YMM, ZMM, OPMASK, eGPRs, or SSP.
> diff --git a/arch/x86/events/intel/core.c b/arch/x86/events/intel/core.c
> index 679781519f8cf..eef5d116aa068 100644
> --- a/arch/x86/events/intel/core.c
> +++ b/arch/x86/events/intel/core.c
> @@ -6316,6 +6316,26 @@ static void intel_extended_regs_init(struct pmu *pmu)
> */
> x86_pmu.ext_regs_mask |= XFEATURE_MASK_SSE;
> dest_pmu->capabilities |= PERF_PMU_CAP_EXTENDED_REGS;
[Severity: High]
Does this skip the new SIMD capabilities on older PEBS V4+ architectures?
The initialization for YMM, ZMM, eGPRs, and SSP is only added to
intel_extended_regs_init(). Since intel_extended_regs_init() is called
by update_pmu_cap(), which is guarded by X86_FEATURE_ARCH_PERFMON_EXT,
architectures like Sapphire Rapids and Alder Lake appear to bypass this.
Should this logic also be replicated in intel_ds_pebs_init(), or should
intel_extended_regs_init() be called for those processors so they can
also support the new SIMD capabilities?
> +
> + if (boot_cpu_has(X86_FEATURE_AVX) &&
> + cpu_has_xfeatures(XFEATURE_MASK_YMM, NULL))
> + x86_pmu.ext_regs_mask |= XFEATURE_MASK_YMM;
> + if (boot_cpu_has(X86_FEATURE_APX) &&
> + cpu_has_xfeatures(XFEATURE_MASK_APX, NULL))
> + x86_pmu.ext_regs_mask |= XFEATURE_MASK_APX;
> + if (boot_cpu_has(X86_FEATURE_AVX512F)) {
> + if (cpu_has_xfeatures(XFEATURE_MASK_OPMASK, NULL))
> + x86_pmu.ext_regs_mask |= XFEATURE_MASK_OPMASK;
> + if (cpu_has_xfeatures(XFEATURE_MASK_ZMM_Hi256, NULL))
> + x86_pmu.ext_regs_mask |= XFEATURE_MASK_ZMM_Hi256;
> + if (cpu_has_xfeatures(XFEATURE_MASK_Hi16_ZMM, NULL))
> + x86_pmu.ext_regs_mask |= XFEATURE_MASK_Hi16_ZMM;
> + }
> + if (cpu_feature_enabled(X86_FEATURE_USER_SHSTK))
> + x86_pmu.ext_regs_mask |= XFEATURE_MASK_CET_USER;
> +
> + if (x86_pmu.ext_regs_mask != XFEATURE_MASK_SSE)
> + dest_pmu->capabilities |= PERF_PMU_CAP_SIMD_REGS;
[Severity: Medium]
Does this incorrectly couple the SIMD capability to non-SIMD features here?
If a processor supports SSE and CET_USER (SSP), ext_regs_mask will not equal
XFEATURE_MASK_SSE, and PERF_PMU_CAP_SIMD_REGS will be enabled.
However, if a processor only supports SSE, this condition evaluates to false,
and PERF_PMU_CAP_SIMD_REGS is omitted.
Could this regression force userspace tools to fall back to a separate legacy
path to sample XMM registers on SSE-only processors, rather than using the
unified sample_simd_regs interface?
> }
>
> #define counter_mask(_gp, _fixed) ((_gp) | ((u64)(_fixed) << INTEL_PMC_IDX_FIXED))
--
Sashiko AI review · https://sashiko.dev/#/patchset/20260529075645.580362-1-dapeng1.mi@linux.intel.com?part=21
next prev parent reply other threads:[~2026-05-29 10:43 UTC|newest]
Thread overview: 44+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-05-29 7:56 [Patch v8 00/23] Support SIMD/eGPRs/SSP registers sampling for perf Dapeng Mi
2026-05-29 7:56 ` [Patch v8 01/23] perf/x86/intel: Validate return value of intel_pmu_init_hybrid() Dapeng Mi
2026-05-29 8:53 ` sashiko-bot
2026-05-29 11:11 ` Peter Zijlstra
2026-05-29 7:56 ` [Patch v8 02/23] perf/x86: Move hybrid PMU initialization before x86_pmu_starting_cpu() Dapeng Mi
2026-05-29 8:51 ` sashiko-bot
2026-05-29 7:56 ` [Patch v8 03/23] perf/x86/intel: Enable large PEBS sampling for XMMs Dapeng Mi
2026-05-29 7:56 ` [Patch v8 04/23] perf/x86/intel: Convert x86_perf_regs to per-cpu variables Dapeng Mi
2026-05-29 7:56 ` [Patch v8 05/23] perf: Eliminate duplicate arch-specific functions definations Dapeng Mi
2026-05-29 7:56 ` [Patch v8 06/23] perf/x86: Use x86_perf_regs in the x86 nmi handlers Dapeng Mi
2026-05-29 7:56 ` [Patch v8 07/23] x86/fpu/xstate: Add xsaves_nmi() helper Dapeng Mi
2026-05-29 8:56 ` sashiko-bot
2026-05-29 11:32 ` Peter Zijlstra
2026-05-29 7:56 ` [Patch v8 08/23] x86/fpu: Ensure TIF_NEED_FPU_LOAD is set after saving FPU state Dapeng Mi
2026-05-29 7:56 ` [Patch v8 09/23] perf: Move and enhance has_extended_regs() for arch-specific use Dapeng Mi
2026-05-29 7:56 ` [Patch v8 10/23] perf/x86: Enable XMM Register Sampling for Non-PEBS Events Dapeng Mi
2026-05-29 9:02 ` sashiko-bot
2026-05-29 11:38 ` Peter Zijlstra
2026-05-29 7:56 ` [Patch v8 11/23] perf/x86: Enable XMM register sampling for REGS_USER case Dapeng Mi
2026-05-29 9:24 ` sashiko-bot
2026-05-29 11:42 ` Peter Zijlstra
2026-05-29 7:56 ` [Patch v8 12/23] perf: Add sampling support for SIMD registers Dapeng Mi
2026-05-29 8:36 ` sashiko-bot
2026-05-29 7:56 ` [Patch v8 13/23] perf/x86: Support XMM sampling using sample_simd_vec_reg_* fields Dapeng Mi
2026-05-29 8:49 ` sashiko-bot
2026-05-29 7:56 ` [Patch v8 14/23] perf/x86: Support YMM " Dapeng Mi
2026-05-29 8:47 ` sashiko-bot
2026-05-29 7:56 ` [Patch v8 15/23] perf/x86: Support ZMM " Dapeng Mi
2026-05-29 7:56 ` [Patch v8 16/23] perf/x86: Support OPMASK sampling using sample_simd_pred_reg_* fields Dapeng Mi
2026-05-29 9:21 ` sashiko-bot
2026-05-29 7:56 ` [Patch v8 17/23] perf: Enhance perf_reg_validate() with simd_enabled argument Dapeng Mi
2026-05-29 7:56 ` [Patch v8 18/23] perf/x86: Support eGPRs sampling using sample_regs_* fields Dapeng Mi
2026-05-29 9:31 ` sashiko-bot
2026-05-29 7:56 ` [Patch v8 19/23] perf/x86: Support SSP " Dapeng Mi
2026-05-29 10:03 ` sashiko-bot
2026-05-29 7:56 ` [Patch v8 20/23] perf/x86/intel: Support arch-PEBS based SIMD/eGPRs/SSP sampling Dapeng Mi
2026-05-29 9:45 ` sashiko-bot
2026-05-29 7:56 ` [Patch v8 21/23] perf/x86/intel: Enable PERF_PMU_CAP_SIMD_REGS capability Dapeng Mi
2026-05-29 10:43 ` sashiko-bot [this message]
2026-05-29 7:56 ` [Patch v8 22/23] perf/x86: Activate back-to-back NMI detection for arch-PEBS induced NMIs Dapeng Mi
2026-05-29 9:34 ` sashiko-bot
2026-05-29 7:56 ` [Patch v8 23/23] perf/x86/intel: Add sanity check for PEBS fragment size Dapeng Mi
2026-05-29 9:54 ` sashiko-bot
2026-05-29 8:32 ` [Patch v8 00/23] Support SIMD/eGPRs/SSP registers sampling for perf Mi, Dapeng
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20260529104309.C69DB1F00893@smtp.kernel.org \
--to=sashiko-bot@kernel.org \
--cc=dapeng1.mi@linux.intel.com \
--cc=linux-perf-users@vger.kernel.org \
--cc=sashiko-reviews@lists.linux.dev \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox