From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from mail-oa1-f73.google.com (mail-oa1-f73.google.com [209.85.160.73]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 5D4DA34F48D for ; Thu, 12 Mar 2026 22:37:42 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.160.73 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1773355064; cv=none; b=sa75N1f+Ve6rjVcbHed13Qdq/q/FdBbeqHIGDOo0yDQGg0a9aasT8ZNzBR7+Yjhi+l0XE2i3PS/sKvPpUvZACk3yVNo68/JGq7MUspxuegz8l9jA2IDWuKUFfjMR6dy5vE0MLacYmtF4rb2ANRz+VQMjLaPY9uE8YoI9U+7akVc= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1773355064; c=relaxed/simple; bh=riRdHtbC6MOlF1LQ1+Xi70vycwoUYV3bcD9E2Zuu6NY=; h=Date:In-Reply-To:Mime-Version:Message-ID:Subject:From:To:Cc: Content-Type; b=P5ktbM1dzaALcucAHrENRtxXOTpY1i62UZy8ImchMlcrig4gkAUz0DcD/j63702ECEys1oZd8fcvd2UJ7UkmvllliLLj/9HturapHU4OHtLnAByuxW3bNawvrsSEsVxNREkU/+h61N/DtskrnKKtS+XYAUKJVA9/acj4lojch3E= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=reject dis=none) header.from=google.com; spf=pass smtp.mailfrom=flex--coltonlewis.bounces.google.com; dkim=pass (2048-bit key) header.d=google.com header.i=@google.com header.b=RH2cAsk2; arc=none smtp.client-ip=209.85.160.73 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=reject dis=none) header.from=google.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=flex--coltonlewis.bounces.google.com Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=google.com header.i=@google.com header.b="RH2cAsk2" Received: by mail-oa1-f73.google.com with SMTP id 586e51a60fabf-40f192cf4b6so6335960fac.1 for ; Thu, 12 Mar 2026 15:37:42 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20230601; t=1773355061; x=1773959861; darn=vger.kernel.org; h=cc:to:from:subject:message-id:mime-version:in-reply-to:date:from:to :cc:subject:date:message-id:reply-to; bh=BOQPFUygErdEhR+VhLQ8d5SqWsIL2z8SybdB1Lqp5qg=; b=RH2cAsk2jYAgTt+a/aeaeNNT1KKZTEbP0e25KtkfYRphEHCle1uqF5sWn48LsXH61n 27FSYPhontTGRHR9uTfEWKJ76qAcZ+/gcoxtPPx9qPf3PPEY5qFd7tBEHJZTWTkcuDp6 GCK0BOu3rW24a3+XqRN2hBtOFG2EcQDR3+P+mLkV/qLelsx4Fxa/uwD+S6fLdzgf8j3g OIl0W2vJejkyQrJPZKnE6uToPLc21JZ9vp8HKALlRwq+jyHkwYjDqowg2dQGNgFQ5qup 7w2H3qtdLRYhzqoOIMfX4xiGBB38fgcscDzjaC34lENs4P817ZUNwrzd+wxZhHpO17wF 05Mw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20251104; t=1773355061; x=1773959861; h=cc:to:from:subject:message-id:mime-version:in-reply-to:date :x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=BOQPFUygErdEhR+VhLQ8d5SqWsIL2z8SybdB1Lqp5qg=; b=cymMadYjFALaQ+u6/z/lr9lIerKoCnP7q2kYcQNMfS/JbEYptt/sxCT7HRoq3sBbYK 5yVpMao79AjDa/UBesApgN0F8kzRhoE1L7GEyVQuBHyrjw0+G6s41juqKWQXz8Qg9NQd 9iyfo7Bh8Ret1QwxDN8s0t9ST1XsXJAzHTg6nAO73IaWBcd1SGF1CPxDX46rPRSpTQaf gWkoGus5zjWQjowLk/cB7KWQBFPquwqTgkXbOCv4ZPtgENWQWVE9opLJcRw1kbPvYeCu UXRc6outKWjYLnfVNCjqVzwMyPR3NTWTfJyiS47vFDEJ7Hd7KcKCqi4vkPN9GqRkqhHq U2cA== X-Forwarded-Encrypted: i=1; AJvYcCWsZldSXcBE/vyjvot1nP4kjFK2rlX9i+oLd/r63B/EA1FTWQ6+Rb3YzS66Q7Ul0r9U9DevAdkNsPQ=@vger.kernel.org X-Gm-Message-State: AOJu0Yxvv12hSjgUdXtDBeH92o2aR+uAsqpGutLjfcyvCwW29aIiKo5N WsO7AsuhRyjBGHGeAfCLtrEuYEUf1h4NUU06QjeuUcpTn8YterYWMGLQXX2nsDraH2SVDz2GIKr 6Pmb7+3fUHofrxp+q0W8C+atY9g== X-Received: from iobif40.prod.google.com ([2002:a05:6602:1e28:b0:960:f5a:2cea]) (user=coltonlewis job=prod-delivery.src-stubby-dispatcher) by 2002:a05:6820:228b:b0:662:f8bd:fc9 with SMTP id 006d021491bc7-67bdaa2ddeamr684027eaf.41.1773355061163; Thu, 12 Mar 2026 15:37:41 -0700 (PDT) Date: Thu, 12 Mar 2026 22:37:40 +0000 In-Reply-To: (message from James Clark on Wed, 11 Mar 2026 17:45:55 +0000) Precedence: bulk X-Mailing-List: linux-doc@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: Mime-Version: 1.0 Message-ID: Subject: Re: [PATCH v6 04/19] perf: arm_pmuv3: Introduce method to partition the PMU From: Colton Lewis To: James Clark Cc: kvm@vger.kernel.org, alexandru.elisei@arm.com, pbonzini@redhat.com, corbet@lwn.net, linux@armlinux.org.uk, catalin.marinas@arm.com, will@kernel.org, maz@kernel.org, oliver.upton@linux.dev, mizhang@google.com, joey.gouly@arm.com, suzuki.poulose@arm.com, yuzenghui@huawei.com, mark.rutland@arm.com, shuah@kernel.org, gankulkarni@os.amperecomputing.com, linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org, linux-arm-kernel@lists.infradead.org, kvmarm@lists.linux.dev, linux-perf-users@vger.kernel.org, linux-kselftest@vger.kernel.org Content-Type: text/plain; charset="UTF-8"; format=flowed; delsp=yes James Clark writes: > On 09/02/2026 10:13 pm, Colton Lewis wrote: >> For PMUv3, the register field MDCR_EL2.HPMN partitiones the PMU >> counters into two ranges where counters 0..HPMN-1 are accessible by >> EL1 and, if allowed, EL0 while counters HPMN..N are only accessible by >> EL2. >> Create module parameter reserved_host_counters to reserve a number of >> counters for the host. This number is set at boot because the perf >> subsystem assumes the number of counters will not change after the PMU >> is probed. >> Introduce the function armv8pmu_partition() to modify the PMU driver's >> cntr_mask of available counters to exclude the counters being reserved >> for the guest and record reserved_guest_counters as the maximum >> allowable value for HPMN. >> Due to the difficulty this feature would create for the driver running >> in nVHE mode, partitioning is only allowed in VHE mode. In order to >> support a partitioning on nVHE we'd need to explicitly disable guest >> counters on every exit and reset HPMN to place all counters in the >> first range. >> Signed-off-by: Colton Lewis >> --- >> arch/arm/include/asm/arm_pmuv3.h | 4 ++ >> arch/arm64/include/asm/arm_pmuv3.h | 5 ++ >> arch/arm64/kvm/Makefile | 2 +- >> arch/arm64/kvm/pmu-direct.c | 22 +++++++++ >> drivers/perf/arm_pmuv3.c | 78 +++++++++++++++++++++++++++++- >> include/kvm/arm_pmu.h | 8 +++ >> include/linux/perf/arm_pmu.h | 1 + >> 7 files changed, 117 insertions(+), 3 deletions(-) >> create mode 100644 arch/arm64/kvm/pmu-direct.c >> diff --git a/arch/arm/include/asm/arm_pmuv3.h >> b/arch/arm/include/asm/arm_pmuv3.h >> index 2ec0e5e83fc98..154503f054886 100644 >> --- a/arch/arm/include/asm/arm_pmuv3.h >> +++ b/arch/arm/include/asm/arm_pmuv3.h >> @@ -221,6 +221,10 @@ static inline bool kvm_pmu_counter_deferred(struct >> perf_event_attr *attr) >> return false; >> } >> +static inline bool has_host_pmu_partition_support(void) >> +{ >> + return false; >> +} >> static inline bool kvm_set_pmuserenr(u64 val) >> { >> return false; >> diff --git a/arch/arm64/include/asm/arm_pmuv3.h >> b/arch/arm64/include/asm/arm_pmuv3.h >> index cf2b2212e00a2..27c4d6d47da31 100644 >> --- a/arch/arm64/include/asm/arm_pmuv3.h >> +++ b/arch/arm64/include/asm/arm_pmuv3.h >> @@ -171,6 +171,11 @@ static inline bool pmuv3_implemented(int pmuver) >> pmuver == ID_AA64DFR0_EL1_PMUVer_NI); >> } >> +static inline bool is_pmuv3p1(int pmuver) >> +{ >> + return pmuver >= ID_AA64DFR0_EL1_PMUVer_V3P1; >> +} >> + >> static inline bool is_pmuv3p4(int pmuver) >> { >> return pmuver >= ID_AA64DFR0_EL1_PMUVer_V3P4; >> diff --git a/arch/arm64/kvm/Makefile b/arch/arm64/kvm/Makefile >> index 3ebc0570345cc..baf0f296c0e53 100644 >> --- a/arch/arm64/kvm/Makefile >> +++ b/arch/arm64/kvm/Makefile >> @@ -26,7 +26,7 @@ kvm-y += arm.o mmu.o mmio.o psci.o hypercalls.o >> pvtime.o \ >> vgic/vgic-its.o vgic/vgic-debug.o vgic/vgic-v3-nested.o \ >> vgic/vgic-v5.o >> -kvm-$(CONFIG_HW_PERF_EVENTS) += pmu-emul.o pmu.o >> +kvm-$(CONFIG_HW_PERF_EVENTS) += pmu-emul.o pmu-direct.o pmu.o >> kvm-$(CONFIG_ARM64_PTR_AUTH) += pauth.o >> kvm-$(CONFIG_PTDUMP_STAGE2_DEBUGFS) += ptdump.o >> diff --git a/arch/arm64/kvm/pmu-direct.c b/arch/arm64/kvm/pmu-direct.c >> new file mode 100644 >> index 0000000000000..74e40e4915416 >> --- /dev/null >> +++ b/arch/arm64/kvm/pmu-direct.c >> @@ -0,0 +1,22 @@ >> +// SPDX-License-Identifier: GPL-2.0-only >> +/* >> + * Copyright (C) 2025 Google LLC >> + * Author: Colton Lewis >> + */ >> + >> +#include >> + >> +#include >> + >> +/** >> + * has_host_pmu_partition_support() - Determine if partitioning is >> possible >> + * >> + * Partitioning is only supported in VHE mode with PMUv3 >> + * >> + * Return: True if partitioning is possible, false otherwise >> + */ >> +bool has_host_pmu_partition_support(void) >> +{ >> + return has_vhe() && >> + system_supports_pmuv3(); >> +} >> diff --git a/drivers/perf/arm_pmuv3.c b/drivers/perf/arm_pmuv3.c >> index 8d3b832cd633a..798c93678e97c 100644 >> --- a/drivers/perf/arm_pmuv3.c >> +++ b/drivers/perf/arm_pmuv3.c >> @@ -42,6 +42,13 @@ >> #define ARMV8_THUNDER_PERFCTR_L1I_CACHE_PREF_ACCESS 0xEC >> #define ARMV8_THUNDER_PERFCTR_L1I_CACHE_PREF_MISS 0xED >> +static int reserved_host_counters __read_mostly = -1; >> +int armv8pmu_max_guest_counters = -1; >> + >> +module_param(reserved_host_counters, int, 0); >> +MODULE_PARM_DESC(reserved_host_counters, >> + "PMU Partition: -1 = No partition; +N = Reserve N counters for the >> host"); >> + >> /* >> * ARMv8 Architectural defined events, not all of these may >> * be supported on any given implementation. Unsupported events will >> @@ -532,6 +539,11 @@ static void armv8pmu_pmcr_write(u64 val) >> write_pmcr(val); >> } >> +static u64 armv8pmu_pmcr_n_read(void) >> +{ >> + return FIELD_GET(ARMV8_PMU_PMCR_N, armv8pmu_pmcr_read()); >> +} >> + >> static int armv8pmu_has_overflowed(u64 pmovsr) >> { >> return !!(pmovsr & ARMV8_PMU_OVERFLOWED_MASK); >> @@ -1309,6 +1321,61 @@ struct armv8pmu_probe_info { >> bool present; >> }; >> +/** >> + * armv8pmu_reservation_is_valid() - Determine if reservation is allowed >> + * @host_counters: Number of host counters to reserve >> + * >> + * Determine if the number of host counters in the argument is an >> + * allowed reservation, 0 to NR_COUNTERS inclusive. >> + * >> + * Return: True if reservation allowed, false otherwise >> + */ >> +static bool armv8pmu_reservation_is_valid(int host_counters) >> +{ >> + return host_counters >= 0 && >> + host_counters <= armv8pmu_pmcr_n_read(); >> +} >> + >> +/** >> + * armv8pmu_partition() - Partition the PMU >> + * @pmu: Pointer to pmu being partitioned >> + * @host_counters: Number of host counters to reserve >> + * >> + * Partition the given PMU by taking a number of host counters to >> + * reserve and, if it is a valid reservation, recording the >> + * corresponding HPMN value in the max_guest_counters field of the PMU >> and >> + * clearing the guest-reserved counters from the counter mask. >> + * >> + * Return: 0 on success, -ERROR otherwise >> + */ >> +static int armv8pmu_partition(struct arm_pmu *pmu, int host_counters) >> +{ >> + u8 nr_counters; >> + u8 hpmn; >> + >> + if (!armv8pmu_reservation_is_valid(host_counters)) { >> + pr_err("PMU partition reservation of %d host counters is not valid", >> host_counters); >> + return -EINVAL; >> + } >> + >> + nr_counters = armv8pmu_pmcr_n_read(); >> + hpmn = nr_counters - host_counters; >> + >> + pmu->max_guest_counters = hpmn; >> + armv8pmu_max_guest_counters = hpmn; >> + >> + bitmap_clear(pmu->cntr_mask, 0, hpmn); >> + bitmap_set(pmu->cntr_mask, hpmn, host_counters); >> + clear_bit(ARMV8_PMU_CYCLE_IDX, pmu->cntr_mask); >> + >> + if (pmuv3_has_icntr()) >> + clear_bit(ARMV8_PMU_INSTR_IDX, pmu->cntr_mask); > We take the fixed instruction counter away from the host here but then > guest never gets it because AA64DFR1 is RAZ. Probably doesn't need to be > a blocker to expose the instruction counter, but worth noting that using > this feature results in losing a counter completely. > There's a comment above kvm_pmu_guest_counter_mask() that suggests the > instruction counter is available for guests, which is why I was looking > here. I think "Compute the bitmask that selects the guest-reserved > counters ... These are the counters in 0..HPMN and the cycle and > instruction counters." shouldn't include "instruction counters". Good point. Early iterations intended to expose the instruction counter to guests but that was dropped. I will drop it here too.