From: "Mi, Dapeng" <dapeng1.mi@linux.intel.com>
To: Sean Christopherson <seanjc@google.com>
Cc: Jim Mattson <jmattson@google.com>,
Peter Zijlstra <peterz@infradead.org>,
Ingo Molnar <mingo@redhat.com>,
Arnaldo Carvalho de Melo <acme@kernel.org>,
Namhyung Kim <namhyung@kernel.org>,
Thomas Gleixner <tglx@kernel.org>, Borislav Petkov <bp@alien8.de>,
Dave Hansen <dave.hansen@linux.intel.com>,
x86@kernel.org, Paolo Bonzini <pbonzini@redhat.com>,
linux-perf-users@vger.kernel.org, linux-kernel@vger.kernel.org,
kvm@vger.kernel.org, Mingwei Zhang <mizhang@google.com>,
Stephane Eranian <eranian@google.com>
Subject: Re: [PATCH v2 1/4] perf/x86/intel: Don't write PEBS_ENABLED on host<=>guest xfers if CPU has isolation
Date: Tue, 28 Apr 2026 10:32:40 +0800 [thread overview]
Message-ID: <cc700a57-4620-4059-b4bc-4c132048605b@linux.intel.com> (raw)
In-Reply-To: <ae-l9qDd4Qnvq752@google.com>
On 4/28/2026 2:07 AM, Sean Christopherson wrote:
> On Mon, Apr 27, 2026, Dapeng Mi wrote:
>> On 4/24/2026 1:59 AM, Jim Mattson wrote:
>>>> arch/x86/events/intel/core.c | 42 ++++++++++++++++++++----------------
>>>> 1 file changed, 23 insertions(+), 19 deletions(-)
>>>>
>>>> diff --git a/arch/x86/events/intel/core.c b/arch/x86/events/intel/core.c
>>>> index 793335c3ce78..002d809f82ef 100644
>>>> --- a/arch/x86/events/intel/core.c
>>>> +++ b/arch/x86/events/intel/core.c
>>>> @@ -4999,12 +4999,15 @@ static struct perf_guest_switch_msr *intel_guest_get_msrs(int *nr, void *data)
>>>> struct kvm_pmu *kvm_pmu = (struct kvm_pmu *)data;
>>>> u64 intel_ctrl = hybrid(cpuc->pmu, intel_ctrl);
>>>> u64 pebs_mask = cpuc->pebs_enabled & x86_pmu.pebs_capable;
>>>> - int global_ctrl, pebs_enable;
>>>> + u64 guest_pebs_mask = pebs_mask & ~cpuc->intel_ctrl_host_mask;
>>>> + int global_ctrl;
>>> Is it worth noting somewhere that pebs_ept is not supported on any
>>> CPUs with PMU version < 5, where a single event can set two
>>> PEBS_ENABLE bits (cf. intel_pmu_pebs_enable)?
> Is that a hardware limitation, or a "perf hasn't added pebs_ept for PMU v5 yet"
> thing? I assume it's the latter?
>
>>>> + /*
>>>> + * Disable counters where the guest PMC is different than the host PMC
>>>> + * being used on behalf of the guest, as the PEBS record includes
>>>> + * PERF_GLOBAL_STATUS, i.e. the guest will see overflow status for the
>>>> + * wrong counter(s). Similarly, disallow PEBS in the guest if the host
>>>> + * is using PEBS, to avoid bleeding host state into PEBS records.
>>>> + */
>>>> + guest_pebs_mask &= kvm_pmu->pebs_enable & ~kvm_pmu->host_cross_mapped_mask;
>>>> + if (pebs_mask & ~cpuc->intel_ctrl_guest_mask)
>>>> + guest_pebs_mask = 0;
>>> I don't understand this clause. IIUC, it says that if we don't have
>>> any exclude-host PEBS events, then clear PEBS_ENABLE for the guest.
>> I suppose it says all guest PEBS events need to be disabled if there is any
>> event using PEBS on host side, and it's clearing GLOBAL_CTRL instead of
>> PEBS_ENABLE to disable guest PEBS events.
> Yeah, but why disable _everything_?
>
>>> Yes, any guest-programmed PEBS event should be exclude-host, but if
>>> there is an inconsistency, shouldn't we apply a mask? What if there is
>>> only one exclude-host PEBS event, but there are two bits set in
>>> guest_pebs_mask?
> I'm confused about this as well. The comment above about not bleeding host state
> into the PEBS records is my best guess (and it's probably not a very good guess)
> as to why the code does what it does. The changelog from commit 854250329c02
> ("KVM: x86/pmu: Disable guest PEBS temporarily in two rare situations") just says:
>
> The guest PEBS will be disabled when some users try to perf KVM and
> its user-space through the same PEBS facility
>
> That doesn't entirely make sense to me though, because I would think disabling
> the host counters iva GLOBAL_CTRL would suffice. I.e. there's no need to disallow
> PEBS in the guest just because the host is also using PEBS. But I can't think of
> any other reason to fully disable PEBS.
>
> FWIW, I was going off the previous code which effectively did:
>
> if (cpuc->pebs_enabled & ~cpuc->intel_ctrl_guest_mask) {
> /* Disable guest PEBS if host PEBS is enabled. */
> arr[pebs_enable].guest = 0;
> }
>
> One idea would be to add a FIXME, and then address the FIXME in a follow-up patch
> (in the same series)? And then see what breaks? :-)
I'm confused with this. does this go back to manipulate PEBS_ENABLE again?
I suppose what Jim mentioned is that the PEBS_LDLAT or PEBS_ST events could
set multiple bits in PEBS_ENABLE MSR before Perfmon v5 as below code shows.
```
void intel_pmu_pebs_enable(struct perf_event *event)
{
......
if ((event->hw.flags & PERF_X86_EVENT_PEBS_LDLAT) && (x86_pmu.version < 5))
cpuc->pebs_enabled |= 1ULL << (hwc->idx + 32);
else if (event->hw.flags & PERF_X86_EVENT_PEBS_ST)
cpuc->pebs_enabled |= 1ULL << 63;
......
```
This would lead to multiple bits set in pebs_mask and may cause incorrect
results.
If so, maybe we can bit-and the pebs_mask with intel_ctrl to filter out
these extra bits?
next prev parent reply other threads:[~2026-04-28 2:32 UTC|newest]
Thread overview: 20+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-04-23 15:03 [PATCH v2 0/4] perf/x86: Don't write PEBS_ENABLED on KVM transitions Sean Christopherson
2026-04-23 15:03 ` [PATCH v2 1/4] perf/x86/intel: Don't write PEBS_ENABLED on host<=>guest xfers if CPU has isolation Sean Christopherson
2026-04-23 16:22 ` Peter Zijlstra
2026-04-23 17:59 ` Jim Mattson
2026-04-27 2:10 ` Mi, Dapeng
2026-04-27 18:07 ` Sean Christopherson
2026-04-28 2:32 ` Mi, Dapeng [this message]
2026-04-23 15:03 ` [PATCH v2 2/4] perf/x86/intel: Don't context switch DS_AREA (and PEBS config) if PEBS is unused Sean Christopherson
2026-04-27 2:24 ` Mi, Dapeng
2026-04-27 17:34 ` Sean Christopherson
2026-04-23 15:03 ` [PATCH v2 3/4] perf/x86/intel: Make @data a mandatory param for intel_guest_get_msrs() Sean Christopherson
2026-04-27 2:28 ` Mi, Dapeng
2026-04-23 15:03 ` [PATCH v2 4/4] perf/x86: KVM: Have perf define a dedicated struct for getting guest PEBS data Sean Christopherson
2026-04-23 18:14 ` Jim Mattson
2026-04-23 23:31 ` sashiko-bot
2026-04-27 17:37 ` Sean Christopherson
2026-04-23 15:33 ` [PATCH v2 0/4] perf/x86: Don't write PEBS_ENABLED on KVM transitions Jim Mattson
2026-04-23 16:16 ` Peter Zijlstra
2026-04-24 12:17 ` Mi, Dapeng
2026-04-24 12:23 ` Peter Zijlstra
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=cc700a57-4620-4059-b4bc-4c132048605b@linux.intel.com \
--to=dapeng1.mi@linux.intel.com \
--cc=acme@kernel.org \
--cc=bp@alien8.de \
--cc=dave.hansen@linux.intel.com \
--cc=eranian@google.com \
--cc=jmattson@google.com \
--cc=kvm@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-perf-users@vger.kernel.org \
--cc=mingo@redhat.com \
--cc=mizhang@google.com \
--cc=namhyung@kernel.org \
--cc=pbonzini@redhat.com \
--cc=peterz@infradead.org \
--cc=seanjc@google.com \
--cc=tglx@kernel.org \
--cc=x86@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox