From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from bombadil.infradead.org (bombadil.infradead.org [198.137.202.133]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.lore.kernel.org (Postfix) with ESMTPS id 8D198FD9E2A for ; Fri, 27 Feb 2026 00:52:05 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=lists.infradead.org; s=bombadil.20210309; h=Sender:List-Subscribe:List-Help :List-Post:List-Archive:List-Unsubscribe:List-Id:Content-Transfer-Encoding: Content-Type:In-Reply-To:From:References:To:Subject:MIME-Version:Date: Message-ID:Reply-To:Cc:Content-ID:Content-Description:Resent-Date:Resent-From :Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID:List-Owner; bh=8EPOhPBT82DUAPyk65MIdni4aC6gP1uhddv+5nM5ruo=; b=kgHYVT7E2w2kK2aHRS0uBI8BYU 79+720Y4nNiZoOjXm3qdiaSIY7u2BRrOA9qTjXBrpTF8uJhNBQFp2U4CcDSWOlIrXcxD/+33iiHzh NbXaTzCk5+0dmgQRl4KtlKjgQ+BSh8OmF/5uW1n7IXwbKsWesBVGNiy6Gw9/RIgsox7w5BBgs+mfn 5i761uXJRGVo1Mu7DB9HpNXM6Jnb1h222HOvX/SzZbjzNJarjOrb1ugHqprNXvjTnQr6g4t2Tt8HZ EkEcNNvvlRWCkeL6OHb8aGBZppPRIRkLJJ92u0kqBop2NQyry/p1syxocaBJP93EBj3XA28gNhtQ/ sUN7WMtw==; Received: from localhost ([::1] helo=bombadil.infradead.org) by bombadil.infradead.org with esmtp (Exim 4.98.2 #2 (Red Hat Linux)) id 1vvm5F-00000007Rdh-0tLD; Fri, 27 Feb 2026 00:51:57 +0000 Received: from mgamail.intel.com ([198.175.65.14]) by bombadil.infradead.org with esmtps (Exim 4.98.2 #2 (Red Hat Linux)) id 1vvm59-00000007RcZ-1LhI for linux-arm-kernel@lists.infradead.org; Fri, 27 Feb 2026 00:51:55 +0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1772153511; x=1803689511; h=message-id:date:mime-version:subject:to:references:from: in-reply-to:content-transfer-encoding; bh=TL6MSvdW1ZjSt5kNmBZzsCS5fHHor4X8yrtubaFUSdQ=; b=W2qG1pDJfA5URgXAAiSNPsiitYBotAWZ6BpzE+6ntC4AHVa7f1Dg3X+l ogGMCDTYY82zzs0DjUaDyNqcmXvzUdr+6fML7zIG4ekfMYbKxD+2xQUW+ T7kP2fU4RJZQFaBPNSv2+HSEfzTBX0HteZRUZ6aZwLS+fMiCmyGpR6fQu g7QMsfGiBWmHNa5BnIyrgANs8UxOi4elZ3eOLjdRVwXRnHSdnk6v9b4ih RMaVzfp4bdlQRHygTJPtqWCgg4vdsEsatkqAkH1sQb9Zjx8QLhShUvTRd wwwTh30Yx/t84YIc9MIpl6CnOzlCz064op+mCXEzPNluCiYEL5vSGNyy3 w==; X-CSE-ConnectionGUID: 9IHn+iZCQrSfYezJcWDkjg== X-CSE-MsgGUID: SURyfBQET3mvmhKREk5s2A== X-IronPort-AV: E=McAfee;i="6800,10657,11713"; a="77067112" X-IronPort-AV: E=Sophos;i="6.21,313,1763452800"; d="scan'208";a="77067112" Received: from fmviesa005.fm.intel.com ([10.60.135.145]) by orvoesa106.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 26 Feb 2026 16:51:47 -0800 X-CSE-ConnectionGUID: whHZ2Z2STH6tdsp/oySNOA== X-CSE-MsgGUID: +noUpSXGRrSeXcmuEEKzfQ== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="6.21,313,1763452800"; d="scan'208";a="221361516" Received: from dapengmi-mobl1.ccr.corp.intel.com (HELO [10.124.241.147]) ([10.124.241.147]) by fmviesa005-auth.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 26 Feb 2026 16:51:42 -0800 Message-ID: Date: Fri, 27 Feb 2026 08:51:40 +0800 MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: [PATCH v2 01/10] perf vendor events intel: Update alderlake events from 1.35 to 1.37 To: Ian Rogers , Peter Zijlstra , Ingo Molnar , Arnaldo Carvalho de Melo , Namhyung Kim , Alexander Shishkin , Jiri Olsa , Adrian Hunter , James Clark , =?UTF-8?Q?Andreas_F=C3=A4rber?= , Manivannan Sadhasivam , linux-perf-users@vger.kernel.org, linux-kernel@vger.kernel.org, linux-arm-kernel@lists.infradead.org References: <20260226175936.593159-1-irogers@google.com> Content-Language: en-US From: "Mi, Dapeng" In-Reply-To: <20260226175936.593159-1-irogers@google.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-CRM114-Version: 20100106-BlameMichelson ( TRE 0.8.0 (BSD) ) MR-646709E3 X-CRM114-CacheID: sfid-20260226_165152_542691_7614DC03 X-CRM114-Status: GOOD ( 14.48 ) X-BeenThere: linux-arm-kernel@lists.infradead.org X-Mailman-Version: 2.1.34 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: "linux-arm-kernel" Errors-To: linux-arm-kernel-bounces+linux-arm-kernel=archiver.kernel.org@lists.infradead.org The whole patchset looks good to me. Thanks. Reviewed-by: Dapeng Mi On 2/27/2026 1:59 AM, Ian Rogers wrote: > The updated events were published in: > https://github.com/intel/perfmon/commit/632936400cfc5978c7b4519c865c137de523bfdd > https://github.com/intel/perfmon/commit/a96d6bf4b50d6ce31e2ffd0be8d13022d07ae319 > > Signed-off-by: Ian Rogers > --- > .../pmu-events/arch/x86/alderlake/cache.json | 27 +++----- > .../arch/x86/alderlake/frontend.json | 18 +++++ > .../arch/x86/alderlake/pipeline.json | 66 +++++++++++++++++-- > .../pmu-events/arch/x86/alderlaken/cache.json | 27 +++----- > .../arch/x86/alderlaken/pipeline.json | 60 +++++++++++++++-- > tools/perf/pmu-events/arch/x86/mapfile.csv | 4 +- > 6 files changed, 152 insertions(+), 50 deletions(-) > > diff --git a/tools/perf/pmu-events/arch/x86/alderlake/cache.json b/tools/perf/pmu-events/arch/x86/alderlake/cache.json > index be15a7f83717..5d0d824f3e7e 100644 > --- a/tools/perf/pmu-events/arch/x86/alderlake/cache.json > +++ b/tools/perf/pmu-events/arch/x86/alderlake/cache.json > @@ -876,105 +876,97 @@ > "Unit": "cpu_atom" > }, > { > - "BriefDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 128 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled.", > + "BriefDescription": "Counts the number of tagged load uops retired that exceed the latency threshold of 128. Only counts with PEBS enabled.", > "Counter": "0,1", > "Data_LA": "1", > "EventCode": "0xd0", > "EventName": "MEM_UOPS_RETIRED.LOAD_LATENCY_GT_128", > "MSRIndex": "0x3F6", > "MSRValue": "0x80", > - "PublicDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 128 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled. If a PEBS record is generated, will populate the PEBS Latency and PEBS Data Source fields accordingly.", > "SampleAfterValue": "1000003", > "UMask": "0x5", > "Unit": "cpu_atom" > }, > { > - "BriefDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 16 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled.", > + "BriefDescription": "Counts the number of tagged load uops retired that exceed the latency threshold of 16. Only counts with PEBS enabled.", > "Counter": "0,1", > "Data_LA": "1", > "EventCode": "0xd0", > "EventName": "MEM_UOPS_RETIRED.LOAD_LATENCY_GT_16", > "MSRIndex": "0x3F6", > "MSRValue": "0x10", > - "PublicDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 16 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled. If a PEBS record is generated, will populate the PEBS Latency and PEBS Data Source fields accordingly.", > "SampleAfterValue": "1000003", > "UMask": "0x5", > "Unit": "cpu_atom" > }, > { > - "BriefDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 256 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled.", > + "BriefDescription": "Counts the number of tagged load uops retired that exceed the latency threshold of 256. Only counts with PEBS enabled.", > "Counter": "0,1", > "Data_LA": "1", > "EventCode": "0xd0", > "EventName": "MEM_UOPS_RETIRED.LOAD_LATENCY_GT_256", > "MSRIndex": "0x3F6", > "MSRValue": "0x100", > - "PublicDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 256 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled. If a PEBS record is generated, will populate the PEBS Latency and PEBS Data Source fields accordingly.", > "SampleAfterValue": "1000003", > "UMask": "0x5", > "Unit": "cpu_atom" > }, > { > - "BriefDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 32 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled.", > + "BriefDescription": "Counts the number of tagged load uops retired that exceed the latency threshold of 32. Only counts with PEBS enabled.", > "Counter": "0,1", > "Data_LA": "1", > "EventCode": "0xd0", > "EventName": "MEM_UOPS_RETIRED.LOAD_LATENCY_GT_32", > "MSRIndex": "0x3F6", > "MSRValue": "0x20", > - "PublicDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 32 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled. If a PEBS record is generated, will populate the PEBS Latency and PEBS Data Source fields accordingly.", > "SampleAfterValue": "1000003", > "UMask": "0x5", > "Unit": "cpu_atom" > }, > { > - "BriefDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 4 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled.", > + "BriefDescription": "Counts the number of tagged load uops retired that exceed the latency threshold of 4. Only counts with PEBS enabled.", > "Counter": "0,1", > "Data_LA": "1", > "EventCode": "0xd0", > "EventName": "MEM_UOPS_RETIRED.LOAD_LATENCY_GT_4", > "MSRIndex": "0x3F6", > "MSRValue": "0x4", > - "PublicDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 4 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled. If a PEBS record is generated, will populate the PEBS Latency and PEBS Data Source fields accordingly.", > "SampleAfterValue": "1000003", > "UMask": "0x5", > "Unit": "cpu_atom" > }, > { > - "BriefDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 512 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled.", > + "BriefDescription": "Counts the number of tagged load uops retired that exceed the latency threshold of 512. Only counts with PEBS enabled.", > "Counter": "0,1", > "Data_LA": "1", > "EventCode": "0xd0", > "EventName": "MEM_UOPS_RETIRED.LOAD_LATENCY_GT_512", > "MSRIndex": "0x3F6", > "MSRValue": "0x200", > - "PublicDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 512 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled. If a PEBS record is generated, will populate the PEBS Latency and PEBS Data Source fields accordingly.", > "SampleAfterValue": "1000003", > "UMask": "0x5", > "Unit": "cpu_atom" > }, > { > - "BriefDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 64 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled.", > + "BriefDescription": "Counts the number of tagged load uops retired that exceed the latency threshold of 64. Only counts with PEBS enabled.", > "Counter": "0,1", > "Data_LA": "1", > "EventCode": "0xd0", > "EventName": "MEM_UOPS_RETIRED.LOAD_LATENCY_GT_64", > "MSRIndex": "0x3F6", > "MSRValue": "0x40", > - "PublicDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 64 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled. If a PEBS record is generated, will populate the PEBS Latency and PEBS Data Source fields accordingly.", > "SampleAfterValue": "1000003", > "UMask": "0x5", > "Unit": "cpu_atom" > }, > { > - "BriefDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 8 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled.", > + "BriefDescription": "Counts the number of tagged load uops retired that exceed the latency threshold of 8. Only counts with PEBS enabled.", > "Counter": "0,1", > "Data_LA": "1", > "EventCode": "0xd0", > "EventName": "MEM_UOPS_RETIRED.LOAD_LATENCY_GT_8", > "MSRIndex": "0x3F6", > "MSRValue": "0x8", > - "PublicDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 8 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled. If a PEBS record is generated, will populate the PEBS Latency and PEBS Data Source fields accordingly.", > "SampleAfterValue": "1000003", > "UMask": "0x5", > "Unit": "cpu_atom" > @@ -1030,12 +1022,11 @@ > "Unit": "cpu_atom" > }, > { > - "BriefDescription": "Counts the number of stores uops retired. Counts with or without PEBS enabled.", > + "BriefDescription": "Counts the number of stores uops retired.", > "Counter": "0,1,2,3,4,5", > "Data_LA": "1", > "EventCode": "0xd0", > "EventName": "MEM_UOPS_RETIRED.STORE_LATENCY", > - "PublicDescription": "Counts the number of stores uops retired. Counts with or without PEBS enabled. If PEBS is enabled and a PEBS record is generated, will populate PEBS Latency and PEBS Data Source fields accordingly.", > "SampleAfterValue": "1000003", > "UMask": "0x6", > "Unit": "cpu_atom" > diff --git a/tools/perf/pmu-events/arch/x86/alderlake/frontend.json b/tools/perf/pmu-events/arch/x86/alderlake/frontend.json > index ff3b30c2619a..11fc853f2d0b 100644 > --- a/tools/perf/pmu-events/arch/x86/alderlake/frontend.json > +++ b/tools/perf/pmu-events/arch/x86/alderlake/frontend.json > @@ -327,6 +327,24 @@ > "UMask": "0x4", > "Unit": "cpu_core" > }, > + { > + "BriefDescription": "ICACHE_TAG.STALLS_INUSE", > + "Counter": "0,1,2,3", > + "EventCode": "0x83", > + "EventName": "ICACHE_TAG.STALLS_INUSE", > + "SampleAfterValue": "200003", > + "UMask": "0x10", > + "Unit": "cpu_core" > + }, > + { > + "BriefDescription": "ICACHE_TAG.STALLS_ISB", > + "Counter": "0,1,2,3", > + "EventCode": "0x83", > + "EventName": "ICACHE_TAG.STALLS_ISB", > + "SampleAfterValue": "200003", > + "UMask": "0x8", > + "Unit": "cpu_core" > + }, > { > "BriefDescription": "Cycles Decode Stream Buffer (DSB) is delivering any Uop", > "Counter": "0,1,2,3", > diff --git a/tools/perf/pmu-events/arch/x86/alderlake/pipeline.json b/tools/perf/pmu-events/arch/x86/alderlake/pipeline.json > index 57a8c78cdc49..80cad3c49d20 100644 > --- a/tools/perf/pmu-events/arch/x86/alderlake/pipeline.json > +++ b/tools/perf/pmu-events/arch/x86/alderlake/pipeline.json > @@ -244,6 +244,15 @@ > "UMask": "0xfb", > "Unit": "cpu_atom" > }, > + { > + "BriefDescription": "Counts the number of near indirect JMP branch instructions retired.", > + "Counter": "0,1,2,3,4,5", > + "EventCode": "0xc4", > + "EventName": "BR_INST_RETIRED.INDIRECT_JMP", > + "SampleAfterValue": "200003", > + "UMask": "0xef", > + "Unit": "cpu_atom" > + }, > { > "BriefDescription": "This event is deprecated. Refer to new event BR_INST_RETIRED.INDIRECT_CALL", > "Counter": "0,1,2,3,4,5", > @@ -464,6 +473,15 @@ > "UMask": "0x2", > "Unit": "cpu_core" > }, > + { > + "BriefDescription": "Counts the number of mispredicted near indirect JMP branch instructions retired.", > + "Counter": "0,1,2,3,4,5", > + "EventCode": "0xc5", > + "EventName": "BR_MISP_RETIRED.INDIRECT_JMP", > + "SampleAfterValue": "200003", > + "UMask": "0xef", > + "Unit": "cpu_atom" > + }, > { > "BriefDescription": "This event is deprecated. Refer to new event BR_MISP_RETIRED.INDIRECT_CALL", > "Counter": "0,1,2,3,4,5", > @@ -573,7 +591,7 @@ > "Unit": "cpu_core" > }, > { > - "BriefDescription": "Counts the number of unhalted core clock cycles. (Fixed event)", > + "BriefDescription": "Fixed Counter: Counts the number of unhalted core clock cycles. [This event is alias to CPU_CLK_UNHALTED.THREAD]", > "Counter": "Fixed counter 1", > "EventName": "CPU_CLK_UNHALTED.CORE", > "PublicDescription": "Counts the number of core cycles while the core is not in a halt state. The core enters the halt state when it is running the HLT instruction. The core frequency may change from time to time. For this reason this event may have a changing ratio with regards to time. This event uses fixed counter 1.", > @@ -582,7 +600,7 @@ > "Unit": "cpu_atom" > }, > { > - "BriefDescription": "Counts the number of unhalted core clock cycles.", > + "BriefDescription": "Counts the number of unhalted core clock cycles. [This event is alias to CPU_CLK_UNHALTED.THREAD_P]", > "Counter": "0,1,2,3,4,5", > "EventCode": "0x3c", > "EventName": "CPU_CLK_UNHALTED.CORE_P", > @@ -651,7 +669,7 @@ > "Unit": "cpu_core" > }, > { > - "BriefDescription": "Counts the number of unhalted reference clock cycles at TSC frequency. (Fixed event)", > + "BriefDescription": "Fixed Counter: Counts the number of unhalted reference clock cycles at TSC frequency.", > "Counter": "Fixed counter 2", > "EventName": "CPU_CLK_UNHALTED.REF_TSC", > "PublicDescription": "Counts the number of reference cycles that the core is not in a halt state. The core enters the halt state when it is running the HLT instruction. This event is not affected by core frequency changes and increments at a fixed frequency that is also used for the Time Stamp Counter (TSC). This event uses fixed counter 2.", > @@ -689,7 +707,7 @@ > "Unit": "cpu_core" > }, > { > - "BriefDescription": "Counts the number of unhalted core clock cycles. (Fixed event)", > + "BriefDescription": "Fixed Counter: Counts the number of unhalted core clock cycles. [This event is alias to CPU_CLK_UNHALTED.CORE]", > "Counter": "Fixed counter 1", > "EventName": "CPU_CLK_UNHALTED.THREAD", > "PublicDescription": "Counts the number of core cycles while the core is not in a halt state. The core enters the halt state when it is running the HLT instruction. The core frequency may change from time to time. For this reason this event may have a changing ratio with regards to time. This event uses fixed counter 1.", > @@ -707,7 +725,7 @@ > "Unit": "cpu_core" > }, > { > - "BriefDescription": "Counts the number of unhalted core clock cycles.", > + "BriefDescription": "Counts the number of unhalted core clock cycles. [This event is alias to CPU_CLK_UNHALTED.CORE_P]", > "Counter": "0,1,2,3,4,5", > "EventCode": "0x3c", > "EventName": "CPU_CLK_UNHALTED.THREAD_P", > @@ -875,7 +893,7 @@ > "Unit": "cpu_core" > }, > { > - "BriefDescription": "Counts the total number of instructions retired. (Fixed event)", > + "BriefDescription": "Fixed Counter: Counts the total number of instructions retired.", > "Counter": "Fixed counter 0", > "EventName": "INST_RETIRED.ANY", > "PublicDescription": "Counts the total number of instructions that retired. For instructions that consist of multiple uops, this event counts the retirement of the last uop of the instruction. This event continues counting during hardware interrupts, traps, and inside interrupt handlers. This event uses fixed counter 0. Available PDIST counters: 32", > @@ -1273,6 +1291,42 @@ > "UMask": "0x20", > "Unit": "cpu_core" > }, > + { > + "BriefDescription": "Counts the number of CLFLUSH, CLWB, and CLDEMOTE instructions retired.", > + "Counter": "0,1,2,3,4,5", > + "EventCode": "0xe0", > + "EventName": "MISC_RETIRED1.CL_INST", > + "SampleAfterValue": "1000003", > + "UMask": "0xff", > + "Unit": "cpu_atom" > + }, > + { > + "BriefDescription": "Counts the number of LFENCE instructions retired.", > + "Counter": "0,1,2,3,4,5", > + "EventCode": "0xe0", > + "EventName": "MISC_RETIRED1.LFENCE", > + "SampleAfterValue": "1000003", > + "UMask": "0x2", > + "Unit": "cpu_atom" > + }, > + { > + "BriefDescription": "Counts the number of accesses to KeyLocker cache.", > + "Counter": "0,1,2,3,4,5", > + "EventCode": "0xe1", > + "EventName": "MISC_RETIRED2.KEYLOCKER_ACCESS", > + "SampleAfterValue": "1000003", > + "UMask": "0x10", > + "Unit": "cpu_atom" > + }, > + { > + "BriefDescription": "Counts the number of misses to KeyLocker cache.", > + "Counter": "0,1,2,3,4,5", > + "EventCode": "0xe1", > + "EventName": "MISC_RETIRED2.KEYLOCKER_MISS", > + "SampleAfterValue": "1000003", > + "UMask": "0x11", > + "Unit": "cpu_atom" > + }, > { > "BriefDescription": "Cycles stalled due to no store buffers available. (not including draining form sync).", > "Counter": "0,1,2,3,4,5,6,7", > diff --git a/tools/perf/pmu-events/arch/x86/alderlaken/cache.json b/tools/perf/pmu-events/arch/x86/alderlaken/cache.json > index 76a841675337..1f97a4dc6fb1 100644 > --- a/tools/perf/pmu-events/arch/x86/alderlaken/cache.json > +++ b/tools/perf/pmu-events/arch/x86/alderlaken/cache.json > @@ -246,98 +246,90 @@ > "UMask": "0x82" > }, > { > - "BriefDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 128 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled.", > + "BriefDescription": "Counts the number of tagged load uops retired that exceed the latency threshold of 128. Only counts with PEBS enabled.", > "Counter": "0,1", > "Data_LA": "1", > "EventCode": "0xd0", > "EventName": "MEM_UOPS_RETIRED.LOAD_LATENCY_GT_128", > "MSRIndex": "0x3F6", > "MSRValue": "0x80", > - "PublicDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 128 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled. If a PEBS record is generated, will populate the PEBS Latency and PEBS Data Source fields accordingly.", > "SampleAfterValue": "1000003", > "UMask": "0x5" > }, > { > - "BriefDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 16 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled.", > + "BriefDescription": "Counts the number of tagged load uops retired that exceed the latency threshold of 16. Only counts with PEBS enabled.", > "Counter": "0,1", > "Data_LA": "1", > "EventCode": "0xd0", > "EventName": "MEM_UOPS_RETIRED.LOAD_LATENCY_GT_16", > "MSRIndex": "0x3F6", > "MSRValue": "0x10", > - "PublicDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 16 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled. If a PEBS record is generated, will populate the PEBS Latency and PEBS Data Source fields accordingly.", > "SampleAfterValue": "1000003", > "UMask": "0x5" > }, > { > - "BriefDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 256 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled.", > + "BriefDescription": "Counts the number of tagged load uops retired that exceed the latency threshold of 256. Only counts with PEBS enabled.", > "Counter": "0,1", > "Data_LA": "1", > "EventCode": "0xd0", > "EventName": "MEM_UOPS_RETIRED.LOAD_LATENCY_GT_256", > "MSRIndex": "0x3F6", > "MSRValue": "0x100", > - "PublicDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 256 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled. If a PEBS record is generated, will populate the PEBS Latency and PEBS Data Source fields accordingly.", > "SampleAfterValue": "1000003", > "UMask": "0x5" > }, > { > - "BriefDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 32 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled.", > + "BriefDescription": "Counts the number of tagged load uops retired that exceed the latency threshold of 32. Only counts with PEBS enabled.", > "Counter": "0,1", > "Data_LA": "1", > "EventCode": "0xd0", > "EventName": "MEM_UOPS_RETIRED.LOAD_LATENCY_GT_32", > "MSRIndex": "0x3F6", > "MSRValue": "0x20", > - "PublicDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 32 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled. If a PEBS record is generated, will populate the PEBS Latency and PEBS Data Source fields accordingly.", > "SampleAfterValue": "1000003", > "UMask": "0x5" > }, > { > - "BriefDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 4 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled.", > + "BriefDescription": "Counts the number of tagged load uops retired that exceed the latency threshold of 4. Only counts with PEBS enabled.", > "Counter": "0,1", > "Data_LA": "1", > "EventCode": "0xd0", > "EventName": "MEM_UOPS_RETIRED.LOAD_LATENCY_GT_4", > "MSRIndex": "0x3F6", > "MSRValue": "0x4", > - "PublicDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 4 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled. If a PEBS record is generated, will populate the PEBS Latency and PEBS Data Source fields accordingly.", > "SampleAfterValue": "1000003", > "UMask": "0x5" > }, > { > - "BriefDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 512 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled.", > + "BriefDescription": "Counts the number of tagged load uops retired that exceed the latency threshold of 512. Only counts with PEBS enabled.", > "Counter": "0,1", > "Data_LA": "1", > "EventCode": "0xd0", > "EventName": "MEM_UOPS_RETIRED.LOAD_LATENCY_GT_512", > "MSRIndex": "0x3F6", > "MSRValue": "0x200", > - "PublicDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 512 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled. If a PEBS record is generated, will populate the PEBS Latency and PEBS Data Source fields accordingly.", > "SampleAfterValue": "1000003", > "UMask": "0x5" > }, > { > - "BriefDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 64 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled.", > + "BriefDescription": "Counts the number of tagged load uops retired that exceed the latency threshold of 64. Only counts with PEBS enabled.", > "Counter": "0,1", > "Data_LA": "1", > "EventCode": "0xd0", > "EventName": "MEM_UOPS_RETIRED.LOAD_LATENCY_GT_64", > "MSRIndex": "0x3F6", > "MSRValue": "0x40", > - "PublicDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 64 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled. If a PEBS record is generated, will populate the PEBS Latency and PEBS Data Source fields accordingly.", > "SampleAfterValue": "1000003", > "UMask": "0x5" > }, > { > - "BriefDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 8 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled.", > + "BriefDescription": "Counts the number of tagged load uops retired that exceed the latency threshold of 8. Only counts with PEBS enabled.", > "Counter": "0,1", > "Data_LA": "1", > "EventCode": "0xd0", > "EventName": "MEM_UOPS_RETIRED.LOAD_LATENCY_GT_8", > "MSRIndex": "0x3F6", > "MSRValue": "0x8", > - "PublicDescription": "Counts the number of tagged loads with an instruction latency that exceeds or equals the threshold of 8 cycles as defined in MEC_CR_PEBS_LD_LAT_THRESHOLD (3F6H). Only counts with PEBS enabled. If a PEBS record is generated, will populate the PEBS Latency and PEBS Data Source fields accordingly.", > "SampleAfterValue": "1000003", > "UMask": "0x5" > }, > @@ -387,12 +379,11 @@ > "UMask": "0x12" > }, > { > - "BriefDescription": "Counts the number of stores uops retired. Counts with or without PEBS enabled.", > + "BriefDescription": "Counts the number of stores uops retired.", > "Counter": "0,1,2,3,4,5", > "Data_LA": "1", > "EventCode": "0xd0", > "EventName": "MEM_UOPS_RETIRED.STORE_LATENCY", > - "PublicDescription": "Counts the number of stores uops retired. Counts with or without PEBS enabled. If PEBS is enabled and a PEBS record is generated, will populate PEBS Latency and PEBS Data Source fields accordingly.", > "SampleAfterValue": "1000003", > "UMask": "0x6" > }, > diff --git a/tools/perf/pmu-events/arch/x86/alderlaken/pipeline.json b/tools/perf/pmu-events/arch/x86/alderlaken/pipeline.json > index d650cbd48c1f..a13851071624 100644 > --- a/tools/perf/pmu-events/arch/x86/alderlaken/pipeline.json > +++ b/tools/perf/pmu-events/arch/x86/alderlaken/pipeline.json > @@ -108,6 +108,14 @@ > "SampleAfterValue": "200003", > "UMask": "0xfb" > }, > + { > + "BriefDescription": "Counts the number of near indirect JMP branch instructions retired.", > + "Counter": "0,1,2,3,4,5", > + "EventCode": "0xc4", > + "EventName": "BR_INST_RETIRED.INDIRECT_JMP", > + "SampleAfterValue": "200003", > + "UMask": "0xef" > + }, > { > "BriefDescription": "This event is deprecated. Refer to new event BR_INST_RETIRED.INDIRECT_CALL", > "Counter": "0,1,2,3,4,5", > @@ -225,6 +233,14 @@ > "SampleAfterValue": "200003", > "UMask": "0xfb" > }, > + { > + "BriefDescription": "Counts the number of mispredicted near indirect JMP branch instructions retired.", > + "Counter": "0,1,2,3,4,5", > + "EventCode": "0xc5", > + "EventName": "BR_MISP_RETIRED.INDIRECT_JMP", > + "SampleAfterValue": "200003", > + "UMask": "0xef" > + }, > { > "BriefDescription": "This event is deprecated. Refer to new event BR_MISP_RETIRED.INDIRECT_CALL", > "Counter": "0,1,2,3,4,5", > @@ -278,7 +294,7 @@ > "UMask": "0xfe" > }, > { > - "BriefDescription": "Counts the number of unhalted core clock cycles. (Fixed event)", > + "BriefDescription": "Fixed Counter: Counts the number of unhalted core clock cycles. [This event is alias to CPU_CLK_UNHALTED.THREAD]", > "Counter": "Fixed counter 1", > "EventName": "CPU_CLK_UNHALTED.CORE", > "PublicDescription": "Counts the number of core cycles while the core is not in a halt state. The core enters the halt state when it is running the HLT instruction. The core frequency may change from time to time. For this reason this event may have a changing ratio with regards to time. This event uses fixed counter 1.", > @@ -286,7 +302,7 @@ > "UMask": "0x2" > }, > { > - "BriefDescription": "Counts the number of unhalted core clock cycles.", > + "BriefDescription": "Counts the number of unhalted core clock cycles. [This event is alias to CPU_CLK_UNHALTED.THREAD_P]", > "Counter": "0,1,2,3,4,5", > "EventCode": "0x3c", > "EventName": "CPU_CLK_UNHALTED.CORE_P", > @@ -303,7 +319,7 @@ > "UMask": "0x1" > }, > { > - "BriefDescription": "Counts the number of unhalted reference clock cycles at TSC frequency. (Fixed event)", > + "BriefDescription": "Fixed Counter: Counts the number of unhalted reference clock cycles at TSC frequency.", > "Counter": "Fixed counter 2", > "EventName": "CPU_CLK_UNHALTED.REF_TSC", > "PublicDescription": "Counts the number of reference cycles that the core is not in a halt state. The core enters the halt state when it is running the HLT instruction. This event is not affected by core frequency changes and increments at a fixed frequency that is also used for the Time Stamp Counter (TSC). This event uses fixed counter 2.", > @@ -320,7 +336,7 @@ > "UMask": "0x1" > }, > { > - "BriefDescription": "Counts the number of unhalted core clock cycles. (Fixed event)", > + "BriefDescription": "Fixed Counter: Counts the number of unhalted core clock cycles. [This event is alias to CPU_CLK_UNHALTED.CORE]", > "Counter": "Fixed counter 1", > "EventName": "CPU_CLK_UNHALTED.THREAD", > "PublicDescription": "Counts the number of core cycles while the core is not in a halt state. The core enters the halt state when it is running the HLT instruction. The core frequency may change from time to time. For this reason this event may have a changing ratio with regards to time. This event uses fixed counter 1.", > @@ -328,7 +344,7 @@ > "UMask": "0x2" > }, > { > - "BriefDescription": "Counts the number of unhalted core clock cycles.", > + "BriefDescription": "Counts the number of unhalted core clock cycles. [This event is alias to CPU_CLK_UNHALTED.CORE_P]", > "Counter": "0,1,2,3,4,5", > "EventCode": "0x3c", > "EventName": "CPU_CLK_UNHALTED.THREAD_P", > @@ -336,7 +352,7 @@ > "SampleAfterValue": "2000003" > }, > { > - "BriefDescription": "Counts the total number of instructions retired. (Fixed event)", > + "BriefDescription": "Fixed Counter: Counts the total number of instructions retired.", > "Counter": "Fixed counter 0", > "EventName": "INST_RETIRED.ANY", > "PublicDescription": "Counts the total number of instructions that retired. For instructions that consist of multiple uops, this event counts the retirement of the last uop of the instruction. This event continues counting during hardware interrupts, traps, and inside interrupt handlers. This event uses fixed counter 0. Available PDIST counters: 32", > @@ -426,6 +442,38 @@ > "SampleAfterValue": "1000003", > "UMask": "0x1" > }, > + { > + "BriefDescription": "Counts the number of CLFLUSH, CLWB, and CLDEMOTE instructions retired.", > + "Counter": "0,1,2,3,4,5", > + "EventCode": "0xe0", > + "EventName": "MISC_RETIRED1.CL_INST", > + "SampleAfterValue": "1000003", > + "UMask": "0xff" > + }, > + { > + "BriefDescription": "Counts the number of LFENCE instructions retired.", > + "Counter": "0,1,2,3,4,5", > + "EventCode": "0xe0", > + "EventName": "MISC_RETIRED1.LFENCE", > + "SampleAfterValue": "1000003", > + "UMask": "0x2" > + }, > + { > + "BriefDescription": "Counts the number of accesses to KeyLocker cache.", > + "Counter": "0,1,2,3,4,5", > + "EventCode": "0xe1", > + "EventName": "MISC_RETIRED2.KEYLOCKER_ACCESS", > + "SampleAfterValue": "1000003", > + "UMask": "0x10" > + }, > + { > + "BriefDescription": "Counts the number of misses to KeyLocker cache.", > + "Counter": "0,1,2,3,4,5", > + "EventCode": "0xe1", > + "EventName": "MISC_RETIRED2.KEYLOCKER_MISS", > + "SampleAfterValue": "1000003", > + "UMask": "0x11" > + }, > { > "BriefDescription": "Counts the number of issue slots in a UMWAIT or TPAUSE instruction where no uop issues due to the instruction putting the CPU into the C0.1 activity state. For Tremont, UMWAIT and TPAUSE will only put the CPU into C0.1 activity state (not C0.2 activity state)", > "Counter": "0,1,2,3,4,5", > diff --git a/tools/perf/pmu-events/arch/x86/mapfile.csv b/tools/perf/pmu-events/arch/x86/mapfile.csv > index 149bbe7abaf5..9370722dc564 100644 > --- a/tools/perf/pmu-events/arch/x86/mapfile.csv > +++ b/tools/perf/pmu-events/arch/x86/mapfile.csv > @@ -1,6 +1,6 @@ > Family-model,Version,Filename,EventType > -GenuineIntel-6-(97|9A|B7|BA|BF),v1.35,alderlake,core > -GenuineIntel-6-BE,v1.35,alderlaken,core > +GenuineIntel-6-(97|9A|B7|BA|BF),v1.37,alderlake,core > +GenuineIntel-6-BE,v1.37,alderlaken,core > GenuineIntel-6-C[56],v1.14,arrowlake,core > GenuineIntel-6-(1C|26|27|35|36),v5,bonnell,core > GenuineIntel-6-(3D|47),v30,broadwell,core