From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from gabe.freedesktop.org (gabe.freedesktop.org [131.252.210.177]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.lore.kernel.org (Postfix) with ESMTPS id 68EA9CA0FF0 for ; Wed, 27 Aug 2025 02:59:01 +0000 (UTC) Received: from gabe.freedesktop.org (localhost [127.0.0.1]) by gabe.freedesktop.org (Postfix) with ESMTP id 039F110E072; Wed, 27 Aug 2025 02:59:01 +0000 (UTC) Authentication-Results: gabe.freedesktop.org; dkim=pass (2048-bit key; unprotected) header.d=intel.com header.i=@intel.com header.b="VERrspb0"; dkim-atps=neutral Received: from mgamail.intel.com (mgamail.intel.com [198.175.65.14]) by gabe.freedesktop.org (Postfix) with ESMTPS id A6C7710E072 for ; Wed, 27 Aug 2025 02:58:59 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1756263540; x=1787799540; h=date:message-id:from:to:cc:subject:in-reply-to: references:mime-version; bh=fPkI6y8pl4Y8OWeyJTuVqcuYU2NBXsjKmWV3wVW5j3M=; b=VERrspb02eA7c7I+y28m8dApcnDzW/afMpwa1iWWxJGNHmpgilJNOeHT 0KOwuNPbZpjR/7Vontlc/8faIBe0uiIL/bVzBNWnhzuWqMeIzKg41xFCK Bq+5lP7QtfnMht3QTyZh1iT+ptaQeCQoCJYuvxlJ0LKv/Nb5nsgHiUyuO G5vxSlArPmIOzzukcA5Zd1j8A5VmMtiqz98rPW+oXSKkstK2mdVpKVMX+ cflxCfGmBACgRtFznbfijTKK+tRSsJ1qTbJeULh4Ckw6Fd2wSdbwYpq+F LMN3gg/bCabQrlT+Bmxe3R53MNN3heq6HMMqjjmZv4WpY/7zcQ+b6Wvyx w==; X-CSE-ConnectionGUID: uh/xdBaFQAa+9bKU+77h0A== X-CSE-MsgGUID: 4SblPde5TESibB6iWhgfQA== X-IronPort-AV: E=McAfee;i="6800,10657,11531"; a="62349244" X-IronPort-AV: E=Sophos;i="6.17,312,1747724400"; d="scan'208";a="62349244" Received: from orviesa005.jf.intel.com ([10.64.159.145]) by orvoesa106.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 26 Aug 2025 19:58:59 -0700 X-CSE-ConnectionGUID: 5O2aXw4GQvavTfT+eHmUJw== X-CSE-MsgGUID: NZ7e0ggoQiCRzkESYBmrxA== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="6.18,214,1751266800"; d="scan'208";a="175032754" Received: from orsosgc001.jf.intel.com (HELO orsosgc001.intel.com) ([10.165.21.142]) by orviesa005-auth.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 26 Aug 2025 19:59:00 -0700 Date: Tue, 26 Aug 2025 19:58:58 -0700 Message-ID: <85h5xtjwr1.wl-ashutosh.dixit@intel.com> From: "Dixit, Ashutosh" To: Sushma Venkatesh Reddy Cc: , Subject: Re: [PATCH 2/2] lib/xe/oa: Update PTL oa-metrics with new compute, memory, and stall counters In-Reply-To: <20250826211545.17076-2-sushma.venkatesh.reddy@intel.com> References: <20250826211545.17076-1-sushma.venkatesh.reddy@intel.com> <20250826211545.17076-2-sushma.venkatesh.reddy@intel.com> User-Agent: Wanderlust/2.15.9 (Almost Unreal) SEMI-EPG/1.14.7 (Harue) FLIM-LB/1.14.9 (=?ISO-8859-4?Q?Goj=F2?=) APEL-LB/10.8 EasyPG/1.0.0 Emacs/29.4 (x86_64-redhat-linux-gnu) MULE/6.0 (HANACHIRUSATO) MIME-Version: 1.0 (generated by SEMI-EPG 1.14.7 - "Harue") Content-Type: text/plain; charset=US-ASCII X-BeenThere: igt-dev@lists.freedesktop.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Development mailing list for IGT GPU Tools List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: igt-dev-bounces@lists.freedesktop.org Sender: "igt-dev" On Tue, 26 Aug 2025 14:15:45 -0700, Sushma Venkatesh Reddy wrote: > Hi Sushma, > - Add new metric sets and counters for compute, memory, and device cache > - Update descriptions and equations You have added Media formats but I am seeing the media OA formats are still incorrect. But that is a separate fix and since you are Mesa, I will not ask you to fix those :) But you have other great fixes in this patch, which now at least allow us to import Media OA metric sets, which we couldn't do previously. So thank you for doing that. I will try to do another import using your updated scripts even for LNL and BMG. So this patch is: Reviewed-by: Ashutosh Dixit If you want to do any of this other work let me know, otherwise we will take it from here. Thanks. -- Ashutosh > > Signed-off-by: Sushma Venkatesh Reddy > --- > lib/xe/oa-configs/guids.xml | 4 + > lib/xe/oa-configs/mdapi-xml-convert.py | 17 +- > lib/xe/oa-configs/oa-ptl.xml | 1976 ++++++++++++++++++++++-- > 3 files changed, 1846 insertions(+), 151 deletions(-) > > diff --git a/lib/xe/oa-configs/guids.xml b/lib/xe/oa-configs/guids.xml > index a7ca3e00f..10b43d6f1 100644 > --- a/lib/xe/oa-configs/guids.xml > +++ b/lib/xe/oa-configs/guids.xml > @@ -2780,4 +2780,8 @@ > > > > + > + > + > + > > diff --git a/lib/xe/oa-configs/mdapi-xml-convert.py b/lib/xe/oa-configs/mdapi-xml-convert.py > index 9abc7e529..add6aa373 100755 > --- a/lib/xe/oa-configs/mdapi-xml-convert.py > +++ b/lib/xe/oa-configs/mdapi-xml-convert.py > @@ -182,8 +182,6 @@ mtl_chipset_oa_formats = { > > xe2_chipset_oa_formats = { > '576B_PEC64LL': xe2_chipset_params_pec, > - # We only care about 576B_PEC64LL metrics in Xe2, entries below is just to > - # surpress warnings > '256B_GENERIC_NOA16': xehpsdv_chipset_params, > '128B_MPEC8_NOA16': mtl_chipset_oam_samedia_params, > } > @@ -262,11 +260,12 @@ def read_token_to_rpn_read_oam(chipset, token, raw_offsets, oa_format): > report_size = chipsets[chipset][oa_format]['oa_report_size'] > > if offset < a_offset: > - if offset == 8: > + if offset in (0, 4, 8): > return "GPU_TIME 0 READ" > - elif offset == 24: > + elif offset in (12, 24): > return "GPU_CLOCK 0 READ" > else: > + print_err(f"Unhandled offset in read_token_to_rpn_read_oam: {offset} (token: {token}, chipset: {chipset}, oa_format: {oa_format})") > assert 0 > elif offset < b_offset: > a_cnt_offset = int((offset - a_offset) / den) > @@ -297,6 +296,14 @@ def read_token_to_rpn_read_oam(chipset, token, raw_offsets, oa_format): > return "C " + str(idx - 48) + " READ" > else: > return "{0} READ".format(read_value(chipset, offset, oa_format)) > + else: > + if idx == 0: > + return "GPU_TIME 0 READ" > + elif idx == 1: > + return "GPU_CLOCK 0 READ" > + else: > + print_err(f"Unhandled delta offset in read_token_to_rpn_read_oam: {offset} (token: {token}, chipset: {chipset}, oa_format: {oa_format})") > + assert 0 > > assert 0 > > @@ -359,7 +366,7 @@ def read_token_to_rpn_read_oag(chipset, token, raw_offsets, oa_format): > report_size = chipsets[chipset][oa_format]['oa_report_size'] > > if offset < a_offset: > - if offset == 4: > + if offset in (0, 4, 8): > return "GPU_TIME 0 READ" > elif offset == 12: > assert chipset != "HSW" # Only for Gen8+ > diff --git a/lib/xe/oa-configs/oa-ptl.xml b/lib/xe/oa-configs/oa-ptl.xml > index a9fe08a10..1878ccaf2 100644 > --- a/lib/xe/oa-configs/oa-ptl.xml > +++ b/lib/xe/oa-configs/oa-ptl.xml > @@ -1,5 +1,5 @@ > > - > + > chipset="PTL" > symbol_name="RenderBasic" > @@ -142,7 +142,7 @@ > symbol_name="POSTPS_ALPHA_TEST_FAIL" > underscore_name="postps_alpha_test_fail" > - description="Number of 2x2 that were lit from Rasterizer but failed the alpha test" > + description="Number of 2x2 that were lit from Rasterizer but failed the Alpha Test or Alpha-to-Coverage" > data_type="uint64" > units="events" > semantic_type="event" > @@ -155,7 +155,7 @@ > symbol_name="SHADER_PIX_KILL" > underscore_name="shader_pix_kill" > - description="Number of pixels that were lit from Rasterizer but killed by the Pixel Shader" > + description="Number of pixels that were lit from Rasterizer but killed by the Pixel Shader or OOB Check" > data_type="uint64" > units="events" > semantic_type="event" > @@ -399,6 +399,19 @@ > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > + + symbol_name="GPU_MEMORY_REQUEST_QUEUE_FULL" > + underscore_name="gpu_memory_request_queue_full" > + description="Percentage of time in which SQ is filled above a threshold (usually 48 entries)" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 54 READ $SqidiTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Memory" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > symbol_name="SAMPLER_2X2_READ" > underscore_name="sampler_2_x2_read" > @@ -1529,6 +1542,19 @@ > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > + + symbol_name="L3_STALL" > + underscore_name="l3_stall" > + description="Percentage of time in which Device Cache bank stalled" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 40 READ $L3BankTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Device Cache" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > symbol_name="COMPRESSOR_INPUT" > underscore_name="compressor_input" > @@ -1633,6 +1659,19 @@ > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > + + symbol_name="GPU_MEMORY_REQUEST_QUEUE_FULL" > + underscore_name="gpu_memory_request_queue_full" > + description="Percentage of time in which SQ is filled above a threshold (usually 48 entries)" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 47 READ $SqidiTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Memory" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > symbol_name="TLB_MISS" > underscore_name="tlb_miss" > @@ -2216,6 +2255,45 @@ > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > + + symbol_name="HIZ_SUBSPAN_LATENCY_FIFOFULL" > + underscore_name="hiz_subspan_latency_fifofull" > + description="Percentage of time in which for which HiZ latency hiding structure full" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 42 READ $DepthPipeTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Depth Pipe" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="IZ_OUTPUT_READY" > + underscore_name="iz_output_ready" > + description="Percentage of time in which IZ has requests to different clients" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 43 READ $DepthPipeTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Depth Pipe" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="IZ_SUBSPAN_LATENCY_FIFOFULL" > + underscore_name="iz_subspan_latency_fifofull" > + description="Percentage of time in which for which IZ latency hiding structure full" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 44 READ $DepthPipeTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Depth Pipe" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > symbol_name="POSTPS_DEPTH_STENCIL_TEST_FAIL" > underscore_name="postps_depth_stencil_test_fail" > @@ -2320,6 +2398,19 @@ > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > + + symbol_name="GPU_MEMORY_ACTIVE" > + underscore_name="gpu_memory_active" > + description="Percentage of time in which device local memory (HBM, GDDR, LPDDR, etc.) is active" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 53 READ $SqidiTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Memory" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > > > > @@ -2768,6 +2859,58 @@ > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > + + symbol_name="L3_BUSY" > + underscore_name="l3_busy" > + description="Percentage of time in which Device Cache request queue has one or more requests pending" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 50 READ $L3BankTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Device Cache" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="L3_INPUT_AVAILABLE" > + underscore_name="l3_input_available" > + description="Percentage of time in which Device Cache bank has input requests" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 51 READ $L3BankTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Device Cache" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="L3_OUTPUT_READY" > + underscore_name="l3_output_ready" > + description="Percentage of time in which Device Cache bank has output ready" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 52 READ $L3BankTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Device Cache" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="L3_SUPERQ_FULL" > + underscore_name="l3_superq_full" > + description="Percentage of time in which all slots in Device Cache request queue are waiting for data return / response" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 53 READ $L3BankTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Device Cache" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > symbol_name="L3_ATOMIC_ACCESS" > underscore_name="l3_atomic_access" > @@ -2820,6 +2963,19 @@ > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > + + symbol_name="L3_STALL" > + underscore_name="l3_stall" > + description="Percentage of time in which Device Cache bank stalled" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 58 READ $L3BankTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Device Cache" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > symbol_name="L3_WRITE" > underscore_name="l3_write" > @@ -3227,7 +3383,7 @@ > symbol_name="GPU_MEMORY_32B_TRANSACTION_READ" > underscore_name="gpu_memory_32_b_transaction_read" > - description="Number of device local memory (HBM, GDDR, LPDDR, etc.) reads (32B)" > + description="Number of device local memory (HBM, GDDR, LPDDR, etc.) non coherent reads (32B)" > data_type="uint64" > units="events" > semantic_type="event" > @@ -3240,7 +3396,7 @@ > symbol_name="GPU_MEMORY_32B_TRANSACTION_WRITE" > underscore_name="gpu_memory_32_b_transaction_write" > - description="Number of device local memory (HBM, GDDR, LPDDR, etc.) writes (32B)" > + description="Number of device local memory (HBM, GDDR, LPDDR, etc.) non coherent writes (32B)" > data_type="uint64" > units="events" > semantic_type="event" > @@ -3253,7 +3409,7 @@ > symbol_name="GPU_MEMORY_64B_TRANSACTION_READ" > underscore_name="gpu_memory_64_b_transaction_read" > - description="Number of device local memory (HBM, GDDR, LPDDR, etc.) reads (64B)" > + description="Number of device local memory (HBM, GDDR, LPDDR, etc.) non coherent reads (64B)" > data_type="uint64" > units="events" > semantic_type="event" > @@ -3266,7 +3422,7 @@ > symbol_name="GPU_MEMORY_64B_TRANSACTION_WRITE" > underscore_name="gpu_memory_64_b_transaction_write" > - description="Number of device local memory (HBM, GDDR, LPDDR, etc.) writes (64B)" > + description="Number of device local memory (HBM, GDDR, LPDDR, etc.) non coherent writes (64B)" > data_type="uint64" > units="events" > semantic_type="event" > @@ -3354,6 +3510,19 @@ > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > + + symbol_name="GPU_MEMORY_REQUEST_QUEUE_FULL" > + underscore_name="gpu_memory_request_queue_full" > + description="Percentage of time in which SQ is filled above a threshold (usually 48 entries)" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 45 READ $SqidiTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Memory" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > symbol_name="TLB_MISS" > underscore_name="tlb_miss" > @@ -3380,6 +3549,19 @@ > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > + + symbol_name="L3_BUSY" > + underscore_name="l3_busy" > + description="Percentage of time in which Device Cache request queue has one or more requests pending" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 48 READ $L3BankTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Device Cache" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > symbol_name="L3_HIT" > underscore_name="l3_hit" > @@ -3393,6 +3575,19 @@ > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > + + symbol_name="L3_INPUT_AVAILABLE" > + underscore_name="l3_input_available" > + description="Percentage of time in which Device Cache bank has input requests" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 50 READ $L3BankTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Device Cache" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > symbol_name="L3_MISS" > underscore_name="l3_miss" > @@ -3406,6 +3601,19 @@ > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > + + symbol_name="L3_OUTPUT_READY" > + underscore_name="l3_output_ready" > + description="Percentage of time in which Device Cache bank has output ready" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 52 READ $L3BankTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Device Cache" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > symbol_name="L3_READ" > underscore_name="l3_read" > @@ -3419,6 +3627,32 @@ > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > + + symbol_name="L3_STALL" > + underscore_name="l3_stall" > + description="Percentage of time in which Device Cache bank stalled" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 54 READ $L3BankTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Device Cache" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="L3_SUPERQ_FULL" > + underscore_name="l3_superq_full" > + description="Percentage of time in which all slots in Device Cache request queue are waiting for data return / response" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 55 READ $L3BankTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Device Cache" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > symbol_name="L3_WRITE" > underscore_name="l3_write" > @@ -4120,6 +4354,32 @@ > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > + + symbol_name="CLIPPER_INPUT_AVAILABLE" > + underscore_name="clipper_input_available" > + description="Percentage of time in which Clipper has input available (from Vertex Shader or SOL)" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 35 READ $GeometryPipeTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Geometry" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="CLIPPER_OUTPUT_READY" > + underscore_name="clipper_output_ready" > + description="Percentage of time in which Clipper output ready" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 36 READ $GeometryPipeTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Geometry" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > symbol_name="CLIPPER_TRANSACTION_OUTPUT" > underscore_name="clipper_transaction_output" > @@ -4146,6 +4406,71 @@ > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > + + symbol_name="STRIPSFAN_OUTPUT_READY" > + underscore_name="stripsfan_output_ready" > + description="Percentage of time in which in which geometry pipeline output is ready" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 39 READ $GeometryPipeTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Geometry" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="VERTEX_FETCH_INPUT_AVAILABLE" > + underscore_name="vertex_fetch_input_available" > + description="Percentage of time in which Vertex Fetch input is available" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 40 READ $GeometryPipeTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Geometry" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="VERTEX_FETCH_OUTPUT_READY" > + underscore_name="vertex_fetch_output_ready" > + description="Percentage of time in which Vertex Fetch output is ready" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 41 READ $GeometryPipeTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Geometry" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="VS_OUTPUT_READY" > + underscore_name="vs_output_ready" > + description="Percentage of time in which Vertex Shader output is ready" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 42 READ $GeometryPipeTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Geometry" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="COLOR_PIPE_CACHE_LATENCY1_STALL" > + underscore_name="color_pipe_cache_latency1_stall" > + description="Percentage of time in which Color Pipeline stalled due to MultiSampling Cache latency hiding structure full" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 43 READ $ColorPipeTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Color Pipe" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > symbol_name="COLOR_PIPE_CACHE_LATENCY2_STALL" > underscore_name="color_pipe_cache_latency2_stall" > @@ -4159,6 +4484,32 @@ > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > + + symbol_name="PIXEL_POST_PROCESS_INPUT_AVAILABLE" > + underscore_name="pixel_post_process_input_available" > + description="Percentage of time in which Color Pipeline input ready" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 45 READ $ColorPipeTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Color Pipe" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="PIXEL_POST_PROCESS_OUTPUT_READY" > + underscore_name="pixel_post_process_output_ready" > + description="Percentage of time in which Color Pipeline pixel output ready" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 46 READ $ColorPipeTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Color Pipe" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > symbol_name="PS_OUTPUT_AVAILABLE" > underscore_name="ps_output_available" > @@ -4172,6 +4523,32 @@ > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > + + symbol_name="RENDER_CACHE_INPUT_AVAILABLE" > + underscore_name="render_cache_input_available" > + description="Percentage of time in which Render Cache input available" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 47 READ 2 UDIV $L3NodeTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Color Pipe" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="RENDER_CACHE_OUTPUT_READY" > + underscore_name="render_cache_output_ready" > + description="Percentage of time in which Render Cache output ready" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 48 READ 2 UDIV $L3NodeTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Color Pipe" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > symbol_name="AMFS_STALL_ALL_INPUT" > underscore_name="amfs_stall_all_input" > @@ -5776,6 +6153,19 @@ > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > + + symbol_name="L3_BUSY" > + underscore_name="l3_busy" > + description="Percentage of time in which Device Cache request queue has one or more requests pending" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 36 READ $L3BankTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Device Cache" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > symbol_name="L3_HIT" > underscore_name="l3_hit" > @@ -5789,6 +6179,19 @@ > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > + + symbol_name="L3_INPUT_AVAILABLE" > + underscore_name="l3_input_available" > + description="Percentage of time in which Device Cache bank has input requests" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 38 READ $L3BankTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Device Cache" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > symbol_name="L3_MISS" > underscore_name="l3_miss" > @@ -5802,6 +6205,19 @@ > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > + + symbol_name="L3_OUTPUT_READY" > + underscore_name="l3_output_ready" > + description="Percentage of time in which Device Cache bank has output ready" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 40 READ $L3BankTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Device Cache" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > symbol_name="L3_READ" > underscore_name="l3_read" > @@ -5815,6 +6231,32 @@ > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > + + symbol_name="L3_STALL" > + underscore_name="l3_stall" > + description="Percentage of time in which Device Cache bank stalled" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 42 READ $L3BankTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Device Cache" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="L3_SUPERQ_FULL" > + underscore_name="l3_superq_full" > + description="Percentage of time in which all slots in Device Cache request queue are waiting for data return / response" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 43 READ $L3BankTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Device Cache" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > symbol_name="L3_WRITE" > underscore_name="l3_write" > @@ -5971,6 +6413,19 @@ > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > + + symbol_name="GPU_MEMORY_REQUEST_QUEUE_FULL" > + underscore_name="gpu_memory_request_queue_full" > + description="Percentage of time in which SQ is filled above a threshold (usually 48 entries)" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 54 READ $SqidiTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Memory" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > > > > @@ -6658,13 +7113,13 @@ > > > > - + chipset="PTL" > - symbol_name="TestOa" > - underscore_name="test_oa" > + symbol_name="XvePipelineRasterizationProfile" > + underscore_name="xve_pipeline_rasterization_profile" > mdapi_supported_apis="VK OGL OCL IO" > oa_format="576B_PEC64LL" > - hw_config_guid="cf90eca9-c696-4e2c-8f48-0574b1745a58" > + hw_config_guid="c8770c23-1a3a-4869-89e8-2e22ea9af5cd" > > > symbol_name="GpuTime" > @@ -6706,203 +7161,1223 @@ > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > - - symbol_name="GPU_BUSY" > - underscore_name="gpu_busy" > - description="Percentage of time in which GPU is not idle including all GPU engines" > + + symbol_name="XVE_ACTIVE" > + underscore_name="xve_active" > + description="Percentage of time in which at least one pipe is active in XVE" > data_type="float" > units="percent" > semantic_type="duration" > - equation="PEC 32 READ 100 UMUL $GpuCoreClocks FDIV" > - mdapi_group="Front End" > + equation="PEC 0 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > mdapi_usage_flags="Frame Batch Draw" > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > - - symbol_name="IA_VERTEX" > - underscore_name="ia_vertex" > - description="Number of vertices in a draw" > - data_type="uint64" > - units="events" > - semantic_type="event" > - equation="PEC 33 READ" > - mdapi_group="Geometry" > + + symbol_name="XVE_STALL" > + underscore_name="xve_stall" > + description="Percentage of time in which any thread loaded but not even a single pipe is active in XVE" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 1 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > mdapi_usage_flags="Frame Batch Draw" > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > - - symbol_name="GPGPU_THREADGROUP_COUNT" > - underscore_name="gpgpu_threadgroup_count" > - description="Number of GPGPU threadgroups dispatched" > - data_type="uint64" > - units="events" > - semantic_type="event" > - equation="PEC 0 READ" > - mdapi_group="Thread Dispatcher" > + + symbol_name="XVE_INST_EXECUTED_ALU0_ALL_UTILIZATION" > + underscore_name="xve_inst_executed_alu0_all_utilization" > + description="Percentage of time taken by instructions executed by ALL threads on ALU0 pipe" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 2 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > mdapi_usage_flags="Frame Batch Draw" > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > - - symbol_name="ASYNC_GPGPU_THREADGROUP_COUNT" > - underscore_name="async_gpgpu_threadgroup_count" > - description="Number of Async GPGPU threadgroups dispatched" > - data_type="uint64" > - units="events" > - semantic_type="event" > - equation="PEC 1 READ" > - mdapi_group="Thread Dispatcher" > + + symbol_name="XVE_INST_EXECUTED_ALU1_ALL_UTILIZATION" > + underscore_name="xve_inst_executed_alu1_all_utilization" > + description="Percentage of time taken by instructions executed by ALL threads on ALU1 pipe. Does not include extended math instructions." > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 3 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > mdapi_usage_flags="Frame Batch Draw" > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > - - symbol_name="RASTERIZER_SAMPLE_OUTPUT" > - underscore_name="rasterizer_sample_output" > - description="Number of lit samples emitted by Rasterizer" > - data_type="uint64" > - units="events" > - semantic_type="event" > - equation="PEC 34 READ" > - mdapi_group="Rasterizer" > + + symbol_name="XVE_INST_EXECUTED_ALU2_ALL_UTILIZATION" > + underscore_name="xve_inst_executed_alu2_all_utilization" > + description="Percentage of time taken by instructions executed in ALU2 pipe" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 4 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > mdapi_usage_flags="Frame Batch Draw" > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > - - symbol_name="TEST_EVENT1" > - underscore_name="test_event1" > - description="Number of GPU (gt) clock cycles at every clock period" > + + symbol_name="XVE_PIPE_ALU0_AND_ALU1_ACTIVE" > + underscore_name="xve_pipe_alu0_and_alu1_active" > + description="Percentage of time in which ALU0 and ALU1 pipes are both actively executing a Gen ISA instruction" > data_type="float" > units="percent" > semantic_type="duration" > - equation="PEC 2 READ $XeCoreTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > - mdapi_group="Test" > + equation="PEC 5 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > mdapi_usage_flags="Frame Batch Draw" > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > - - symbol_name="TEST_EVENT1_CYCLES" > - underscore_name="test_event1_cycles" > - description="Number of GPU (gt) clock cycles at every clock period" > - data_type="uint64" > - units="cycles" > - semantic_type="event" > - equation="PEC 2 READ" > - mdapi_group="Test" > + + symbol_name="XVE_PIPE_ALU0_AND_ALU2_ACTIVE" > + underscore_name="xve_pipe_alu0_and_alu2_active" > + description="Percentage of time in which ALU0 and ALU2 pipes are both actively executing a Gen ISA instruction" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 6 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > mdapi_usage_flags="Frame Batch Draw" > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > - - symbol_name="TEST_EVENT1_CYCLES_XECORE0" > - underscore_name="test_event1_cycles_xecore0" > - description="Number of GPU (gt) clock cycles at every clock period (XECORE0)" > - data_type="uint64" > - units="cycles" > - semantic_type="event" > - equation="PEC 3 READ" > - availability="$GtXeCore0" > - mdapi_group="Test" > + + symbol_name="XVE_MULTIPLE_PIPE_ACTIVE" > + underscore_name="xve_multiple_pipe_active" > + description="Percentage of time in which at least two pipes are actively executing a Gen ISA instruction among ALU0, ALU1 and ALU2 pipes" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 21 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > mdapi_usage_flags="Frame Batch Draw" > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > - - symbol_name="TEST_EVENT1_CYCLES_XECORE1" > - underscore_name="test_event1_cycles_xecore1" > - description="Number of GPU (gt) clock cycles at every clock period (XECORE1)" > - data_type="uint64" > - units="cycles" > - semantic_type="event" > - equation="PEC 4 READ" > - availability="$GtXeCore1" > - mdapi_group="Test" > + + symbol_name="XVE_INST_EXECUTED_ALU0_VS_UTILIZATION" > + underscore_name="xve_inst_executed_alu0_vs_utilization" > + description="Percentage of time taken by instructions executed by VS threads on ALU0 pipe" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 7 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > mdapi_usage_flags="Frame Batch Draw" > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > - - symbol_name="TEST_EVENT1_CYCLES_XECORE2" > - underscore_name="test_event1_cycles_xecore2" > - description="Number of GPU (gt) clock cycles at every clock period (XECORE2)" > - data_type="uint64" > - units="cycles" > - semantic_type="event" > - equation="PEC 5 READ" > - availability="$GtXeCore2" > - mdapi_group="Test" > + + symbol_name="XVE_INST_EXECUTED_ALU0_MS_UTILIZATION" > + underscore_name="xve_inst_executed_alu0_ms_utilization" > + description="Percentage of time taken by instructions executed by MS threads on ALU0 pipe" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 8 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > mdapi_usage_flags="Frame Batch Draw" > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > - - symbol_name="TEST_EVENT1_CYCLES_XECORE3" > - underscore_name="test_event1_cycles_xecore3" > - description="Number of GPU (gt) clock cycles at every clock period (XECORE3)" > - data_type="uint64" > - units="cycles" > - semantic_type="event" > - equation="PEC 6 READ" > - availability="$GtXeCore3" > - mdapi_group="Test" > + + symbol_name="XVE_INST_EXECUTED_ALU0_TS_UTILIZATION" > + underscore_name="xve_inst_executed_alu0_ts_utilization" > + description="Percentage of time taken by instructions executed by TS threads on ALU0 pipe" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 9 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > mdapi_usage_flags="Frame Batch Draw" > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > - - symbol_name="TEST_EVENT1_CYCLES_XECORE4" > - underscore_name="test_event1_cycles_xecore4" > - description="Number of GPU (gt) clock cycles at every clock period (XECORE4)" > - data_type="uint64" > - units="cycles" > - semantic_type="event" > - equation="PEC 21 READ" > - availability="$GtXeCore4" > - mdapi_group="Test" > + + symbol_name="XVE_INST_EXECUTED_ALU0_PS_UTILIZATION" > + underscore_name="xve_inst_executed_alu0_ps_utilization" > + description="Percentage of time taken by instructions executed by PS threads on ALU0 pipe" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 10 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > mdapi_usage_flags="Frame Batch Draw" > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > - - symbol_name="TEST_EVENT1_CYCLES_XECORE5" > - underscore_name="test_event1_cycles_xecore5" > - description="Number of GPU (gt) clock cycles at every clock period (XECORE5)" > - data_type="uint64" > - units="cycles" > - semantic_type="event" > - equation="PEC 22 READ" > - availability="$GtXeCore5" > - mdapi_group="Test" > + + symbol_name="XVE_INST_EXECUTED_ALU0_ASYNC_CS_UTILIZATION" > + underscore_name="xve_inst_executed_alu0_async_cs_utilization" > + description="Percentage of time taken by instructions executed by ASYNC_CS threads on ALU0 pipe" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 11 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > mdapi_usage_flags="Frame Batch Draw" > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > - - symbol_name="TEST_EVENT1_CYCLES_XECORE6" > - underscore_name="test_event1_cycles_xecore6" > - description="Number of GPU (gt) clock cycles at every clock period (XECORE6)" > - data_type="uint64" > - units="cycles" > - semantic_type="event" > - equation="PEC 23 READ" > - availability="$GtXeCore6" > - mdapi_group="Test" > + + symbol_name="XVE_INST_EXECUTED_ALU0_CS_UTILIZATION" > + underscore_name="xve_inst_executed_alu0_cs_utilization" > + description="Percentage of time taken by instructions executed by CS threads on ALU0 pipe" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 12 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > mdapi_usage_flags="Frame Batch Draw" > mdapi_supported_apis="" > mdapi_hw_unit_type="gpu" > /> > - - symbol_name="TEST_EVENT1_CYCLES_XECORE7" > - underscore_name="test_event1_cycles_xecore7" > - description="Number of GPU (gt) clock cycles at every clock period (XECORE7)" > - data_type="uint64" > - units="cycles" > - semantic_type="event" > - equation="PEC 24 READ" > + + symbol_name="XVE_INST_EXECUTED_ALU1_VS_UTILIZATION" > + underscore_name="xve_inst_executed_alu1_vs_utilization" > + description="Percentage of time taken by instructions executed by VS threads on ALU1 pipe. Does not include extended math instructions." > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 13 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_INST_EXECUTED_ALU1_MS_UTILIZATION" > + underscore_name="xve_inst_executed_alu1_ms_utilization" > + description="Percentage of time taken by instructions executed by MS threads on ALU1 pipe. Does not include extended math instructions." > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 14 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_INST_EXECUTED_ALU1_TS_UTILIZATION" > + underscore_name="xve_inst_executed_alu1_ts_utilization" > + description="Percentage of time taken by instructions executed by TS threads on ALU1 pipe. Does not include extended math instructions." > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 15 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_INST_EXECUTED_ALU1_PS_UTILIZATION" > + underscore_name="xve_inst_executed_alu1_ps_utilization" > + description="Percentage of time taken by instructions executed by PS threads on ALU1 pipe. Does not include extended math instructions." > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 16 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_INST_EXECUTED_ALU1_ASYNC_CS_UTILIZATION" > + underscore_name="xve_inst_executed_alu1_async_cs_utilization" > + description="Percentage of time taken by instructions executed by ASYNC_CS threads on ALU1 pipe. Does not include extended math instructions." > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 17 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_INST_EXECUTED_ALU1_CS_UTILIZATION" > + underscore_name="xve_inst_executed_alu1_cs_utilization" > + description="Percentage of time taken by instructions executed by CS threads on ALU1 pipe. Does not include extended math instructions." > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 18 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_THREADS_OCCUPANCY_ALL" > + underscore_name="xve_threads_occupancy_all" > + description="Percentage of thread slots occupied by ALL threads" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 22 READ $VectorEngineThreadsCount $VectorEngineTotalCount UMUL UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_STALL_ALUWR" > + underscore_name="xve_stall_aluwr" > + description="Percentage of time in which XVE stalled, with at least one thread waiting for ALU to write GRF/ACC register. Multiple stall reasons can qualify during the same cycle" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 23 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_STALL_BARRIER" > + underscore_name="xve_stall_barrier" > + description="Percentage of time in which XVE stalled, with at least one thread waiting for Gateway to write Notify register. Multiple stall reasons can qualify during the same cycle" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 24 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_STALL_CONTROL" > + underscore_name="xve_stall_control" > + description="Percentage of time in which XVE stalled, with at least one thread waiting for JEU to complete branch instruction. Multiple stall reasons can qualify during the same cycle" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 25 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_STALL_INSTFETCH" > + underscore_name="xve_stall_instfetch" > + description="Percentage of time in which XVE stalled, with at least one thread waiting for Instruction Fetch. Multiple stall reasons can qualify during the same cycle" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 26 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_STALL_OTHER" > + underscore_name="xve_stall_other" > + description="Percentage of time in which XVE stalled, with at least one thread waiting on any other dependency (Flag/EoT etc). Multiple stall reasons can qualify during the same cycle" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 27 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_STALL_PIPESTALL" > + underscore_name="xve_stall_pipestall" > + description="Percentage of time in which XVE stalled, with at least one thread ready to be scheduled (Grf conf/send holds etc). Multiple stall reasons can qualify during the same cycle" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 28 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_STALL_PS_DEPENDENCY" > + underscore_name="xve_stall_ps_dependency" > + description="Percentage of time in which XVE stalled, with at least one thread waiting on Pixel Shader dependency" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 29 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_STALL_SBID" > + underscore_name="xve_stall_sbid" > + description="Percentage of time in which XVE stalled, with at least one thread waiting for Scoreboard token to be available. Multiple stall reasons can qualify during the same cycle" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 30 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_STALL_SENDWR" > + underscore_name="xve_stall_sendwr" > + description="Percentage of time in which XVE stalled, with at least one thread waiting for SEND message to be dispatched from XVE. Multiple stall reasons can qualify during the same cycle" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 31 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + + chipset="PTL" > + symbol_name="XvePipelineRaytracingProfile" > + underscore_name="xve_pipeline_raytracing_profile" > + mdapi_supported_apis="VK OGL OCL IO" > + oa_format="576B_PEC64LL" > + hw_config_guid="dac47689-630c-4a44-a9e9-e8c94e3a33c4" > + > > + + symbol_name="GpuTime" > + underscore_name="gpu_time" > + description="Time elapsed on the GPU during the measurement." > + data_type="uint64" > + units="ns" > + semantic_type="event" > + equation="GPU_TIME 0 READ 1000000000 UMUL $GpuTimestampFrequency UDIV" > + mdapi_group="GPU" > + mdapi_usage_flags="Tier1 Overview System Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="GpuCoreClocks" > + underscore_name="gpu_core_clocks" > + description="The total number of GPU core clocks elapsed during the measurement." > + data_type="uint64" > + units="cycles" > + semantic_type="event" > + equation="GPU_CLOCK 0 READ" > + mdapi_group="GPU" > + mdapi_usage_flags="Tier1 Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="AvgGpuCoreFrequency" > + underscore_name="avg_gpu_core_frequency" > + description="Average GPU Core Frequency in the measurement." > + data_type="uint64" > + max_equation="$GpuMaxFrequency" > + units="hz" > + semantic_type="event" > + equation="$GpuCoreClocks 1000000000 UMUL $GpuTime UDIV" > + mdapi_group="GPU" > + mdapi_usage_flags="Tier1 Overview System Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_ACTIVE" > + underscore_name="xve_active" > + description="Percentage of time in which at least one pipe is active in XVE" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 0 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_STALL" > + underscore_name="xve_stall" > + description="Percentage of time in which any thread loaded but not even a single pipe is active in XVE" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 1 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_INST_EXECUTED_ALU0_ALL_UTILIZATION" > + underscore_name="xve_inst_executed_alu0_all_utilization" > + description="Percentage of time taken by instructions executed by ALL threads on ALU0 pipe" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 2 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_INST_EXECUTED_ALU1_ALL_UTILIZATION" > + underscore_name="xve_inst_executed_alu1_all_utilization" > + description="Percentage of time taken by instructions executed by ALL threads on ALU1 pipe. Does not include extended math instructions." > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 3 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_INST_EXECUTED_ALU2_ALL_UTILIZATION" > + underscore_name="xve_inst_executed_alu2_all_utilization" > + description="Percentage of time taken by instructions executed in ALU2 pipe" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 4 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_PIPE_ALU0_AND_ALU1_ACTIVE" > + underscore_name="xve_pipe_alu0_and_alu1_active" > + description="Percentage of time in which ALU0 and ALU1 pipes are both actively executing a Gen ISA instruction" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 5 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_PIPE_ALU0_AND_ALU2_ACTIVE" > + underscore_name="xve_pipe_alu0_and_alu2_active" > + description="Percentage of time in which ALU0 and ALU2 pipes are both actively executing a Gen ISA instruction" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 6 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_MULTIPLE_PIPE_ACTIVE" > + underscore_name="xve_multiple_pipe_active" > + description="Percentage of time in which at least two pipes are actively executing a Gen ISA instruction among ALU0, ALU1 and ALU2 pipes" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 21 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_INST_EXECUTED_ALU0_RT_AHS_UTILIZATION" > + underscore_name="xve_inst_executed_alu0_rt_ahs_utilization" > + description="Percentage of time taken by instructions executed by RT_AHS threads on ALU0 pipe" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 7 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_INST_EXECUTED_ALU0_RT_CHS_UTILIZATION" > + underscore_name="xve_inst_executed_alu0_rt_chs_utilization" > + description="Percentage of time taken by instructions executed by RT_CHS threads on ALU0 pipe" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 8 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_INST_EXECUTED_ALU0_RT_MS_UTILIZATION" > + underscore_name="xve_inst_executed_alu0_rt_ms_utilization" > + description="Percentage of time taken by instructions executed by RT_MS threads on ALU0 pipe" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 9 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_INST_EXECUTED_ALU0_PS_UTILIZATION" > + underscore_name="xve_inst_executed_alu0_ps_utilization" > + description="Percentage of time taken by instructions executed by PS threads on ALU0 pipe" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 10 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_INST_EXECUTED_ALU0_ASYNC_CS_UTILIZATION" > + underscore_name="xve_inst_executed_alu0_async_cs_utilization" > + description="Percentage of time taken by instructions executed by ASYNC_CS threads on ALU0 pipe" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 11 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_INST_EXECUTED_ALU0_CS_UTILIZATION" > + underscore_name="xve_inst_executed_alu0_cs_utilization" > + description="Percentage of time taken by instructions executed by CS threads on ALU0 pipe" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 12 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_INST_EXECUTED_ALU1_RT_AHS_UTILIZATION" > + underscore_name="xve_inst_executed_alu1_rt_ahs_utilization" > + description="Percentage of time taken by instructions executed by RT_AHS threads on ALU1 pipe. Does not include extended math instructions." > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 13 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_INST_EXECUTED_ALU1_RT_CHS_UTILIZATION" > + underscore_name="xve_inst_executed_alu1_rt_chs_utilization" > + description="Percentage of time taken by instructions executed by RT_CHS threads on ALU1 pipe. Does not include extended math instructions." > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 14 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_INST_EXECUTED_ALU1_RT_MS_UTILIZATION" > + underscore_name="xve_inst_executed_alu1_rt_ms_utilization" > + description="Percentage of time taken by instructions executed by RT_MS threads on ALU1 pipe. Does not include extended math instructions." > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 15 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_INST_EXECUTED_ALU1_PS_UTILIZATION" > + underscore_name="xve_inst_executed_alu1_ps_utilization" > + description="Percentage of time taken by instructions executed by PS threads on ALU1 pipe. Does not include extended math instructions." > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 16 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_INST_EXECUTED_ALU1_ASYNC_CS_UTILIZATION" > + underscore_name="xve_inst_executed_alu1_async_cs_utilization" > + description="Percentage of time taken by instructions executed by ASYNC_CS threads on ALU1 pipe. Does not include extended math instructions." > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 17 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_INST_EXECUTED_ALU1_CS_UTILIZATION" > + underscore_name="xve_inst_executed_alu1_cs_utilization" > + description="Percentage of time taken by instructions executed by CS threads on ALU1 pipe. Does not include extended math instructions." > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 18 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_THREADS_OCCUPANCY_ALL" > + underscore_name="xve_threads_occupancy_all" > + description="Percentage of thread slots occupied by ALL threads" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 22 READ $VectorEngineThreadsCount $VectorEngineTotalCount UMUL UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_STALL_ALUWR" > + underscore_name="xve_stall_aluwr" > + description="Percentage of time in which XVE stalled, with at least one thread waiting for ALU to write GRF/ACC register. Multiple stall reasons can qualify during the same cycle" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 23 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_STALL_BARRIER" > + underscore_name="xve_stall_barrier" > + description="Percentage of time in which XVE stalled, with at least one thread waiting for Gateway to write Notify register. Multiple stall reasons can qualify during the same cycle" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 24 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_STALL_CONTROL" > + underscore_name="xve_stall_control" > + description="Percentage of time in which XVE stalled, with at least one thread waiting for JEU to complete branch instruction. Multiple stall reasons can qualify during the same cycle" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 25 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_STALL_INSTFETCH" > + underscore_name="xve_stall_instfetch" > + description="Percentage of time in which XVE stalled, with at least one thread waiting for Instruction Fetch. Multiple stall reasons can qualify during the same cycle" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 26 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_STALL_OTHER" > + underscore_name="xve_stall_other" > + description="Percentage of time in which XVE stalled, with at least one thread waiting on any other dependency (Flag/EoT etc). Multiple stall reasons can qualify during the same cycle" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 27 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_STALL_PIPESTALL" > + underscore_name="xve_stall_pipestall" > + description="Percentage of time in which XVE stalled, with at least one thread ready to be scheduled (Grf conf/send holds etc). Multiple stall reasons can qualify during the same cycle" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 28 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_STALL_PS_DEPENDENCY" > + underscore_name="xve_stall_ps_dependency" > + description="Percentage of time in which XVE stalled, with at least one thread waiting on Pixel Shader dependency" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 29 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_STALL_SBID" > + underscore_name="xve_stall_sbid" > + description="Percentage of time in which XVE stalled, with at least one thread waiting for Scoreboard token to be available. Multiple stall reasons can qualify during the same cycle" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 30 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="XVE_STALL_SENDWR" > + underscore_name="xve_stall_sendwr" > + description="Percentage of time in which XVE stalled, with at least one thread waiting for SEND message to be dispatched from XVE. Multiple stall reasons can qualify during the same cycle" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 31 READ $EuCoresTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Vector Engine" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + + chipset="PTL" > + symbol_name="TestOa" > + underscore_name="test_oa" > + mdapi_supported_apis="VK OGL OCL IO" > + oa_format="576B_PEC64LL" > + hw_config_guid="cf90eca9-c696-4e2c-8f48-0574b1745a58" > + > > + + symbol_name="GpuTime" > + underscore_name="gpu_time" > + description="Time elapsed on the GPU during the measurement." > + data_type="uint64" > + units="ns" > + semantic_type="event" > + equation="GPU_TIME 0 READ 1000000000 UMUL $GpuTimestampFrequency UDIV" > + mdapi_group="GPU" > + mdapi_usage_flags="Tier1 Overview System Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="GpuCoreClocks" > + underscore_name="gpu_core_clocks" > + description="The total number of GPU core clocks elapsed during the measurement." > + data_type="uint64" > + units="cycles" > + semantic_type="event" > + equation="GPU_CLOCK 0 READ" > + mdapi_group="GPU" > + mdapi_usage_flags="Tier1 Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="AvgGpuCoreFrequency" > + underscore_name="avg_gpu_core_frequency" > + description="Average GPU Core Frequency in the measurement." > + data_type="uint64" > + max_equation="$GpuMaxFrequency" > + units="hz" > + semantic_type="event" > + equation="$GpuCoreClocks 1000000000 UMUL $GpuTime UDIV" > + mdapi_group="GPU" > + mdapi_usage_flags="Tier1 Overview System Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="GPU_BUSY" > + underscore_name="gpu_busy" > + description="Percentage of time in which GPU is not idle including all GPU engines" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 32 READ 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Front End" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="IA_VERTEX" > + underscore_name="ia_vertex" > + description="Number of vertices in a draw" > + data_type="uint64" > + units="events" > + semantic_type="event" > + equation="PEC 33 READ" > + mdapi_group="Geometry" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="GPGPU_THREADGROUP_COUNT" > + underscore_name="gpgpu_threadgroup_count" > + description="Number of GPGPU threadgroups dispatched" > + data_type="uint64" > + units="events" > + semantic_type="event" > + equation="PEC 0 READ" > + mdapi_group="Thread Dispatcher" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="ASYNC_GPGPU_THREADGROUP_COUNT" > + underscore_name="async_gpgpu_threadgroup_count" > + description="Number of Async GPGPU threadgroups dispatched" > + data_type="uint64" > + units="events" > + semantic_type="event" > + equation="PEC 1 READ" > + mdapi_group="Thread Dispatcher" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="RASTERIZER_SAMPLE_OUTPUT" > + underscore_name="rasterizer_sample_output" > + description="Number of lit samples emitted by Rasterizer" > + data_type="uint64" > + units="events" > + semantic_type="event" > + equation="PEC 34 READ" > + mdapi_group="Rasterizer" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="TEST_EVENT1" > + underscore_name="test_event1" > + description="Number of GPU (gt) clock cycles at every clock period" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="PEC 2 READ $XeCoreTotalCount UDIV 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Test" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="TEST_EVENT1_CYCLES" > + underscore_name="test_event1_cycles" > + description="Number of GPU (gt) clock cycles at every clock period" > + data_type="uint64" > + units="cycles" > + semantic_type="event" > + equation="PEC 2 READ" > + mdapi_group="Test" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="TEST_EVENT1_CYCLES_XECORE0" > + underscore_name="test_event1_cycles_xecore0" > + description="Number of GPU (gt) clock cycles at every clock period (XECORE0)" > + data_type="uint64" > + units="cycles" > + semantic_type="event" > + equation="PEC 3 READ" > + availability="$GtXeCore0" > + mdapi_group="Test" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="TEST_EVENT1_CYCLES_XECORE1" > + underscore_name="test_event1_cycles_xecore1" > + description="Number of GPU (gt) clock cycles at every clock period (XECORE1)" > + data_type="uint64" > + units="cycles" > + semantic_type="event" > + equation="PEC 4 READ" > + availability="$GtXeCore1" > + mdapi_group="Test" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="TEST_EVENT1_CYCLES_XECORE2" > + underscore_name="test_event1_cycles_xecore2" > + description="Number of GPU (gt) clock cycles at every clock period (XECORE2)" > + data_type="uint64" > + units="cycles" > + semantic_type="event" > + equation="PEC 5 READ" > + availability="$GtXeCore2" > + mdapi_group="Test" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="TEST_EVENT1_CYCLES_XECORE3" > + underscore_name="test_event1_cycles_xecore3" > + description="Number of GPU (gt) clock cycles at every clock period (XECORE3)" > + data_type="uint64" > + units="cycles" > + semantic_type="event" > + equation="PEC 6 READ" > + availability="$GtXeCore3" > + mdapi_group="Test" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="TEST_EVENT1_CYCLES_XECORE4" > + underscore_name="test_event1_cycles_xecore4" > + description="Number of GPU (gt) clock cycles at every clock period (XECORE4)" > + data_type="uint64" > + units="cycles" > + semantic_type="event" > + equation="PEC 21 READ" > + availability="$GtXeCore4" > + mdapi_group="Test" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="TEST_EVENT1_CYCLES_XECORE5" > + underscore_name="test_event1_cycles_xecore5" > + description="Number of GPU (gt) clock cycles at every clock period (XECORE5)" > + data_type="uint64" > + units="cycles" > + semantic_type="event" > + equation="PEC 22 READ" > + availability="$GtXeCore5" > + mdapi_group="Test" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="TEST_EVENT1_CYCLES_XECORE6" > + underscore_name="test_event1_cycles_xecore6" > + description="Number of GPU (gt) clock cycles at every clock period (XECORE6)" > + data_type="uint64" > + units="cycles" > + semantic_type="event" > + equation="PEC 23 READ" > + availability="$GtXeCore6" > + mdapi_group="Test" > + mdapi_usage_flags="Frame Batch Draw" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="TEST_EVENT1_CYCLES_XECORE7" > + underscore_name="test_event1_cycles_xecore7" > + description="Number of GPU (gt) clock cycles at every clock period (XECORE7)" > + data_type="uint64" > + units="cycles" > + semantic_type="event" > + equation="PEC 24 READ" > availability="$GtXeCore7" > mdapi_group="Test" > mdapi_usage_flags="Frame Batch Draw" > @@ -7062,4 +8537,213 @@ > > > > + + chipset="PTL" > + symbol_name="MediaSet2" > + underscore_name="media_set2" > + mdapi_supported_apis="IO" > + oa_format="128B_MPEC8_NOA16" > + hw_config_guid="36cf7053-8097-4389-bc5d-a652553de240" > + > > + + symbol_name="GpuTime" > + underscore_name="gpu_time" > + description="Time elapsed on the GPU during the measurement." > + data_type="uint64" > + units="ns" > + semantic_type="event" > + equation="GPU_TIME 0 READ 1000000000 UMUL $GpuTimestampFrequency UDIV" > + mdapi_group="GPU" > + mdapi_usage_flags="Tier1 Overview System Batch" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="GpuCoreClocks" > + underscore_name="gpu_core_clocks" > + description="The total number of media slice clocks elapsed during the measurement." > + data_type="uint64" > + units="cycles" > + semantic_type="event" > + equation="GPU_CLOCK 0 READ" > + mdapi_group="GPU" > + mdapi_usage_flags="Tier1 Overview System Batch" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="AvgGpuCoreFrequency" > + underscore_name="avg_gpu_core_frequency" > + description="Average GPU Media Frequency in the measurement." > + data_type="uint64" > + max_equation="$GpuMaxFrequency" > + units="hz" > + semantic_type="event" > + equation="$GpuCoreClocks 1000000000 UMUL $GpuTime UDIV" > + mdapi_group="GPU" > + mdapi_usage_flags="Tier1 Overview System Batch" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="COMMAND_PARSER_VIDEO_ENHANCEMENT_ENGINE0_BUSY" > + underscore_name="command_parser_video_enhancement_engine0_busy" > + description="Number of VECS0 (VEBOX0) activities" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="A 0 READ 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Media" > + mdapi_usage_flags="Batch" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="VIDEO_ENHANCEMENT_BOX0_READ_REQUEST" > + underscore_name="video_enhancement_box0_read_request" > + description="Number of VEBOX0 read requests" > + data_type="uint64" > + units="events" > + semantic_type="event" > + equation="A 1 READ" > + mdapi_group="Media" > + mdapi_usage_flags="Batch" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="VIDEO_ENHANCEMENT_BOX0_WRITE_REQUEST" > + underscore_name="video_enhancement_box0_write_request" > + description="Number of VEBOX0 write requests" > + data_type="uint64" > + units="events" > + semantic_type="event" > + equation="A 2 READ" > + mdapi_group="Media" > + mdapi_usage_flags="Batch" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="COMMAND_PARSER_VIDEO_ENHANCEMENT_ENGINE1_BUSY" > + underscore_name="command_parser_video_enhancement_engine1_busy" > + description="Number of VECS1 (VEBOX1) activities" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="A 3 READ 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Media" > + mdapi_usage_flags="Batch" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="VIDEO_ENHANCEMENT_BOX1_READ_REQUEST" > + underscore_name="video_enhancement_box1_read_request" > + description="Number of VEBOX1 read requests" > + data_type="uint64" > + units="events" > + semantic_type="event" > + equation="A 4 READ" > + mdapi_group="Media" > + mdapi_usage_flags="Batch" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="VIDEO_ENHANCEMENT_BOX1_WRITE_REQUEST" > + underscore_name="video_enhancement_box1_write_request" > + description="Number of VEBOX1 write requests" > + data_type="uint64" > + units="events" > + semantic_type="event" > + equation="A 5 READ" > + mdapi_group="Media" > + mdapi_usage_flags="Batch" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + > + + chipset="PTL" > + symbol_name="MediaSet1" > + underscore_name="media_set1" > + mdapi_supported_apis="IO" > + oa_format="128B_MPEC8_NOA16" > + hw_config_guid="2d53df97-44b8-4bf1-aa33-2de98fe6f6ad" > + > > + + symbol_name="GpuTime" > + underscore_name="gpu_time" > + description="Time elapsed on the GPU during the measurement." > + data_type="uint64" > + units="ns" > + semantic_type="event" > + equation="GPU_TIME 0 READ 1000000000 UMUL $GpuTimestampFrequency UDIV" > + mdapi_group="GPU" > + mdapi_usage_flags="Tier1 Overview System Batch" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="GpuCoreClocks" > + underscore_name="gpu_core_clocks" > + description="The total number of media slice clocks elapsed during the measurement." > + data_type="uint64" > + units="cycles" > + semantic_type="event" > + equation="GPU_CLOCK 0 READ" > + mdapi_group="GPU" > + mdapi_usage_flags="Tier1 Overview System Batch" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="AvgGpuCoreFrequency" > + underscore_name="avg_gpu_core_frequency" > + description="Average GPU Media Frequency in the measurement." > + data_type="uint64" > + max_equation="$GpuMaxFrequency" > + units="hz" > + semantic_type="event" > + equation="$GpuCoreClocks 1000000000 UMUL $GpuTime UDIV" > + mdapi_group="GPU" > + mdapi_usage_flags="Tier1 Overview System Batch" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + + symbol_name="COMMAND_PARSER_GRAPHICS_SECURITY_CONTROLLER_BUSY" > + underscore_name="command_parser_graphics_security_controller_busy" > + description="Number of GSCCS activities" > + data_type="float" > + units="percent" > + semantic_type="duration" > + equation="A 0 READ 100 UMUL $GpuCoreClocks FDIV" > + mdapi_group="Media" > + mdapi_usage_flags="Batch" > + mdapi_supported_apis="" > + mdapi_hw_unit_type="gpu" > + /> > + > + > + > + > + > + > > -- > 2.43.0 >