From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from gabe.freedesktop.org (gabe.freedesktop.org [131.252.210.177]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.lore.kernel.org (Postfix) with ESMTPS id 4384CC001DB for ; Mon, 14 Aug 2023 13:08:12 +0000 (UTC) Received: from gabe.freedesktop.org (localhost [127.0.0.1]) by gabe.freedesktop.org (Postfix) with ESMTP id 105DD10E1F6; Mon, 14 Aug 2023 13:08:12 +0000 (UTC) Received: from mgamail.intel.com (mgamail.intel.com [134.134.136.24]) by gabe.freedesktop.org (Postfix) with ESMTPS id 13E6810E1F6 for ; Mon, 14 Aug 2023 13:08:10 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1692018490; x=1723554490; h=from:to:cc:subject:date:message-id:mime-version: content-transfer-encoding; bh=DuETrMXnmjHxxT87hYqMJmdYO6fPLxE9aNVNlkWYRZg=; b=cYpVfu6vPDLbJ1DaYyfQhmQdlfs4lE9ucmr0vNTgoIs8f42zQGTNcYQI +Tl1ZFLwarSqX54P5S4bdDF9qAof51nqA09g7RmTD0c9lr38oo9bk5qan Wb2+a8bfTclwFqxAClsrG9MoDh3i+XPitpu/E3lYLdwk9kOBmuVomZ6Vo dPZP0I4aYv+OljlY67tsYtrwDH/DjvUf5WLSl5QDxCKyFDDXv5nMj5Ris 2S8KrAabMAE5N/BqTW5FwMfV6s+WgBxQeu4j63BaaGzD3AbDX0jm+yYJO OC/Lv8Dko2CZWmeyfHoxEX8D6LPXGJA47OmaUSHbKMH+o/n3l7hK6PYvY w==; X-IronPort-AV: E=McAfee;i="6600,9927,10802"; a="374803972" X-IronPort-AV: E=Sophos;i="6.01,172,1684825200"; d="scan'208";a="374803972" Received: from orsmga005.jf.intel.com ([10.7.209.41]) by orsmga102.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 14 Aug 2023 06:08:08 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=McAfee;i="6600,9927,10802"; a="907231761" X-IronPort-AV: E=Sophos;i="6.01,172,1684825200"; d="scan'208";a="907231761" Received: from aravind-dev.iind.intel.com ([10.145.162.80]) by orsmga005-auth.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 14 Aug 2023 06:08:07 -0700 From: Aravind Iddamsetty To: intel-xe@lists.freedesktop.org Date: Mon, 14 Aug 2023 18:42:11 +0530 Message-Id: <20230814131213.1620252-1-aravind.iddamsetty@intel.com> X-Mailer: git-send-email 2.25.1 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Subject: [Intel-xe] [PATCH v4 0/2] drm/xe/pmu: Enable PMU interface X-BeenThere: intel-xe@lists.freedesktop.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Intel Xe graphics driver List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: intel-xe-bounces@lists.freedesktop.org Sender: "Intel-xe" There are a set of engine group busyness counters provided by HW which are perfect fit to be exposed via PMU perf events. BSPEC: 46559, 46560, 46722, 46729, 52071, 71028 events can be listed using: perf list xe_0000_03_00.0/any-engine-group-busy-gt0/ [Kernel PMU event] xe_0000_03_00.0/copy-group-busy-gt0/ [Kernel PMU event] xe_0000_03_00.0/interrupts/ [Kernel PMU event] xe_0000_03_00.0/media-group-busy-gt0/ [Kernel PMU event] xe_0000_03_00.0/render-group-busy-gt0/ [Kernel PMU event] and can be read using: perf stat -e "xe_0000_8c_00.0/render-group-busy-gt0/" -I 1000 time counts unit events 1.001139062 0 ns xe_0000_8c_00.0/render-group-busy-gt0/ 2.003294678 0 ns xe_0000_8c_00.0/render-group-busy-gt0/ 3.005199582 0 ns xe_0000_8c_00.0/render-group-busy-gt0/ 4.007076497 0 ns xe_0000_8c_00.0/render-group-busy-gt0/ 5.008553068 0 ns xe_0000_8c_00.0/render-group-busy-gt0/ 6.010531563 43520 ns xe_0000_8c_00.0/render-group-busy-gt0/ 7.012468029 44800 ns xe_0000_8c_00.0/render-group-busy-gt0/ 8.013463515 0 ns xe_0000_8c_00.0/render-group-busy-gt0/ 9.015300183 0 ns xe_0000_8c_00.0/render-group-busy-gt0/ 10.017233010 0 ns xe_0000_8c_00.0/render-group-busy-gt0/ 10.971934120 0 ns xe_0000_8c_00.0/render-group-busy-gt0/ The pmu base implementation is taken from i915. v4: Minor nits. v3: 1. drop init_samples, as storing counters before going to suspend should be sufficient. 2. ported the "drm/i915/pmu: Make PMU sample array two-dimensional" and dropped helpers to store and read samples. 3. use xe_device_mem_access_get_if_ongoing to check if device is active before reading the OA registers. 4. dropped format attr as no longer needed 5. introduce xe_pmu_suspend to call engine_group_busyness_store 6. few other nits. v2: Store last known value when device is awake return that while the GT is suspended and then update the driver copy when read during awake. Aravind Iddamsetty (2): drm/xe: Get GT clock to nanosecs drm/xe/pmu: Enable PMU interface drivers/gpu/drm/xe/Makefile | 2 + drivers/gpu/drm/xe/regs/xe_gt_regs.h | 5 + drivers/gpu/drm/xe/xe_device.c | 2 + drivers/gpu/drm/xe/xe_device_types.h | 4 + drivers/gpu/drm/xe/xe_gt.c | 2 + drivers/gpu/drm/xe/xe_gt_clock.c | 5 + drivers/gpu/drm/xe/xe_gt_clock.h | 4 +- drivers/gpu/drm/xe/xe_irq.c | 18 + drivers/gpu/drm/xe/xe_module.c | 5 + drivers/gpu/drm/xe/xe_pmu.c | 675 +++++++++++++++++++++++++++ drivers/gpu/drm/xe/xe_pmu.h | 25 + drivers/gpu/drm/xe/xe_pmu_types.h | 76 +++ include/uapi/drm/xe_drm.h | 16 + 13 files changed, 838 insertions(+), 1 deletion(-) create mode 100644 drivers/gpu/drm/xe/xe_pmu.c create mode 100644 drivers/gpu/drm/xe/xe_pmu.h create mode 100644 drivers/gpu/drm/xe/xe_pmu_types.h -- 2.25.1