From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from mgamail.intel.com (mgamail.intel.com [192.198.163.13]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id AED1A1E7C12 for ; Fri, 10 Apr 2026 14:06:23 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=192.198.163.13 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1775829985; cv=none; b=FQTKv11TqA168AFIW3TTn8KIDeoW7PBs3swHiV8pFpHuJE2eNA8O+xb0DwtCCinfF3HpUx0nAmwh/wISbpflwrjrswza5MDKt10xdFsIy4dCoSFC9jfdLwlfvy7J2M1vLX3Zj/4hhQQUWbSrHv8lfAKr7cCJvadqKvHL0NI338c= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1775829985; c=relaxed/simple; bh=3ClDLJ4f/upEWN9RmLVmi1H9UKP4WNemLlet5o1DJ6Y=; h=From:To:Cc:Subject:Date:Message-ID:MIME-Version; b=kfehw4ZqIF4MaJcsoB9g7/EAqC/Q1IysHZ5W4tnfIbsN4Sqajpp+zeLJh+OZgJe6lFHt0dJIVnFw3XIHToFLA/+qqWttPNX/KLsvuEzmgWIz2XTaASrAIertXyNvzvknJoleTF75XmYZQazTw2Ot+RQreQgvUKHqXHnb5CQZjbs= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=intel.com; spf=pass smtp.mailfrom=intel.com; dkim=pass (2048-bit key) header.d=intel.com header.i=@intel.com header.b=VhyfXnsz; arc=none smtp.client-ip=192.198.163.13 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=intel.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=intel.com Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=intel.com header.i=@intel.com header.b="VhyfXnsz" DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1775829983; x=1807365983; h=from:to:cc:subject:date:message-id:mime-version: content-transfer-encoding; bh=3ClDLJ4f/upEWN9RmLVmi1H9UKP4WNemLlet5o1DJ6Y=; b=VhyfXnszfJRK1YEfuLZoZU2V50a9RsW2di/yEt2OZ+kY95G4AawQ9p/a ZpZocO7J/phqIutT6uj0kbRleAkd5aTMre4t/kopVsa5vRs5Cl6d1amV2 SzICWhRQGh66qVYZolcKhcOrZXmN6XAfQwrhuZWOGd969eW8zyMtuD2Oi bK32aLBm7ZFUB24nOKMmOffAzmQrqRX3YzLPpTmrpgWmzVp173fVMYNDc BfQlazBwHyVcLyXNCctbk/lXyT+L2yZU3vvTL3Vs2U5tobF7s0a7KoquO XGR17ZlBb9QFJOv2RmQYX3eWT4I6Ow2fd/oglPjoIQh6sjRrsf5d9LQbz w==; X-CSE-ConnectionGUID: 7EbrOxNWTmmvGsHC39/kng== X-CSE-MsgGUID: Q5VzAqcQTkuXZENumtVJoA== X-IronPort-AV: E=McAfee;i="6800,10657,11755"; a="79434713" X-IronPort-AV: E=Sophos;i="6.23,171,1770624000"; d="scan'208";a="79434713" Received: from fmviesa009.fm.intel.com ([10.60.135.149]) by fmvoesa107.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 10 Apr 2026 07:06:23 -0700 X-CSE-ConnectionGUID: mKPQS7paT8+dxzlagc+bwA== X-CSE-MsgGUID: 4fX0oQdESwagMNFrZejwgA== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="6.23,171,1770624000"; d="scan'208";a="222606965" Received: from dut6094bmgfrd.fm.intel.com ([10.80.55.31]) by fmviesa009.fm.intel.com with ESMTP; 10 Apr 2026 07:06:23 -0700 From: Jia Yao To: intel-gfx@lists.freedesktop.org Cc: Jia Yao , stable@vger.kernel.org, Shuicheng Lin , Matt Roper , Joonas Lahtinen , Rodrigo Vivi , Maciej Plewka Subject: [PATCH v2] drm/i915/dg2: Add per-context control for Wa_22013059131 Date: Fri, 10 Apr 2026 14:06:19 +0000 Message-ID: <20260410140619.736008-1-jia.yao@intel.com> X-Mailer: git-send-email 2.43.0 Precedence: bulk X-Mailing-List: stable@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Wa_22013059131 sets FORCE_1_SUB_MESSAGE_PER_FRAGMENT in LSC_CHICKEN_BIT_0 at engine init, but this is known to cause GPU hangs in certain workloads. Add I915_CONTEXT_PARAM_WA_22013059131 so userspace that handles the workaround itself (e.g. by limiting SLM size) can set it to 1 to let the kernel know bit 15 programming is not needed for that context. LSC_CHICKEN_BIT_0 is not context-saved by hardware, so the kernel restores the correct value on every context switch via the indirect context batchbuffer to avoid leaking state between contexts. The old unconditional application of Wa22013059131 in intel_workarounds.c is removed. Bspec: 54833 Fixes: 645cc0b9d972 ("drm/i915/dg2: Add initial gt/ctx/engine workarounds") Cc: stable@vger.kernel.org Cc: Shuicheng Lin Cc: Matt Roper Cc: Joonas Lahtinen Cc: Rodrigo Vivi Cc: Maciej Plewka Signed-off-by: Jia Yao --- drivers/gpu/drm/i915/gem/i915_gem_context.c | 12 ++++++ .../gpu/drm/i915/gem/i915_gem_context_types.h | 1 + drivers/gpu/drm/i915/gt/intel_context_types.h | 1 + drivers/gpu/drm/i915/gt/intel_lrc.c | 41 ++++++++++++++++++- drivers/gpu/drm/i915/gt/intel_workarounds.c | 10 ++--- include/uapi/drm/i915_drm.h | 10 +++++ 6 files changed, 69 insertions(+), 6 deletions(-) diff --git a/drivers/gpu/drm/i915/gem/i915_gem_context.c b/drivers/gpu/drm/i915/gem/i915_gem_context.c index 6ac0f23570f3..d24e449f1eb3 100644 --- a/drivers/gpu/drm/i915/gem/i915_gem_context.c +++ b/drivers/gpu/drm/i915/gem/i915_gem_context.c @@ -911,6 +911,15 @@ static int set_proto_ctx_param(struct drm_i915_file_private *fpriv, ret = -EINVAL; break; + case I915_CONTEXT_PARAM_WA_22013059131: + if (args->size) + ret = -EINVAL; + else if (args->value) + pc->user_flags |= BIT(UCONTEXT_WA_22013059131); + else + pc->user_flags &= ~BIT(UCONTEXT_WA_22013059131); + break; + case I915_CONTEXT_PARAM_RECOVERABLE: if (args->size) ret = -EINVAL; @@ -1003,6 +1012,9 @@ static int intel_context_set_gem(struct intel_context *ce, if (test_bit(UCONTEXT_LOW_LATENCY, &ctx->user_flags)) __set_bit(CONTEXT_LOW_LATENCY, &ce->flags); + if (test_bit(UCONTEXT_WA_22013059131, &ctx->user_flags)) + __set_bit(CONTEXT_WA_22013059131, &ce->flags); + return ret; } diff --git a/drivers/gpu/drm/i915/gem/i915_gem_context_types.h b/drivers/gpu/drm/i915/gem/i915_gem_context_types.h index 0267c924634b..4efc0e758d3b 100644 --- a/drivers/gpu/drm/i915/gem/i915_gem_context_types.h +++ b/drivers/gpu/drm/i915/gem/i915_gem_context_types.h @@ -338,6 +338,7 @@ struct i915_gem_context { #define UCONTEXT_RECOVERABLE 3 #define UCONTEXT_PERSISTENCE 4 #define UCONTEXT_LOW_LATENCY 5 +#define UCONTEXT_WA_22013059131 6 /** * @flags: small set of booleans diff --git a/drivers/gpu/drm/i915/gt/intel_context_types.h b/drivers/gpu/drm/i915/gt/intel_context_types.h index 10070ee4d74c..84011ce7c84d 100644 --- a/drivers/gpu/drm/i915/gt/intel_context_types.h +++ b/drivers/gpu/drm/i915/gt/intel_context_types.h @@ -133,6 +133,7 @@ struct intel_context { #define CONTEXT_EXITING 13 #define CONTEXT_LOW_LATENCY 14 #define CONTEXT_OWN_STATE 15 +#define CONTEXT_WA_22013059131 16 struct { u64 timeout_us; diff --git a/drivers/gpu/drm/i915/gt/intel_lrc.c b/drivers/gpu/drm/i915/gt/intel_lrc.c index 147d22907960..9bae82f9746a 100644 --- a/drivers/gpu/drm/i915/gt/intel_lrc.c +++ b/drivers/gpu/drm/i915/gt/intel_lrc.c @@ -1348,6 +1348,35 @@ gen12_invalidate_state_cache(u32 *cs) return cs; } +static u32 * +dg2_g11_emit_wa_22013059131(const struct intel_context *ce, u32 *cs) +{ + /* + * While re-writing LSC_CHICKEN_BIT_0 for Wa_22013059131, the + * other bits of the register will also get overwritten. The + * hardware default for all other bits is 0, but any workarounds + * that adjust the other bits in the lower dword of the register + * also need to be re-applied here. At the moment that's just + * Wa_22014226127, which is always set for DG2-G11 platforms. + */ + u32 val = DISABLE_D8_D16_COASLESCE; + + /* + * i915 should only set LSC_CHICKEN_BIT_0 as a solution for + * Wa_22013059131 on contexts for which the userspace driver is + * _not_ applying the preferred workaround implementation in + * userspace. + */ + if (!test_bit(CONTEXT_WA_22013059131, &ce->flags)) + val |= FORCE_1_SUB_MESSAGE_PER_FRAGMENT; + + *cs++ = MI_LOAD_REGISTER_IMM(1); + *cs++ = i915_mmio_reg_offset(LSC_CHICKEN_BIT_0); + *cs++ = val; + + return cs; +} + static u32 * gen12_emit_indirect_ctx_rcs(const struct intel_context *ce, u32 *cs) { @@ -1371,6 +1400,10 @@ gen12_emit_indirect_ctx_rcs(const struct intel_context *ce, u32 *cs) IS_DG2(ce->engine->i915)) cs = dg2_emit_draw_watermark_setting(cs); + /* Wa_22013059131:dg2 */ + if (IS_DG2_G11(ce->engine->i915)) + cs = dg2_g11_emit_wa_22013059131(ce, cs); + return cs; } @@ -1387,7 +1420,13 @@ gen12_emit_indirect_ctx_xcs(const struct intel_context *ce, u32 *cs) PIPE_CONTROL_INSTRUCTION_CACHE_INVALIDATE, 0); - return gen12_emit_aux_table_inv(ce->engine, cs); + cs = gen12_emit_aux_table_inv(ce->engine, cs); + + /* Wa_22013059131:dg2 */ + if (IS_DG2_G11(ce->engine->i915)) + cs = dg2_g11_emit_wa_22013059131(ce, cs); + + return cs; } static u32 *xehp_emit_fastcolor_blt_wabb(const struct intel_context *ce, u32 *cs) diff --git a/drivers/gpu/drm/i915/gt/intel_workarounds.c b/drivers/gpu/drm/i915/gt/intel_workarounds.c index 24ea5d8d529c..ef6eea3ab597 100644 --- a/drivers/gpu/drm/i915/gt/intel_workarounds.c +++ b/drivers/gpu/drm/i915/gt/intel_workarounds.c @@ -2840,7 +2840,11 @@ general_render_compute_wa_init(struct intel_engine_cs *engine, struct i915_wa_li if (IS_GFX_GT_IP_STEP(gt, IP_VER(12, 70), STEP_A0, STEP_B0) || IS_GFX_GT_IP_STEP(gt, IP_VER(12, 71), STEP_A0, STEP_B0) || IS_DG2(i915)) { - /* Wa_22014226127 */ + /* + * Wa_22014226127: Note that this workaround also needs to be + * re-applied in intel_lrc.c when LSC_CHICKEN_BIT_0 is + * re-written for Wa_22013059131. + */ wa_mcr_write_or(wal, LSC_CHICKEN_BIT_0, DISABLE_D8_D16_COASLESCE); } @@ -2867,10 +2871,6 @@ general_render_compute_wa_init(struct intel_engine_cs *engine, struct i915_wa_li MAXREQS_PER_BANK, REG_FIELD_PREP(MAXREQS_PER_BANK, 2)); - /* Wa_22013059131:dg2 */ - wa_mcr_write_or(wal, LSC_CHICKEN_BIT_0, - FORCE_1_SUB_MESSAGE_PER_FRAGMENT); - /* * Wa_22012654132 * diff --git a/include/uapi/drm/i915_drm.h b/include/uapi/drm/i915_drm.h index 535cb68fdb5c..0f553bb12fb0 100644 --- a/include/uapi/drm/i915_drm.h +++ b/include/uapi/drm/i915_drm.h @@ -2172,6 +2172,16 @@ struct drm_i915_gem_context_param { * Note that this is a debug API not available on production kernel builds. */ #define I915_CONTEXT_PARAM_CONTEXT_IMAGE 0xf + +/* + * I915_CONTEXT_PARAM_WA_22013059131: + * + * Default value 0 means the kernel programs Wa_22013059131 for this context. + * Set to 1 to inform the kernel that userspace is taking responsibility for + * applying the preferred workaround implementation, so the kernel programming + * of LSC_CHICKEN_BIT_0 bit 15 is not needed for this context. DG2-G11 only. + */ +#define I915_CONTEXT_PARAM_WA_22013059131 0x10 /* Must be kept compact -- no holes and well documented */ /** @value: Context parameter value to be set or queried */ -- 2.43.0