From: Sasha Levin <sashal@kernel.org>
To: patches@lists.linux.dev, stable@vger.kernel.org
Cc: TungYu Lu <tungyu.lu@amd.com>,
Nicholas Kazlauskas <nicholas.kazlauskas@amd.com>,
Zaeem Mohamed <zaeem.mohamed@amd.com>,
Mark Broadworth <mark.broadworth@amd.com>,
Alex Deucher <alexander.deucher@amd.com>,
Sasha Levin <sashal@kernel.org>,
austin.zheng@amd.com, jun.lei@amd.com, harry.wentland@amd.com,
sunpeng.li@amd.com, christian.koenig@amd.com, airlied@gmail.com,
simona@ffwll.ch, alex.hung@amd.com, rodrigo.siqueira@amd.com,
hamzamahfooz@linux.microsoft.com, v.shevtsov@mt-integration.ru,
ivlipski@amd.com, amd-gfx@lists.freedesktop.org,
dri-devel@lists.freedesktop.org, linux-kernel@vger.kernel.org
Subject: [PATCH AUTOSEL 6.6 36/66] drm/amd/display: Correct prefetch calculation
Date: Sun, 1 Jun 2025 19:37:13 -0400 [thread overview]
Message-ID: <20250601233744.3514795-36-sashal@kernel.org> (raw)
In-Reply-To: <20250601233744.3514795-1-sashal@kernel.org>
From: TungYu Lu <tungyu.lu@amd.com>
[ Upstream commit 33bc89949b4366dff2dca30bc61ba1c0cbcd2ab2 ]
[Why]
The minimum value of the dst_y_prefetch_equ was not correct
in prefetch calculation whice causes OPTC underflow.
[How]
Add the min operation of dst_y_prefetch_equ in prefetch calculation
for legacy DML.
Reviewed-by: Nicholas Kazlauskas <nicholas.kazlauskas@amd.com>
Signed-off-by: TungYu Lu <tungyu.lu@amd.com>
Signed-off-by: Zaeem Mohamed <zaeem.mohamed@amd.com>
Tested-by: Mark Broadworth <mark.broadworth@amd.com>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
Signed-off-by: Sasha Levin <sashal@kernel.org>
---
**YES** ## Analysis This commit should be backported to stable kernel
trees. Here's my detailed analysis: ### **Bug Description and Impact**
The commit fixes a critical hardware register overflow issue in AMD
display's prefetch calculation that causes OPTC (Output Pipe and Timing
Control) underflow. OPTC underflow is a serious display issue that can
cause: - Screen corruption or artifacts - Display timing problems -
Potential system instability during display operations ### **Code Change
Analysis** The fix adds a single line in three DCN (Display Core Next)
versions (30, 31, 314): ```c dst_y_prefetch_equ =
dml_min(dst_y_prefetch_equ, 63.75); // limit to the reg limit of U6.2
for DST_Y_PREFETCH ``` This change: 1. **Prevents register overflow**:
The DST_Y_PREFETCH register is limited to 8 bits with U6.2 format,
meaning maximum value is 63.75 2. **Applies early in calculation**: The
limit is applied to `dst_y_prefetch_equ` before it's used in subsequent
calculations 3. **Matches existing pattern**: DCN21 already has this
exact fix (commit 9857bb9457fe5 from 2021), establishing this as a
known, proven solution ### **Why This Should Be Backported** 1.
**Important Bug Fix**: Fixes a hardware register overflow that causes
visible display issues affecting end users 2. **Minimal Risk**: - Single
line addition in each file - No architectural changes - Simply enforces
hardware register limits that should have been there - Proven safe
(already in DCN21 for 3+ years) 3. **Contained Scope**: Only affects AMD
display prefetch calculations in legacy DML (Display Mode Library)
versions 4. **Clear User Impact**: OPTC underflow causes noticeable
display problems that users would report 5. **Follows Stable Criteria**:
- Small, obvious fix - Addresses hardware limitation - Low regression
risk - Similar to reference commits marked "YES" for backporting ###
**Comparison with Similar Commits** Looking at the provided examples: -
**Similar Commit #1** (YES): Also fixes prefetch calculation with min
operation and has `Cc: stable@vger.kernel.org` - **Similar Commit #2**
(YES): Fixes prefetch calculation loop variables and explicitly tagged
for stable backport - **Similar Commit #4** (NO): Same type of
DST_Y_PREFETCH guard fix but wasn't tagged for stable The pattern shows
that prefetch calculation fixes that prevent register overflow and have
clear user impact are good backport candidates. ### **Hardware Context**
This affects DCN 3.0, 3.1, and 3.14 hardware generations. The fact that
DCN 2.1 already had this fix suggests this was a systematic issue across
multiple hardware generations that needed addressing. The fix is
essential for proper display functionality and represents a critical
hardware constraint that must be enforced to prevent register overflow
and subsequent display corruption.
drivers/gpu/drm/amd/display/dc/dml/dcn30/display_mode_vba_30.c | 1 +
drivers/gpu/drm/amd/display/dc/dml/dcn31/display_mode_vba_31.c | 1 +
drivers/gpu/drm/amd/display/dc/dml/dcn314/display_mode_vba_314.c | 1 +
3 files changed, 3 insertions(+)
diff --git a/drivers/gpu/drm/amd/display/dc/dml/dcn30/display_mode_vba_30.c b/drivers/gpu/drm/amd/display/dc/dml/dcn30/display_mode_vba_30.c
index 72ffa1abebaa1..d799ae7a9048d 100644
--- a/drivers/gpu/drm/amd/display/dc/dml/dcn30/display_mode_vba_30.c
+++ b/drivers/gpu/drm/amd/display/dc/dml/dcn30/display_mode_vba_30.c
@@ -1002,6 +1002,7 @@ static bool CalculatePrefetchSchedule(
dst_y_prefetch_equ = VStartup - (Tsetup + dml_max(TWait + TCalc, *Tdmdl)) / LineTime
- (*DSTYAfterScaler + *DSTXAfterScaler / myPipe->HTotal);
+ dst_y_prefetch_equ = dml_min(dst_y_prefetch_equ, 63.75); // limit to the reg limit of U6.2 for DST_Y_PREFETCH
Lsw_oto = dml_max(PrefetchSourceLinesY, PrefetchSourceLinesC);
Tsw_oto = Lsw_oto * LineTime;
diff --git a/drivers/gpu/drm/amd/display/dc/dml/dcn31/display_mode_vba_31.c b/drivers/gpu/drm/amd/display/dc/dml/dcn31/display_mode_vba_31.c
index adea459e7d363..338a295c464dc 100644
--- a/drivers/gpu/drm/amd/display/dc/dml/dcn31/display_mode_vba_31.c
+++ b/drivers/gpu/drm/amd/display/dc/dml/dcn31/display_mode_vba_31.c
@@ -1105,6 +1105,7 @@ static bool CalculatePrefetchSchedule(
Tr0_oto_lines = dml_ceil(4.0 * Tr0_oto / LineTime, 1) / 4.0;
dst_y_prefetch_oto = Tvm_oto_lines + 2 * Tr0_oto_lines + Lsw_oto;
dst_y_prefetch_equ = VStartup - (*TSetup + dml_max(TWait + TCalc, *Tdmdl)) / LineTime - (*DSTYAfterScaler + *DSTXAfterScaler / myPipe->HTotal);
+ dst_y_prefetch_equ = dml_min(dst_y_prefetch_equ, 63.75); // limit to the reg limit of U6.2 for DST_Y_PREFETCH
dst_y_prefetch_equ = dml_floor(4.0 * (dst_y_prefetch_equ + 0.125), 1) / 4.0;
Tpre_rounded = dst_y_prefetch_equ * LineTime;
diff --git a/drivers/gpu/drm/amd/display/dc/dml/dcn314/display_mode_vba_314.c b/drivers/gpu/drm/amd/display/dc/dml/dcn314/display_mode_vba_314.c
index a94aa0f21a7f4..06cb11bc60905 100644
--- a/drivers/gpu/drm/amd/display/dc/dml/dcn314/display_mode_vba_314.c
+++ b/drivers/gpu/drm/amd/display/dc/dml/dcn314/display_mode_vba_314.c
@@ -1126,6 +1126,7 @@ static bool CalculatePrefetchSchedule(
Tr0_oto_lines = dml_ceil(4.0 * Tr0_oto / LineTime, 1) / 4.0;
dst_y_prefetch_oto = Tvm_oto_lines + 2 * Tr0_oto_lines + Lsw_oto;
dst_y_prefetch_equ = VStartup - (*TSetup + dml_max(TWait + TCalc, *Tdmdl)) / LineTime - (*DSTYAfterScaler + *DSTXAfterScaler / myPipe->HTotal);
+ dst_y_prefetch_equ = dml_min(dst_y_prefetch_equ, 63.75); // limit to the reg limit of U6.2 for DST_Y_PREFETCH
dst_y_prefetch_equ = dml_floor(4.0 * (dst_y_prefetch_equ + 0.125), 1) / 4.0;
Tpre_rounded = dst_y_prefetch_equ * LineTime;
--
2.39.5
next prev parent reply other threads:[~2025-06-01 23:39 UTC|newest]
Thread overview: 66+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-06-01 23:36 [PATCH AUTOSEL 6.6 01/66] drm/amdgpu/gfx6: fix CSIB handling Sasha Levin
2025-06-01 23:36 ` [PATCH AUTOSEL 6.6 02/66] media: imx-jpeg: Check decoding is ongoing for motion-jpeg Sasha Levin
2025-06-01 23:36 ` [PATCH AUTOSEL 6.6 03/66] drm/dp: add option to disable zero sized address only transactions Sasha Levin
2025-06-01 23:36 ` [PATCH AUTOSEL 6.6 04/66] sunrpc: update nextcheck time when adding new cache entries Sasha Levin
2025-06-01 23:36 ` [PATCH AUTOSEL 6.6 05/66] drm/amd/display: DCN32 null data check Sasha Levin
2025-06-01 23:36 ` [PATCH AUTOSEL 6.6 06/66] drm/bridge: analogix_dp: Add irq flag IRQF_NO_AUTOEN instead of calling disable_irq() Sasha Levin
2025-06-01 23:36 ` [PATCH AUTOSEL 6.6 07/66] workqueue: Fix race condition in wq->stats incrementation Sasha Levin
2025-06-01 23:36 ` [PATCH AUTOSEL 6.6 08/66] exfat: fix double free in delayed_free Sasha Levin
2025-06-01 23:36 ` [PATCH AUTOSEL 6.6 09/66] arm64/cpuinfo: only show one cpu's info in c_show() Sasha Levin
2025-06-01 23:36 ` [PATCH AUTOSEL 6.6 10/66] drm/bridge: anx7625: change the gpiod_set_value API Sasha Levin
2025-06-01 23:36 ` [PATCH AUTOSEL 6.6 11/66] drm/amdgpu/gfx11: fix CSIB handling Sasha Levin
2025-06-01 23:36 ` [PATCH AUTOSEL 6.6 12/66] media: i2c: imx334: Enable runtime PM before sub-device registration Sasha Levin
2025-06-01 23:36 ` [PATCH AUTOSEL 6.6 13/66] drm/msm/hdmi: add runtime PM calls to DDC transfer function Sasha Levin
2025-06-01 23:36 ` [PATCH AUTOSEL 6.6 14/66] media: uapi: v4l: Fix V4L2_TYPE_IS_OUTPUT condition Sasha Levin
2025-06-01 23:36 ` [PATCH AUTOSEL 6.6 15/66] drm/amd/display: Add NULL pointer checks in dm_force_atomic_commit() Sasha Levin
2025-06-01 23:36 ` [PATCH AUTOSEL 6.6 16/66] media: verisilicon: Enable wide 4K in AV1 decoder Sasha Levin
2025-06-01 23:36 ` [PATCH AUTOSEL 6.6 17/66] drm/amd/display: Skip to enable dsc if it has been off Sasha Levin
2025-06-01 23:36 ` [PATCH AUTOSEL 6.6 18/66] drm/msm/a6xx: Increase HFI response timeout Sasha Levin
2025-06-01 23:36 ` [PATCH AUTOSEL 6.6 19/66] drm/amd/display: Do Not Consider DSC if Valid Config Not Found Sasha Levin
2025-06-01 23:36 ` [PATCH AUTOSEL 6.6 20/66] media: i2c: imx334: Fix runtime PM handling in remove function Sasha Levin
2025-06-01 23:36 ` [PATCH AUTOSEL 6.6 21/66] drm/amdgpu/gfx10: fix CSIB handling Sasha Levin
2025-06-01 23:36 ` [PATCH AUTOSEL 6.6 22/66] drm: panel-orientation-quirks: Add ZOTAC Gaming Zone Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 23/66] media: ccs-pll: Better validate VT PLL branch Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 24/66] media: uapi: v4l: Change V4L2_TYPE_IS_CAPTURE condition Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 25/66] drm/amdgpu/gfx7: fix CSIB handling Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 26/66] ext4: ext4: unify EXT4_EX_NOCACHE|NOFAIL flags in ext4_ext_remove_space() Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 27/66] jfs: fix array-index-out-of-bounds read in add_missing_indices Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 28/66] media: ti: cal: Fix wrong goto on error path Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 29/66] media: rkvdec: h264: Use bytesperline and buffer height as virstride Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 30/66] media: rkvdec: Initialize the m2m context before the controls Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 31/66] sunrpc: fix race in cache cleanup causing stale nextcheck time Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 32/66] ext4: prevent stale extent cache entries caused by concurrent get es_cache Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 33/66] drm/amdgpu/gfx8: fix CSIB handling Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 34/66] drm/amdgpu/gfx9: " Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 35/66] jfs: Fix null-ptr-deref in jfs_ioc_trim Sasha Levin
2025-06-01 23:37 ` Sasha Levin [this message]
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 37/66] drm/msm/dpu: don't select single flush for active CTL blocks Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 38/66] drm/amdkfd: Set SDMA_RLCx_IB_CNTL/SWITCH_INSIDE_IB Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 39/66] media: tc358743: ignore video while HPD is low Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 40/66] media: platform: exynos4-is: Add hardware sync wait to fimc_is_hw_change_mode() Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 41/66] media: i2c: imx334: update mode_3840x2160_regs array Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 42/66] nios2: force update_mmu_cache on spurious tlb-permission--related pagefaults Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 43/66] media: rcar-vin: Fix stride setting for RAW8 formats Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 44/66] media: qcom: venus: Fix uninitialized variable warning Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 45/66] ACPI: bus: Bail out if acpi_kobj registration fails Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 46/66] pmdomain: ti: Fix STANDBY handling of PER power domain Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 47/66] PM: runtime: fix denying of auto suspend in pm_suspend_timer_fn() Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 48/66] ASoC: amd: yc: Add quirk for Lenovo Yoga Pro 7 14ASP9 Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 49/66] thermal/drivers/qcom/tsens: Update conditions to strictly evaluate for IP v2+ Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 50/66] clocksource/drivers/timer-tegra186: Fix watchdog self-pinging Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 51/66] gpio: pxa: Make irq_chip immutable Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 52/66] gpio: grgpio: " Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 53/66] gpio: xgene-sb: " Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 54/66] genirq: Retain disable depth for managed interrupts across CPU hotplug Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 55/66] mmc: sdhci-esdhc-imx: Save tuning value when card stays powered in suspend Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 56/66] mmc: Add quirk to disable DDR50 tuning Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 57/66] clocksource: Fix the CPUs' choice in the watchdog per CPU verification Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 58/66] ACPICA: Avoid sequence overread in call to strncmp() Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 59/66] ACPICA: utilities: Fix overflow check in vsnprintf() Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 60/66] ACPI: EC: Add device to acpi_ec_no_wakeup[] qurik list Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 61/66] ALSA: seq: Remove unused snd_seq_queue_client_leave_cells Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 62/66] cpufreq: Force sync policy boost with global boost on sysfs update Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 63/66] power: supply: bq27xxx: Retrieve again when busy Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 64/66] tools/nolibc: use intmax definitions from compiler Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 65/66] gpio: ds4520: don't check the 'ngpios' property in the driver Sasha Levin
2025-06-01 23:37 ` [PATCH AUTOSEL 6.6 66/66] ASoC: tas2770: Power cycle amp on ISENSE/VSENSE change Sasha Levin
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20250601233744.3514795-36-sashal@kernel.org \
--to=sashal@kernel.org \
--cc=airlied@gmail.com \
--cc=alex.hung@amd.com \
--cc=alexander.deucher@amd.com \
--cc=amd-gfx@lists.freedesktop.org \
--cc=austin.zheng@amd.com \
--cc=christian.koenig@amd.com \
--cc=dri-devel@lists.freedesktop.org \
--cc=hamzamahfooz@linux.microsoft.com \
--cc=harry.wentland@amd.com \
--cc=ivlipski@amd.com \
--cc=jun.lei@amd.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mark.broadworth@amd.com \
--cc=nicholas.kazlauskas@amd.com \
--cc=patches@lists.linux.dev \
--cc=rodrigo.siqueira@amd.com \
--cc=simona@ffwll.ch \
--cc=stable@vger.kernel.org \
--cc=sunpeng.li@amd.com \
--cc=tungyu.lu@amd.com \
--cc=v.shevtsov@mt-integration.ru \
--cc=zaeem.mohamed@amd.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox