From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from gabe.freedesktop.org (gabe.freedesktop.org [131.252.210.177]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.lore.kernel.org (Postfix) with ESMTPS id 82192D1951F for ; Mon, 26 Jan 2026 21:41:02 +0000 (UTC) Received: from gabe.freedesktop.org (localhost [127.0.0.1]) by gabe.freedesktop.org (Postfix) with ESMTP id EE48510E265; Mon, 26 Jan 2026 21:41:01 +0000 (UTC) Authentication-Results: gabe.freedesktop.org; dkim=pass (2048-bit key; unprotected) header.d=kernel.org header.i=@kernel.org header.b="Uvh5MhqY"; dkim-atps=neutral Received: from tor.source.kernel.org (tor.source.kernel.org [172.105.4.254]) by gabe.freedesktop.org (Postfix) with ESMTPS id 145AA10E265 for ; Mon, 26 Jan 2026 21:41:01 +0000 (UTC) Received: from smtp.kernel.org (transwarp.subspace.kernel.org [100.75.92.58]) by tor.source.kernel.org (Postfix) with ESMTP id 1B52D60051 for ; Mon, 26 Jan 2026 21:41:00 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPS id C61BEC19422 for ; Mon, 26 Jan 2026 21:40:59 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1769463659; bh=6tf9knWeOV04XomUGTKDYNXE08AEtmErODAiYhZLgQw=; h=From:To:Subject:Date:From; b=Uvh5MhqYmieJpD8G70yIvxszcG4NEwQUG8TF4jMNtF2G02h0sv6QXmSj2PvwdnNUK 1Z3eRPAPbWbSciPzY3ihwFYnu8dkvpGrQft8ukZ4zBjUX0WyIFYbrlqW+o5E8CjKlw sFW7BjafplUztHPW/vt3UaCZfwlfGEp2Ox36W7t3CTr4msW1xgVy7P6lvW221ykoK3 1uK1uUHQULV0yKPhbyPuTroF6pfaTU2bPLBEni5rww+s9sw6SCbsALvxGbg0M5XPG4 hYx2WkMrDo6zB+yLFVD2My0ThXl+oTL8KNi5yVqme0T2GZEIi470kfyUTHtR28Ij8W VW7Oj7xXUCPGA== Received: by aws-us-west-2-korg-bugzilla-1.web.codeaurora.org (Postfix, from userid 48) id C10DAC41612; Mon, 26 Jan 2026 21:40:59 +0000 (UTC) From: bugzilla-daemon@kernel.org To: dri-devel@lists.freedesktop.org Subject: [Bug 221012] New: GPU page fault on AMD RX 7600 XT after commit bf2084a7b1d75d093b6a79df4c10142d49fbaa0e Date: Mon, 26 Jan 2026 21:40:59 +0000 X-Bugzilla-Reason: None X-Bugzilla-Type: new X-Bugzilla-Watch-Reason: AssignedTo drivers_video-dri@kernel-bugs.osdl.org X-Bugzilla-Product: Drivers X-Bugzilla-Component: Video(DRI - non Intel) X-Bugzilla-Version: 2.5 X-Bugzilla-Keywords: X-Bugzilla-Severity: high X-Bugzilla-Who: geschw@pm.me X-Bugzilla-Status: NEW X-Bugzilla-Resolution: X-Bugzilla-Priority: P3 X-Bugzilla-Assigned-To: drivers_video-dri@kernel-bugs.osdl.org X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: bug_id short_desc product version rep_platform op_sys bug_status bug_severity priority component assigned_to reporter cf_regression attachments.created Message-ID: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: https://bugzilla.kernel.org/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-BeenThere: dri-devel@lists.freedesktop.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Direct Rendering Infrastructure - Development List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" https://bugzilla.kernel.org/show_bug.cgi?id=3D221012 Bug ID: 221012 Summary: GPU page fault on AMD RX 7600 XT after commit bf2084a7b1d75d093b6a79df4c10142d49fbaa0e Product: Drivers Version: 2.5 Hardware: AMD OS: Linux Status: NEW Severity: high Priority: P3 Component: Video(DRI - non Intel) Assignee: drivers_video-dri@kernel-bugs.osdl.org Reporter: geschw@pm.me Regression: No Created attachment 309237 --> https://bugzilla.kernel.org/attachment.cgi?id=3D309237&action=3Dedit dmesg, lspci GPU page fault occurs when running HIP/ROCm workloads on the AMD Radeon RX = 7600 XT with kernel version 6.18.2. This issue is introduced by commit bf2084a7b1d75d093b6a79df4c10142d49fbaa0e, which modifies the alignment logic for split SVM ranges and introduces the use of huge pages. The error can be resolved by reverting this commit. The fault is reproducible under consistent workloads, and it prevents successful execution of HIP-based tasks, such as PyTorch model training, on this GPU. The error manifests as permission faults in the GPU driver logs. --=20 You may reply to this email to add a comment. You are receiving this mail because: You are watching the assignee of the bug.=