dri-devel.lists.freedesktop.org archive mirror
 help / color / mirror / Atom feed
* [Bug 218900] New: amdgpu: Fatal error during GPU init
@ 2024-05-27 14:52 bugzilla-daemon
  2024-05-27 15:08 ` [Bug 218900] " bugzilla-daemon
                   ` (26 more replies)
  0 siblings, 27 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-05-27 14:52 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

            Bug ID: 218900
           Summary: amdgpu: Fatal error during GPU init
           Product: Drivers
           Version: 2.5
          Hardware: AMD
                OS: Linux
            Status: NEW
          Severity: blocking
          Priority: P3
         Component: Video(DRI - non Intel)
          Assignee: drivers_video-dri@kernel-bugs.osdl.org
          Reporter: jean-christophe@guillain.net
        Regression: No

Hello !

Trying the new kernel RC today (6.10.0-rc1), I no longer have video.
With 6.9.1 works.

Lenovo ThinkCentre M715q

00:01.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Wani
[Radeon R5/R6/R7 Graphics] (rev e4)


In the journal, I have multiple entries like this one :
May 27 14:24:22 youpi kernel: iommu ivhd0: AMD-Vi: Event logged
[ILLEGAL_DEV_TABLE_ENTRY device=0000:00:01.0 pasid=0x00000 address=0x102e89980
flags=0x0080]
May 27 14:24:22 youpi kernel: AMD-Vi: DTE[0]: 7190000000000003
May 27 14:24:22 youpi kernel: AMD-Vi: DTE[1]: 00001001034f0002
May 27 14:24:22 youpi kernel: AMD-Vi: DTE[2]: 200000010022a013
May 27 14:24:22 youpi kernel: AMD-Vi: DTE[3]: 0000000000000000



Then, multiple entries like that one :
May 27 14:24:22 youpi kernel: amdgpu 0000:00:01.0: [drm:amdgpu_ring_test_helper
[amdgpu]] *ERROR* ring gfx test failed (-110)
May 27 14:24:22 youpi kernel: [drm:amdgpu_device_init [amdgpu]] *ERROR* hw_init
of IP block <gfx_v8_0> failed -110
May 27 14:24:22 youpi kernel: amdgpu 0000:00:01.0: amdgpu:
amdgpu_device_ip_init failed
May 27 14:24:22 youpi kernel: amdgpu 0000:00:01.0: amdgpu: Fatal error during
GPU init
May 27 14:24:22 youpi kernel: amdgpu 0000:00:01.0: amdgpu: amdgpu: finishing
device.
May 27 14:24:22 youpi kernel: ------------[ cut here ]------------
May 27 14:24:22 youpi kernel: WARNING: CPU: 0 PID: 179 at
drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:630 amdgpu_irq_put+0x45/0x70 [amdgpu]
May 27 14:24:22 youpi kernel: Modules linked in: sd_mod usbhid uas hid
usb_storage amdgpu(+) amdxcp drm_exec gpu_sched drm_buddy i2c_algo_bit
drm_suballoc_helper drm>
May 27 14:24:22 youpi kernel: CPU: 0 PID: 179 Comm: (udev-worker) Not tainted
6.10.0-rc1-jcg #1
May 27 14:24:22 youpi kernel: Hardware name: LENOVO 10VGS02P00/3130, BIOS
M1XKT57A 02/10/2022
May 27 14:24:22 youpi kernel: RIP: 0010:amdgpu_irq_put+0x45/0x70 [amdgpu]
May 27 14:24:22 youpi kernel: Code: 48 8b 4e 10 48 83 39 00 74 2c 89 d1 48 8d
04 88 8b 08 85 c9 74 14 f0 ff 08 b8 00 00 00 00 74 05 e9 80 d8 a3 fc e9 6b fd
ff ff <0f>
May 27 14:24:22 youpi kernel: RSP: 0018:ffffbc9c80813a48 EFLAGS: 00010246
May 27 14:24:22 youpi kernel: RAX: ffff985ad74e3780 RBX: ffff985a82f18878 RCX:
0000000000000000
May 27 14:24:22 youpi kernel: RDX: 0000000000000000 RSI: ffff985a82f254b8 RDI:
ffff985a82f00000
May 27 14:24:22 youpi kernel: RBP: ffff985a82f10208 R08: 0000000000000000 R09:
0000000000000003
May 27 14:24:22 youpi kernel: R10: ffffbc9c80813880 R11: ffffffffbdec7828 R12:
ffff985a82f105e8
May 27 14:24:22 youpi kernel: R13: ffff985a82f00010 R14: ffff985a82f00000 R15:
ffff985a82f254b8
May 27 14:24:22 youpi kernel: FS:  00007f18ca0058c0(0000)
GS:ffff985b57600000(0000) knlGS:0000000000000000
May 27 14:24:22 youpi kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
May 27 14:24:22 youpi kernel: CR2: 00005563a55b3a68 CR3: 000000010f8bc000 CR4:
00000000001506f0
May 27 14:24:22 youpi kernel: Call Trace:
May 27 14:24:22 youpi kernel:  <TASK>
May 27 14:24:22 youpi kernel:  ? __warn+0x7c/0x120
May 27 14:24:22 youpi kernel:  ? amdgpu_irq_put+0x45/0x70 [amdgpu]
May 27 14:24:22 youpi kernel:  ? report_bug+0x155/0x170
May 27 14:24:22 youpi kernel:  ? handle_bug+0x3f/0x80
May 27 14:24:22 youpi kernel:  ? exc_invalid_op+0x13/0x60
May 27 14:24:22 youpi kernel:  ? asm_exc_invalid_op+0x16/0x20
May 27 14:24:22 youpi kernel:  ? amdgpu_irq_put+0x45/0x70 [amdgpu]
May 27 14:24:22 youpi kernel:  amdgpu_fence_driver_hw_fini+0xfa/0x130 [amdgpu]
May 27 14:24:22 youpi kernel:  amdgpu_device_fini_hw+0xa2/0x3f0 [amdgpu]
May 27 14:24:22 youpi kernel:  amdgpu_driver_load_kms+0x79/0xb0 [amdgpu]
May 27 14:24:22 youpi kernel:  amdgpu_pci_probe+0x182/0x4f0 [amdgpu]
May 27 14:24:22 youpi kernel:  local_pci_probe+0x41/0x90
May 27 14:24:22 youpi kernel:  pci_device_probe+0xbb/0x1e0
May 27 14:24:22 youpi kernel:  really_probe+0xd6/0x390
May 27 14:24:22 youpi kernel:  ? __pfx___driver_attach+0x10/0x10
May 27 14:24:22 youpi kernel:  __driver_probe_device+0x78/0x150
May 27 14:24:22 youpi kernel:  driver_probe_device+0x1f/0x90
May 27 14:24:22 youpi kernel:  __driver_attach+0xce/0x1c0
May 27 14:24:22 youpi kernel:  bus_for_each_dev+0x84/0xd0
May 27 14:24:22 youpi kernel:  bus_add_driver+0x10e/0x240
May 27 14:24:22 youpi kernel:  driver_register+0x55/0x100
May 27 14:24:22 youpi kernel:  ? __pfx_amdgpu_init+0x10/0x10 [amdgpu]
May 27 14:24:22 youpi kernel:  do_one_initcall+0x57/0x320
May 27 14:24:22 youpi kernel:  do_init_module+0x60/0x230
May 27 14:24:22 youpi kernel:  init_module_from_file+0x86/0xc0
May 27 14:24:22 youpi kernel:  idempotent_init_module+0x11b/0x2b0
May 27 14:24:22 youpi kernel:  __x64_sys_finit_module+0x5a/0xb0
May 27 14:24:22 youpi kernel:  do_syscall_64+0x7e/0x190
May 27 14:24:22 youpi kernel:  ? ksys_mmap_pgoff+0x14e/0x1f0
May 27 14:24:22 youpi kernel:  ? syscall_exit_to_user_mode+0x71/0x1e0
May 27 14:24:22 youpi kernel:  ? do_syscall_64+0x8a/0x190
May 27 14:24:22 youpi kernel:  ? do_syscall_64+0x8a/0x190
May 27 14:24:22 youpi kernel:  ? do_syscall_64+0x8a/0x190
May 27 14:24:22 youpi kernel:  ? __irq_exit_rcu+0x38/0xb0
May 27 14:24:22 youpi kernel:  entry_SYSCALL_64_after_hwframe+0x76/0x7e
May 27 14:24:22 youpi kernel: RIP: 0033:0x7f18c9e79719
May 27 14:24:22 youpi kernel: Code: 08 89 e8 5b 5d c3 66 2e 0f 1f 84 00 00 00
00 00 90 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08
0f 05 <48>
May 27 14:24:22 youpi kernel: RSP: 002b:00007ffd56f52208 EFLAGS: 00000246
ORIG_RAX: 0000000000000139
May 27 14:24:22 youpi kernel: RAX: ffffffffffffffda RBX: 00005563a558e400 RCX:
00007f18c9e79719
May 27 14:24:22 youpi kernel: RDX: 0000000000000000 RSI: 00007f18ca01defd RDI:
0000000000000015
May 27 14:24:22 youpi kernel: RBP: 00007f18ca01defd R08: 0000000000000000 R09:
00005563a55902b0
May 27 14:24:22 youpi kernel: R10: 0000000000000015 R11: 0000000000000246 R12:
0000000000020000
May 27 14:24:22 youpi kernel: R13: 0000000000000000 R14: 00005563a5591f30 R15:
000055638158bec1
May 27 14:24:22 youpi kernel:  </TASK>
May 27 14:24:22 youpi kernel: ---[ end trace 0000000000000000 ]---

I suspect this commit :
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c?id=db5d28c0bfe566908719bec8e25443aabecbb802

Let me now if you need more information.

Cheers,
jC

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
@ 2024-05-27 15:08 ` bugzilla-daemon
  2024-05-27 15:10 ` bugzilla-daemon
                   ` (25 subsequent siblings)
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-05-27 15:08 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

--- Comment #1 from Jean-Christophe Guillain (jean-christophe@guillain.net) ---
Created attachment 306354
  --> https://bugzilla.kernel.org/attachment.cgi?id=306354&action=edit
Full logs of the boot.

I added the full log of the boot process showing all the errors.

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
  2024-05-27 15:08 ` [Bug 218900] " bugzilla-daemon
@ 2024-05-27 15:10 ` bugzilla-daemon
  2024-05-27 15:22 ` bugzilla-daemon
                   ` (24 subsequent siblings)
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-05-27 15:10 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

Jean-Christophe Guillain (jean-christophe@guillain.net) changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
 Bisected commit-id|                            |db5d28c0bfe566908719bec8e25
                   |                            |443aabecbb802
     Kernel Version|                            |6.10.0-rc1
         Regression|No                          |Yes

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
  2024-05-27 15:08 ` [Bug 218900] " bugzilla-daemon
  2024-05-27 15:10 ` bugzilla-daemon
@ 2024-05-27 15:22 ` bugzilla-daemon
  2024-05-27 15:46 ` bugzilla-daemon
                   ` (23 subsequent siblings)
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-05-27 15:22 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

Alex Deucher (alexdeucher@gmail.com) changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |alexdeucher@gmail.com

--- Comment #2 from Alex Deucher (alexdeucher@gmail.com) ---
Can you bisect?  https://docs.kernel.org/admin-guide/bug-bisect.html

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
                   ` (2 preceding siblings ...)
  2024-05-27 15:22 ` bugzilla-daemon
@ 2024-05-27 15:46 ` bugzilla-daemon
  2024-05-28 15:45 ` bugzilla-daemon
                   ` (22 subsequent siblings)
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-05-27 15:46 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

--- Comment #3 from Jean-Christophe Guillain (jean-christophe@guillain.net) ---
Bisecting: 5720 revisions left to test after this (roughly 13 steps)

I'll try, but it will take some time. My machine is not very powerful...

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
                   ` (3 preceding siblings ...)
  2024-05-27 15:46 ` bugzilla-daemon
@ 2024-05-28 15:45 ` bugzilla-daemon
  2024-05-28 18:01 ` bugzilla-daemon
                   ` (21 subsequent siblings)
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-05-28 15:45 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

Jean-Christophe Guillain (jean-christophe@guillain.net) changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
 Bisected commit-id|db5d28c0bfe566908719bec8e25 |
                   |443aabecbb802               |

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
                   ` (4 preceding siblings ...)
  2024-05-28 15:45 ` bugzilla-daemon
@ 2024-05-28 18:01 ` bugzilla-daemon
  2024-05-29  6:15 ` bugzilla-daemon
                   ` (20 subsequent siblings)
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-05-28 18:01 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

Mario Limonciello (AMD) (mario.limonciello@amd.com) changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |mario.limonciello@amd.com

--- Comment #4 from Mario Limonciello (AMD) (mario.limonciello@amd.com) ---
Possibly the same as this report:

https://lore.kernel.org/all/20240527192159.GEZlTdV7OoOuJrHmI0@fat_crate.local/

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
                   ` (5 preceding siblings ...)
  2024-05-28 18:01 ` bugzilla-daemon
@ 2024-05-29  6:15 ` bugzilla-daemon
  2024-05-29  6:16 ` bugzilla-daemon
                   ` (19 subsequent siblings)
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-05-29  6:15 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

Vasant Hegde (vasant.hegde@amd.com) changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |vasant.hegde@amd.com

--- Comment #5 from Vasant Hegde (vasant.hegde@amd.com) ---
Created attachment 306364
  --> https://bugzilla.kernel.org/attachment.cgi?id=306364&action=edit
Check Enhanced PPR support before enabling PPR

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
                   ` (6 preceding siblings ...)
  2024-05-29  6:15 ` bugzilla-daemon
@ 2024-05-29  6:16 ` bugzilla-daemon
  2024-05-29  7:11 ` bugzilla-daemon
                   ` (18 subsequent siblings)
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-05-29  6:16 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

--- Comment #6 from Vasant Hegde (vasant.hegde@amd.com) ---
Hi,

Attached patch should fix this issue. Can you please test it?

I will send proper patch to mailing list soon.

-Vasant

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
                   ` (7 preceding siblings ...)
  2024-05-29  6:16 ` bugzilla-daemon
@ 2024-05-29  7:11 ` bugzilla-daemon
  2024-05-29  7:42 ` bugzilla-daemon
                   ` (17 subsequent siblings)
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-05-29  7:11 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

--- Comment #7 from Vasant Hegde (vasant.hegde@amd.com) ---
Also can you please attach full dmesg? I want to see IOMMU feature list and
confirm what I am doing is right.

-Vasant

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
                   ` (8 preceding siblings ...)
  2024-05-29  7:11 ` bugzilla-daemon
@ 2024-05-29  7:42 ` bugzilla-daemon
  2024-05-29 10:41 ` bugzilla-daemon
                   ` (16 subsequent siblings)
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-05-29  7:42 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

--- Comment #8 from Jean-Christophe Guillain (jean-christophe@guillain.net) ---
Hi,

I plan to finish the bisection today, and I'll test your patch.

jC

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
                   ` (9 preceding siblings ...)
  2024-05-29  7:42 ` bugzilla-daemon
@ 2024-05-29 10:41 ` bugzilla-daemon
  2024-05-29 11:39 ` bugzilla-daemon
                   ` (15 subsequent siblings)
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-05-29 10:41 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

--- Comment #9 from Vasant Hegde (vasant.hegde@amd.com) ---
(In reply to Jean-Christophe Guillain from comment #8)
> Hi,
> 
> I plan to finish the bisection today, and I'll test your patch.
> 

You mean bisecting for this issue? If so we know the culprit commit. Issue is
happening because IOMMU driver tried to enable PPR bit in DTE without checking
Enhanced PPR support in EFR register.



-Vasant

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
                   ` (10 preceding siblings ...)
  2024-05-29 10:41 ` bugzilla-daemon
@ 2024-05-29 11:39 ` bugzilla-daemon
  2024-05-29 12:41 ` bugzilla-daemon
                   ` (14 subsequent siblings)
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-05-29 11:39 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

Mario Limonciello (AMD) (mario.limonciello@amd.com) changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
 Attachment #306364|0                           |1
           is patch|                            |

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
                   ` (11 preceding siblings ...)
  2024-05-29 11:39 ` bugzilla-daemon
@ 2024-05-29 12:41 ` bugzilla-daemon
  2024-05-29 12:43 ` bugzilla-daemon
                   ` (13 subsequent siblings)
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-05-29 12:41 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

--- Comment #10 from Jean-Christophe Guillain (jean-christophe@guillain.net) ---
I applied your patch to the 6.10.0-rc1 kernel, and I confirm that it fixes this
bug.

Thank you very much !

jC

(full dmesg attached)

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
                   ` (12 preceding siblings ...)
  2024-05-29 12:41 ` bugzilla-daemon
@ 2024-05-29 12:43 ` bugzilla-daemon
  2024-05-29 16:14 ` bugzilla-daemon
                   ` (12 subsequent siblings)
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-05-29 12:43 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

--- Comment #11 from Jean-Christophe Guillain (jean-christophe@guillain.net) ---
Created attachment 306367
  --> https://bugzilla.kernel.org/attachment.cgi?id=306367&action=edit
Full dmesg after applying Vasant's patch

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
                   ` (13 preceding siblings ...)
  2024-05-29 12:43 ` bugzilla-daemon
@ 2024-05-29 16:14 ` bugzilla-daemon
  2024-05-29 16:17 ` bugzilla-daemon
                   ` (11 subsequent siblings)
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-05-29 16:14 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

Jean-Christophe Guillain (jean-christophe@guillain.net) changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
 Bisected commit-id|                            |c4cb23111103a841c2df3005859
                   |                            |7398443bcad5f

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
                   ` (14 preceding siblings ...)
  2024-05-29 16:14 ` bugzilla-daemon
@ 2024-05-29 16:17 ` bugzilla-daemon
  2024-05-29 21:26 ` bugzilla-daemon
                   ` (10 subsequent siblings)
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-05-29 16:17 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

--- Comment #12 from Jean-Christophe Guillain (jean-christophe@guillain.net) ---
(I still finished my bisection, and as you said,
c4cb23111103a841c2df30058597398443bcad5f is the first bad commit.)

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
                   ` (15 preceding siblings ...)
  2024-05-29 16:17 ` bugzilla-daemon
@ 2024-05-29 21:26 ` bugzilla-daemon
  2024-05-30  6:20 ` bugzilla-daemon
                   ` (9 subsequent siblings)
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-05-29 21:26 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

Artem S. Tashkinov (aros@gmx.com) changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
         Resolution|---                         |CODE_FIX

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
                   ` (16 preceding siblings ...)
  2024-05-29 21:26 ` bugzilla-daemon
@ 2024-05-30  6:20 ` bugzilla-daemon
  2024-06-06 12:26 ` bugzilla-daemon
                   ` (8 subsequent siblings)
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-05-30  6:20 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

--- Comment #13 from Vasant Hegde (vasant.hegde@amd.com) ---
Thanks Jean for testing. I will send patch with your Tested-by today.

-Vasant

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
                   ` (17 preceding siblings ...)
  2024-05-30  6:20 ` bugzilla-daemon
@ 2024-06-06 12:26 ` bugzilla-daemon
  2024-06-07 15:18 ` bugzilla-daemon
                   ` (7 subsequent siblings)
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-06-06 12:26 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

Hanabishi (i.r.e.c.c.a.k.u.n+bugzilla.kernel.org@gmail.com) changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |i.r.e.c.c.a.k.u.n+bugzilla.
                   |                            |kernel.org@gmail.com

--- Comment #14 from Hanabishi (i.r.e.c.c.a.k.u.n+bugzilla.kernel.org@gmail.com) ---
*** Bug 218921 has been marked as a duplicate of this bug. ***

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
                   ` (18 preceding siblings ...)
  2024-06-06 12:26 ` bugzilla-daemon
@ 2024-06-07 15:18 ` bugzilla-daemon
  2024-06-10 15:02 ` bugzilla-daemon
                   ` (6 subsequent siblings)
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-06-07 15:18 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

--- Comment #15 from Hanabishi (i.r.e.c.c.a.k.u.n+bugzilla.kernel.org@gmail.com) ---
(In reply to Vasant Hegde from comment #5)
> Created attachment 306364 [details]
> Check Enhanced PPR support before enabling PPR

I applied your patch on top of rc2 and also confirm that it works.
Thank you.

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
                   ` (19 preceding siblings ...)
  2024-06-07 15:18 ` bugzilla-daemon
@ 2024-06-10 15:02 ` bugzilla-daemon
  2024-06-25 17:10 ` bugzilla-daemon
                   ` (5 subsequent siblings)
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-06-10 15:02 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

--- Comment #16 from Vasant Hegde (vasant.hegde@amd.com) ---
(In reply to Hanabishi from comment #15)
> (In reply to Vasant Hegde from comment #5)
> > Created attachment 306364 [details]
> > Check Enhanced PPR support before enabling PPR
> 
> I applied your patch on top of rc2 and also confirm that it works.
> Thank you.

Thanks Hanabishi for testing.

FYI. Patches merged into -rc3.

-Vasant

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
                   ` (20 preceding siblings ...)
  2024-06-10 15:02 ` bugzilla-daemon
@ 2024-06-25 17:10 ` bugzilla-daemon
  2024-06-25 17:11 ` bugzilla-daemon
                   ` (4 subsequent siblings)
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-06-25 17:10 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

Jean-Denis Girard (jd.girard@sysnux.pf) changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |jd.girard@sysnux.pf

--- Comment #17 from Jean-Denis Girard (jd.girard@sysnux.pf) ---
I seem to have a similar problem on 6.10-rc5 after suspend. I get a black
screen on resume.

[  269.157149] amdgpu 0000:02:00.0: amdgpu: reserve 0x400000 from 0xf41f800000
for PSP TMR
[  269.159956] iommu ivhd0: AMD-Vi: Event logged [ILLEGAL_DEV_TABLE_ENTRY
device=0000:02:00.0 pasid=0x00000 address=0x131400000 flags=0x0180]
[  269.159960] AMD-Vi: DTE[0]: 6190000000000003
[  269.159962] AMD-Vi: DTE[1]: 00001001049e000b
[  269.159963] AMD-Vi: DTE[2]: 200000013c610013
[  269.159963] AMD-Vi: DTE[3]: 0000000000000000
[  269.160104] amdgpu 0000:02:00.0: amdgpu: failed to load ucode SDMA0(0x1) 
[  269.160108] amdgpu 0000:02:00.0: amdgpu: psp gfx command LOAD_IP_FW(0x6)
failed and response status is (0xF)

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
                   ` (21 preceding siblings ...)
  2024-06-25 17:10 ` bugzilla-daemon
@ 2024-06-25 17:11 ` bugzilla-daemon
  2024-06-25 17:14 ` bugzilla-daemon
                   ` (3 subsequent siblings)
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-06-25 17:11 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

--- Comment #18 from Jean-Denis Girard (jd.girard@sysnux.pf) ---
Created attachment 306495
  --> https://bugzilla.kernel.org/attachment.cgi?id=306495&action=edit
Complete dmesg

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
                   ` (22 preceding siblings ...)
  2024-06-25 17:11 ` bugzilla-daemon
@ 2024-06-25 17:14 ` bugzilla-daemon
  2024-06-25 17:55 ` bugzilla-daemon
                   ` (2 subsequent siblings)
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-06-25 17:14 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

--- Comment #19 from Vasant Hegde (vasant.hegde@amd.com) ---
Unfortunately there was another big in suspend/resume path. Can you please test
with below patch?

https://lore.kernel.org/linux-iommu/ZnqzXyCU8bn32j4-@8bytes.org/T/#m1cd1520facb8b758efdf7a8c0261f9ee2ec217d7



-Vasant

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
                   ` (23 preceding siblings ...)
  2024-06-25 17:14 ` bugzilla-daemon
@ 2024-06-25 17:55 ` bugzilla-daemon
  2024-06-27 16:00 ` bugzilla-daemon
  2024-06-28  9:13 ` bugzilla-daemon
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-06-25 17:55 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

--- Comment #20 from Jean-Denis Girard (jd.girard@sysnux.pf) ---
Yes, I confirm the patch "iommu/amd: Fix GT feature enablement again" applied
to 6.10-rc5 fixes resume on my machine.

Thanks for prompt reply!

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
                   ` (24 preceding siblings ...)
  2024-06-25 17:55 ` bugzilla-daemon
@ 2024-06-27 16:00 ` bugzilla-daemon
  2024-06-28  9:13 ` bugzilla-daemon
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-06-27 16:00 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

dreamlike_clinking040@simplelogin.com changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |dreamlike_clinking040@simpl
                   |                            |elogin.com

--- Comment #21 from dreamlike_clinking040@simplelogin.com ---
(In reply to Vasant Hegde from comment #19)
> Unfortunately there was another big in suspend/resume path. Can you please
> test with below patch?
> 
> https://lore.kernel.org/linux-iommu/ZnqzXyCU8bn32j4-@8bytes.org/T/
> #m1cd1520facb8b758efdf7a8c0261f9ee2ec217d7
> 
> 
> 
> -Vasant

Can confirm this patch also fixes my suspend/resume issue, thanks!

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [Bug 218900] amdgpu: Fatal error during GPU init
  2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
                   ` (25 preceding siblings ...)
  2024-06-27 16:00 ` bugzilla-daemon
@ 2024-06-28  9:13 ` bugzilla-daemon
  26 siblings, 0 replies; 28+ messages in thread
From: bugzilla-daemon @ 2024-06-28  9:13 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=218900

--- Comment #22 from Vasant Hegde (vasant.hegde@amd.com) ---
(In reply to dreamlike_clinking040 from comment #21)
> (In reply to Vasant Hegde from comment #19)

> 
> Can confirm this patch also fixes my suspend/resume issue, thanks!

Thanks a lot.

-Vasant

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 28+ messages in thread

end of thread, other threads:[~2024-06-28  9:13 UTC | newest]

Thread overview: 28+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-05-27 14:52 [Bug 218900] New: amdgpu: Fatal error during GPU init bugzilla-daemon
2024-05-27 15:08 ` [Bug 218900] " bugzilla-daemon
2024-05-27 15:10 ` bugzilla-daemon
2024-05-27 15:22 ` bugzilla-daemon
2024-05-27 15:46 ` bugzilla-daemon
2024-05-28 15:45 ` bugzilla-daemon
2024-05-28 18:01 ` bugzilla-daemon
2024-05-29  6:15 ` bugzilla-daemon
2024-05-29  6:16 ` bugzilla-daemon
2024-05-29  7:11 ` bugzilla-daemon
2024-05-29  7:42 ` bugzilla-daemon
2024-05-29 10:41 ` bugzilla-daemon
2024-05-29 11:39 ` bugzilla-daemon
2024-05-29 12:41 ` bugzilla-daemon
2024-05-29 12:43 ` bugzilla-daemon
2024-05-29 16:14 ` bugzilla-daemon
2024-05-29 16:17 ` bugzilla-daemon
2024-05-29 21:26 ` bugzilla-daemon
2024-05-30  6:20 ` bugzilla-daemon
2024-06-06 12:26 ` bugzilla-daemon
2024-06-07 15:18 ` bugzilla-daemon
2024-06-10 15:02 ` bugzilla-daemon
2024-06-25 17:10 ` bugzilla-daemon
2024-06-25 17:11 ` bugzilla-daemon
2024-06-25 17:14 ` bugzilla-daemon
2024-06-25 17:55 ` bugzilla-daemon
2024-06-27 16:00 ` bugzilla-daemon
2024-06-28  9:13 ` bugzilla-daemon

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).