From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 106258] AMD Xorg start failes with non-4K page sizes Date: Wed, 06 Jun 2018 20:39:04 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1875057757==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id 788F26F0CF for ; Wed, 6 Jun 2018 20:39:04 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1875057757== Content-Type: multipart/alternative; boundary="15283175440.Fca3b.5850" Content-Transfer-Encoding: 7bit --15283175440.Fca3b.5850 Date: Wed, 6 Jun 2018 20:39:04 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D106258 --- Comment #25 from foxbat@ruin.net --- (In reply to Michel D=C3=A4nzer from comment #24) > Created attachment 140046 [details] [review] > Add some debugging output to amdgpu_sa_bo_new >=20 > This patch should tell us which of the WARN_ON_ONCE in amdgpu_sa_bo_new is > hit, and what the values are. Hello, I have reproduced the error with the provided patch. Here is the output: [ 175.501689] WARNING: CPU: 24 PID: 3212 at drivers/gpu/drm/amd/amdgpu/amdgpu_sa.c:288 amdgpu_sa_bo_new+0x648/0x6d0 [amdgpu] [ 175.501692] Modules linked in: binfmt_misc snd_usb_audio snd_usbmidi_lib snd_rawmidi snd_seq_device amdgpu evdev snd_hda_codec_hdmi chash ast gpu_sc= hed snd_hda_intel ttm snd_hda_codec ghash_generic drm_kms_helper gf128mul ecb snd_hda_core drm xts snd_hwdep snd_pcm drm_panel_orientation_quirks syscopy= area ctr sysfillrect sysimgblt snd_timer fb_sys_fops cbc vmx_crypto snd i2c_algo= _bit sg soundcore ofpart ipmi_powernv powernv_flash ipmi_devintf opal_prd mtd ipmi_msghandler at24 ip_tables x_tables autofs4 ext4 crc16 mbcache jbd2 fscrypto raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx hid_generic usbhid hid xor raid6_pq libcrc32c crc32c_generic raid1 raid0 multipath linear sd_mod md_mod ses enclosure xhci_pci xhci_hcd mpt3sas raid_class usbcore scsi_transport_sas nvme tg3 nvme_core [ 175.501731] aacraid libphy [ 175.501735] CPU: 24 PID: 3212 Comm: spectacle Not tainted 4.17.0-foxbat-= 64k #2 [ 175.501736] NIP: c00800001885cd10 LR: c008000018865944 CTR: 0000000000000000 [ 175.501738] REGS: c000001f618b71f0 TRAP: 0700 Not tainted=20 (4.17.0-foxbat-64k) [ 175.501738] MSR: 9000000000029033 CR: 248288= 48=20 XER: 20040000 [ 175.501744] CFAR: c00800001885c724 SOFTE: 0=20 GPR00: c008000018865944 c000001f618b7470 c008000018ab6400 c000001fdbea3418=20 GPR04: c000201ca9246260 0000000000100000 0000000000000100 00000000000017c1=20 GPR08: 3817b594461fff5f c000201ca9246000 0000000000000000 c008000018a19dd0=20 GPR12: 0000000000008000 c000001ffffe4000 c000001fdbea0000 c000001feab786a0=20 GPR16: 00000000027ffc4e c000201ca9246260 00000000fffe2000 ffffffffffffffef=20 GPR20: 0000000009fff138 0000000000000100 c000001fdbea6980 0000000000000000=20 GPR24: 00000000027ffc4e 00000000000e5fff c000001fdbea3418 00000001fe000000=20 GPR28: c000001fdbea0000 0000000000000000 c000201ca9246260 0000000009fff138=20 [ 175.501780] NIP [c00800001885cd10] amdgpu_sa_bo_new+0x648/0x6d0 [amdgpu] [ 175.501798] LR [c008000018865944] amdgpu_ib_get+0x8c/0x120 [amdgpu] [ 175.501799] Call Trace: [ 175.501820] [c000001f618b7470] [c00800001885c7f8] amdgpu_sa_bo_new+0x130/0x6d0 [amdgpu] (unreliable) [ 175.501840] [c000001f618b7710] [c008000018865944] amdgpu_ib_get+0x8c/0x1= 20 [amdgpu] [ 175.501861] [c000001f618b7790] [c008000018942558] amdgpu_job_alloc_with_ib+0x90/0x110 [amdgpu] [ 175.501880] [c000001f618b77d0] [c008000018862124] amdgpu_vm_bo_update_mapping+0x35c/0x480 [amdgpu] [ 175.501899] [c000001f618b78c0] [c0080000188625c8] amdgpu_vm_bo_update+0x380/0x740 [amdgpu] [ 175.501916] [c000001f618b79d0] [c008000018846d10] amdgpu_gem_va_ioctl+0x5f8/0x620 [amdgpu] [ 175.501923] [c000001f618b7b20] [c008000017748ce8] drm_ioctl_kernel+0xa0/0x140 [drm] [ 175.501928] [c000001f618b7b70] [c0080000177491b4] drm_ioctl+0x1ac/0x4d0 [drm] [ 175.501943] [c000001f618b7cb0] [c008000018820078] amdgpu_drm_ioctl+0x70/= 0xd0 [amdgpu] [ 175.501946] [c000001f618b7d00] [c0000000003e216c] do_vfs_ioctl+0xdc/0x8a0 [ 175.501948] [c000001f618b7da0] [c0000000003e2a34] ksys_ioctl+0x104/0x120 [ 175.501950] [c000001f618b7df0] [c0000000003e2a90] sys_ioctl+0x40/0xa0 [ 175.501953] [c000001f618b7e30] [c00000000000b9e0] system_call+0x58/0x6c [ 175.501954] Instruction dump: [ 175.501956] 892a0000 2f890000 409efd4c 3c620000 e8639aa0 39200001 7ea4ab= 78 992a0000=20 [ 175.501959] 481bee31 e8410018 4bfffd2c 60000000 <0fe00000> 3d420000 e94a= 9a98 3bc0ffea=20 [ 175.501963] ---[ end trace b955b8bff21188f9 ]--- [ 175.501965] [drm] size=3D167768376 > sa_manager->size=3D1048576 [ 175.501968] amdgpu 0000:01:00.0: failed to get a new IB (-22) [ 175.502036] [drm:amdgpu_gem_va_ioctl [amdgpu]] *ERROR* Couldn't update B= O_VA (-22) [ 175.502153] amdgpu 0000:01:00.0: failed to get a new IB (-22) [ 175.535050] amdgpu 0000:01:00.0: failed to get a new IB (-22) [ 175.539783] amdgpu 0000:01:00.0: failed to get a new IB (-22) [ 175.539870] [drm:amdgpu_gem_va_ioctl [amdgpu]] *ERROR* Couldn't update B= O_VA (-22) [ 175.540410] amdgpu 0000:01:00.0: failed to get a new IB (-22) [ 175.540489] [drm:amdgpu_gem_va_ioctl [amdgpu]] *ERROR* Couldn't update B= O_VA (-22) --=20 You are receiving this mail because: You are the assignee for the bug.= --15283175440.Fca3b.5850 Date: Wed, 6 Jun 2018 20:39:04 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 25 on bug 10625= 8 from foxbat@ruin.net
(In reply to Michel D=C3=A4nzer from comment #24)
> Created attachment 140046 [details] [review] [review]
> Add some debugging output to amdgpu_sa_bo_new
>=20
> This patch should tell us which of the WARN_ON_ONCE in amdgpu_sa_bo_ne=
w is
> hit, and what the values are.

Hello,

I have reproduced the error with the provided patch. Here is the output:

[  175.501689] WARNING: CPU: 24 PID: 3212 at
drivers/gpu/drm/amd/amdgpu/amdgpu_sa.c:288 amdgpu_sa_bo_new+0x648/0x6d0
[amdgpu]
[  175.501692] Modules linked in: binfmt_misc snd_usb_audio snd_usbmidi_lib
snd_rawmidi snd_seq_device amdgpu evdev snd_hda_codec_hdmi chash ast gpu_sc=
hed
snd_hda_intel ttm snd_hda_codec ghash_generic drm_kms_helper gf128mul ecb
snd_hda_core drm xts snd_hwdep snd_pcm drm_panel_orientation_quirks syscopy=
area
ctr sysfillrect sysimgblt snd_timer fb_sys_fops cbc vmx_crypto snd i2c_algo=
_bit
sg soundcore ofpart ipmi_powernv powernv_flash ipmi_devintf opal_prd mtd
ipmi_msghandler at24 ip_tables x_tables autofs4 ext4 crc16 mbcache jbd2
fscrypto raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor
async_tx hid_generic usbhid hid xor raid6_pq libcrc32c crc32c_generic raid1
raid0 multipath linear sd_mod md_mod ses enclosure xhci_pci xhci_hcd mpt3sas
raid_class usbcore scsi_transport_sas nvme tg3 nvme_core
[  175.501731]  aacraid libphy
[  175.501735] CPU: 24 PID: 3212 Comm: spectacle Not tainted 4.17.0-foxbat-=
64k
#2
[  175.501736] NIP:  c00800001885cd10 LR: c008000018865944 CTR:
0000000000000000
[  175.501738] REGS: c000001f618b71f0 TRAP: 0700   Not tainted=20
(4.17.0-foxbat-64k)
[  175.501738] MSR:  9000000000029033 <SF,HV,EE,ME,IR,DR,RI,LE>  CR: =
24828848=20
XER: 20040000
[  175.501744] CFAR: c00800001885c724 SOFTE: 0=20
               GPR00: c008000018865944 c000001f618b7470 c008000018ab6400
c000001fdbea3418=20
               GPR04: c000201ca9246260 0000000000100000 0000000000000100
00000000000017c1=20
               GPR08: 3817b594461fff5f c000201ca9246000 0000000000000000
c008000018a19dd0=20
               GPR12: 0000000000008000 c000001ffffe4000 c000001fdbea0000
c000001feab786a0=20
               GPR16: 00000000027ffc4e c000201ca9246260 00000000fffe2000
ffffffffffffffef=20
               GPR20: 0000000009fff138 0000000000000100 c000001fdbea6980
0000000000000000=20
               GPR24: 00000000027ffc4e 00000000000e5fff c000001fdbea3418
00000001fe000000=20
               GPR28: c000001fdbea0000 0000000000000000 c000201ca9246260
0000000009fff138=20
[  175.501780] NIP [c00800001885cd10] amdgpu_sa_bo_new+0x648/0x6d0 [amdgpu]
[  175.501798] LR [c008000018865944] amdgpu_ib_get+0x8c/0x120 [amdgpu]
[  175.501799] Call Trace:
[  175.501820] [c000001f618b7470] [c00800001885c7f8]
amdgpu_sa_bo_new+0x130/0x6d0 [amdgpu] (unreliable)
[  175.501840] [c000001f618b7710] [c008000018865944] amdgpu_ib_get+0x8c/0x1=
20
[amdgpu]
[  175.501861] [c000001f618b7790] [c008000018942558]
amdgpu_job_alloc_with_ib+0x90/0x110 [amdgpu]
[  175.501880] [c000001f618b77d0] [c008000018862124]
amdgpu_vm_bo_update_mapping+0x35c/0x480 [amdgpu]
[  175.501899] [c000001f618b78c0] [c0080000188625c8]
amdgpu_vm_bo_update+0x380/0x740 [amdgpu]
[  175.501916] [c000001f618b79d0] [c008000018846d10]
amdgpu_gem_va_ioctl+0x5f8/0x620 [amdgpu]
[  175.501923] [c000001f618b7b20] [c008000017748ce8]
drm_ioctl_kernel+0xa0/0x140 [drm]
[  175.501928] [c000001f618b7b70] [c0080000177491b4] drm_ioctl+0x1ac/0x4d0
[drm]
[  175.501943] [c000001f618b7cb0] [c008000018820078] amdgpu_drm_ioctl+0x70/=
0xd0
[amdgpu]
[  175.501946] [c000001f618b7d00] [c0000000003e216c] do_vfs_ioctl+0xdc/0x8a0
[  175.501948] [c000001f618b7da0] [c0000000003e2a34] ksys_ioctl+0x104/0x120
[  175.501950] [c000001f618b7df0] [c0000000003e2a90] sys_ioctl+0x40/0xa0
[  175.501953] [c000001f618b7e30] [c00000000000b9e0] system_call+0x58/0x6c
[  175.501954] Instruction dump:
[  175.501956] 892a0000 2f890000 409efd4c 3c620000 e8639aa0 39200001 7ea4ab=
78
992a0000=20
[  175.501959] 481bee31 e8410018 4bfffd2c 60000000 <0fe00000> 3d42000=
0 e94a9a98
3bc0ffea=20
[  175.501963] ---[ end trace b955b8bff21188f9 ]---
[  175.501965] [drm] size=3D167768376 > sa_manager->size=3D1048576
[  175.501968] amdgpu 0000:01:00.0: failed to get a new IB (-22)
[  175.502036] [drm:amdgpu_gem_va_ioctl [amdgpu]] *ERROR* Couldn't update B=
O_VA
(-22)
[  175.502153] amdgpu 0000:01:00.0: failed to get a new IB (-22)
[  175.535050] amdgpu 0000:01:00.0: failed to get a new IB (-22)
[  175.539783] amdgpu 0000:01:00.0: failed to get a new IB (-22)
[  175.539870] [drm:amdgpu_gem_va_ioctl [amdgpu]] *ERROR* Couldn't update B=
O_VA
(-22)
[  175.540410] amdgpu 0000:01:00.0: failed to get a new IB (-22)
[  175.540489] [drm:amdgpu_gem_va_ioctl [amdgpu]] *ERROR* Couldn't update B=
O_VA
(-22)


You are receiving this mail because:
  • You are the assignee for the bug.
= --15283175440.Fca3b.5850-- --===============1875057757== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============1875057757==--