From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 98638] Panic on shutdown with AMDGPU and Ubuntu Plymouth Date: Mon, 14 Nov 2016 09:05:05 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0672459026==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id BD92D6E35D for ; Mon, 14 Nov 2016 09:05:04 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0672459026== Content-Type: multipart/alternative; boundary="14791143040.fFcE.5700"; charset="UTF-8" --14791143040.fFcE.5700 Date: Mon, 14 Nov 2016 09:05:04 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D98638 --- Comment #6 from Ernst Sj=C3=B6strand --- Here's a new "backtrace"... null pointer dereference in amdgpu_fence_wait_empty can't be that many things... Is it the rcu_dereference? https://cgit.freedesktop.org/~agd5f/linux/tree/drivers/gpu/drm/amd/amdgpu/a= mdgpu_fence.c?h=3Ddrm-next-4.10-wip#n261 [ 85.216191] [drm] amdgpu: finishing device. [ 86.336739] Console: switching to colour VGA+ 80x25 [ 86.345463] BUG: unable to handle kernel NULL pointer dereference at=20= =20=20=20=20=20=20=20 (null) [ 86.345477] IP: [] amdgpu_fence_wait_empty+0x2a/0xd0 [amdgpu] [ 86.345508] PGD 0 [ 86.345510]=20 [ 86.345516] Oops: 0000 [#1] SMP [ 86.345519] Modules linked in: netconsole configfs binfmt_misc eeepc_wmi asus_wmi sparse_keymap video intel_rapl x86_pkg_temp_thermal kvm_intel kvm irqbypass input_leds btusb btrtl btbcm crct10dif_pclmul crc32_pclmul btintel ghash_clmulni_intel bluetooth aesni_intel aes_x86_64 snd_hda_codec_realtek = lrw glue_helper ablk_helper snd_hda_codec_generic cryptd snd_hda_codec_hdmi intel_cstate snd_hda_intel intel_rapl_perf snd_hda_codec snd_hda_core serio= _raw snd_hwdep snd_pcm snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device snd_timer snd mei_me soundcore mei lpc_ich shpchp mac_hid sbs sbshc max6650 coretemp parport_pc ppdev lp parport ip_tables x_tables autof= s4 btrfs xor raid6_pq hid_generic usbhid hid amdkfd amd_iommu_v2 mxm_wmi amdgpu i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect ahci sysimgblt fb_sys_fops libahci e1000e83 ec 08 23 87 88 00 00 00 48 8b 97 90 00 00 00 4= 8 8d 04 c2 18 48 85 db 74 6f 8b 0b 85 c9 74 69 8d 51 01 89 c8 f0 0f=20 [ 86.346049] RIP [] amdgpu_fence_wait_empty+0x2a/0xd0 [amdgpu] [ 86.346071] RSP [ 86.346073] CR2: 0000000000000000 I only see the backtrace when amdgpu is unloaded before my ethernet module, which seems to be rare. Would be nice to force the ethernet driver to be unloaded last. --=20 You are receiving this mail because: You are the assignee for the bug.= --14791143040.fFcE.5700 Date: Mon, 14 Nov 2016 09:05:04 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comment= # 6 on bug 98638<= /a> from Ernst Sj=C3=B6strand
Here's a new "backtrace"...
null pointer dereference in amdgpu_fence_wait_empty can't be that many
things...
Is it the rcu_dereference?

https://cgit.freedeskt=
op.org/~agd5f/linux/tree/drivers/gpu/drm/amd/amdgpu/amdgpu_fence.c?h=3Ddrm-=
next-4.10-wip#n261

[   85.216191] [drm] amdgpu: finishing device.
[   86.336739] Console: switching to colour VGA+ 80x25
[   86.345463] BUG: unable to handle kernel NULL pointer dereference at=20=
=20=20=20=20=20=20=20
  (null)
[   86.345477] IP: [<ffffffffc03882ba>] amdgpu_fence_wait_empty+0x2a/=
0xd0
[amdgpu]
[   86.345508] PGD 0 [   86.345510]=20
[   86.345516] Oops: 0000 [#1] SMP
[   86.345519] Modules linked in: netconsole configfs binfmt_misc eeepc_wmi
asus_wmi sparse_keymap video intel_rapl x86_pkg_temp_thermal kvm_intel kvm
irqbypass input_leds btusb btrtl btbcm crct10dif_pclmul crc32_pclmul btintel
ghash_clmulni_intel bluetooth aesni_intel aes_x86_64 snd_hda_codec_realtek =
lrw
glue_helper ablk_helper snd_hda_codec_generic cryptd snd_hda_codec_hdmi
intel_cstate snd_hda_intel intel_rapl_perf snd_hda_codec snd_hda_core serio=
_raw
snd_hwdep snd_pcm snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq
snd_seq_device snd_timer snd mei_me soundcore mei lpc_ich shpchp mac_hid sbs
sbshc max6650 coretemp parport_pc ppdev lp parport ip_tables x_tables autof=
s4
btrfs xor raid6_pq hid_generic usbhid hid amdkfd amd_iommu_v2 mxm_wmi amdgpu
i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect ahci sysimgblt
fb_sys_fops libahci e1000e83 ec 08 23 87 88 00 00 00 48 8b 97 90 00 00 00 4=
8 8d
04 c2 18 48 85 db 74 6f 8b 0b 85 c9 74 69 8d 51 01 89 c8 f0 0f=20
[   86.346049] RIP  [<ffffffffc03882ba>] amdgpu_fence_wait_empty+0x2a=
/0xd0
[amdgpu]
[   86.346071]  RSP <ffffa46a017f3af0>
[   86.346073] CR2: 0000000000000000

I only see the backtrace when amdgpu is unloaded before my ethernet module,
which seems to be rare.
Would be nice to force the ethernet driver to be unloaded last.


You are receiving this mail because:
  • You are the assignee for the bug.
= --14791143040.fFcE.5700-- --===============0672459026== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============0672459026==--