From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: =?UTF-8?B?W0J1ZyAxMTAyMjVdIEtlcm5lbCBwYW5pYyB3aGlsZSDigJwgIG1vZHByb2Jl?= =?UTF-8?B?IGFtZGtmZCA7ICBtb2Rwcm9iZSAtciBhbWRrZmQgIiAgOyA0LjE0LjM1IGtl?= =?UTF-8?B?cm5lbCAu?= Date: Fri, 22 Mar 2019 17:06:44 +0000 Message-ID: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1396414206==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id 001D46E1D7 for ; Fri, 22 Mar 2019 17:06:44 +0000 (UTC) List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1396414206== Content-Type: multipart/alternative; boundary="15532744041.B6e6Eb4.9549" Content-Transfer-Encoding: 7bit --15532744041.B6e6Eb4.9549 Date: Fri, 22 Mar 2019 17:06:44 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D110225 Bug ID: 110225 Summary: Kernel panic while =E2=80=9C modprobe amdkfd ; modpro= be -r amdkfd " ; 4.14.35 kernel . Product: DRI Version: XOrg git Hardware: Other OS: All Status: NEW Severity: major Priority: medium Component: DRM/amdkfd Assignee: dri-devel@lists.freedesktop.org Reporter: John.p.donnelly@oracle.com Hello , I am investigating a issue that our test group reported concerning this dri= ver. Their test loads and unloads every kernel module included in the Oracle 4.14.35 kernel release . You don=E2=80=99t even need a AMD platform . It oc= curs on any Intel, or a KVM VM instance too.=20 Kernel panic while =E2=80=9C modprobe amdkfd ; modprobe -r amdkfd =E2=80= =9C [ 329.425334] ? __slab_free+0x9b/0x2ba [ 329.427836] ? process_slab+0x3c1/0x45c [ 329.430336] dev_printk_emit+0x4e/0x65 [ 329.432829] __dev_printk+0x46/0x8b [ 329.435183] _dev_info+0x6c/0x85 [ 329.437435] ? kfree+0x141/0x182 [ 329.439646] kfd_module_exit+0x37/0x39 [amdkfd] [ 329.442258] SyS_delete_module+0x1c3/0x26f [ 329.444722] ? entry_SYSCALL_64_after_hwframe+0xaa/0x0 [ 329.447479] ? entry_SYSCALL_64_after_hwframe+0xa3/0x0 [ 329.450206] ? entry_SYSCALL_64_after_hwframe+0x9c/0x0 [ 329.452912] ? entry_SYSCALL_64_after_hwframe+0x95/0x0 [ 329.455586] do_syscall_64+0x79/0x1ae [ 329.457766] entry_SYSCALL_64_after_hwframe+0x151/0x0 [ 329.460369] RIP: 0033:0x7f1757a1b457 [ 329.462502] RSP: 002b:00007ffd62ce1f48 EFLAGS: 00000206 ORIG_RAX: Looks like some memory corruption.=20 Sometimes the unload works but the message logged is garbage: [root@jpd-vmbase02 ~]# modprobe -r amdkfd [ 144.449981] ???????????? hn??=E8=9F=9F??xn??=D7=9F??kfd: Removed module Is this something one of team members could have possibly corrected in an upstream version ? #define KFD_DRIVER_DESC "Standalone HSA driver for AMD's GPUs" #define KFD_DRIVER_DATE "20150421" #define KFD_DRIVER_MAJOR 0 #define KFD_DRIVER_MINOR 7 #define KFD_DRIVER_PATCHLEVEL 2 Thank you, John --=20 You are receiving this mail because: You are the assignee for the bug.= --15532744041.B6e6Eb4.9549 Date: Fri, 22 Mar 2019 17:06:44 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated
Bug ID 110225
Summary Kernel panic while =E2=80=9C modprobe amdkfd ; modprobe -r = amdkfd " ; 4.14.35 kernel .
Product DRI
Version XOrg git
Hardware Other
OS All
Status NEW
Severity major
Priority medium
Component DRM/amdkfd
Assignee dri-devel@lists.freedesktop.org
Reporter John.p.donnelly@oracle.com

Hello ,


I am investigating a issue that our test group reported concerning this dri=
ver.
 Their test loads and unloads every kernel module included in the Oracle
4.14.35 kernel release . You don=E2=80=99t even need a AMD platform . It oc=
curs on any
Intel,  or a  KVM VM instance too.=20

Kernel panic while =E2=80=9C  modprobe amdkfd ;  modprobe -r amdkfd  =E2=80=
=9C

[  329.425334]  ? __slab_free+0x9b/0x2ba
[  329.427836]  ? process_slab+0x3c1/0x45c
[  329.430336]  dev_printk_emit+0x4e/0x65
[  329.432829]  __dev_printk+0x46/0x8b
[  329.435183]  _dev_info+0x6c/0x85
[  329.437435]  ? kfree+0x141/0x182
[  329.439646]  kfd_module_exit+0x37/0x39 [amdkfd]
[  329.442258]  SyS_delete_module+0x1c3/0x26f
[  329.444722]  ? entry_SYSCALL_64_after_hwframe+0xaa/0x0
[  329.447479]  ? entry_SYSCALL_64_after_hwframe+0xa3/0x0
[  329.450206]  ? entry_SYSCALL_64_after_hwframe+0x9c/0x0
[  329.452912]  ? entry_SYSCALL_64_after_hwframe+0x95/0x0
[  329.455586]  do_syscall_64+0x79/0x1ae
[  329.457766]  entry_SYSCALL_64_after_hwframe+0x151/0x0
[  329.460369] RIP: 0033:0x7f1757a1b457
[  329.462502] RSP: 002b:00007ffd62ce1f48 EFLAGS: 00000206 ORIG_RAX:


Looks like some memory corruption.=20

Sometimes  the unload works but the message logged is garbage:

[root@jpd-vmbase02 ~]# modprobe -r amdkfd
[  144.449981] ???????????? hn??=E8=9F=9F??xn??=D7=9F??kfd: Removed module


Is  this something one of team members could have possibly corrected in an
upstream version ?

#define KFD_DRIVER_DESC         "Standalone HSA driver for AMD's GPUs&=
quot;
#define KFD_DRIVER_DATE         "20150421"
#define KFD_DRIVER_MAJOR        0
#define KFD_DRIVER_MINOR        7
#define KFD_DRIVER_PATCHLEVEL   2


Thank you,

John


You are receiving this mail because:
  • You are the assignee for the bug.
= --15532744041.B6e6Eb4.9549-- --===============1396414206== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVs --===============1396414206==--