From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 62997] New: GPU fault unless R600_DEBUG=nodma Date: Mon, 01 Apr 2013 15:31:59 +0000 Message-ID: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============2025177572==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 05CC8E5C5F for ; Mon, 1 Apr 2013 08:32:00 -0700 (PDT) List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: dri-devel-bounces+sf-dri-devel=m.gmane.org@lists.freedesktop.org Errors-To: dri-devel-bounces+sf-dri-devel=m.gmane.org@lists.freedesktop.org To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============2025177572== Content-Type: multipart/alternative; boundary="1364830319.e76f80b40.9948"; charset="us-ascii" --1364830319.e76f80b40.9948 Date: Mon, 1 Apr 2013 15:31:59 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=62997 Priority: medium Bug ID: 62997 Assignee: dri-devel@lists.freedesktop.org Summary: GPU fault unless R600_DEBUG=nodma Severity: major Classification: Unclassified OS: Linux (All) Reporter: udovdh@xs4all.nl Hardware: x86-64 (AMD64) Status: NEW Version: git Component: Drivers/Gallium/r600 Product: Mesa Ever since booting into kernel.org 3.8.4 on my AMD A10-5800K (ARUBA graphics), running git mesa and git xf86-video-ati, I get short uptimes (15 minutes, around one hour max) due to crashes. The logs mention stuff like: [ 1332.480233] radeon 0000:00:01.0: GPU fault detected: 146 0x0134710c [ 1332.480243] radeon 0000:00:01.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00000813 [ 1332.480250] radeon 0000:00:01.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0407100C Watching youtube `helps` triggering the issue as it appears. (correlates, no real causation yet) Having R600_DEBUG=nodma in the environment solves the problem. Occasionally I see a GPU lockup, if that is related: [29648.098135] disk 0, wo:0, o:1, dev:sda2 [29648.098140] disk 1, wo:0, o:1, dev:sdb2 [29648.098142] disk 2, wo:0, o:1, dev:sdc2 [29648.098145] disk 3, wo:0, o:1, dev:sdd2 [68707.166021] radeon 0000:00:01.0: GPU fault detected: 146 0x0d4c2604 [68707.166030] radeon 0000:00:01.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x000008D4 [68707.166043] radeon 0000:00:01.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0C026004 [70621.378798] radeon 0000:00:01.0: GPU fault detected: 146 0x013c710c [70621.378808] radeon 0000:00:01.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00000813 [70621.378815] radeon 0000:00:01.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0C07100C [70621.378837] radeon 0000:00:01.0: GPU fault detected: 147 0x0f0c7102 [70621.378843] radeon 0000:00:01.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00000000 [70621.378848] radeon 0000:00:01.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x00000000 [70621.378854] radeon 0000:00:01.0: GPU fault detected: 147 0x0f1c7102 [70621.378859] radeon 0000:00:01.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00000000 [70621.378864] radeon 0000:00:01.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x00000000 [70631.857918] radeon 0000:00:01.0: GPU lockup CP stall for more than 10000msec [70631.857927] radeon 0000:00:01.0: GPU lockup (waiting for 0x00000000007e1fe5 last fence id 0x00000000007e1fe3) [70631.858436] radeon 0000:00:01.0: sa_manager is not empty, clearing anyway [70631.859755] radeon 0000:00:01.0: Saved 951 dwords of commands on ring 0. [70631.859761] radeon 0000:00:01.0: GPU softreset: 0x00000003 [70631.859766] radeon 0000:00:01.0: VM_CONTEXT0_PROTECTION_FAULT_ADDR 0x00000000 [70631.859770] radeon 0000:00:01.0: VM_CONTEXT0_PROTECTION_FAULT_STATUS 0x00000000 [70631.859774] radeon 0000:00:01.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00000000 [70631.859778] radeon 0000:00:01.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x00000000 [70631.867299] radeon 0000:00:01.0: GRBM_STATUS = 0xA2703828 [70631.867305] radeon 0000:00:01.0: GRBM_STATUS_SE0 = 0x1D000007 [70631.867309] radeon 0000:00:01.0: GRBM_STATUS_SE1 = 0x00000007 [70631.867313] radeon 0000:00:01.0: SRBM_STATUS = 0x20000040 [70631.867317] radeon 0000:00:01.0: R_008674_CP_STALLED_STAT1 = 0x00000000 [70631.867321] radeon 0000:00:01.0: R_008678_CP_STALLED_STAT2 = 0x00018000 [70631.867325] radeon 0000:00:01.0: R_00867C_CP_BUSY_STAT = 0x00008006 [70631.867328] radeon 0000:00:01.0: R_008680_CP_STAT = 0x80038647 [70631.867332] radeon 0000:00:01.0: GRBM_SOFT_RESET=0x0000DF7B [70631.867386] radeon 0000:00:01.0: GRBM_STATUS = 0x00003828 [70631.867390] radeon 0000:00:01.0: GRBM_STATUS_SE0 = 0x00000007 [70631.867393] radeon 0000:00:01.0: GRBM_STATUS_SE1 = 0x00000007 [70631.867397] radeon 0000:00:01.0: SRBM_STATUS = 0x20000040 [70631.867400] radeon 0000:00:01.0: R_008674_CP_STALLED_STAT1 = 0x00000000 [70631.867404] radeon 0000:00:01.0: R_008678_CP_STALLED_STAT2 = 0x00000000 [70631.867408] radeon 0000:00:01.0: R_00867C_CP_BUSY_STAT = 0x00000000 [70631.867411] radeon 0000:00:01.0: R_008680_CP_STAT = 0x00000000 [70631.883681] radeon 0000:00:01.0: GPU reset succeeded, trying to resume [70631.916445] [drm] PCIE GART of 512M enabled (table at 0x0000000000040000). [70631.916534] radeon 0000:00:01.0: WB enabled [70631.916536] radeon 0000:00:01.0: fence driver on ring 0 use gpu addr 0x0000000030000c00 and cpu addr 0xffff880235891c00 [70631.916538] radeon 0000:00:01.0: fence driver on ring 1 use gpu addr 0x0000000030000c04 and cpu addr 0xffff880235891c04 [70631.916540] radeon 0000:00:01.0: fence driver on ring 2 use gpu addr 0x0000000030000c08 and cpu addr 0xffff880235891c08 [70631.916541] radeon 0000:00:01.0: fence driver on ring 3 use gpu addr 0x0000000030000c0c and cpu addr 0xffff880235891c0c [70631.916543] radeon 0000:00:01.0: fence driver on ring 4 use gpu addr 0x0000000030000c10 and cpu addr 0xffff880235891c10 [70631.935206] [drm] ring test on 0 succeeded in 3 usecs [70631.935264] [drm] ring test on 3 succeeded in 2 usecs [70631.935271] [drm] ring test on 4 succeeded in 1 usecs [70631.949531] [drm] ib test on ring 0 succeeded in 0 usecs [70631.950057] [drm] ib test on ring 3 succeeded in 0 usecs [70631.950576] [drm] ib test on ring 4 succeeded in 1 usecs -- You are receiving this mail because: You are the assignee for the bug. --1364830319.e76f80b40.9948 Date: Mon, 1 Apr 2013 15:31:59 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"
Priority medium
Bug ID 62997
Assignee dri-devel@lists.freedesktop.org
Summary GPU fault unless R600_DEBUG=nodma
Severity major
Classification Unclassified
OS Linux (All)
Reporter udovdh@xs4all.nl
Hardware x86-64 (AMD64)
Status NEW
Version git
Component Drivers/Gallium/r600
Product Mesa

Ever since booting into kernel.org 3.8.4 on my AMD A10-5800K (ARUBA graphics),
running git mesa and git xf86-video-ati, I get short uptimes (15 minutes,
around one hour max) due to crashes.
The logs mention stuff like:

[ 1332.480233] radeon 0000:00:01.0: GPU fault detected: 146 0x0134710c
[ 1332.480243] radeon 0000:00:01.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00000813
[ 1332.480250] radeon 0000:00:01.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x0407100C

Watching youtube `helps` triggering the issue as it appears. (correlates, no
real causation yet) 
Having R600_DEBUG=nodma in the environment solves the problem.

Occasionally I see a GPU lockup, if that is related:

    [29648.098135]  disk 0, wo:0, o:1, dev:sda2
    [29648.098140]  disk 1, wo:0, o:1, dev:sdb2
    [29648.098142]  disk 2, wo:0, o:1, dev:sdc2
    [29648.098145]  disk 3, wo:0, o:1, dev:sdd2
    [68707.166021] radeon 0000:00:01.0: GPU fault detected: 146 0x0d4c2604
    [68707.166030] radeon 0000:00:01.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x000008D4
    [68707.166043] radeon 0000:00:01.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x0C026004
    [70621.378798] radeon 0000:00:01.0: GPU fault detected: 146 0x013c710c
    [70621.378808] radeon 0000:00:01.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00000813
    [70621.378815] radeon 0000:00:01.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x0C07100C
    [70621.378837] radeon 0000:00:01.0: GPU fault detected: 147 0x0f0c7102
    [70621.378843] radeon 0000:00:01.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00000000
    [70621.378848] radeon 0000:00:01.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
    [70621.378854] radeon 0000:00:01.0: GPU fault detected: 147 0x0f1c7102
    [70621.378859] radeon 0000:00:01.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00000000
    [70621.378864] radeon 0000:00:01.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
    [70631.857918] radeon 0000:00:01.0: GPU lockup CP stall for more than
10000msec
    [70631.857927] radeon 0000:00:01.0: GPU lockup (waiting for
0x00000000007e1fe5 last fence id 0x00000000007e1fe3)
    [70631.858436] radeon 0000:00:01.0: sa_manager is not empty, clearing
anyway
    [70631.859755] radeon 0000:00:01.0: Saved 951 dwords of commands on ring 0.
    [70631.859761] radeon 0000:00:01.0: GPU softreset: 0x00000003
    [70631.859766] radeon 0000:00:01.0:   VM_CONTEXT0_PROTECTION_FAULT_ADDR  
0x00000000
    [70631.859770] radeon 0000:00:01.0:   VM_CONTEXT0_PROTECTION_FAULT_STATUS
0x00000000
    [70631.859774] radeon 0000:00:01.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00000000
    [70631.859778] radeon 0000:00:01.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
    [70631.867299] radeon 0000:00:01.0:   GRBM_STATUS               =
0xA2703828
    [70631.867305] radeon 0000:00:01.0:   GRBM_STATUS_SE0           =
0x1D000007
    [70631.867309] radeon 0000:00:01.0:   GRBM_STATUS_SE1           =
0x00000007
    [70631.867313] radeon 0000:00:01.0:   SRBM_STATUS               =
0x20000040
    [70631.867317] radeon 0000:00:01.0:   R_008674_CP_STALLED_STAT1 =
0x00000000
    [70631.867321] radeon 0000:00:01.0:   R_008678_CP_STALLED_STAT2 =
0x00018000
    [70631.867325] radeon 0000:00:01.0:   R_00867C_CP_BUSY_STAT     =
0x00008006
    [70631.867328] radeon 0000:00:01.0:   R_008680_CP_STAT          =
0x80038647
    [70631.867332] radeon 0000:00:01.0:   GRBM_SOFT_RESET=0x0000DF7B
    [70631.867386] radeon 0000:00:01.0:   GRBM_STATUS               =
0x00003828
    [70631.867390] radeon 0000:00:01.0:   GRBM_STATUS_SE0           =
0x00000007
    [70631.867393] radeon 0000:00:01.0:   GRBM_STATUS_SE1           =
0x00000007
    [70631.867397] radeon 0000:00:01.0:   SRBM_STATUS               =
0x20000040
    [70631.867400] radeon 0000:00:01.0:   R_008674_CP_STALLED_STAT1 =
0x00000000
    [70631.867404] radeon 0000:00:01.0:   R_008678_CP_STALLED_STAT2 =
0x00000000
    [70631.867408] radeon 0000:00:01.0:   R_00867C_CP_BUSY_STAT     =
0x00000000
    [70631.867411] radeon 0000:00:01.0:   R_008680_CP_STAT          =
0x00000000
    [70631.883681] radeon 0000:00:01.0: GPU reset succeeded, trying to resume
    [70631.916445] [drm] PCIE GART of 512M enabled (table at
0x0000000000040000).
    [70631.916534] radeon 0000:00:01.0: WB enabled
    [70631.916536] radeon 0000:00:01.0: fence driver on ring 0 use gpu addr
0x0000000030000c00 and cpu addr 0xffff880235891c00
    [70631.916538] radeon 0000:00:01.0: fence driver on ring 1 use gpu addr
0x0000000030000c04 and cpu addr 0xffff880235891c04
    [70631.916540] radeon 0000:00:01.0: fence driver on ring 2 use gpu addr
0x0000000030000c08 and cpu addr 0xffff880235891c08
    [70631.916541] radeon 0000:00:01.0: fence driver on ring 3 use gpu addr
0x0000000030000c0c and cpu addr 0xffff880235891c0c
    [70631.916543] radeon 0000:00:01.0: fence driver on ring 4 use gpu addr
0x0000000030000c10 and cpu addr 0xffff880235891c10
    [70631.935206] [drm] ring test on 0 succeeded in 3 usecs
    [70631.935264] [drm] ring test on 3 succeeded in 2 usecs
    [70631.935271] [drm] ring test on 4 succeeded in 1 usecs
    [70631.949531] [drm] ib test on ring 0 succeeded in 0 usecs
    [70631.950057] [drm] ib test on ring 3 succeeded in 0 usecs
    [70631.950576] [drm] ib test on ring 4 succeeded in 1 usecs


You are receiving this mail because:
  • You are the assignee for the bug.
--1364830319.e76f80b40.9948-- --===============2025177572== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ dri-devel mailing list dri-devel@lists.freedesktop.org http://lists.freedesktop.org/mailman/listinfo/dri-devel --===============2025177572==--