From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 86969] _drm_intel_gem_bo_references() function takes half the CPU with Witcher2 game Date: Wed, 03 Dec 2014 11:30:00 +0000 Message-ID: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1590887632==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 80A7D6F7C6 for ; Wed, 3 Dec 2014 03:30:00 -0800 (PST) List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1590887632== Content-Type: multipart/alternative; boundary="1417606200.dC1Fa2d0.22219"; charset="UTF-8" --1417606200.dC1Fa2d0.22219 Date: Wed, 3 Dec 2014 11:30:00 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=86969 Bug ID: 86969 Summary: _drm_intel_gem_bo_references() function takes half the CPU with Witcher2 game Product: DRI Version: unspecified Hardware: Other OS: All Status: NEW Keywords: have-backtrace Severity: normal Priority: medium Component: libdrm Assignee: dri-devel@lists.freedesktop.org Reporter: eero.t.tamminen@intel.com CC: currojerez@riseup.net Created attachment 110404 --> https://bugs.freedesktop.org/attachment.cgi?id=110404&action=edit hack/test for alternate drm_intel_gem_bo_references() semantics Setup: - HSW GT3e in desktop case - Ubuntu 14.10 64-bit (kernel 3.16, Xorg 1.16) - Latest libdrm & Mesa 32-bit builds (2014-11-07) - Witcher2 game from Steam (32-bit) Steps: - Start Witcher2 with latest Mesa - Select FullHD resolution and highest generic gfx option, then disable anti-aliasing & ubersampling from the advanced options - Select "Arena" option from the main menu - After animation stops, click through "discussion" and pan around with mouse Results: - When panning around, some orientations show 100% (single) CPU utilization. - "perf" reports (nearly) *half* of the CPU consumption to happen in (very small & recursive) libdrm "_drm_intel_gem_bo_references" function. Analysis: Only caller of "_drm_intel_gem_bo_references" is the exported "drm_intel_gem_bo_references" function. Tracing the calls to that, reveals it to be called from Mesa gen6_check_query() function. [1] Removing libdrm _drm_intel_gem_bo_references() CPU bottleneck by doing flushes unconditionally in gen6_check_query() removed most of the CPU consumption and verifies the "perf" finding. However, those extra flushes made performance marginally worse. Printing statistics from resolving counts showed that for Witcher2, largest relocation count in _drm_intel_gem_bo_references() was 590, but ~97% of the calls had zero relocation counts. Another test was changing the semantics of "drm_intel_gem_bo_references". This also removed most of the Witcher2 CPU consumption, potentially with speed improvement. *On the test machine*, Witcher2 isn't CPU bound despite ~100% CPU load, so CPU usage doesn't directly affect that. *However*, on a temperature limited machine (e.g. laptop with GT3), this could have clear performance impact as the lowered CPU consumption may allow GPU to run at higher clock speed. Power usage should at least be effected. Attached is patch/hack (by Fransisco Jerez) for testing this. Conclusion: There could be two separate functions, with slightly different semantics. One that is fast and does something similar to what Fransisco proposed and which can be used by (Mesa) functions that don't need more accurate information, and the current "libdrm _drm_intel_gem_bo_references" function for those that do need it. --- [1] In addition to resource usage tracing, functracer can attach to a running process and track calls to specified (exported) function: https://maemo.gitorious.org/maemo-tools/functracer According to it, the callers were: 194154 calls (for the trace period): 0xf601e722 drm_intel_bo_references() at intel_bufmgr.c:298 0xf63ca55c gen6_check_query() at gen6_queryobj.c:329 0xf6144e8d _mesa_GetQueryObjectiv() at queryobj.c:620 1133 calls: 0xf601e722 drm_intel_bo_references() at intel_bufmgr.c:298 0xf63ca34c gen6_queryobj_get_results() at gen6_queryobj.c:128 0xf63ca583 gen6_check_query() at gen6_queryobj.c:333 0xf6144e8d _mesa_GetQueryObjectiv() at queryobj.c:620 mp.h:17626 62 calls: 0xf601e722 drm_intel_bo_references() at intel_bufmgr.c:298 0xf62fe973 brw_map_buffer_range() at intel_buffer_objects.c:390 0xf60754b6 _mesa_MapBufferRange() at bufferobj.c:2178 -- You are receiving this mail because: You are the assignee for the bug. --1417606200.dC1Fa2d0.22219 Date: Wed, 3 Dec 2014 11:30:00 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"
Bug ID 86969
Summary _drm_intel_gem_bo_references() function takes half the CPU with Witcher2 game
Product DRI
Version unspecified
Hardware Other
OS All
Status NEW
Keywords have-backtrace
Severity normal
Priority medium
Component libdrm
Assignee dri-devel@lists.freedesktop.org
Reporter eero.t.tamminen@intel.com
CC currojerez@riseup.net

Created attachment 110404 [details] [review]
hack/test for alternate drm_intel_gem_bo_references() semantics

Setup:

- HSW GT3e in desktop case
- Ubuntu 14.10 64-bit (kernel 3.16, Xorg 1.16)
- Latest libdrm & Mesa 32-bit builds (2014-11-07)
- Witcher2 game from Steam (32-bit)

Steps:

- Start Witcher2 with latest Mesa
- Select FullHD resolution and highest generic gfx option, then disable
anti-aliasing & ubersampling from the advanced options
- Select "Arena" option from the main menu
- After animation stops, click through "discussion" and pan around with mouse

Results:

- When panning around, some orientations show 100% (single) CPU utilization.
- "perf" reports (nearly) *half* of the CPU consumption to happen in (very
small & recursive) libdrm "_drm_intel_gem_bo_references" function.


Analysis:

Only caller of "_drm_intel_gem_bo_references" is the exported
"drm_intel_gem_bo_references" function.  Tracing the calls to that, reveals it
to be called from Mesa gen6_check_query() function. [1]

Removing libdrm _drm_intel_gem_bo_references() CPU bottleneck by doing flushes
unconditionally in gen6_check_query() removed most of the CPU consumption and
verifies the "perf" finding. However, those extra flushes made performance
marginally worse.


Printing statistics from resolving counts showed that for Witcher2, largest
relocation count in _drm_intel_gem_bo_references() was 590, but ~97% of the
calls had zero relocation counts.

Another test was changing the semantics of "drm_intel_gem_bo_references".  This
also removed most of the Witcher2 CPU consumption, potentially with speed
improvement.  *On the test machine*, Witcher2 isn't CPU bound despite ~100% CPU
load, so CPU usage doesn't directly affect that.  *However*, on a temperature
limited machine (e.g. laptop with GT3), this could have clear performance
impact as the lowered CPU consumption may allow GPU to run at higher clock
speed. Power usage should at least be effected.

Attached is patch/hack (by Fransisco Jerez) for testing this.


Conclusion:

There could be two separate functions, with slightly different semantics.  One
that is fast and does something similar to what Fransisco proposed and which
can be used by (Mesa) functions that don't need more accurate information, and
the current "libdrm _drm_intel_gem_bo_references" function for those that do
need it.


---

[1] In addition to resource usage tracing, functracer can attach to a running
process and track calls to specified (exported) function:
  https://maemo.gitorious.org/maemo-tools/functracer

According to it, the callers were:

194154 calls (for the trace period):
0xf601e722 drm_intel_bo_references() at intel_bufmgr.c:298
0xf63ca55c gen6_check_query() at gen6_queryobj.c:329
0xf6144e8d _mesa_GetQueryObjectiv() at queryobj.c:620

1133 calls:
0xf601e722 drm_intel_bo_references() at intel_bufmgr.c:298
0xf63ca34c gen6_queryobj_get_results() at gen6_queryobj.c:128
0xf63ca583 gen6_check_query() at gen6_queryobj.c:333
0xf6144e8d _mesa_GetQueryObjectiv() at queryobj.c:620
mp.h:17626

62 calls:
0xf601e722 drm_intel_bo_references() at intel_bufmgr.c:298
0xf62fe973 brw_map_buffer_range() at intel_buffer_objects.c:390
0xf60754b6 _mesa_MapBufferRange() at bufferobj.c:2178


You are receiving this mail because:
  • You are the assignee for the bug.
--1417606200.dC1Fa2d0.22219-- --===============1590887632== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============1590887632==--