From mboxrd@z Thu Jan 1 00:00:00 1970
From: bugzilla-daemon@freedesktop.org
Subject: [Bug 97273] [r600g, bisected] regression: NI/Turks WebGL (FishGL)
massive speed decrease ~33%
Date: Wed, 31 Aug 2016 01:37:46 +0000
Message-ID:
References:
Mime-Version: 1.0
Content-Type: multipart/mixed; boundary="===============0124198006=="
Return-path:
Received: from culpepper.freedesktop.org (culpepper.freedesktop.org
[IPv6:2610:10:20:722:a800:ff:fe98:4b55])
by gabe.freedesktop.org (Postfix) with ESMTP id 075B56E179
for ; Wed, 31 Aug 2016 01:37:46 +0000 (UTC)
In-Reply-To:
List-Unsubscribe: ,
List-Archive:
List-Post:
List-Help:
List-Subscribe: ,
Errors-To: dri-devel-bounces@lists.freedesktop.org
Sender: "dri-devel"
To: dri-devel@lists.freedesktop.org
List-Id: dri-devel@lists.freedesktop.org
--===============0124198006==
Content-Type: multipart/alternative; boundary="14726074653.2854.24769";
charset="UTF-8"
--14726074653.2854.24769
Date: Wed, 31 Aug 2016 01:37:45 +0000
MIME-Version: 1.0
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Bugzilla-URL: http://bugs.freedesktop.org/
Auto-Submitted: auto-generated
https://bugs.freedesktop.org/show_bug.cgi?id=3D97273
--- Comment #2 from Dieter N=C3=BCtzel ---
SOLVED
with Mario's commit 2cc880c=20
If I revert this speed is BAD as with Nicolai's
3735a925ef5692c836c4d26d6adee370dae1c2b0
commit.
commit 2cc880cba54d687a122298c8187ecc31b4a0ee2d
Author: Mario Kleiner
Date: Fri Aug 26 18:59:05 2016 +0200
r600: increase performance for DRI PRIME offloading if 2nd GPU is
Evergreen+
This is a direct port of Marek Ol=C5=A1=C3=A1ks patch
"radeonsi: increase performance for DRI PRIME
offloading if 2nd GPU is CIK or VI" to r600.
It uses SDMA for the detiling blit from renderoffload VRAM
to GTT, as SDMA is much faster for tiled->linear blits from
VRAM to GTT.
Testing on a dual Radeon HD-5770 setup reduced the time
for the render offload gpu to get its rendering into
system RAM from approximately 16 msecs for simple rendering
at 1920x1080 pixel 32 bpp to 5 msecs, a > 3x speedup!
This was measured using ftrace to trace the time the radeon kms
driver waited on the dmabuf fence of the renderoffload gpu to
complete.
All in all this brought the time for a flip down from 20 msecs
to 9 msecs, so the prime setup can display at full 60 fps instead
of barely 30 fps vsync'ed.
The current r600 implementation supports SDMA on Evergreen and
later, but not R600/R700 due to some bugs apparently present
in their SDMA implementation.
Signed-off-by: Mario Kleiner
Cc: Marek Ol=C5=A1=C3=A1k
Signed-off-by: Marek Ol=C5=A1=C3=A1k
:040000 040000 16967e652cc0708f670ab8b6d63e5eb629fbd6a0
e62fa916bd1706eb1d61975765d77d76cfae0fd2 Msrc
So I'm somewhat unsure if I should close this.
Mario, Marek, Nicolai could it be that we get another boost if both patches
'work together'?
--=20
You are receiving this mail because:
You are the assignee for the bug.=
--14726074653.2854.24769
Date: Wed, 31 Aug 2016 01:37:45 +0000
MIME-Version: 1.0
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Bugzilla-URL: http://bugs.freedesktop.org/
Auto-Submitted: auto-generated
Comment=
# 2
on bug 97273<=
/a>
from Dieter N=C3=BCtzel
SOLVED
with Mario's commit 2cc880c=20
If I revert this speed is BAD as with Nicolai's
3735a925ef5692c836c4d26d6adee370dae1c2b0
commit.
commit 2cc880cba54d687a122298c8187ecc31b4a0ee2d
Author: Mario Kleiner <mario.kleiner.de@gmail.com>
Date: Fri Aug 26 18:59:05 2016 +0200
r600: increase performance for DRI PRIME offloading if 2nd GPU is
Evergreen+
This is a direct port of Marek Ol=C5=A1=C3=A1ks patch
"radeonsi: increase performance for DRI PRIME
offloading if 2nd GPU is CIK or VI" to r600.
It uses SDMA for the detiling blit from renderoffload VRAM
to GTT, as SDMA is much faster for tiled->linear blits from
VRAM to GTT.
Testing on a dual Radeon HD-5770 setup reduced the time
for the render offload gpu to get its rendering into
system RAM from approximately 16 msecs for simple rendering
at 1920x1080 pixel 32 bpp to 5 msecs, a > 3x speedup!
This was measured using ftrace to trace the time the radeon kms
driver waited on the dmabuf fence of the renderoffload gpu to
complete.
All in all this brought the time for a flip down from 20 msecs
to 9 msecs, so the prime setup can display at full 60 fps instead
of barely 30 fps vsync'ed.
The current r600 implementation supports SDMA on Evergreen and
later, but not R600/R700 due to some bugs apparently present
in their SDMA implementation.
Signed-off-by: Mario Kleiner <mario.kleiner.de@gmail.com>
Cc: Marek Ol=C5=A1=C3=A1k <marek.olsak@amd.com>
Signed-off-by: Marek Ol=C5=A1=C3=A1k <marek.olsak@amd.com>
:040000 040000 16967e652cc0708f670ab8b6d63e5eb629fbd6a0
e62fa916bd1706eb1d61975765d77d76cfae0fd2 Msrc
So I'm somewhat unsure if I should close this.
Mario, Marek, Nicolai could it be that we get another boost if both patches
'work together'?
You are receiving this mail because:
- You are the assignee for the bug.
=
--14726074653.2854.24769--
--===============0124198006==
Content-Type: text/plain; charset="utf-8"
MIME-Version: 1.0
Content-Transfer-Encoding: base64
Content-Disposition: inline
X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs
IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz
dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg==
--===============0124198006==--