From mboxrd@z Thu Jan 1 00:00:00 1970 From: Tvrtko Ursulin Subject: Re: [PATCH] drm/i915: Use SSE4.1 movntdqa to accelerate reads from WC memory Date: Mon, 18 Jul 2016 16:06:24 +0100 Message-ID: <578CF070.50300@linux.intel.com> References: <20160718100111.GD21839@nuc-i3427.alporthouse.com> <1468836434-29107-1-git-send-email-chris@chris-wilson.co.uk> <578CBA54.40107@linux.intel.com> <20160718113501.GH21839@nuc-i3427.alporthouse.com> <578CC415.202@intel.com> <578CD214.8070703@linux.intel.com> <578CDDA0.3010108@linux.intel.com> Mime-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 Return-path: Received: from mga04.intel.com (mga04.intel.com [192.55.52.120]) by gabe.freedesktop.org (Postfix) with ESMTP id 6807B6E446 for ; Mon, 18 Jul 2016 15:06:26 +0000 (UTC) In-Reply-To: <578CDDA0.3010108@linux.intel.com> List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: intel-gfx-bounces@lists.freedesktop.org Sender: "Intel-gfx" To: Dave Gordon , Chris Wilson , intel-gfx@lists.freedesktop.org, Akash Goel , Mika Kuoppala List-Id: intel-gfx@lists.freedesktop.org Ck9uIDE4LzA3LzE2IDE0OjQ2LCBUdnJ0a28gVXJzdWxpbiB3cm90ZToKCltzbmlwXQoKPiBUaGlz IHZlcnNpb24gZ2VuZXJhdGVzIHRoZSBzbWFsbGVzdCBjb2RlOgo+IAo+IHN0YXRpYyB2b2lkIF9f bWVtY3B5X250ZHFhKHN0cnVjdCBxdzIgKmRzdCwgY29uc3Qgc3RydWN0IHF3MiAqc3JjLCB1bnNp Z25lZCBsb25nIGxlbikKPiB7Cj4gCXVuc2lnbmVkIGxvbmcgbDQ7Cj4gCj4gCWtlcm5lbF9mcHVf YmVnaW4oKTsKPiAKPiAJbDQgPSBsZW4gLyA0Owo+IAl3aGlsZSAobDQpIHsKPiAJCWFzbSgibW92 bnRkcWEgICAoJTApLCAlJXhtbTAiIDo6ICJyIiAoc3JjKSwgIm0iIChzcmNbMF0pKTsKPiAJCWFz bSgibW92bnRkcWEgMTYoJTApLCAlJXhtbTEiIDo6ICJyIiAoc3JjKSwgIm0iIChzcmNbMV0pKTsK PiAJCWFzbSgibW92bnRkcWEgMzIoJTApLCAlJXhtbTIiIDo6ICJyIiAoc3JjKSwgIm0iIChzcmNb Ml0pKTsKPiAJCWFzbSgibW92bnRkcWEgNDgoJTApLCAlJXhtbTMiIDo6ICJyIiAoc3JjKSwgIm0i IChzcmNbM10pKTsKPiAJCWFzbSgibW92YXBzICUleG1tMCwgICAoJTEpIiA6ICI9bSIgKGRzdFsw XSkgOiAiciIgKGRzdCkpOwo+IAkJYXNtKCJtb3ZhcHMgJSV4bW0xLCAxNiglMSkiIDogIj1tIiAo ZHN0WzFdKSA6ICJyIiAoZHN0KSk7Cj4gCQlhc20oIm1vdmFwcyAlJXhtbTIsIDMyKCUxKSIgOiAi PW0iIChkc3RbMl0pIDogInIiIChkc3QpKTsKPiAJCWFzbSgibW92YXBzICUleG1tMywgNDgoJTEp IiA6ICI9bSIgKGRzdFszXSkgOiAiciIgKGRzdCkpOwo+IAkJc3JjICs9IDQ7Cj4gCQlkc3QgKz0g NDsKPiAJCWw0LS07Cj4gCX0KPiAKPiAJbGVuICU9IDQ7Cj4gCXdoaWxlIChsZW4pIHsKPiAJCWFz bSgibW92bnRkcWEgKCUwKSwgJSV4bW0wIiA6OiAiciIgKHNyYyksICJtIiAoc3JjWzBdKSk7Cj4g CQlhc20oIm1vdmFwcyAlJXhtbTAsICglMSkiIDogIj1tIiAoZHN0WzBdKSA6ICJyIiAoZHN0KSk7 Cj4gCQlzcmMrKzsKPiAJCWRzdCsrOwo+IAkJbGVuLS07Cj4gCX0KPiAKPiAJa2VybmVsX2ZwdV9l bmQoKTsKPiB9Cj4gCj4gQWx0aG91Z2ggSSBzdGlsbCBoYXZlbid0IGZpZ3VyZWQgb3V0IGEgd2F5 IHRvIGNvbnZpbmNlIGl0IHRvIHVzZQo+IHRoZSBzYW1lIHJlZ2lzdGVycyBmb3Igc3JjIGFuZCBk ZXN0IGJldHdlZW4gdGhlIHR3byBsb29wcy4KCkkgcmVtZW1iZXJlZCBvbmUgZmFtb3VzIGludGVy dmlldyBxdWVzdGlvbiwgYWxvbmcgdGhlIGxpbmVzIG9mLCAid2hhdAppcyB0aGUgY29kZSBiZWxv dyBkb2luZyIuIFRyYW5zbGF0ZWQgdG8gdGhpcyBleGFtcGxlOgoKc3RhdGljIHZvaWQgX19tZW1j cHlfbnRkcWEoc3RydWN0IHF3MiAqZHN0LCBjb25zdCBzdHJ1Y3QgcXcyICpzcmMsIHVuc2lnbmVk IGxvbmcgbGVuKQp7Cgl1bnNpZ25lZCBsb25nIG47CgoJa2VybmVsX2ZwdV9iZWdpbigpOwoKCW4g PSAobGVuICsgMykgLyA0OwoJc3dpdGNoIChsZW4gJSA0KSB7CgljYXNlIDA6IGRvIHsgYXNtKCJt b3ZudGRxYSAlMSwgJSV4bW0wXG4iCgkJCSAgIm1vdmFwcyAlJXhtbTAsICUwXG4iIDogIj1tIiAo KmRzdCk6ICJtIiAoKnNyYykpOwoJCSAgICAgc3JjKys7IGRzdCsrOwoJY2FzZSAzOgkgICAgIGFz bSgibW92bnRkcWEgJTEsICUleG1tMVxuIgoJCQkgICJtb3ZhcHMgJSV4bW0xLCAlMFxuIiA6ICI9 bSIgKCpkc3QpOiAibSIgKCpzcmMpKTsKCQkgICAgIHNyYysrOyBkc3QrKzsKCWNhc2UgMjoJICAg ICBhc20oIm1vdm50ZHFhICUxLCAlJXhtbTJcbiIKCQkJICAibW92YXBzICUleG1tMiwgJTBcbiIg OiAiPW0iICgqZHN0KTogIm0iICgqc3JjKSk7CgkJICAgICBzcmMrKzsgZHN0Kys7CgljYXNlIDE6 CSAgICAgYXNtKCJtb3ZudGRxYSAlMSwgJSV4bW0zXG4iCgkJCSAgIm1vdmFwcyAlJXhtbTMsICUw XG4iIDogIj1tIiAoKmRzdCk6ICJtIiAoKnNyYykpOwoJCSAgICAgc3JjKys7IGRzdCsrOwoJCX0g d2hpbGUgKC0tbiA+IDApOwoJfQoKCWtlcm5lbF9mcHVfZW5kKCk7Cn0KCjpEIAoKTm8gaWRlYSBp ZiBsb2Fkcy9zdG9yZXMgY2FuIHJ1biBhc3luYyBpbiB0aGlzIGNhc2UuCgpSZWdhcmRzLAoKVHZy dGtvCl9fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fCkludGVs LWdmeCBtYWlsaW5nIGxpc3QKSW50ZWwtZ2Z4QGxpc3RzLmZyZWVkZXNrdG9wLm9yZwpodHRwczov L2xpc3RzLmZyZWVkZXNrdG9wLm9yZy9tYWlsbWFuL2xpc3RpbmZvL2ludGVsLWdmeAo=