From mboxrd@z Thu Jan 1 00:00:00 1970 From: Dave Gordon Subject: Re: [PATCH] drm/i915: Use SSE4.1 movntdqa to accelerate reads from WC memory Date: Mon, 18 Jul 2016 12:57:09 +0100 Message-ID: <578CC415.202@intel.com> References: <20160718100111.GD21839@nuc-i3427.alporthouse.com> <1468836434-29107-1-git-send-email-chris@chris-wilson.co.uk> <578CBA54.40107@linux.intel.com> <20160718113501.GH21839@nuc-i3427.alporthouse.com> Mime-Version: 1.0 Content-Type: text/plain; charset="utf-8"; Format="flowed" Content-Transfer-Encoding: base64 Return-path: Received: from mga09.intel.com (mga09.intel.com [134.134.136.24]) by gabe.freedesktop.org (Postfix) with ESMTP id 3805D6E3C2 for ; Mon, 18 Jul 2016 11:57:11 +0000 (UTC) In-Reply-To: <20160718113501.GH21839@nuc-i3427.alporthouse.com> List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: intel-gfx-bounces@lists.freedesktop.org Sender: "Intel-gfx" To: Chris Wilson , Tvrtko Ursulin , intel-gfx@lists.freedesktop.org, Akash Goel , Mika Kuoppala List-Id: intel-gfx@lists.freedesktop.org T24gMTgvMDcvMTYgMTI6MzUsIENocmlzIFdpbHNvbiB3cm90ZToKPiBPbiBNb24sIEp1bCAxOCwg MjAxNiBhdCAxMjoxNTozMlBNICswMTAwLCBUdnJ0a28gVXJzdWxpbiB3cm90ZToKPj4gSSBhbSBu b3Qgc3VyZSBhYm91dCB0aGlzLCBidXQgbG9va2luZyBhdCB0aGUgcmFpZDYgZm9yIGV4YW1wbGUs IGl0Cj4+IGhhcyBhIGxvdCBtb3JlIGFubm90YXRpb25zIGluIGNhc2VzIGxpa2UgdGhpcy4KPj4K Pj4gSXQgc2VlbXMgdG8gYmUgdGVsbGluZyB0aGUgY29tcGlsZXIgd2hpY2ggbWVtb3J5IHJhbmdl cyBkb2VzIGVhY2gKPj4gaW5zdHJ1Y3Rpb24gYWNjZXNzLCBhbmQgYWxzbyB1c2VzICJhc20gdm9s YXRpbGUiIC0gd2hldGhlciBvciBub3QKPj4gdGhhdCBpcyByZWFsbHkgbmVlZGVkIEkgZG9uJ3Qg a25vdy4KPj4KPj4gRm9yIGV4YW1wbGU6Cj4+ICAgICAgICAgICAgICAgICAgYXNtIHZvbGF0aWxl KCJtb3ZkcWEgJTAsJSV4bW00IiA6OiAibSIgKGRwdHJbejBdW2RdKSk7Cj4+Cj4+IEFuZDoKPj4g ICAgICAgICAgICAgICAgICBhc20gdm9sYXRpbGUoIm1vdmRxYSAlJXhtbTQsJTAiIDogIj1tIiAo cVtkXSkpOwo+Pgo+PiBFYWNoIG9uZSBpcyB0ZWxsaW5nIHRoZSBjb21waWxlciB0aGUgaW5zdHJ1 Y3Rpb24gaXMgZWl0aGVyIHJlYWRpbmcKPj4gb3Igd3JpdGluZyByZXNwZWN0aXZlbHkgZnJvbSBh IGNlcnRhaW4gbWVtb3J5IGFkZHJlc3MuCj4+Cj4+IFlvdSBkb24ndCBoYXZlIGFueSBvZiB0aGF0 LCBhbmQgZG9uJ3QgZXZlbiBzcGVjaWZ5IG5vdGhpbmcgYXMgYW4KPj4gb3V0cHV0IHBhcmFtZXRl ciBzbyBJIGFtIG5vdCBzdXJlIGlmIHlvdXIgY29kZSBpcyBzYWZlLgo+Cj4gVGhlIGFzbSBpcyBj b3JyZWN0LiBXZSBkbyBub3QgbW9kaWZ5IGVpdGhlciBvZiB0aGUgdHdvIHBvaW50ZXJzIHdoaWNo IHdlCj4gcGFzcyBpbiB2aWEgcmVnaXN0ZXIgaW5wdXRzLCBidXQgdGhlIG1lbW9yeSBiZWhpbmQg dGhlbSAtIGhlbmNlIHRoZSBtZW1vcnkKPiBjbG9iYmVyLgoKVGhpcyBpcyBhIGNob2ljZSBvZiBo b3cgbXVjaCB3ZSBsZXQgdGhlIGNvbXBpbGVyIGRlY2lkZSBhYm91dCAKYWRkcmVzc2luZywgYW5k IGhvdyBtdWNoIHdlIHRlbGwgaXQgYWJvdXQgd2hhdCB0aGUgYXNtIGNvZGUgcmVhbGx5IGRvZXMu IApUaGUgZXhhbXBsZXMgYWJvdmUgZ2V0IHRoZSBjb21waWxlciB0byBnZW5lcmF0ZSAqYW55KiBz dWl0YWJsZSAKYWRkcmVzc2luZyBtb2RlIGZvciBlYWNoIHNwZWNpZmljIGxvY2F0aW9uIGludm9s dmVkIGluIHRoZSB0cmFuc2ZlcnMsIHNvIAp0aGUgY29tcGlsZXIga25vd3MgYSBsb3QgYWJvdXQg d2hhdCdzIGhhcHBlbmluZyBhbmQgY2FuIHRyYWNrIHdoZXJlIGVhY2ggCmRhdHVtIGNvbWVzIGZy b20gYW5kIGdvZXMgdG8uCgpPVE9IIENocmlzJyBjb2RlCgorICAgICAgICBhc20oIm1vdm50ZHFh ICAgKCUwKSwgJSV4bW0wXG4iCisgICAgICAgICAgICAibW92bnRkcWEgMTYoJTApLCAlJXhtbTFc biIKKyAgICAgICAgICAgICJtb3ZudGRxYSAzMiglMCksICUleG1tMlxuIgorICAgICAgICAgICAg Im1vdm50ZHFhIDQ4KCUwKSwgJSV4bW0zXG4iCisgICAgICAgICAgICAibW92YXBzICUleG1tMCwg ICAoJTEpXG4iCisgICAgICAgICAgICAibW92YXBzICUleG1tMSwgMTYoJTEpXG4iCisgICAgICAg ICAgICAibW92YXBzICUleG1tMiwgMzIoJTEpXG4iCisgICAgICAgICAgICAibW92YXBzICUleG1t MywgNDgoJTEpXG4iCisgICAgICAgICAgICA6OiAiciIgKHNyYyksICJyIiAoZHN0KSA6ICJtZW1v cnkiKTsKCi0gZG9lc24ndCBuZWVkICJ2b2xhdGlsZSIgYmVjYXVzZSBhc20gc3RhdGVtZW50cyB0 aGF0IGhhdmUgbm8gb3V0cHV0IApvcGVyYW5kcyBhcmUgaW1wbGljaXRseSB2b2xhdGlsZS4KCi0g bWFrZXMgdGhlIGNvbXBpbGVyIGdpdmUgdXMgdGhlIHNvdXJjZSBhbmQgZGVzdGluYXRpb24gKmFk ZHJlc3NlcyogaW4gYSAKcmVnaXN0ZXIgZWFjaDsgYmV5b25kIHRoYXQsIGl0IGRvZXNuJ3Qga25v dyB3aGF0IHdlJ3JlIGRvaW5nIHdpdGggdGhlbSwgCnNvIHRoZSB0aGlyZCAoImNsb2JiZXJzIikg cGFyYW1ldGVyIGhhcyB0byBzYXkgIm1lbW9yeSIgaS5lLiB0cmVhdCAqYWxsKiAKbWVtb3J5IGNv bnRlbnRzIGFzIHVua25vd24gYWZ0ZXIgdGhpcy4KCltbRnJvbSBHQ0MgZG9jczogVGhlICJtZW1v cnkiIGNsb2JiZXIgdGVsbHMgdGhlIGNvbXBpbGVyIHRoYXQgdGhlIAphc3NlbWJseSBjb2RlIHBl cmZvcm1zIG1lbW9yeSByZWFkcyBvciB3cml0ZXMgdG8gaXRlbXMgb3RoZXIgdGhhbiB0aG9zZSAK bGlzdGVkIGluIHRoZSBpbnB1dCBhbmQgb3V0cHV0IG9wZXJhbmRzIChmb3IgZXhhbXBsZSwgYWNj ZXNzaW5nIHRoZSAKbWVtb3J5IHBvaW50ZWQgdG8gYnkgb25lIG9mIHRoZSBpbnB1dCBwYXJhbWV0 ZXJzKS4gVG8gZW5zdXJlIG1lbW9yeSAKY29udGFpbnMgY29ycmVjdCB2YWx1ZXMsIEdDQyBtYXkg bmVlZCB0byBmbHVzaCBzcGVjaWZpYyByZWdpc3RlciB2YWx1ZXMgCnRvIG1lbW9yeSBiZWZvcmUg ZXhlY3V0aW5nIHRoZSBhc20uIEZ1cnRoZXIsIHRoZSBjb21waWxlciBkb2VzIG5vdCAKYXNzdW1l IHRoYXQgYW55IHZhbHVlcyByZWFkIGZyb20gbWVtb3J5IGJlZm9yZSBhbiBhc20gcmVtYWluIHVu Y2hhbmdlZCAKYWZ0ZXIgdGhhdCBhc207IGl0IHJlbG9hZHMgdGhlbSBhcyBuZWVkZWQuIFVzaW5n IHRoZSAibWVtb3J5IiBjbG9iYmVyIAplZmZlY3RpdmVseSBmb3JtcyBhIHJlYWQvd3JpdGUgbWVt b3J5IGJhcnJpZXIgZm9yIHRoZSBjb21waWxlci5dXQoKQlRXLCBzaG91bGQgd2Ugbm90IHRlbGwg aXQgd2UndmUgKmFsc28qIGNsb2JiZXJlZCAleG1tWzAtM10/CgpTbyB0aGV5J3JlIGJvdGggY29y cmVjdCwganVzdCB0YWtpbmcgZGlmZmVyZW50IGFwcHJvYWNoZXMuIEkgZG9uJ3Qga25vdyAKd2hp Y2ggd291bGQgZ2l2ZSB0aGUgYmVzdCBwZXJmb3JtYW5jZSBmb3IgdGhpcyBzcGVjaWZpYyBjYXNl LgoKLkRhdmUuCgpfX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19f XwpJbnRlbC1nZnggbWFpbGluZyBsaXN0CkludGVsLWdmeEBsaXN0cy5mcmVlZGVza3RvcC5vcmcK aHR0cHM6Ly9saXN0cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9pbnRlbC1nZngK