From: Paolo Bonzini <pbonzini@redhat.com>
To: qemu-devel@nongnu.org
Cc: rth@twiddle.net
Subject: [Qemu-devel] [PATCH 08/10] cutils: Add SSE4 version
Date: Tue, 13 Sep 2016 18:10:03 +0200 [thread overview]
Message-ID: <1473783005-113609-9-git-send-email-pbonzini@redhat.com> (raw)
In-Reply-To: <1473783005-113609-1-git-send-email-pbonzini@redhat.com>
Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>
---
util/bufferiszero.c | 10 ++++++++++
1 file changed, 10 insertions(+)
diff --git a/util/bufferiszero.c b/util/bufferiszero.c
index d21d2af..932b809 100644
--- a/util/bufferiszero.c
+++ b/util/bufferiszero.c
@@ -107,6 +107,13 @@ ACCEL_BUFFER_ZERO(buffer_zero_sse2, 64, __m128i, SSE2_NONZERO)
#ifdef CONFIG_AVX2_OPT
#pragma GCC push_options
+#pragma GCC target("sse4")
+#include <smmintrin.h>
+#define SSE4_NONZERO(X) !_mm_testz_si128((X), (X))
+ACCEL_BUFFER_ZERO(buffer_zero_sse4, 64, __m128i, SSE4_NONZERO)
+#pragma GCC pop_options
+
+#pragma GCC push_options
#pragma GCC target("avx2")
#include <immintrin.h>
#define AVX2_NONZERO(X) !_mm256_testz_si256((X), (X))
@@ -173,6 +180,9 @@ static bool select_accel_fn(const void *buf, size_t len)
if (len % 128 == 0 && ibuf % 32 == 0 && (cpuid_cache & CACHE_AVX2)) {
return buffer_zero_avx2(buf, len);
}
+ if (len % 64 == 0 && ibuf % 16 == 0 && (cpuid_cache & CACHE_SSE4)) {
+ return buffer_zero_sse4(buf, len);
+ }
#endif
if (len % 64 == 0 && ibuf % 16 == 0 && (cpuid_cache & CACHE_SSE2)) {
return buffer_zero_sse2(buf, len);
--
1.8.3.1
next prev parent reply other threads:[~2016-09-13 16:11 UTC|newest]
Thread overview: 13+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-09-13 16:09 [Qemu-devel] [PATCH v4 00/10] Improve buffer_is_zero Paolo Bonzini
2016-09-13 16:09 ` [Qemu-devel] [PATCH 01/10] cutils: Move buffer_is_zero and subroutines to a new file Paolo Bonzini
2016-09-13 16:09 ` [Qemu-devel] [PATCH 02/10] cutils: Remove SPLAT macro Paolo Bonzini
2016-09-13 16:09 ` [Qemu-devel] [PATCH 03/10] cutils: Export only buffer_is_zero Paolo Bonzini
2016-09-13 16:09 ` [Qemu-devel] [PATCH 04/10] cutils: Rearrange buffer_is_zero acceleration Paolo Bonzini
2016-09-13 16:10 ` [Qemu-devel] [PATCH 05/10] cutils: Remove aarch64 buffer zero checking Paolo Bonzini
2016-09-13 16:10 ` [Qemu-devel] [PATCH 06/10] cutils: Remove ppc " Paolo Bonzini
2016-09-13 16:10 ` [Qemu-devel] [PATCH 07/10] cutils: Add test for buffer_is_zero Paolo Bonzini
2016-09-13 16:10 ` Paolo Bonzini [this message]
2016-09-13 16:10 ` [Qemu-devel] [PATCH 09/10] cutils: Add generic prefetch Paolo Bonzini
2016-09-13 16:10 ` [Qemu-devel] [PATCH 10/10] cutils: Rewrite x86 buffer zero checking Paolo Bonzini
2016-09-13 16:27 ` Richard Henderson
2016-09-13 16:33 ` Paolo Bonzini
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1473783005-113609-9-git-send-email-pbonzini@redhat.com \
--to=pbonzini@redhat.com \
--cc=qemu-devel@nongnu.org \
--cc=rth@twiddle.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).