qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
From: Paolo Bonzini <pbonzini@redhat.com>
To: qemu-devel@nongnu.org
Cc: rth@twiddle.net
Subject: [Qemu-devel] [PATCH 08/10] cutils: Add SSE4 version
Date: Tue, 13 Sep 2016 18:10:03 +0200	[thread overview]
Message-ID: <1473783005-113609-9-git-send-email-pbonzini@redhat.com> (raw)
In-Reply-To: <1473783005-113609-1-git-send-email-pbonzini@redhat.com>

Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>
---
 util/bufferiszero.c | 10 ++++++++++
 1 file changed, 10 insertions(+)

diff --git a/util/bufferiszero.c b/util/bufferiszero.c
index d21d2af..932b809 100644
--- a/util/bufferiszero.c
+++ b/util/bufferiszero.c
@@ -107,6 +107,13 @@ ACCEL_BUFFER_ZERO(buffer_zero_sse2, 64, __m128i, SSE2_NONZERO)
 
 #ifdef CONFIG_AVX2_OPT
 #pragma GCC push_options
+#pragma GCC target("sse4")
+#include <smmintrin.h>
+#define SSE4_NONZERO(X)  !_mm_testz_si128((X), (X))
+ACCEL_BUFFER_ZERO(buffer_zero_sse4, 64, __m128i, SSE4_NONZERO)
+#pragma GCC pop_options
+
+#pragma GCC push_options
 #pragma GCC target("avx2")
 #include <immintrin.h>
 #define AVX2_NONZERO(X)  !_mm256_testz_si256((X), (X))
@@ -173,6 +180,9 @@ static bool select_accel_fn(const void *buf, size_t len)
     if (len % 128 == 0 && ibuf % 32 == 0 && (cpuid_cache & CACHE_AVX2)) {
         return buffer_zero_avx2(buf, len);
     }
+    if (len % 64 == 0 && ibuf % 16 == 0 && (cpuid_cache & CACHE_SSE4)) {
+        return buffer_zero_sse4(buf, len);
+    }
 #endif
     if (len % 64 == 0 && ibuf % 16 == 0 && (cpuid_cache & CACHE_SSE2)) {
         return buffer_zero_sse2(buf, len);
-- 
1.8.3.1

  parent reply	other threads:[~2016-09-13 16:11 UTC|newest]

Thread overview: 13+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2016-09-13 16:09 [Qemu-devel] [PATCH v4 00/10] Improve buffer_is_zero Paolo Bonzini
2016-09-13 16:09 ` [Qemu-devel] [PATCH 01/10] cutils: Move buffer_is_zero and subroutines to a new file Paolo Bonzini
2016-09-13 16:09 ` [Qemu-devel] [PATCH 02/10] cutils: Remove SPLAT macro Paolo Bonzini
2016-09-13 16:09 ` [Qemu-devel] [PATCH 03/10] cutils: Export only buffer_is_zero Paolo Bonzini
2016-09-13 16:09 ` [Qemu-devel] [PATCH 04/10] cutils: Rearrange buffer_is_zero acceleration Paolo Bonzini
2016-09-13 16:10 ` [Qemu-devel] [PATCH 05/10] cutils: Remove aarch64 buffer zero checking Paolo Bonzini
2016-09-13 16:10 ` [Qemu-devel] [PATCH 06/10] cutils: Remove ppc " Paolo Bonzini
2016-09-13 16:10 ` [Qemu-devel] [PATCH 07/10] cutils: Add test for buffer_is_zero Paolo Bonzini
2016-09-13 16:10 ` Paolo Bonzini [this message]
2016-09-13 16:10 ` [Qemu-devel] [PATCH 09/10] cutils: Add generic prefetch Paolo Bonzini
2016-09-13 16:10 ` [Qemu-devel] [PATCH 10/10] cutils: Rewrite x86 buffer zero checking Paolo Bonzini
2016-09-13 16:27   ` Richard Henderson
2016-09-13 16:33     ` Paolo Bonzini

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1473783005-113609-9-git-send-email-pbonzini@redhat.com \
    --to=pbonzini@redhat.com \
    --cc=qemu-devel@nongnu.org \
    --cc=rth@twiddle.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).