From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-pf0-x242.google.com (mail-pf0-x242.google.com [IPv6:2607:f8b0:400e:c00::242]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by lists.ozlabs.org (Postfix) with ESMTPS id 3xz2Mr03SXzDrJL for ; Fri, 22 Sep 2017 15:38:47 +1000 (AEST) Received: by mail-pf0-x242.google.com with SMTP id g65so63646pfe.1 for ; Thu, 21 Sep 2017 22:38:47 -0700 (PDT) From: wei.guo.simon@gmail.com To: linuxppc-dev@lists.ozlabs.org Cc: Paul Mackerras , Michael Ellerman , "Naveen N. Rao" , David Laight , Christophe LEROY , Simon Guo Subject: [PATCH v2 0/3] powerpc/64: memcmp() optimization Date: Thu, 21 Sep 2017 07:34:37 +0800 Message-Id: <1505950480-14830-1-git-send-email-wei.guo.simon@gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 List-Id: Linux on PowerPC Developers Mail List List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , From: Simon Guo There is some room to optimize memcmp() in powerpc 64 bits version for following 2 cases: (1) Even src/dst addresses are not aligned with 8 bytes at the beginning, memcmp() can align them and go with .Llong comparision mode without fallback to .Lshort comparision mode do compare buffer byte by byte. (2) VMX instructions can be used to speed up for large size comparision. This patch set also updates memcmp selftest case to make it compiled and incorporate large size comparison case. v1 -> v2: - update 8bytes unaligned bytes comparison method. - fix a VMX comparision bug. - enhanced the original memcmp() selftest. - add powerpc/64 to subject/commit message. Simon Guo (3): powerpc/64: Align bytes before fall back to .Lshort in powerpc64 memcmp(). powerpc/64: enhance memcmp() with VMX instruction for long bytes comparision powerpc:selftest update memcmp_64 selftest for VMX implementation arch/powerpc/include/asm/asm-prototypes.h | 2 +- arch/powerpc/lib/copypage_power7.S | 2 +- arch/powerpc/lib/memcmp_64.S | 181 ++++++++++++++++++++- arch/powerpc/lib/memcpy_power7.S | 2 +- arch/powerpc/lib/vmx-helper.c | 2 +- .../selftests/powerpc/copyloops/asm/ppc_asm.h | 2 +- .../selftests/powerpc/stringloops/asm/ppc_asm.h | 31 ++++ .../testing/selftests/powerpc/stringloops/memcmp.c | 63 ++++--- 8 files changed, 254 insertions(+), 31 deletions(-) -- 1.8.3.1