From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-8.8 required=3.0 tests=DKIM_SIGNED,DKIM_VALID, DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI, MENTIONS_GIT_HOSTING,SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED,USER_AGENT_GIT autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id B99DAC2D0F6 for ; Tue, 12 May 2020 16:40:28 +0000 (UTC) Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPS id 8675620714 for ; Tue, 12 May 2020 16:40:28 +0000 (UTC) Authentication-Results: mail.kernel.org; dkim=pass (2048-bit key) header.d=linaro.org header.i=@linaro.org header.b="scwn7P6r" DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 8675620714 Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=linaro.org Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org Received: from localhost ([::1]:39300 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1jYXxD-0007yF-Gw for qemu-devel@archiver.kernel.org; Tue, 12 May 2020 12:40:27 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:50492) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1jYXvx-00068l-S5 for qemu-devel@nongnu.org; Tue, 12 May 2020 12:39:09 -0400 Received: from mail-wr1-x444.google.com ([2a00:1450:4864:20::444]:37781) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1jYXvw-00060N-DW for qemu-devel@nongnu.org; Tue, 12 May 2020 12:39:09 -0400 Received: by mail-wr1-x444.google.com with SMTP id l17so2743217wrr.4 for ; Tue, 12 May 2020 09:39:07 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linaro.org; s=google; h=from:to:cc:subject:date:message-id:mime-version :content-transfer-encoding; bh=y9FKmH11fW+PJwlUH1phqvbD/5vZmoJNtxztGQmDla8=; b=scwn7P6rRRugQ9MXOhXHKmbNKPx8pEm7X6VpdnQdVTC4GSSBtD+iejTXQkFT0VCnsE xl61oj39/YQ92/CalhVGOkqZQFvOA/Qc5yt6NU+faIYIqq/faI+edonziiin5dgPiuuZ YzfNoo9YUfLWt+XDkqbJB7jFIS9GGLV711xLGA22iCjtzb2DVy4/XMEzXkIa96IB6Lna EpnoCkP52XKXlsDJiwuqVNTLbbBbC/U8onw+fUiHzzTCyUSlncC2kSRrOb8/lphEGooU BNr32gfyc1XEjM4CSvxjrn8qFXIpGtgQgsqkIJn06/pC2+SiVemsIwyn0o0+2QCOmoSF Us+w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id:mime-version :content-transfer-encoding; bh=y9FKmH11fW+PJwlUH1phqvbD/5vZmoJNtxztGQmDla8=; b=WNVfXiy9FEZ9QbjlZNOvR6tk9UBWTQXIxSY0xk0I8T/ToPKiXssKYWLTDYgCuq4NZ5 +xAv7nfyOSdtnWtrZ7NQA6+GwJLVN3MG7a19Yh8EPRctZz9flAbiAWzJLQLsFMTUYnac 4unGIAAJqj/O0iBjWvfxpOXZRxefri2jS7pRvMNmOI299an4gtp5StdUtyyrMK2xPViu am4LaaRxCEkGJIeJbSdddFRNLb0XWQHOCNnhvHjimPFxcyOhIFzLh5ILSlYEgmVxlQo7 Rd1CkuC++ellr0DmyaU5fcqzB/Nn4Oea9yQvu7yqjrCzGLxKhAb2oHgjUNvPbqx1g8TY EIgQ== X-Gm-Message-State: AGi0PuYYmM4gkjY3GN+t07q5mmPIG7ASwWwPBVG+TJ+r61j2BIfYWDGH pyVA54mSzSDOW1UiMwBCod28yA== X-Google-Smtp-Source: APiQypJ42skAKT/YEZO05i1jNXY3SX9MVF5Julz7dRrrbMgA3SVF41bnN0RIYw+52G5VHI+gFm721A== X-Received: by 2002:a5d:694d:: with SMTP id r13mr15447112wrw.238.1589301546779; Tue, 12 May 2020 09:39:06 -0700 (PDT) Received: from orth.archaic.org.uk (orth.archaic.org.uk. [81.2.115.148]) by smtp.gmail.com with ESMTPSA id o205sm18525981wmo.32.2020.05.12.09.39.05 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 12 May 2020 09:39:06 -0700 (PDT) From: Peter Maydell To: qemu-arm@nongnu.org, qemu-devel@nongnu.org Subject: [PATCH v2 00/17] target/arm: Convert rest of Neon 3-reg-same to decodetree Date: Tue, 12 May 2020 17:38:47 +0100 Message-Id: <20200512163904.10918-1-peter.maydell@linaro.org> X-Mailer: git-send-email 2.20.1 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Received-SPF: pass client-ip=2a00:1450:4864:20::444; envelope-from=peter.maydell@linaro.org; helo=mail-wr1-x444.google.com X-detected-operating-system: by eggs.gnu.org: No matching host in p0f cache. That's all we know. X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001 autolearn=_AUTOLEARN X-Spam_action: no action X-BeenThere: qemu-devel@nongnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: Richard Henderson Errors-To: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org Sender: "Qemu-devel" This patchset is v2 of the Neon decodetree conversion. The first half of v1 is in master already, so we're left just with patches converting the rest of the 3-reg-same Neon dp insn group. Based-on: <20200508152200.6547-1-richard.henderson@linaro.org> ("[PATCH v3 00/16] target/arm: partial vector cleanup") Strictly speaking, based on that with a fixup for the VRSRA bug, but I think patchew should be placated by that based-on tag. Git tree available at: https://git.linaro.org/people/peter.maydell/qemu-arm.git neon-decodetree with the whole patchstack including RTH's series. Changes from v1: * the first 19 or so patches have been upstreamed * patch 1 (VQRDMLAH/VQRDMLSH) uses do_3same() now * patch 3 (64-bit elt 3-reg-same): shifts now use a format which swaps Vn and Vm in decode, so we don't need to special case them in the C code. We also use the gvec interface rather than hand-rolling a for-each-pass loop. * patch 4: make DO_3SAME_32() handle just one trans fn rather than doing both _U and _S in one macro invocation. Use gvec rather than hand-rolling the for-each-pass loop. patch 5: (vaba/vabd): new, since rth's patchset rewrote how these were handled and they can now just be handled via DO_3SAME_NO_SZ_3() * patch 7: saturating shift handling rewritten to use the gvec APIs. * patch 10 (vqdmulh/vqrdmulh): rewritten to use gvec * patch 11 (vadd/vsub): rewritten to use gvec; vabd now in an earlier patch with vaba * patch 13 (vmul/vmla/vmls): vmul uses gvec; do_3same_fp() and DO_3S_FP() now added in this patch as it is the first user * new patch 15 making recps_f32 and rsqrts_f32 easier to use with common gvec APIs and macros by moving the 'env' argument to the front * patch 16: updated VRECPS/VRSQRTS code to use gvec NB: I have not attempted to merge VQSHL_S and VQSHL_S64 into one pattern in patch 7 (as suggested in review of v1) -- adding yet another DO_3SAME_FOO for the case of "64 bit and 32 bit can be done with same trans/gen fn" didn't seem worthwhile. thanks -- PMM Peter Maydell (17): target/arm: Convert Neon 3-reg-same VQRDMLAH/VQRDMLSH to decodetree target/arm: Convert Neon 3-reg-same SHA to decodetree target/arm: Convert Neon 64-bit element 3-reg-same insns target/arm: Convert Neon VHADD 3-reg-same insns target/arm: Convert Neon VABA/VABD 3-reg-same to decodetree target/arm: Convert Neon VRHADD, VHSUB 3-reg-same insns to decodetree target/arm: Convert Neon VQSHL, VRSHL, VQRSHL 3-reg-same insns to decodetree target/arm: Convert Neon VPMAX/VPMIN 3-reg-same insns to decodetree target/arm: Convert Neon VPADD 3-reg-same insns to decodetree target/arm: Convert Neon VQDMULH/VQRDMULH 3-reg-same to decodetree target/arm: Convert Neon VADD, VSUB, VABD 3-reg-same insns to decodetree target/arm: Convert Neon VPMIN/VPMAX/VPADD float 3-reg-same insns to decodetree target/arm: Convert Neon fp VMUL, VMLA, VMLS 3-reg-same insns to decodetree target/arm: Convert Neon 3-reg-same compare insns to decodetree target/arm: Move 'env' argument of recps_f32 and rsqrts_f32 helpers to usual place target/arm: Convert Neon fp VMAX/VMIN/VMAXNM/VMINNM/VRECPS/VRSQRTS to decodetree target/arm: Convert NEON VFMA, VFMS 3-reg-same insns to decodetree target/arm/helper.h | 7 +- target/arm/neon-dp.decode | 113 +++++- target/arm/neon_helper.c | 7 - target/arm/translate-neon.inc.c | 639 ++++++++++++++++++++++++++++++++ target/arm/translate.c | 495 +------------------------ target/arm/vec_helper.c | 7 + target/arm/vfp_helper.c | 4 +- 7 files changed, 767 insertions(+), 505 deletions(-) -- 2.20.1