From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from eggs.gnu.org ([2001:4830:134:3::10]:39705) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZvzkO-0004xD-TF for qemu-devel@nongnu.org; Mon, 09 Nov 2015 22:37:29 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ZvzkM-0008C9-7s for qemu-devel@nongnu.org; Mon, 09 Nov 2015 22:37:28 -0500 Received: from mga14.intel.com ([192.55.52.115]:42092) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZvzkM-0008Br-2Y for qemu-devel@nongnu.org; Mon, 09 Nov 2015 22:37:26 -0500 From: Liang Li Date: Tue, 10 Nov 2015 11:31:35 +0800 Message-Id: <1447126297-27239-1-git-send-email-liang.z.li@intel.com> Subject: [Qemu-devel] [v2 RESEND 0/2] add avx2 instruction optimization List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , To: qemu-devel@nongnu.org Cc: quintela@redhat.com, Liang Li , mst@redhat.com, amit.shah@redhat.com, pbonzini@redhat.com buffer_find_nonzero_offset() is a hot function during live migration. Now it use SSE2 intructions for optimization. For platform supports AVX2 instructions, use the AVX2 instructions for optimization can help to improve the performance about 30% comparing to SSE2. Zero page check can be faster with this optimization, the test result shows that for an 8GB RAM idle guest, this patch can help to shorten the total live migration time about 6%. This patch use the ifunc mechanism to select the proper function when running, for platform supports AVX2, excute the AVX2 instructions, else, excute the original code. With patch, if build QEMU binary with AVX2 enabled, the binary can run on both platforms support AVX2 or not. If build QEMU binary with AVX2 diabled, or if compiler can not support AVX2, the binary will not contain the AVX2 instruction, and it can run on both platforms support AVX2 or not. Liang Li (2): cutils: add avx2 instruction optimization configure: add options to config avx2 configure | 29 ++++++++++++++++++++++ include/qemu-common.h | 28 +++++++++++++++------ util/Makefile.objs | 2 ++ util/avx2.c | 68 +++++++++++++++++++++++++++++++++++++++++++++++++++ util/cutils.c | 47 +++++++++++++++++++++++++++++++++-- 5 files changed, 165 insertions(+), 9 deletions(-) create mode 100644 util/avx2.c -- 1.9.1