From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from eggs.gnu.org ([2001:4830:134:3::10]:60057) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1bNfLA-0003cI-Af for qemu-devel@nongnu.org; Thu, 14 Jul 2016 08:02:05 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1bNfL7-0005fv-6j for qemu-devel@nongnu.org; Thu, 14 Jul 2016 08:02:04 -0400 Received: from mail-wm0-x22a.google.com ([2a00:1450:400c:c09::22a]:35174) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1bNfL6-0005fj-MO for qemu-devel@nongnu.org; Thu, 14 Jul 2016 08:02:01 -0400 Received: by mail-wm0-x22a.google.com with SMTP id f65so64009459wmi.0 for ; Thu, 14 Jul 2016 05:02:00 -0700 (PDT) References: <1468354426-837-1-git-send-email-sergey.fedorov@linaro.org> <148a0c48-f58c-6d95-8b97-6449670019ac@redhat.com> From: Alex =?utf-8?Q?Benn=C3=A9e?= In-reply-to: <148a0c48-f58c-6d95-8b97-6449670019ac@redhat.com> Date: Thu, 14 Jul 2016 13:02:01 +0100 Message-ID: <87wpko4edi.fsf@linaro.org> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Subject: Re: [Qemu-devel] [PATCH v3 00/11] Reduce lock contention on TCG hot-path List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , To: Paolo Bonzini Cc: Sergey Fedorov , qemu-devel@nongnu.org, mttcg@listserver.greensocs.com, fred.konrad@greensocs.com, a.rigo@virtualopensystems.com, serge.fdrv@gmail.com, cota@braap.org, bobby.prani@gmail.com, rth@twiddle.net, peter.maydell@linaro.org, patches@linaro.org, claudio.fontana@huawei.com, mark.burton@greensocs.com, jan.kiszka@siemens.com Paolo Bonzini writes: > On 12/07/2016 22:13, Sergey Fedorov wrote: >> From: Sergey Fedorov > > Have you measured performance with the series? In any case, it's nice > to see MTTCG finally taking shape! Here are some numbers on the multi-threaded pigz test: Before: retry.py called with ['./arm-linux-user/qemu-arm', './pigz.armhf', '-c', '-9', 'linux-4.6.3.tar'] Source code is @ pull-tcg-20160708-104-g9ec3025 or heads/review/hot-patch-v3 run 1: ret=0 (PASS), time=32.285497 (1/1) run 2: ret=0 (PASS), time=32.035293 (2/2) run 3: ret=0 (PASS), time=31.784781 (3/3) run 4: ret=0 (PASS), time=32.035136 (4/4) run 5: ret=0 (PASS), time=32.285612 (5/5) Results summary: 0: 5 times (100.00%), avg time 32.085 (0.04 varience/0.21 deviation) After: Ran command 5 times, 5 passes retry.py called with ['./arm-linux-user/qemu-arm', './pigz.armhf', '-c', '-9', 'linux-4.6.3.tar'] Source code is @ pull-tcg-20160708-115-gf317fa8 or heads/review/hot-patch-v3 run 1: ret=0 (PASS), time=29.281950 (1/1) run 2: ret=0 (PASS), time=29.285588 (2/2) run 3: ret=0 (PASS), time=29.282065 (3/3) run 4: ret=0 (PASS), time=29.282397 (4/4) run 5: ret=0 (PASS), time=29.282045 (5/5) Results summary: 0: 5 times (100.00%), avg time 29.283 (0.00 varience/0.00 deviation) Ran command 5 times, 5 passes Which gives a roughly 10% improvement for heavily threaded code. Looking at the perf data it looks like the hotest part of the code now is cpu_get_tb_cpu_state which is required to get the initial hash to search for the next tb. -- Alex Bennée