From mboxrd@z Thu Jan 1 00:00:00 1970 From: David Ahern Subject: Re: [PATCH v2 00/11] KVM in-guest performance monitoring Date: Wed, 15 Jun 2011 06:40:15 -0600 Message-ID: <4DF8A82F.5090900@cisco.com> References: <1307972106-2468-1-git-send-email-avi@redhat.com> <4DF66B1A.6060606@cisco.com> <4DF71DA3.2080300@redhat.com> <4DF7972F.3040103@cisco.com> <4DF79941.9050705@siemens.com> <4DF79B6F.10102@cisco.com> <4DF79EFE.1050201@cisco.com> <4DF7A436.8090308@cisco.com> <4DF873FD.6040903@redhat.com> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: Marcelo Tosatti , kvm@vger.kernel.org, Ingo Molnar , Peter Zijlstra , Arnaldo Carvalho de Melo To: Avi Kivity Return-path: Received: from sj-iport-2.cisco.com ([171.71.176.71]:48013 "EHLO sj-iport-2.cisco.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751768Ab1FOMkP (ORCPT ); Wed, 15 Jun 2011 08:40:15 -0400 In-Reply-To: <4DF873FD.6040903@redhat.com> Sender: kvm-owner@vger.kernel.org List-ID: On 06/15/2011 02:57 AM, Avi Kivity wrote: > Okay. If you do anything interesting with it, please let us know. I > only tested the watchdog, 'perf top', and 'perf stat'. > For the following I was using the userspace command from latest perf-core branch. cycles H/W event is not working for me, so perf-top did not do much other than start. perf-stat -ddd shows a whole lot of 0's - which is interesting. It means time enabled and time running are non-0, yet the counter value is 0. cycles and instructions events also show as not counted Command I was playing with: taskset -c 1 chrt -r 1 perf stat -ddd openssl speed aes Performance counter stats for 'openssl speed aes': 46111.369065 task-clock # 0.984 CPUs utilized 195 context-switches # 0.000 M/sec 0 CPU-migrations # 0.000 M/sec 650 page-faults # 0.000 M/sec cycles 0 stalled-cycles-frontend # 0.00% frontend cycles idle [ 7.63%] 0 stalled-cycles-backend # 0.00% backend cycles idle [12.70%] instructions 801,002,999 branches # 17.371 M/sec [ 8.15%] 8,491,676 branch-misses # 1.06% of all branches [15.17%] 0 L1-dcache-loads # 0.000 M/sec [ 9.23%] 0 L1-dcache-load-misses # 0.00% of all L1-dcache hits [ 8.48%] 0 LLC-loads # 0.000 M/sec [13.89%] 0 LLC-load-misses # 0.00% of all LL-cache hits [12.47%] 0 L1-icache-loads # 0.000 M/sec [ 9.46%] 0 L1-icache-load-misses # 0.00% of all L1-icache hits [ 9.44%] 0 dTLB-loads # 0.000 M/sec [ 9.59%] 0 dTLB-load-misses # 0.00% of all dTLB cache hits [11.00%] 0 iTLB-loads # 0.000 M/sec [11.13%] 0 iTLB-load-misses # 0.00% of all iTLB cache hits [ 9.73%] 0 L1-dcache-prefetches # 0.000 M/sec [10.98%] 0 L1-dcache-prefetch-misses # 0.000 M/sec [12.51%] 46.851192693 seconds time elapsed Also, the numbers for branches and branch-misses just seem wrong compared to the same command run in the host as well as running perf-stat in the host on the vcpu thread running openssl (with the vcpu pinned to a pcpu). And then reality kicked in and I had to move on to other items. David