linux-perf-users.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* Some event modifiers missing from output and desire option to compute value based on measurements
@ 2014-03-13 16:58 William Cohen
  2014-03-13 17:30 ` Andi Kleen
  0 siblings, 1 reply; 5+ messages in thread
From: William Cohen @ 2014-03-13 16:58 UTC (permalink / raw)
  To: linux-perf-users

When experimenting with perf I wanted to have separate counts for events in userspace and the kernel.  I used:

$ perf stat   -e instructions:u -e instructions:k -e cycles:u -e cycles:k -e cache-misses:u -e cache-misses:k make

The associated  output below includes the event modifiers for all the events, but the 3.06 and 0.37 insns per cycles look off.  Shouldn't that instructions:u/cycles:u and instructions:k/cycles:k be the values reported for "insns per cycle"?

Performance counter stats for 'make':

   340,034,597,510 instructions:u            #    3.06  insns per cycle         [83.70%]
    40,963,149,231 instructions:k            #    0.37  insns per cycle         [83.67%]
   185,451,244,302 cycles:u                  [83.54%]
    36,901,938,457 cycles:k                  [83.64%]
        34,811,408 cache-misses:u                                               [83.96%]
       102,781,614 cache-misses:k                                               [83.85%]

      66.290033775 seconds time elapsed

For the L1-icache-load-misses and iTLB-load-misses the event modifier appears to be dropped as shown in the output below

$ perf stat   -e instructions:u -e instructions:k -e cycles:u -e cycles:k -e L1-icache-load-misses:u -e L1-icache-load-misses:k make

 Performance counter stats for 'make':

   340,522,617,398 instructions:u            #    3.09  insns per cycle         [83.55%]
    41,045,130,555 instructions:k            #    0.37  insns per cycle         [83.82%]
   184,082,319,783 cycles:u                  [84.12%]
    36,447,301,873 cycles:k                  [84.21%]
       849,438,930 L1-icache-load-misses                                        [82.96%]
       445,141,089 L1-icache-load-misses                                        [83.67%]

      65.611650172 seconds time elapsed




$ perf stat   -e instructions:u -e instructions:k -e cycles:u -e cycles:k -e iTLB-load-misses:u -e iTLB-load-misses:k make


 2,074,873,836,988 instructions:u            #    2.09  insns per cycle         [83.47%]
   128,412,604,104 instructions:k            #    0.13  insns per cycle         [83.51%]
 1,786,252,236,017 cycles:u                  [83.52%]
   202,469,325,995 cycles:k                  [83.46%]
       786,419,505 iTLB-load-misses                                             [83.44%]
        40,548,044 iTLB-load-misses                                             [83.43%]

    3800.440742009 seconds time elapsed



It appears that the output is listing the measurements in the same order they are specified on the command line, but it would be nice if the output was clearer on the events being measured.  If I am reading the output correctly, the L1-icache-load-misses per instruction is pretty poor for kernel-space.  Much of the time I am looking at ratios of events and it would be nice if "perf stat" had a way to have it compute the ratios directly. Maybe a "-m, --math" option allowing algebraic expressions where you could do:

perf stat   -e instructions:u -e instructions:k -e cycles:u -e cycles:k -e L1-icache-load-misses:u -e L1-icache-load-misses:k \
--math instructions:u/icache-load-misses:u \
--math instructions:k/icache-load-misses:k \
make

 Performance counter stats for 'make':

$ perf stat   -e instructions:u -e instructions:k -e cycles:u -e cycles:k -e L1-icache-load-misses:u -e L1-icache-load-misses:k make

 Performance counter stats for 'make':

   340,522,617,398 instructions:u            #    3.09  insns per cycle         [83.55%]
    41,045,130,555 instructions:k            #    0.37  insns per cycle         [83.82%]
   184,082,319,783 cycles:u                  [84.12%]
    36,447,301,873 cycles:k                  [84.21%]
       849,438,930 L1-icache-load-misses                                        [82.96%]
       445,141,089 L1-icache-load-misses                                        [83.67%]
            401.09 instructions:u/icache-load-misses:u
             92.20 instructions:k/icache-load-misses:k

      65.611650172 seconds time elapsed

-Will

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2014-03-21 14:16 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2014-03-13 16:58 Some event modifiers missing from output and desire option to compute value based on measurements William Cohen
2014-03-13 17:30 ` Andi Kleen
2014-03-13 18:39   ` William Cohen
2014-03-21 12:49     ` Christopher Covington
2014-03-21 14:16       ` David Ahern

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).