From: Ye Xiaolong <xiaolong.ye@intel.com>
To: lkp@lists.01.org
Subject: Re: [lkp-developer] [sched/core] 6b94780e45: unixbench.score -4.5% regression
Date: Tue, 03 Jan 2017 15:13:16 +0800 [thread overview]
Message-ID: <20170103071316.GA2823@yexl-desktop> (raw)
In-Reply-To: <20170102145637.GA8760@linaro.org>
[-- Attachment #1: Type: text/plain, Size: 14870 bytes --]
On 01/02, Vincent Guittot wrote:
>Hi Xiaolong,
>
>Le Monday 19 Dec 2016 à 08:14:53 (+0800), kernel test robot a écrit :
>>
>> Greeting,
>>
>> FYI, we noticed a -4.5% regression of unixbench.score due to commit:
>
>I have been able to restore performance on my platform with the patch below.
>Could you test it ?
>
>---
> kernel/sched/core.c | 1
> 1 file changed, 1 insertion(+)
>
>diff --git a/kernel/sched/core.c b/kernel/sched/core.c
>index 393759b..6e7d45c 100644
>--- a/kernel/sched/core.c
>+++ b/kernel/sched/core.c
>@@ -2578,6 +2578,7 @@ void wake_up_new_task(struct task_struct *p)
> __set_task_cpu(p, select_task_rq(p, task_cpu(p), SD_BALANCE_FORK, 0));
> #endif
> rq = __task_rq_lock(p, &rf);
>+ update_rq_clock(rq);
> post_init_entity_util_avg(&p->se);
>
> activate_task(rq, p, 0);
>--
>2.7.4
>
>Vincent
Hi, Vincent,
I applied your fix patch on top of 6b94780 ("sched/core: Use load_avg for selecting idlest group"),
and here is the comparison. (60df283834fd4def3c11ad2de3 is the fix commit id).
Seems the performance hasn't been restored back.
=========================================================================================
compiler/cpufreq_governor/kconfig/nr_task/rootfs/runtime/tbox_group/test/testcase:
gcc-6/performance/x86_64-rhel-7.2/100%/debian-x86_64-2016-08-31.cgz/300s/lkp-wsm-ep1/shell1/unixbench
commit:
f519a3f1c6b7a990e5aed37a8f853c6ecfdee945
6b94780e45c17b83e3e75f8aaca5a328db583c74
60df283834fd4def3c11ad2de3e6fc9e81b7dff1
f519a3f1c6b7a990 6b94780e45c17b83e3e75f8aac 60df283834fd4def3c11ad2de3
---------------- -------------------------- --------------------------
%stddev %change %stddev %change %stddev
\ | \ | \
25565 ± 0% -4.5% 24414 ± 0% -4.5% 24421 ± 0% unixbench.score
13223805 ± 2% -19.6% 10628072 ± 0% -21.3% 10412818 ± 1% unixbench.time.involuntary_context_switches
9.232e+08 ± 0% -4.3% 8.831e+08 ± 0% -4.3% 8.838e+08 ± 0% unixbench.time.minor_page_faults
1807 ± 0% -5.4% 1709 ± 0% -5.6% 1705 ± 0% unixbench.time.percent_of_cpu_this_job_got
5656 ± 0% -6.8% 5271 ± 0% -7.3% 5243 ± 0% unixbench.time.system_time
5743 ± 0% -4.0% 5514 ± 0% -3.9% 5516 ± 0% unixbench.time.user_time
29557557 ± 0% -2.6% 28781098 ± 0% -2.2% 28919280 ± 0% unixbench.time.voluntary_context_switches
741766 ± 2% -62.4% 279054 ± 1% -61.8% 283034 ± 1% interrupts.CAL:Function_call_interrupts
2912823 ± 0% -9.7% 2630010 ± 0% -8.7% 2660077 ± 0% softirqs.RCU
13223805 ± 2% -19.6% 10628072 ± 0% -21.3% 10412818 ± 1% time.involuntary_context_switches
126250 ± 0% -12.2% 110890 ± 0% -11.5% 111739 ± 0% vmstat.system.cs
31060 ± 1% -9.2% 28214 ± 0% -9.6% 28078 ± 0% vmstat.system.in
454.50 ±150% +164.7% 1203 ±166% +792.3% 4055 ± 18% numa-numastat.node0.numa_foreign
454.50 ±150% +164.7% 1203 ±166% +792.3% 4055 ± 18% numa-numastat.node0.numa_miss
4297 ± 15% -18.1% 3520 ± 57% -84.5% 666.40 ±113% numa-numastat.node1.numa_foreign
4297 ± 15% -18.1% 3520 ± 57% -84.5% 666.40 ±113% numa-numastat.node1.numa_miss
78.58 ± 0% -5.6% 74.20 ± 0% -6.0% 73.90 ± 0% turbostat.%Busy
2507 ± 0% -5.6% 2366 ± 0% -6.0% 2356 ± 0% turbostat.Avg_MHz
3.01 ± 2% +100.4% 6.03 ± 2% +100.1% 6.02 ± 0% turbostat.CPU%c3
2.35 ± 1% +6.8% 2.51 ± 4% +12.1% 2.64 ± 1% turbostat.CPU%c6
1.25 ± 5% -17.1% 1.04 ± 22% -32.3% 0.85 ± 5% perf-profile.children.cycles-pp.__irqentry_text_start
Thanks,
Xiaolong
>
>>
>>
>> commit: 6b94780e45c17b83e3e75f8aaca5a328db583c74 ("sched/core: Use load_avg for selecting idlest group")
>> https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git master
>>
>> in testcase: unixbench
>> on test machine: 24 threads Nehalem-EP with 24G memory
>> with following parameters:
>>
>> runtime: 300s
>> nr_task: 100%
>> test: shell1
>> cpufreq_governor: performance
>>
>> test-description: UnixBench is the original BYTE UNIX benchmark suite aims to test performance of Unix-like system.
>> test-url: https://github.com/kdlucas/byte-unixbench
>>
>> In addition to that, the commit also has significant impact on the following tests:
>>
>> +------------------+-----------------------------------------------------------------------+
>> | testcase: change | unixbench: unixbench.score -2.9% regression |
>> | test machine | 8 threads Intel(R) Core(TM) i7 CPU 870 @ 2.93GHz with 6G memory |
>> | test parameters | nr_task=1 |
>> | | runtime=300s |
>> | | test=shell8 |
>> +------------------+-----------------------------------------------------------------------+
>>
>>
>> Details are as below:
>> -------------------------------------------------------------------------------------------------->
>>
>>
>> To reproduce:
>>
>> git clone git://git.kernel.org/pub/scm/linux/kernel/git/wfg/lkp-tests.git
>> cd lkp-tests
>> bin/lkp install job.yaml # job file is attached in this email
>> bin/lkp run job.yaml
>>
>> testcase/path_params/tbox_group/run: unixbench/300s-100%-shell1-performance/lkp-wsm-ep1
>>
>> f519a3f1c6b7a990 6b94780e45c17b83e3e75f8aac
>> ---------------- --------------------------
>> 25565 -5% 24414 unixbench.score
>> 29557557 28781098 unixbench.time.voluntary_context_switches
>> 5743 -4% 5514 unixbench.time.user_time
>> 9.232e+08 -4% 8.831e+08 unixbench.time.minor_page_faults
>> 1807 -5% 1709 unixbench.time.percent_of_cpu_this_job_got
>> 5656 -7% 5271 unixbench.time.system_time
>> 13223805 -20% 10628072 unixbench.time.involuntary_context_switches
>> 741766 -62% 279054 interrupts.CAL:Function_call_interrupts
>> 31060 -9% 28214 vmstat.system.in
>> 126250 -12% 110890 vmstat.system.cs
>> 78.58 -6% 74.20 turbostat.%Busy
>> 2507 -6% 2366 turbostat.Avg_MHz
>> 9134 ± 47% -6e+03 2973 ± 36% latency_stats.max.pipe_read.__vfs_read.vfs_read.SyS_read.entry_SYSCALL_64_fastpath
>> 380879 ± 10% 5e+05 887692 ± 49% latency_stats.sum.wait_on_page_bit_killable.__lock_page_or_retry.filemap_fault.__do_fault.handle_mm_fault.__do_page_fault.do_page_fault.page_fault
>> 31710 ± 15% -2e+04 10583 ± 14% latency_stats.sum.call_rwsem_down_write_failed.__vma_adjust.__split_vma.do_munmap.vm_munmap.elf_map.load_elf_binary.search_binary_handler.do_execveat_common.SyS_execve.do_syscall_64.return_from_SYSCALL_64
>> 51796 ± 4% -4e+04 15457 ± 10% latency_stats.sum.call_rwsem_down_write_failed.unlink_file_vma.free_pgtables.unmap_region.do_munmap.vm_munmap.elf_map.load_elf_binary.search_binary_handler.do_execveat_common.SyS_execve.do_syscall_64
>> 111998 ± 18% -7e+04 37074 ± 14% latency_stats.sum.call_rwsem_down_write_failed.__vma_adjust.__split_vma.do_munmap.mmap_region.do_mmap.vm_mmap_pgoff.SyS_mmap_pgoff.SyS_mmap.entry_SYSCALL_64_fastpath
>> 275087 ± 15% -2e+05 81973 ± 3% latency_stats.sum.call_rwsem_down_write_failed.unlink_file_vma.free_pgtables.unmap_region.do_munmap.mmap_region.do_mmap.vm_mmap_pgoff.SyS_mmap_pgoff.SyS_mmap.entry_SYSCALL_64_fastpath
>> 930993 ± 12% -6e+05 320520 ± 4% latency_stats.sum.call_rwsem_down_write_failed.vma_link.mmap_region.do_mmap.vm_mmap_pgoff.vm_mmap.elf_map.load_elf_binary.search_binary_handler.do_execveat_common.SyS_execve.do_syscall_64
>> 4755783 ± 9% -3e+06 1619348 ± 4% latency_stats.sum.call_rwsem_down_write_failed.__vma_adjust.__split_vma.split_vma.mprotect_fixup.do_mprotect_pkey.SyS_mprotect.entry_SYSCALL_64_fastpath
>> 5536067 ± 10% -4e+06 1929338 ± 3% latency_stats.sum.call_rwsem_down_write_failed.copy_process._do_fork.SyS_clone.do_syscall_64.return_from_SYSCALL_64
>> 9.032e+08 -4% 8.64e+08 perf-stat.page-faults
>> 9.032e+08 -4% 8.64e+08 perf-stat.minor-faults
>> 2.329e+09 2.269e+09 perf-stat.node-load-misses
>> 2.2e+09 -9% 2.011e+09 ± 5% perf-stat.dTLB-store-misses
>> 3.278e+10 -9% 2.987e+10 ± 6% perf-stat.dTLB-load-misses
>> 19484819 13% 21974129 perf-stat.cpu-migrations
>> 3.755e+13 -6% 3.54e+13 perf-stat.cpu-cycles
>> 3244 4% 3379 perf-stat.instructions-per-iTLB-miss
>> 4.536e+12 -4% 4.332e+12 perf-stat.branch-instructions
>> 2.303e+13 -4% 2.208e+13 perf-stat.instructions
>> 5.768e+12 -4% 5.517e+12 perf-stat.dTLB-loads
>> 3.567e+11 -4% 3.414e+11 perf-stat.cache-references
>> 2.97 2.93 perf-stat.branch-miss-rate%
>> 2.768e+10 2.699e+10 perf-stat.node-stores
>> 5.446e+10 -3% 5.275e+10 perf-stat.cache-misses
>> 0.03 -4% 0.03 perf-stat.iTLB-load-miss-rate%
>> 9.673e+09 -4% 9.294e+09 perf-stat.node-loads
>> 3.596e+12 -4% 3.442e+12 perf-stat.dTLB-stores
>> 0.61 0.62 perf-stat.ipc
>> 1.347e+11 -6% 1.27e+11 perf-stat.branch-misses
>> 7.098e+09 -8% 6.533e+09 perf-stat.iTLB-load-misses
>> 2.309e+13 -4% 2.206e+13 perf-stat.iTLB-loads
>> 79911173 -12% 70187035 perf-stat.context-switches
>>
>>
>>
>> turbostat._Busy
>>
>> 90 ++-------------------------------------*---*---------------------------+
>> | .. *...*.. |
>> 80 *+..*..*...*..*...*..*...*..*...O...* O O O O O...O..O...O O O
>> 70 O+ O O O O O O O O |
>> | |
>> 60 ++ |
>> 50 ++ |
>> | |
>> 40 ++ |
>> 30 ++ |
>> | |
>> 20 ++ |
>> 10 ++ |
>> | |
>> 0 ++----------------------------------O----------------------------------+
>>
>>
>>
>>
>>
>> unixbench.time.percent_of_cpu_this_job_got
>>
>> 2500 ++-------------------------------------------------------------------+
>> | |
>> | .*... |
>> 2000 ++ .*. *..*... |
>> *..*...*..*...*..*...*..*...*..O...*. O O O O O..O...O..O O O
>> O O O O O O O O O |
>> 1500 ++ |
>> | |
>> 1000 ++ |
>> | |
>> | |
>> 500 ++ |
>> | |
>> | |
>> 0 ++---------------------------------O---------------------------------+
>>
>>
>> vmstat.system.in
>>
>> 40000 ++------------------------------------------------------------------+
>> | .*...*.. |
>> 35000 ++ .*...*. |
>> 30000 *+.*...*..*...*..*..*...*..*...*..*. *..*...*..* |
>> O O O O O O O O O O O O O O O O O O O O
>> 25000 ++ |
>> | |
>> 20000 ++ |
>> | |
>> 15000 ++ |
>> 10000 ++ |
>> | |
>> 5000 ++ |
>> | |
>> 0 ++--------------------------------O---------------------------------+
>>
>> [*] bisect-good sample
>> [O] bisect-bad sample
>>
>>
>> Disclaimer:
>> Results have been estimated based on internal Intel analysis and are provided
>> for informational purposes only. Any difference in system hardware or software
>> design or configuration may affect actual performance.
>>
>>
>> Thanks,
>> Xiaolong
>
WARNING: multiple messages have this Message-ID (diff)
From: Ye Xiaolong <xiaolong.ye@intel.com>
To: Vincent Guittot <vincent.guittot@linaro.org>
Cc: LKML <linux-kernel@vger.kernel.org>, lkp@01.org
Subject: Re: [lkp-developer] [sched/core] 6b94780e45: unixbench.score -4.5% regression
Date: Tue, 3 Jan 2017 15:13:16 +0800 [thread overview]
Message-ID: <20170103071316.GA2823@yexl-desktop> (raw)
In-Reply-To: <20170102145637.GA8760@linaro.org>
On 01/02, Vincent Guittot wrote:
>Hi Xiaolong,
>
>Le Monday 19 Dec 2016 à 08:14:53 (+0800), kernel test robot a écrit :
>>
>> Greeting,
>>
>> FYI, we noticed a -4.5% regression of unixbench.score due to commit:
>
>I have been able to restore performance on my platform with the patch below.
>Could you test it ?
>
>---
> kernel/sched/core.c | 1
> 1 file changed, 1 insertion(+)
>
>diff --git a/kernel/sched/core.c b/kernel/sched/core.c
>index 393759b..6e7d45c 100644
>--- a/kernel/sched/core.c
>+++ b/kernel/sched/core.c
>@@ -2578,6 +2578,7 @@ void wake_up_new_task(struct task_struct *p)
> __set_task_cpu(p, select_task_rq(p, task_cpu(p), SD_BALANCE_FORK, 0));
> #endif
> rq = __task_rq_lock(p, &rf);
>+ update_rq_clock(rq);
> post_init_entity_util_avg(&p->se);
>
> activate_task(rq, p, 0);
>--
>2.7.4
>
>Vincent
Hi, Vincent,
I applied your fix patch on top of 6b94780 ("sched/core: Use load_avg for selecting idlest group"),
and here is the comparison. (60df283834fd4def3c11ad2de3 is the fix commit id).
Seems the performance hasn't been restored back.
=========================================================================================
compiler/cpufreq_governor/kconfig/nr_task/rootfs/runtime/tbox_group/test/testcase:
gcc-6/performance/x86_64-rhel-7.2/100%/debian-x86_64-2016-08-31.cgz/300s/lkp-wsm-ep1/shell1/unixbench
commit:
f519a3f1c6b7a990e5aed37a8f853c6ecfdee945
6b94780e45c17b83e3e75f8aaca5a328db583c74
60df283834fd4def3c11ad2de3e6fc9e81b7dff1
f519a3f1c6b7a990 6b94780e45c17b83e3e75f8aac 60df283834fd4def3c11ad2de3
---------------- -------------------------- --------------------------
%stddev %change %stddev %change %stddev
\ | \ | \
25565 ± 0% -4.5% 24414 ± 0% -4.5% 24421 ± 0% unixbench.score
13223805 ± 2% -19.6% 10628072 ± 0% -21.3% 10412818 ± 1% unixbench.time.involuntary_context_switches
9.232e+08 ± 0% -4.3% 8.831e+08 ± 0% -4.3% 8.838e+08 ± 0% unixbench.time.minor_page_faults
1807 ± 0% -5.4% 1709 ± 0% -5.6% 1705 ± 0% unixbench.time.percent_of_cpu_this_job_got
5656 ± 0% -6.8% 5271 ± 0% -7.3% 5243 ± 0% unixbench.time.system_time
5743 ± 0% -4.0% 5514 ± 0% -3.9% 5516 ± 0% unixbench.time.user_time
29557557 ± 0% -2.6% 28781098 ± 0% -2.2% 28919280 ± 0% unixbench.time.voluntary_context_switches
741766 ± 2% -62.4% 279054 ± 1% -61.8% 283034 ± 1% interrupts.CAL:Function_call_interrupts
2912823 ± 0% -9.7% 2630010 ± 0% -8.7% 2660077 ± 0% softirqs.RCU
13223805 ± 2% -19.6% 10628072 ± 0% -21.3% 10412818 ± 1% time.involuntary_context_switches
126250 ± 0% -12.2% 110890 ± 0% -11.5% 111739 ± 0% vmstat.system.cs
31060 ± 1% -9.2% 28214 ± 0% -9.6% 28078 ± 0% vmstat.system.in
454.50 ±150% +164.7% 1203 ±166% +792.3% 4055 ± 18% numa-numastat.node0.numa_foreign
454.50 ±150% +164.7% 1203 ±166% +792.3% 4055 ± 18% numa-numastat.node0.numa_miss
4297 ± 15% -18.1% 3520 ± 57% -84.5% 666.40 ±113% numa-numastat.node1.numa_foreign
4297 ± 15% -18.1% 3520 ± 57% -84.5% 666.40 ±113% numa-numastat.node1.numa_miss
78.58 ± 0% -5.6% 74.20 ± 0% -6.0% 73.90 ± 0% turbostat.%Busy
2507 ± 0% -5.6% 2366 ± 0% -6.0% 2356 ± 0% turbostat.Avg_MHz
3.01 ± 2% +100.4% 6.03 ± 2% +100.1% 6.02 ± 0% turbostat.CPU%c3
2.35 ± 1% +6.8% 2.51 ± 4% +12.1% 2.64 ± 1% turbostat.CPU%c6
1.25 ± 5% -17.1% 1.04 ± 22% -32.3% 0.85 ± 5% perf-profile.children.cycles-pp.__irqentry_text_start
Thanks,
Xiaolong
>
>>
>>
>> commit: 6b94780e45c17b83e3e75f8aaca5a328db583c74 ("sched/core: Use load_avg for selecting idlest group")
>> https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git master
>>
>> in testcase: unixbench
>> on test machine: 24 threads Nehalem-EP with 24G memory
>> with following parameters:
>>
>> runtime: 300s
>> nr_task: 100%
>> test: shell1
>> cpufreq_governor: performance
>>
>> test-description: UnixBench is the original BYTE UNIX benchmark suite aims to test performance of Unix-like system.
>> test-url: https://github.com/kdlucas/byte-unixbench
>>
>> In addition to that, the commit also has significant impact on the following tests:
>>
>> +------------------+-----------------------------------------------------------------------+
>> | testcase: change | unixbench: unixbench.score -2.9% regression |
>> | test machine | 8 threads Intel(R) Core(TM) i7 CPU 870 @ 2.93GHz with 6G memory |
>> | test parameters | nr_task=1 |
>> | | runtime=300s |
>> | | test=shell8 |
>> +------------------+-----------------------------------------------------------------------+
>>
>>
>> Details are as below:
>> -------------------------------------------------------------------------------------------------->
>>
>>
>> To reproduce:
>>
>> git clone git://git.kernel.org/pub/scm/linux/kernel/git/wfg/lkp-tests.git
>> cd lkp-tests
>> bin/lkp install job.yaml # job file is attached in this email
>> bin/lkp run job.yaml
>>
>> testcase/path_params/tbox_group/run: unixbench/300s-100%-shell1-performance/lkp-wsm-ep1
>>
>> f519a3f1c6b7a990 6b94780e45c17b83e3e75f8aac
>> ---------------- --------------------------
>> 25565 -5% 24414 unixbench.score
>> 29557557 28781098 unixbench.time.voluntary_context_switches
>> 5743 -4% 5514 unixbench.time.user_time
>> 9.232e+08 -4% 8.831e+08 unixbench.time.minor_page_faults
>> 1807 -5% 1709 unixbench.time.percent_of_cpu_this_job_got
>> 5656 -7% 5271 unixbench.time.system_time
>> 13223805 -20% 10628072 unixbench.time.involuntary_context_switches
>> 741766 -62% 279054 interrupts.CAL:Function_call_interrupts
>> 31060 -9% 28214 vmstat.system.in
>> 126250 -12% 110890 vmstat.system.cs
>> 78.58 -6% 74.20 turbostat.%Busy
>> 2507 -6% 2366 turbostat.Avg_MHz
>> 9134 ± 47% -6e+03 2973 ± 36% latency_stats.max.pipe_read.__vfs_read.vfs_read.SyS_read.entry_SYSCALL_64_fastpath
>> 380879 ± 10% 5e+05 887692 ± 49% latency_stats.sum.wait_on_page_bit_killable.__lock_page_or_retry.filemap_fault.__do_fault.handle_mm_fault.__do_page_fault.do_page_fault.page_fault
>> 31710 ± 15% -2e+04 10583 ± 14% latency_stats.sum.call_rwsem_down_write_failed.__vma_adjust.__split_vma.do_munmap.vm_munmap.elf_map.load_elf_binary.search_binary_handler.do_execveat_common.SyS_execve.do_syscall_64.return_from_SYSCALL_64
>> 51796 ± 4% -4e+04 15457 ± 10% latency_stats.sum.call_rwsem_down_write_failed.unlink_file_vma.free_pgtables.unmap_region.do_munmap.vm_munmap.elf_map.load_elf_binary.search_binary_handler.do_execveat_common.SyS_execve.do_syscall_64
>> 111998 ± 18% -7e+04 37074 ± 14% latency_stats.sum.call_rwsem_down_write_failed.__vma_adjust.__split_vma.do_munmap.mmap_region.do_mmap.vm_mmap_pgoff.SyS_mmap_pgoff.SyS_mmap.entry_SYSCALL_64_fastpath
>> 275087 ± 15% -2e+05 81973 ± 3% latency_stats.sum.call_rwsem_down_write_failed.unlink_file_vma.free_pgtables.unmap_region.do_munmap.mmap_region.do_mmap.vm_mmap_pgoff.SyS_mmap_pgoff.SyS_mmap.entry_SYSCALL_64_fastpath
>> 930993 ± 12% -6e+05 320520 ± 4% latency_stats.sum.call_rwsem_down_write_failed.vma_link.mmap_region.do_mmap.vm_mmap_pgoff.vm_mmap.elf_map.load_elf_binary.search_binary_handler.do_execveat_common.SyS_execve.do_syscall_64
>> 4755783 ± 9% -3e+06 1619348 ± 4% latency_stats.sum.call_rwsem_down_write_failed.__vma_adjust.__split_vma.split_vma.mprotect_fixup.do_mprotect_pkey.SyS_mprotect.entry_SYSCALL_64_fastpath
>> 5536067 ± 10% -4e+06 1929338 ± 3% latency_stats.sum.call_rwsem_down_write_failed.copy_process._do_fork.SyS_clone.do_syscall_64.return_from_SYSCALL_64
>> 9.032e+08 -4% 8.64e+08 perf-stat.page-faults
>> 9.032e+08 -4% 8.64e+08 perf-stat.minor-faults
>> 2.329e+09 2.269e+09 perf-stat.node-load-misses
>> 2.2e+09 -9% 2.011e+09 ± 5% perf-stat.dTLB-store-misses
>> 3.278e+10 -9% 2.987e+10 ± 6% perf-stat.dTLB-load-misses
>> 19484819 13% 21974129 perf-stat.cpu-migrations
>> 3.755e+13 -6% 3.54e+13 perf-stat.cpu-cycles
>> 3244 4% 3379 perf-stat.instructions-per-iTLB-miss
>> 4.536e+12 -4% 4.332e+12 perf-stat.branch-instructions
>> 2.303e+13 -4% 2.208e+13 perf-stat.instructions
>> 5.768e+12 -4% 5.517e+12 perf-stat.dTLB-loads
>> 3.567e+11 -4% 3.414e+11 perf-stat.cache-references
>> 2.97 2.93 perf-stat.branch-miss-rate%
>> 2.768e+10 2.699e+10 perf-stat.node-stores
>> 5.446e+10 -3% 5.275e+10 perf-stat.cache-misses
>> 0.03 -4% 0.03 perf-stat.iTLB-load-miss-rate%
>> 9.673e+09 -4% 9.294e+09 perf-stat.node-loads
>> 3.596e+12 -4% 3.442e+12 perf-stat.dTLB-stores
>> 0.61 0.62 perf-stat.ipc
>> 1.347e+11 -6% 1.27e+11 perf-stat.branch-misses
>> 7.098e+09 -8% 6.533e+09 perf-stat.iTLB-load-misses
>> 2.309e+13 -4% 2.206e+13 perf-stat.iTLB-loads
>> 79911173 -12% 70187035 perf-stat.context-switches
>>
>>
>>
>> turbostat._Busy
>>
>> 90 ++-------------------------------------*---*---------------------------+
>> | .. *...*.. |
>> 80 *+..*..*...*..*...*..*...*..*...O...* O O O O O...O..O...O O O
>> 70 O+ O O O O O O O O |
>> | |
>> 60 ++ |
>> 50 ++ |
>> | |
>> 40 ++ |
>> 30 ++ |
>> | |
>> 20 ++ |
>> 10 ++ |
>> | |
>> 0 ++----------------------------------O----------------------------------+
>>
>>
>>
>>
>>
>> unixbench.time.percent_of_cpu_this_job_got
>>
>> 2500 ++-------------------------------------------------------------------+
>> | |
>> | .*... |
>> 2000 ++ .*. *..*... |
>> *..*...*..*...*..*...*..*...*..O...*. O O O O O..O...O..O O O
>> O O O O O O O O O |
>> 1500 ++ |
>> | |
>> 1000 ++ |
>> | |
>> | |
>> 500 ++ |
>> | |
>> | |
>> 0 ++---------------------------------O---------------------------------+
>>
>>
>> vmstat.system.in
>>
>> 40000 ++------------------------------------------------------------------+
>> | .*...*.. |
>> 35000 ++ .*...*. |
>> 30000 *+.*...*..*...*..*..*...*..*...*..*. *..*...*..* |
>> O O O O O O O O O O O O O O O O O O O O
>> 25000 ++ |
>> | |
>> 20000 ++ |
>> | |
>> 15000 ++ |
>> 10000 ++ |
>> | |
>> 5000 ++ |
>> | |
>> 0 ++--------------------------------O---------------------------------+
>>
>> [*] bisect-good sample
>> [O] bisect-bad sample
>>
>>
>> Disclaimer:
>> Results have been estimated based on internal Intel analysis and are provided
>> for informational purposes only. Any difference in system hardware or software
>> design or configuration may affect actual performance.
>>
>>
>> Thanks,
>> Xiaolong
>
next prev parent reply other threads:[~2017-01-03 7:13 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-12-19 0:14 [lkp-developer] [sched/core] 6b94780e45: unixbench.score -4.5% regression kernel test robot
2016-12-19 0:14 ` kernel test robot
2017-01-02 14:56 ` Vincent Guittot
2017-01-02 14:56 ` Vincent Guittot
2017-01-03 7:13 ` Ye Xiaolong [this message]
2017-01-03 7:13 ` Ye Xiaolong
2017-01-03 9:01 ` Vincent Guittot
2017-01-03 9:01 ` Vincent Guittot
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20170103071316.GA2823@yexl-desktop \
--to=xiaolong.ye@intel.com \
--cc=lkp@lists.01.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.