From mboxrd@z Thu Jan 1 00:00:00 1970 Content-Type: multipart/mixed; boundary="===============3453090059312978768==" MIME-Version: 1.0 From: Huang Ying To: lkp@lists.01.org Subject: [mm] 721c21c17ab: +11.7% will-it-scale.per_thread_ops Date: Tue, 03 Feb 2015 15:45:04 +0800 Message-ID: <1422949504.13951.5.camel@linux.intel.com> List-Id: --===============3453090059312978768== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable FYI, we noticed the below changes on commit 721c21c17ab958abf19a8fc611c3bd4743680e38 ("mm: mmu_gather: use tlb->= end !=3D 0 only for TLB invalidation") testbox/testcase/testparams: nhm4/will-it-scale/performance-readseek1 v3.19-rc4 721c21c17ab958abf19a8fc611 = ---------------- -------------------------- = %stddev %change %stddev \ | \ = 0.56 =C2=B1 1% +5.2% 0.59 =C2=B1 1% will-it-scale.scala= bility 1807741 =C2=B1 0% +2.3% 1848641 =C2=B1 0% will-it-scale.per_t= hread_ops 740 =C2=B1 30% +40.9% 1043 =C2=B1 26% sched_debug.cpu#4.t= twu_local 1335 =C2=B1 20% +23.7% 1651 =C2=B1 17% sched_debug.cpu#4.t= twu_count 506 =C2=B1 9% +40.8% 712 =C2=B1 1% cpuidle.C1-NHM.usage 120 =C2=B1 9% +33.1% 160 =C2=B1 11% sched_debug.cpu#7.l= oad 120 =C2=B1 9% +26.2% 151 =C2=B1 10% sched_debug.cfs_rq[= 7]:/.load 90 =C2=B1 5% -16.2% 75 =C2=B1 16% sched_debug.cpu#6.c= pu_load[4] 96 =C2=B1 7% +16.7% 112 =C2=B1 10% sched_debug.cfs_rq[= 2]:/.runnable_load_avg testbox/testcase/testparams: nhm4/will-it-scale/performance-pread2 v3.19-rc4 721c21c17ab958abf19a8fc611 = ---------------- -------------------------- = 900692 =C2=B1 1% +11.7% 1005724 =C2=B1 0% will-it-scale.per_t= hread_ops 28033529 =C2=B1 0% -1.2% 27698665 =C2=B1 0% will-it-scale.time.= voluntary_context_switches 671 =C2=B1 22% +40.4% 942 =C2=B1 27% sched_debug.cfs_rq[= 7]:/.blocked_load_avg 802 =C2=B1 19% +30.9% 1049 =C2=B1 25% sched_debug.cfs_rq[= 7]:/.tg_load_contrib 44840 =C2=B1 6% +15.6% 51846 =C2=B1 6% meminfo.DirectMap4k 18284 =C2=B1 1% -7.4% 16926 =C2=B1 2% vmstat.system.in 378463 =C2=B1 0% -1.2% 373746 =C2=B1 0% vmstat.system.cs testbox/testcase/testparams: nhm4/will-it-scale/performance-readseek3 v3.19-rc4 721c21c17ab958abf19a8fc611 = ---------------- -------------------------- = 0.55 =C2=B1 0% +9.9% 0.60 =C2=B1 5% will-it-scale.scala= bility 1791707 =C2=B1 0% +2.9% 1843202 =C2=B1 0% will-it-scale.per_t= hread_ops 187 =C2=B1 41% +167.3% 501 =C2=B1 23% sched_debug.cfs_rq[= 0]:/.blocked_load_avg 281 =C2=B1 29% +121.3% 622 =C2=B1 18% sched_debug.cfs_rq[= 0]:/.tg_load_contrib 110 =C2=B1 9% +25.5% 138 =C2=B1 13% sched_debug.cfs_rq[= 5]:/.load 110 =C2=B1 9% +25.9% 138 =C2=B1 13% sched_debug.cpu#5.l= oad 178 =C2=B1 6% -19.5% 144 =C2=B1 16% sched_debug.cpu#4.c= pu_load[1] 94 =C2=B1 6% +12.9% 107 =C2=B1 8% sched_debug.cfs_rq[= 3]:/.runnable_load_avg 1.78 =C2=B1 7% +17.4% 2.09 =C2=B1 0% perf-profile.cpu-cy= cles.put_page.shmem_file_read_iter.new_sync_read.__vfs_read.vfs_read 187 =C2=B1 9% -19.1% 152 =C2=B1 16% sched_debug.cpu#4.c= pu_load[2] 757 =C2=B1 5% +10.6% 838 =C2=B1 2% slabinfo.kmalloc-20= 48.active_objs 3064 =C2=B1 7% +7.8% 3302 =C2=B1 6% sched_debug.cpu#1.c= urr->pid 5.23 =C2=B1 2% +8.8% 5.69 =C2=B1 4% perf-profile.cpu-cy= cles.security_file_permission.rw_verify_area.vfs_read.sys_read.system_call_= fastpath 3.23 =C2=B1 4% +8.0% 3.48 =C2=B1 5% perf-profile.cpu-cy= cles.copy_page_to_iter_iovec.copy_page_to_iter.shmem_file_read_iter.new_syn= c_read.__vfs_read 4216 =C2=B1 7% +7.5% 4531 =C2=B1 5% slabinfo.kmalloc-19= 2.active_objs nhm4: Nehalem Memory: 4G lkp-sbx04: Sandy Bridge-EX Memory: 64G will-it-scale.per_thread_ops 1.04e+06 ++--------------------------------------------------------------= -+ | O O O = | 1.02e+06 ++ O O O O O O O O O = | 1e+06 O+ O O O O = O | O O O = | 980000 ++ = | | = | 960000 ++ = | | = | 940000 ++ = | 920000 ++ = | *.. *..* = | 900000 ++ + = | | *...*.. + = | 880000 ++--------*-----------------------------------------------------= -+ [*] bisect-good sample [O] bisect-bad sample To reproduce: apt-get install ruby ruby-oj git clone git://git.kernel.org/pub/scm/linux/kernel/git/wfg/lkp-tes= ts.git cd lkp-tests bin/setup-local job.yaml # the job file attached in this email bin/run-local job.yaml Disclaimer: Results have been estimated based on internal Intel analysis and are provid= ed for informational purposes only. Any difference in system hardware or softw= are design or configuration may affect actual performance. Thanks, Huang, Ying _______________________________________________ LKP mailing list LKP(a)linux.intel.com --===============3453090059312978768== Content-Type: text/plain MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: attachment; filename="job.yaml" LS0tCnRlc3RjYXNlOiB3aWxsLWl0LXNjYWxlCmRlZmF1bHQtbW9uaXRvcnM6CiAgd2FpdDogcHJl LXRlc3QKICB1cHRpbWU6IAogIGlvc3RhdDogCiAgdm1zdGF0OiAKICBudW1hLW51bWFzdGF0OiAK ICBudW1hLXZtc3RhdDogCiAgbnVtYS1tZW1pbmZvOiAKICBwcm9jLXZtc3RhdDogCiAgcHJvYy1z dGF0OiAKICBtZW1pbmZvOiAKICBzbGFiaW5mbzogCiAgaW50ZXJydXB0czogCiAgbG9ja19zdGF0 OiAKICBsYXRlbmN5X3N0YXRzOiAKICBzb2Z0aXJxczogCiAgYmRpX2Rldl9tYXBwaW5nOiAKICBk aXNrc3RhdHM6IAogIGNwdWlkbGU6IAogIGNwdWZyZXE6IAogIHR1cmJvc3RhdDogCiAgc2NoZWRf ZGVidWc6CiAgICBpbnRlcnZhbDogMTAKICBwbWV0ZXI6IApkZWZhdWx0X3dhdGNoZG9nczoKICB3 YXRjaC1vb206IAogIHdhdGNoZG9nOiAKY3B1ZnJlcV9nb3Zlcm5vcjoKLSBwZXJmb3JtYW5jZQpj b21taXQ6IDYzNGIwYmQ0OTBiN2ViZDdhMDU0Y2VhNGY3ZTBkMjU3NDhiZGU2NzgKbW9kZWw6IE5l aGFsZW0KbnJfY3B1OiA4Cm1lbW9yeTogNEcKaGRkX3BhcnRpdGlvbnM6ICIvZGV2L2Rpc2svYnkt aWQvYXRhLVdEQ19XRDEwMDNGQllaLTAxMEZCMF9XRC1XQ0FXMzY4MTIwNDEtcGFydDEiCnN3YXBf cGFydGl0aW9uczogIi9kZXYvZGlzay9ieS1pZC9hdGEtV0RDX1dEMTAwM0ZCWVotMDEwRkIwX1dE LVdDQVczNjgxMjA0MS1wYXJ0MiIKcm9vdGZzX3BhcnRpdGlvbjogIi9kZXYvZGlzay9ieS1pZC9h dGEtV0RDX1dEMTAwM0ZCWVotMDEwRkIwX1dELVdDQVczNjgxMjA0MS1wYXJ0MyIKbmV0Y29uc29s ZV9wb3J0OiA2NjQ5CnBlcmYtcHJvZmlsZToKICBmcmVxOiA4MDAKd2lsbC1pdC1zY2FsZToKICB0 ZXN0OgogIC0gcHJlYWQyCnRlc3Rib3g6IG5obTQKdGJveF9ncm91cDogbmhtNAprY29uZmlnOiB4 ODZfNjQtcmhlbAplbnF1ZXVlX3RpbWU6IDIwMTUtMDEtMTYgMTk6Mzk6MTIuODQ4ODIxNTExICsw ODowMApoZWFkX2NvbW1pdDogNjM0YjBiZDQ5MGI3ZWJkN2EwNTRjZWE0ZjdlMGQyNTc0OGJkZTY3 OApiYXNlX2NvbW1pdDogZWFhMjdmMzRlOTFhMTRjZGNlZWQyNmVkNmM2NzkzZWMxZDE4NjExNQpi cmFuY2g6IG5leHQvbWFzdGVyCmtlcm5lbDogIi9rZXJuZWwveDg2XzY0LXJoZWwvNjM0YjBiZDQ5 MGI3ZWJkN2EwNTRjZWE0ZjdlMGQyNTc0OGJkZTY3OC92bWxpbnV6LTMuMTkuMC1yYzQtbmV4dC0y MDE1MDExNi1nNjM0YjBiZCIKdXNlcjogbGtwCnF1ZXVlOiBjeWNsaWMKcm9vdGZzOiBkZWJpYW4t eDg2XzY0LTIwMTUtMDEtMTMuY2d6CnJlc3VsdF9yb290OiAiL3Jlc3VsdC9uaG00L3dpbGwtaXQt c2NhbGUvcGVyZm9ybWFuY2UtcHJlYWQyL2RlYmlhbi14ODZfNjQtMjAxNS0wMS0xMy5jZ3oveDg2 XzY0LXJoZWwvNjM0YjBiZDQ5MGI3ZWJkN2EwNTRjZWE0ZjdlMGQyNTc0OGJkZTY3OC8wIgpqb2Jf ZmlsZTogIi9sa3Avc2NoZWR1bGVkL25obTQvY3ljbGljX3dpbGwtaXQtc2NhbGUtcGVyZm9ybWFu Y2UtcHJlYWQyLXg4Nl82NC1yaGVsLUhFQUQtNjM0YjBiZDQ5MGI3ZWJkN2EwNTRjZWE0ZjdlMGQy NTc0OGJkZTY3OC0wLnlhbWwiCmRlcXVldWVfdGltZTogMjAxNS0wMS0xNyAwMjozMDozNS4zMzI0 MDI4NTUgKzA4OjAwCmpvYl9zdGF0ZTogZmluaXNoZWQKbG9hZGF2ZzogNS4xMiAzLjIxIDEuMzQg MS8xMjMgNjcwMgpzdGFydF90aW1lOiAnMTQyMTQzMzA2NycKZW5kX3RpbWU6ICcxNDIxNDMzMzcy Jwp2ZXJzaW9uOiAiL2xrcC9sa3AvLnNyYy0yMDE1MDExNi0xMTM1MjUiCg== --===============3453090059312978768== Content-Type: text/plain MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: attachment; filename="reproduce.ksh" Li9ydW50ZXN0LnB5IHByZWFkMiAzMiAxIDQgNiA4Cg== --===============3453090059312978768==-- From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S933681AbbBCHpN (ORCPT ); Tue, 3 Feb 2015 02:45:13 -0500 Received: from mga03.intel.com ([134.134.136.65]:12438 "EHLO mga03.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754951AbbBCHpJ (ORCPT ); Tue, 3 Feb 2015 02:45:09 -0500 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.09,511,1418112000"; d="yaml'?scan'208";a="680072460" Message-ID: <1422949504.13951.5.camel@linux.intel.com> Subject: [LKP] [mm] 721c21c17ab: +11.7% will-it-scale.per_thread_ops From: Huang Ying To: Will Deacon Cc: Linus Torvalds , LKML , LKP ML Date: Tue, 03 Feb 2015 15:45:04 +0800 Content-Type: multipart/mixed; boundary="=-bStCTqPnXEls3LO9eWQP" X-Mailer: Evolution 3.12.9-1+b1 Mime-Version: 1.0 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org --=-bStCTqPnXEls3LO9eWQP Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 8bit FYI, we noticed the below changes on commit 721c21c17ab958abf19a8fc611c3bd4743680e38 ("mm: mmu_gather: use tlb->end != 0 only for TLB invalidation") testbox/testcase/testparams: nhm4/will-it-scale/performance-readseek1 v3.19-rc4 721c21c17ab958abf19a8fc611 ---------------- -------------------------- %stddev %change %stddev \ | \ 0.56 ± 1% +5.2% 0.59 ± 1% will-it-scale.scalability 1807741 ± 0% +2.3% 1848641 ± 0% will-it-scale.per_thread_ops 740 ± 30% +40.9% 1043 ± 26% sched_debug.cpu#4.ttwu_local 1335 ± 20% +23.7% 1651 ± 17% sched_debug.cpu#4.ttwu_count 506 ± 9% +40.8% 712 ± 1% cpuidle.C1-NHM.usage 120 ± 9% +33.1% 160 ± 11% sched_debug.cpu#7.load 120 ± 9% +26.2% 151 ± 10% sched_debug.cfs_rq[7]:/.load 90 ± 5% -16.2% 75 ± 16% sched_debug.cpu#6.cpu_load[4] 96 ± 7% +16.7% 112 ± 10% sched_debug.cfs_rq[2]:/.runnable_load_avg testbox/testcase/testparams: nhm4/will-it-scale/performance-pread2 v3.19-rc4 721c21c17ab958abf19a8fc611 ---------------- -------------------------- 900692 ± 1% +11.7% 1005724 ± 0% will-it-scale.per_thread_ops 28033529 ± 0% -1.2% 27698665 ± 0% will-it-scale.time.voluntary_context_switches 671 ± 22% +40.4% 942 ± 27% sched_debug.cfs_rq[7]:/.blocked_load_avg 802 ± 19% +30.9% 1049 ± 25% sched_debug.cfs_rq[7]:/.tg_load_contrib 44840 ± 6% +15.6% 51846 ± 6% meminfo.DirectMap4k 18284 ± 1% -7.4% 16926 ± 2% vmstat.system.in 378463 ± 0% -1.2% 373746 ± 0% vmstat.system.cs testbox/testcase/testparams: nhm4/will-it-scale/performance-readseek3 v3.19-rc4 721c21c17ab958abf19a8fc611 ---------------- -------------------------- 0.55 ± 0% +9.9% 0.60 ± 5% will-it-scale.scalability 1791707 ± 0% +2.9% 1843202 ± 0% will-it-scale.per_thread_ops 187 ± 41% +167.3% 501 ± 23% sched_debug.cfs_rq[0]:/.blocked_load_avg 281 ± 29% +121.3% 622 ± 18% sched_debug.cfs_rq[0]:/.tg_load_contrib 110 ± 9% +25.5% 138 ± 13% sched_debug.cfs_rq[5]:/.load 110 ± 9% +25.9% 138 ± 13% sched_debug.cpu#5.load 178 ± 6% -19.5% 144 ± 16% sched_debug.cpu#4.cpu_load[1] 94 ± 6% +12.9% 107 ± 8% sched_debug.cfs_rq[3]:/.runnable_load_avg 1.78 ± 7% +17.4% 2.09 ± 0% perf-profile.cpu-cycles.put_page.shmem_file_read_iter.new_sync_read.__vfs_read.vfs_read 187 ± 9% -19.1% 152 ± 16% sched_debug.cpu#4.cpu_load[2] 757 ± 5% +10.6% 838 ± 2% slabinfo.kmalloc-2048.active_objs 3064 ± 7% +7.8% 3302 ± 6% sched_debug.cpu#1.curr->pid 5.23 ± 2% +8.8% 5.69 ± 4% perf-profile.cpu-cycles.security_file_permission.rw_verify_area.vfs_read.sys_read.system_call_fastpath 3.23 ± 4% +8.0% 3.48 ± 5% perf-profile.cpu-cycles.copy_page_to_iter_iovec.copy_page_to_iter.shmem_file_read_iter.new_sync_read.__vfs_read 4216 ± 7% +7.5% 4531 ± 5% slabinfo.kmalloc-192.active_objs nhm4: Nehalem Memory: 4G lkp-sbx04: Sandy Bridge-EX Memory: 64G will-it-scale.per_thread_ops 1.04e+06 ++---------------------------------------------------------------+ | O O O | 1.02e+06 ++ O O O O O O O O O | 1e+06 O+ O O O O O | O O O | 980000 ++ | | | 960000 ++ | | | 940000 ++ | 920000 ++ | *.. *..* | 900000 ++ + | | *...*.. + | 880000 ++--------*------------------------------------------------------+ [*] bisect-good sample [O] bisect-bad sample To reproduce: apt-get install ruby ruby-oj git clone git://git.kernel.org/pub/scm/linux/kernel/git/wfg/lkp-tests.git cd lkp-tests bin/setup-local job.yaml # the job file attached in this email bin/run-local job.yaml Disclaimer: Results have been estimated based on internal Intel analysis and are provided for informational purposes only. Any difference in system hardware or software design or configuration may affect actual performance. Thanks, Huang, Ying --=-bStCTqPnXEls3LO9eWQP Content-Type: text/plain; charset=us-ascii Content-Disposition: attachment; filename="job.yaml" Content-Transfer-Encoding: 7bit --- testcase: will-it-scale default-monitors: wait: pre-test uptime: iostat: vmstat: numa-numastat: numa-vmstat: numa-meminfo: proc-vmstat: proc-stat: meminfo: slabinfo: interrupts: lock_stat: latency_stats: softirqs: bdi_dev_mapping: diskstats: cpuidle: cpufreq: turbostat: sched_debug: interval: 10 pmeter: default_watchdogs: watch-oom: watchdog: cpufreq_governor: - performance commit: 634b0bd490b7ebd7a054cea4f7e0d25748bde678 model: Nehalem nr_cpu: 8 memory: 4G hdd_partitions: "/dev/disk/by-id/ata-WDC_WD1003FBYZ-010FB0_WD-WCAW36812041-part1" swap_partitions: "/dev/disk/by-id/ata-WDC_WD1003FBYZ-010FB0_WD-WCAW36812041-part2" rootfs_partition: "/dev/disk/by-id/ata-WDC_WD1003FBYZ-010FB0_WD-WCAW36812041-part3" netconsole_port: 6649 perf-profile: freq: 800 will-it-scale: test: - pread2 testbox: nhm4 tbox_group: nhm4 kconfig: x86_64-rhel enqueue_time: 2015-01-16 19:39:12.848821511 +08:00 head_commit: 634b0bd490b7ebd7a054cea4f7e0d25748bde678 base_commit: eaa27f34e91a14cdceed26ed6c6793ec1d186115 branch: next/master kernel: "/kernel/x86_64-rhel/634b0bd490b7ebd7a054cea4f7e0d25748bde678/vmlinuz-3.19.0-rc4-next-20150116-g634b0bd" user: lkp queue: cyclic rootfs: debian-x86_64-2015-01-13.cgz result_root: "/result/nhm4/will-it-scale/performance-pread2/debian-x86_64-2015-01-13.cgz/x86_64-rhel/634b0bd490b7ebd7a054cea4f7e0d25748bde678/0" job_file: "/lkp/scheduled/nhm4/cyclic_will-it-scale-performance-pread2-x86_64-rhel-HEAD-634b0bd490b7ebd7a054cea4f7e0d25748bde678-0.yaml" dequeue_time: 2015-01-17 02:30:35.332402855 +08:00 job_state: finished loadavg: 5.12 3.21 1.34 1/123 6702 start_time: '1421433067' end_time: '1421433372' version: "/lkp/lkp/.src-20150116-113525" --=-bStCTqPnXEls3LO9eWQP Content-Type: text/plain; charset=us-ascii Content-Disposition: attachment; filename="reproduce" Content-Transfer-Encoding: 7bit ./runtest.py pread2 32 1 4 6 8 --=-bStCTqPnXEls3LO9eWQP Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable _______________________________________________ LKP mailing list LKP@linux.intel.com =0D --=-bStCTqPnXEls3LO9eWQP--