From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mike Galbraith Subject: Re: [RFC/RFT][PATCH v3 0/6] sched/cpuidle: Idle loop rework Date: Sat, 10 Mar 2018 06:01:31 +0100 Message-ID: <1520658091.15339.4.camel@suse.de> References: <2450532.XN8DODrtDf@aspire.rjw.lan> Mime-Version: 1.0 Content-Type: text/plain; charset="ISO-8859-15" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <2450532.XN8DODrtDf@aspire.rjw.lan> Sender: linux-kernel-owner@vger.kernel.org To: "Rafael J. Wysocki" , Peter Zijlstra , Linux PM , Frederic Weisbecker Cc: Thomas Gleixner , Paul McKenney , Thomas Ilsche , Doug Smythies , Rik van Riel , Aubrey Li , LKML List-Id: linux-pm@vger.kernel.org On Fri, 2018-03-09 at 10:34 +0100, Rafael J. Wysocki wrote: > Hi All, > > Thanks a lot for the discussion and testing so far! > > This is a total respin of the whole series, so please look at it afresh. > Patches 2 and 3 are the most similar to their previous versions, but > still they are different enough. Respin of testdrive... i4790 booted nopti nospectre_v2 30 sec tbench 4.16.0.g1b88acc-master (virgin) Throughput 559.279 MB/sec 1 clients 1 procs max_latency=0.046 ms Throughput 997.119 MB/sec 2 clients 2 procs max_latency=0.246 ms Throughput 1693.04 MB/sec 4 clients 4 procs max_latency=4.309 ms Throughput 3597.2 MB/sec 8 clients 8 procs max_latency=6.760 ms Throughput 3474.55 MB/sec 16 clients 16 procs max_latency=6.743 ms 4.16.0.g1b88acc-master (+ v2) Throughput 588.929 MB/sec 1 clients 1 procs max_latency=0.291 ms Throughput 1080.93 MB/sec 2 clients 2 procs max_latency=0.639 ms Throughput 1826.3 MB/sec 4 clients 4 procs max_latency=0.647 ms Throughput 3561.01 MB/sec 8 clients 8 procs max_latency=1.279 ms Throughput 3382.98 MB/sec 16 clients 16 procs max_latency=4.817 ms 4.16.0.g1b88acc-master (+ v3) Throughput 588.711 MB/sec 1 clients 1 procs max_latency=0.067 ms Throughput 1077.71 MB/sec 2 clients 2 procs max_latency=0.298 ms Throughput 1803.47 MB/sec 4 clients 4 procs max_latency=0.667 ms Throughput 3591.4 MB/sec 8 clients 8 procs max_latency=4.999 ms Throughput 3444.74 MB/sec 16 clients 16 procs max_latency=1.995 ms 4.16.0.g1b88acc-master (+ my local patches) Throughput 722.559 MB/sec 1 clients 1 procs max_latency=0.087 ms Throughput 1208.59 MB/sec 2 clients 2 procs max_latency=0.289 ms Throughput 2071.94 MB/sec 4 clients 4 procs max_latency=0.654 ms Throughput 3784.91 MB/sec 8 clients 8 procs max_latency=0.974 ms Throughput 3644.4 MB/sec 16 clients 16 procs max_latency=5.620 ms turbostat -q -- firefox /root/tmp/video/BigBuckBunny-DivXPlusHD.mkv & sleep 300;killall firefox PkgWatt 1 2 3 4.16.0.g1b88acc-master 6.95 7.03 6.91 (virgin) 4.16.0.g1b88acc-master 7.20 7.25 7.26 (+v2) 4.16.0.g1b88acc-master 7.04 6.97 7.07 (+v3) 4.16.0.g1b88acc-master 6.90 7.06 6.95 (+my patches) No change wrt nohz high frequency cross core scheduling overhead, but the light load power consumption oddity did go away. (btw, don't read anything into max_latency numbers, that's GUI noise) -Mike