From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-5.3 required=3.0 tests=BAYES_00, HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS, USER_AGENT_SANE_1 autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 711E3C433E6 for ; Tue, 26 Jan 2021 04:52:02 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id 3621622B3F for ; Tue, 26 Jan 2021 04:52:02 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728636AbhAZEvV (ORCPT ); Mon, 25 Jan 2021 23:51:21 -0500 Received: from outbound-smtp25.blacknight.com ([81.17.249.193]:42788 "EHLO outbound-smtp25.blacknight.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726160AbhAYJRR (ORCPT ); Mon, 25 Jan 2021 04:17:17 -0500 Received: from mail.blacknight.com (pemlinmail04.blacknight.ie [81.17.254.17]) by outbound-smtp25.blacknight.com (Postfix) with ESMTPS id E0F5F42091 for ; Mon, 25 Jan 2021 09:04:20 +0000 (GMT) Received: (qmail 4005 invoked from network); 25 Jan 2021 09:04:20 -0000 Received: from unknown (HELO techsingularity.net) (mgorman@techsingularity.net@[84.203.22.4]) by 81.17.254.9 with ESMTPSA (AES256-SHA encrypted, authenticated); 25 Jan 2021 09:04:20 -0000 Date: Mon, 25 Jan 2021 09:04:19 +0000 From: Mel Gorman To: "Li, Aubrey" Cc: Vincent Guittot , Peter Zijlstra , Ingo Molnar , Qais Yousef , LKML Subject: Re: [PATCH v3 0/5] Scan for an idle sibling in a single pass Message-ID: <20210125090419.GW3592@techsingularity.net> References: <20210119112211.3196-1-mgorman@techsingularity.net> <20210119120220.GS3592@techsingularity.net> <20210122101451.GV3592@techsingularity.net> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-15 Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.10.1 (2018-07-13) Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Jan 25, 2021 at 12:29:47PM +0800, Li, Aubrey wrote: > >>> hackbench -l 2560 -g 1 on 8 cores arm64 > >>> v5.11-rc4 : 1.355 (+/- 7.96) > >>> + sis improvement : 1.923 (+/- 25%) > >>> + the patch below : 1.332 (+/- 4.95) > >>> > >>> hackbench -l 2560 -g 256 on 8 cores arm64 > >>> v5.11-rc4 : 2.116 (+/- 4.62%) > >>> + sis improvement : 2.216 (+/- 3.84%) > >>> + the patch below : 2.113 (+/- 3.01%) > >>> > > 4 benchmarks reported out during weekend, with patch 3 on a x86 4s system > with 24 cores per socket and 2 HT per core, total 192 CPUs. > > It looks like mid-load has notable changes on my side: > - netperf 50% num of threads in TCP mode has 27.25% improved > - tbench 50% num of threads has 9.52% regression > It's interesting that patch 3 would make any difference on x64 given that it's SMT2. The scan depth should have been similar. It's somewhat expected that it will not be a universal win, particularly once the utilisation is high enough to spill over in sched domains (25%, 50%, 75% utilisation being interesting on 4-socket systems). In such cases, double scanning can still show improvements for workloads that idle rapidly like tbench and hackbench even though it's expensive. The extra scanning gives more time for a CPU to go idle enough to be selected which can improve throughput but at the cost of wake-up latency, Hopefully v4 can be tested as well which is now just a single scan. -- Mel Gorman SUSE Labs