From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from mail-yw1-f182.google.com (mail-yw1-f182.google.com [209.85.128.182]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 1019F1EB5C2 for ; Tue, 27 Jan 2026 04:15:59 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.128.182 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1769487361; cv=none; b=lAI8um+o/vK5SvEk2cOLM5gErkjF+l07a3kascsinURD69KIIPBkNLFkqYtlDaN0SC+ABozzruJNNI4+oGQb9qdwgq1JJsQeAc0L4TaqBMFiJby07RtqI8MrUZY6kEqsEncIa/fN+GClxGqI9LPEEGXEX8Aj8Yz3wuDWgUdr5jg= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1769487361; c=relaxed/simple; bh=xEpnxlMXA4bzrHlGDKzI4T15VybUDNs4WCq1l0JvMMs=; h=Message-ID:Date:MIME-Version:Subject:To:Cc:References:From: In-Reply-To:Content-Type; b=WocQlOcPSBHbbGfhphCRSWOTUdS3/Ke07LF/N2gB/y958SAiI+PRK8/Hq5P4WWPIZA5b0yaV6mOX1kwrXDaOcOQogg9wAltpoaBWYe8UQRslrEjdUU4UO4SbnSGURr0BPYBRa2DbV4m7k14NZ+1LXdsc6hpHJUBXm44W5GRGp8I= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=gmail.com; spf=pass smtp.mailfrom=gmail.com; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b=hrdkBGSA; arc=none smtp.client-ip=209.85.128.182 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=gmail.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=gmail.com Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b="hrdkBGSA" Received: by mail-yw1-f182.google.com with SMTP id 00721157ae682-79088484065so46472707b3.1 for ; Mon, 26 Jan 2026 20:15:59 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1769487359; x=1770092159; darn=vger.kernel.org; h=content-transfer-encoding:in-reply-to:from:content-language :references:cc:to:subject:user-agent:mime-version:date:message-id :from:to:cc:subject:date:message-id:reply-to; bh=ckNxND7enecfdafnwE+W6ZVBCg0yOV2ungQtuLYNJRY=; b=hrdkBGSAIi/Kth45aWKiNKwvhZjGsXPC9uhMbyUahVLpAhFCkzJ87OY+KwxCeZLw1V 2pwuLCs3etNafX4YtXg6ompogIcrg54q4iZt1IJHqWpr8AcDE1rZHik64UWATQNsI8ly SPizxqTuPwldXEkfEUzGAh3hhbMmemjXbksYiWqusdym4dQZMtx1Ls6fLzsqfhwVrnEa KU/RR430rFd3FeISzMCO5/OS0wjZcRZUrs6Sm9MYS2gcC4X82ne5fZlif45D/18pjRjM M8foz+1RfGWmmmC+Ufr0OchhWGBYv9EOm+j+XO4G57uo/gBvcAM+gvGq8InqtOnlnYE7 AvBA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1769487359; x=1770092159; h=content-transfer-encoding:in-reply-to:from:content-language :references:cc:to:subject:user-agent:mime-version:date:message-id :x-gm-gg:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=ckNxND7enecfdafnwE+W6ZVBCg0yOV2ungQtuLYNJRY=; b=fPY7LT6dpzip+S0Rk7D4Fi7vkvfenBKDfAoO4QdycNsMe5BkBeqCkz+v5h0tuAoFNJ YTG1yIfP5EFeBOn5n9VazSZMbIfp55LaCYY4eBG1v7e5vh7z3obBP1sAJ8ifYxlZmj3/ Ao5I5tFPzioCeol/iJam06vgT/65mnR+hQHV7v36JNcu9Pgkw4UzhQuxc7g6qAZpVaYX 1UaDIVpL10AsINxwt+pJbyXtzEQBzeO15riFCUyWXoCv9NMstcJwrnC1AsrpaOAqWDdQ q7Y/miLL4z1g532Vf287sLsXjye2YelC0BaNm/sHJDnOq8cPO+8ndVSxyROcanEUg/T0 1QJw== X-Forwarded-Encrypted: i=1; AJvYcCU77Si5Wh9SqCbfnIbFv9QDZG4epWuM/msfXuFTxX1ZXX8iKTo9OudKMHSYmt7cI1SXpnRKDJwq1owph8s=@vger.kernel.org X-Gm-Message-State: AOJu0YwqY2s6GsoxVx2k6PidQNn68NSOx5hj2zRcehT1Z9zMutEhQDsP z0Tw3DAPJ7/MI464mMkIaYJwXyHVj/2izZ2xc+raJbHYsiv0mNlGjidA X-Gm-Gg: AZuq6aJRuNeboMZm7a1Efxh+nVoeuzS4w9YbgNfZ3ncYr0I7UIM5kqbsH/EQcJVGotm wLXVoAT0gsqmNDTpq/oZAyJbPyV6mrpmEohFkGE3ZaaHZs1MxB5vJdXx+dcFDVO5hhHN7C1o+qq JedEar899eI+N3GQp/I2T0qLeiFinPPvRzqTIS6JdFCSeL0ssnFQZZ/blJ/xCHwloB+mkzrm8U6 JPGsz2aWw0nNRLkRwsifhJznu6NqtQM6BLJd8C34cECERzlSPJkK5kyLgX6/5e0jTlLcrveQ1aP mamwoeHjewIzlTKuYthRdiskXy15BvHmZYdM/CZoIjGvEuGuO7I1UQRg8iEdOzB02A8yUKpaJbw 0BfmKzUT3OiE9xQnqzpox/L3eQ13BCFfQQWt14caP8SI7K5C63xTo19OAhcXo8+zwGI18bRYGXv CvqDt/dGgMleYTCfw= X-Received: by 2002:a05:690c:312:b0:78f:a81a:6f6b with SMTP id 00721157ae682-7947ac01f24mr5357457b3.32.1769487359000; Mon, 26 Jan 2026 20:15:59 -0800 (PST) Received: from [192.168.1.64] ([173.92.131.131]) by smtp.gmail.com with ESMTPSA id 956f58d0204a3-649619e09d2sm5463317d50.2.2026.01.26.20.15.57 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Mon, 26 Jan 2026 20:15:58 -0800 (PST) Message-ID: Date: Mon, 26 Jan 2026 23:15:57 -0500 Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: [PATCH 4/4] sched/fair: Proportional newidle balance To: Peter Zijlstra Cc: Chris Mason , Joseph Salisbury , Adam Li , Hazem Mohamed Abuelfotoh , Josh Don , mingo@redhat.com, juri.lelli@redhat.com, vincent.guittot@linaro.org, dietmar.eggemann@arm.com, rostedt@goodmis.org, bsegall@google.com, mgorman@suse.de, vschneid@redhat.com, linux-kernel@vger.kernel.org, kprateek.nayak@amd.com References: <20251107160645.929564468@infradead.org> <20251107161739.770122091@infradead.org> <8760001e-0274-454c-a4e4-1f38a9695b88@gmail.com> <20260123105046.GM171111@noisy.programming.kicks-ass.net> <20260123110306.GA217302@noisy.programming.kicks-ass.net> Content-Language: en-US From: Mario Roy In-Reply-To: <20260123110306.GA217302@noisy.programming.kicks-ass.net> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit I tried the Stress-NG socket activity test. Plus prefer-idle-core patch. The patch is about mindfulness for limited CPU saturation testing. AMD Ryzen Threadripper 9960X CPU (24/48)                     Bogo operations/second, More is better                 A        B        C        D        E        F SocketAct    12128.7  13907.6  12377.7  10551.7  12158.7  11842.2 SocketAct24  64553.3  20072.0  67018.7  62182.3  18133.5  66756.6 SocketAct15  49206.3  22170.7  57038.7  44077.6  19884.1  56727.5 SocketAct10  35263.5  20140.3  40092.1  33040.3  19701.6  41346.3 The kernels are built with clang without LTO/AutoFDO A. 6.19-rc7 next_buddy ena with sched/fair: Proportional newidle balance B. 6.19-rc7 next_buddy ena without sched/fair: Proportional newidle balance C. 6.19-rc7 next_buddy ena without sched regression; with prefer-idle-core D. 6.19-rc7 next_buddy dis with sched/fair: Proportional newidle balance E. 6.19-rc7 next_buddy dis without sched/fair: Proportional newidle balance F. 6.19-rc7 next_buddy dis without sched regression; with prefer-idle-core Without sched regression:   this is without sched/fair: Proportional newidle balance With prefer-idle-core: https://github.com/marioroy/cachymod/blob/main/linux-cachymod-6.18/   0280-prefer-prevcpu-for-wakeup.patch Stress-NG 0.20.00: SocketAct, SocketAct24, SocketAct15, SocketAct10   stress-ng -t 30 --metrics-brief --sock -1 --no-rand-seed --sock-zerocopy   stress-ng -t 30 --metrics-brief --sock 24 --no-rand-seed --sock-zerocopy   stress-ng -t 30 --metrics-brief --sock 15 --no-rand-seed --sock-zerocopy   stress-ng -t 30 --metrics-brief --sock 10 --no-rand-seed --sock-zerocopy Basically 100%, 50%, and 31.25% times 2 (writer, reader) I ran also, --sock 10 because 10 x 2 is less than 50% (24 threads) Linux 6.18.7 results: granted, both are built with LTO + AutoFDO profile              CachyOS 6.18.7-2  CachyMod 6.18.7-2 [1] SocketAct    40799.2           46784.3 SocketAct24  61057.6           71414.5 SocketAct15  45056.4           61772.3 SocketAct10  32691.6           44244.6 [1] https://github.com/marioroy/cachymod     the sched regression reverted (0040 patch)     prefer-idle-core (0280 patch) On 1/23/26 6:03 AM, Peter Zijlstra wrote: > On Fri, Jan 23, 2026 at 11:50:46AM +0100, Peter Zijlstra wrote: >> On Sun, Jan 18, 2026 at 03:46:22PM -0500, Mario Roy wrote: >>> The patch "Proportional newidle balance" introduced a regression >>> with Linux 6.12.65 and 6.18.5. There is noticeable regression with >>> easyWave testing. [1] >>> >>> The CPU is AMD Threadripper 9960X CPU (24/48). I followed the source >>> to install easyWave [2]. That is fetching the two tar.gz archives. >> What is the actual configuration of that chip? Is it like 3*8 or 4*6 >> (CCX wise). A quick google couldn't find me the answer :/ > Obviously I found it right after sending this. It's a 4x6 config. > Meaning it needs newidle to balance between those 4 domains. > > Pratheek -- are you guys still considering that SIS_NODE thing? That > worked really well for workstation chips, but there were some issues on > Epyc or so. > >>> #!/bin/bash >>> # CXXFLAGS="-O3 $CXXFLAGS" ./configure >>> # make -j8 >>> >>> trap 'rm -f *.ssh *.idx *.log *.sshmax *.time' EXIT >>> >>> OMP_NUM_THREADS=48 ./src/easywave \ >>>   -grid examples/e2Asean.grd -source examples/BengkuluSept2007.flt \ >>>   -time 1200 >>> >>> >>> Before results with CachyOS 6.12.63-2 and 6.18.3-2 kernels. >> So the problem is that 6.12 -> 6.18 is an enormous amount of kernel >> releases :/ This patch in particular was an effort to fix a regression >> caused by: >> >> 155213a2aed4 ("sched/fair: Bump sd->max_newidle_lb_cost when newidle balance fails") >> >> I'm thinking that if you revert all 4 patches of this series your >> performance will be even worse? >> >> Anyway, my guess is that somehow this benchmark likes doing newidle even >> if it is often not successful. I'll see if I can reproduce this on one >> of my machine, but that might take a little while.