From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1757821AbaEKBjF (ORCPT ); Sat, 10 May 2014 21:39:05 -0400 Received: from e38.co.us.ibm.com ([32.97.110.159]:35169 "EHLO e38.co.us.ibm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751550AbaEKBjB (ORCPT ); Sat, 10 May 2014 21:39:01 -0400 Date: Sat, 10 May 2014 18:38:54 -0700 From: "Paul E. McKenney" To: Frederic Weisbecker Cc: Thomas Gleixner , Christoph Lameter , Andrew Morton , Gilad Ben-Yossef , Tejun Heo , Mike Frysinger , Minchan Kim , Hakan Akkan , Max Krasnyansky , LKML , linux-mm@kvack.org, hughd@google.com, viresh.kumar@linaro.org, Ingo Molnar , "H. Peter Anvin" , Peter Zijlstra , John Stultz Subject: Re: vmstat: On demand vmstat workers V4 Message-ID: <20140511013854.GL4827@linux.vnet.ibm.com> Reply-To: paulmck@linux.vnet.ibm.com References: <20140509234745.GB8754@linux.vnet.ibm.com> <20140510004843.GB32393@localhost.localdomain> <20140510131422.GA13660@localhost.localdomain> <20140511011708.GD4827@linux.vnet.ibm.com> <20140511013029.GC13660@localhost.localdomain> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20140511013029.GC13660@localhost.localdomain> User-Agent: Mutt/1.5.21 (2010-09-15) X-TM-AS-MML: disable X-Content-Scanned: Fidelis XPS MAILER x-cbid: 14051101-1344-0000-0000-000001790880 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sun, May 11, 2014 at 03:30:31AM +0200, Frederic Weisbecker wrote: > On Sat, May 10, 2014 at 06:17:08PM -0700, Paul E. McKenney wrote: > > On Sat, May 10, 2014 at 03:14:25PM +0200, Frederic Weisbecker wrote: > > > On Sat, May 10, 2014 at 02:31:28PM +0200, Thomas Gleixner wrote: > > > > On Sat, 10 May 2014, Frederic Weisbecker wrote: > > > > > But I still have the plan to make the timekeeper use the full sysidle > > > > > facility in order to adaptively get to dynticks idle. > > > > > > > > > > Reminder for others: in NO_HZ_FULL, the timekeeper (always CPU 0) stays > > > > > completely periodic. It can't enter in dynticks idle mode because it > > > > > must maintain timekeeping on behalf of full dynticks CPUs. So that's > > > > > a power issue. > > > > > > > > > > But Paul has a feature in RCU that lets us know when all CPUs are idle > > > > > and the timekeeper can finally sleep. Then when a full nohz CPU wakes > > > > > up from idle, it sends an IPI to the timekeeper if needed so the latter > > > > > restarts timekeeping maintainance. > > > > > > > > > > It's not complicated to add to the timer code. > > > > > Most of the code is already there, in RCU, for a while already. > > > > > > > > > > Are we keeping that direction? > > > > > > > > So the idea is that the timekeeper stays on cpu0, but if everything is > > > > idle it is allowed to take a long nap as well. So if some other cpu > > > > wakes up it updates timekeeping without taking over the time keeper > > > > duty and if it has work to do, it kicks cpu0 into gear. If it just > > > > goes back to sleep, then nothing to do. > > > > Hmmm... If RCU is supposed to ignore the fact that one of the other > > CPUs woke up momentarily, we will need to adjust things a bit. > > Maybe not that much actually. > > > > > > Exactly! Except perhaps the last sentence "If it just goes back to sleep, > > > then nothing to do.", I didn't think about that although this special case > > > is quite frequent indeed when an interrupt fires on idle but no task is woken up. > > > > > > Maybe I should move the code that fires the IPI to cpu0, if it is sleeping, > > > on irq exit (the plan was to do it right away on irq enter) and fire it > > > only if need_resched(). > > > > And of course if that code path contains any RCU read-side critical > > sections, RCU absolutely cannot ignore that CPU's momentary wakeup. > > Sure the core RCU still needs to know that the CPU went out of dynticks the > time of the irq, so we keep the rcu_irq_enter/rcu_irq_exit calls. > > But if the CPU only wakes up to serve an IRQ, it doesn't need to tell the RCU > sysidle detection about it. The irq entry fixup jiffies on dynticks idle mode, > this should be enough. As long as you pass me in a hint so that RCU knows which case it is dealing with. ;-) Thanx, Paul