From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751292AbcGMVmb (ORCPT ); Wed, 13 Jul 2016 17:42:31 -0400 Received: from mx0b-001b2d01.pphosted.com ([148.163.158.5]:29629 "EHLO mx0a-001b2d01.pphosted.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1751067AbcGMVmY (ORCPT ); Wed, 13 Jul 2016 17:42:24 -0400 X-IBM-Helo: d03dlp01.boulder.ibm.com X-IBM-MailFrom: paulmck@linux.vnet.ibm.com Date: Wed, 13 Jul 2016 14:42:38 -0700 From: "Paul E. McKenney" To: Tejun Heo Cc: Peter Zijlstra , John Stultz , Ingo Molnar , lkml , Dmitry Shmidt , Rom Lemarchand , Colin Cross , Todd Kjos , Oleg Nesterov Subject: Re: Severe performance regression w/ 4.4+ on Android due to cgroup locking changes Reply-To: paulmck@linux.vnet.ibm.com References: <20160713182102.GJ4065@mtj.duckdns.org> <20160713183347.GK4065@mtj.duckdns.org> <20160713201823.GB29670@mtj.duckdns.org> <20160713202657.GW30154@twins.programming.kicks-ass.net> <20160713203944.GC29670@mtj.duckdns.org> <20160713205102.GZ30909@twins.programming.kicks-ass.net> <20160713210315.GO7094@linux.vnet.ibm.com> <20160713210526.GF29670@mtj.duckdns.org> <20160713211841.GQ7094@linux.vnet.ibm.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20160713211841.GQ7094@linux.vnet.ibm.com> User-Agent: Mutt/1.5.21 (2010-09-15) X-TM-AS-MML: disable X-Content-Scanned: Fidelis XPS MAILER x-cbid: 16071321-0008-0000-0000-0000050D69DC X-IBM-AV-DETECTION: SAVI=unused REMOTE=unused XFE=unused x-cbparentid: 16071321-0009-0000-0000-0000394AB4BA Message-Id: <20160713214238.GA15996@linux.vnet.ibm.com> X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:,, definitions=2016-07-13_11:,, signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 spamscore=0 suspectscore=0 malwarescore=0 phishscore=0 adultscore=0 bulkscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1604210000 definitions=main-1607130238 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Jul 13, 2016 at 02:18:41PM -0700, Paul E. McKenney wrote: > On Wed, Jul 13, 2016 at 05:05:26PM -0400, Tejun Heo wrote: > > On Wed, Jul 13, 2016 at 02:03:15PM -0700, Paul E. McKenney wrote: > > > Take the patch that I just sent out and make the choice of normal > > > vs. expedited depend on CONFIG_PREEMPT_RT or whatever the -rt guys are > > > calling it these days. Is there a low-latency Kconfig option other > > > than CONFIG_NO_HZ_FULL? > > > > Sounds like a plan to me. > > I like the way we like each other's idea. Mutually assured laziness? ;-) But here is what mine might look like. Untested, probably does not even build. Note that the default is -no- expediting, use the rcusync.expedited kernel parameter to enable it. Thanx, Paul ------------------------------------------------------------------------ diff --git a/Documentation/kernel-parameters.txt b/Documentation/kernel-parameters.txt index 82b42c958d1c..b8bc9854e548 100644 --- a/Documentation/kernel-parameters.txt +++ b/Documentation/kernel-parameters.txt @@ -3229,6 +3229,11 @@ bytes respectively. Such letter suffixes can also be entirely omitted. energy efficiency by requiring that the kthreads periodically wake up to do the polling. + rcusync.expedited [KNL] + Specify that the rcusync mechanism use expedited + grace periods. As of mid-2016, this affects + per-CPU rwsems. + rcutree.blimit= [KNL] Set maximum number of finished RCU callbacks to process in one batch. diff --git a/kernel/rcu/sync.c b/kernel/rcu/sync.c index be922c9f3d37..5bc5bef2e00a 100644 --- a/kernel/rcu/sync.c +++ b/kernel/rcu/sync.c @@ -22,6 +22,14 @@ #include #include +#include +#include + +MODULE_ALIAS("rcusync"); +#ifdef MODULE_PARAM_PREFIX +#undef MODULE_PARAM_PREFIX +#endif +#define MODULE_PARAM_PREFIX "rcusync." #ifdef CONFIG_PROVE_RCU #define __INIT_HELD(func) .held = func, @@ -29,7 +37,7 @@ #define __INIT_HELD(func) #endif -static const struct { +static struct { void (*sync)(void); void (*call)(struct rcu_head *, void (*)(struct rcu_head *)); void (*wait)(void); @@ -62,6 +70,20 @@ enum { CB_IDLE = 0, CB_PENDING, CB_REPLAY }; #define rss_lock gp_wait.lock +static bool expedited; +module_param(expedited, bool, 0444); + +static int __init rcu_sync_early_init(void) +{ + if (expedited) { + gp_ops[RCU_SYNC].sync = synchronize_rcu_expedited; + gp_ops[RCU_SCHED_SYNC].sync = synchronize_sched_expedited; + gp_ops[RCU_BH_SYNC].sync = synchronize_rcu_bh_expedited; + } + return 0; +} +early_initcall(rcu_sync_early_init); + #ifdef CONFIG_PROVE_RCU void rcu_sync_lockdep_assert(struct rcu_sync *rsp) {