From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751895AbeEVAR4 (ORCPT ); Mon, 21 May 2018 20:17:56 -0400 Received: from mx0b-001b2d01.pphosted.com ([148.163.158.5]:36938 "EHLO mx0a-001b2d01.pphosted.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1751019AbeEVARy (ORCPT ); Mon, 21 May 2018 20:17:54 -0400 Date: Mon, 21 May 2018 17:19:25 -0700 From: "Paul E. McKenney" To: Joel Fernandes Cc: Joel Fernandes , linux-kernel@vger.kernel.org, Josh Triplett , Steven Rostedt , Mathieu Desnoyers , Lai Jiangshan , byungchul.park@lge.com, kernel-team@android.com Subject: Re: [PATCH v3 3/4] rcu: Use better variable names in funnel locking loop Reply-To: paulmck@linux.vnet.ibm.com References: <20180521044220.123933-1-joel@joelfernandes.org> <20180521044220.123933-4-joel@joelfernandes.org> <20180521231357.GI3803@linux.vnet.ibm.com> <20180522000016.GC40541@joelaf.mtv.corp.google.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20180522000016.GC40541@joelaf.mtv.corp.google.com> User-Agent: Mutt/1.5.21 (2010-09-15) X-TM-AS-GCONF: 00 x-cbid: 18052200-2213-0000-0000-000002A95AA3 X-IBM-SpamModules-Scores: X-IBM-SpamModules-Versions: BY=3.00009063; HX=3.00000241; KW=3.00000007; PH=3.00000004; SC=3.00000261; SDB=6.01035758; UDB=6.00529803; IPR=6.00814887; MB=3.00021229; MTD=3.00000008; XFM=3.00000015; UTC=2018-05-22 00:17:51 X-IBM-AV-DETECTION: SAVI=unused REMOTE=unused XFE=unused x-cbparentid: 18052200-2214-0000-0000-00005A33822E Message-Id: <20180522001925.GO3803@linux.vnet.ibm.com> X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:,, definitions=2018-05-21_10:,, signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 priorityscore=1501 malwarescore=0 suspectscore=0 phishscore=0 bulkscore=0 spamscore=0 clxscore=1015 lowpriorityscore=0 impostorscore=0 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1709140000 definitions=main-1805220001 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, May 21, 2018 at 05:00:16PM -0700, Joel Fernandes wrote: > On Mon, May 21, 2018 at 04:13:57PM -0700, Paul E. McKenney wrote: > > > --- > > > kernel/rcu/tree.c | 48 ++++++++++++++++++++++++----------------------- > > > 1 file changed, 25 insertions(+), 23 deletions(-) > > > > > > diff --git a/kernel/rcu/tree.c b/kernel/rcu/tree.c > > > index 0ffd41ba304f..879c67a31116 100644 > > > --- a/kernel/rcu/tree.c > > > +++ b/kernel/rcu/tree.c > > > @@ -1526,7 +1526,7 @@ static void trace_rcu_this_gp(struct rcu_node *rnp, struct rcu_data *rdp, > > > > > > /* > > > * rcu_start_this_gp - Request the start of a particular grace period > > > - * @rnp: The leaf node of the CPU from which to start. > > > + * @rnp_start: The leaf node of the CPU from which to start. > > > * @rdp: The rcu_data corresponding to the CPU from which to start. > > > * @gp_seq_req: The gp_seq of the grace period to start. > > > * > > > @@ -1540,12 +1540,12 @@ static void trace_rcu_this_gp(struct rcu_node *rnp, struct rcu_data *rdp, > > > * > > > * Returns true if the GP thread needs to be awakened else false. > > > */ > > > -static bool rcu_start_this_gp(struct rcu_node *rnp, struct rcu_data *rdp, > > > +static bool rcu_start_this_gp(struct rcu_node *rnp_start, struct rcu_data *rdp, > > > unsigned long gp_seq_req) > > > { > > > bool ret = false; > > > struct rcu_state *rsp = rdp->rsp; > > > - struct rcu_node *rnp_root; > > > + struct rcu_node *rnp, *rnp_root = NULL; > > > > Unless I am going blind, this patch really isn't using rnp_root. It > > could be removed. > > Its just limitation of the diff tools. Your eyes are just fine and doing > great based on your review comments ;) > > The rnp_root is used after we break out of the loop. > > > > > > > /* > > > * Use funnel locking to either acquire the root rcu_node > > > @@ -1556,34 +1556,36 @@ static bool rcu_start_this_gp(struct rcu_node *rnp, struct rcu_data *rdp, > > > * scan the leaf rcu_node structures. Note that rnp->lock must > > > * not be released. > > > */ > > > - raw_lockdep_assert_held_rcu_node(rnp); > > > - trace_rcu_this_gp(rnp, rdp, gp_seq_req, TPS("Startleaf")); > > > - for (rnp_root = rnp; 1; rnp_root = rnp_root->parent) { > > > - if (rnp_root != rnp) > > > - raw_spin_lock_rcu_node(rnp_root); > > > - if (ULONG_CMP_GE(rnp_root->gp_seq_needed, gp_seq_req) || > > > - rcu_seq_started(&rnp_root->gp_seq, gp_seq_req) || > > > - (rnp != rnp_root && > > > - rcu_seq_state(rcu_seq_current(&rnp_root->gp_seq)))) { > > > - trace_rcu_this_gp(rnp_root, rdp, gp_seq_req, > > > + raw_lockdep_assert_held_rcu_node(rnp_start); > > > + trace_rcu_this_gp(rnp_start, rdp, gp_seq_req, TPS("Startleaf")); > > > + for (rnp = rnp_start; 1; rnp = rnp->parent) { > > > + if (rnp != rnp_start) > > > + raw_spin_lock_rcu_node(rnp); > > > + if (ULONG_CMP_GE(rnp->gp_seq_needed, gp_seq_req) || > > > + rcu_seq_started(&rnp->gp_seq, gp_seq_req) || > > > + (rnp != rnp_start && > > > + rcu_seq_state(rcu_seq_current(&rnp->gp_seq)))) { > > > + trace_rcu_this_gp(rnp, rdp, gp_seq_req, > > > TPS("Prestarted")); > > > goto unlock_out; > > > } > > > - rnp_root->gp_seq_needed = gp_seq_req; > > > - if (rcu_seq_state(rcu_seq_current(&rnp->gp_seq))) { > > > + rnp->gp_seq_needed = gp_seq_req; > > > + if (rcu_seq_state(rcu_seq_current(&rnp_start->gp_seq))) { > > > > The original had a performance bug, which is quite a bit more obvious > > given the new names, so thank you for that! The above statement should > > instead be as follows: > > > > if (rcu_seq_state(rcu_seq_current(&rnp->gp_seq))) { > > > > It does not make sense to keep checking the starting rcu_node because > > changes to ->gp_seq happen first at the top of the tree. So we might > > take an earlier exit by checking the current rnp instead of rechecking > > rnp_start over and over. > > > > Please feel free to make this change, which is probably best as a separate > > patch. That way this rename patch can remain a straightforward rename patch. > > Yes, sounds like a nice optimization and I'm glad my variable renaming helped > ;) I feel I should have seen it too. I can make this change and send out > with my next series as you suggest. > > > > /* > > > * We just marked the leaf, and a grace period > > > * is in progress, which means that rcu_gp_cleanup() > > > * will see the marking. Bail to reduce contention. > > > */ > > > - trace_rcu_this_gp(rnp, rdp, gp_seq_req, > > > + trace_rcu_this_gp(rnp_start, rdp, gp_seq_req, > > > TPS("Startedleaf")); > > > goto unlock_out; > > > } > > > - if (rnp_root != rnp && rnp_root->parent != NULL) > > > - raw_spin_unlock_rcu_node(rnp_root); > > > - if (!rnp_root->parent) > > > + if (rnp != rnp_start && rnp->parent != NULL) > > > + raw_spin_unlock_rcu_node(rnp); > > > + if (!rnp->parent) { > > > + rnp_root = rnp; > > > > Since rnp_root is otherwise unused in the new version, the above statement > > can be dropped along with the "if" statement's braces and the declaration. > > Actually rnp_root is needed for tracing calls after we breakout of the loop. But at that point, rnp_root is equal to rnp, so rnp_root still isn't really needed, correct? Thanx, Paul