From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756770AbZCRDS0 (ORCPT ); Tue, 17 Mar 2009 23:18:26 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1755472AbZCRDSS (ORCPT ); Tue, 17 Mar 2009 23:18:18 -0400 Received: from e1.ny.us.ibm.com ([32.97.182.141]:41327 "EHLO e1.ny.us.ibm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755051AbZCRDSR (ORCPT ); Tue, 17 Mar 2009 23:18:17 -0400 Date: Wed, 18 Mar 2009 08:48:32 +0530 From: Bharata B Rao To: Balbir Singh Cc: Li Zefan , linux-kernel@vger.kernel.org, Dhaval Giani , Paul Menage , Ingo Molnar , Peter Zijlstra , KAMEZAWA Hiroyuki Subject: Re: [PATCH -tip] cpuacct: Make cpuacct hierarchy walk in cpuacct_charge() safe when rcupreempt is used. Message-ID: <20090318031832.GA3960@in.ibm.com> Reply-To: bharata@linux.vnet.ibm.com References: <20090317061754.GD3314@in.ibm.com> <49BF42FB.4030103@cn.fujitsu.com> <20090317073649.GH3314@in.ibm.com> <20090317131251.GU16897@balbir.in.ibm.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20090317131251.GU16897@balbir.in.ibm.com> User-Agent: Mutt/1.5.18 (2008-05-17) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Mar 17, 2009 at 06:42:51PM +0530, Balbir Singh wrote: > * Bharata B Rao [2009-03-17 13:06:49]: > > > On Tue, Mar 17, 2009 at 02:28:11PM +0800, Li Zefan wrote: > > > Bharata B Rao wrote: > > > > cpuacct: Make cpuacct hierarchy walk in cpuacct_charge() safe when > > > > rcupreempt is used. > > > > > > > > cpuacct_charge() obtains task's ca and does a hierarchy walk upwards. > > > > This can race with the task's movement between cgroups. This race > > > > can cause an access to freed ca pointer in cpuacct_charge(). This will not > > > > > > Actually it can also end up access invalid tsk->cgroups. ;) > > > > > > get tsk->cgroups (cg) > > > (move tsk to another cgroup) or (tsk exiting) > > > -> kfree(tsk->cgroups) > > > get cg->subsys[..] > > > > Ok :) Here is the patch again with updated description. > > > > cpuacct: Make cpuacct hierarchy walk in cpuacct_charge() safe when > > rcupreempt is used. > > > > cpuacct_charge() obtains task's ca and does a hierarchy walk upwards. > > This can race with the task's movement between cgroups. This race > > can cause an access to freed ca pointer in cpuacct_charge() or access > > to invalid cgroups pointer of the task. This will not happen with rcu or > > tree rcu as cpuacct_charge() is called with preemption disabled. However if > > rcupreempt is used, the race is seen. Thanks to Li Zefan for explaining this. > > > > Fix this race by explicitly protecting ca and the hierarchy walk with > > rcu_read_lock(). > > > > Looks good and works very well (except for the batch issue that you > pointed out, it takes up to batch values before updates are seen). > > I'd like to get the patches in -tip and see the results, I would > recommend using percpu_counter_sum() while reading the data as an > enhancement to this patch. If user space does not overwhelm with a lot > of reads, sum would work out better. > > > Tested-by: Balbir Singh > Acked-by: Balbir Singh So I guess this ack is not for this patch but for the per-cgroup stime/utime cpuacct controller statistics patch. Regards, Bharata.