From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755939AbYHVWk3 (ORCPT ); Fri, 22 Aug 2008 18:40:29 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1751958AbYHVWkV (ORCPT ); Fri, 22 Aug 2008 18:40:21 -0400 Received: from e34.co.us.ibm.com ([32.97.110.152]:38902 "EHLO e34.co.us.ibm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751816AbYHVWkT (ORCPT ); Fri, 22 Aug 2008 18:40:19 -0400 Date: Fri, 22 Aug 2008 15:40:16 -0700 From: "Paul E. McKenney" To: Andi Kleen Cc: Christoph Lameter , Pekka Enberg , Ingo Molnar , Jeremy Fitzhardinge , Nick Piggin , "Pallipadi, Venkatesh" , Suresh Siddha , Jens Axboe , Rusty Russell , Linux Kernel Mailing List Subject: Re: [PATCH 2/2] smp_call_function: use rwlocks on queues rather than rcu Message-ID: <20080822224016.GN6744@linux.vnet.ibm.com> Reply-To: paulmck@linux.vnet.ibm.com References: <48AE0883.6050701@goop.org> <20080822062800.GQ14110@elte.hu> <84144f020808220006n25d684b1n9db306ddc4f58c4c@mail.gmail.com> <48AEC6B2.1080701@linux-foundation.org> <20080822151156.GA6744@linux.vnet.ibm.com> <48AEF3FD.70906@linux-foundation.org> <20080822182915.GG6744@linux.vnet.ibm.com> <20080822183346.GS23334@one.firstfloor.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20080822183346.GS23334@one.firstfloor.org> User-Agent: Mutt/1.5.15+20070412 (2007-04-11) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Aug 22, 2008 at 08:33:46PM +0200, Andi Kleen wrote: > > Would it make sense to push the freed-by-RCU memory further up the > > hierarchy, so that such memory is not mistaken for recently freed > > hot-in-cache memory? > > Right now my impression is that it is not well understood why > the kmalloc makes the IPI that much slower. In theory a kmalloc > shouldn't be all that slow, it's essentially just a > "disable interrupts; unlink object from cpu cache; enable interrupts" > with some window dressing. kfree() is similar. > > Does it bounce a cache line on freeing perhaps? It is indeed easy to focus on the wrong area when attempting to improve performance. Done it many times myself... :-/ Thanx, Paul