From mboxrd@z Thu Jan 1 00:00:00 1970 From: Andi Kleen Subject: Re: [patch] measurements, numbers about CONFIG_OPTIMIZE_INLINING=y impact Date: Fri, 9 Jan 2009 18:20:11 +0100 Message-ID: <20090109172011.GD26290@one.firstfloor.org> References: <1231434515.14304.27.camel@think.oraclecorp.com> <20090108183306.GA22916@elte.hu> <496648C7.5050700@zytor.com> <20090109130057.GA31845@elte.hu> <49675920.4050205@hp.com> <20090109153508.GA4671@elte.hu> <49677CB1.3030701@zytor.com> <20090109084620.3c711aad@infradead.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Cc: "H. Peter Anvin" , Ingo Molnar , jim owens , Linus Torvalds , Chris Mason , Peter Zijlstra , Steven Rostedt , paulmck@linux.vnet.ibm.com, Gregory Haskins , Matthew Wilcox , Andi Kleen , Andrew Morton , Linux Kernel Mailing List , linux-fsdevel , linux-btrfs , Thomas Gleixner , Nick Piggin , Peter Morreale , Sven Dietrich , jh@suse.cz To: Dirk Hohndel Return-path: Content-Disposition: inline In-Reply-To: <20090109084620.3c711aad@infradead.org> Sender: linux-btrfs-owner@vger.kernel.org List-Id: linux-fsdevel.vger.kernel.org On Fri, Jan 09, 2009 at 08:46:20AM -0800, Dirk Hohndel wrote: > On Fri, 09 Jan 2009 08:34:57 -0800 > "H. Peter Anvin" wrote: > > > > As far as naming is concerned, gcc effectively supports four levels, > > which *currently* map onto macros as follows: > > > > __always_inline Inline unconditionally > > inline Inlining hint > > Standard heuristics > > noinline Uninline unconditionally > > > > A lot of noise is being made about the naming of the levels (and I > > personally believe we should have a different annotation for "inline > > unconditionally for correctness" and "inline unconditionally for > > performance", as a documentation issue), but those are the four we > > get. > > Does gcc actually follow the "promise"? If that's the case (and if it's > considered a bug when it doesn't), then we can get what Linus wants by > annotating EVERY function with either __always_inline or noinline. There's also one alternative: gcc's inlining algorithms are extensibly tunable with --param. We might be able to find a set of numbers that make it roughly work like we want it by default. Disadvantage: the whole thing will be compiler version dependent so we might need to have different numbers for different compiler versions and it will be an area that will need constant maintenance in the future. I'm not sure that's really a good path to walk down to. Also cc Honza in case he has comments (you might want to review more of the thread in the archives) -Andi -- ak@linux.intel.com