From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754227AbYIZHTW (ORCPT ); Fri, 26 Sep 2008 03:19:22 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1753029AbYIZHTN (ORCPT ); Fri, 26 Sep 2008 03:19:13 -0400 Received: from casper.infradead.org ([85.118.1.10]:41424 "EHLO casper.infradead.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752986AbYIZHTM (ORCPT ); Fri, 26 Sep 2008 03:19:12 -0400 Subject: Re: [RFC PATCH v4] Unified trace buffer From: Peter Zijlstra To: Mathieu Desnoyers Cc: Steven Rostedt , Masami Hiramatsu , LKML , Ingo Molnar , Thomas Gleixner , Andrew Morton , prasad@linux.vnet.ibm.com, Linus Torvalds , "Frank Ch. Eigler" , David Wilder , hch@lst.de, Martin Bligh , Christoph Hellwig , Steven Rostedt In-Reply-To: <20080926032045.GA28771@Krystal> References: <20080925185154.230259579@goodmis.org> <20080925185236.244343232@goodmis.org> <48DC406D.1050508@redhat.com> <20080926032045.GA28771@Krystal> Content-Type: text/plain Date: Fri, 26 Sep 2008 09:18:27 +0200 Message-Id: <1222413507.16700.235.camel@lappy.programming.kicks-ass.net> Mime-Version: 1.0 X-Mailer: Evolution 2.22.3.1 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, 2008-09-25 at 23:20 -0400, Mathieu Desnoyers wrote: > * Steven Rostedt (rostedt@goodmis.org) wrote: > > > > On Thu, 25 Sep 2008, Masami Hiramatsu wrote: > > > > > Hi Steven, > > > > > > Steven Rostedt wrote: > > > > This version has been cleaned up a bit. I've been running it as > > > > a back end to ftrace, and it has been handling pretty well. > > > > > > Thank you for your great work. > > > It seems good to me(especially, encapsulating events :)). > > > > Thanks! > > > > > > > > I have one request of enhancement. > > > > > > > +static struct ring_buffer_per_cpu * > > > > +ring_buffer_allocate_cpu_buffer(struct ring_buffer *buffer, int cpu) > > > > +{ > > > [...] > > > > + cpu_buffer->pages = kzalloc_node(ALIGN(sizeof(void *) * pages, > > > > + cache_line_size()), GFP_KERNEL, > > > > + cpu_to_node(cpu)); > > > > > > Here, you are using a slab object for page managing array, > > > the largest object size is 128KB(x86-64), so it can contain > > > 16K pages = 64MB. > > > > > > As I had improved relayfs, in some rare case(on 64bit arch), > > > we'd like to use larger buffer than 64MB. > > > > > > http://sourceware.org/ml/systemtap/2008-q2/msg00103.html > > > > > > So, I think similar hack can be applicable. > > > > > > Would it be acceptable for the next version? > > > > I would like to avoid using vmalloc as much as possible, but I do see the > > limitation here. Here's my compromise. > > > > Instead of using vmalloc if the page array is greater than one page, > > how about using vmalloc if the page array is greater than > > KMALLOC_MAX_SIZE? > > > > This would let us keep the vmap area free unless we have no choice. > > > > -- Steve > > > > You could also fallback on a 2-level page array when buffer size is > > 64MB. The cost is mainly a supplementary pointer dereference, but one > more should not make sure a big difference overall. I'm still not sure why we don't just link the pages using the page frames, we don't need the random access, do we?