From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752108Ab0KZLgv (ORCPT ); Fri, 26 Nov 2010 06:36:51 -0500 Received: from casper.infradead.org ([85.118.1.10]:44448 "EHLO casper.infradead.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751151Ab0KZLgu convert rfc822-to-8bit (ORCPT ); Fri, 26 Nov 2010 06:36:50 -0500 Subject: Re: [RFC PATCH 2/3 v2] perf: Implement Nehalem uncore pmu From: Peter Zijlstra To: Stephane Eranian Cc: Lin Ming , Lin Ming , Ingo Molnar , Andi Kleen , lkml , Frederic Weisbecker , Arjan van de Ven In-Reply-To: References: <1290340877.2245.124.camel@localhost> <1290770652.2145.128.camel@laptop> Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 8BIT Date: Fri, 26 Nov 2010 12:36:59 +0100 Message-ID: <1290771419.2145.137.camel@laptop> Mime-Version: 1.0 X-Mailer: Evolution 2.30.3 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, 2010-11-26 at 12:25 +0100, Stephane Eranian wrote: > On Fri, Nov 26, 2010 at 12:24 PM, Peter Zijlstra wrote: > > On Fri, 2010-11-26 at 09:18 +0100, Stephane Eranian wrote: > > > >> In the perf_event model, given that any one of the 4 cores can be used > >> to program uncore events, you have no choice but to broadcast to all > >> 4 cores. Each has to demultiplex and figure out which of its counters > >> have overflowed. > > > > Not really, you can redirect all these events to the first online cpu of > > the node. > > > > You can re-write event->cpu in pmu::event_init(), and register cpu > > hotplug notifiers to migrate the state around. > > > I am sure you could. But then the user thinks the event is controlled > from CPUx when it's actually from CPUz. I am sure it can work but > that's confusing, especially interrupt-wise. Well, its either that or keeping a node wide state like we do for AMD and serialize everything from there. And I'm not sure what's most expensive, steering the interrupt to one core only, or broadcasting every interrupt, I'd favour the first approach. The whole thing is a node-wide resource, so the user needs to think in nodes anyway, we already do a cpu->node mapping for identifying the thing.