From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755216Ab0AUKpX (ORCPT ); Thu, 21 Jan 2010 05:45:23 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1752674Ab0AUKpV (ORCPT ); Thu, 21 Jan 2010 05:45:21 -0500 Received: from mail-fx0-f220.google.com ([209.85.220.220]:34878 "EHLO mail-fx0-f220.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751999Ab0AUKpU (ORCPT ); Thu, 21 Jan 2010 05:45:20 -0500 DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=date:from:to:cc:subject:message-id:references:mime-version :content-type:content-disposition:content-transfer-encoding :in-reply-to:user-agent; b=YPWqReatl+cM/2UugR6iVR7RMcfLNRtRRlHOVfNvzOrvnOtmaQw1h0MaiKe6d19x4N p57/+8OLohFKy5MZefKMnAY30sr6h4kE7JTlBAQT9W4W71vcQhT/kOBAOji1rE8ObYNT Oj85cIT5XC2FiATPbFxfrFzXzRvIzy7tmQx+g= Date: Thu, 21 Jan 2010 11:45:15 +0100 From: Frederic Weisbecker To: Stephane Eranian Cc: Peter Zijlstra , linux-kernel@vger.kernel.org, mingo@elte.hu, paulus@samba.org, davem@davemloft.net, perfmon2-devel@lists.sf.net, eranian@gmail.com Subject: Re: [PATCH] perf_events: improve x86 event scheduling (v5) Message-ID: <20100121104513.GA5017@nowhere> References: <4b5430c6.0f975e0a.1bf9.ffff85fe@mx.google.com> <20100118134324.GB10364@nowhere> <1263822898.4283.558.camel@laptop> <20100118142004.GD10364@nowhere> <1263825158.4283.590.camel@laptop> <20100118144556.GE10364@nowhere> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: User-Agent: Mutt/1.5.18 (2008-05-17) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Jan 21, 2010 at 11:08:12AM +0100, Stephane Eranian wrote: > >> > Do you mean this: > >> > > >> > hw_perf_group_sched_in_begin(&x86_pmu); > >> > > >> > for_each_event(event, group) { > >> >         event->enable();        //do the collection here > >> > } > >> > > >> > > >> > if (hw_perf_group_sched_in_end(&x86_pmu)) { > >> >         rollback... > >> > } > >> > > >> > That requires to know in advance if we have hardware pmu > >> > in the list though (can be a flag in the group). > >> > > I don't think this model can work without scheduling for each event. > > Imagine the situation where you have more events than you have > counters. At each tick you: > - disable all events > - rotate the list > - collect events from the list > - schedule events > - activate > > Collection is the accumulation of events until you have as many as you > have counters > given you defer scheduling until the end (see loop above). > > But that does not mean you can schedule what you have accumulated. And then what > do you do, i.e., rollback to what? If the scheduling validation fails, then you just need to rollback the whole group. That's sensibly what you did in your patch, right? Except the loop is now handled by the core code. > > With incremental, you can skip a group that is conflicting with the > groups already > accumulated. What hw_perf_group_sched_in() gives you is simply a way to do > incremental on a whole event group at once. I don't understand why that can't be done with the above model. In your patch we iterate through the whole group, collect events, and schedule them. With the above, the collection is just done on enable(), and the scheduling is done with the new pmu callbacks. The thing is sensibly the same, where is the obstacle? > > Given the perf_event model, I believe you have no other way but to do > incremental > scheduling of events. That is the only way you guarantee you maximize the use of > the PMU. Regardless of that, the scheduling model has a bias towards smaller > and less constrained event groups. But the incremental is still the purpose of the above model. I feel confused.