From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S933458Ab3GRSfu (ORCPT ); Thu, 18 Jul 2013 14:35:50 -0400 Received: from mx1.redhat.com ([209.132.183.28]:36498 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752779Ab3GRSft (ORCPT ); Thu, 18 Jul 2013 14:35:49 -0400 Date: Thu, 18 Jul 2013 20:30:18 +0200 From: Oleg Nesterov To: Frederic Weisbecker , Peter Zijlstra , Steven Rostedt Cc: David Ahern , Ingo Molnar , Masami Hiramatsu , "zhangwei(Jovi)" , linux-kernel@vger.kernel.org Subject: [PATCH RESEND 0/3] Teach perf_trace_##call() to check hlist_empty(perf_events) Message-ID: <20130718183018.GA4043@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.5.18 (2008-05-17) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hello. The patches are the same, I only tried to update the changelogs a bit. I am also quoting my old email below, to explain what this hack tries to do. Say, "perf record -e sched:sched_switch -p1". Every task except /sbin/init will do perf_trace_sched_switch() and perf_trace_buf_prepare() + perf_trace_buf_submit for no reason(), it doesn't have a counter. So it makes sense to add the fast-path check at the start of perf_trace_##call(), if (hlist_empty(event_call->perf_events)) return; The problem is, we should not do this if __task != NULL (iow, if DECLARE_EVENT_CLASS() uses __perf_task()), perf_tp_event() has the additional code for this case. So we should do if (!__task && hlist_empty(event_call->perf_events)) return; But __task is changed by "{ assign; }" block right before perf_trace_buf_submit(). Too late for the fast-path check, we already called perf_trace_buf_prepare/fetch_regs. So. After 2/3 __perf_task() (and __perf_count/addr) is called when ftrace_get_offsets_##call(args) evaluates the arguments, and we can check !__task && hlist_empty() right after that. Oleg.