From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754762Ab1AUMLQ (ORCPT ); Fri, 21 Jan 2011 07:11:16 -0500 Received: from canuck.infradead.org ([134.117.69.58]:58470 "EHLO canuck.infradead.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754403Ab1AUMLP convert rfc822-to-8bit (ORCPT ); Fri, 21 Jan 2011 07:11:15 -0500 Subject: Re: Q: perf_install_in_context/perf_event_enable are racy? From: Peter Zijlstra To: Oleg Nesterov Cc: Alan Stern , Arnaldo Carvalho de Melo , Frederic Weisbecker , Ingo Molnar , Paul Mackerras , Prasad , Roland McGrath , linux-kernel@vger.kernel.org In-Reply-To: <20110120193033.GA13924@redhat.com> References: <20101108145647.GA3426@redhat.com> <20101108145725.GA3434@redhat.com> <20110119182141.GA12183@redhat.com> <20110120193033.GA13924@redhat.com> Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 8BIT Date: Fri, 21 Jan 2011 13:11:45 +0100 Message-ID: <1295611905.28776.269.camel@laptop> Mime-Version: 1.0 X-Mailer: Evolution 2.30.3 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, 2011-01-20 at 20:30 +0100, Oleg Nesterov wrote: > On 01/19, Oleg Nesterov wrote: > > > > Also. I believe there are more problems in perf_install_event(), but > > I need to recheck. > > Help! I can't believe it can be so trivially wrong, but otoh I can't > understand how this can be correct. > > So, ignoring details and !task case, __perf_install_in_context() does: > > if (cpuctx->task_ctx || ctx->task != current) > return; > > cpuctx->task_ctx = ctx; > event_sched_in(event); > > Stupid question, what if this task has already passed > perf_event_exit_task() and thus it doesn't have ->perf_event_ctxp[] ? > Given that perf_event_context_sched_out() does nothing if !ctx, who > will event_sched_out() this event? > > OK, even if I am right this is trivial, we just need the additional > check. Indeed (or do the cleanup from put_ctx(), but that's too complex a change I think). > But, it seems, there is another problem. Forget about the exiting, > I can't understand why we can trust current in the code above. > With __ARCH_WANT_INTERRUPTS_ON_CTXSW schedule() does: > > // sets cpuctx->task_ctx = NULL > perf_event_task_sched_out(); > > // enables irqs > prepare_lock_switch(); > > > // updates current_task > switch_to(); > > What if IPI comes in the window before switch_to() ? > > (the same questions for __perf_event_enable). Ingo, do you have any insights in that, I think you wrote all that initially?