From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752539Ab1AQUwg (ORCPT ); Mon, 17 Jan 2011 15:52:36 -0500 Received: from canuck.infradead.org ([134.117.69.58]:47764 "EHLO canuck.infradead.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751092Ab1AQUwf convert rfc822-to-8bit (ORCPT ); Mon, 17 Jan 2011 15:52:35 -0500 Subject: Re: Q: perf_event && task->ptrace_bps[] From: Peter Zijlstra To: Oleg Nesterov Cc: Alan Stern , Arnaldo Carvalho de Melo , Frederic Weisbecker , Ingo Molnar , Paul Mackerras , Prasad , Roland McGrath , linux-kernel@vger.kernel.org In-Reply-To: <20110117203459.GA32700@redhat.com> References: <20101108145647.GA3426@redhat.com> <20110117203459.GA32700@redhat.com> Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 8BIT Date: Mon, 17 Jan 2011 21:52:56 +0100 Message-ID: <1295297576.30950.382.camel@laptop> Mime-Version: 1.0 X-Mailer: Evolution 2.30.3 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, 2011-01-17 at 21:34 +0100, Oleg Nesterov wrote: > On 11/08, Oleg Nesterov wrote: > > > > I am trying to understand the usage of hw-breakpoints in arch_ptrace(). > > ptrace_set_debugreg() and related code looks obviously racy. Nothing > > protects us against flush_ptrace_hw_breakpoint() called by the dying > > tracee. Afaics we can leak perf_event or use the already freed memory > > or both. > > > > Am I missed something? > > > > Looking into the git history, I don't even know which patch should be > > blamed (if I am right), there were too many changes. I noticed that > > 2ebd4ffb6d0cb877787b1e42be8485820158857e "perf events: Split out task > > search into helper" moved the PF_EXITING check from find_get_context(). > > This check coould help if sys_ptrace() races with SIGKILL, but it was > > racy anyway. > > Ping. > > Any idea how to fix this cleanly? May be we can reuse perf_event_mutex, > but this looks soooo ugly. And do_exit()->flush_ptrace_hw_breakpoint() > has the strange "FIXME:" comment which doesn't help me to understand > what can we do. > > Probably the best fix is to change this code so that the tracer owns > ->ptrace_bps[], not the tracee. But this is not trivial, and needs a > lot of changes in ptrace code. Wasn't this sorted by: 8882135bcd332f294df5455747ea43ba9e6f77ad? Or is this purely related to the ptrace muck? in which case I guess Frederic is you man, I never looked at the hw_breakpoint stuff in general and the ptrace bits in particular. > I am reading perf_event.c, but all I found so far is a couple of trivial > methods to crash the kernel via sys_perf_event_open(), will report > tomorrow... Ow, that's not too pretty..