From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from multi.imgtec.com (multi.imgtec.com [194.200.65.239]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client CN "*.imgtec.com", Issuer "DigiCert High Assurance CA-3" (not verified)) by ozlabs.org (Postfix) with ESMTPS id 994182C0388 for ; Wed, 30 Oct 2013 23:05:57 +1100 (EST) Message-ID: <5270F21C.3080805@imgtec.com> Date: Wed, 30 Oct 2013 11:48:44 +0000 From: James Hogan MIME-Version: 1.0 To: Peter Zijlstra Subject: Re: perf events ring buffer memory barrier on powerpc References: <20131025173749.GG19466@laptop.lan> <20131028132634.GO19466@laptop.lan> <20131028163418.GD4126@linux.vnet.ibm.com> <20131028201735.GA15629@redhat.com> <20131029102131.GA16117@laptop.programming.kicks-ass.net> <20131029103057.GN2490@laptop.programming.kicks-ass.net> <20131030104246.GH16117@laptop.programming.kicks-ass.net> In-Reply-To: <20131030104246.GH16117@laptop.programming.kicks-ass.net> Content-Type: text/plain; charset="ISO-8859-1" Cc: Michael Neuling , Mathieu Desnoyers , Vince Weaver , Oleg Nesterov , Linux PPC dev , Anton Blanchard , Frederic Weisbecker , Victor Kaplansky , "Paul E. McKenney" , linux-metag , LKML List-Id: Linux on PowerPC Developers Mail List List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Hi Peter, On 30/10/13 10:42, Peter Zijlstra wrote: > Subject: perf, tool: Add required memory barriers > > To match patch bf378d341e48 ("perf: Fix perf ring buffer memory > ordering") change userspace to also adhere to the ordering outlined. > > Most barrier implementations were gleaned from > arch/*/include/asm/barrier.h and with the exception of metag I'm fairly > sure they're correct. Yeh... Short answer: For Meta you're probably best off assuming CONFIG_METAG_SMP_WRITE_REORDERING=n and just using compiler barriers. Long answer: The issue with write reordering between Meta's hardware threads beyond the cache is only with a particular SoC, and SMP is not used in production on it. It is possible to make the LINSYSEVENT_WR_COMBINE_FLUSH register writable to userspace (it's in a non-mappable region already) but even then the write to that register needs odd placement to be effective (before the shmem write rather than after - which isn't a place any existing barriers are guaranteed to be placed). I'm fairly confident we get away with it in the kernel, and userland normally just uses linked load/store instructions for atomicity which works fine. Cheers James