From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from foss.arm.com ([217.140.101.70]:32844 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751337AbeFTI0b (ORCPT ); Wed, 20 Jun 2018 04:26:31 -0400 Date: Wed, 20 Jun 2018 09:17:16 +0100 From: Will Deacon To: =?utf-8?B?6ZmI5Y2O5omN?= Cc: Peter Zijlstra , Paul Burton , Ralf Baechle , James Hogan , linux-mips , Fuxin Zhang , wuzhangjin , Huacai Chen , stable , Alan Stern , AndreaParri , Boqun Feng , Nicholas Piggin , David Howells , Jade Alglave , Luc Maranget , "Paul E. McKenney" , Akira Yokosawa , linux-kernel Subject: Re: [PATCH] MIPS: implement smp_cond_load_acquire() for Loongson-3 Message-ID: <20180620081715.GA27776@arm.com> References: <1529042858-9483-1-git-send-email-chenhc@lemote.com> <20180618185141.yvkrsbdi2gbxjxj7@pburton-laptop> <20180619072242.GC2494@hirez.programming.kicks-ass.net> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: Sender: stable-owner@vger.kernel.org List-ID: On Wed, Jun 20, 2018 at 11:31:55AM +0800, 陈华才 wrote: > Loongson-3's Store Fill Buffer is nearly the same as your "Store Buffer", > and it increases the memory ordering weakness. So, smp_cond_load_acquire() > only need a __smp_mb() before the loop, not after every READ_ONCE(). In > other word, the following code is just OK: > > #define smp_cond_load_acquire(ptr, cond_expr) \ > ({ \ > typeof(ptr) __PTR = (ptr); \ > typeof(*ptr) VAL; \ > __smp_mb(); \ > for (;;) { \ > VAL = READ_ONCE(*__PTR); \ > if (cond_expr) \ > break; \ > cpu_relax(); \ > } \ > __smp_mb(); \ > VAL; \ > }) > > the __smp_mb() before loop is used to avoid "reads prioritised over > writes", which is caused by SFB's weak ordering and similar to ARM11MPCore > (mentioned by Will Deacon). Sure, but smp_cond_load_acquire() isn't the only place you'll see this sort of pattern in the kernel. In other places, the only existing arch hook is cpu_relax(), so unless you want to audit all loops and add a special MIPs-specific smp_mb() to those that are affected, I think your only option is to stick it in cpu_relax(). I assume you don't have a control register that can disable this prioritisation in the SFB? Will