From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753866AbbCIRhW (ORCPT ); Mon, 9 Mar 2015 13:37:22 -0400 Received: from g1t5424.austin.hp.com ([15.216.225.54]:50205 "EHLO g1t5424.austin.hp.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751899AbbCIRhQ (ORCPT ); Mon, 9 Mar 2015 13:37:16 -0400 Message-ID: <1425922630.2475.390.camel@j-VirtualBox> Subject: Re: [PATCH] locking/rwsem: Fix lock optimistic spinning when owner is not running From: Jason Low To: Sasha Levin Cc: Linus Torvalds , Ingo Molnar , Peter Zijlstra , Davidlohr Bueso , Tim Chen , "Paul E. McKenney" , Michel Lespinasse , LKML , Dave Jones , Ming Lei , jason.low2@hp.com Date: Mon, 09 Mar 2015 10:37:10 -0700 In-Reply-To: <54FB40BE.1070309@oracle.com> References: <1425714331.2475.388.camel@j-VirtualBox> <54FB40BE.1070309@oracle.com> Content-Type: text/plain; charset="UTF-8" X-Mailer: Evolution 3.2.3-0ubuntu6 Content-Transfer-Encoding: 7bit Mime-Version: 1.0 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sat, 2015-03-07 at 13:17 -0500, Sasha Levin wrote: > On 03/07/2015 02:45 AM, Jason Low wrote: > > Fixes tip commit b3fd4f03ca0b (locking/rwsem: Avoid deceiving lock spinners). > > > > Ming reported soft lockups occurring when running xfstest due to > > commit b3fd4f03ca0b. > > > > When doing optimistic spinning in rwsem, threads should stop spinning when > > the lock owner is not running. While a thread is spinning on owner, if > > the owner reschedules, owner->on_cpu returns false and we stop spinning. > > > > However, commit b3fd4f03ca0b essentially caused the check to get ignored > > because when we break out of the spin loop due to !on_cpu, we continue > > spinning if sem->owner != NULL. > > > > This patch fixes this by making sure we stop spinning if the owner is not > > running. Furthermore, just like with mutexes, refactor the code such that > > we don't have separate checks for owner_running(). This makes it more > > straightforward in terms of why we exit the spin on owner loop and we > > would also avoid needing to "guess" why we broke out of the loop to make > > this more readable. > > That seems to solve the hangs I'm seeing as well. Great, thanks for confirming this.