From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756121AbZETNXS (ORCPT ); Wed, 20 May 2009 09:23:18 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1753730AbZETNXH (ORCPT ); Wed, 20 May 2009 09:23:07 -0400 Received: from mx2.redhat.com ([66.187.237.31]:43907 "EHLO mx2.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753360AbZETNXG (ORCPT ); Wed, 20 May 2009 09:23:06 -0400 Date: Wed, 20 May 2009 15:18:23 +0200 From: Oleg Nesterov To: Peter Zijlstra Cc: Ingo Molnar , Zdenek Kabelac , "Rafael J. Wysocki" , Linux Kernel Mailing List Subject: Re: INFO: possible circular locking dependency at cleanup_workqueue_thread Message-ID: <20090520131823.GA14933@redhat.com> References: <20090517071834.GA8507@elte.hu> <1242821938.26820.586.camel@twins> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1242821938.26820.586.camel@twins> User-Agent: Mutt/1.5.18 (2008-05-17) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 05/20, Peter Zijlstra wrote: > > > > ======================================================= > > > [ INFO: possible circular locking dependency detected ] > > > 2.6.30-rc5-00097-gd665355 #59 > > > ------------------------------------------------------- > > > pm-suspend/12129 is trying to acquire lock: > > > (events){+.+.+.}, at: [] cleanup_workqueue_thread+0x26/0xd0 > > > > > > but task is already holding lock: > > > (cpu_add_remove_lock){+.+.+.}, at: [] > > > cpu_maps_update_begin+0x17/0x20 > > > > > > which lock already depends on the new lock. > > > > > > > > > the existing dependency chain (in reverse order) is: > > > > > > -> #5 (cpu_add_remove_lock){+.+.+.}: > > > [] __lock_acquire+0xc64/0x10a0 > > > [] lock_acquire+0x98/0x140 > > > [] __mutex_lock_common+0x4c/0x3b0 > > > [] mutex_lock_nested+0x46/0x60 > > > [] cpu_maps_update_begin+0x17/0x20 > > > [] __create_workqueue_key+0xc3/0x250 > > > [] stop_machine_create+0x40/0xb0 > > > [] sys_delete_module+0x84/0x270 > > > [] system_call_fastpath+0x16/0x1b > > > [] 0xffffffffffffffff > > Oleg, why does __create_workqueue_key() require cpu_maps_update_begin()? > Wouldn't get_online_cpus() be enough to freeze the online cpus? Yes, get_online_cpus() pins online CPUs. But CPU_POST_DEAD calls cleanup_workqueue_thread() without cpu_hotplug.lock, this means that create/destroy can race with cpu_down(). We can avoid cpu_add_remove_lock, but then we have to add another lock to protect workqueues, cpu_populated_map, etc. > Breaking the setup_lock -> cpu_add_remove_lock dependency seems > sufficient. Hmm. What do you mean? Afaics setup_lock -> cpu_add_remove_lock is not a problem? Oleg.