From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756596AbZBFBUy (ORCPT ); Thu, 5 Feb 2009 20:20:54 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1753150AbZBFBUq (ORCPT ); Thu, 5 Feb 2009 20:20:46 -0500 Received: from cn.fujitsu.com ([222.73.24.84]:62869 "EHLO song.cn.fujitsu.com" rhost-flags-OK-FAIL-OK-OK) by vger.kernel.org with ESMTP id S1753024AbZBFBUp (ORCPT ); Thu, 5 Feb 2009 20:20:45 -0500 Message-ID: <498B9055.8020407@cn.fujitsu.com> Date: Fri, 06 Feb 2009 09:20:21 +0800 From: Lai Jiangshan User-Agent: Thunderbird 2.0.0.19 (Windows/20081209) MIME-Version: 1.0 To: Oleg Nesterov CC: Peter Zijlstra , =?UTF-8?B?RnLDqWTDqXJpYyBXZWlz?= =?UTF-8?B?YmVja2Vy?= , Ingo Molnar , Andrew Morton , Eric Dumazet , Linux Kernel Mailing List Subject: Re: [PATCH 2/3] workqueue: not allow recursion run_workqueue References: <497838F0.7020408@cn.fujitsu.com> <20090122093046.GC5891@nowhere> <20090122093649.GD24758@elte.hu> <1232622615.4890.114.camel@laptop> <498AA0F1.2030003@cn.fujitsu.com> <20090205170156.GA25517@redhat.com> In-Reply-To: <20090205170156.GA25517@redhat.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Oleg Nesterov wrote: > On 02/05, Lai Jiangshan wrote: >> DEADLOCK EXAMPLE for explain my above option: >> >> (work_func0() and work_func1() are work callback, and they >> calls flush_workqueue()) >> >> CPU#0 CPU#1 >> run_workqueue() run_workqueue() >> work_func0() work_func1() >> flush_workqueue() flush_workqueue() >> flush_cpu_workqueue(0) . >> flush_cpu_workqueue(cpu#1) flush_cpu_workqueue(cpu#0) >> waiting work_func1() in cpu#1 waiting work_func0 in cpu#0 >> >> DEADLOCK! > > I am not sure. Note that when work_func0() calls run_workqueue(), > it will clear cwq->current_work, so another flush_ on CPU#1 will > not wait for work_func0, no? cwq->current_work is changed only when !list_empty(&cwq->worklist) in run_workqueue(). so cwq->current_work may not be changed. > > But anyway. Nobody argues, "if (cwq->thread == current) {...}" code in > flush_cpu_workqueue() is bad and should die. Otrherwise, we should > fix the lockdep warning ;) > > The only problem: if we still have the users of this hack, they will > deadlock. But perhaps it is time to fix them. > > And, if it was not clear, I do agree with this change. And Peter > seems to agree as well. > > Oleg. > > > >