From: Lai Jiangshan <laijs@cn.fujitsu.com>
To: Peter Zijlstra <peterz@infradead.org>
Cc: Oleg Nesterov <oleg@tv-sign.ru>, Ingo Molnar <mingo@elte.hu>,
Andrew Morton <akpm@linux-foundation.org>,
Linux Kernel Mailing List <linux-kernel@vger.kernel.org>
Subject: Re: [PATCH] workqueue: not allow recursion run_workqueue
Date: Thu, 22 Jan 2009 14:03:38 +0800 [thread overview]
Message-ID: <49780C3A.1050601@cn.fujitsu.com> (raw)
In-Reply-To: <1232536373.4847.115.camel@laptop>
Peter Zijlstra wrote:
> On Wed, 2009-01-21 at 17:42 +0800, Lai Jiangshan wrote:
>> 1) lockdep will complain when recursion run_workqueue
>> 2) works is not run orderly when recursion run_workqueue
>>
>> 3) BUG!
>> We use recursion run_workqueue to hidden deadlock when
>> keventd trying to flush its own queue.
>>
>> It's bug. When flush_workqueue()(nested in a work callback)returns,
>> the workqueue is not really flushed, the sequence statement of
>> this work callback will do some thing bad.
>>
>> So we should not allow workqueue trying to flush its own queue.
>
> The patch looks good, but I'm utterly failing to comprehend this
> changelog. What exactly can go wrong (other than the obvious too deep
> nest and the fact that lockdep will complain)?
void do_some_cleanup(void)
{
find_all_queued_work_struct_and_mark_it_old();
flush_workqueue(workqueue);
/* we can destroy old work_struct for we have flushed them */
destroy_old_work_structs();
}
if work->func() called do_some_cleanup(), it's very probably a bug.
>
>> Signed-off-by: Lai Jiangshan <laijs@cn.fujitsu.com>
>> ---
>> diff --git a/kernel/workqueue.c b/kernel/workqueue.c
>> index 2f44583..1129cde 100644
>> --- a/kernel/workqueue.c
>> +++ b/kernel/workqueue.c
>> @@ -48,8 +48,6 @@ struct cpu_workqueue_struct {
>>
>> struct workqueue_struct *wq;
>> struct task_struct *thread;
>> -
>> - int run_depth; /* Detect run_workqueue() recursion depth */
>> } ____cacheline_aligned;
>>
>> /*
>> @@ -262,13 +260,6 @@ EXPORT_SYMBOL_GPL(queue_delayed_work_on);
>> static void run_workqueue(struct cpu_workqueue_struct *cwq)
>> {
>> spin_lock_irq(&cwq->lock);
>> - cwq->run_depth++;
>> - if (cwq->run_depth > 3) {
>> - /* morton gets to eat his hat */
>> - printk("%s: recursion depth exceeded: %d\n",
>> - __func__, cwq->run_depth);
>> - dump_stack();
>> - }
>> while (!list_empty(&cwq->worklist)) {
>> struct work_struct *work = list_entry(cwq->worklist.next,
>> struct work_struct, entry);
>> @@ -311,7 +302,6 @@ static void run_workqueue(struct cpu_workqueue_struct *cwq)
>> spin_lock_irq(&cwq->lock);
>> cwq->current_work = NULL;
>> }
>> - cwq->run_depth--;
>> spin_unlock_irq(&cwq->lock);
>> }
>>
>> @@ -368,29 +358,20 @@ static void insert_wq_barrier(struct cpu_workqueue_struct *cwq,
>>
>> static int flush_cpu_workqueue(struct cpu_workqueue_struct *cwq)
>> {
>> - int active;
>> + int active = 0;
>> + struct wq_barrier barr;
>>
>> - if (cwq->thread == current) {
>> - /*
>> - * Probably keventd trying to flush its own queue. So simply run
>> - * it by hand rather than deadlocking.
>> - */
>> - run_workqueue(cwq);
>> - active = 1;
>> - } else {
>> - struct wq_barrier barr;
>> + WARN_ON(cwq->thread == current);
>>
>> - active = 0;
>> - spin_lock_irq(&cwq->lock);
>> - if (!list_empty(&cwq->worklist) || cwq->current_work != NULL) {
>> - insert_wq_barrier(cwq, &barr, &cwq->worklist);
>> - active = 1;
>> - }
>> - spin_unlock_irq(&cwq->lock);
>> -
>> - if (active)
>> - wait_for_completion(&barr.done);
>> + spin_lock_irq(&cwq->lock);
>> + if (!list_empty(&cwq->worklist) || cwq->current_work != NULL) {
>> + insert_wq_barrier(cwq, &barr, &cwq->worklist);
>> + active = 1;
>> }
>> + spin_unlock_irq(&cwq->lock);
>> +
>> + if (active)
>> + wait_for_completion(&barr.done);
>>
>> return active;
>> }
>>
>>
>> --
>> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at http://vger.kernel.org/majordomo-info.html
>> Please read the FAQ at http://www.tux.org/lkml/
>
>
>
>
next prev parent reply other threads:[~2009-01-22 6:04 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-01-21 9:42 [PATCH] workqueue: not allow recursion run_workqueue Lai Jiangshan
2009-01-21 10:56 ` Ingo Molnar
2009-01-21 11:12 ` Peter Zijlstra
2009-01-21 12:45 ` Oleg Nesterov
2009-01-22 6:03 ` Lai Jiangshan [this message]
2009-01-22 9:52 ` Peter Zijlstra
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=49780C3A.1050601@cn.fujitsu.com \
--to=laijs@cn.fujitsu.com \
--cc=akpm@linux-foundation.org \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@elte.hu \
--cc=oleg@tv-sign.ru \
--cc=peterz@infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox