From: Andrew Morton <akpm@linux-foundation.org>
To: Ingo Molnar <mingo@elte.hu>
Cc: rusty@rustcorp.com.au, travis@sgi.com, mingo@redhat.com,
davej@redhat.com, cpufreq@vger.kernel.org,
linux-kernel@vger.kernel.org, Oleg Nesterov <oleg@redhat.com>
Subject: Re: [PATCH 2/3] work_on_cpu: Use our own workqueue.
Date: Mon, 26 Jan 2009 13:00:46 -0800 [thread overview]
Message-ID: <20090126130046.37b8f34e.akpm@linux-foundation.org> (raw)
In-Reply-To: <20090126202022.GA8867@elte.hu>
On Mon, 26 Jan 2009 21:20:22 +0100
Ingo Molnar <mingo@elte.hu> wrote:
>
> * Andrew Morton <akpm@linux-foundation.org> wrote:
>
> > On Mon, 26 Jan 2009 18:16:18 +0100
> > Ingo Molnar <mingo@elte.hu> wrote:
> >
> > >
> > > * Andrew Morton <akpm@linux-foundation.org> wrote:
> > >
> > > > > > Yet another kernel thread for each CPU. All because of some dung
> > > > > > way down in arch/x86/kernel/cpu/cpufreq/acpi-cpufreq.c.
> > > > > >
> > > > > > Is there no other way?
> > > > >
> > > > > Perhaps, but this works. Trying to be clever got me into this mess in
> > > > > the first place.
> > > > >
> > > > > We could stop using workqueues and change work_on_cpu to create a
> > > > > thread every time, which would give it a new failure mode so I don't
> > > > > know that everyone could use it any more. Or we could keep a single
> > > > > thread around to do all the cpus, and duplicate much of the workqueue
> > > > > code.
> > > > >
> > > > > None of these options are appealing...
> > > >
> > > > Can we try harder please? 10 screenfuls of kernel threads in the ps
> > > > output is just irritating.
> > > >
> > > > How about banning the use of work_on_cpu() from schedule_work() handlers
> > > > and then fixing that driver somehow?
> > >
> > > Yes, but that's fundamentally fragile: anyone who happens to stick the
> > > wrong thing into keventd (and it's dead easy because schedule_work() is
> > > easy to use) will lock up work_on_cpu() users.
> > >
> >
> > --- a/kernel/workqueue.c~a
> > +++ a/kernel/workqueue.c
> > @@ -998,6 +998,8 @@ long work_on_cpu(unsigned int cpu, long
> > {
> > struct work_for_cpu wfc;
> >
> > + BUG_ON(current_is_keventd());
> > +
> > INIT_WORK(&wfc.work, do_work_for_cpu);
> > wfc.fn = fn;
> > wfc.arg = arg;
> > _
> >
> >
> > That wasn't so hard.
>
> What is the purpose of your change? I'm not sure you understood the
> problem.
Well. That's because I was forced to resort to guesswork.
> The problem is not with work_on_cpu() usage. The problem is:
>
> 1) holding locks while calling work_on_cpu()
>
> 2) same locks being taken by a worklet used by some other code
>
> work_on_cpu() really wants to serialize on its own workload only, not on
> the other stuff that might be sometimes be queued up in the keventd
> workqueue.
but but but, we fixed that ages ago, I think. But I don't see the code
there.
If we want to wait on a *particular* keventd work item then we
shouldn't wait on all the other queued ones.
- If it's currently running, wait on it
- If it isn't yet running, detach it from the queue and run it directly.
Maybe I'm thinking of a different subsystem, but I don't think so.
Maybe Oleg recalls what happened to that?
> > > work_on_cpu() is an important (and lowlevel enough) facility to be
> > > isolated from casual interaction like that.
> >
> > We have one single (known) caller in the whole kernel. This is not
> > worth adding another great pile of kernel threads for!
>
> i'd expect there to be more as part of the cpumask stack reduction
> patches that Rusty and Mike are working on.
>
> in any case it's a correctness issue: work_on_cpu() is a just as generic
> facility as on_each_cpu() - with the difference that it can handle
> blocking contexts too.
Well on_each_cpu() has restrictions. Can't all it with local
interrupts disabled. Can't call it (synchronously) while holding locks
which the callback takes.
> So if it's generic it ought to be implemented in a generic way - not a
> "dont use from any codepath that has a lock held that might occasionally
> also be held in a keventd worklet". (which is a totally unmaintainable
> proposition and which would just cause repeat bugs again and again.)
That's different. The core fault here lies in the keventd workqueue
handling code. If we're flushing work A then we shouldn't go and block
behind unrelated work B.
next prev parent reply other threads:[~2009-01-26 21:02 UTC|newest]
Thread overview: 70+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-01-16 19:11 [PATCH 0/3] cpu freq: fix problems with work_on_cpu usage in acpi-cpufreq Mike Travis
2009-01-16 19:11 ` [PATCH 1/3] work_on_cpu: dont try to get_online_cpus() in work_on_cpu Mike Travis
2009-01-16 19:11 ` [PATCH 2/3] work_on_cpu: Use our own workqueue Mike Travis
2009-01-24 8:15 ` Andrew Morton
[not found] ` <200901261711.43943.rusty@rustcorp.com.au>
2009-01-26 7:01 ` Andrew Morton
2009-01-26 17:16 ` Ingo Molnar
2009-01-26 18:35 ` Andrew Morton
2009-01-26 20:20 ` Ingo Molnar
2009-01-26 20:43 ` Mike Travis
2009-01-26 21:00 ` Andrew Morton [this message]
2009-01-26 21:27 ` Ingo Molnar
2009-01-26 21:35 ` Andrew Morton
2009-01-26 21:45 ` Ingo Molnar
2009-01-26 22:01 ` Andrew Morton
2009-01-26 22:05 ` Ingo Molnar
2009-01-26 22:16 ` Andrew Morton
2009-01-26 22:20 ` Ingo Molnar
2009-01-26 22:50 ` Andrew Morton
2009-01-26 22:59 ` Ingo Molnar
2009-01-26 23:42 ` Andrew Morton
2009-01-26 23:53 ` Ingo Molnar
2009-01-27 0:42 ` Andrew Morton
2009-01-26 22:31 ` Oleg Nesterov
2009-01-26 22:15 ` Oleg Nesterov
2009-01-26 22:24 ` Ingo Molnar
2009-01-26 22:37 ` Oleg Nesterov
2009-01-26 22:42 ` Ingo Molnar
2009-01-26 21:50 ` Oleg Nesterov
2009-01-26 22:17 ` Ingo Molnar
2009-01-26 23:01 ` Mike Travis
2009-01-27 0:09 ` Oleg Nesterov
2009-01-27 7:15 ` Rusty Russell
2009-01-27 17:55 ` Oleg Nesterov
2009-01-27 7:05 ` Rusty Russell
2009-01-27 7:25 ` Andrew Morton
2009-01-27 15:28 ` Ingo Molnar
2009-01-27 16:51 ` Andrew Morton
2009-01-28 13:02 ` Rusty Russell
2009-01-28 17:19 ` Mike Travis
2009-01-28 17:32 ` Mike Travis
2009-01-29 10:39 ` Rusty Russell
2009-01-28 19:44 ` Andrew Morton
2009-01-29 1:43 ` Rusty Russell
2009-01-29 2:12 ` Andrew Morton
2009-01-30 6:03 ` Rusty Russell
2009-01-30 6:30 ` Andrew Morton
2009-01-30 13:49 ` Ingo Molnar
2009-01-30 17:08 ` Andrew Morton
2009-01-30 21:59 ` Rusty Russell
2009-01-30 22:17 ` Andrew Morton
2009-02-02 12:35 ` Rusty Russell
2009-02-03 4:06 ` Andrew Morton
2009-02-04 2:44 ` Rusty Russell
2009-02-04 3:01 ` Andrew Morton
2009-02-04 10:41 ` Rusty Russell
2009-02-04 15:36 ` Andrew Morton
2009-02-04 21:35 ` Ingo Molnar
2009-02-04 21:48 ` Andrew Morton
2009-02-04 21:54 ` Ingo Molnar
2009-02-04 23:45 ` Rusty Russell
2009-02-05 12:19 ` Pavel Machek
2009-02-05 17:44 ` Dmitry Adamushko
2009-02-10 8:54 ` Rusty Russell
2009-02-10 9:35 ` Andrew Morton
2009-02-11 0:32 ` Rusty Russell
2009-01-16 19:11 ` [PATCH 3/3] cpufreq: use work_on_cpu in acpi-cpufreq.c for drv_read and drv_write Mike Travis
2009-01-16 23:38 ` [PATCH 0/3] cpu freq: fix problems with work_on_cpu usage in acpi-cpufreq [PULL request] Mike Travis
2009-01-17 22:08 ` Ingo Molnar
2009-01-19 17:11 ` Mike Travis
2009-01-19 17:26 ` Ingo Molnar
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20090126130046.37b8f34e.akpm@linux-foundation.org \
--to=akpm@linux-foundation.org \
--cc=cpufreq@vger.kernel.org \
--cc=davej@redhat.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@elte.hu \
--cc=mingo@redhat.com \
--cc=oleg@redhat.com \
--cc=rusty@rustcorp.com.au \
--cc=travis@sgi.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox