From: Jens Axboe <axboe@suse.de>
To: Ingo Molnar <mingo@elte.hu>
Cc: Lee Revell <rlrevell@joe-job.com>,
linux-kernel <linux-kernel@vger.kernel.org>
Subject: Re: CFQ + 2.6.13-rc4-RT-V0.7.52-02 = BUG: scheduling with irqs disabled
Date: Thu, 25 Aug 2005 08:22:08 +0200 [thread overview]
Message-ID: <20050825062207.GO28272@suse.de> (raw)
In-Reply-To: <20050825060958.GB26398@elte.hu>
On Thu, Aug 25 2005, Ingo Molnar wrote:
>
> * Jens Axboe <axboe@suse.de> wrote:
>
> > > BUG: scheduling with irqs disabled: libc6.postinst/0x20000000/13229
> > > caller is ___down_mutex+0xe9/0x1a0
> > > [<c029c1f9>] schedule+0x59/0xf0 (8)
> > > [<c029ced9>] ___down_mutex+0xe9/0x1a0 (28)
> > > [<c0221832>] cfq_exit_single_io_context+0x22/0xa0 (84)
> > > [<c02218ea>] cfq_exit_io_context+0x3a/0x50 (16)
> > > [<c021db84>] exit_io_context+0x64/0x70 (16)
> > > [<c011efda>] do_exit+0x5a/0x3e0 (20)
> > > [<c011f3ca>] do_group_exit+0x2a/0xb0 (24)
> > > [<c0103039>] syscall_call+0x7/0xb (20)
> >
> > Hmm, Ingo I seem to remember you saying that the following construct:
> >
> > local_irq_save(flags);
> > spin_lock(lock);
> >
> > which is equivelant to spin_lock_irqsave() in mainline being illegal
> > in -RT, is that correct? This is what cfq uses right now for an
> > exiting task, as the above trace indicates.
>
> yes, that's correct. Mainline's exit_io_contexts() uses the above
> construct because there's no task_lock_irqsave(current, flags) API.
>
> note that recent -RT kernels are a lot less drastic about these cases
> and print a once-per-bootup warning, not a scary message like above.
> This message should not happen in recent -RT kernels.
>
> The problem was this piece of code in exit_io_contexts():
>
> local_irq_save(flags);
> task_lock(current);
> ioc = current->io_context;
> current->io_context = NULL;
> ioc->task = NULL;
> task_unlock(current);
> local_irq_restore(flags);
>
> i understand the detached use of flags, but i'm also wondering why irqs
> have to be disabled here in the first place? At this point in do_exit()
> we should normally not have any pending IO attached to our io_context.
> What am i missing?
There can quite easily be lots of pending IO for the io_context (and, in
CFQ's case, below cfq_io_contexts), task exiting is completely decoupled
from any pending io.
Then there's the cfq_exit_io_context() locking. I have to ponder this a
bit, I cannot even convince myself that it is currently safe right now.
--
Jens Axboe
next prev parent reply other threads:[~2005-08-25 6:22 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2005-08-24 16:02 CFQ + 2.6.13-rc4-RT-V0.7.52-02 = BUG: scheduling with irqs disabled Lee Revell
2005-08-24 17:47 ` Jens Axboe
2005-08-24 21:35 ` Esben Nielsen
2005-08-25 6:10 ` Jens Axboe
2005-08-25 7:40 ` Ingo Molnar
2005-08-25 6:09 ` Ingo Molnar
2005-08-25 6:22 ` Jens Axboe [this message]
2005-08-25 7:52 ` Ingo Molnar
2005-08-25 11:17 ` Jens Axboe
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20050825062207.GO28272@suse.de \
--to=axboe@suse.de \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@elte.hu \
--cc=rlrevell@joe-job.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox