public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Steven Rostedt <rostedt@goodmis.org>
To: Robert Crocombe <rcrocomb@gmail.com>
Cc: linux-kernel@vger.kernel.org, Ingo Molnar <mingo@elte.hu>,
	Thomas Gleixner <tglx@linutronix.de>,
	Bill Huey <billh@gnuppy.monkey.org>
Subject: Re: Problems with 2.6.17-rt8
Date: Thu, 03 Aug 2006 10:27:41 -0400	[thread overview]
Message-ID: <1154615261.32264.6.camel@localhost.localdomain> (raw)
In-Reply-To: <e6babb600608030448y7bb0cd34i74f5f632e4caf1b1@mail.gmail.com>

Please don't trim CC lines.  LKML is too big to read all emails.

On Thu, 2006-08-03 at 04:48 -0700, Robert Crocombe wrote: 
> On 8/2/06, Steven Rostedt <rostedt@goodmis.org> wrote:
> > You mention problems but I don't see you listing what exactly the
> > problems are.  Just saying "the problems exist" doesn't tell us
> > anything.
> >
> > Don't assume that we will go to some web site to figure out what you're
> > talking about. Please list the problems you are facing.
> 
> The machine dies (no alt-sysrq, no keyboard LEDs of any kind: dead in
> the water).  I thought the log would provide more useful information
> without potentially erroneous editorialization by myself.  Here are
> some highlights:
> 
> kjournald/1105[CPU#3]: BUG in debug_rt_mutex_unlock at kernel/rtmutex-debug.c:47
> 1

Ouch, that looks like kjournald is unlocking a lock that it doesn't own?

> 
> Call Trace:
>        <ffffffff8047655a>{_raw_spin_lock_irqsave+24}
>        <ffffffff8022b272>{__WARN_ON+100}
>        <ffffffff802457e4>{debug_rt_mutex_unlock+199}
>        <ffffffff804757b7>{rt_lock_slowunlock+25}
>        <ffffffff80476301>{__lock_text_start+9}

hmm, here we are probably having trouble with the percpu slab locks,
that is somewhat of a hack to get slabs working on a per cpu basis.

>        <ffffffff80271e93>{kmem_cache_alloc+202}

It would also be nice to know exactly where ffffffff80271e93 is.

>        <ffffffff8025493b>{mempool_alloc_slab+17}
>        <ffffffff80254d07>{mempool_alloc+75}
>        <ffffffff802f2f8c>{generic_make_request+375}
>        <ffffffff8027b914>{bio_alloc_bioset+35}
>        <ffffffff8027ba2a>{bio_alloc+16}
>        <ffffffff802781d1>{submit_bh+137}
>        <ffffffff80279377>{ll_rw_block+122}
>        <ffffffff8027939e>{ll_rw_block+161}
>        <ffffffff802c85dc>{journal_commit_transaction+1011}
>        <ffffffff80476a5f>{_raw_spin_unlock_irqrestore+56}
>        <ffffffff804769ac>{_raw_spin_unlock+46}
>        <ffffffff804757df>{rt_lock_slowunlock+65}
>        <ffffffff80476301>{__lock_text_start+9}
>        <ffffffff802339b0>{try_to_del_timer_sync+85}
>        <ffffffff802cca63>{kjournald+202}
>        <ffffffff8023db60>{autoremove_wake_function+0}
>        <ffffffff802cc999>{kjournald+0}
>        <ffffffff8023d739>{keventd_create_kthread+0}
>        <ffffffff8023da2f>{kthread+219}
>        <ffffffff80225a23>{schedule_tail+188}
>        <ffffffff8020aaca>{child_rip+8}
>        <ffffffff8023d739>{keventd_create_kthread+0}
>        <ffffffff8023d954>{kthread+0}
>        <ffffffff8020aac2>{child_rip+0}
> ---------------------------
> | preempt count: 00000002 ]
> | 2-level deep critical section nesting:
> ----------------------------------------
> .. [<ffffffff80476499>] .... _raw_spin_lock+0x16/0x23
> .....[<ffffffff804757af>] ..   ( <= rt_lock_slowunlock+0x11/0x6b)
> .. [<ffffffff8047655a>] .... _raw_spin_lock_irqsave+0x18/0x29
> .....[<ffffffff8022b22d>] ..   ( <= __WARN_ON+0x1f/0x82)
> 
> 
> Somewhat later:
> 
> Kernel BUG at kernel/rtmutex.c:639

The rest was probably caused as a side effect from above.  The above is
already broken!

You have NUMA configured too, so this is also something to look at.

I still wouldn't ignore the first bug message you got:

----
BUG: scheduling while atomic: udev_run_devd/0x00000001/1568

Call Trace:
       <ffffffff8045c693>{__schedule+155}
       <ffffffff8045f156>{_raw_spin_unlock_irqrestore+53}
       <ffffffff80242241>{task_blocks_on_rt_mutex+518}
       <ffffffff80252da0>{free_pages_bulk+39}
       <ffffffff80252da0>{free_pages_bulk+39}
...
----

This could also have a side effect that messes things up.

Unfortunately, right now I'm assigned to other tasks and I cant spend
much more time on this at the moment.  So hopefully, Ingo, Thomas or
Bill, or someone else can help you find the reason for this problem.

-- Steve



  reply	other threads:[~2006-08-03 14:27 UTC|newest]

Thread overview: 58+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <e6babb600608012231r74470b77x6e7eaeab222ee160@mail.gmail.com>
2006-08-02  5:37 ` Problems with 2.6.17-rt8 Robert Crocombe
2006-08-02 17:51   ` Steven Rostedt
2006-08-03 11:48     ` Robert Crocombe
2006-08-03 14:27       ` Steven Rostedt [this message]
2006-08-03 15:08         ` Robert Crocombe
2006-08-03 15:27           ` Steven Rostedt
2006-08-03 15:48             ` Robert Crocombe
2006-08-03 16:04               ` Steven Rostedt
2006-08-03 17:16                 ` Robert Crocombe
2006-08-03 20:22         ` Bill Huey
2006-08-03 20:54           ` Steven Rostedt
2006-08-03 21:18             ` Bill Huey
2006-08-08  2:56         ` [Patch] restore the RCU callback to defer put_task_struct() " Bill Huey
2006-08-08  3:05           ` Bill Huey
2006-08-08 18:46             ` Robert Crocombe
2006-08-08 19:06               ` Steven Rostedt
2006-08-08 21:35                 ` Robert Crocombe
2006-08-08 21:44                   ` Steven Rostedt
2006-08-08 22:10                     ` Robert Crocombe
2006-08-09 17:19                   ` Robert Crocombe
2006-08-09  0:35               ` Bill Huey
2006-08-11  7:47               ` Bill Huey
2006-08-11 14:52                 ` Robert Crocombe
2006-08-09 22:05             ` Esben Nielsen
2006-08-10  0:00               ` Steven Rostedt
2006-08-10  2:18               ` Bill Huey
2006-08-11  1:06                 ` Bill Huey
2006-08-11  8:16                   ` Esben Nielsen
2006-08-11  8:46                     ` Bill Huey
2006-08-11 15:00                   ` Robert Crocombe
2006-08-11 21:18                     ` Bill Huey
     [not found]                       ` <20060811221054.GA32459@gnuppy.monkey.org>
2006-08-14 17:56                         ` Robert Crocombe
2006-08-14 23:44                           ` Bill Huey
2006-08-15 10:43                             ` Bill Huey
2006-08-15 17:53                             ` Robert Crocombe
2006-08-18 11:59                               ` Bill Huey
2006-08-22  0:21                                 ` Robert Crocombe
2006-08-22  1:37                                   ` rtmutex assert failure (was [Patch] restore the RCU callback...) Bill Huey
2006-08-22 23:20                                     ` Bill Huey
2006-08-22 23:21                                       ` Bill Huey
2006-08-23 17:14                                       ` Robert Crocombe
2006-08-23 17:24                                         ` Robert Crocombe
2006-08-23 20:20                                         ` Bill Huey
2006-08-23 21:05                                           ` Bill Huey
2006-08-23 21:08                                             ` Bill Huey
2006-08-24  1:22                                               ` Robert Crocombe
2006-08-24  1:46                                                 ` Bill Huey
2006-08-25  7:19                                                   ` Bill Huey
2006-08-26  1:24                                                     ` Robert Crocombe
2006-08-26  1:28                                                       ` Robert Crocombe
2006-08-26  2:37                                                         ` Robert Crocombe
2006-08-26 10:28                                                       ` Bill Huey
2006-08-26 10:49                                                       ` Bill Huey
2006-08-28 18:33                                                         ` Robert Crocombe
2006-08-28 20:28                                                           ` Bill Huey
2006-08-29  4:05                                                             ` Robert Crocombe
2006-08-29 17:11                                                               ` Bill Huey
2006-08-29 17:19                                                                 ` Robert Crocombe

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1154615261.32264.6.camel@localhost.localdomain \
    --to=rostedt@goodmis.org \
    --cc=billh@gnuppy.monkey.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mingo@elte.hu \
    --cc=rcrocomb@gmail.com \
    --cc=tglx@linutronix.de \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox