* 2.6.19-rc3-git7: BUG: mutex warning sysfs_dir_open
@ 2006-10-31 17:19 Mark Lord
2006-10-31 17:53 ` Linus Torvalds
0 siblings, 1 reply; 4+ messages in thread
From: Mark Lord @ 2006-10-31 17:19 UTC (permalink / raw)
To: Linux Kernel; +Cc: Linus Torvalds, Andrew Morton
I don't know if the kernel is getting better at detecting its own bugs,
or if 2.6.19 is shaping up to be the flakiest kernel in recent memory.
I found this in my syslog just now:
kernel: BUG: warning at kernel/mutex.c:132/__mutex_lock_common()
kernel: [<c0104088>] dump_trace+0x1c8/0x200
kernel: [<c01040da>] show_trace_log_lvl+0x1a/0x30
kernel: [<c0104832>] show_trace+0x12/0x20
kernel: [<c0104999>] dump_stack+0x19/0x20
kernel: [<c035d8ef>] __mutex_lock_slowpath+0x20f/0x260
kernel: [<c035d961>] mutex_lock+0x21/0x30
kernel: [<c01a0286>] sysfs_dir_open+0x26/0x60
kernel: [<c0164925>] __dentry_open+0xb5/0x1e0
kernel: [<c0164af5>] nameidata_to_filp+0x35/0x40
kernel: [<c0164b49>] do_filp_open+0x49/0x50
kernel: [<c0164b97>] do_sys_open+0x47/0xe0
kernel: [<c0164c6c>] sys_open+0x1c/0x20
kernel: [<c0102f4d>] sysenter_past_esp+0x56/0x8d
kernel: [<b7fc5410>] 0xb7fc5410
kernel: =======================
kernel: BUG: unable to handle kernel paging request at virtual address 6e726574
kernel: printing eip:
kernel: c022a8fc
kernel: *pde = 00000000
kernel: Oops: 0000 [#1]
kernel: SMP
kernel: Modules linked in: button battery ac dm_mod snd_hda_intel snd_hda_codec snd_pcm snd_timer snd soundcore snd_page_alloc r1000 ahci edd fan thermal processor
kernel: CPU: 0
kernel: EIP: 0060:[<c022a8fc>] Not tainted VLI
kernel: EFLAGS: 00010046 (2.6.19-rc3-git7-ml #9)
kernel: EIP is at __list_add+0x1c/0x80
kernel: eax: 6e726574 ebx: dfe39120 ecx: dfe39120 edx: 6e726574
kernel: esi: 6e726574 edi: f7861ea4 ebp: f7861e88 esp: f7861e70
kernel: ds: 007b es: 007b ss: 0068
kernel: Process hald (pid: 3360, ti=f7860000 task=f7974af0 task.ti=f7860000)
kernel: Stack: 00000000 00000002 00000001 dfe39100 00000246 f7974af0 f7861ec4 c035d76e
kernel: 00000000 00000002 c035d961 c0362479 dfe39120 f7861ea4 f7861ea4 11111111
kernel: dfe39100 f7861ea4 dfe39100 f753ac00 dfe3a8a4 f7861ed0 c035d961 f78c0b34
kernel: Call Trace:
kernel: [<c035d76e>] __mutex_lock_slowpath+0x8e/0x260
kernel: [<c035d961>] mutex_lock+0x21/0x30
kernel: [<c01a0286>] sysfs_dir_open+0x26/0x60
kernel: [<c0164925>] __dentry_open+0xb5/0x1e0
kernel: [<c0164af5>] nameidata_to_filp+0x35/0x40
kernel: [<c0164b49>] do_filp_open+0x49/0x50
kernel: [<c0164b97>] do_sys_open+0x47/0xe0
kernel: [<c0164c6c>] sys_open+0x1c/0x20
kernel: [<c0102f4d>] sysenter_past_esp+0x56/0x8d
kernel: [<b7fc5410>] 0xb7fc5410
kernel: =======================
kernel: Code: 13 eb a7 8d b6 00 00 00 00 8d bf 00 00 00 00 55 89 e5 83 ec 18 89 5d f4 89 cb 89 75 f8 89 d6 89 7d fc 89 c7 8b 41 04 39 d0 75 1d <8b> 06 39 c3 75 35 89 7b 04 89 1f 8b 5d f4 89 77 04 89 3e 8b 75
kernel: EIP: [<c022a8fc>] __list_add+0x1c/0x80 SS:ESP 0068:f7861e70
kernel: <3>BUG: sleeping function called from invalid context at kernel/rwsem.c:20
kernel: in_atomic():0, irqs_disabled():1
kernel: [<c0104088>] dump_trace+0x1c8/0x200
kernel: [<c01040da>] show_trace_log_lvl+0x1a/0x30
kernel: [<c0104832>] show_trace+0x12/0x20
kernel: [<c0104999>] dump_stack+0x19/0x20
kernel: [<c01166f8>] __might_sleep+0xa8/0xb0
kernel: [<c0134468>] down_read+0x18/0x60
kernel: [<c0143869>] acct_collect+0x39/0x150
kernel: [<c0120715>] do_exit+0xf5/0x8a0
kernel: [<c0104725>] die+0x2f5/0x300
kernel: [<c0114c0f>] do_page_fault+0x1ff/0x5f0
kernel: [<c035f0a9>] error_code+0x39/0x40
kernel: [<c022a8fc>] __list_add+0x1c/0x80
kernel: [<c035d76e>] __mutex_lock_slowpath+0x8e/0x260
kernel: [<c035d961>] mutex_lock+0x21/0x30
kernel: [<c01a0286>] sysfs_dir_open+0x26/0x60
kernel: [<c0164925>] __dentry_open+0xb5/0x1e0
kernel: [<c0164af5>] nameidata_to_filp+0x35/0x40
kernel: [<c0164b49>] do_filp_open+0x49/0x50
kernel: [<c0164b97>] do_sys_open+0x47/0xe0
kernel: [<c0164c6c>] sys_open+0x1c/0x20
kernel: [<c0102f4d>] sysenter_past_esp+0x56/0x8d
kernel: [<b7fc5410>] 0xb7fc5410
kernel: =======================
silvy:~#
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: 2.6.19-rc3-git7: BUG: mutex warning sysfs_dir_open
2006-10-31 17:19 2.6.19-rc3-git7: BUG: mutex warning sysfs_dir_open Mark Lord
@ 2006-10-31 17:53 ` Linus Torvalds
2006-10-31 17:57 ` Mark Lord
2006-11-06 15:13 ` Takashi Iwai
0 siblings, 2 replies; 4+ messages in thread
From: Linus Torvalds @ 2006-10-31 17:53 UTC (permalink / raw)
To: Mark Lord; +Cc: Linux Kernel, Andrew Morton, Takashi Iwai, Frederik Deweerdt
On Tue, 31 Oct 2006, Mark Lord wrote:
>
> I found this in my syslog just now:
>
> kernel: BUG: warning at kernel/mutex.c:132/__mutex_lock_common()
That's the "spin_lock_mutex()" debugging (kernel/mutex-debug.h), and it's
one of either
DEBUG_LOCKS_WARN_ON(in_interrupt());
or
DEBUG_LOCKS_WARN_ON(l->magic != l);
where the much more likely one is the "l->magic" one triggering, because
your bug that follows is:
> kernel: BUG: unable to handle kernel paging request at virtual address 6e726574
64726574 is "tern", for whatever that's worth, and that EIP is in
"__list_add()", which comes from list_add_tail(), and so it looks like the
mutex "lock->lock_list" is also totally buggered (which would certainly
fit with the lock "magic" field being crud too).
> kernel: Call Trace:
> kernel: [<c035d76e>] __mutex_lock_slowpath+0x8e/0x260
> kernel: [<c035d961>] mutex_lock+0x21/0x30
> kernel: [<c01a0286>] sysfs_dir_open+0x26/0x60
This would be
mutex_lock(&dentry->d_inode->i_mutex);
in sysfs_dir_open(), so you would seem to have an inode that is
overwritten by crud (or it could be a dentry with a bogus d_inode
pointer, of course).
There's a ton of stuff with "tern" in it, mostly "Internal", so sadly even
the nice string pattern doesn't seem to be all that helpful.
Is this somewhat reproducible?
HOWEVER, google does find a clue. We've had reports of something somewhat
similar before: googling for "__mutex_lock_slowpath" and "sysfs_dir_open"
shows for example
http://lkml.org/lkml/2006/8/18/72
where the thread ended up first blaming sound, but then deciding that
maybe it was DRM-related. However, you don't seem to have any AGP or DRM
support at all, so maybe it really _is_ a sound problem.
Mark - can you verify that you don't have any DRM support in your kernel?
Frederic, Takashi, did that bug-report ever get resolved?
Linus
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: 2.6.19-rc3-git7: BUG: mutex warning sysfs_dir_open
2006-10-31 17:53 ` Linus Torvalds
@ 2006-10-31 17:57 ` Mark Lord
2006-11-06 15:13 ` Takashi Iwai
1 sibling, 0 replies; 4+ messages in thread
From: Mark Lord @ 2006-10-31 17:57 UTC (permalink / raw)
To: Linus Torvalds
Cc: Linux Kernel, Andrew Morton, Takashi Iwai, Frederik Deweerdt
Linus Torvalds wrote:
>
> Is this somewhat reproducible?
Just the one occurance of it, with rc3-git7.
I've now moved on to -rc4.
> HOWEVER, google does find a clue. We've had reports of something somewhat
> similar before: googling for "__mutex_lock_slowpath" and "sysfs_dir_open"
> shows for example
>
> http://lkml.org/lkml/2006/8/18/72
>
> where the thread ended up first blaming sound, but then deciding that
> maybe it was DRM-related. However, you don't seem to have any AGP or DRM
> support at all, so maybe it really _is_ a sound problem.
>
> Mark - can you verify that you don't have any DRM support in your kernel?
# CONFIG_DRM is not set
(but I'm about to change that!).
Cheers
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: 2.6.19-rc3-git7: BUG: mutex warning sysfs_dir_open
2006-10-31 17:53 ` Linus Torvalds
2006-10-31 17:57 ` Mark Lord
@ 2006-11-06 15:13 ` Takashi Iwai
1 sibling, 0 replies; 4+ messages in thread
From: Takashi Iwai @ 2006-11-06 15:13 UTC (permalink / raw)
To: Linus Torvalds; +Cc: Mark Lord, Linux Kernel, Andrew Morton, Frederik Deweerdt
At Tue, 31 Oct 2006 09:53:35 -0800 (PST),
Linus Torvalds wrote:
>
>
>
> On Tue, 31 Oct 2006, Mark Lord wrote:
> >
> > I found this in my syslog just now:
> >
> > kernel: BUG: warning at kernel/mutex.c:132/__mutex_lock_common()
>
> That's the "spin_lock_mutex()" debugging (kernel/mutex-debug.h), and it's
> one of either
>
> DEBUG_LOCKS_WARN_ON(in_interrupt());
> or
> DEBUG_LOCKS_WARN_ON(l->magic != l);
>
> where the much more likely one is the "l->magic" one triggering, because
> your bug that follows is:
>
> > kernel: BUG: unable to handle kernel paging request at virtual address 6e726574
>
> 64726574 is "tern", for whatever that's worth, and that EIP is in
> "__list_add()", which comes from list_add_tail(), and so it looks like the
> mutex "lock->lock_list" is also totally buggered (which would certainly
> fit with the lock "magic" field being crud too).
>
> > kernel: Call Trace:
> > kernel: [<c035d76e>] __mutex_lock_slowpath+0x8e/0x260
> > kernel: [<c035d961>] mutex_lock+0x21/0x30
> > kernel: [<c01a0286>] sysfs_dir_open+0x26/0x60
>
> This would be
>
> mutex_lock(&dentry->d_inode->i_mutex);
>
> in sysfs_dir_open(), so you would seem to have an inode that is
> overwritten by crud (or it could be a dentry with a bogus d_inode
> pointer, of course).
>
> There's a ton of stuff with "tern" in it, mostly "Internal", so sadly even
> the nice string pattern doesn't seem to be all that helpful.
>
> Is this somewhat reproducible?
>
> HOWEVER, google does find a clue. We've had reports of something somewhat
> similar before: googling for "__mutex_lock_slowpath" and "sysfs_dir_open"
> shows for example
>
> http://lkml.org/lkml/2006/8/18/72
>
> where the thread ended up first blaming sound, but then deciding that
> maybe it was DRM-related. However, you don't seem to have any AGP or DRM
> support at all, so maybe it really _is_ a sound problem.
>
> Mark - can you verify that you don't have any DRM support in your kernel?
>
> Frederic, Takashi, did that bug-report ever get resolved?
IIRC, in Frederik's case, it turned out to be irrelevant from the
sound stuff since the bug occured even before loading ALSA modules.
Of course, it still _might_ be a sound problem :)
It'd be nice if we can see what sysfs entry triggers bugs...
Takashi
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2006-11-06 15:13 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2006-10-31 17:19 2.6.19-rc3-git7: BUG: mutex warning sysfs_dir_open Mark Lord
2006-10-31 17:53 ` Linus Torvalds
2006-10-31 17:57 ` Mark Lord
2006-11-06 15:13 ` Takashi Iwai
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox