* Re: jffs2 filesystem: possible circular locking dependency detected
[not found] <D61182AC8012EA4EBC531B3AF23BE109B3C6C6@tranzeo-mail2.12stewart.tranzeo.com>
@ 2012-02-08 20:09 ` Thomas Gleixner
2012-02-11 6:57 ` Brian Norris
0 siblings, 1 reply; 3+ messages in thread
From: Thomas Gleixner @ 2012-02-08 20:09 UTC (permalink / raw)
To: Darcy Watkins; +Cc: Peter Zijlstra, David Woodhouse, linux-mtd, linux-rt-users
On Wed, 8 Feb 2012, Darcy Watkins wrote:
> [ 0.000000] Linux version 3.0.18-rt34 (darcy@tr-pentomino) (gcc version 4.4.6 (crosstool-NG 1.12.4) ) #41 PREEMPT RT Wed Feb 8 10:04:00 PST 2012
> [ 20.932000] =======================================================
> [ 20.932000] [ INFO: possible circular locking dependency detected ]
> [ 20.932000] 3.0.18-rt34 #41
> [ 20.932000] -------------------------------------------------------
> [ 20.932000] depmod/734 is trying to acquire lock:
> [ 20.932000] (&mm->mmap_sem){++++++}, at: [<800e82d0>] might_fault+0x4c/0xa4
> [ 20.932000]
> [ 20.932000] but task is already holding lock:
> [ 20.932000] (&f->sem){+.+.+.}, at: [<80184f88>] jffs2_readdir+0x108/0x1c0
> [ 20.932000]
> [ 20.932000] which lock already depends on the new lock.
> [ 20.932000]
> [ 20.932000]
> [ 20.932000] the existing dependency chain (in reverse order) is:
> [ 20.932000]
> [ 20.932000] -> #1 (&f->sem){+.+.+.}:
> [ 20.932000] [<800bae14>] lock_acquire+0x60/0x88
> [ 20.932000] [<802d3a84>] _mutex_lock+0x34/0x48
> [ 20.932000] [<80185754>] jffs2_readpage+0x24/0x54
> [ 20.932000] [<800d91e8>] __do_page_cache_readahead+0x274/0x2dc
> [ 20.932000] [<800d9278>] ra_submit+0x28/0x34
> [ 20.932000] [<800d1320>] filemap_fault+0x1a8/0x48c
> [ 20.932000] [<800e898c>] __do_fault+0x70/0x468
> [ 20.932000] [<800e9df8>] handle_pte_fault+0x388/0xd28
> [ 20.932000] [<800eaa44>] handle_mm_fault+0xf4/0x11c
> [ 20.932000] [<8006c230>] do_page_fault+0x110/0x300
> [ 20.932000] [<80062c04>] ret_from_exception+0x0/0x10
> [ 20.932000] [<801c3cb4>] __bzero+0x38/0x164
> [ 20.932000] [<8014121c>] padzero+0x58/0x84
> [ 20.932000] [<80142b18>] load_elf_binary+0x774/0x12fc
> [ 20.932000] [<80102c60>] search_binary_handler+0xec/0x318
> [ 20.932000] [<801044c4>] do_execve+0x158/0x264
> [ 20.932000] [<800670a8>] sys_execve+0x44/0x6c
> [ 20.932000] [<8006a3b8>] stack_done+0x20/0x40
> [ 20.932000]
> [ 20.932000] -> #0 (&mm->mmap_sem){++++++}:
> [ 20.932000] [<800ba1f4>] __lock_acquire+0x1228/0x1934
> [ 20.932000] [<800bae14>] lock_acquire+0x60/0x88
> [ 20.932000] [<800e82f8>] might_fault+0x74/0xa4
> [ 20.932000] [<8010dd80>] filldir64+0xe0/0x144
> [ 20.932000] [<80184fe4>] jffs2_readdir+0x164/0x1c0
> [ 20.932000] [<8010e070>] vfs_readdir+0x74/0xcc
> [ 20.932000] [<8010e13c>] sys_getdents64+0x74/0xd8
> [ 20.932000] [<8006a3b8>] stack_done+0x20/0x40
> [ 20.932000]
> [ 20.932000] other info that might help us debug this:
> [ 20.932000]
> [ 20.932000] Possible unsafe locking scenario:
> [ 20.932000]
> [ 20.932000] CPU0 CPU1
> [ 20.932000] ---- ----
> [ 20.932000] lock(&f->sem);
> [ 20.932000] lock(&mm->mmap_sem);
> [ 20.932000] lock(&f->sem);
> [ 20.932000] lock(&mm->mmap_sem);
> [ 20.932000]
> [ 20.932000] *** DEADLOCK ***
> [ 20.932000]
> [ 20.932000] 2 locks held by depmod/734:
> [ 20.932000] #0: (&sb->s_type->i_mutex_key#3){+.+.+.}, at: [<8010e044>] vfs_readdir+0x48/0xcc
> [ 20.932000] #1: (&f->sem){+.+.+.}, at: [<80184f88>] jffs2_readdir+0x108/0x1c0
> [ 20.932000]
> [ 20.932000] stack backtrace:
> [ 20.932000] Call Trace:
> [ 20.932000] [<802d103c>] dump_stack+0x8/0x34
> [ 20.932000] [<800b8960>] print_circular_bug+0x2bc/0x2e8
> [ 20.932000] [<800ba1f4>] __lock_acquire+0x1228/0x1934
> [ 20.932000] [<800bae14>] lock_acquire+0x60/0x88
> [ 20.932000] [<800e82f8>] might_fault+0x74/0xa4
> [ 20.932000] [<8010dd80>] filldir64+0xe0/0x144
> [ 20.932000] [<80184fe4>] jffs2_readdir+0x164/0x1c0
> [ 20.932000] [<8010e070>] vfs_readdir+0x74/0xcc
> [ 20.932000] [<8010e13c>] sys_getdents64+0x74/0xd8
> [ 20.932000] [<8006a3b8>] stack_done+0x20/0x40
Classic ABBA deadlock. I don't think it's RT specific, but I might be
wrong as usual. Will have a look later this week, when noone beats me.
Thanks,
tglx
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: jffs2 filesystem: possible circular locking dependency detected
2012-02-08 20:09 ` jffs2 filesystem: possible circular locking dependency detected Thomas Gleixner
@ 2012-02-11 6:57 ` Brian Norris
2012-02-13 20:02 ` Josh Cartwright
0 siblings, 1 reply; 3+ messages in thread
From: Brian Norris @ 2012-02-11 6:57 UTC (permalink / raw)
To: Thomas Gleixner
Cc: linux-rt-users, Darcy Watkins, Peter Zijlstra, linux-mtd,
Josh Cartwright, David Woodhouse
On Wed, Feb 8, 2012 at 12:09 PM, Thomas Gleixner <tglx@linutronix.de> wrote:
> On Wed, 8 Feb 2012, Darcy Watkins wrote:
>> [ 20.932000] =======================================================
>> [ 20.932000] [ INFO: possible circular locking dependency detected ]
>> [ 20.932000] 3.0.18-rt34 #41
>> [ 20.932000] -------------------------------------------------------
>> [ 20.932000] depmod/734 is trying to acquire lock:
>> [ 20.932000] (&mm->mmap_sem){++++++}, at: [<800e82d0>] might_fault+0x4c/0xa4
>> [ 20.932000]
>> [ 20.932000] but task is already holding lock:
>> [ 20.932000] (&f->sem){+.+.+.}, at: [<80184f88>] jffs2_readdir+0x108/0x1c0
>> [ 20.932000]
>> [ 20.932000] which lock already depends on the new lock.
>
> Classic ABBA deadlock. I don't think it's RT specific, but I might be
> wrong as usual. Will have a look later this week, when noone beats me.
Looks like someone beat you :) Josh Cartwright has a patch here:
http://lists.infradead.org/pipermail/linux-mtd/2012-February/039787.html
Testing is always helpful to the MTD maintainers!
Brian
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: jffs2 filesystem: possible circular locking dependency detected
2012-02-11 6:57 ` Brian Norris
@ 2012-02-13 20:02 ` Josh Cartwright
0 siblings, 0 replies; 3+ messages in thread
From: Josh Cartwright @ 2012-02-13 20:02 UTC (permalink / raw)
To: Brian Norris
Cc: linux-rt-users, Darcy Watkins, Peter Zijlstra, linux-mtd,
Thomas Gleixner, David Woodhouse
On Fri, Feb 10, 2012 at 10:57:54PM -0800, Brian Norris wrote:
> On Wed, Feb 8, 2012 at 12:09 PM, Thomas Gleixner <tglx@linutronix.de> wrote:
> > On Wed, 8 Feb 2012, Darcy Watkins wrote:
> >> [ 20.932000] =======================================================
> >> [ 20.932000] [ INFO: possible circular locking dependency detected ]
> >> [ 20.932000] 3.0.18-rt34 #41
> >> [ 20.932000] -------------------------------------------------------
> >> [ 20.932000] depmod/734 is trying to acquire lock:
> >> [ 20.932000] (&mm->mmap_sem){++++++}, at: [<800e82d0>] might_fault+0x4c/0xa4
> >> [ 20.932000]
> >> [ 20.932000] but task is already holding lock:
> >> [ 20.932000] (&f->sem){+.+.+.}, at: [<80184f88>] jffs2_readdir+0x108/0x1c0
> >> [ 20.932000]
> >> [ 20.932000] which lock already depends on the new lock.
> >
> > Classic ABBA deadlock. I don't think it's RT specific, but I might be
> > wrong as usual. Will have a look later this week, when noone beats me.
>
> Looks like someone beat you :) Josh Cartwright has a patch here:
> http://lists.infradead.org/pipermail/linux-mtd/2012-February/039787.html
Unfortunately, Darcy's lockdep splat implicates a different set of
locks, so I think it is a different issue then I resolved in the linked
patch.
Looking into this one, however, I think I convinced myself that the
lockdep warning is bogus. Here are two stack snippets that lockdep
claims would be problematic if interleaved:
do_page_fault()
down_read(¤t->mm->mmap_sem)
/* readahead... */
jffs2_readpage()
mutex_lock(&JFFS2_INODE_INFO(inode)->sem)
vfs_readdir()
/* ... */
jffs2_readdir()
mutex_lock(&JFFS2_INODE_INFO(inode)->sem)
filldir()
__put_user()
/* fault ... */
do_page_fault()
down_read(¤t->mm->mmap_sem)
In Darcy's case, the validator saw the do_page_fault() segment first,
and decided the lock order should be [mmap_sem, &JFFS2_INODE_INFO(inode)->sem].
It complained when it then saw the vfs_readdir() codepath reverse the
order [1].
This would be problematic, if it wasn't for the guarantee that the
jffs2_inode_info::sem in both paths will be different. In the readdir()
path, the inode is the directory inode, whose i_fops doesn't even
support mmap(), and so couldn't possibly be involved in a fault().
1: Well, not exactly the same codepath, since a fault was not generated.
put_user() includes a might_fault() which hints to lockdep that
mmap_sem _could_ be acquired if a fault occurs.
--
joshc
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2012-02-13 20:03 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <D61182AC8012EA4EBC531B3AF23BE109B3C6C6@tranzeo-mail2.12stewart.tranzeo.com>
2012-02-08 20:09 ` jffs2 filesystem: possible circular locking dependency detected Thomas Gleixner
2012-02-11 6:57 ` Brian Norris
2012-02-13 20:02 ` Josh Cartwright
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox