Re: jffs2 filesystem: possible circular locking dependency detected

public inbox for linux-mtd@lists.infradead.org
 help / color / mirror / Atom feed

* Re: jffs2 filesystem: possible circular locking dependency detected
       [not found] <D61182AC8012EA4EBC531B3AF23BE109B3C6C6@tranzeo-mail2.12stewart.tranzeo.com>
@ 2012-02-08 20:09 ` Thomas Gleixner
  2012-02-11  6:57   ` Brian Norris
  0 siblings, 1 reply; 3+ messages in thread
From: Thomas Gleixner @ 2012-02-08 20:09 UTC (permalink / raw)
  To: Darcy Watkins; +Cc: Peter Zijlstra, David Woodhouse, linux-mtd, linux-rt-users

On Wed, 8 Feb 2012, Darcy Watkins wrote:
> [    0.000000] Linux version 3.0.18-rt34 (darcy@tr-pentomino) (gcc version 4.4.6 (crosstool-NG 1.12.4) ) #41 PREEMPT RT Wed Feb 8 10:04:00 PST 2012
> [   20.932000] =======================================================
> [   20.932000] [ INFO: possible circular locking dependency detected ]
> [   20.932000] 3.0.18-rt34 #41
> [   20.932000] -------------------------------------------------------
> [   20.932000] depmod/734 is trying to acquire lock:
> [   20.932000]  (&mm->mmap_sem){++++++}, at: [<800e82d0>] might_fault+0x4c/0xa4
> [   20.932000]
> [   20.932000] but task is already holding lock:
> [   20.932000]  (&f->sem){+.+.+.}, at: [<80184f88>] jffs2_readdir+0x108/0x1c0
> [   20.932000]
> [   20.932000] which lock already depends on the new lock.
> [   20.932000]
> [   20.932000]
> [   20.932000] the existing dependency chain (in reverse order) is:
> [   20.932000]
> [   20.932000] -> #1 (&f->sem){+.+.+.}:
> [   20.932000]        [<800bae14>] lock_acquire+0x60/0x88
> [   20.932000]        [<802d3a84>] _mutex_lock+0x34/0x48
> [   20.932000]        [<80185754>] jffs2_readpage+0x24/0x54
> [   20.932000]        [<800d91e8>] __do_page_cache_readahead+0x274/0x2dc
> [   20.932000]        [<800d9278>] ra_submit+0x28/0x34
> [   20.932000]        [<800d1320>] filemap_fault+0x1a8/0x48c
> [   20.932000]        [<800e898c>] __do_fault+0x70/0x468
> [   20.932000]        [<800e9df8>] handle_pte_fault+0x388/0xd28
> [   20.932000]        [<800eaa44>] handle_mm_fault+0xf4/0x11c
> [   20.932000]        [<8006c230>] do_page_fault+0x110/0x300
> [   20.932000]        [<80062c04>] ret_from_exception+0x0/0x10
> [   20.932000]        [<801c3cb4>] __bzero+0x38/0x164
> [   20.932000]        [<8014121c>] padzero+0x58/0x84
> [   20.932000]        [<80142b18>] load_elf_binary+0x774/0x12fc
> [   20.932000]        [<80102c60>] search_binary_handler+0xec/0x318
> [   20.932000]        [<801044c4>] do_execve+0x158/0x264
> [   20.932000]        [<800670a8>] sys_execve+0x44/0x6c
> [   20.932000]        [<8006a3b8>] stack_done+0x20/0x40
> [   20.932000]
> [   20.932000] -> #0 (&mm->mmap_sem){++++++}:
> [   20.932000]        [<800ba1f4>] __lock_acquire+0x1228/0x1934
> [   20.932000]        [<800bae14>] lock_acquire+0x60/0x88
> [   20.932000]        [<800e82f8>] might_fault+0x74/0xa4
> [   20.932000]        [<8010dd80>] filldir64+0xe0/0x144
> [   20.932000]        [<80184fe4>] jffs2_readdir+0x164/0x1c0
> [   20.932000]        [<8010e070>] vfs_readdir+0x74/0xcc
> [   20.932000]        [<8010e13c>] sys_getdents64+0x74/0xd8
> [   20.932000]        [<8006a3b8>] stack_done+0x20/0x40
> [   20.932000]
> [   20.932000] other info that might help us debug this:
> [   20.932000]
> [   20.932000]  Possible unsafe locking scenario:
> [   20.932000]
> [   20.932000]        CPU0                    CPU1
> [   20.932000]        ----                    ----
> [   20.932000]   lock(&f->sem);
> [   20.932000]                                lock(&mm->mmap_sem);
> [   20.932000]                                lock(&f->sem);
> [   20.932000]   lock(&mm->mmap_sem);
> [   20.932000]
> [   20.932000]  *** DEADLOCK ***
> [   20.932000]
> [   20.932000] 2 locks held by depmod/734:
> [   20.932000]  #0:  (&sb->s_type->i_mutex_key#3){+.+.+.}, at: [<8010e044>] vfs_readdir+0x48/0xcc
> [   20.932000]  #1:  (&f->sem){+.+.+.}, at: [<80184f88>] jffs2_readdir+0x108/0x1c0
> [   20.932000]
> [   20.932000] stack backtrace:
> [   20.932000] Call Trace:
> [   20.932000] [<802d103c>] dump_stack+0x8/0x34
> [   20.932000] [<800b8960>] print_circular_bug+0x2bc/0x2e8
> [   20.932000] [<800ba1f4>] __lock_acquire+0x1228/0x1934
> [   20.932000] [<800bae14>] lock_acquire+0x60/0x88
> [   20.932000] [<800e82f8>] might_fault+0x74/0xa4
> [   20.932000] [<8010dd80>] filldir64+0xe0/0x144
> [   20.932000] [<80184fe4>] jffs2_readdir+0x164/0x1c0
> [   20.932000] [<8010e070>] vfs_readdir+0x74/0xcc
> [   20.932000] [<8010e13c>] sys_getdents64+0x74/0xd8
> [   20.932000] [<8006a3b8>] stack_done+0x20/0x40

Classic ABBA deadlock. I don't think it's RT specific, but I might be
wrong as usual. Will have a look later this week, when noone beats me.

Thanks,

	tglx

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: jffs2 filesystem: possible circular locking dependency detected
  2012-02-08 20:09 ` jffs2 filesystem: possible circular locking dependency detected Thomas Gleixner
@ 2012-02-11  6:57   ` Brian Norris
  2012-02-13 20:02     ` Josh Cartwright
  0 siblings, 1 reply; 3+ messages in thread
From: Brian Norris @ 2012-02-11  6:57 UTC (permalink / raw)
  To: Thomas Gleixner
  Cc: linux-rt-users, Darcy Watkins, Peter Zijlstra, linux-mtd,
	Josh Cartwright, David Woodhouse

On Wed, Feb 8, 2012 at 12:09 PM, Thomas Gleixner <tglx@linutronix.de> wrote:
> On Wed, 8 Feb 2012, Darcy Watkins wrote:
>> [   20.932000] =======================================================
>> [   20.932000] [ INFO: possible circular locking dependency detected ]
>> [   20.932000] 3.0.18-rt34 #41
>> [   20.932000] -------------------------------------------------------
>> [   20.932000] depmod/734 is trying to acquire lock:
>> [   20.932000]  (&mm->mmap_sem){++++++}, at: [<800e82d0>] might_fault+0x4c/0xa4
>> [   20.932000]
>> [   20.932000] but task is already holding lock:
>> [   20.932000]  (&f->sem){+.+.+.}, at: [<80184f88>] jffs2_readdir+0x108/0x1c0
>> [   20.932000]
>> [   20.932000] which lock already depends on the new lock.
>
> Classic ABBA deadlock. I don't think it's RT specific, but I might be
> wrong as usual. Will have a look later this week, when noone beats me.

Looks like someone beat you :) Josh Cartwright has a patch here:
http://lists.infradead.org/pipermail/linux-mtd/2012-February/039787.html

Testing is always helpful to the MTD maintainers!

Brian

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: jffs2 filesystem: possible circular locking dependency detected
  2012-02-11  6:57   ` Brian Norris
@ 2012-02-13 20:02     ` Josh Cartwright
  0 siblings, 0 replies; 3+ messages in thread
From: Josh Cartwright @ 2012-02-13 20:02 UTC (permalink / raw)
  To: Brian Norris
  Cc: linux-rt-users, Darcy Watkins, Peter Zijlstra, linux-mtd,
	Thomas Gleixner, David Woodhouse

On Fri, Feb 10, 2012 at 10:57:54PM -0800, Brian Norris wrote:
> On Wed, Feb 8, 2012 at 12:09 PM, Thomas Gleixner <tglx@linutronix.de> wrote:
> > On Wed, 8 Feb 2012, Darcy Watkins wrote:
> >> [   20.932000] =======================================================
> >> [   20.932000] [ INFO: possible circular locking dependency detected ]
> >> [   20.932000] 3.0.18-rt34 #41
> >> [   20.932000] -------------------------------------------------------
> >> [   20.932000] depmod/734 is trying to acquire lock:
> >> [   20.932000]  (&mm->mmap_sem){++++++}, at: [<800e82d0>] might_fault+0x4c/0xa4
> >> [   20.932000]
> >> [   20.932000] but task is already holding lock:
> >> [   20.932000]  (&f->sem){+.+.+.}, at: [<80184f88>] jffs2_readdir+0x108/0x1c0
> >> [   20.932000]
> >> [   20.932000] which lock already depends on the new lock.
> >
> > Classic ABBA deadlock. I don't think it's RT specific, but I might be
> > wrong as usual. Will have a look later this week, when noone beats me.
> 
> Looks like someone beat you :) Josh Cartwright has a patch here:
> http://lists.infradead.org/pipermail/linux-mtd/2012-February/039787.html

Unfortunately, Darcy's lockdep splat implicates a different set of
locks, so I think it is a different issue then I resolved in the linked
patch.

Looking into this one, however, I think I convinced myself that the
lockdep warning is bogus.  Here are two stack snippets that lockdep
claims would be problematic if interleaved:

do_page_fault()
   down_read(&current->mm->mmap_sem)
   /* readahead... */
      jffs2_readpage()
         mutex_lock(&JFFS2_INODE_INFO(inode)->sem)

vfs_readdir()
  /* ... */
  jffs2_readdir()
     mutex_lock(&JFFS2_INODE_INFO(inode)->sem)
     filldir()
        __put_user()
        /* fault ... */
           do_page_fault()
              down_read(&current->mm->mmap_sem)

In Darcy's case, the validator saw the do_page_fault() segment first,
and decided the lock order should be [mmap_sem, &JFFS2_INODE_INFO(inode)->sem].
It complained when it then saw the vfs_readdir() codepath reverse the
order [1].

This would be problematic, if it wasn't for the guarantee that the
jffs2_inode_info::sem in both paths will be different.  In the readdir()
path, the inode is the directory inode, whose i_fops doesn't even
support mmap(), and so couldn't possibly be involved in a fault().

1: Well, not exactly the same codepath, since a fault was not generated.
   put_user() includes a might_fault() which hints to lockdep that
   mmap_sem _could_ be acquired if a fault occurs.

-- 
                                           joshc

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2012-02-13 20:03 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <D61182AC8012EA4EBC531B3AF23BE109B3C6C6@tranzeo-mail2.12stewart.tranzeo.com>
2012-02-08 20:09 ` jffs2 filesystem: possible circular locking dependency detected Thomas Gleixner
2012-02-11  6:57   ` Brian Norris
2012-02-13 20:02     ` Josh Cartwright

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox