[BUG] kernel 3.1.0 possible circular locking dependency detected

reiserfs-devel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

* [BUG] kernel 3.1.0 possible circular locking dependency detected
@ 2011-10-31  8:35 Knut Petersen
  2011-10-31 15:08 ` Linus Torvalds
  0 siblings, 1 reply; 6+ messages in thread
From: Knut Petersen @ 2011-10-31  8:35 UTC (permalink / raw)
  To: linux-kernel; +Cc: reiserfs-devel, Linus Torvalds, Greg KH

After a " rm -r /verybigdir" (about 12G on a 25G reiserfs 3.6partition)
I found the following report about a circular locking dependency in
kernel 3.1.0

[  337.064044]
[  337.064046] =======================================================
[  337.064059] [ INFO: possible circular locking dependency detected ]
[  337.064069] 3.1.0-main #18
[  337.064074] -------------------------------------------------------
[  337.064083] rm/4340 is trying to acquire lock:
[  337.064090]  (&sb->s_type->i_mutex_key#12/2){+.+.+.}, at: [<c01f73d7>] xattr_unlink+0x3a/0x6d
[  337.064114]
[  337.064115] but task is already holding lock:
[  337.064123]  (&sb->s_type->i_mutex_key#12/3){+.+.+.}, at: [<c01f7884>] reiserfs_for_each_xattr+0x9e/0x224
[  337.064143]
[  337.064144] which lock already depends on the new lock.
[  337.064146]
[  337.064156]
[  337.064157] the existing dependency chain (in reverse order) is:
[  337.064167]
[  337.064168] -> #1 (&sb->s_type->i_mutex_key#12/3){+.+.+.}:
[  337.064184]        [<c0149c32>] lock_acquire+0x47/0x5e
[  337.064195]        [<c04360df>] mutex_lock_nested+0x35/0x26f
[  337.064208]        [<c01f7603>] open_xa_dir+0x3d/0x150
[  337.064218]        [<c01f772a>] xattr_lookup+0x14/0xd0
[  337.064228]        [<c01f7f50>] reiserfs_xattr_get+0x4c/0x21b
[  337.064238]        [<c01f88f4>] security_get+0x3e/0x46
[  337.064248]        [<c01f817c>] reiserfs_getxattr+0x5d/0x7a
[  337.064258]        [<c02414e8>] cap_inode_need_killpriv+0x1e/0x2d
[  337.064272]        [<c024264c>] security_inode_need_killpriv+0xf/0x11
[  337.064284]        [<c016dc16>] file_remove_suid+0x27/0x71
[  337.064296]        [<c01ae38b>] generic_file_splice_write+0x86/0x121
[  337.064310]        [<c01adfa9>] do_splice_from+0x58/0x62
[  337.064320]        [<c01adfca>] direct_splice_actor+0x17/0x1c
[  337.064331]        [<c01ae256>] splice_direct_to_actor+0xbf/0x16e
[  337.064342]        [<c01af16c>] do_splice_direct+0x4b/0x62
[  337.064353]        [<c0192e02>] do_sendfile+0x159/0x1f1
[  337.064365]        [<c01938d7>] sys_sendfile64+0x3f/0x80
[  337.064375]        [<c043b10c>] sysenter_do_call+0x12/0x32
[  337.064388]
[  337.064389] -> #0 (&sb->s_type->i_mutex_key#12/2){+.+.+.}:
[  337.064404]        [<c01492ca>] __lock_acquire+0xee6/0x1472
[  337.064416]        [<c0149c32>] lock_acquire+0x47/0x5e
[  337.064426]        [<c04360df>] mutex_lock_nested+0x35/0x26f
[  337.064436]        [<c01f73d7>] xattr_unlink+0x3a/0x6d
[  337.064446]        [<c01f7499>] delete_one_xattr+0x8f/0x99
[  337.064456]        [<c01f78cf>] reiserfs_for_each_xattr+0xe9/0x224
[  337.064467]        [<c01f7a1f>] reiserfs_delete_xattrs+0x15/0x3f
[  337.064477]        [<c01dfba5>] reiserfs_evict_inode+0x7f/0x115
[  337.064490]        [<c01a4f27>] evict+0x85/0x126
[  337.064500]        [<c01a5109>] iput+0x141/0x146
[  337.064509]        [<c019d6d9>] do_unlinkat+0xf1/0x136
[  337.064520]        [<c019df1f>] sys_unlinkat+0x2b/0x32
[  337.064530]        [<c043b10c>] sysenter_do_call+0x12/0x32
[  337.064541]
[  337.064542] other info that might help us debug this:
[  337.064544]
[  337.064570]  Possible unsafe locking scenario:
[  337.064572]
[  337.064588]        CPU0                    CPU1
[  337.064599]        ----                    ----
[  337.064609]   lock(&sb->s_type->i_mutex_key);
[  337.064623]                                lock(&sb->s_type->i_mutex_key);
[  337.064638]                                lock(&sb->s_type->i_mutex_key);
[  337.064654]   lock(&sb->s_type->i_mutex_key);
[  337.064667]
[  337.064668]  *** DEADLOCK ***
[  337.064669]
[  337.064690] 1 lock held by rm/4340:
[  337.064700]  #0:  (&sb->s_type->i_mutex_key#12/3){+.+.+.}, at: [<c01f7884>] reiserfs_for_each_xattr+0x9e/0x224
[  337.064725]
[  337.064726] stack backtrace:
[  337.064743] Pid: 4340, comm: rm Not tainted 3.1.0-main #18
[  337.064755] Call Trace:
[  337.064767]  [<c0434964>] ? printk+0xf/0x13
[  337.064781]  [<c014732b>] print_circular_bug+0x215/0x222
[  337.064796]  [<c01492ca>] __lock_acquire+0xee6/0x1472
[  337.064811]  [<c0144a54>] ? tick_dev_program_event+0x24/0x105
[  337.064826]  [<c0149c32>] lock_acquire+0x47/0x5e
[  337.064839]  [<c01f73d7>] ? xattr_unlink+0x3a/0x6d
[  337.064853]  [<c01f73d7>] ? xattr_unlink+0x3a/0x6d
[  337.064867]  [<c04360df>] mutex_lock_nested+0x35/0x26f
[  337.064881]  [<c01f73d7>] ? xattr_unlink+0x3a/0x6d
[  337.064894]  [<c01f73d7>] xattr_unlink+0x3a/0x6d
[  337.064908]  [<c01f7499>] delete_one_xattr+0x8f/0x99
[  337.064921]  [<c01f78cf>] reiserfs_for_each_xattr+0xe9/0x224
[  337.064936]  [<c01f740a>] ? xattr_unlink+0x6d/0x6d
[  337.064950]  [<c0439a6f>] ? sub_preempt_count+0x81/0x8e
[  337.064965]  [<c04362fe>] ? mutex_lock_nested+0x254/0x26f
[  337.064980]  [<c01f7a1f>] reiserfs_delete_xattrs+0x15/0x3f
[  337.064994]  [<c01dfba5>] reiserfs_evict_inode+0x7f/0x115
[  337.065009]  [<c0120da4>] ? get_parent_ip+0xb/0x31
[  337.065023]  [<c0439a6f>] ? sub_preempt_count+0x81/0x8e
[  337.065037]  [<c01a4f27>] evict+0x85/0x126
[  337.065050]  [<c01a5109>] iput+0x141/0x146
[  337.065063]  [<c019d6d9>] do_unlinkat+0xf1/0x136
[  337.065078]  [<c01bb261>] ? dnotify_flush+0x2c/0xa6
[  337.065092]  [<c043b13b>] ? sysenter_exit+0xf/0x16
[  337.065107]  [<c019df1f>] sys_unlinkat+0x2b/0x32
[  337.065120]  [<c043b10c>] sysenter_do_call+0x12/0x32


^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [BUG] kernel 3.1.0 possible circular locking dependency detected
  2011-10-31  8:35 [BUG] kernel 3.1.0 possible circular locking dependency detected Knut Petersen
@ 2011-10-31 15:08 ` Linus Torvalds
  2011-10-31 15:59   ` Knut Petersen
                     ` (2 more replies)
  0 siblings, 3 replies; 6+ messages in thread
From: Linus Torvalds @ 2011-10-31 15:08 UTC (permalink / raw)
  To: Knut Petersen
  Cc: linux-kernel, reiserfs-devel, Greg KH, Al Viro, Christoph Hellwig,
	Frederic Weisbecker, Peter Zijlstra

[ Added a few more people to the cc ]

On Mon, Oct 31, 2011 at 1:35 AM, Knut Petersen
<Knut_Petersen@t-online.de> wrote:
> After a " rm -r /verybigdir" (about 12G on a 25G reiserfs 3.6partition)
> I found the following report about a circular locking dependency in
> kernel 3.1.0

Heh. There is even a comment about the ordering violation:

/* We use I_MUTEX_CHILD here to silence lockdep. It's safe because xattr
 * mutation ops aren't called during rename or splace, which are the
 * only other users of I_MUTEX_CHILD. It violates the ordering, but that's
 * better than allocating another subclass just for this code. */

and apparently the comment is wrong: we *do* end up looking up xattrs
during splice, due to the security_inode_need_killpriv() thing.

So I think this needs a suid (or sgid) file that has xattrs and is removed.

That said, I suspect this is a false positive, because the actual
unlink can never happen while somebody is splicing to/from the same
file at the same time (because then the iput wouldn't be the last one
for the inode, and the file removal would be delayed until the file
has been closed for the last time).

But the hacky use of "I_MUTEX_CHILD" is basically not the proper way
to silence the lockdep splat.

Anybody?

                  Linus

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [BUG] kernel 3.1.0 possible circular locking dependency detected
  2011-10-31 15:08 ` Linus Torvalds
@ 2011-10-31 15:59   ` Knut Petersen
  2011-11-07 17:18   ` Peter Zijlstra
  2011-11-15 13:59   ` kernel 3.1.1 / 3.1.0 reiserfs locking problems Knut Petersen
  2 siblings, 0 replies; 6+ messages in thread
From: Knut Petersen @ 2011-10-31 15:59 UTC (permalink / raw)
  To: Linus Torvalds
  Cc: linux-kernel, reiserfs-devel, Greg KH, Al Viro, Christoph Hellwig,
	Frederic Weisbecker, Peter Zijlstra

Am 31.10.2011 16:08, schrieb Linus Torvalds:
> [ Added a few more people to the cc ]
>
> On Mon, Oct 31, 2011 at 1:35 AM, Knut Petersen
> <Knut_Petersen@t-online.de>  wrote:
>> After a " rm -r /verybigdir" (about 12G on a 25G reiserfs 3.6partition)
>> I found the following report about a circular locking dependency in
>> kernel 3.1.0
> Heh. There is even a comment about the ordering violation:
>
> /* We use I_MUTEX_CHILD here to silence lockdep. It's safe because xattr
>   * mutation ops aren't called during rename or splace, which are the
>   * only other users of I_MUTEX_CHILD. It violates the ordering, but that's
>   * better than allocating another subclass just for this code. */
>
> and apparently the comment is wrong: we *do* end up looking up xattrs
> during splice, due to the security_inode_need_killpriv() thing.
>
> So I think this needs a suid (or sgid) file that has xattrs and is removed.

Well, after rm -r  /some_small_dir_with_suid_and_sgid_files
there was no warning in dmesg.

I restored a copy of /verybigdir and searched for sgid/suid files with
find /test -type f -perm +6000 -exec ls -l {} \;

Result: not a singe suid/sgid file in /verybigdir

But  rm -r /verybigdir triggered the warning again ...

knut

> That said, I suspect this is a false positive, because the actual
> unlink can never happen while somebody is splicing to/from the same
> file at the same time (because then the iput wouldn't be the last one
> for the inode, and the file removal would be delayed until the file
> has been closed for the last time).
>
> But the hacky use of "I_MUTEX_CHILD" is basically not the proper way
> to silence the lockdep splat.
>
> Anybody?
>
>                    Linus
>


^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [BUG] kernel 3.1.0 possible circular locking dependency detected
  2011-10-31 15:08 ` Linus Torvalds
  2011-10-31 15:59   ` Knut Petersen
@ 2011-11-07 17:18   ` Peter Zijlstra
  2011-11-15 13:59   ` kernel 3.1.1 / 3.1.0 reiserfs locking problems Knut Petersen
  2 siblings, 0 replies; 6+ messages in thread
From: Peter Zijlstra @ 2011-11-07 17:18 UTC (permalink / raw)
  To: Linus Torvalds
  Cc: Knut Petersen, linux-kernel, reiserfs-devel, Greg KH, Al Viro,
	Christoph Hellwig, Frederic Weisbecker, Jeff Mahoney

On Mon, 2011-10-31 at 08:08 -0700, Linus Torvalds wrote:
> [ Added a few more people to the cc ]
> 
> On Mon, Oct 31, 2011 at 1:35 AM, Knut Petersen
> <Knut_Petersen@t-online.de> wrote:
> > After a " rm -r /verybigdir" (about 12G on a 25G reiserfs 3.6partition)
> > I found the following report about a circular locking dependency in
> > kernel 3.1.0
> 
> Heh. There is even a comment about the ordering violation:
> 
> /* We use I_MUTEX_CHILD here to silence lockdep. It's safe because xattr
>  * mutation ops aren't called during rename or splace, which are the
>  * only other users of I_MUTEX_CHILD. It violates the ordering, but that's
>  * better than allocating another subclass just for this code. */
> 
> and apparently the comment is wrong: we *do* end up looking up xattrs
> during splice, due to the security_inode_need_killpriv() thing.
> 
> So I think this needs a suid (or sgid) file that has xattrs and is removed.
> 
> That said, I suspect this is a false positive, because the actual
> unlink can never happen while somebody is splicing to/from the same
> file at the same time (because then the iput wouldn't be the last one
> for the inode, and the file removal would be delayed until the file
> has been closed for the last time).
> 
> But the hacky use of "I_MUTEX_CHILD" is basically not the proper way
> to silence the lockdep splat.
> 
> Anybody?

I_MUTEX_XATTR sounds like the right nesting for something called
xattr_*() but then, what do I know about filesystems.. Jeff Mahoney
wrote this, Jeff any clue?

^ permalink raw reply	[flat|nested] 6+ messages in thread

* kernel 3.1.1 / 3.1.0 reiserfs locking problems
  2011-10-31 15:08 ` Linus Torvalds
  2011-10-31 15:59   ` Knut Petersen
  2011-11-07 17:18   ` Peter Zijlstra
@ 2011-11-15 13:59   ` Knut Petersen
  2011-11-15 18:15     ` Frederic Weisbecker
  2 siblings, 1 reply; 6+ messages in thread
From: Knut Petersen @ 2011-11-15 13:59 UTC (permalink / raw)
  To: Linus Torvalds
  Cc: linux-kernel, reiserfs-devel, Greg KH, Al Viro, Christoph Hellwig,
	Frederic Weisbecker, Peter Zijlstra, Jeff Mahoney

Am 31.10.2011 16:08, schrieb Linus Torvalds:

With kernel 3.1.1 there is another reiserfs related lock probleme:

Nov 15 11:37:27 golem kernel: [ 1986.896976]
Nov 15 11:37:27 golem kernel: [ 1986.896979] =================================
Nov 15 11:37:27 golem kernel: [ 1986.896990] [ INFO: inconsistent lock state ]
Nov 15 11:37:27 golem kernel: [ 1986.896997] 3.1.1-main #8
Nov 15 11:37:27 golem kernel: [ 1986.897001] ---------------------------------
Nov 15 11:37:27 golem kernel: [ 1986.897007] inconsistent {RECLAIM_FS-ON-W} -> {IN-RECLAIM_FS-W} usage.
Nov 15 11:37:27 golem kernel: [ 1986.897016] kswapd0/16 [HC0[0]:SC0[0]:HE1:SE1] takes:
Nov 15 11:37:27 golem kernel: [ 1986.897023]  (&REISERFS_SB(s)->lock){+.+.?.}, at: [<c01f8bd4>] reiserfs_write_lock+0x20/0x2a
Nov 15 11:37:27 golem kernel: [ 1986.897044] {RECLAIM_FS-ON-W} state was registered at:
Nov 15 11:37:27 golem kernel: [ 1986.897050]   [<c014a5b9>] mark_held_locks+0xae/0xd0
Nov 15 11:37:27 golem kernel: [ 1986.897060]   [<c014aab3>] lockdep_trace_alloc+0x7d/0x91
Nov 15 11:37:27 golem kernel: [ 1986.897068]   [<c0190ee0>] kmem_cache_alloc+0x1a/0x93
Nov 15 11:37:27 golem kernel: [ 1986.897078]   [<c01e7728>] reiserfs_alloc_inode+0x13/0x3d
Nov 15 11:37:27 golem kernel: [ 1986.897088]   [<c01a5b06>] alloc_inode+0x14/0x5f
Nov 15 11:37:27 golem kernel: [ 1986.897097]   [<c01a5cb9>] iget5_locked+0x62/0x13a
Nov 15 11:37:27 golem kernel: [ 1986.897106]   [<c01e99e0>] reiserfs_fill_super+0x410/0x8b9
Nov 15 11:37:27 golem kernel: [ 1986.897114]   [<c01953da>] mount_bdev+0x10b/0x159
Nov 15 11:37:27 golem kernel: [ 1986.897123]   [<c01e764d>] get_super_block+0x10/0x12
Nov 15 11:37:27 golem kernel: [ 1986.897131]   [<c0195b38>] mount_fs+0x59/0x12d
Nov 15 11:37:27 golem kernel: [ 1986.897138]   [<c01a80d1>] vfs_kern_mount+0x45/0x7a
Nov 15 11:37:27 golem kernel: [ 1986.897147]   [<c01a83e3>] do_kern_mount+0x2f/0xb0
Nov 15 11:37:27 golem kernel: [ 1986.897155]   [<c01a987a>] do_mount+0x5c2/0x612
Nov 15 11:37:27 golem kernel: [ 1986.897163]   [<c01a9a72>] sys_mount+0x61/0x8f
Nov 15 11:37:27 golem kernel: [ 1986.897170]   [<c044060c>] sysenter_do_call+0x12/0x32
Nov 15 11:37:27 golem kernel: [ 1986.897181] irq event stamp: 7509691
Nov 15 11:37:27 golem kernel: [ 1986.897186] hardirqs last  enabled at (7509691): [<c0190f34>] kmem_cache_alloc+0x6e/0x93
Nov 15 11:37:27 golem kernel: [ 1986.897197] hardirqs last disabled at (7509690): [<c0190eea>] kmem_cache_alloc+0x24/0x93
Nov 15 11:37:27 golem kernel: [ 1986.897209] softirqs last  enabled at (7508896): [<c01294bd>] __do_softirq+0xee/0xfd
Nov 15 11:37:27 golem kernel: [ 1986.897222] softirqs last disabled at (7508859): [<c01030ed>] do_softirq+0x50/0x9d
Nov 15 11:37:27 golem kernel: [ 1986.897234]
Nov 15 11:37:27 golem kernel: [ 1986.897235] other info that might help us debug this:
Nov 15 11:37:27 golem kernel: [ 1986.897242]  Possible unsafe locking scenario:
Nov 15 11:37:27 golem kernel: [ 1986.897244]
Nov 15 11:37:27 golem kernel: [ 1986.897250]        CPU0
Nov 15 11:37:27 golem kernel: [ 1986.897254]        ----
Nov 15 11:37:27 golem kernel: [ 1986.897257]   lock(&REISERFS_SB(s)->lock);
Nov 15 11:37:27 golem kernel: [ 1986.897265] <Interrupt>
Nov 15 11:37:27 golem kernel: [ 1986.897269]     lock(&REISERFS_SB(s)->lock);
Nov 15 11:37:27 golem kernel: [ 1986.897276]
Nov 15 11:37:27 golem kernel: [ 1986.897277]  *** DEADLOCK ***
Nov 15 11:37:27 golem kernel: [ 1986.897278]
Nov 15 11:37:27 golem kernel: [ 1986.897286] no locks held by kswapd0/16.
Nov 15 11:37:27 golem kernel: [ 1986.897291]
Nov 15 11:37:27 golem kernel: [ 1986.897292] stack backtrace:
Nov 15 11:37:27 golem kernel: [ 1986.897299] Pid: 16, comm: kswapd0 Not tainted 3.1.1-main #8
Nov 15 11:37:27 golem kernel: [ 1986.897306] Call Trace:
Nov 15 11:37:27 golem kernel: [ 1986.897314]  [<c0439e76>] ? printk+0xf/0x11
Nov 15 11:37:27 golem kernel: [ 1986.897324]  [<c01482d1>] print_usage_bug+0x20e/0x21a
Nov 15 11:37:27 golem kernel: [ 1986.897332]  [<c01479b8>] ? print_irq_inversion_bug+0x172/0x172
Nov 15 11:37:27 golem kernel: [ 1986.897341]  [<c014855c>] mark_lock+0x27f/0x483
Nov 15 11:37:27 golem kernel: [ 1986.897349]  [<c0148d88>] __lock_acquire+0x628/0x1472
Nov 15 11:37:27 golem kernel: [ 1986.897358]  [<c0149fae>] lock_acquire+0x47/0x5e
Nov 15 11:37:27 golem kernel: [ 1986.897366]  [<c01f8bd4>] ? reiserfs_write_lock+0x20/0x2a
Nov 15 11:37:27 golem kernel: [ 1986.897384]  [<c01f8bd4>] ? reiserfs_write_lock+0x20/0x2a
Nov 15 11:37:27 golem kernel: [ 1986.897397]  [<c043b5ef>] mutex_lock_nested+0x35/0x26f
Nov 15 11:37:27 golem kernel: [ 1986.897409]  [<c01f8bd4>] ? reiserfs_write_lock+0x20/0x2a
Nov 15 11:37:27 golem kernel: [ 1986.897421]  [<c01f8bd4>] reiserfs_write_lock+0x20/0x2a
Nov 15 11:37:27 golem kernel: [ 1986.897433]  [<c01e2edd>] map_block_for_writepage+0xc9/0x590
Nov 15 11:37:27 golem kernel: [ 1986.897448]  [<c01b1706>] ? create_empty_buffers+0x33/0x8f
Nov 15 11:37:27 golem kernel: [ 1986.897461]  [<c0121124>] ? get_parent_ip+0xb/0x31
Nov 15 11:37:27 golem kernel: [ 1986.897472]  [<c043ef7f>] ? sub_preempt_count+0x81/0x8e
Nov 15 11:37:27 golem kernel: [ 1986.897485]  [<c043cae0>] ? _raw_spin_unlock+0x27/0x3d
Nov 15 11:37:27 golem kernel: [ 1986.897496]  [<c0121124>] ? get_parent_ip+0xb/0x31
Nov 15 11:37:27 golem kernel: [ 1986.897508]  [<c01e355d>] reiserfs_writepage+0x1b9/0x3e7
Nov 15 11:37:27 golem kernel: [ 1986.897521]  [<c0173b40>] ? clear_page_dirty_for_io+0xcb/0xde
Nov 15 11:37:27 golem kernel: [ 1986.897533]  [<c014a6e3>] ? trace_hardirqs_on_caller+0x108/0x138
Nov 15 11:37:27 golem kernel: [ 1986.897546]  [<c014a71e>] ? trace_hardirqs_on+0xb/0xd
Nov 15 11:37:27 golem kernel: [ 1986.897559]  [<c0177b38>] shrink_page_list+0x34f/0x5e2
Nov 15 11:37:27 golem kernel: [ 1986.897572]  [<c01780a7>] shrink_inactive_list+0x172/0x22c
Nov 15 11:37:27 golem kernel: [ 1986.897585]  [<c0178464>] shrink_zone+0x303/0x3b1
Nov 15 11:37:27 golem kernel: [ 1986.897597]  [<c043cae0>] ? _raw_spin_unlock+0x27/0x3d
Nov 15 11:37:27 golem kernel: [ 1986.897611]  [<c01788c9>] kswapd+0x3b7/0x5f2
Nov 15 11:37:27 golem kernel: [ 1986.897622]  [<c01788c9>] ? kswapd+0x3b7/0x5f2
Nov 15 11:37:27 golem kernel: [ 1986.897637]  [<c0138cde>] ? wake_up_bit+0x1b/0x1b
Nov 15 11:37:27 golem kernel: [ 1986.897649]  [<c0178512>] ? shrink_zone+0x3b1/0x3b1
Nov 15 11:37:27 golem kernel: [ 1986.897661]  [<c0138a87>] kthread+0x61/0x66
Nov 15 11:37:27 golem kernel: [ 1986.897673]  [<c0138a26>] ? __init_kthread_worker+0x42/0x42
Nov 15 11:37:27 golem kernel: [ 1986.897686]  [<c0440bba>] kernel_thread_helper+0x6/0xd

I don´t know exactly what I was doing at that time  - probably I edited a _huge_ image gimp.


> [ Added a few more people to the cc ]
>
> On Mon, Oct 31, 2011 at 1:35 AM, Knut Petersen
> <Knut_Petersen@t-online.de>  wrote:
>> After a " rm -r /verybigdir" (about 12G on a 25G reiserfs 3.6partition)
>> I found the following report about a circular locking dependency in
>> kernel 3.1.0
> Heh. There is even a comment about the ordering violation:
>
> /* We use I_MUTEX_CHILD here to silence lockdep. It's safe because xattr
>   * mutation ops aren't called during rename or splace, which are the
>   * only other users of I_MUTEX_CHILD. It violates the ordering, but that's
>   * better than allocating another subclass just for this code. */
>
> and apparently the comment is wrong: we *do* end up looking up xattrs
> during splice, due to the security_inode_need_killpriv() thing.
>
> So I think this needs a suid (or sgid) file that has xattrs and is removed.
>
> That said, I suspect this is a false positive, because the actual
> unlink can never happen while somebody is splicing to/from the same
> file at the same time (because then the iput wouldn't be the last one
> for the inode, and the file removal would be delayed until the file
> has been closed for the last time).
>
> But the hacky use of "I_MUTEX_CHILD" is basically not the proper way
> to silence the lockdep splat.
>
> Anybody?
>
>                    Linus
>

--
To unsubscribe from this list: send the line "unsubscribe reiserfs-devel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: kernel 3.1.1 / 3.1.0 reiserfs locking problems
  2011-11-15 13:59   ` kernel 3.1.1 / 3.1.0 reiserfs locking problems Knut Petersen
@ 2011-11-15 18:15     ` Frederic Weisbecker
  0 siblings, 0 replies; 6+ messages in thread
From: Frederic Weisbecker @ 2011-11-15 18:15 UTC (permalink / raw)
  To: Knut Petersen
  Cc: Linus Torvalds, linux-kernel, reiserfs-devel, Greg KH, Al Viro,
	Christoph Hellwig, Peter Zijlstra, Jeff Mahoney

On Tue, Nov 15, 2011 at 02:59:21PM +0100, Knut Petersen wrote:
> Am 31.10.2011 16:08, schrieb Linus Torvalds:
> 
> With kernel 3.1.1 there is another reiserfs related lock probleme:
> 
> Nov 15 11:37:27 golem kernel: [ 1986.896976]
> Nov 15 11:37:27 golem kernel: [ 1986.896979] =================================
> Nov 15 11:37:27 golem kernel: [ 1986.896990] [ INFO: inconsistent lock state ]
> Nov 15 11:37:27 golem kernel: [ 1986.896997] 3.1.1-main #8
> Nov 15 11:37:27 golem kernel: [ 1986.897001] ---------------------------------
> Nov 15 11:37:27 golem kernel: [ 1986.897007] inconsistent {RECLAIM_FS-ON-W} -> {IN-RECLAIM_FS-W} usage.
> Nov 15 11:37:27 golem kernel: [ 1986.897016] kswapd0/16 [HC0[0]:SC0[0]:HE1:SE1] takes:
> Nov 15 11:37:27 golem kernel: [ 1986.897023]  (&REISERFS_SB(s)->lock){+.+.?.}, at: [<c01f8bd4>] reiserfs_write_lock+0x20/0x2a
> Nov 15 11:37:27 golem kernel: [ 1986.897044] {RECLAIM_FS-ON-W} state was registered at:
> Nov 15 11:37:27 golem kernel: [ 1986.897050]   [<c014a5b9>] mark_held_locks+0xae/0xd0
> Nov 15 11:37:27 golem kernel: [ 1986.897060]   [<c014aab3>] lockdep_trace_alloc+0x7d/0x91
> Nov 15 11:37:27 golem kernel: [ 1986.897068]   [<c0190ee0>] kmem_cache_alloc+0x1a/0x93
> Nov 15 11:37:27 golem kernel: [ 1986.897078]   [<c01e7728>] reiserfs_alloc_inode+0x13/0x3d
> Nov 15 11:37:27 golem kernel: [ 1986.897088]   [<c01a5b06>] alloc_inode+0x14/0x5f
> Nov 15 11:37:27 golem kernel: [ 1986.897097]   [<c01a5cb9>] iget5_locked+0x62/0x13a
> Nov 15 11:37:27 golem kernel: [ 1986.897106]   [<c01e99e0>] reiserfs_fill_super+0x410/0x8b9
> Nov 15 11:37:27 golem kernel: [ 1986.897114]   [<c01953da>] mount_bdev+0x10b/0x159
> Nov 15 11:37:27 golem kernel: [ 1986.897123]   [<c01e764d>] get_super_block+0x10/0x12
> Nov 15 11:37:27 golem kernel: [ 1986.897131]   [<c0195b38>] mount_fs+0x59/0x12d
> Nov 15 11:37:27 golem kernel: [ 1986.897138]   [<c01a80d1>] vfs_kern_mount+0x45/0x7a
> Nov 15 11:37:27 golem kernel: [ 1986.897147]   [<c01a83e3>] do_kern_mount+0x2f/0xb0
> Nov 15 11:37:27 golem kernel: [ 1986.897155]   [<c01a987a>] do_mount+0x5c2/0x612
> Nov 15 11:37:27 golem kernel: [ 1986.897163]   [<c01a9a72>] sys_mount+0x61/0x8f
> Nov 15 11:37:27 golem kernel: [ 1986.897170]   [<c044060c>] sysenter_do_call+0x12/0x32

The most straightforward way to solve this is to unlock before getting the superblock
inode:

diff --git a/fs/reiserfs/super.c b/fs/reiserfs/super.c
index 14363b9..463c203 100644
--- a/fs/reiserfs/super.c
+++ b/fs/reiserfs/super.c
@@ -1762,9 +1762,11 @@ static int reiserfs_fill_super(struct super_block *s, void *data, int silent)
 	}
 	args.objectid = REISERFS_ROOT_OBJECTID;
 	args.dirid = REISERFS_ROOT_PARENT_OBJECTID;
+	reiserfs_write_unlock(s);
 	root_inode =
 	    iget5_locked(s, REISERFS_ROOT_OBJECTID, reiserfs_find_actor,
 			 reiserfs_init_locked_inode, (void *)(&args));
+	reiserfs_write_lock(s);
 	if (!root_inode) {
 		SWARN(silent, s, "jmacd-10", "get root inode failed");
 		goto error;

But may be there are other iget5_locked in the path that I missed.

Note this should be a harmless warning because I guess the filesystem can't be
used for memory reclaim before it is actually mounted. But still this is
annoying and the above fix is one more hack to cope with the fact we need to
hold the lock from the mount path early even if we don't need it. That's
because many helpers used there assume the lock is always taken when they
are called. This assumption come from the bkl times.

So probably I should try to clean up this a bit to solve it in a less
dirty way. I'll try to come with something soon.

^ permalink raw reply related	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2011-11-15 18:15 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-10-31  8:35 [BUG] kernel 3.1.0 possible circular locking dependency detected Knut Petersen
2011-10-31 15:08 ` Linus Torvalds
2011-10-31 15:59   ` Knut Petersen
2011-11-07 17:18   ` Peter Zijlstra
2011-11-15 13:59   ` kernel 3.1.1 / 3.1.0 reiserfs locking problems Knut Petersen
2011-11-15 18:15     ` Frederic Weisbecker

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).