[GIT PULL] Btrfs bug fixes

linux-btrfs.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

* [GIT PULL] Btrfs bug fixes
@ 2015-06-30 22:20 fdmanana
  2015-07-01 17:24 ` Chris Mason
  0 siblings, 1 reply; 9+ messages in thread
From: fdmanana @ 2015-06-30 22:20 UTC (permalink / raw)
  To: linux-btrfs, clm; +Cc: Filipe Manana

From: Filipe Manana <fdmanana@suse.com>

Hi Chris,

Please consider the following changes (or a subset at your will in case
they are too many or too large) for the kernel 4.2 release. All these
patches have been available in the mailing list for at least 2 weeks.

They are all bug fixes and deal with races, crashes, memory corruption,
hangs and data loss on fsync. No cleanups, refactorings or other cosmetic
changes that don't affect users, except for a change that updates a
comment added in a change that was already sent to Linus (but that can
hardly introduce any regression).

I have included 2 fixes from Liu related to error handling of direct IO
writes because not only they seem important to me, I also tested and
reviewed them. My recent patch for direct IO error handling (not included
in this pull request because it's too fresh) does not depend on Liu's
fixes, it's for a different error case (when we fail to submit the bios),
and can be tested/applied with or without his fixes.

Finally I have included a fix from Shilong that has been around since
April and didn't got any attention, despite being correct, trivial and
useful.

I have added Reviewed-By and Tested-by tags to all the patches that got
such tags from people through the mailing list. Some patches are also
tagged for stable.

I have rebased them on top of your current integration-4.2 branch and
re-tested them with xfstests, LTP and some custom tests. Some of these
fixes have new test cases for xfstest submitted recently as well (but
not yet merged as of today, only reviewed).

Thanks.

The following changes since commit 5a5003df98d5a7f6834227885b7c9728f767cc27:

  btrfs: delayed-ref: double free in btrfs_add_delayed_tree_ref() (2015-06-24 12:28:03 -0700)

are available in the git repository at:

  git://git.kernel.org/pub/scm/linux/kernel/git/fdmanana/linux.git integration-4.2

for you to fetch changes up to 9ac2b7cb4755cb3311bb7d1ccf0eb51d0e006fba:

  Btrfs: fix wrong check for btrfs_force_chunk_alloc() (2015-06-30 05:00:49 +0100)

----------------------------------------------------------------
Filipe Manana (7):
      Btrfs: fix a comment in inode.c:evict_inode_truncate_pages()
      Btrfs: fix race between balance and unused block group deletion
      Btrfs: use kmem_cache_free when freeing entry in inode cache
      Btrfs: fix race between caching kthread and returning inode to inode cache
      Btrfs: fix crash on close_ctree() if cleaner starts new transaction
      Btrfs: fix fsync data loss after append write
      Btrfs: fix fsync xattr loss in the fast fsync path

Liu Bo (2):
      Btrfs: fix hang when failing to submit bio of directIO
      Btrfs: fix warning of bytes_may_use

Shilong Wang (1):
      Btrfs: fix wrong check for btrfs_force_chunk_alloc()

 fs/btrfs/btrfs_inode.h |   2 ++
 fs/btrfs/ctree.h       |   1 +
 fs/btrfs/disk-io.c     |  41 ++++++++++++++++++++++++++++++++++++++++-
 fs/btrfs/extent-tree.c |   3 +++
 fs/btrfs/inode-map.c   |  17 ++++++++++++-----
 fs/btrfs/inode.c       |  24 ++++++++++++++++--------
 fs/btrfs/relocation.c  |   2 +-
 fs/btrfs/tree-log.c    | 118 +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++-----
 fs/btrfs/volumes.c     |  48 +++++++++++++++++++++++++++++++++++++++++++-----
 9 files changed, 231 insertions(+), 25 deletions(-)

-- 
2.1.3

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [GIT PULL] Btrfs bug fixes
  2015-06-30 22:20 [GIT PULL] Btrfs bug fixes fdmanana
@ 2015-07-01 17:24 ` Chris Mason
  2015-07-01 21:00   ` Ed Tomlinson
  0 siblings, 1 reply; 9+ messages in thread
From: Chris Mason @ 2015-07-01 17:24 UTC (permalink / raw)
  To: fdmanana; +Cc: linux-btrfs, Filipe Manana

On Tue, Jun 30, 2015 at 11:20:31PM +0100, fdmanana@kernel.org wrote:
> From: Filipe Manana <fdmanana@suse.com>
> 
> Hi Chris,
> 
> Please consider the following changes (or a subset at your will in case
> they are too many or too large) for the kernel 4.2 release. All these
> patches have been available in the mailing list for at least 2 weeks.
> 
> They are all bug fixes and deal with races, crashes, memory corruption,
> hangs and data loss on fsync. No cleanups, refactorings or other cosmetic
> changes that don't affect users, except for a change that updates a
> comment added in a change that was already sent to Linus (but that can
> hardly introduce any regression).

Thanks! These are queued along with a few others I pulled from the list,
Mark's dedup fixes, and the ones Qu mentioned.  Once it passes tests
here I'll push out.

-chris

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [GIT PULL] Btrfs bug fixes
  2015-07-01 17:24 ` Chris Mason
@ 2015-07-01 21:00   ` Ed Tomlinson
  0 siblings, 0 replies; 9+ messages in thread
From: Ed Tomlinson @ 2015-07-01 21:00 UTC (permalink / raw)
  To: Chris Mason; +Cc: fdmanana, linux-btrfs, Filipe Manana

Chris,

Have you looked at Omar Sandoval's

[PATCH v2 0/5] Btrfs: RAID 5/6 missing device scrub+replace

It would be really nice to have these when/if a disk dies...  I
been running with them since v1 without issue.

Thanks
Ed Tomlinson

On Wednesday, July 1, 2015 1:24:41 PM EDT, Chris Mason wrote:
> On Tue, Jun 30, 2015 at 11:20:31PM +0100, fdmanana@kernel.org wrote:
>> From: Filipe Manana <fdmanana@suse.com>
>> 
>> Hi Chris,
>> 
>> Please consider the following changes (or a subset at your will in case
>> they are too many or too large) for the kernel 4.2 release. All these
>> patches have been available in the mailing list for at least 2 weeks. ...
>
> Thanks! These are queued along with a few others I pulled from the list,
> Mark's dedup fixes, and the ones Qu mentioned.  Once it passes tests
> here I'll push out.
>
> -chris
> --
> To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>
>
>


^ permalink raw reply	[flat|nested] 9+ messages in thread

* [GIT PULL] Btrfs bug fixes
@ 2015-09-15  2:22 fdmanana
  2015-09-15 11:49 ` Holger Hoffstätte
  0 siblings, 1 reply; 9+ messages in thread
From: fdmanana @ 2015-09-15  2:22 UTC (permalink / raw)
  To: linux-btrfs; +Cc: clm, Filipe Manana

From: Filipe Manana <fdmanana@suse.com>

Hi Chris,

Please consider the following fixes for the 4.3 kernel release candidates.
One of them addresses a deadlock introduced in 4.3, another is for a false
enospc condition (which I introduced in a 4.2 commit) that can happen either
on empty filesystems or filesystems with files having only inlined extents
after mounting with "-o nospace_cache", a fix for read corruption of
compressed and shared (cloned, deduped) extents that has been present
possibly since the compression feature exists or since the clone/dedup
ioctls exist, and finally, from Jeff, a fix for a long standing null
pointer dereference during inode eviction that some users were running
into very often.

They are all short and simple, the read corruption one is bigger only
because it adds a big fat comment, and just like Jeff's fix, it was
tagged for backporting to stable releases too.

I included any Reviewed-by tags people added through the mailing list and
two of these fixes have corresponding test cases for fstests that were
submitted, along with the respective fixes, to the mailing list.

Thanks.

The following changes since commit 527afb4493c2892ce89fb74648e72a30b68ba120:

  Btrfs: cleanup: remove unnecessary check before btrfs_free_path is called (2015-08-31 11:46:41 -0700)

are available in the git repository at:

  git://git.kernel.org/pub/scm/linux/kernel/git/fdmanana/linux.git integration-4.3

for you to fetch changes up to a30e577c96f59b1e1678ea5462432b09bf7d5cbc:

  btrfs: skip waiting on ordered range for special files (2015-09-15 02:21:08 +0100)

----------------------------------------------------------------
Filipe Manana (3):
      Btrfs: don't initialize a space info as full to prevent ENOSPC
      Btrfs: remove unnecessary locking of cleaner_mutex to avoid deadlock
      Btrfs: fix read corruption of compressed and shared extents

Jeff Mahoney (1):
      btrfs: skip waiting on ordered range for special files

 fs/btrfs/disk-io.c     |  2 --
 fs/btrfs/extent-tree.c |  5 +----
 fs/btrfs/extent_io.c   | 65 +++++++++++++++++++++++++++++++++++++++++++++++++++++++++--------
 fs/btrfs/inode.c       |  3 ++-
 fs/btrfs/super.c       |  2 --
 5 files changed, 60 insertions(+), 17 deletions(-)

-- 
2.1.3

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [GIT PULL] Btrfs bug fixes
  2015-09-15  2:22 fdmanana
@ 2015-09-15 11:49 ` Holger Hoffstätte
  2015-09-15 12:58   ` Filipe Manana
  0 siblings, 1 reply; 9+ messages in thread
From: Holger Hoffstätte @ 2015-09-15 11:49 UTC (permalink / raw)
  To: fdmanana, linux-btrfs; +Cc: clm, Filipe Manana

Hello Filipe,

your mail comes just in time as I was typing a mail about this patch:

On 09/15/15 04:22, fdmanana@kernel.org wrote:
>       Btrfs: remove unnecessary locking of cleaner_mutex to avoid deadlock

While it might seem to fix this particular problem, it seems there is either a new one or a bad interaction with either both or one of these from Josef, which came through around the same time:

Btrfs: use btrfs_get_fs_root in resolve_indirect_ref
Btrfs: keep dropped roots in cache until transaciton commit

With all three patches I have now seen something like this when deleting snapshots that contain enough data to also kick off the cleaner:

[34851.791035] BTRFS info (device sdc1): leaf 865062912000 total ptrs 104 free space 1151
(..lots of stuff that was supposed to be deleted..)
[34851.791210] BTRFS critical (device sdc1): unable to update root key 1328 132 5341
[34851.791795] ------------[ cut here ]------------
[34851.792343] kernel BUG at fs/btrfs/root-tree.c:160!
[34851.792889] invalid opcode: 0000 [#1] SMP 
[34851.793442] Modules linked in: nfsd auth_rpcgss oid_registry lockd grace sunrpc sch_fq_codel btrfs xor snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic x86_pkg_temp_thermal coretemp crc32_pclmul crc32c_intel aesni_intel i915 raid6_pq aes_x86_64 glue_helper snd_hda_intel lrw gf128mul snd_hda_controller ablk_helper cryptd snd_hda_codec intel_gtt snd_hda_core i2c_algo_bit drm_kms_helper snd_pcm i2c_i801 usbhid drm snd_timer r8169 snd soundcore i2c_core mii video
[34851.795328] CPU: 0 PID: 8807 Comm: sync Not tainted 4.1.7 #1
[34851.795942] Hardware name: System manufacturer System Product Name/P8Z68-V LX, BIOS 4105 07/01/2013
[34851.796571] task: ffff88040cc91760 ti: ffff880352454000 task.ti: ffff880352454000
[34851.797204] RIP: 0010:[<ffffffffa0435496>]  [<ffffffffa0435496>] btrfs_update_root+0x1e6/0x330 [btrfs]
[34851.797869] RSP: 0018:ffff880352457cf8  EFLAGS: 00010292
[34851.798521] RAX: 0000000000000045 RBX: ffff8802f0e59120 RCX: 0000000000000006
[34851.799188] RDX: 0000000000000007 RSI: 0000000000000246 RDI: ffff88041f20d520
[34851.799858] RBP: ffff880352457d58 R08: 0000000000000000 R09: 0000000000000410
[34851.800534] R10: 00000000ffffffff R11: 0000000000000410 R12: 0000000000000001
[34851.801208] R13: ffff8803fdf191df R14: ffff8803fdf19028 R15: ffff8800c96f4800
[34851.801879] FS:  00007fa61e9bd700(0000) GS:ffff88041f200000(0000) knlGS:0000000000000000
[34851.802566] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[34851.803257] CR2: 00007fadc28e4000 CR3: 000000035247f000 CR4: 00000000000406f0
[34851.803962] Stack:
[34851.804661]  00003a6552457d88 00000000000000b0 ffff8803fdf191df ffff88030c3a8f18
[34851.805389]  000000000000053a 0000000000000000 ffff880352457d58 ffff88030c3a8f18
[34851.806107]  ffff88040cba6000 ffff88040cba6070 0000000000000002 ffff8803fdf19000
[34851.806825] Call Trace:
[34851.807536]  [<ffffffffa04bd900>] commit_fs_roots.isra.7+0x11b/0x14d [btrfs]
[34851.808261]  [<ffffffffa0444325>] btrfs_commit_transaction+0x6b5/0xc10 [btrfs]
[34851.808992]  [<ffffffff8107aec0>] ? wake_atomic_t_function+0x60/0x60
[34851.809714]  [<ffffffffa04125b9>] btrfs_sync_fs+0x59/0x110 [btrfs]
[34851.810438]  [<ffffffff811a3d70>] ? do_fsync+0x70/0x70
[34851.811170]  [<ffffffff811a3d90>] sync_fs_one_sb+0x20/0x30
[34851.811904]  [<ffffffff81176a56>] iterate_supers+0xb6/0x120
[34851.812636]  [<ffffffff811a3eb5>] sys_sync+0x55/0x90
[34851.813371]  [<ffffffff81573b97>] system_call_fastpath+0x12/0x6a
[34851.814106] Code: 89 ff e8 de f9 ff ff 48 8b 45 b0 48 c7 c6 88 78 4c a0 49 8b bf f0 01 00 00 0f b6 48 08 4c 8b 40 09 48 8b 10 31 c0 e8 5a e8 fd ff <0f> 0b 49 8b 87 f0 01 00 00 f0 0f ba a8 78 0d 00 00 02 72 1d 48 
[34851.815868] RIP  [<ffffffffa0435496>] btrfs_update_root+0x1e6/0x330 [btrfs]
[34851.816713]  RSP <ffff880352457cf8>
[34851.817575] ---[ end trace cc3e428ae50170a3 ]---

While this happened when issuing sync after a delayed snapshot delete, I've seen it also happen in the background. It's however not easily repeatable. Until now I've not been able to pinpoint whether the problem is only in the lock removal patch or an interaction with Josef's patches (or only there), but so far I have never seen this with only those two.

Just thought you wanted to know - maybe you have an idea. Maybe there is a bad interaction (larger race window?) with keeping the snapshots pinned longer and the cleaner lock removal.

Holger


^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [GIT PULL] Btrfs bug fixes
  2015-09-15 11:49 ` Holger Hoffstätte
@ 2015-09-15 12:58   ` Filipe Manana
  2015-09-15 13:43     ` Holger Hoffstätte
  0 siblings, 1 reply; 9+ messages in thread
From: Filipe Manana @ 2015-09-15 12:58 UTC (permalink / raw)
  To: Holger Hoffstätte
  Cc: linux-btrfs@vger.kernel.org, Chris Mason, Filipe Manana

On Tue, Sep 15, 2015 at 12:49 PM, Holger Hoffstätte
<holger.hoffstaette@googlemail.com> wrote:
> Hello Filipe,
>
> your mail comes just in time as I was typing a mail about this patch:
>
> On 09/15/15 04:22, fdmanana@kernel.org wrote:
>>       Btrfs: remove unnecessary locking of cleaner_mutex to avoid deadlock
>
> While it might seem to fix this particular problem, it seems there is either a new one or a bad interaction with either both or one of these from Josef, which came through around the same time:
>
> Btrfs: use btrfs_get_fs_root in resolve_indirect_ref
> Btrfs: keep dropped roots in cache until transaciton commit
>
> With all three patches I have now seen something like this when deleting snapshots that contain enough data to also kick off the cleaner:
>
> [34851.791035] BTRFS info (device sdc1): leaf 865062912000 total ptrs 104 free space 1151
> (..lots of stuff that was supposed to be deleted..)
> [34851.791210] BTRFS critical (device sdc1): unable to update root key 1328 132 5341
> [34851.791795] ------------[ cut here ]------------
> [34851.792343] kernel BUG at fs/btrfs/root-tree.c:160!
> [34851.792889] invalid opcode: 0000 [#1] SMP
> [34851.793442] Modules linked in: nfsd auth_rpcgss oid_registry lockd grace sunrpc sch_fq_codel btrfs xor snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic x86_pkg_temp_thermal coretemp crc32_pclmul crc32c_intel aesni_intel i915 raid6_pq aes_x86_64 glue_helper snd_hda_intel lrw gf128mul snd_hda_controller ablk_helper cryptd snd_hda_codec intel_gtt snd_hda_core i2c_algo_bit drm_kms_helper snd_pcm i2c_i801 usbhid drm snd_timer r8169 snd soundcore i2c_core mii video
> [34851.795328] CPU: 0 PID: 8807 Comm: sync Not tainted 4.1.7 #1
> [34851.795942] Hardware name: System manufacturer System Product Name/P8Z68-V LX, BIOS 4105 07/01/2013
> [34851.796571] task: ffff88040cc91760 ti: ffff880352454000 task.ti: ffff880352454000
> [34851.797204] RIP: 0010:[<ffffffffa0435496>]  [<ffffffffa0435496>] btrfs_update_root+0x1e6/0x330 [btrfs]
> [34851.797869] RSP: 0018:ffff880352457cf8  EFLAGS: 00010292
> [34851.798521] RAX: 0000000000000045 RBX: ffff8802f0e59120 RCX: 0000000000000006
> [34851.799188] RDX: 0000000000000007 RSI: 0000000000000246 RDI: ffff88041f20d520
> [34851.799858] RBP: ffff880352457d58 R08: 0000000000000000 R09: 0000000000000410
> [34851.800534] R10: 00000000ffffffff R11: 0000000000000410 R12: 0000000000000001
> [34851.801208] R13: ffff8803fdf191df R14: ffff8803fdf19028 R15: ffff8800c96f4800
> [34851.801879] FS:  00007fa61e9bd700(0000) GS:ffff88041f200000(0000) knlGS:0000000000000000
> [34851.802566] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [34851.803257] CR2: 00007fadc28e4000 CR3: 000000035247f000 CR4: 00000000000406f0
> [34851.803962] Stack:
> [34851.804661]  00003a6552457d88 00000000000000b0 ffff8803fdf191df ffff88030c3a8f18
> [34851.805389]  000000000000053a 0000000000000000 ffff880352457d58 ffff88030c3a8f18
> [34851.806107]  ffff88040cba6000 ffff88040cba6070 0000000000000002 ffff8803fdf19000
> [34851.806825] Call Trace:
> [34851.807536]  [<ffffffffa04bd900>] commit_fs_roots.isra.7+0x11b/0x14d [btrfs]
> [34851.808261]  [<ffffffffa0444325>] btrfs_commit_transaction+0x6b5/0xc10 [btrfs]
> [34851.808992]  [<ffffffff8107aec0>] ? wake_atomic_t_function+0x60/0x60
> [34851.809714]  [<ffffffffa04125b9>] btrfs_sync_fs+0x59/0x110 [btrfs]
> [34851.810438]  [<ffffffff811a3d70>] ? do_fsync+0x70/0x70
> [34851.811170]  [<ffffffff811a3d90>] sync_fs_one_sb+0x20/0x30
> [34851.811904]  [<ffffffff81176a56>] iterate_supers+0xb6/0x120
> [34851.812636]  [<ffffffff811a3eb5>] sys_sync+0x55/0x90
> [34851.813371]  [<ffffffff81573b97>] system_call_fastpath+0x12/0x6a
> [34851.814106] Code: 89 ff e8 de f9 ff ff 48 8b 45 b0 48 c7 c6 88 78 4c a0 49 8b bf f0 01 00 00 0f b6 48 08 4c 8b 40 09 48 8b 10 31 c0 e8 5a e8 fd ff <0f> 0b 49 8b 87 f0 01 00 00 f0 0f ba a8 78 0d 00 00 02 72 1d 48
> [34851.815868] RIP  [<ffffffffa0435496>] btrfs_update_root+0x1e6/0x330 [btrfs]
> [34851.816713]  RSP <ffff880352457cf8>
> [34851.817575] ---[ end trace cc3e428ae50170a3 ]---
>
> While this happened when issuing sync after a delayed snapshot delete, I've seen it also happen in the background. It's however not easily repeatable. Until now I've not been able to pinpoint whether the problem is only in the lock removal patch or an interaction with Josef's patches (or only there), but so far I have never seen this with only those two.
>
> Just thought you wanted to know - maybe you have an idea. Maybe there is a bad interaction (larger race window?) with keeping the snapshots pinned longer and the cleaner lock removal.

Hi Holger,

So at a first glance I don't think the patch that removes the locking
of cleaner mutex is the cause here. First it removed the locking in
the unmount and remount paths only (before attempts to delete unused
block groups), second the cleaner mutex is used only to serialize
snapshot removal and block group relocation/balance. Deleting unused
block groups is safe without the cleaner mutex and safe against a
transaction commit too, since before deleting any block group it will
start/join the transaction and block until the current transaction
ends its commit (if it's currently committing, such as in your trace).

I haven't tried Josef's patches, but I'll leave them running here and
see if I can reproduce that.

When you get such error, are you doing an unmount or remount? If not,
I really don't see how the patch introduces any problems, either with
or without Josef's changes.

thanks

>
> Holger
>
> --
> To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [GIT PULL] Btrfs bug fixes
  2015-09-15 12:58   ` Filipe Manana
@ 2015-09-15 13:43     ` Holger Hoffstätte
  2015-09-15 13:53       ` Josef Bacik
  2015-09-15 14:07       ` Josef Bacik
  0 siblings, 2 replies; 9+ messages in thread
From: Holger Hoffstätte @ 2015-09-15 13:43 UTC (permalink / raw)
  To: Filipe Manana; +Cc: linux-btrfs@vger.kernel.org, Chris Mason, Filipe Manana

On 09/15/15 14:58, Filipe Manana wrote:
> On Tue, Sep 15, 2015 at 12:49 PM, Holger Hoffstätte
> <holger.hoffstaette@googlemail.com> wrote:
>> Hello Filipe,
>>
>> your mail comes just in time as I was typing a mail about this patch:
>>
>> On 09/15/15 04:22, fdmanana@kernel.org wrote:
>>>       Btrfs: remove unnecessary locking of cleaner_mutex to avoid deadlock
>>
>> While it might seem to fix this particular problem, it seems there is either a new one or a bad interaction with either both or one of these from Josef, which came through around the same time:
>>
>> Btrfs: use btrfs_get_fs_root in resolve_indirect_ref
>> Btrfs: keep dropped roots in cache until transaciton commit
>>
>> With all three patches I have now seen something like this when deleting snapshots that contain enough data to also kick off the cleaner:
>>
>> [34851.791035] BTRFS info (device sdc1): leaf 865062912000 total ptrs 104 free space 1151
>> (..lots of stuff that was supposed to be deleted..)
>> [34851.791210] BTRFS critical (device sdc1): unable to update root key 1328 132 5341
>> [34851.791795] ------------[ cut here ]------------
>> [34851.792343] kernel BUG at fs/btrfs/root-tree.c:160!
>> [34851.792889] invalid opcode: 0000 [#1] SMP
>> [34851.793442] Modules linked in: nfsd auth_rpcgss oid_registry lockd grace sunrpc sch_fq_codel btrfs xor snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic x86_pkg_temp_thermal coretemp crc32_pclmul crc32c_intel aesni_intel i915 raid6_pq aes_x86_64 glue_helper snd_hda_intel lrw gf128mul snd_hda_controller ablk_helper cryptd snd_hda_codec intel_gtt snd_hda_core i2c_algo_bit drm_kms_helper snd_pcm i2c_i801 usbhid drm snd_timer r8169 snd soundcore i2c_core mii video
>> [34851.795328] CPU: 0 PID: 8807 Comm: sync Not tainted 4.1.7 #1
>> [34851.795942] Hardware name: System manufacturer System Product Name/P8Z68-V LX, BIOS 4105 07/01/2013
>> [34851.796571] task: ffff88040cc91760 ti: ffff880352454000 task.ti: ffff880352454000
>> [34851.797204] RIP: 0010:[<ffffffffa0435496>]  [<ffffffffa0435496>] btrfs_update_root+0x1e6/0x330 [btrfs]
>> [34851.797869] RSP: 0018:ffff880352457cf8  EFLAGS: 00010292
>> [34851.798521] RAX: 0000000000000045 RBX: ffff8802f0e59120 RCX: 0000000000000006
>> [34851.799188] RDX: 0000000000000007 RSI: 0000000000000246 RDI: ffff88041f20d520
>> [34851.799858] RBP: ffff880352457d58 R08: 0000000000000000 R09: 0000000000000410
>> [34851.800534] R10: 00000000ffffffff R11: 0000000000000410 R12: 0000000000000001
>> [34851.801208] R13: ffff8803fdf191df R14: ffff8803fdf19028 R15: ffff8800c96f4800
>> [34851.801879] FS:  00007fa61e9bd700(0000) GS:ffff88041f200000(0000) knlGS:0000000000000000
>> [34851.802566] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
>> [34851.803257] CR2: 00007fadc28e4000 CR3: 000000035247f000 CR4: 00000000000406f0
>> [34851.803962] Stack:
>> [34851.804661]  00003a6552457d88 00000000000000b0 ffff8803fdf191df ffff88030c3a8f18
>> [34851.805389]  000000000000053a 0000000000000000 ffff880352457d58 ffff88030c3a8f18
>> [34851.806107]  ffff88040cba6000 ffff88040cba6070 0000000000000002 ffff8803fdf19000
>> [34851.806825] Call Trace:
>> [34851.807536]  [<ffffffffa04bd900>] commit_fs_roots.isra.7+0x11b/0x14d [btrfs]
>> [34851.808261]  [<ffffffffa0444325>] btrfs_commit_transaction+0x6b5/0xc10 [btrfs]
>> [34851.808992]  [<ffffffff8107aec0>] ? wake_atomic_t_function+0x60/0x60
>> [34851.809714]  [<ffffffffa04125b9>] btrfs_sync_fs+0x59/0x110 [btrfs]
>> [34851.810438]  [<ffffffff811a3d70>] ? do_fsync+0x70/0x70
>> [34851.811170]  [<ffffffff811a3d90>] sync_fs_one_sb+0x20/0x30
>> [34851.811904]  [<ffffffff81176a56>] iterate_supers+0xb6/0x120
>> [34851.812636]  [<ffffffff811a3eb5>] sys_sync+0x55/0x90
>> [34851.813371]  [<ffffffff81573b97>] system_call_fastpath+0x12/0x6a
>> [34851.814106] Code: 89 ff e8 de f9 ff ff 48 8b 45 b0 48 c7 c6 88 78 4c a0 49 8b bf f0 01 00 00 0f b6 48 08 4c 8b 40 09 48 8b 10 31 c0 e8 5a e8 fd ff <0f> 0b 49 8b 87 f0 01 00 00 f0 0f ba a8 78 0d 00 00 02 72 1d 48
>> [34851.815868] RIP  [<ffffffffa0435496>] btrfs_update_root+0x1e6/0x330 [btrfs]
>> [34851.816713]  RSP <ffff880352457cf8>
>> [34851.817575] ---[ end trace cc3e428ae50170a3 ]---
>>
>> While this happened when issuing sync after a delayed snapshot delete, I've seen it also happen in the background. It's however not easily repeatable. Until now I've not been able to pinpoint whether the problem is only in the lock removal patch or an interaction with Josef's patches (or only there), but so far I have never seen this with only those two.
>>
>> Just thought you wanted to know - maybe you have an idea. Maybe there is a bad interaction (larger race window?) with keeping the snapshots pinned longer and the cleaner lock removal.
> 
> Hi Holger,
> 
> So at a first glance I don't think the patch that removes the locking
> of cleaner mutex is the cause here. First it removed the locking in
> the unmount and remount paths only (before attempts to delete unused
> block groups), second the cleaner mutex is used only to serialize
> snapshot removal and block group relocation/balance. Deleting unused
> block groups is safe without the cleaner mutex and safe against a
> transaction commit too, since before deleting any block group it will
> start/join the transaction and block until the current transaction
> ends its commit (if it's currently committing, such as in your trace).

OK, thanks for the explanation.

> I haven't tried Josef's patches, but I'll leave them running here and
> see if I can reproduce that.

Thanks.

> When you get such error, are you doing an unmount or remount? If not,

no, just snapshot delete & manual sync.

> I really don't see how the patch introduces any problems, either with
> or without Josef's changes.

The interesting thing is that I ran with only your patch for a while
without problems, but saw this error after merging Josef's patches
and trying a quick snapshot delete *right away* (I remember that I also
issued a sync then, so..). I chalked it up to stale module info - despite
clean unmounting/rmmod/depmod/reload, which usually works fine for simple
patches - and since it worked fine for several days in casual use after
a clean reboot I let it slide. However it just happened again out of
the blue while deleting a snapshot. I don't know whether issuing the
sync was a deciding factor; I was more interested in seeing reclaimed
space immediately.

I've now reverted Josef's patches and have been running with only yours
for a good hour, creating/deleting multiple/nested/chained snapshots
with lots of blocks changed between them, with and without issuing sync
as hard as I can - nothing. Also verified that the cleaner still does
its thing (even deleting the very last data chunk) - no problems.

Let me know what you think and how/when I can help or test.

Thanks,
Holger


^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [GIT PULL] Btrfs bug fixes
  2015-09-15 13:43     ` Holger Hoffstätte
@ 2015-09-15 13:53       ` Josef Bacik
  2015-09-15 14:07       ` Josef Bacik
  1 sibling, 0 replies; 9+ messages in thread
From: Josef Bacik @ 2015-09-15 13:53 UTC (permalink / raw)
  To: Holger Hoffstätte, Filipe Manana
  Cc: linux-btrfs@vger.kernel.org, Chris Mason, Filipe Manana

On 09/15/2015 09:43 AM, Holger Hoffstätte wrote:
> On 09/15/15 14:58, Filipe Manana wrote:
>> On Tue, Sep 15, 2015 at 12:49 PM, Holger Hoffstätte
>> <holger.hoffstaette@googlemail.com> wrote:
>>> Hello Filipe,
>>>
>>> your mail comes just in time as I was typing a mail about this patch:
>>>
>>> On 09/15/15 04:22, fdmanana@kernel.org wrote:
>>>>        Btrfs: remove unnecessary locking of cleaner_mutex to avoid deadlock
>>>
>>> While it might seem to fix this particular problem, it seems there is either a new one or a bad interaction with either both or one of these from Josef, which came through around the same time:
>>>
>>> Btrfs: use btrfs_get_fs_root in resolve_indirect_ref
>>> Btrfs: keep dropped roots in cache until transaciton commit
>>>
>>> With all three patches I have now seen something like this when deleting snapshots that contain enough data to also kick off the cleaner:
>>>
>>> [34851.791035] BTRFS info (device sdc1): leaf 865062912000 total ptrs 104 free space 1151
>>> (..lots of stuff that was supposed to be deleted..)
>>> [34851.791210] BTRFS critical (device sdc1): unable to update root key 1328 132 5341
>>> [34851.791795] ------------[ cut here ]------------
>>> [34851.792343] kernel BUG at fs/btrfs/root-tree.c:160!
>>> [34851.792889] invalid opcode: 0000 [#1] SMP
>>> [34851.793442] Modules linked in: nfsd auth_rpcgss oid_registry lockd grace sunrpc sch_fq_codel btrfs xor snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic x86_pkg_temp_thermal coretemp crc32_pclmul crc32c_intel aesni_intel i915 raid6_pq aes_x86_64 glue_helper snd_hda_intel lrw gf128mul snd_hda_controller ablk_helper cryptd snd_hda_codec intel_gtt snd_hda_core i2c_algo_bit drm_kms_helper snd_pcm i2c_i801 usbhid drm snd_timer r8169 snd soundcore i2c_core mii video
>>> [34851.795328] CPU: 0 PID: 8807 Comm: sync Not tainted 4.1.7 #1
>>> [34851.795942] Hardware name: System manufacturer System Product Name/P8Z68-V LX, BIOS 4105 07/01/2013
>>> [34851.796571] task: ffff88040cc91760 ti: ffff880352454000 task.ti: ffff880352454000
>>> [34851.797204] RIP: 0010:[<ffffffffa0435496>]  [<ffffffffa0435496>] btrfs_update_root+0x1e6/0x330 [btrfs]
>>> [34851.797869] RSP: 0018:ffff880352457cf8  EFLAGS: 00010292
>>> [34851.798521] RAX: 0000000000000045 RBX: ffff8802f0e59120 RCX: 0000000000000006
>>> [34851.799188] RDX: 0000000000000007 RSI: 0000000000000246 RDI: ffff88041f20d520
>>> [34851.799858] RBP: ffff880352457d58 R08: 0000000000000000 R09: 0000000000000410
>>> [34851.800534] R10: 00000000ffffffff R11: 0000000000000410 R12: 0000000000000001
>>> [34851.801208] R13: ffff8803fdf191df R14: ffff8803fdf19028 R15: ffff8800c96f4800
>>> [34851.801879] FS:  00007fa61e9bd700(0000) GS:ffff88041f200000(0000) knlGS:0000000000000000
>>> [34851.802566] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
>>> [34851.803257] CR2: 00007fadc28e4000 CR3: 000000035247f000 CR4: 00000000000406f0
>>> [34851.803962] Stack:
>>> [34851.804661]  00003a6552457d88 00000000000000b0 ffff8803fdf191df ffff88030c3a8f18
>>> [34851.805389]  000000000000053a 0000000000000000 ffff880352457d58 ffff88030c3a8f18
>>> [34851.806107]  ffff88040cba6000 ffff88040cba6070 0000000000000002 ffff8803fdf19000
>>> [34851.806825] Call Trace:
>>> [34851.807536]  [<ffffffffa04bd900>] commit_fs_roots.isra.7+0x11b/0x14d [btrfs]
>>> [34851.808261]  [<ffffffffa0444325>] btrfs_commit_transaction+0x6b5/0xc10 [btrfs]
>>> [34851.808992]  [<ffffffff8107aec0>] ? wake_atomic_t_function+0x60/0x60
>>> [34851.809714]  [<ffffffffa04125b9>] btrfs_sync_fs+0x59/0x110 [btrfs]
>>> [34851.810438]  [<ffffffff811a3d70>] ? do_fsync+0x70/0x70
>>> [34851.811170]  [<ffffffff811a3d90>] sync_fs_one_sb+0x20/0x30
>>> [34851.811904]  [<ffffffff81176a56>] iterate_supers+0xb6/0x120
>>> [34851.812636]  [<ffffffff811a3eb5>] sys_sync+0x55/0x90
>>> [34851.813371]  [<ffffffff81573b97>] system_call_fastpath+0x12/0x6a
>>> [34851.814106] Code: 89 ff e8 de f9 ff ff 48 8b 45 b0 48 c7 c6 88 78 4c a0 49 8b bf f0 01 00 00 0f b6 48 08 4c 8b 40 09 48 8b 10 31 c0 e8 5a e8 fd ff <0f> 0b 49 8b 87 f0 01 00 00 f0 0f ba a8 78 0d 00 00 02 72 1d 48
>>> [34851.815868] RIP  [<ffffffffa0435496>] btrfs_update_root+0x1e6/0x330 [btrfs]
>>> [34851.816713]  RSP <ffff880352457cf8>
>>> [34851.817575] ---[ end trace cc3e428ae50170a3 ]---
>>>
>>> While this happened when issuing sync after a delayed snapshot delete, I've seen it also happen in the background. It's however not easily repeatable. Until now I've not been able to pinpoint whether the problem is only in the lock removal patch or an interaction with Josef's patches (or only there), but so far I have never seen this with only those two.
>>>
>>> Just thought you wanted to know - maybe you have an idea. Maybe there is a bad interaction (larger race window?) with keeping the snapshots pinned longer and the cleaner lock removal.
>>
>> Hi Holger,
>>
>> So at a first glance I don't think the patch that removes the locking
>> of cleaner mutex is the cause here. First it removed the locking in
>> the unmount and remount paths only (before attempts to delete unused
>> block groups), second the cleaner mutex is used only to serialize
>> snapshot removal and block group relocation/balance. Deleting unused
>> block groups is safe without the cleaner mutex and safe against a
>> transaction commit too, since before deleting any block group it will
>> start/join the transaction and block until the current transaction
>> ends its commit (if it's currently committing, such as in your trace).
>
> OK, thanks for the explanation.
>
>> I haven't tried Josef's patches, but I'll leave them running here and
>> see if I can reproduce that.
>
> Thanks.
>
>> When you get such error, are you doing an unmount or remount? If not,
>
> no, just snapshot delete & manual sync.
>
>> I really don't see how the patch introduces any problems, either with
>> or without Josef's changes.
>
> The interesting thing is that I ran with only your patch for a while
> without problems, but saw this error after merging Josef's patches
> and trying a quick snapshot delete *right away* (I remember that I also
> issued a sync then, so..). I chalked it up to stale module info - despite
> clean unmounting/rmmod/depmod/reload, which usually works fine for simple
> patches - and since it worked fine for several days in casual use after
> a clean reboot I let it slide. However it just happened again out of
> the blue while deleting a snapshot. I don't know whether issuing the
> sync was a deciding factor; I was more interested in seeing reclaimed
> space immediately.
>
> I've now reverted Josef's patches and have been running with only yours
> for a good hour, creating/deleting multiple/nested/chained snapshots
> with lots of blocks changed between them, with and without issuing sync
> as hard as I can - nothing. Also verified that the cleaner still does
> its thing (even deleting the very last data chunk) - no problems.
>
> Let me know what you think and how/when I can help or test.
>

This is my bug, sorry about that, I'll send something out to fix it 
right away.  Thanks,

Josef


^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [GIT PULL] Btrfs bug fixes
  2015-09-15 13:43     ` Holger Hoffstätte
  2015-09-15 13:53       ` Josef Bacik
@ 2015-09-15 14:07       ` Josef Bacik
  1 sibling, 0 replies; 9+ messages in thread
From: Josef Bacik @ 2015-09-15 14:07 UTC (permalink / raw)
  To: Holger Hoffstätte, Filipe Manana
  Cc: linux-btrfs@vger.kernel.org, Chris Mason, Filipe Manana

On 09/15/2015 09:43 AM, Holger Hoffstätte wrote:
> On 09/15/15 14:58, Filipe Manana wrote:
>> On Tue, Sep 15, 2015 at 12:49 PM, Holger Hoffstätte
>> <holger.hoffstaette@googlemail.com> wrote:
>>> Hello Filipe,
>>>
>>> your mail comes just in time as I was typing a mail about this patch:
>>>
>>> On 09/15/15 04:22, fdmanana@kernel.org wrote:
>>>>        Btrfs: remove unnecessary locking of cleaner_mutex to avoid deadlock
>>>
>>> While it might seem to fix this particular problem, it seems there is either a new one or a bad interaction with either both or one of these from Josef, which came through around the same time:
>>>
>>> Btrfs: use btrfs_get_fs_root in resolve_indirect_ref
>>> Btrfs: keep dropped roots in cache until transaciton commit
>>>
>>> With all three patches I have now seen something like this when deleting snapshots that contain enough data to also kick off the cleaner:
>>>
>>> [34851.791035] BTRFS info (device sdc1): leaf 865062912000 total ptrs 104 free space 1151
>>> (..lots of stuff that was supposed to be deleted..)
>>> [34851.791210] BTRFS critical (device sdc1): unable to update root key 1328 132 5341
>>> [34851.791795] ------------[ cut here ]------------
>>> [34851.792343] kernel BUG at fs/btrfs/root-tree.c:160!
>>> [34851.792889] invalid opcode: 0000 [#1] SMP
>>> [34851.793442] Modules linked in: nfsd auth_rpcgss oid_registry lockd grace sunrpc sch_fq_codel btrfs xor snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic x86_pkg_temp_thermal coretemp crc32_pclmul crc32c_intel aesni_intel i915 raid6_pq aes_x86_64 glue_helper snd_hda_intel lrw gf128mul snd_hda_controller ablk_helper cryptd snd_hda_codec intel_gtt snd_hda_core i2c_algo_bit drm_kms_helper snd_pcm i2c_i801 usbhid drm snd_timer r8169 snd soundcore i2c_core mii video
>>> [34851.795328] CPU: 0 PID: 8807 Comm: sync Not tainted 4.1.7 #1
>>> [34851.795942] Hardware name: System manufacturer System Product Name/P8Z68-V LX, BIOS 4105 07/01/2013
>>> [34851.796571] task: ffff88040cc91760 ti: ffff880352454000 task.ti: ffff880352454000
>>> [34851.797204] RIP: 0010:[<ffffffffa0435496>]  [<ffffffffa0435496>] btrfs_update_root+0x1e6/0x330 [btrfs]
>>> [34851.797869] RSP: 0018:ffff880352457cf8  EFLAGS: 00010292
>>> [34851.798521] RAX: 0000000000000045 RBX: ffff8802f0e59120 RCX: 0000000000000006
>>> [34851.799188] RDX: 0000000000000007 RSI: 0000000000000246 RDI: ffff88041f20d520
>>> [34851.799858] RBP: ffff880352457d58 R08: 0000000000000000 R09: 0000000000000410
>>> [34851.800534] R10: 00000000ffffffff R11: 0000000000000410 R12: 0000000000000001
>>> [34851.801208] R13: ffff8803fdf191df R14: ffff8803fdf19028 R15: ffff8800c96f4800
>>> [34851.801879] FS:  00007fa61e9bd700(0000) GS:ffff88041f200000(0000) knlGS:0000000000000000
>>> [34851.802566] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
>>> [34851.803257] CR2: 00007fadc28e4000 CR3: 000000035247f000 CR4: 00000000000406f0
>>> [34851.803962] Stack:
>>> [34851.804661]  00003a6552457d88 00000000000000b0 ffff8803fdf191df ffff88030c3a8f18
>>> [34851.805389]  000000000000053a 0000000000000000 ffff880352457d58 ffff88030c3a8f18
>>> [34851.806107]  ffff88040cba6000 ffff88040cba6070 0000000000000002 ffff8803fdf19000
>>> [34851.806825] Call Trace:
>>> [34851.807536]  [<ffffffffa04bd900>] commit_fs_roots.isra.7+0x11b/0x14d [btrfs]
>>> [34851.808261]  [<ffffffffa0444325>] btrfs_commit_transaction+0x6b5/0xc10 [btrfs]
>>> [34851.808992]  [<ffffffff8107aec0>] ? wake_atomic_t_function+0x60/0x60
>>> [34851.809714]  [<ffffffffa04125b9>] btrfs_sync_fs+0x59/0x110 [btrfs]
>>> [34851.810438]  [<ffffffff811a3d70>] ? do_fsync+0x70/0x70
>>> [34851.811170]  [<ffffffff811a3d90>] sync_fs_one_sb+0x20/0x30
>>> [34851.811904]  [<ffffffff81176a56>] iterate_supers+0xb6/0x120
>>> [34851.812636]  [<ffffffff811a3eb5>] sys_sync+0x55/0x90
>>> [34851.813371]  [<ffffffff81573b97>] system_call_fastpath+0x12/0x6a
>>> [34851.814106] Code: 89 ff e8 de f9 ff ff 48 8b 45 b0 48 c7 c6 88 78 4c a0 49 8b bf f0 01 00 00 0f b6 48 08 4c 8b 40 09 48 8b 10 31 c0 e8 5a e8 fd ff <0f> 0b 49 8b 87 f0 01 00 00 f0 0f ba a8 78 0d 00 00 02 72 1d 48
>>> [34851.815868] RIP  [<ffffffffa0435496>] btrfs_update_root+0x1e6/0x330 [btrfs]
>>> [34851.816713]  RSP <ffff880352457cf8>
>>> [34851.817575] ---[ end trace cc3e428ae50170a3 ]---
>>>
>>> While this happened when issuing sync after a delayed snapshot delete, I've seen it also happen in the background. It's however not easily repeatable. Until now I've not been able to pinpoint whether the problem is only in the lock removal patch or an interaction with Josef's patches (or only there), but so far I have never seen this with only those two.
>>>
>>> Just thought you wanted to know - maybe you have an idea. Maybe there is a bad interaction (larger race window?) with keeping the snapshots pinned longer and the cleaner lock removal.
>>
>> Hi Holger,
>>
>> So at a first glance I don't think the patch that removes the locking
>> of cleaner mutex is the cause here. First it removed the locking in
>> the unmount and remount paths only (before attempts to delete unused
>> block groups), second the cleaner mutex is used only to serialize
>> snapshot removal and block group relocation/balance. Deleting unused
>> block groups is safe without the cleaner mutex and safe against a
>> transaction commit too, since before deleting any block group it will
>> start/join the transaction and block until the current transaction
>> ends its commit (if it's currently committing, such as in your trace).
>
> OK, thanks for the explanation.
>
>> I haven't tried Josef's patches, but I'll leave them running here and
>> see if I can reproduce that.
>
> Thanks.
>
>> When you get such error, are you doing an unmount or remount? If not,
>
> no, just snapshot delete & manual sync.
>
>> I really don't see how the patch introduces any problems, either with
>> or without Josef's changes.
>
> The interesting thing is that I ran with only your patch for a while
> without problems, but saw this error after merging Josef's patches
> and trying a quick snapshot delete *right away* (I remember that I also
> issued a sync then, so..). I chalked it up to stale module info - despite
> clean unmounting/rmmod/depmod/reload, which usually works fine for simple
> patches - and since it worked fine for several days in casual use after
> a clean reboot I let it slide. However it just happened again out of
> the blue while deleting a snapshot. I don't know whether issuing the
> sync was a deciding factor; I was more interested in seeing reclaimed
> space immediately.
>
> I've now reverted Josef's patches and have been running with only yours
> for a good hour, creating/deleting multiple/nested/chained snapshots
> with lots of blocks changed between them, with and without issuing sync
> as hard as I can - nothing. Also verified that the cleaner still does
> its thing (even deleting the very last data chunk) - no problems.
>
> Let me know what you think and how/when I can help or test.
>

Please give V2 a whirl.  Thanks,

Josef


^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2015-09-15 14:07 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2015-06-30 22:20 [GIT PULL] Btrfs bug fixes fdmanana
2015-07-01 17:24 ` Chris Mason
2015-07-01 21:00   ` Ed Tomlinson
  -- strict thread matches above, loose matches on Subject: below --
2015-09-15  2:22 fdmanana
2015-09-15 11:49 ` Holger Hoffstätte
2015-09-15 12:58   ` Filipe Manana
2015-09-15 13:43     ` Holger Hoffstätte
2015-09-15 13:53       ` Josef Bacik
2015-09-15 14:07       ` Josef Bacik

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).