From: bugzilla-daemon@bugzilla.kernel.org
To: linux-xfs@vger.kernel.org
Subject: [Bug 214767] xfs seems to hang due to race condition? maybe related to (gratuitous) thaw.
Date: Fri, 29 Oct 2021 02:21:22 +0000 [thread overview]
Message-ID: <bug-214767-201763-2S5R4pWO3a@https.bugzilla.kernel.org/> (raw)
In-Reply-To: <bug-214767-201763@https.bugzilla.kernel.org/>
https://bugzilla.kernel.org/show_bug.cgi?id=214767
Pedram Fard (pedram.fard@appian.com) changed:
What |Removed |Added
----------------------------------------------------------------------------
CC| |pedram.fard@appian.com
--- Comment #13 from Pedram Fard (pedram.fard@appian.com) ---
We have run into a similar issue. We use xfs_freeze to take backup snapshots of
our filesystems and occasionally, in the process the xfs_freeze/thaw process
turns into zombies.
We used the echo w > /proc/sysrq-trigger to get the stack trace:
[736826.666940] task:kswapd0 state:D stack: 0 pid: 576 ppid: 2
flags:0x00004000
[736826.674439] Call Trace:
[736826.677686] __schedule+0x1f9/0x660
[736826.681593] schedule+0x46/0xb0
[736826.685141] percpu_rwsem_wait+0xa9/0x180
[736826.688867] ? percpu_free_rwsem+0x30/0x30
[736826.692764] __percpu_down_read+0x49/0x60
[736826.696549] xfs_trans_alloc+0x15d/0x170
[736826.700367] xfs_free_eofblocks+0x130/0x1e0
[736826.807546] xfs_fs_destroy_inode+0xa8/0x1b0
[736826.811610] destroy_inode+0x3b/0x70
[736826.815150] dispose_list+0x48/0x60
[736826.818796] prune_icache_sb+0x54/0x80
[736826.822476] super_cache_scan+0x161/0x1e0
[736826.826341] do_shrink_slab+0x145/0x240
[736826.830294] shrink_slab_memcg+0xcd/0x1e0
[736826.834371] shrink_node_memcgs+0x186/0x1c0
[736826.838491] shrink_node+0x14f/0x570
[736826.842088] balance_pgdat+0x232/0x510
[736826.845820] kswapd+0xe2/0x170
[736826.849154] ? balance_pgdat+0x510/0x510
[736826.852944] kthread+0x11b/0x140
[736826.856326] ? __kthread_bind_mask+0x60/0x60
[736826.860313] ret_from_fork+0x22/0x30
[736826.863839] task:systemd-logind state:D stack: 0 pid: 2073 ppid: 1
flags:0x00000084
[736826.871453] Call Trace:
[736826.874898] __schedule+0x1f9/0x660
[736826.878898] ? kvm_sched_clock_read+0xd/0x20
[736826.883265] schedule+0x46/0xb0
[736826.887003] rwsem_down_write_slowpath+0x234/0x4b0
[736826.891114] ? __kmalloc+0x154/0x2b0
[736826.894794] prealloc_shrinker+0x4c/0xf0
[736826.898526] alloc_super+0x29a/0x2f0
[736826.902210] ? set_anon_super+0x40/0x40
[736826.905867] sget_fc+0x6c/0x240
[736826.909206] ? shmem_create+0x20/0x20
[736826.912912] get_tree_nodev+0x23/0xb0
[736826.916533] ? bpf_lsm_capset+0x10/0x10
[736826.920310] vfs_get_tree+0x25/0xb0
[736826.923848] do_new_mount+0x152/0x1b0
[736826.927548] __x64_sys_mount+0x103/0x140
[736826.931531] do_syscall_64+0x33/0x40
[736826.935597] entry_SYSCALL_64_after_hwframe+0x44/0xa9
[736826.940323] RIP: 0033:0x7f49f8efcdba
[736826.944298] RSP: 002b:00007ffdb98f2658 EFLAGS: 00000202 ORIG_RAX:
00000000000000a5
[736826.952238] RAX: ffffffffffffffda RBX: 000055f21a7113a0 RCX:
00007f49f8efcdba
[736826.959940] RDX: 000055f218672af0 RSI: 000055f21a721de0 RDI:
000055f218672af0
[736826.967721] RBP: 00007ffdb98f2690 R08: 000055f21a7a51e0 R09:
0000000000000000
[736826.975503] R10: 0000000000000006 R11: 0000000000000202 R12:
0000000000000000
[736826.983253] R13: 00007ffdb98f2740 R14: 00007ffdb98f2820 R15:
0000000000000000
[736826.990979] task:dockerd state:D stack: 0 pid: 6639 ppid: 1
flags:0x00000080
[736826.999289] Call Trace:
[736827.002695] __schedule+0x1f9/0x660
[736827.006708] schedule+0x46/0xb0
[736827.010512] rwsem_down_write_slowpath+0x234/0x4b0
[736827.015214] ? __kmalloc+0x154/0x2b0
[736827.019291] prealloc_shrinker+0x4c/0xf0
[736827.023495] alloc_super+0x29a/0x2f0
[736827.027505] sget+0x91/0x220
[736827.031089] ? destroy_super_rcu+0x40/0x40
[736827.035268] ? ovl_get_lowerstack+0x190/0x190 [overlay]
[736827.040172] mount_nodev+0x26/0x90
[736827.044070] legacy_get_tree+0x27/0x40
[736827.048169] vfs_get_tree+0x25/0xb0
[736827.052088] do_new_mount+0x152/0x1b0
[736827.056076] __x64_sys_mount+0x103/0x140
[736827.060176] do_syscall_64+0x33/0x40
[736827.064158] entry_SYSCALL_64_after_hwframe+0x44/0xa9
[736827.068950] RIP: 0033:0x55c5416cc2aa
[736827.072915] RSP: 002b:000000c0015b2768 EFLAGS: 00000216 ORIG_RAX:
00000000000000a5
[736827.080676] RAX: ffffffffffffffda RBX: 000000c00005e800 RCX:
000055c5416cc2aa
[736827.088375] RDX: 000000c000df5320 RSI: 000000c0012db140 RDI:
000000c000df52e8
[736827.095311] RBP: 000000c0015b2800 R08: 000000c0015a0600 R09:
0000000000000000
[736827.102300] R10: 0000000000000000 R11: 0000000000000216 R12:
0000000000000133
[736827.109100] R13: 0000000000000132 R14: 0000000000000200 R15:
0000000000000055
[737030.525416] task:sh state:D stack: 0 pid:27526 ppid: 27517
flags:0x00000080
[737030.532905] Call Trace:
[737030.536188] __schedule+0x1f9/0x660
[737030.540060] schedule+0x46/0xb0
[737030.543385] percpu_rwsem_wait+0xa9/0x180
[737030.547220] ? percpu_free_rwsem+0x30/0x30
[737030.551093] __percpu_down_read+0x49/0x60
[737030.555220] mnt_want_write+0x66/0x90
[737030.558966] open_last_lookups+0x30d/0x3f0
[737030.562914] ? path_init+0x2bd/0x3e0
[737030.681735] path_openat+0x88/0x1d0
[737030.685600] ? xfs_iunlock+0x94/0xe0
[737030.689575] do_filp_open+0x88/0x130
[737030.693496] ? __check_object_size.part.0+0x11f/0x140
[737030.698201] do_sys_openat2+0x97/0x150
[737030.702232] __x64_sys_openat+0x54/0x90
[737030.706301] do_syscall_64+0x33/0x40
[737030.710248] entry_SYSCALL_64_after_hwframe+0x44/0xa9
[737030.714945] RIP: 0033:0x7f4502c0d14e
[737030.718849] RSP: 002b:00007ffd066be7a0 EFLAGS: 00000246 ORIG_RAX:
0000000000000101
[737030.726628] RAX: ffffffffffffffda RBX: 00000000009ba360 RCX:
00007f4502c0d14e
[737030.734177] RDX: 0000000000000241 RSI: 00000000009bb8c0 RDI:
ffffffffffffff9c
[737030.741759] RBP: 00007ffd066be8a0 R08: 0000000000000020 R09:
00000000009bb8c0
[737030.749452] R10: 00000000000001b6 R11: 0000000000000246 R12:
0000000000000000
[737030.757142] R13: 0000000000000001 R14: 0000000000000001 R15:
00000000009bad60
[737030.764836] task:bash state:D stack: 0 pid:27580 ppid: 1
flags:0x00000084
[737030.773068] Call Trace:
[737030.776437] __schedule+0x1f9/0x660
[737030.780354] schedule+0x46/0xb0
[737030.784080] percpu_rwsem_wait+0xa9/0x180
[737030.788264] ? percpu_free_rwsem+0x30/0x30
[737030.792476] __percpu_down_read+0x49/0x60
[737030.796642] mnt_want_write+0x66/0x90
[737030.800660] open_last_lookups+0x30d/0x3f0
[737030.804923] ? path_init+0x2bd/0x3e0
[737030.808895] path_openat+0x88/0x1d0
[737030.812800] ? xfs_iunlock+0x94/0xe0
[737030.816745] do_filp_open+0x88/0x130
[737030.820694] ? getname_flags.part.0+0x29/0x1a0
[737030.825097] ? __check_object_size.part.0+0x11f/0x140
[737030.829846] ? audit_alloc_name+0x8c/0xe0
[737030.834010] do_sys_openat2+0x97/0x150
[737030.838056] __x64_sys_openat+0x54/0x90
[737030.842164] do_syscall_64+0x33/0x40
[737030.846146] entry_SYSCALL_64_after_hwframe+0x44/0xa9
[737030.850855] RIP: 0033:0x7f2072a9414e
[737030.854822] RSP: 002b:00007ffd855832e0 EFLAGS: 00000246 ORIG_RAX:
0000000000000101
[737030.862655] RAX: ffffffffffffffda RBX: 000000000178aab0 RCX:
00007f2072a9414e
[737030.870259] RDX: 0000000000000241 RSI: 000000000178af20 RDI:
ffffffffffffff9c
[737030.877820] RBP: 00007ffd855833e0 R08: 0000000000000020 R09:
000000000178af20
[737030.885463] R10: 00000000000001b6 R11: 0000000000000246 R12:
0000000000000000
[737030.893083] R13: 0000000000000003 R14: 0000000000000001 R15:
000000000178aa60
[736992.593491] task:k state:D stack: 0 pid:11166 ppid: 29014
flags:0x00000184
[736992.600911] Call Trace:
[736992.604234] __schedule+0x1f9/0x660
[736992.607842] schedule+0x46/0xb0
[736992.611295] percpu_rwsem_wait+0xa9/0x180
[736992.614999] ? percpu_free_rwsem+0x30/0x30
[736992.618901] __percpu_down_read+0x49/0x60
[736992.622614] mnt_want_write+0x66/0x90
[736992.626323] open_last_lookups+0x30d/0x3f0
[736992.630064] ? path_init+0x2bd/0x3e0
[736992.633795] path_openat+0x88/0x1d0
[736992.637360] do_filp_open+0x88/0x130
[736992.640911] ? getname_flags.part.0+0x29/0x1a0
[736992.645006] ? __check_object_size.part.0+0x11f/0x140
[736992.649268] ? audit_alloc_name+0x8c/0xe0
[736992.653173] do_sys_openat2+0x97/0x150
[736992.656832] __x64_sys_openat+0x54/0x90
[736992.660619] do_syscall_64+0x33/0x40
[736992.664181] entry_SYSCALL_64_after_hwframe+0x44/0xa9
[736992.668543] RIP: 0033:0x7f81dc2d11ae
[736992.672208] RSP: 002b:00007ffcff615c00 EFLAGS: 00000246 ORIG_RAX:
0000000000000101
[736992.679509] RAX: ffffffffffffffda RBX: 0000000000000002 RCX:
00007f81dc2d11ae
[736992.686481] RDX: 0000000000000042 RSI: 00007f81d757e6b8 RDI:
00000000ffffff9c
[736992.693723] RBP: 00007f81d7ba14c4 R08: 000000000000003a R09:
0000000000000000
[736992.700684] R10: 00000000000001b6 R11: 0000000000000246 R12:
00007f81d757e6b8
[736992.708136] R13: 00007f81d7ba14c4 R14: 00007f81dbfb45b8 R15:
00007f81d7f3f8fc
[736992.715824] task:k state:D stack: 0 pid:11273 ppid: 29014
flags:0x00000184
[736992.724256] Call Trace:
[736992.727698] __schedule+0x1f9/0x660
[736992.731616] schedule+0x46/0xb0
[736992.735367] percpu_rwsem_wait+0xa9/0x180
[736992.739512] ? percpu_free_rwsem+0x30/0x30
[736992.743681] __percpu_down_read+0x49/0x60
[736992.747837] mnt_want_write+0x66/0x90
[736992.751896] open_last_lookups+0x30d/0x3f0
[736992.756191] ? path_init+0x2bd/0x3e0
[736992.760175] path_openat+0x88/0x1d0
[736992.764055] do_filp_open+0x88/0x130
[736992.768115] ? getname_flags.part.0+0x29/0x1a0
[736992.772563] ? __check_object_size.part.0+0x11f/0x140
[736992.777391] ? audit_alloc_name+0x8c/0xe0
[736992.781658] do_sys_openat2+0x97/0x150
[736992.785794] __x64_sys_openat+0x54/0x90
[736992.789856] do_syscall_64+0x33/0x40
[736992.793913] entry_SYSCALL_64_after_hwframe+0x44/0xa9
[736992.798746] RIP: 0033:0x7fb7d337f1ae
[736992.802878] RSP: 002b:00007ffd9b2dbda0 EFLAGS: 00000246 ORIG_RAX:
0000000000000101
[736992.810824] RAX: ffffffffffffffda RBX: 0000000000000002 RCX:
00007fb7d337f1ae
[736992.818496] RDX: 0000000000000042 RSI: 00007fb7d2e2aa88 RDI:
00000000ffffff9c
[736992.826359] RBP: 00007fb7cecdc7dc R08: 0000000000000008 R09:
0000000000000000
[736992.833753] R10: 00000000000001b6 R11: 0000000000000246 R12:
00007fb7d2e2aa88
[736992.840727] R13: 00007fb7d2e2aa7c R14: 00007fb7d3150920 R15:
0000000000000004
[736992.847580] task:tail state:D stack: 0 pid:11608 ppid: 9603
flags:0x00004186
[736992.855146] Call Trace:
[736992.858238] __schedule+0x1f9/0x660
[736992.861943] schedule+0x46/0xb0
[736992.865387] percpu_rwsem_wait+0xa9/0x180
[736992.869138] ? percpu_free_rwsem+0x30/0x30
[736992.873134] __percpu_down_read+0x49/0x60
[736992.876948] xfs_trans_alloc+0x15d/0x170
[736992.880854] xfs_free_eofblocks+0x130/0x1e0
[736992.885085] xfs_release+0x13d/0x160
[736992.888807] __fput+0x96/0x240
[736992.892138] task_work_run+0x5f/0x90
[736992.895834] do_exit+0x22c/0x3b0
[736992.899189] ? timerqueue_del+0x1e/0x40
[736992.902967] do_group_exit+0x33/0xa0
[736992.906469] get_signal+0x15d/0x5b0
[736993.013638] arch_do_signal+0x25/0xf0
[736993.017223] exit_to_user_mode_loop+0x8d/0xc0
[736993.021148] exit_to_user_mode_prepare+0x6a/0x70
[736993.025371] syscall_exit_to_user_mode+0x22/0x140
[736993.029417] entry_SYSCALL_64_after_hwframe+0x44/0xa9
[736993.033852] RIP: 0033:0x7f907725e6f4
[736993.037391] RSP: 002b:00007ffcf7722f38 EFLAGS: 00000246 ORIG_RAX:
0000000000000023
[736993.044403] RAX: fffffffffffffdfc RBX: 0000000000000001 RCX:
00007f907725e6f4
[736993.051384] RDX: 0000000000000000 RSI: 0000000000000000 RDI:
00007ffcf7722f40
[736993.058190] RBP: 0000000000000000 R08: 00007ffcf7723070 R09:
00007f9077312e80
[736993.065109] R10: 0000000000000000 R11: 0000000000000246 R12:
00007ffcf7722f40
[736993.071934] R13: 00007ffcf77256bf R14: 0000560396137440 R15:
0000000000000000
[736996.659414] task:process-exporte state:D stack: 0 pid:30396 ppid: 2204
flags:0x00004082
[736996.666720] Call Trace:
[736996.669864] __schedule+0x1f9/0x660
[736996.673371] schedule+0x46/0xb0
[736996.676730] rwsem_down_write_slowpath+0x234/0x4b0
[736996.680991] unregister_memcg_shrinker.isra.0+0x18/0x40
[736996.685377] unregister_shrinker+0x7b/0x80
[736996.689295] deactivate_locked_super+0x29/0xa0
[736996.693241] cleanup_mnt+0x12d/0x190
[736996.696912] task_work_run+0x5f/0x90
[736996.700444] do_exit+0x22c/0x3b0
[736996.703996] do_group_exit+0x33/0xa0
[736996.707581] get_signal+0x15d/0x5b0
[736996.711191] arch_do_signal+0x25/0xf0
[736996.714771] exit_to_user_mode_loop+0x8d/0xc0
[736996.718759] exit_to_user_mode_prepare+0x6a/0x70
[736996.722740] syscall_exit_to_user_mode+0x22/0x140
[736996.726869] entry_SYSCALL_64_after_hwframe+0x44/0xa9
[736996.835174] RIP: 0033:0x4a2c50
[736996.838687] RSP: 002b:000000c000213798 EFLAGS: 00000216 ORIG_RAX:
0000000000000000
[736996.845651] RAX: 0000000000000200 RBX: 000000c000032500 RCX:
00000000004a2c50
[736996.852636] RDX: 0000000000000200 RSI: 000000c0008ef600 RDI:
0000000000000005
[736996.859446] RBP: 000000c0002137e8 R08: 0000000000000000 R09:
0000000000000000
[736996.866347] R10: 0000000000000000 R11: 0000000000000216 R12:
ffffffffffffffff
[736996.873124] R13: 000000000000000c R14: 000000000000000b R15:
0000000000000010
--
You may reply to this email to add a comment.
You are receiving this mail because:
You are watching the assignee of the bug.
next prev parent reply other threads:[~2021-10-29 2:21 UTC|newest]
Thread overview: 20+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-10-20 7:36 [Bug 214767] New: xfs seems to hang due to race condition? maybe related to (gratuitous) thaw bugzilla-daemon
2021-10-20 7:39 ` [Bug 214767] " bugzilla-daemon
2021-10-20 16:16 ` bugzilla-daemon
2021-10-20 16:31 ` bugzilla-daemon
2021-10-20 18:29 ` bugzilla-daemon
2021-10-20 18:37 ` bugzilla-daemon
2021-10-20 19:22 ` bugzilla-daemon
2021-10-20 19:26 ` bugzilla-daemon
2021-10-20 22:16 ` [Bug 214767] New: " Dave Chinner
2021-10-20 22:38 ` [Bug 214767] " bugzilla-daemon
2021-10-21 4:21 ` bugzilla-daemon
2021-10-21 13:17 ` bugzilla-daemon
2021-10-21 13:19 ` bugzilla-daemon
2021-10-21 13:20 ` bugzilla-daemon
2021-10-29 2:21 ` bugzilla-daemon [this message]
2021-10-29 2:22 ` bugzilla-daemon
2021-11-09 5:19 ` bugzilla-daemon
2021-11-10 15:16 ` bugzilla-daemon
2021-11-20 22:31 ` Dave Chinner
2021-11-20 22:31 ` bugzilla-daemon
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=bug-214767-201763-2S5R4pWO3a@https.bugzilla.kernel.org/ \
--to=bugzilla-daemon@bugzilla.kernel.org \
--cc=linux-xfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.