linux-xfs.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* xfs blocks (blocked for more than 120 seconds)
@ 2014-03-24  9:24 Michel Verbraak(1st-Setup)
  2014-03-24 13:27 ` Mark Tinguely
  2014-03-24 17:36 ` Dave Chinner
  0 siblings, 2 replies; 7+ messages in thread
From: Michel Verbraak(1st-Setup) @ 2014-03-24  9:24 UTC (permalink / raw)
  To: xfs


[-- Attachment #1.1: Type: text/plain, Size: 75778 bytes --]

Hi,

We have a problem with one of our systems which is using XFS but we are
unable to find the problem. Recently we had two moments, Tuesday 4th of
March and Friday the 21st of March, where we had to reboot the system to
get it up and running again.

What happens:
- The programs handling files on the XFS disc stop working when
creating, deleting or writing files. They do not error they are just
waiting on the command to complete.
- One of our programs, a java application, goes into very high cpu usage
(50%) which normally is at 1%. This could be something in our java
application but it happens at the moment handling files gets stuck.
- A nice restart of the programs does not succeed as wel a kill -9 does
not work.
- Trying to reboot the servers in a normal fashion does not work. As it
is a virtual machine we have to do a shutdown (unplug power) and start
it up again to get it up and running.

Following details I have for you:

System OS: Ubuntu 12.04 LTS
Kernel: 3.2.0-37-generic #58-Ubuntu SMP Thu Jan 24 15:28:10 UTC 2013
x86_64 x86_64 x86_64 GNU/Linux
Server: Virtual machine in a VMWare setup.
Disc: 300GB direct attached LUN

We have an exact clone of this system for our acceptance environment. In
this environment we are unable to reproduce this problem/situation.

Differences between the two days is that our services on 2014-03-21 were
quit busy with a lot of file changes on the xfs disc and on 2014-03-04
the system was very quiet on the moment the kernel traces appear and the
services get stuck.

Any help is appreciated.

Regards Michel Verbraak.

Following we see in the syslog on both moments (2014-03-04 and 2014-03-21):

Mar 21 06:32:20 ealxs00169 kernel: [1412280.930340] INFO: task
flush-8:16:13864 blocked for more than 120 seconds.
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930462] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930543] flush-8:16      D
0000000000000000     0 13864      2 0x00000000
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930549]  ffff88000676f880
0000000000000046 0000000001000000 2500000000000000
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930557]  ffff88000676ffd8
ffff88000676ffd8 ffff88000676ffd8 00000000000137c0
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930560]  ffff880002218000
ffff88001b5d4500 ffff88000676f850 ffff88001fc14080
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930564] Call Trace:
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930584] 
[<ffffffff81118360>] ? __lock_page+0x70/0x70
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930602] 
[<ffffffff8165b34f>] schedule+0x3f/0x60
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930605] 
[<ffffffff8165b3ff>] io_schedule+0x8f/0xd0
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930608] 
[<ffffffff8111836e>] sleep_on_page+0xe/0x20
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930617] 
[<ffffffff8165baca>] __wait_on_bit_lock+0x5a/0xc0
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930620] 
[<ffffffff81118357>] __lock_page+0x67/0x70
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930626] 
[<ffffffff8108bd60>] ? autoremove_wake_function+0x40/0x40
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930634] 
[<ffffffff81122bd4>] write_cache_pages+0x3d4/0x460
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930638] 
[<ffffffff81122200>] ? set_page_dirty_lock+0x60/0x60
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930641] 
[<ffffffff81122200>] ? set_page_dirty_lock+0x60/0x60
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930644] 
[<ffffffff81122caa>] generic_writepages+0x4a/0x70
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930701] 
[<ffffffffa007980d>] xfs_vm_writepages+0x4d/0x60 [xfs]
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930708] 
[<ffffffff81123b71>] do_writepages+0x21/0x40
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930719] 
[<ffffffff811a2990>] writeback_single_inode+0x180/0x430
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930722] 
[<ffffffff811a3056>] writeback_sb_inodes+0x1b6/0x270
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930725] 
[<ffffffff811a31ae>] __writeback_inodes_wb+0x9e/0xd0
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930728] 
[<ffffffff811a345b>] wb_writeback+0x27b/0x330
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930739] 
[<ffffffff810136e5>] ? __switch_to+0xf5/0x360
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930744] 
[<ffffffff81194ad2>] ? get_nr_dirty_inodes+0x52/0x80
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930747] 
[<ffffffff811a35af>] wb_check_old_data_flush+0x9f/0xb0
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930751] 
[<ffffffff811a4481>] wb_do_writeback+0x151/0x1d0
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930757] 
[<ffffffff810778b0>] ? usleep_range+0x50/0x50
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930762] 
[<ffffffff811a4583>] bdi_writeback_thread+0x83/0x2a0
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930765] 
[<ffffffff811a4500>] ? wb_do_writeback+0x1d0/0x1d0
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930768] 
[<ffffffff8108b27c>] kthread+0x8c/0xa0
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930775] 
[<ffffffff816679b4>] kernel_thread_helper+0x4/0x10
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930778] 
[<ffffffff8108b1f0>] ? flush_kthread_worker+0xa0/0xa0
Mar 21 06:32:20 ealxs00169 kernel: [1412280.930781] 
[<ffffffff816679b0>] ? gs_change+0x13/0x13
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891012] INFO: task
archiver.pl:4005 blocked for more than 120 seconds.
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891118] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891181] archiver.pl     D
0000000000000000     0  4005   4004 0x00000000
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891186]  ffff88001bb41718
0000000000000086 ffffffff8142bff0 0000000000000002
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891189]  ffff88001bb41fd8
ffff88001bb41fd8 ffff88001bb41fd8 00000000000137c0
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891192]  ffff88001bad8000
ffff88000602dc00 ffff88001bb41798 7fffffffffffffff
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891195] Call Trace:
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891223] 
[<ffffffff8142bff0>] ? scsi_mode_select+0x200/0x200
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891227] 
[<ffffffff8165b34f>] schedule+0x3f/0x60
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891229] 
[<ffffffff8165b995>] schedule_timeout+0x2a5/0x320
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891234] 
[<ffffffff8165c5f0>] __down_common+0xa5/0xf5
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891291] 
[<ffffffffa007c295>] ? _xfs_buf_find+0xd5/0x240 [xfs]
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891294] 
[<ffffffff8165c6b3>] __down+0x1d/0x1f
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891301] 
[<ffffffff810912e1>] down+0x41/0x50
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891314] 
[<ffffffffa007c0f4>] xfs_buf_lock+0x44/0x110 [xfs]
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891324] 
[<ffffffffa007c295>] _xfs_buf_find+0xd5/0x240 [xfs]
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891335] 
[<ffffffffa007c684>] xfs_buf_get+0x34/0x1c0 [xfs]
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891345] 
[<ffffffffa007cf2d>] xfs_buf_read+0x2d/0x120 [xfs]
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891373] 
[<ffffffffa00d89ff>] xfs_trans_read_buf+0x2bf/0x470 [xfs]
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891386] 
[<ffffffffa0094775>] xfs_read_agf+0x65/0x1a0 [xfs]
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891399] 
[<ffffffffa00948e2>] xfs_alloc_read_agf+0x32/0x60 [xfs]
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891411] 
[<ffffffffa00966ff>] xfs_alloc_fix_freelist+0x42f/0x490 [xfs]
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891418] 
[<ffffffff8116371b>] ? kfree+0x3b/0x140
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891421] 
[<ffffffff8116355f>] ? kmem_cache_free+0x2f/0x110
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891432] 
[<ffffffff8131214b>] ? radix_tree_lookup+0xb/0x10
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891447] 
[<ffffffffa00cb2f8>] ? xfs_perag_get+0x38/0xd0 [xfs]
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891460] 
[<ffffffffa00967f1>] ? xfs_free_extent+0x91/0x120 [xfs]
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891473] 
[<ffffffffa0096802>] xfs_free_extent+0xa2/0x120 [xfs]
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891486] 
[<ffffffffa00a6e14>] xfs_bmap_finish+0x164/0x1b0 [xfs]
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891501] 
[<ffffffffa00c143b>] xfs_itruncate_extents+0xdb/0x1e0 [xfs]
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891516] 
[<ffffffffa00c15a6>] xfs_itruncate_data+0x66/0x150 [xfs]
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891529] 
[<ffffffffa009056e>] xfs_inactive+0x2fe/0x460 [xfs]
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891532] 
[<ffffffff8165d545>] ? _raw_spin_lock_irq+0x15/0x20
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891543] 
[<ffffffffa008afad>] xfs_fs_evict_inode+0xad/0x110 [xfs]
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891547] 
[<ffffffff81193e98>] evict+0xb8/0x1c0
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891550] 
[<ffffffff81194088>] iput_final+0xe8/0x210
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891552] 
[<ffffffff811941ee>] iput+0x3e/0x50
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891555] 
[<ffffffff81188be3>] do_unlinkat+0x153/0x1d0
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891561] 
[<ffffffff8117974d>] ? vfs_read+0x10d/0x180
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891564] 
[<ffffffff811897b6>] sys_unlink+0x16/0x20
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891568] 
[<ffffffff81665842>] system_call_fastpath+0x16/0x1b
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891571] INFO: task
flush-8:16:13864 blocked for more than 120 seconds.
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891626] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891686] flush-8:16      D
0000000000000000     0 13864      2 0x00000000
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891689]  ffff88000676f880
0000000000000046 0000000001000000 2500000000000000
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891692]  ffff88000676ffd8
ffff88000676ffd8 ffff88000676ffd8 00000000000137c0
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891694]  ffff880002218000
ffff88001b5d4500 ffff88000676f850 ffff88001fc14080
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891697] Call Trace:
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891702] 
[<ffffffff81118360>] ? __lock_page+0x70/0x70
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891704] 
[<ffffffff8165b34f>] schedule+0x3f/0x60
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891710] 
[<ffffffff8165b3ff>] io_schedule+0x8f/0xd0
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891712] 
[<ffffffff8111836e>] sleep_on_page+0xe/0x20
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891714] 
[<ffffffff8165baca>] __wait_on_bit_lock+0x5a/0xc0
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891717] 
[<ffffffff81118357>] __lock_page+0x67/0x70
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891720] 
[<ffffffff8108bd60>] ? autoremove_wake_function+0x40/0x40
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891725] 
[<ffffffff81122bd4>] write_cache_pages+0x3d4/0x460
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891727] 
[<ffffffff81122200>] ? set_page_dirty_lock+0x60/0x60
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891730] 
[<ffffffff81122200>] ? set_page_dirty_lock+0x60/0x60
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891732] 
[<ffffffff81122caa>] generic_writepages+0x4a/0x70
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891742] 
[<ffffffffa007980d>] xfs_vm_writepages+0x4d/0x60 [xfs]
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891745] 
[<ffffffff81123b71>] do_writepages+0x21/0x40
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891749] 
[<ffffffff811a2990>] writeback_single_inode+0x180/0x430
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891751] 
[<ffffffff811a3056>] writeback_sb_inodes+0x1b6/0x270
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891754] 
[<ffffffff811a31ae>] __writeback_inodes_wb+0x9e/0xd0
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891756] 
[<ffffffff811a345b>] wb_writeback+0x27b/0x330
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891760] 
[<ffffffff810136e5>] ? __switch_to+0xf5/0x360
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891763] 
[<ffffffff81194ad2>] ? get_nr_dirty_inodes+0x52/0x80
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891765] 
[<ffffffff811a35af>] wb_check_old_data_flush+0x9f/0xb0
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891768] 
[<ffffffff811a4481>] wb_do_writeback+0x151/0x1d0
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891772] 
[<ffffffff810778b0>] ? usleep_range+0x50/0x50
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891774] 
[<ffffffff811a4583>] bdi_writeback_thread+0x83/0x2a0
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891777] 
[<ffffffff811a4500>] ? wb_do_writeback+0x1d0/0x1d0
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891779] 
[<ffffffff8108b27c>] kthread+0x8c/0xa0
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891782] 
[<ffffffff816679b4>] kernel_thread_helper+0x4/0x10
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891784] 
[<ffffffff8108b1f0>] ? flush_kthread_worker+0xa0/0xa0
Mar 21 06:34:20 ealxs00169 kernel: [1412400.891787] 
[<ffffffff816679b0>] ? gs_change+0x13/0x13
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861122] INFO: task
java:14629 blocked for more than 120 seconds.
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861204] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861471] java            D
0000000000000000     0 14629   1788 0x00000000
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861476]  ffff8800028e9328
0000000000000086 0000000000000100 0000000000000100
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861481]  ffff8800028e9fd8
ffff8800028e9fd8 ffff8800028e9fd8 00000000000137c0
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861483]  ffff88001a9a4500
ffff88001b5e8000 ffff8800028e9308 7fffffffffffffff
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861487] Call Trace:
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861500] 
[<ffffffff8165b34f>] schedule+0x3f/0x60
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861502] 
[<ffffffff8165b995>] schedule_timeout+0x2a5/0x320
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861514] 
[<ffffffff8101aef2>] ? nommu_map_sg+0x72/0xe0
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861522] 
[<ffffffff8142bff0>] ? scsi_mode_select+0x200/0x200
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861526] 
[<ffffffff8165c5f0>] __down_common+0xa5/0xf5
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861581] 
[<ffffffffa007c295>] ? _xfs_buf_find+0xd5/0x240 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861583] 
[<ffffffff8165c6b3>] __down+0x1d/0x1f
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861588] 
[<ffffffff810912e1>] down+0x41/0x50
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861598] 
[<ffffffffa007c0f4>] xfs_buf_lock+0x44/0x110 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861608] 
[<ffffffffa007c295>] _xfs_buf_find+0xd5/0x240 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861618] 
[<ffffffffa007c684>] xfs_buf_get+0x34/0x1c0 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861626] 
[<ffffffff8105702d>] ? set_next_entity+0xad/0xd0
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861637] 
[<ffffffffa007cf2d>] xfs_buf_read+0x2d/0x120 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861656] 
[<ffffffffa00d89ff>] xfs_trans_read_buf+0x2bf/0x470 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861670] 
[<ffffffffa0094775>] xfs_read_agf+0x65/0x1a0 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861684] 
[<ffffffffa00a0c93>] ? xfs_bmap_add_extent_hole_delay+0xd3/0x6b0 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861697] 
[<ffffffffa00948e2>] xfs_alloc_read_agf+0x32/0x60 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861711] 
[<ffffffffa00966ff>] xfs_alloc_fix_freelist+0x42f/0x490 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861725] 
[<ffffffffa00aa368>] ? xfs_bmbt_get_all+0x18/0x20 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861730] 
[<ffffffff8131214b>] ? radix_tree_lookup+0xb/0x10
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861732] 
[<ffffffff8131214b>] ? radix_tree_lookup+0xb/0x10
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861748] 
[<ffffffffa00cb2f8>] ? xfs_perag_get+0x38/0xd0 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861762] 
[<ffffffffa009fe23>] ? xfs_bmap_btalloc_nullfb+0x193/0x380 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861776] 
[<ffffffffa0096b6c>] ? xfs_alloc_vextent+0x2ec/0x690 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861792] 
[<ffffffffa0096a4b>] xfs_alloc_vextent+0x1cb/0x690 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861806] 
[<ffffffffa00a02c1>] xfs_bmap_btalloc+0x2b1/0x770 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861821] 
[<ffffffffa00c21dc>] ? xfs_iext_bno_to_ext+0x8c/0x160 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861836] 
[<ffffffffa00a07a4>] xfs_bmap_alloc+0x24/0x40 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861849] 
[<ffffffffa00a594e>] xfs_bmapi_allocate+0xce/0x2d0 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861864] 
[<ffffffffa00a85c3>] xfs_bmapi_write+0x4c3/0x720 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861876] 
[<ffffffffa0086988>] xfs_iomap_write_allocate+0x178/0x330 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861886] 
[<ffffffffa00799c3>] xfs_map_blocks+0x183/0x250 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861895] 
[<ffffffffa007aa56>] xfs_vm_writepage+0x196/0x510 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861901] 
[<ffffffff81122217>] __writepage+0x17/0x40
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861903] 
[<ffffffff81122a0d>] write_cache_pages+0x20d/0x460
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861922] 
[<ffffffff8152ba7d>] ? sock_recvmsg+0x11d/0x140
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861925] 
[<ffffffff81122200>] ? set_page_dirty_lock+0x60/0x60
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861935] 
[<ffffffffa00804bc>] ? xfs_file_aio_write+0x25c/0x2a0 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861938] 
[<ffffffff81122caa>] generic_writepages+0x4a/0x70
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861948] 
[<ffffffffa007980d>] xfs_vm_writepages+0x4d/0x60 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861951] 
[<ffffffff81123b71>] do_writepages+0x21/0x40
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861955] 
[<ffffffff81119f2b>] __filemap_fdatawrite_range+0x5b/0x60
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861958] 
[<ffffffff81119f82>] filemap_write_and_wait_range+0x52/0x80
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861968] 
[<ffffffffa007f05d>] xfs_file_fsync+0x5d/0x300 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861972] 
[<ffffffff8118c985>] ? poll_select_set_timeout+0x85/0xa0
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861977] 
[<ffffffff811a7eb6>] do_fsync+0x56/0x80
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861979] 
[<ffffffff811a81e0>] sys_fsync+0x10/0x20
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861983] 
[<ffffffff81665842>] system_call_fastpath+0x16/0x1b
Mar 21 06:36:20 ealxs00169 kernel: [1412520.861989] INFO: task
archiver.pl:4005 blocked for more than 120 seconds.
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862047] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862112] archiver.pl     D
0000000000000000     0  4005   4004 0x00000000
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862115]  ffff88001bb41718
0000000000000086 ffffffff8142bff0 0000000000000002
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862118]  ffff88001bb41fd8
ffff88001bb41fd8 ffff88001bb41fd8 00000000000137c0
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862121]  ffff88001bad8000
ffff88000602dc00 ffff88001bb41798 7fffffffffffffff
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862123] Call Trace:
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862126] 
[<ffffffff8142bff0>] ? scsi_mode_select+0x200/0x200
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862133] 
[<ffffffff8165b34f>] schedule+0x3f/0x60
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862135] 
[<ffffffff8165b995>] schedule_timeout+0x2a5/0x320
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862138] 
[<ffffffff8165c5f0>] __down_common+0xa5/0xf5
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862149] 
[<ffffffffa007c295>] ? _xfs_buf_find+0xd5/0x240 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862151] 
[<ffffffff8165c6b3>] __down+0x1d/0x1f
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862154] 
[<ffffffff810912e1>] down+0x41/0x50
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862164] 
[<ffffffffa007c0f4>] xfs_buf_lock+0x44/0x110 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862173] 
[<ffffffffa007c295>] _xfs_buf_find+0xd5/0x240 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862183] 
[<ffffffffa007c684>] xfs_buf_get+0x34/0x1c0 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862193] 
[<ffffffffa007cf2d>] xfs_buf_read+0x2d/0x120 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862210] 
[<ffffffffa00d89ff>] xfs_trans_read_buf+0x2bf/0x470 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862223] 
[<ffffffffa0094775>] xfs_read_agf+0x65/0x1a0 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862236] 
[<ffffffffa00948e2>] xfs_alloc_read_agf+0x32/0x60 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862250] 
[<ffffffffa00966ff>] xfs_alloc_fix_freelist+0x42f/0x490 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862255] 
[<ffffffff8116371b>] ? kfree+0x3b/0x140
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862258] 
[<ffffffff8116355f>] ? kmem_cache_free+0x2f/0x110
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862260] 
[<ffffffff8131214b>] ? radix_tree_lookup+0xb/0x10
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862277] 
[<ffffffffa00cb2f8>] ? xfs_perag_get+0x38/0xd0 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862290] 
[<ffffffffa00967f1>] ? xfs_free_extent+0x91/0x120 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862303] 
[<ffffffffa0096802>] xfs_free_extent+0xa2/0x120 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862317] 
[<ffffffffa00a6e14>] xfs_bmap_finish+0x164/0x1b0 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862333] 
[<ffffffffa00c143b>] xfs_itruncate_extents+0xdb/0x1e0 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862349] 
[<ffffffffa00c15a6>] xfs_itruncate_data+0x66/0x150 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862365] 
[<ffffffffa009056e>] xfs_inactive+0x2fe/0x460 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862368] 
[<ffffffff8165d545>] ? _raw_spin_lock_irq+0x15/0x20
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862380] 
[<ffffffffa008afad>] xfs_fs_evict_inode+0xad/0x110 [xfs]
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862385] 
[<ffffffff81193e98>] evict+0xb8/0x1c0
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862387] 
[<ffffffff81194088>] iput_final+0xe8/0x210
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862389] 
[<ffffffff811941ee>] iput+0x3e/0x50
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862392] 
[<ffffffff81188be3>] do_unlinkat+0x153/0x1d0
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862396] 
[<ffffffff8117974d>] ? vfs_read+0x10d/0x180
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862399] 
[<ffffffff811897b6>] sys_unlink+0x16/0x20
Mar 21 06:36:20 ealxs00169 kernel: [1412520.862401] 
[<ffffffff81665842>] system_call_fastpath+0x16/0x1b
Mar 21 06:38:20 ealxs00169 kernel: [1412640.820901] INFO: task java:1804
blocked for more than 120 seconds.
Mar 21 06:38:20 ealxs00169 kernel: [1412640.820972] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821037] java            D
0000000000000000     0  1804   1788 0x00000000
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821042]  ffff88001b4b3de8
0000000000000086 ffff88001b4b3da8 ffffffffa008d944
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821045]  ffff88001b4b3fd8
ffff88001b4b3fd8 ffff88001b4b3fd8 00000000000137c0
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821048]  ffff88001a749700
ffff88001c61ae00 0000000000000000 ffff8800002f38a0
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821051] Call Trace:
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821116] 
[<ffffffffa008d944>] ? xfs_inode_set_reclaim_tag+0x84/0xa0 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821122] 
[<ffffffff8165b34f>] schedule+0x3f/0x60
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821126] 
[<ffffffff8165c157>] __mutex_lock_slowpath+0xd7/0x150
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821131] 
[<ffffffff81188731>] ? do_path_lookup+0x31/0xc0
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821134] 
[<ffffffff8165bd6a>] mutex_lock+0x2a/0x50
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821136] 
[<ffffffff81188b1e>] do_unlinkat+0x8e/0x1d0
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821140] 
[<ffffffff8118c985>] ? poll_select_set_timeout+0x85/0xa0
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821145] 
[<ffffffff81015721>] ? math_state_restore+0x51/0x80
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821148] 
[<ffffffff811897b6>] sys_unlink+0x16/0x20
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821152] 
[<ffffffff81665842>] system_call_fastpath+0x16/0x1b
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821154] INFO: task java:1805
blocked for more than 120 seconds.
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821207] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821301] java            D
0000000000000000     0  1805   1788 0x00000000
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821309]  ffff88001f4a17e8
0000000000000086 0000000000000000 ffff88001f5f1410
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821312]  ffff88001f4a1fd8
ffff88001f4a1fd8 ffff88001f4a1fd8 00000000000137c0
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821314]  ffff88001bad8000
ffff88001c61dc00 000000000000003a 7fffffffffffffff
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821317] Call Trace:
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821320] 
[<ffffffff8165b34f>] schedule+0x3f/0x60
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821322] 
[<ffffffff8165b995>] schedule_timeout+0x2a5/0x320
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821327] 
[<ffffffff81165a1d>] ? kmem_cache_alloc+0x11d/0x140
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821342] 
[<ffffffffa00927b7>] ? kmem_zone_alloc+0x67/0xe0 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821345] 
[<ffffffff8165c5f0>] __down_common+0xa5/0xf5
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821355] 
[<ffffffffa007c295>] ? _xfs_buf_find+0xd5/0x240 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821358] 
[<ffffffff8165c6b3>] __down+0x1d/0x1f
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821362] 
[<ffffffff810912e1>] down+0x41/0x50
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821372] 
[<ffffffffa007c0f4>] xfs_buf_lock+0x44/0x110 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821382] 
[<ffffffffa007c295>] _xfs_buf_find+0xd5/0x240 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821392] 
[<ffffffffa007c684>] xfs_buf_get+0x34/0x1c0 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821402] 
[<ffffffffa007cf2d>] xfs_buf_read+0x2d/0x120 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821420] 
[<ffffffffa00d89ff>] xfs_trans_read_buf+0x2bf/0x470 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821434] 
[<ffffffffa0094775>] xfs_read_agf+0x65/0x1a0 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821446] 
[<ffffffffa00925e7>] ? kmem_alloc+0x67/0xe0 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821459] 
[<ffffffffa00948e2>] xfs_alloc_read_agf+0x32/0x60 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821473] 
[<ffffffffa00966ff>] xfs_alloc_fix_freelist+0x42f/0x490 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821483] 
[<ffffffffa007c42d>] ? xfs_buf_unlock+0x2d/0xa0 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821487] 
[<ffffffff8116355f>] ? kmem_cache_free+0x2f/0x110
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821491] 
[<ffffffff8131214b>] ? radix_tree_lookup+0xb/0x10
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821508] 
[<ffffffffa00cb2f8>] ? xfs_perag_get+0x38/0xd0 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821521] 
[<ffffffffa00967f1>] ? xfs_free_extent+0x91/0x120 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821539] 
[<ffffffffa0096802>] xfs_free_extent+0xa2/0x120 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821553] 
[<ffffffffa00a6e14>] xfs_bmap_finish+0x164/0x1b0 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821566] 
[<ffffffffa00914c1>] xfs_remove+0x351/0x400 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821569] 
[<ffffffff8165d25e>] ? _raw_spin_lock+0xe/0x20
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821580] 
[<ffffffffa0087378>] xfs_vn_unlink+0x48/0x90 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821583] 
[<ffffffff81186520>] vfs_unlink.part.26+0x80/0xf0
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821585] 
[<ffffffff811865cc>] vfs_unlink+0x3c/0x60
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821587] 
[<ffffffff81188c3a>] do_unlinkat+0x1aa/0x1d0
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821590] 
[<ffffffff8118c985>] ? poll_select_set_timeout+0x85/0xa0
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821593] 
[<ffffffff81015721>] ? math_state_restore+0x51/0x80
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821595] 
[<ffffffff811897b6>] sys_unlink+0x16/0x20
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821598] 
[<ffffffff81665842>] system_call_fastpath+0x16/0x1b
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821600] INFO: task java:1806
blocked for more than 120 seconds.
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821668] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821760] java            D
0000000000000000     0  1806   1788 0x00000000
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821767]  ffff88001f741de8
0000000000000086 ffff88001f741da8 ffffffffa008d944
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821770]  ffff88001f741fd8
ffff88001f741fd8 ffff88001f741fd8 00000000000137c0
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821772]  ffff88001c61ae00
ffff88001b9adc00 0000000000000000 ffff8800002f38a0
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821775] Call Trace:
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821788] 
[<ffffffffa008d944>] ? xfs_inode_set_reclaim_tag+0x84/0xa0 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821790] 
[<ffffffff8165b34f>] schedule+0x3f/0x60
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821793] 
[<ffffffff8165c157>] __mutex_lock_slowpath+0xd7/0x150
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821795] 
[<ffffffff81188731>] ? do_path_lookup+0x31/0xc0
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821797] 
[<ffffffff8165bd6a>] mutex_lock+0x2a/0x50
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821799] 
[<ffffffff81188b1e>] do_unlinkat+0x8e/0x1d0
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821802] 
[<ffffffff8118c985>] ? poll_select_set_timeout+0x85/0xa0
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821805] 
[<ffffffff81015721>] ? math_state_restore+0x51/0x80
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821807] 
[<ffffffff811897b6>] sys_unlink+0x16/0x20
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821809] 
[<ffffffff81665842>] system_call_fastpath+0x16/0x1b
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821812] INFO: task java:1807
blocked for more than 120 seconds.
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821878] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821971] java            D
0000000000000000     0  1807   1788 0x00000000
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821977]  ffff88001f7fdde8
0000000000000086 ffff88001f7fdda8 ffffffffa008d944
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821980]  ffff88001f7fdfd8
ffff88001f7fdfd8 ffff88001f7fdfd8 00000000000137c0
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821983]  ffff88001b9adc00
ffff88001b9a8000 0000000000000000 ffff8800002f38a0
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821985] Call Trace:
Mar 21 06:38:20 ealxs00169 kernel: [1412640.821998] 
[<ffffffffa008d944>] ? xfs_inode_set_reclaim_tag+0x84/0xa0 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822001] 
[<ffffffff8165b34f>] schedule+0x3f/0x60
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822003] 
[<ffffffff8165c157>] __mutex_lock_slowpath+0xd7/0x150
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822005] 
[<ffffffff81188731>] ? do_path_lookup+0x31/0xc0
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822007] 
[<ffffffff8165bd6a>] mutex_lock+0x2a/0x50
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822010] 
[<ffffffff81188b1e>] do_unlinkat+0x8e/0x1d0
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822012] 
[<ffffffff8118c985>] ? poll_select_set_timeout+0x85/0xa0
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822015] 
[<ffffffff81015721>] ? math_state_restore+0x51/0x80
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822017] 
[<ffffffff811897b6>] sys_unlink+0x16/0x20
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822020] 
[<ffffffff81665842>] system_call_fastpath+0x16/0x1b
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822022] INFO: task
java:14629 blocked for more than 120 seconds.
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822090] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822183] java            D
0000000000000000     0 14629   1788 0x00000000
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822186]  ffff8800028e9328
0000000000000086 0000000000000100 0000000000000100
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822189]  ffff8800028e9fd8
ffff8800028e9fd8 ffff8800028e9fd8 00000000000137c0
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822191]  ffff88001a9a4500
ffff88001b5e8000 ffff8800028e9308 7fffffffffffffff
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822194] Call Trace:
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822196] 
[<ffffffff8165b34f>] schedule+0x3f/0x60
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822199] 
[<ffffffff8165b995>] schedule_timeout+0x2a5/0x320
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822203] 
[<ffffffff8101aef2>] ? nommu_map_sg+0x72/0xe0
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822216] 
[<ffffffff8142bff0>] ? scsi_mode_select+0x200/0x200
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822220] 
[<ffffffff8165c5f0>] __down_common+0xa5/0xf5
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822234] 
[<ffffffffa007c295>] ? _xfs_buf_find+0xd5/0x240 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822237] 
[<ffffffff8165c6b3>] __down+0x1d/0x1f
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822241] 
[<ffffffff810912e1>] down+0x41/0x50
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822252] 
[<ffffffffa007c0f4>] xfs_buf_lock+0x44/0x110 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822262] 
[<ffffffffa007c295>] _xfs_buf_find+0xd5/0x240 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822272] 
[<ffffffffa007c684>] xfs_buf_get+0x34/0x1c0 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822276] 
[<ffffffff8105702d>] ? set_next_entity+0xad/0xd0
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822285] 
[<ffffffffa007cf2d>] xfs_buf_read+0x2d/0x120 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822302] 
[<ffffffffa00d89ff>] xfs_trans_read_buf+0x2bf/0x470 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822315] 
[<ffffffffa0094775>] xfs_read_agf+0x65/0x1a0 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822329] 
[<ffffffffa00a0c93>] ? xfs_bmap_add_extent_hole_delay+0xd3/0x6b0 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822342] 
[<ffffffffa00948e2>] xfs_alloc_read_agf+0x32/0x60 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822355] 
[<ffffffffa00966ff>] xfs_alloc_fix_freelist+0x42f/0x490 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822369] 
[<ffffffffa00aa368>] ? xfs_bmbt_get_all+0x18/0x20 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822372] 
[<ffffffff8131214b>] ? radix_tree_lookup+0xb/0x10
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822374] 
[<ffffffff8131214b>] ? radix_tree_lookup+0xb/0x10
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822390] 
[<ffffffffa00cb2f8>] ? xfs_perag_get+0x38/0xd0 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822403] 
[<ffffffffa009fe23>] ? xfs_bmap_btalloc_nullfb+0x193/0x380 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822417] 
[<ffffffffa0096b6c>] ? xfs_alloc_vextent+0x2ec/0x690 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822430] 
[<ffffffffa0096a4b>] xfs_alloc_vextent+0x1cb/0x690 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822443] 
[<ffffffffa00a02c1>] xfs_bmap_btalloc+0x2b1/0x770 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822459] 
[<ffffffffa00c21dc>] ? xfs_iext_bno_to_ext+0x8c/0x160 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822473] 
[<ffffffffa00a07a4>] xfs_bmap_alloc+0x24/0x40 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822486] 
[<ffffffffa00a594e>] xfs_bmapi_allocate+0xce/0x2d0 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822500] 
[<ffffffffa00a85c3>] xfs_bmapi_write+0x4c3/0x720 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822512] 
[<ffffffffa0086988>] xfs_iomap_write_allocate+0x178/0x330 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822522] 
[<ffffffffa00799c3>] xfs_map_blocks+0x183/0x250 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822531] 
[<ffffffffa007aa56>] xfs_vm_writepage+0x196/0x510 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822536] 
[<ffffffff81122217>] __writepage+0x17/0x40
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822538] 
[<ffffffff81122a0d>] write_cache_pages+0x20d/0x460
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822542] 
[<ffffffff8152ba7d>] ? sock_recvmsg+0x11d/0x140
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822544] 
[<ffffffff81122200>] ? set_page_dirty_lock+0x60/0x60
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822554] 
[<ffffffffa00804bc>] ? xfs_file_aio_write+0x25c/0x2a0 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822557] 
[<ffffffff81122caa>] generic_writepages+0x4a/0x70
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822566] 
[<ffffffffa007980d>] xfs_vm_writepages+0x4d/0x60 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822569] 
[<ffffffff81123b71>] do_writepages+0x21/0x40
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822573] 
[<ffffffff81119f2b>] __filemap_fdatawrite_range+0x5b/0x60
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822575] 
[<ffffffff81119f82>] filemap_write_and_wait_range+0x52/0x80
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822585] 
[<ffffffffa007f05d>] xfs_file_fsync+0x5d/0x300 [xfs]
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822588] 
[<ffffffff8118c985>] ? poll_select_set_timeout+0x85/0xa0
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822593] 
[<ffffffff811a7eb6>] do_fsync+0x56/0x80
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822595] 
[<ffffffff811a81e0>] sys_fsync+0x10/0x20
Mar 21 06:38:20 ealxs00169 kernel: [1412640.822597] 
[<ffffffff81665842>] system_call_fastpath+0x16/0x1b

>From this moment on the services are not working anymore. Complete stuck.

Mar  4 19:39:25 ealxs00169 kernel: [34229278.392177] INFO: task
flush-8:16:20760 blocked for more than 120 seconds.
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392400] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392474] flush-8:16      D
0000000000000000     0 20760      2 0x00000000
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392478]  ffff8800148af880
0000000000000046 ffff8800148afbf0 0000000000000000
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392487]  ffff8800148affd8
ffff8800148affd8 ffff8800148affd8 00000000000137c0
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392489]  ffff8800022a4500
ffff8800022a1700 ffff8800148af850 ffff88001fc14080
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392492] Call Trace:
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392510] 
[<ffffffff81118360>] ? __lock_page+0x70/0x70
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392523] 
[<ffffffff8165b34f>] schedule+0x3f/0x60
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392526] 
[<ffffffff8165b3ff>] io_schedule+0x8f/0xd0
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392528] 
[<ffffffff8111836e>] sleep_on_page+0xe/0x20
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392536] 
[<ffffffff8165baca>] __wait_on_bit_lock+0x5a/0xc0
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392539] 
[<ffffffff81118357>] __lock_page+0x67/0x70
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392544] 
[<ffffffff8108bd60>] ? autoremove_wake_function+0x40/0x40
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392551] 
[<ffffffff81122bd4>] write_cache_pages+0x3d4/0x460
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392554] 
[<ffffffff81122200>] ? set_page_dirty_lock+0x60/0x60
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392556] 
[<ffffffff81122200>] ? set_page_dirty_lock+0x60/0x60
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392559] 
[<ffffffff81122caa>] generic_writepages+0x4a/0x70
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392621] 
[<ffffffffa008d80d>] xfs_vm_writepages+0x4d/0x60 [xfs]
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392627] 
[<ffffffff81123b71>] do_writepages+0x21/0x40
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392636] 
[<ffffffff811a2990>] writeback_single_inode+0x180/0x430
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392639] 
[<ffffffff811a3056>] writeback_sb_inodes+0x1b6/0x270
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392641] 
[<ffffffff811a31ae>] __writeback_inodes_wb+0x9e/0xd0
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392644] 
[<ffffffff811a345b>] wb_writeback+0x27b/0x330
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392654] 
[<ffffffff810136e5>] ? __switch_to+0xf5/0x360
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392658] 
[<ffffffff81194ad2>] ? get_nr_dirty_inodes+0x52/0x80
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392661] 
[<ffffffff811a35af>] wb_check_old_data_flush+0x9f/0xb0
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392664] 
[<ffffffff811a4481>] wb_do_writeback+0x151/0x1d0
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392669] 
[<ffffffff810778b0>] ? usleep_range+0x50/0x50
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392671] 
[<ffffffff811a4583>] bdi_writeback_thread+0x83/0x2a0
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392674] 
[<ffffffff811a4500>] ? wb_do_writeback+0x1d0/0x1d0
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392676] 
[<ffffffff8108b27c>] kthread+0x8c/0xa0
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392682] 
[<ffffffff816679b4>] kernel_thread_helper+0x4/0x10
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392685] 
[<ffffffff8108b1f0>] ? flush_kthread_worker+0xa0/0xa0
Mar  4 19:39:25 ealxs00169 kernel: [34229278.392687] 
[<ffffffff816679b0>] ? gs_change+0x13/0x13
Mar  4 19:41:25 ealxs00169 kernel: [34229398.350796] INFO: task
flush-8:16:20760 blocked for more than 120 seconds.
Mar  4 19:41:25 ealxs00169 kernel: [34229398.350923] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Mar  4 19:41:25 ealxs00169 kernel: [34229398.350987] flush-8:16      D
0000000000000000     0 20760      2 0x00000000
Mar  4 19:41:25 ealxs00169 kernel: [34229398.350991]  ffff8800148af880
0000000000000046 ffff8800148afbf0 0000000000000000
Mar  4 19:41:25 ealxs00169 kernel: [34229398.350995]  ffff8800148affd8
ffff8800148affd8 ffff8800148affd8 00000000000137c0
Mar  4 19:41:25 ealxs00169 kernel: [34229398.350998]  ffff8800022a4500
ffff8800022a1700 ffff8800148af850 ffff88001fc14080
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351001] Call Trace:
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351017] 
[<ffffffff81118360>] ? __lock_page+0x70/0x70
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351022] 
[<ffffffff8165b34f>] schedule+0x3f/0x60
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351024] 
[<ffffffff8165b3ff>] io_schedule+0x8f/0xd0
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351027] 
[<ffffffff8111836e>] sleep_on_page+0xe/0x20
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351030] 
[<ffffffff8165baca>] __wait_on_bit_lock+0x5a/0xc0
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351033] 
[<ffffffff81118357>] __lock_page+0x67/0x70
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351039] 
[<ffffffff8108bd60>] ? autoremove_wake_function+0x40/0x40
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351044] 
[<ffffffff81122bd4>] write_cache_pages+0x3d4/0x460
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351047] 
[<ffffffff81122200>] ? set_page_dirty_lock+0x60/0x60
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351049] 
[<ffffffff81122200>] ? set_page_dirty_lock+0x60/0x60
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351052] 
[<ffffffff81122caa>] generic_writepages+0x4a/0x70
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351093] 
[<ffffffffa008d80d>] xfs_vm_writepages+0x4d/0x60 [xfs]
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351096] 
[<ffffffff81123b71>] do_writepages+0x21/0x40
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351101] 
[<ffffffff811a2990>] writeback_single_inode+0x180/0x430
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351104] 
[<ffffffff811a3056>] writeback_sb_inodes+0x1b6/0x270
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351106] 
[<ffffffff811a31ae>] __writeback_inodes_wb+0x9e/0xd0
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351109] 
[<ffffffff811a345b>] wb_writeback+0x27b/0x330
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351113] 
[<ffffffff810136e5>] ? __switch_to+0xf5/0x360
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351117] 
[<ffffffff81194ad2>] ? get_nr_dirty_inodes+0x52/0x80
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351120] 
[<ffffffff811a35af>] wb_check_old_data_flush+0x9f/0xb0
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351123] 
[<ffffffff811a4481>] wb_do_writeback+0x151/0x1d0
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351128] 
[<ffffffff810778b0>] ? usleep_range+0x50/0x50
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351130] 
[<ffffffff811a4583>] bdi_writeback_thread+0x83/0x2a0
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351133] 
[<ffffffff811a4500>] ? wb_do_writeback+0x1d0/0x1d0
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351135] 
[<ffffffff8108b27c>] kthread+0x8c/0xa0
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351140] 
[<ffffffff816679b4>] kernel_thread_helper+0x4/0x10
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351142] 
[<ffffffff8108b1f0>] ? flush_kthread_worker+0xa0/0xa0
Mar  4 19:41:25 ealxs00169 kernel: [34229398.351145] 
[<ffffffff816679b0>] ? gs_change+0x13/0x13
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317264] INFO: task
flush-8:16:20760 blocked for more than 120 seconds.
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317367] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317436] flush-8:16      D
0000000000000000     0 20760      2 0x00000000
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317443]  ffff8800148af880
0000000000000046 ffff8800148afbf0 0000000000000000
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317447]  ffff8800148affd8
ffff8800148affd8 ffff8800148affd8 00000000000137c0
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317450]  ffff8800022a4500
ffff8800022a1700 ffff8800148af850 ffff88001fc14080
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317453] Call Trace:
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317467] 
[<ffffffff81118360>] ? __lock_page+0x70/0x70
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317472] 
[<ffffffff8165b34f>] schedule+0x3f/0x60
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317474] 
[<ffffffff8165b3ff>] io_schedule+0x8f/0xd0
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317477] 
[<ffffffff8111836e>] sleep_on_page+0xe/0x20
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317480] 
[<ffffffff8165baca>] __wait_on_bit_lock+0x5a/0xc0
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317483] 
[<ffffffff81118357>] __lock_page+0x67/0x70
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317488] 
[<ffffffff8108bd60>] ? autoremove_wake_function+0x40/0x40
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317493] 
[<ffffffff81122bd4>] write_cache_pages+0x3d4/0x460
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317496] 
[<ffffffff81122200>] ? set_page_dirty_lock+0x60/0x60
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317498] 
[<ffffffff81122200>] ? set_page_dirty_lock+0x60/0x60
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317501] 
[<ffffffff81122caa>] generic_writepages+0x4a/0x70
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317596] 
[<ffffffffa008d80d>] xfs_vm_writepages+0x4d/0x60 [xfs]
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317600] 
[<ffffffff81123b71>] do_writepages+0x21/0x40
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317606] 
[<ffffffff811a2990>] writeback_single_inode+0x180/0x430
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317609] 
[<ffffffff811a3056>] writeback_sb_inodes+0x1b6/0x270
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317611] 
[<ffffffff811a31ae>] __writeback_inodes_wb+0x9e/0xd0
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317614] 
[<ffffffff811a345b>] wb_writeback+0x27b/0x330
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317620] 
[<ffffffff810136e5>] ? __switch_to+0xf5/0x360
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317624] 
[<ffffffff81194ad2>] ? get_nr_dirty_inodes+0x52/0x80
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317627] 
[<ffffffff811a35af>] wb_check_old_data_flush+0x9f/0xb0
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317630] 
[<ffffffff811a4481>] wb_do_writeback+0x151/0x1d0
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317648] 
[<ffffffff810778b0>] ? usleep_range+0x50/0x50
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317651] 
[<ffffffff811a4583>] bdi_writeback_thread+0x83/0x2a0
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317654] 
[<ffffffff811a4500>] ? wb_do_writeback+0x1d0/0x1d0
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317657] 
[<ffffffff8108b27c>] kthread+0x8c/0xa0
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317661] 
[<ffffffff816679b4>] kernel_thread_helper+0x4/0x10
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317664] 
[<ffffffff8108b1f0>] ? flush_kthread_worker+0xa0/0xa0
Mar  4 19:43:25 ealxs00169 kernel: [34229518.317666] 
[<ffffffff816679b0>] ? gs_change+0x13/0x13
Mar  4 19:45:25 ealxs00169 kernel: [34229638.275799] INFO: task
flush-8:16:20760 blocked for more than 120 seconds.
Mar  4 19:45:25 ealxs00169 kernel: [34229638.275894] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Mar  4 19:45:25 ealxs00169 kernel: [34229638.275957] flush-8:16      D
0000000000000000     0 20760      2 0x00000000
Mar  4 19:45:25 ealxs00169 kernel: [34229638.275962]  ffff8800148af880
0000000000000046 ffff8800148afbf0 0000000000000000
Mar  4 19:45:25 ealxs00169 kernel: [34229638.275969]  ffff8800148affd8
ffff8800148affd8 ffff8800148affd8 00000000000137c0
Mar  4 19:45:25 ealxs00169 kernel: [34229638.275971]  ffff8800022a4500
ffff8800022a1700 ffff8800148af850 ffff88001fc14080
Mar  4 19:45:25 ealxs00169 kernel: [34229638.275975] Call Trace:
Mar  4 19:45:25 ealxs00169 kernel: [34229638.275993] 
[<ffffffff81118360>] ? __lock_page+0x70/0x70
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276006] 
[<ffffffff8165b34f>] schedule+0x3f/0x60
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276008] 
[<ffffffff8165b3ff>] io_schedule+0x8f/0xd0
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276011] 
[<ffffffff8111836e>] sleep_on_page+0xe/0x20
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276018] 
[<ffffffff8165baca>] __wait_on_bit_lock+0x5a/0xc0
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276021] 
[<ffffffff81118357>] __lock_page+0x67/0x70
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276029] 
[<ffffffff8108bd60>] ? autoremove_wake_function+0x40/0x40
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276036] 
[<ffffffff81122bd4>] write_cache_pages+0x3d4/0x460
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276039] 
[<ffffffff81122200>] ? set_page_dirty_lock+0x60/0x60
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276042] 
[<ffffffff81122200>] ? set_page_dirty_lock+0x60/0x60
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276044] 
[<ffffffff81122caa>] generic_writepages+0x4a/0x70
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276084] 
[<ffffffffa008d80d>] xfs_vm_writepages+0x4d/0x60 [xfs]
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276090] 
[<ffffffff81123b71>] do_writepages+0x21/0x40
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276099] 
[<ffffffff811a2990>] writeback_single_inode+0x180/0x430
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276102] 
[<ffffffff811a3056>] writeback_sb_inodes+0x1b6/0x270
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276105] 
[<ffffffff811a31ae>] __writeback_inodes_wb+0x9e/0xd0
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276107] 
[<ffffffff811a345b>] wb_writeback+0x27b/0x330
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276117] 
[<ffffffff810136e5>] ? __switch_to+0xf5/0x360
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276121] 
[<ffffffff81194ad2>] ? get_nr_dirty_inodes+0x52/0x80
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276124] 
[<ffffffff811a35af>] wb_check_old_data_flush+0x9f/0xb0
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276127] 
[<ffffffff811a4481>] wb_do_writeback+0x151/0x1d0
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276132] 
[<ffffffff810778b0>] ? usleep_range+0x50/0x50
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276135] 
[<ffffffff811a4583>] bdi_writeback_thread+0x83/0x2a0
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276137] 
[<ffffffff811a4500>] ? wb_do_writeback+0x1d0/0x1d0
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276140] 
[<ffffffff8108b27c>] kthread+0x8c/0xa0
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276146] 
[<ffffffff816679b4>] kernel_thread_helper+0x4/0x10
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276148] 
[<ffffffff8108b1f0>] ? flush_kthread_worker+0xa0/0xa0
Mar  4 19:45:25 ealxs00169 kernel: [34229638.276151] 
[<ffffffff816679b0>] ? gs_change+0x13/0x13
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246364] INFO: task
flush-8:16:20760 blocked for more than 120 seconds.
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246439] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246501] flush-8:16      D
0000000000000000     0 20760      2 0x00000000
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246505]  ffff8800148af880
0000000000000046 ffff8800148afbf0 0000000000000000
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246509]  ffff8800148affd8
ffff8800148affd8 ffff8800148affd8 00000000000137c0
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246512]  ffff8800022a4500
ffff8800022a1700 ffff8800148af850 ffff88001fc14080
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246515] Call Trace:
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246527] 
[<ffffffff81118360>] ? __lock_page+0x70/0x70
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246532] 
[<ffffffff8165b34f>] schedule+0x3f/0x60
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246534] 
[<ffffffff8165b3ff>] io_schedule+0x8f/0xd0
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246537] 
[<ffffffff8111836e>] sleep_on_page+0xe/0x20
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246540] 
[<ffffffff8165baca>] __wait_on_bit_lock+0x5a/0xc0
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246543] 
[<ffffffff81118357>] __lock_page+0x67/0x70
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246548] 
[<ffffffff8108bd60>] ? autoremove_wake_function+0x40/0x40
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246554] 
[<ffffffff81122bd4>] write_cache_pages+0x3d4/0x460
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246556] 
[<ffffffff81122200>] ? set_page_dirty_lock+0x60/0x60
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246559] 
[<ffffffff81122200>] ? set_page_dirty_lock+0x60/0x60
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246561] 
[<ffffffff81122caa>] generic_writepages+0x4a/0x70
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246622] 
[<ffffffffa008d80d>] xfs_vm_writepages+0x4d/0x60 [xfs]
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246626] 
[<ffffffff81123b71>] do_writepages+0x21/0x40
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246631] 
[<ffffffff811a2990>] writeback_single_inode+0x180/0x430
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246633] 
[<ffffffff811a3056>] writeback_sb_inodes+0x1b6/0x270
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246636] 
[<ffffffff811a31ae>] __writeback_inodes_wb+0x9e/0xd0
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246638] 
[<ffffffff811a345b>] wb_writeback+0x27b/0x330
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246644] 
[<ffffffff810136e5>] ? __switch_to+0xf5/0x360
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246648] 
[<ffffffff81194ad2>] ? get_nr_dirty_inodes+0x52/0x80
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246650] 
[<ffffffff811a35af>] wb_check_old_data_flush+0x9f/0xb0
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246653] 
[<ffffffff811a4481>] wb_do_writeback+0x151/0x1d0
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246658] 
[<ffffffff810778b0>] ? usleep_range+0x50/0x50
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246661] 
[<ffffffff811a4583>] bdi_writeback_thread+0x83/0x2a0
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246663] 
[<ffffffff811a4500>] ? wb_do_writeback+0x1d0/0x1d0
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246665] 
[<ffffffff8108b27c>] kthread+0x8c/0xa0
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246670] 
[<ffffffff816679b4>] kernel_thread_helper+0x4/0x10
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246672] 
[<ffffffff8108b1f0>] ? flush_kthread_worker+0xa0/0xa0
Mar  4 19:47:25 ealxs00169 kernel: [34229758.246675] 
[<ffffffff816679b0>] ? gs_change+0x13/0x13
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204689] INFO: task
flush-8:16:20760 blocked for more than 120 seconds.
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204764] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204827] flush-8:16      D
0000000000000000     0 20760      2 0x00000000
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204834]  ffff8800148af880
0000000000000046 ffff8800148afbf0 0000000000000000
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204838]  ffff8800148affd8
ffff8800148affd8 ffff8800148affd8 00000000000137c0
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204841]  ffff8800022a4500
ffff8800022a1700 ffff8800148af850 ffff88001fc14080
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204844] Call Trace:
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204865] 
[<ffffffff81118360>] ? __lock_page+0x70/0x70
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204870] 
[<ffffffff8165b34f>] schedule+0x3f/0x60
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204872] 
[<ffffffff8165b3ff>] io_schedule+0x8f/0xd0
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204875] 
[<ffffffff8111836e>] sleep_on_page+0xe/0x20
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204878] 
[<ffffffff8165baca>] __wait_on_bit_lock+0x5a/0xc0
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204880] 
[<ffffffff81118357>] __lock_page+0x67/0x70
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204886] 
[<ffffffff8108bd60>] ? autoremove_wake_function+0x40/0x40
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204891] 
[<ffffffff81122bd4>] write_cache_pages+0x3d4/0x460
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204893] 
[<ffffffff81122200>] ? set_page_dirty_lock+0x60/0x60
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204896] 
[<ffffffff81122200>] ? set_page_dirty_lock+0x60/0x60
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204898] 
[<ffffffff81122caa>] generic_writepages+0x4a/0x70
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204956] 
[<ffffffffa008d80d>] xfs_vm_writepages+0x4d/0x60 [xfs]
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204960] 
[<ffffffff81123b71>] do_writepages+0x21/0x40
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204965] 
[<ffffffff811a2990>] writeback_single_inode+0x180/0x430
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204967] 
[<ffffffff811a3056>] writeback_sb_inodes+0x1b6/0x270
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204970] 
[<ffffffff811a31ae>] __writeback_inodes_wb+0x9e/0xd0
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204972] 
[<ffffffff811a345b>] wb_writeback+0x27b/0x330
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204976] 
[<ffffffff810136e5>] ? __switch_to+0xf5/0x360
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204980] 
[<ffffffff81194ad2>] ? get_nr_dirty_inodes+0x52/0x80
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204982] 
[<ffffffff811a35af>] wb_check_old_data_flush+0x9f/0xb0
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204985] 
[<ffffffff811a4481>] wb_do_writeback+0x151/0x1d0
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204990] 
[<ffffffff810778b0>] ? usleep_range+0x50/0x50
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204993] 
[<ffffffff811a4583>] bdi_writeback_thread+0x83/0x2a0
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204995] 
[<ffffffff811a4500>] ? wb_do_writeback+0x1d0/0x1d0
Mar  4 19:49:25 ealxs00169 kernel: [34229878.204997] 
[<ffffffff8108b27c>] kthread+0x8c/0xa0
Mar  4 19:49:25 ealxs00169 kernel: [34229878.205002] 
[<ffffffff816679b4>] kernel_thread_helper+0x4/0x10
Mar  4 19:49:25 ealxs00169 kernel: [34229878.205004] 
[<ffffffff8108b1f0>] ? flush_kthread_worker+0xa0/0xa0
Mar  4 19:49:25 ealxs00169 kernel: [34229878.205007] 
[<ffffffff816679b0>] ? gs_change+0x13/0x13
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176219] INFO: task
flush-8:16:20760 blocked for more than 120 seconds.
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176491] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176614] flush-8:16      D
0000000000000000     0 20760      2 0x00000000
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176618]  ffff8800148af880
0000000000000046 ffff8800148afbf0 0000000000000000
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176622]  ffff8800148affd8
ffff8800148affd8 ffff8800148affd8 00000000000137c0
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176625]  ffff8800022a4500
ffff8800022a1700 ffff8800148af850 ffff88001fc14080
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176628] Call Trace:
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176640] 
[<ffffffff81118360>] ? __lock_page+0x70/0x70
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176645] 
[<ffffffff8165b34f>] schedule+0x3f/0x60
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176647] 
[<ffffffff8165b3ff>] io_schedule+0x8f/0xd0
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176649] 
[<ffffffff8111836e>] sleep_on_page+0xe/0x20
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176652] 
[<ffffffff8165baca>] __wait_on_bit_lock+0x5a/0xc0
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176655] 
[<ffffffff81118357>] __lock_page+0x67/0x70
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176661] 
[<ffffffff8108bd60>] ? autoremove_wake_function+0x40/0x40
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176666] 
[<ffffffff81122bd4>] write_cache_pages+0x3d4/0x460
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176668] 
[<ffffffff81122200>] ? set_page_dirty_lock+0x60/0x60
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176671] 
[<ffffffff81122200>] ? set_page_dirty_lock+0x60/0x60
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176673] 
[<ffffffff81122caa>] generic_writepages+0x4a/0x70
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176746] 
[<ffffffffa008d80d>] xfs_vm_writepages+0x4d/0x60 [xfs]
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176750] 
[<ffffffff81123b71>] do_writepages+0x21/0x40
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176756] 
[<ffffffff811a2990>] writeback_single_inode+0x180/0x430
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176759] 
[<ffffffff811a3056>] writeback_sb_inodes+0x1b6/0x270
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176762] 
[<ffffffff811a31ae>] __writeback_inodes_wb+0x9e/0xd0
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176764] 
[<ffffffff811a345b>] wb_writeback+0x27b/0x330
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176769] 
[<ffffffff810136e5>] ? __switch_to+0xf5/0x360
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176773] 
[<ffffffff81194ad2>] ? get_nr_dirty_inodes+0x52/0x80
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176776] 
[<ffffffff811a35af>] wb_check_old_data_flush+0x9f/0xb0
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176779] 
[<ffffffff811a4481>] wb_do_writeback+0x151/0x1d0
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176784] 
[<ffffffff810778b0>] ? usleep_range+0x50/0x50
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176786] 
[<ffffffff811a4583>] bdi_writeback_thread+0x83/0x2a0
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176789] 
[<ffffffff811a4500>] ? wb_do_writeback+0x1d0/0x1d0
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176791] 
[<ffffffff8108b27c>] kthread+0x8c/0xa0
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176796] 
[<ffffffff816679b4>] kernel_thread_helper+0x4/0x10
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176798] 
[<ffffffff8108b1f0>] ? flush_kthread_worker+0xa0/0xa0
Mar  4 19:51:25 ealxs00169 kernel: [34229998.176801] 
[<ffffffff816679b0>] ? gs_change+0x13/0x13
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139419] INFO: task
flush-8:16:20760 blocked for more than 120 seconds.
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139541] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139635] flush-8:16      D
0000000000000000     0 20760      2 0x00000000
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139639]  ffff8800148af880
0000000000000046 ffff8800148afbf0 0000000000000000
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139643]  ffff8800148affd8
ffff8800148affd8 ffff8800148affd8 00000000000137c0
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139646]  ffff8800022a4500
ffff8800022a1700 ffff8800148af850 ffff88001fc14080
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139649] Call Trace:
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139665] 
[<ffffffff81118360>] ? __lock_page+0x70/0x70
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139671] 
[<ffffffff8165b34f>] schedule+0x3f/0x60
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139673] 
[<ffffffff8165b3ff>] io_schedule+0x8f/0xd0
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139676] 
[<ffffffff8111836e>] sleep_on_page+0xe/0x20
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139679] 
[<ffffffff8165baca>] __wait_on_bit_lock+0x5a/0xc0
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139681] 
[<ffffffff81118357>] __lock_page+0x67/0x70
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139687] 
[<ffffffff8108bd60>] ? autoremove_wake_function+0x40/0x40
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139693] 
[<ffffffff81122bd4>] write_cache_pages+0x3d4/0x460
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139695] 
[<ffffffff81122200>] ? set_page_dirty_lock+0x60/0x60
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139698] 
[<ffffffff81122200>] ? set_page_dirty_lock+0x60/0x60
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139700] 
[<ffffffff81122caa>] generic_writepages+0x4a/0x70
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139759] 
[<ffffffffa008d80d>] xfs_vm_writepages+0x4d/0x60 [xfs]
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139762] 
[<ffffffff81123b71>] do_writepages+0x21/0x40
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139767] 
[<ffffffff811a2990>] writeback_single_inode+0x180/0x430
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139770] 
[<ffffffff811a3056>] writeback_sb_inodes+0x1b6/0x270
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139773] 
[<ffffffff811a31ae>] __writeback_inodes_wb+0x9e/0xd0
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139775] 
[<ffffffff811a345b>] wb_writeback+0x27b/0x330
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139781] 
[<ffffffff810136e5>] ? __switch_to+0xf5/0x360
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139785] 
[<ffffffff81194ad2>] ? get_nr_dirty_inodes+0x52/0x80
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139787] 
[<ffffffff811a35af>] wb_check_old_data_flush+0x9f/0xb0
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139791] 
[<ffffffff811a4481>] wb_do_writeback+0x151/0x1d0
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139796] 
[<ffffffff810778b0>] ? usleep_range+0x50/0x50
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139798] 
[<ffffffff811a4583>] bdi_writeback_thread+0x83/0x2a0
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139801] 
[<ffffffff811a4500>] ? wb_do_writeback+0x1d0/0x1d0
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139803] 
[<ffffffff8108b27c>] kthread+0x8c/0xa0
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139808] 
[<ffffffff816679b4>] kernel_thread_helper+0x4/0x10
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139810] 
[<ffffffff8108b1f0>] ? flush_kthread_worker+0xa0/0xa0
Mar  4 19:53:25 ealxs00169 kernel: [34230118.139813] 
[<ffffffff816679b0>] ? gs_change+0x13/0x13
Mar  4 19:55:25 ealxs00169 kernel: [34230238.105607] INFO: task
flush-8:16:20760 blocked for more than 120 seconds.
Mar  4 19:55:25 ealxs00169 kernel: [34230238.105721] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Mar  4 19:55:25 ealxs00169 kernel: [34230238.105812] flush-8:16      D
0000000000000000     0 20760      2 0x00000000
Mar  4 19:55:25 ealxs00169 kernel: [34230238.105817]  ffff8800148af880
0000000000000046 ffff8800148afbf0 0000000000000000
Mar  4 19:55:25 ealxs00169 kernel: [34230238.105823]  ffff8800148affd8
ffff8800148affd8 ffff8800148affd8 00000000000137c0
Mar  4 19:55:25 ealxs00169 kernel: [34230238.105826]  ffff8800022a4500
ffff8800022a1700 ffff8800148af850 ffff88001fc14080
Mar  4 19:55:25 ealxs00169 kernel: [34230238.105829] Call Trace:
Mar  4 19:55:25 ealxs00169 kernel: [34230238.105848] 
[<ffffffff81118360>] ? __lock_page+0x70/0x70
Mar  4 19:55:25 ealxs00169 kernel: [34230238.105860] 
[<ffffffff8165b34f>] schedule+0x3f/0x60
Mar  4 19:55:25 ealxs00169 kernel: [34230238.105862] 
[<ffffffff8165b3ff>] io_schedule+0x8f/0xd0
Mar  4 19:55:25 ealxs00169 kernel: [34230238.105865] 
[<ffffffff8111836e>] sleep_on_page+0xe/0x20
Mar  4 19:55:25 ealxs00169 kernel: [34230238.105872] 
[<ffffffff8165baca>] __wait_on_bit_lock+0x5a/0xc0
Mar  4 19:55:25 ealxs00169 kernel: [34230238.105875] 
[<ffffffff81118357>] __lock_page+0x67/0x70
Mar  4 19:55:25 ealxs00169 kernel: [34230238.105881] 
[<ffffffff8108bd60>] ? autoremove_wake_function+0x40/0x40
Mar  4 19:55:25 ealxs00169 kernel: [34230238.105887] 
[<ffffffff81122bd4>] write_cache_pages+0x3d4/0x460
Mar  4 19:55:25 ealxs00169 kernel: [34230238.105890] 
[<ffffffff81122200>] ? set_page_dirty_lock+0x60/0x60
Mar  4 19:55:25 ealxs00169 kernel: [34230238.105892] 
[<ffffffff81122200>] ? set_page_dirty_lock+0x60/0x60
Mar  4 19:55:25 ealxs00169 kernel: [34230238.105895] 
[<ffffffff81122caa>] generic_writepages+0x4a/0x70
Mar  4 19:55:25 ealxs00169 kernel: [34230238.105965] 
[<ffffffffa008d80d>] xfs_vm_writepages+0x4d/0x60 [xfs]
Mar  4 19:55:25 ealxs00169 kernel: [34230238.105971] 
[<ffffffff81123b71>] do_writepages+0x21/0x40
Mar  4 19:55:25 ealxs00169 kernel: [34230238.105980] 
[<ffffffff811a2990>] writeback_single_inode+0x180/0x430
Mar  4 19:55:25 ealxs00169 kernel: [34230238.105983] 
[<ffffffff811a3056>] writeback_sb_inodes+0x1b6/0x270
Mar  4 19:55:25 ealxs00169 kernel: [34230238.105986] 
[<ffffffff811a31ae>] __writeback_inodes_wb+0x9e/0xd0
Mar  4 19:55:25 ealxs00169 kernel: [34230238.105988] 
[<ffffffff811a345b>] wb_writeback+0x27b/0x330
Mar  4 19:55:25 ealxs00169 kernel: [34230238.105998] 
[<ffffffff810136e5>] ? __switch_to+0xf5/0x360
Mar  4 19:55:25 ealxs00169 kernel: [34230238.106003] 
[<ffffffff81194ad2>] ? get_nr_dirty_inodes+0x52/0x80
Mar  4 19:55:25 ealxs00169 kernel: [34230238.106005] 
[<ffffffff811a35af>] wb_check_old_data_flush+0x9f/0xb0
Mar  4 19:55:25 ealxs00169 kernel: [34230238.106009] 
[<ffffffff811a4481>] wb_do_writeback+0x151/0x1d0
Mar  4 19:55:25 ealxs00169 kernel: [34230238.106014] 
[<ffffffff810778b0>] ? usleep_range+0x50/0x50
Mar  4 19:55:25 ealxs00169 kernel: [34230238.106016] 
[<ffffffff811a4583>] bdi_writeback_thread+0x83/0x2a0
Mar  4 19:55:25 ealxs00169 kernel: [34230238.106019] 
[<ffffffff811a4500>] ? wb_do_writeback+0x1d0/0x1d0
Mar  4 19:55:25 ealxs00169 kernel: [34230238.106021] 
[<ffffffff8108b27c>] kthread+0x8c/0xa0
Mar  4 19:55:25 ealxs00169 kernel: [34230238.106027] 
[<ffffffff816679b4>] kernel_thread_helper+0x4/0x10
Mar  4 19:55:25 ealxs00169 kernel: [34230238.106030] 
[<ffffffff8108b1f0>] ? flush_kthread_worker+0xa0/0xa0
Mar  4 19:55:25 ealxs00169 kernel: [34230238.106032] 
[<ffffffff816679b0>] ? gs_change+0x13/0x13
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082371] INFO: task
flush-8:16:20760 blocked for more than 120 seconds.
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082468] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082560] flush-8:16      D
0000000000000000     0 20760      2 0x00000000
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082564]  ffff8800148af880
0000000000000046 ffff8800148afbf0 0000000000000000
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082568]  ffff8800148affd8
ffff8800148affd8 ffff8800148affd8 00000000000137c0
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082571]  ffff8800022a4500
ffff8800022a1700 ffff8800148af850 ffff88001fc14080
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082573] Call Trace:
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082594] 
[<ffffffff81118360>] ? __lock_page+0x70/0x70
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082599] 
[<ffffffff8165b34f>] schedule+0x3f/0x60
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082601] 
[<ffffffff8165b3ff>] io_schedule+0x8f/0xd0
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082604] 
[<ffffffff8111836e>] sleep_on_page+0xe/0x20
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082607] 
[<ffffffff8165baca>] __wait_on_bit_lock+0x5a/0xc0
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082609] 
[<ffffffff81118357>] __lock_page+0x67/0x70
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082615] 
[<ffffffff8108bd60>] ? autoremove_wake_function+0x40/0x40
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082620] 
[<ffffffff81122bd4>] write_cache_pages+0x3d4/0x460
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082623] 
[<ffffffff81122200>] ? set_page_dirty_lock+0x60/0x60
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082625] 
[<ffffffff81122200>] ? set_page_dirty_lock+0x60/0x60
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082627] 
[<ffffffff81122caa>] generic_writepages+0x4a/0x70
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082677] 
[<ffffffffa008d80d>] xfs_vm_writepages+0x4d/0x60 [xfs]
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082681] 
[<ffffffff81123b71>] do_writepages+0x21/0x40
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082687] 
[<ffffffff811a2990>] writeback_single_inode+0x180/0x430
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082689] 
[<ffffffff811a3056>] writeback_sb_inodes+0x1b6/0x270
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082692] 
[<ffffffff811a31ae>] __writeback_inodes_wb+0x9e/0xd0
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082694] 
[<ffffffff811a345b>] wb_writeback+0x27b/0x330
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082700] 
[<ffffffff810136e5>] ? __switch_to+0xf5/0x360
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082705] 
[<ffffffff81194ad2>] ? get_nr_dirty_inodes+0x52/0x80
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082707] 
[<ffffffff811a35af>] wb_check_old_data_flush+0x9f/0xb0
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082713] 
[<ffffffff811a4481>] wb_do_writeback+0x151/0x1d0
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082719] 
[<ffffffff810778b0>] ? usleep_range+0x50/0x50
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082721] 
[<ffffffff811a4583>] bdi_writeback_thread+0x83/0x2a0
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082723] 
[<ffffffff811a4500>] ? wb_do_writeback+0x1d0/0x1d0
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082726] 
[<ffffffff8108b27c>] kthread+0x8c/0xa0
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082731] 
[<ffffffff816679b4>] kernel_thread_helper+0x4/0x10
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082733] 
[<ffffffff8108b1f0>] ? flush_kthread_worker+0xa0/0xa0
Mar  4 19:57:25 ealxs00169 kernel: [34230358.082736] 
[<ffffffff816679b0>] ? gs_change+0x13/0x13


[-- Attachment #1.2: Type: text/html, Size: 101107 bytes --]

[-- Attachment #2: Type: text/plain, Size: 121 bytes --]

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: xfs blocks (blocked for more than 120 seconds)
  2014-03-24  9:24 xfs blocks (blocked for more than 120 seconds) Michel Verbraak(1st-Setup)
@ 2014-03-24 13:27 ` Mark Tinguely
  2014-03-24 13:40   ` Michel Verbraak(1st-Setup)
  2014-03-24 17:36 ` Dave Chinner
  1 sibling, 1 reply; 7+ messages in thread
From: Mark Tinguely @ 2014-03-24 13:27 UTC (permalink / raw)
  To: Michel Verbraak(1st-Setup); +Cc: xfs

On 03/24/14 04:24, Michel Verbraak(1st-Setup) wrote:
> Hi,
>
> We have a problem with one of our systems which is using XFS but we are
> unable to find the problem. Recently we had two moments, Tuesday 4th of
> March and Friday the 21st of March, where we had to reboot the system to
> get it up and running again.
>
> What happens:
> - The programs handling files on the XFS disc stop working when
> creating, deleting or writing files. They do not error they are just
> waiting on the command to complete.
> - One of our programs, a java application, goes into very high cpu usage
> (50%) which normally is at 1%. This could be something in our java
> application but it happens at the moment handling files gets stuck.
> - A nice restart of the programs does not succeed as wel a kill -9 does
> not work.
> - Trying to reboot the servers in a normal fashion does not work. As it
> is a virtual machine we have to do a shutdown (unplug power) and start
> it up again to get it up and running.
>
> Following details I have for you:
>
> System OS: Ubuntu 12.04 LTS
> Kernel: 3.2.0-37-generic #58-Ubuntu SMP Thu Jan 24 15:28:10 UTC 2013
> x86_64 x86_64 x86_64 GNU/Linux
> Server: Virtual machine in a VMWare setup.
> Disc: 300GB direct attached LUN
>
> We have an exact clone of this system for our acceptance environment. In
> this environment we are unable to reproduce this problem/situation.
>
> Differences between the two days is that our services on 2014-03-21 were
> quit busy with a lot of file changes on the xfs disc and on 2014-03-04
> the system was very quiet on the moment the kernel traces appear and the
> services get stuck.
>
> Any help is appreciated.
>
> Regards Michel Verbraak.


Could you set up kdump and take a core dump next time it hangs?
There is a couple suspicious items in the syslog entries

--Mark.

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: xfs blocks (blocked for more than 120 seconds)
  2014-03-24 13:27 ` Mark Tinguely
@ 2014-03-24 13:40   ` Michel Verbraak(1st-Setup)
  2014-03-24 16:12     ` Mark Tinguely
  0 siblings, 1 reply; 7+ messages in thread
From: Michel Verbraak(1st-Setup) @ 2014-03-24 13:40 UTC (permalink / raw)
  To: Mark Tinguely; +Cc: xfs


[-- Attachment #1.1: Type: text/plain, Size: 2135 bytes --]

op 24-03-14 14:27, Mark Tinguely schreef:
> On 03/24/14 04:24, Michel Verbraak(1st-Setup) wrote:
>> Hi,
>>
>> We have a problem with one of our systems which is using XFS but we are
>> unable to find the problem. Recently we had two moments, Tuesday 4th of
>> March and Friday the 21st of March, where we had to reboot the system to
>> get it up and running again.
>>
>> What happens:
>> - The programs handling files on the XFS disc stop working when
>> creating, deleting or writing files. They do not error they are just
>> waiting on the command to complete.
>> - One of our programs, a java application, goes into very high cpu usage
>> (50%) which normally is at 1%. This could be something in our java
>> application but it happens at the moment handling files gets stuck.
>> - A nice restart of the programs does not succeed as wel a kill -9 does
>> not work.
>> - Trying to reboot the servers in a normal fashion does not work. As it
>> is a virtual machine we have to do a shutdown (unplug power) and start
>> it up again to get it up and running.
>>
>> Following details I have for you:
>>
>> System OS: Ubuntu 12.04 LTS
>> Kernel: 3.2.0-37-generic #58-Ubuntu SMP Thu Jan 24 15:28:10 UTC 2013
>> x86_64 x86_64 x86_64 GNU/Linux
>> Server: Virtual machine in a VMWare setup.
>> Disc: 300GB direct attached LUN
>>
>> We have an exact clone of this system for our acceptance environment. In
>> this environment we are unable to reproduce this problem/situation.
>>
>> Differences between the two days is that our services on 2014-03-21 were
>> quit busy with a lot of file changes on the xfs disc and on 2014-03-04
>> the system was very quiet on the moment the kernel traces appear and the
>> services get stuck.
>>
>> Any help is appreciated.
>>
>> Regards Michel Verbraak.
>
>
> Could you set up kdump and take a core dump next time it hangs?
> There is a couple suspicious items in the syslog entries
>
> --Mark.
>
> _______________________________________________
> xfs mailing list
> xfs@oss.sgi.com
> http://oss.sgi.com/mailman/listinfo/xfs
Mark,

We will setup the kdump. Can you elloborate on your suspicions?

Michel.

[-- Attachment #1.2: Type: text/html, Size: 3676 bytes --]

[-- Attachment #2: Type: text/plain, Size: 121 bytes --]

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: xfs blocks (blocked for more than 120 seconds)
  2014-03-24 13:40   ` Michel Verbraak(1st-Setup)
@ 2014-03-24 16:12     ` Mark Tinguely
  0 siblings, 0 replies; 7+ messages in thread
From: Mark Tinguely @ 2014-03-24 16:12 UTC (permalink / raw)
  To: Michel Verbraak(1st-Setup); +Cc: xfs

On 03/24/14 08:40, Michel Verbraak(1st-Setup) wrote:
> op 24-03-14 14:27, Mark Tinguely schreef:
>> On 03/24/14 04:24, Michel Verbraak(1st-Setup) wrote:
>>> Hi,
>>>
>>> We have a problem with one of our systems which is using XFS but we are
>>> unable to find the problem. Recently we had two moments, Tuesday 4th of
>>> March and Friday the 21st of March, where we had to reboot the system to
>>> get it up and running again.
>>>
>>> What happens:
>>> - The programs handling files on the XFS disc stop working when
>>> creating, deleting or writing files. They do not error they are just
>>> waiting on the command to complete.
>>> - One of our programs, a java application, goes into very high cpu usage
>>> (50%) which normally is at 1%. This could be something in our java
>>> application but it happens at the moment handling files gets stuck.
>>> - A nice restart of the programs does not succeed as wel a kill -9 does
>>> not work.
>>> - Trying to reboot the servers in a normal fashion does not work. As it
>>> is a virtual machine we have to do a shutdown (unplug power) and start
>>> it up again to get it up and running.
>>>
>>> Following details I have for you:
>>>
>>> System OS: Ubuntu 12.04 LTS
>>> Kernel: 3.2.0-37-generic #58-Ubuntu SMP Thu Jan 24 15:28:10 UTC 2013
>>> x86_64 x86_64 x86_64 GNU/Linux
>>> Server: Virtual machine in a VMWare setup.
>>> Disc: 300GB direct attached LUN
>>>
>>> We have an exact clone of this system for our acceptance environment. In
>>> this environment we are unable to reproduce this problem/situation.
>>>
>>> Differences between the two days is that our services on 2014-03-21 were
>>> quit busy with a lot of file changes on the xfs disc and on 2014-03-04
>>> the system was very quiet on the moment the kernel traces appear and the
>>> services get stuck.
>>>
>>> Any help is appreciated.
>>>
>>> Regards Michel Verbraak.
>>
>>
>> Could you set up kdump and take a core dump next time it hangs?
>> There is a couple suspicious items in the syslog entries
>>
>> --Mark.
>>
>> _______________________________________________
>> xfs mailing list
>> xfs@oss.sgi.com
>> http://oss.sgi.com/mailman/listinfo/xfs
> Mark,
>
> We will setup the kdump. Can you elloborate on your suspicions?
>
> Michel.

I am interested in the AGF buffer locks and it would be nice to know who 
hold what locks. The flush workers are interesting too. We have seen 
process block worker's writes from completing, so looking at what is 
running and scheduled would be interesting. So a vmcore would tell a lot.

--Mark.

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: xfs blocks (blocked for more than 120 seconds)
  2014-03-24  9:24 xfs blocks (blocked for more than 120 seconds) Michel Verbraak(1st-Setup)
  2014-03-24 13:27 ` Mark Tinguely
@ 2014-03-24 17:36 ` Dave Chinner
  2014-03-25  8:59   ` Michel Verbraak(1st-Setup)
  2014-03-25  9:39   ` Michel Verbraak(1st-Setup)
  1 sibling, 2 replies; 7+ messages in thread
From: Dave Chinner @ 2014-03-24 17:36 UTC (permalink / raw)
  To: Michel Verbraak(1st-Setup); +Cc: xfs

On Mon, Mar 24, 2014 at 10:24:45AM +0100, Michel Verbraak(1st-Setup) wrote:
> Hi,
> 
> We have a problem with one of our systems which is using XFS but we are
> unable to find the problem. Recently we had two moments, Tuesday 4th of
> March and Friday the 21st of March, where we had to reboot the system to
> get it up and running again.
> 
> What happens:
> - The programs handling files on the XFS disc stop working when
> creating, deleting or writing files. They do not error they are just
> waiting on the command to complete.
> - One of our programs, a java application, goes into very high cpu usage
> (50%) which normally is at 1%. This could be something in our java
> application but it happens at the moment handling files gets stuck.
> - A nice restart of the programs does not succeed as wel a kill -9 does
> not work.
> - Trying to reboot the servers in a normal fashion does not work. As it
> is a virtual machine we have to do a shutdown (unplug power) and start
> it up again to get it up and running.
......
> 
> Following details I have for you:
> 
> System OS: Ubuntu 12.04 LTS
> Kernel: 3.2.0-37-generic #58-Ubuntu SMP Thu Jan 24 15:28:10 UTC 2013
> x86_64 x86_64 x86_64 GNU/Linux
> Server: Virtual machine in a VMWare setup.
> Disc: 300GB direct attached LUN
> 
> We have an exact clone of this system for our acceptance environment. In
> this environment we are unable to reproduce this problem/situation.
> 
> Differences between the two days is that our services on 2014-03-21 were
> quit busy with a lot of file changes on the xfs disc and on 2014-03-04
> the system was very quiet on the moment the kernel traces appear and the
> services get stuck.
> 
> Any help is appreciated.
> 
> Regards Michel Verbraak.
> 
> Following we see in the syslog on both moments (2014-03-04 and 2014-03-21):
> 
....
> Mar 21 06:32:20 ealxs00169 kernel: [1412280.930543] flush-8:16      D
> 0000000000000000     0 13864      2 0x00000000
> [<ffffffff8165b34f>] schedule+0x3f/0x60
> [<ffffffff8165b3ff>] io_schedule+0x8f/0xd0
> [<ffffffff8111836e>] sleep_on_page+0xe/0x20
> [<ffffffff8165baca>] __wait_on_bit_lock+0x5a/0xc0
> [<ffffffff81118357>] __lock_page+0x67/0x70
> [<ffffffff81122bd4>] write_cache_pages+0x3d4/0x460
> [<ffffffff81122caa>] generic_writepages+0x4a/0x70
> [<ffffffffa007980d>] xfs_vm_writepages+0x4d/0x60 [xfs]
> [<ffffffff81123b71>] do_writepages+0x21/0x40
> [<ffffffff811a2990>] writeback_single_inode+0x180/0x430
> [<ffffffff811a3056>] writeback_sb_inodes+0x1b6/0x270
> [<ffffffff811a31ae>] __writeback_inodes_wb+0x9e/0xd0
> [<ffffffff811a345b>] wb_writeback+0x27b/0x330
> [<ffffffff811a35af>] wb_check_old_data_flush+0x9f/0xb0
> [<ffffffff811a4481>] wb_do_writeback+0x151/0x1d0
> [<ffffffff811a4583>] bdi_writeback_thread+0x83/0x2a0
> [<ffffffff8108b27c>] kthread+0x8c/0xa0

Writeback is blocked on a locked page, and is waiting for IO
completion.


> Mar 21 06:34:20 ealxs00169 kernel: [1412400.891181] archiver.pl     D
> [<ffffffff8165b34f>] schedule+0x3f/0x60
> [<ffffffff8165b995>] schedule_timeout+0x2a5/0x320
> [<ffffffff8165c5f0>] __down_common+0xa5/0xf5
> [<ffffffff8165c6b3>] __down+0x1d/0x1f
> [<ffffffff810912e1>] down+0x41/0x50
> [<ffffffffa007c0f4>] xfs_buf_lock+0x44/0x110 [xfs]
> [<ffffffffa007c295>] _xfs_buf_find+0xd5/0x240 [xfs]
> [<ffffffffa007c684>] xfs_buf_get+0x34/0x1c0 [xfs]
> [<ffffffffa007cf2d>] xfs_buf_read+0x2d/0x120 [xfs]
> [<ffffffffa00d89ff>] xfs_trans_read_buf+0x2bf/0x470 [xfs]
> [<ffffffffa0094775>] xfs_read_agf+0x65/0x1a0 [xfs]
> [<ffffffffa00948e2>] xfs_alloc_read_agf+0x32/0x60 [xfs]
> [<ffffffffa00966ff>] xfs_alloc_fix_freelist+0x42f/0x490 [xfs]
> [<ffffffffa0096802>] xfs_free_extent+0xa2/0x120 [xfs]
> [<ffffffffa00a6e14>] xfs_bmap_finish+0x164/0x1b0 [xfs]
> [<ffffffffa00c143b>] xfs_itruncate_extents+0xdb/0x1e0 [xfs]
> [<ffffffffa00c15a6>] xfs_itruncate_data+0x66/0x150 [xfs]
> [<ffffffffa009056e>] xfs_inactive+0x2fe/0x460 [xfs]
> [<ffffffffa008afad>] xfs_fs_evict_inode+0xad/0x110 [xfs]
> [<ffffffff81193e98>] evict+0xb8/0x1c0
> [<ffffffff81194088>] iput_final+0xe8/0x210
> [<ffffffff811941ee>] iput+0x3e/0x50
> [<ffffffff81188be3>] do_unlinkat+0x153/0x1d0
> [<ffffffff811897b6>] sys_unlink+0x16/0x20
> [<ffffffff81665842>] system_call_fastpath+0x16/0x1b

120s later, unlinking a file, blocked waiting on an AGF lock.

> Mar 21 06:34:20 ealxs00169 kernel: [1412400.891686] flush-8:16      D

writeback again, 120s later.

> Mar 21 06:36:20 ealxs00169 kernel: [1412520.861471] java            D
> [<ffffffff8165b995>] schedule_timeout+0x2a5/0x320
> [<ffffffff8165c5f0>] __down_common+0xa5/0xf5
> [<ffffffff8165c6b3>] __down+0x1d/0x1f
> [<ffffffff810912e1>] down+0x41/0x50
> [<ffffffffa007c0f4>] xfs_buf_lock+0x44/0x110 [xfs]
> [<ffffffffa007c295>] _xfs_buf_find+0xd5/0x240 [xfs]
> [<ffffffffa007c684>] xfs_buf_get+0x34/0x1c0 [xfs]
> [<ffffffffa007cf2d>] xfs_buf_read+0x2d/0x120 [xfs]
> [<ffffffffa00d89ff>] xfs_trans_read_buf+0x2bf/0x470 [xfs]
> [<ffffffffa0094775>] xfs_read_agf+0x65/0x1a0 [xfs]
> [<ffffffffa00948e2>] xfs_alloc_read_agf+0x32/0x60 [xfs]
> [<ffffffffa00966ff>] xfs_alloc_fix_freelist+0x42f/0x490 [xfs]
> [<ffffffffa0096a4b>] xfs_alloc_vextent+0x1cb/0x690 [xfs]
> [<ffffffffa00a02c1>] xfs_bmap_btalloc+0x2b1/0x770 [xfs]
> [<ffffffffa00a07a4>] xfs_bmap_alloc+0x24/0x40 [xfs]
> [<ffffffffa00a594e>] xfs_bmapi_allocate+0xce/0x2d0 [xfs]
> [<ffffffffa00a85c3>] xfs_bmapi_write+0x4c3/0x720 [xfs]
> [<ffffffffa0086988>] xfs_iomap_write_allocate+0x178/0x330 [xfs]
> [<ffffffffa00799c3>] xfs_map_blocks+0x183/0x250 [xfs]
> [<ffffffffa007aa56>] xfs_vm_writepage+0x196/0x510 [xfs]
> [<ffffffff81122217>] __writepage+0x17/0x40
> [<ffffffff81122a0d>] write_cache_pages+0x20d/0x460
> [<ffffffff81122caa>] generic_writepages+0x4a/0x70
> [<ffffffffa007980d>] xfs_vm_writepages+0x4d/0x60 [xfs]
> [<ffffffff81123b71>] do_writepages+0x21/0x40
> [<ffffffff81119f2b>] __filemap_fdatawrite_range+0x5b/0x60
> [<ffffffff81119f82>] filemap_write_and_wait_range+0x52/0x80
> [<ffffffffa007f05d>] xfs_file_fsync+0x5d/0x300 [xfs]
> [<ffffffff811a7eb6>] do_fsync+0x56/0x80
> [<ffffffff811a81e0>] sys_fsync+0x10/0x20
> [<ffffffff81665842>] system_call_fastpath+0x16/0x1b

120s later, fsync() blocks and AGF lock trying to do allocation.

> Mar 21 06:36:20 ealxs00169 kernel: [1412520.862112] archiver.pl     D
> [<ffffffff8165b34f>] schedule+0x3f/0x60
> [<ffffffff8165b995>] schedule_timeout+0x2a5/0x320
> [<ffffffff8165c5f0>] __down_common+0xa5/0xf5
> [<ffffffff8165c6b3>] __down+0x1d/0x1f
> [<ffffffff810912e1>] down+0x41/0x50
> [<ffffffffa007c0f4>] xfs_buf_lock+0x44/0x110 [xfs]
> [<ffffffffa007c295>] _xfs_buf_find+0xd5/0x240 [xfs]
> [<ffffffffa007c684>] xfs_buf_get+0x34/0x1c0 [xfs]
> [<ffffffffa007cf2d>] xfs_buf_read+0x2d/0x120 [xfs]
> [<ffffffffa00d89ff>] xfs_trans_read_buf+0x2bf/0x470 [xfs]
> [<ffffffffa0094775>] xfs_read_agf+0x65/0x1a0 [xfs]
> [<ffffffffa00948e2>] xfs_alloc_read_agf+0x32/0x60 [xfs]
> [<ffffffffa00966ff>] xfs_alloc_fix_freelist+0x42f/0x490 [xfs]
> [<ffffffffa0096802>] xfs_free_extent+0xa2/0x120 [xfs]
> [<ffffffffa00a6e14>] xfs_bmap_finish+0x164/0x1b0 [xfs]
> [<ffffffffa00c143b>] xfs_itruncate_extents+0xdb/0x1e0 [xfs]
> [<ffffffffa00c15a6>] xfs_itruncate_data+0x66/0x150 [xfs]
> [<ffffffffa009056e>] xfs_inactive+0x2fe/0x460 [xfs]
> [<ffffffffa008afad>] xfs_fs_evict_inode+0xad/0x110 [xfs]
> [<ffffffff81193e98>] evict+0xb8/0x1c0
> [<ffffffff81194088>] iput_final+0xe8/0x210
> [<ffffffff811941ee>] iput+0x3e/0x50
> [<ffffffff81188be3>] do_unlinkat+0x153/0x1d0
> [<ffffffff811897b6>] sys_unlink+0x16/0x20
> [<ffffffff81665842>] system_call_fastpath+0x16/0x1b

Still blocked on AGF in unlink.


> Mar 21 06:38:20 ealxs00169 kernel: [1412640.821037] java            D
> [<ffffffff8165b34f>] schedule+0x3f/0x60
> [<ffffffff8165c157>] __mutex_lock_slowpath+0xd7/0x150
> [<ffffffff8165bd6a>] mutex_lock+0x2a/0x50
> [<ffffffff81188b1e>] do_unlinkat+0x8e/0x1d0
> [<ffffffff811897b6>] sys_unlink+0x16/0x20
> [<ffffffff81665842>] system_call_fastpath+0x16/0x1b

120s later, unlink blocked on a directory mutex.

(others stuck on AGF lock).

> From this moment on the services are not working anymore. Complete stuck.

Yup, everything is stuck waiting for writeback to complete.

can you reproduce this? does the IO subsystem go idle when this
occurs? or is it running flat out?

Cheers,

Dave.
-- 
Dave Chinner
david@fromorbit.com

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: xfs blocks (blocked for more than 120 seconds)
  2014-03-24 17:36 ` Dave Chinner
@ 2014-03-25  8:59   ` Michel Verbraak(1st-Setup)
  2014-03-25  9:39   ` Michel Verbraak(1st-Setup)
  1 sibling, 0 replies; 7+ messages in thread
From: Michel Verbraak(1st-Setup) @ 2014-03-25  8:59 UTC (permalink / raw)
  To: Dave Chinner; +Cc: xfs


[-- Attachment #1.1: Type: text/plain, Size: 9483 bytes --]

op 24-03-14 18:36, Dave Chinner schreef:
> On Mon, Mar 24, 2014 at 10:24:45AM +0100, Michel Verbraak(1st-Setup) wrote:
>> Hi,
>>
>> We have a problem with one of our systems which is using XFS but we are
>> unable to find the problem. Recently we had two moments, Tuesday 4th of
>> March and Friday the 21st of March, where we had to reboot the system to
>> get it up and running again.
>>
>> What happens:
>> - The programs handling files on the XFS disc stop working when
>> creating, deleting or writing files. They do not error they are just
>> waiting on the command to complete.
>> - One of our programs, a java application, goes into very high cpu usage
>> (50%) which normally is at 1%. This could be something in our java
>> application but it happens at the moment handling files gets stuck.
>> - A nice restart of the programs does not succeed as wel a kill -9 does
>> not work.
>> - Trying to reboot the servers in a normal fashion does not work. As it
>> is a virtual machine we have to do a shutdown (unplug power) and start
>> it up again to get it up and running.
> ......
>> Following details I have for you:
>>
>> System OS: Ubuntu 12.04 LTS
>> Kernel: 3.2.0-37-generic #58-Ubuntu SMP Thu Jan 24 15:28:10 UTC 2013
>> x86_64 x86_64 x86_64 GNU/Linux
>> Server: Virtual machine in a VMWare setup.
>> Disc: 300GB direct attached LUN
>>
>> We have an exact clone of this system for our acceptance environment. In
>> this environment we are unable to reproduce this problem/situation.
>>
>> Differences between the two days is that our services on 2014-03-21 were
>> quit busy with a lot of file changes on the xfs disc and on 2014-03-04
>> the system was very quiet on the moment the kernel traces appear and the
>> services get stuck.
>>
>> Any help is appreciated.
>>
>> Regards Michel Verbraak.
>>
>> Following we see in the syslog on both moments (2014-03-04 and 2014-03-21):
>>
> ....
>> Mar 21 06:32:20 ealxs00169 kernel: [1412280.930543] flush-8:16      D
>> 0000000000000000     0 13864      2 0x00000000
>> [<ffffffff8165b34f>] schedule+0x3f/0x60
>> [<ffffffff8165b3ff>] io_schedule+0x8f/0xd0
>> [<ffffffff8111836e>] sleep_on_page+0xe/0x20
>> [<ffffffff8165baca>] __wait_on_bit_lock+0x5a/0xc0
>> [<ffffffff81118357>] __lock_page+0x67/0x70
>> [<ffffffff81122bd4>] write_cache_pages+0x3d4/0x460
>> [<ffffffff81122caa>] generic_writepages+0x4a/0x70
>> [<ffffffffa007980d>] xfs_vm_writepages+0x4d/0x60 [xfs]
>> [<ffffffff81123b71>] do_writepages+0x21/0x40
>> [<ffffffff811a2990>] writeback_single_inode+0x180/0x430
>> [<ffffffff811a3056>] writeback_sb_inodes+0x1b6/0x270
>> [<ffffffff811a31ae>] __writeback_inodes_wb+0x9e/0xd0
>> [<ffffffff811a345b>] wb_writeback+0x27b/0x330
>> [<ffffffff811a35af>] wb_check_old_data_flush+0x9f/0xb0
>> [<ffffffff811a4481>] wb_do_writeback+0x151/0x1d0
>> [<ffffffff811a4583>] bdi_writeback_thread+0x83/0x2a0
>> [<ffffffff8108b27c>] kthread+0x8c/0xa0
> Writeback is blocked on a locked page, and is waiting for IO
> completion.
>
>
>> Mar 21 06:34:20 ealxs00169 kernel: [1412400.891181] archiver.pl     D
>> [<ffffffff8165b34f>] schedule+0x3f/0x60
>> [<ffffffff8165b995>] schedule_timeout+0x2a5/0x320
>> [<ffffffff8165c5f0>] __down_common+0xa5/0xf5
>> [<ffffffff8165c6b3>] __down+0x1d/0x1f
>> [<ffffffff810912e1>] down+0x41/0x50
>> [<ffffffffa007c0f4>] xfs_buf_lock+0x44/0x110 [xfs]
>> [<ffffffffa007c295>] _xfs_buf_find+0xd5/0x240 [xfs]
>> [<ffffffffa007c684>] xfs_buf_get+0x34/0x1c0 [xfs]
>> [<ffffffffa007cf2d>] xfs_buf_read+0x2d/0x120 [xfs]
>> [<ffffffffa00d89ff>] xfs_trans_read_buf+0x2bf/0x470 [xfs]
>> [<ffffffffa0094775>] xfs_read_agf+0x65/0x1a0 [xfs]
>> [<ffffffffa00948e2>] xfs_alloc_read_agf+0x32/0x60 [xfs]
>> [<ffffffffa00966ff>] xfs_alloc_fix_freelist+0x42f/0x490 [xfs]
>> [<ffffffffa0096802>] xfs_free_extent+0xa2/0x120 [xfs]
>> [<ffffffffa00a6e14>] xfs_bmap_finish+0x164/0x1b0 [xfs]
>> [<ffffffffa00c143b>] xfs_itruncate_extents+0xdb/0x1e0 [xfs]
>> [<ffffffffa00c15a6>] xfs_itruncate_data+0x66/0x150 [xfs]
>> [<ffffffffa009056e>] xfs_inactive+0x2fe/0x460 [xfs]
>> [<ffffffffa008afad>] xfs_fs_evict_inode+0xad/0x110 [xfs]
>> [<ffffffff81193e98>] evict+0xb8/0x1c0
>> [<ffffffff81194088>] iput_final+0xe8/0x210
>> [<ffffffff811941ee>] iput+0x3e/0x50
>> [<ffffffff81188be3>] do_unlinkat+0x153/0x1d0
>> [<ffffffff811897b6>] sys_unlink+0x16/0x20
>> [<ffffffff81665842>] system_call_fastpath+0x16/0x1b
> 120s later, unlinking a file, blocked waiting on an AGF lock.
>
>> Mar 21 06:34:20 ealxs00169 kernel: [1412400.891686] flush-8:16      D
> writeback again, 120s later.
>
>> Mar 21 06:36:20 ealxs00169 kernel: [1412520.861471] java            D
>> [<ffffffff8165b995>] schedule_timeout+0x2a5/0x320
>> [<ffffffff8165c5f0>] __down_common+0xa5/0xf5
>> [<ffffffff8165c6b3>] __down+0x1d/0x1f
>> [<ffffffff810912e1>] down+0x41/0x50
>> [<ffffffffa007c0f4>] xfs_buf_lock+0x44/0x110 [xfs]
>> [<ffffffffa007c295>] _xfs_buf_find+0xd5/0x240 [xfs]
>> [<ffffffffa007c684>] xfs_buf_get+0x34/0x1c0 [xfs]
>> [<ffffffffa007cf2d>] xfs_buf_read+0x2d/0x120 [xfs]
>> [<ffffffffa00d89ff>] xfs_trans_read_buf+0x2bf/0x470 [xfs]
>> [<ffffffffa0094775>] xfs_read_agf+0x65/0x1a0 [xfs]
>> [<ffffffffa00948e2>] xfs_alloc_read_agf+0x32/0x60 [xfs]
>> [<ffffffffa00966ff>] xfs_alloc_fix_freelist+0x42f/0x490 [xfs]
>> [<ffffffffa0096a4b>] xfs_alloc_vextent+0x1cb/0x690 [xfs]
>> [<ffffffffa00a02c1>] xfs_bmap_btalloc+0x2b1/0x770 [xfs]
>> [<ffffffffa00a07a4>] xfs_bmap_alloc+0x24/0x40 [xfs]
>> [<ffffffffa00a594e>] xfs_bmapi_allocate+0xce/0x2d0 [xfs]
>> [<ffffffffa00a85c3>] xfs_bmapi_write+0x4c3/0x720 [xfs]
>> [<ffffffffa0086988>] xfs_iomap_write_allocate+0x178/0x330 [xfs]
>> [<ffffffffa00799c3>] xfs_map_blocks+0x183/0x250 [xfs]
>> [<ffffffffa007aa56>] xfs_vm_writepage+0x196/0x510 [xfs]
>> [<ffffffff81122217>] __writepage+0x17/0x40
>> [<ffffffff81122a0d>] write_cache_pages+0x20d/0x460
>> [<ffffffff81122caa>] generic_writepages+0x4a/0x70
>> [<ffffffffa007980d>] xfs_vm_writepages+0x4d/0x60 [xfs]
>> [<ffffffff81123b71>] do_writepages+0x21/0x40
>> [<ffffffff81119f2b>] __filemap_fdatawrite_range+0x5b/0x60
>> [<ffffffff81119f82>] filemap_write_and_wait_range+0x52/0x80
>> [<ffffffffa007f05d>] xfs_file_fsync+0x5d/0x300 [xfs]
>> [<ffffffff811a7eb6>] do_fsync+0x56/0x80
>> [<ffffffff811a81e0>] sys_fsync+0x10/0x20
>> [<ffffffff81665842>] system_call_fastpath+0x16/0x1b
> 120s later, fsync() blocks and AGF lock trying to do allocation.
>
>> Mar 21 06:36:20 ealxs00169 kernel: [1412520.862112] archiver.pl     D
>> [<ffffffff8165b34f>] schedule+0x3f/0x60
>> [<ffffffff8165b995>] schedule_timeout+0x2a5/0x320
>> [<ffffffff8165c5f0>] __down_common+0xa5/0xf5
>> [<ffffffff8165c6b3>] __down+0x1d/0x1f
>> [<ffffffff810912e1>] down+0x41/0x50
>> [<ffffffffa007c0f4>] xfs_buf_lock+0x44/0x110 [xfs]
>> [<ffffffffa007c295>] _xfs_buf_find+0xd5/0x240 [xfs]
>> [<ffffffffa007c684>] xfs_buf_get+0x34/0x1c0 [xfs]
>> [<ffffffffa007cf2d>] xfs_buf_read+0x2d/0x120 [xfs]
>> [<ffffffffa00d89ff>] xfs_trans_read_buf+0x2bf/0x470 [xfs]
>> [<ffffffffa0094775>] xfs_read_agf+0x65/0x1a0 [xfs]
>> [<ffffffffa00948e2>] xfs_alloc_read_agf+0x32/0x60 [xfs]
>> [<ffffffffa00966ff>] xfs_alloc_fix_freelist+0x42f/0x490 [xfs]
>> [<ffffffffa0096802>] xfs_free_extent+0xa2/0x120 [xfs]
>> [<ffffffffa00a6e14>] xfs_bmap_finish+0x164/0x1b0 [xfs]
>> [<ffffffffa00c143b>] xfs_itruncate_extents+0xdb/0x1e0 [xfs]
>> [<ffffffffa00c15a6>] xfs_itruncate_data+0x66/0x150 [xfs]
>> [<ffffffffa009056e>] xfs_inactive+0x2fe/0x460 [xfs]
>> [<ffffffffa008afad>] xfs_fs_evict_inode+0xad/0x110 [xfs]
>> [<ffffffff81193e98>] evict+0xb8/0x1c0
>> [<ffffffff81194088>] iput_final+0xe8/0x210
>> [<ffffffff811941ee>] iput+0x3e/0x50
>> [<ffffffff81188be3>] do_unlinkat+0x153/0x1d0
>> [<ffffffff811897b6>] sys_unlink+0x16/0x20
>> [<ffffffff81665842>] system_call_fastpath+0x16/0x1b
> Still blocked on AGF in unlink.
>
>
>> Mar 21 06:38:20 ealxs00169 kernel: [1412640.821037] java            D
>> [<ffffffff8165b34f>] schedule+0x3f/0x60
>> [<ffffffff8165c157>] __mutex_lock_slowpath+0xd7/0x150
>> [<ffffffff8165bd6a>] mutex_lock+0x2a/0x50
>> [<ffffffff81188b1e>] do_unlinkat+0x8e/0x1d0
>> [<ffffffff811897b6>] sys_unlink+0x16/0x20
>> [<ffffffff81665842>] system_call_fastpath+0x16/0x1b
> 120s later, unlink blocked on a directory mutex.
>
> (others stuck on AGF lock).
>
>> From this moment on the services are not working anymore. Complete stuck.
> Yup, everything is stuck waiting for writeback to complete.
>
> can you reproduce this? does the IO subsystem go idle when this
> occurs? or is it running flat out?
We are not yet able to reproduce. But your added explanation might help
us in reproducing. The production environment is very dynamic in when
clients connect to the service and what kind of requests they do.

The colleague handling the incident tried to get some info but because
he was alarmed quit late he decided for the reboot instead to get our
production qorking again. For the next time we are preparing some
scripts to gather information which might tell more.
If you have any suggestions on commands we can run when it happens again
please let me know? Currently we are configuring kdump as suggested by
Mark and some other queries against our own software.
> Cheers,
>
> Dave.
Some extra info: this system has been running for about 1 year without
any problems. During this time it was never rebooted or restarted until
the issue on the 4th. Only change could be in the usage of java service
running on it by external clients. We are looking into this.

Thank you for looking into it.

Michel.

[-- Attachment #1.2: Type: text/html, Size: 11161 bytes --]

[-- Attachment #2: Type: text/plain, Size: 121 bytes --]

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: xfs blocks (blocked for more than 120 seconds)
  2014-03-24 17:36 ` Dave Chinner
  2014-03-25  8:59   ` Michel Verbraak(1st-Setup)
@ 2014-03-25  9:39   ` Michel Verbraak(1st-Setup)
  1 sibling, 0 replies; 7+ messages in thread
From: Michel Verbraak(1st-Setup) @ 2014-03-25  9:39 UTC (permalink / raw)
  To: Dave Chinner; +Cc: xfs


[-- Attachment #1.1: Type: text/plain, Size: 4724 bytes --]

op 24-03-14 18:36, Dave Chinner schreef:
> On Mon, Mar 24, 2014 at 10:24:45AM +0100, Michel Verbraak(1st-Setup) wrote:
>> Hi,
>>
>> We have a problem with one of our systems which is using XFS but we are
>> unable to find the problem. Recently we had two moments, Tuesday 4th of
>> March and Friday the 21st of March, where we had to reboot the system to
>> get it up and running again.
>>
>> What happens:
>> - The programs handling files on the XFS disc stop working when
>> creating, deleting or writing files. They do not error they are just
>> waiting on the command to complete.
>> - One of our programs, a java application, goes into very high cpu usage
>> (50%) which normally is at 1%. This could be something in our java
>> application but it happens at the moment handling files gets stuck.
>> - A nice restart of the programs does not succeed as wel a kill -9 does
>> not work.
>> - Trying to reboot the servers in a normal fashion does not work. As it
>> is a virtual machine we have to do a shutdown (unplug power) and start
>> it up again to get it up and running.
> ......
>> Following details I have for you:
>>
>> System OS: Ubuntu 12.04 LTS
>> Kernel: 3.2.0-37-generic #58-Ubuntu SMP Thu Jan 24 15:28:10 UTC 2013
>> x86_64 x86_64 x86_64 GNU/Linux
>> Server: Virtual machine in a VMWare setup.
>> Disc: 300GB direct attached LUN
>>
>> We have an exact clone of this system for our acceptance environment. In
>> this environment we are unable to reproduce this problem/situation.
>>
>> Differences between the two days is that our services on 2014-03-21 were
>> quit busy with a lot of file changes on the xfs disc and on 2014-03-04
>> the system was very quiet on the moment the kernel traces appear and the
>> services get stuck.
>>
>> Any help is appreciated.
>>
>> Regards Michel Verbraak.
>>
>> Following we see in the syslog on both moments (2014-03-04 and 2014-03-21):
>>
> ....
>> Mar 21 06:32:20 ealxs00169 kernel: [1412280.930543] flush-8:16      D
>> 0000000000000000     0 13864      2 0x00000000
>> [<ffffffff8165b34f>] schedule+0x3f/0x60
>> [<ffffffff8165b3ff>] io_schedule+0x8f/0xd0
>> [<ffffffff8111836e>] sleep_on_page+0xe/0x20
>> [<ffffffff8165baca>] __wait_on_bit_lock+0x5a/0xc0
>> [<ffffffff81118357>] __lock_page+0x67/0x70
>> [<ffffffff81122bd4>] write_cache_pages+0x3d4/0x460
>> [<ffffffff81122caa>] generic_writepages+0x4a/0x70
>> [<ffffffffa007980d>] xfs_vm_writepages+0x4d/0x60 [xfs]
>> [<ffffffff81123b71>] do_writepages+0x21/0x40
>> [<ffffffff811a2990>] writeback_single_inode+0x180/0x430
>> [<ffffffff811a3056>] writeback_sb_inodes+0x1b6/0x270
>> [<ffffffff811a31ae>] __writeback_inodes_wb+0x9e/0xd0
>> [<ffffffff811a345b>] wb_writeback+0x27b/0x330
>> [<ffffffff811a35af>] wb_check_old_data_flush+0x9f/0xb0
>> [<ffffffff811a4481>] wb_do_writeback+0x151/0x1d0
>> [<ffffffff811a4583>] bdi_writeback_thread+0x83/0x2a0
>> [<ffffffff8108b27c>] kthread+0x8c/0xa0
> Writeback is blocked on a locked page, and is waiting for IO
> completion.
We currently have the following options in fstab for the filesystem:
defaults,noatime,inode64,barrier=no

Now I read everywhere to turn of barrier you should specify "nobarrier".
Is our way of disabling wrong?

The disc used has "write cache" disabled:
[    2.875792] sd 3:0:0:0: Attached scsi generic sg2 type 0
[    2.876376] sd 3:0:0:0: [sdb] 629145600 512-byte logical blocks: (322
GB/300 GiB)
[    2.876879] sd 3:0:0:0: [sdb] Write Protect is off
[    2.877050] sd 3:0:0:0: [sdb] Mode Sense: 87 00 00 08
[    2.877890] sd 3:0:0:0: [sdb] Write cache: disabled, read cache:
enabled, doesn't support DPO or FUA
[    2.885634]  sdb: unknown partition table
...
[    5.132308] SGI XFS with ACLs, security attributes, realtime, large
block/inode numbers, no debug enabled
[    5.155268] SGI XFS Quota Management subsystem
[    5.159345] XFS (sdb): Mounting Filesystem
..


Following is output of xfs_info on sdb:
meta-data=/dev/sdb               isize=256    agcount=4, agsize=19660800
blks
         =                       sectsz=512   attr=2
data     =                       bsize=4096   blocks=78643200, imaxpct=25
         =                       sunit=0      swidth=0 blks
naming   =version 2              bsize=4096   ascii-ci=0
log      =internal               bsize=4096   blocks=38400, version=2
         =                       sectsz=512   sunit=0 blks, lazy-count=1
realtime =none                   extsz=4096   blocks=0, rtextents=0

>
>
>> Mar 21 06:34:20 ealxs00169 kernel: [1412400.891181] archiver.pl     D
>> [<ffffffff8165b34f>] schedule+0x3f/0x60
>> [<ffffffff8165b995>] schedule_timeout+0x2a5/0x320
>> [<ffffffff8165c5f0>] __down_common+0xa5/0xf5
>> [<ffffffff8165c6b3>] __down+0x1d/0x1f
.....
>> Cheers,
>>
>> Dave.
Michel.

[-- Attachment #1.2: Type: text/html, Size: 7118 bytes --]

[-- Attachment #2: Type: text/plain, Size: 121 bytes --]

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2014-03-25  9:40 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2014-03-24  9:24 xfs blocks (blocked for more than 120 seconds) Michel Verbraak(1st-Setup)
2014-03-24 13:27 ` Mark Tinguely
2014-03-24 13:40   ` Michel Verbraak(1st-Setup)
2014-03-24 16:12     ` Mark Tinguely
2014-03-24 17:36 ` Dave Chinner
2014-03-25  8:59   ` Michel Verbraak(1st-Setup)
2014-03-25  9:39   ` Michel Verbraak(1st-Setup)

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).