From mboxrd@z Thu Jan 1 00:00:00 1970 From: Bruno =?UTF-8?B?UHLDqW1vbnQ=?= Subject: Deadlock with nilfs on 2.6.31.4 Date: Wed, 21 Oct 2009 20:38:47 +0200 Message-ID: <20091021203847.26acab0a@neptune.home> Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Cc: linux-fsdevel@vger.kernel.org To: users@nilfs.org Return-path: Received: from ppp-157-46.adsl.restena.lu ([158.64.157.46]:45415 "EHLO bonbons.gotdns.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754590AbZJUSrP (ORCPT ); Wed, 21 Oct 2009 14:47:15 -0400 Sender: linux-fsdevel-owner@vger.kernel.org List-ID: Hi, nilfs seems to have some dead-locks that put processes in D-state (at least on my arm system). This time around it seems that syslog-ng has been hit first. The previous times it most often was collectd/rrdtool. Kernel is vanilla 2.6.31.4 + a patch for USB HID device. System is arm, Feroceon 88FR131, SheevaPlug. nilfs is being used on a SD card (mmcblk0: mmc0:bc20 SD08G 7.60 GiB, mvsdio driver) Bruno Extracts from dmesg (less attempting to read a logfile produced by syslog-ng): INFO: task less:15839 blocked for more than 120 seconds. "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. less D c02c8610 0 15839 1742 0x00000001 [] (schedule+0x2a8/0x3b0) from [] (__mutex_lock_slowpath+0x88/0x140) [] (__mutex_lock_slowpath+0x88/0x140) from [] (generic_file_llseek+0x24/0x64) [] (generic_file_llseek+0x24/0x64) from [] (vfs_llseek+0x54/0x64) [] (vfs_llseek+0x54/0x64) from [] (sys_llseek+0x74/0xcc) [] (sys_llseek+0x74/0xcc) from [] (ret_fast_syscall+0x0/0x2c) All stuck processes as listed by SysRq + T: syslog-ng D c02c8610 0 1698 1 0x00000000 [] (schedule+0x2a8/0x3b0) from [] (schedule_timeout+0x14c/0x1e8) [] (schedule_timeout+0x14c/0x1e8) from [] (io_schedule_timeout+0x34/0x58) [] (io_schedule_timeout+0x34/0x58) from [] (congestion_wait+0x5c/0x80) [] (congestion_wait+0x5c/0x80) from [] (balance_dirty_pages_ratelimited_nr+0xe8/0x290) [] (balance_dirty_pages_ratelimited_nr+0xe8/0x290) from [] (generic_file_buffered_write+0x10c/0x348) [] (generic_file_buffered_write+0x10c/0x348) from [] (__generic_file_aio_write_nolock+0x264/0x4f4) [] (__generic_file_aio_write_nolock+0x264/0x4f4) from [] (generic_file_aio_write+0x74/0xe8) [] (generic_file_aio_write+0x74/0xe8) from [] (do_sync_write+0xbc/0x100) [] (do_sync_write+0xbc/0x100) from [] (vfs_write+0xb0/0x164) [] (vfs_write+0xb0/0x164) from [] (sys_write+0x40/0x70) [] (sys_write+0x40/0x70) from [] (ret_fast_syscall+0x0/0x2c) less D c02c8610 0 15839 1742 0x00000001 [] (schedule+0x2a8/0x3b0) from [] (__mutex_lock_slowpath+0x88/0x140) [] (__mutex_lock_slowpath+0x88/0x140) from [] (generic_file_llseek+0x24/0x64) [] (generic_file_llseek+0x24/0x64) from [] (vfs_llseek+0x54/0x64) [] (vfs_llseek+0x54/0x64) from [] (sys_llseek+0x74/0xcc) [] (sys_llseek+0x74/0xcc) from [] (ret_fast_syscall+0x0/0x2c) sshd D c02c8610 0 15844 15842 0x00000001 [] (schedule+0x2a8/0x3b0) from [] (schedule_timeout+0x14c/0x1e8) [] (schedule_timeout+0x14c/0x1e8) from [] (io_schedule_timeout+0x34/0x58) [] (io_schedule_timeout+0x34/0x58) from [] (congestion_wait+0x5c/0x80) [] (congestion_wait+0x5c/0x80) from [] (balance_dirty_pages_ratelimited_nr+0xe8/0x290) [] (balance_dirty_pages_ratelimited_nr+0xe8/0x290) from [] (generic_file_buffered_write+0x10c/0x348) [] (generic_file_buffered_write+0x10c/0x348) from [] (__generic_file_aio_write_nolock+0x264/0x4f4) [] (__generic_file_aio_write_nolock+0x264/0x4f4) from [] (generic_file_aio_write+0x74/0xe8) [] (generic_file_aio_write+0x74/0xe8) from [] (do_sync_write+0xbc/0x100) [] (do_sync_write+0xbc/0x100) from [] (vfs_write+0xb0/0x164) [] (vfs_write+0xb0/0x164) from [] (sys_write+0x40/0x70) [] (sys_write+0x40/0x70) from [] (ret_fast_syscall+0x0/0x2c) nilfs related processes: [40049.761881] segctord S c02c8610 0 859 2 0x00000000 [40049.761894] [] (schedule+0x2a8/0x3b0) from [] (nilfs_segctor_thread+0x2d4/0x328 [nilfs2]) [40049.761999] [] (nilfs_segctor_thread+0x2d4/0x328 [nilfs2]) from [] (kthread+0x7c/0x84) [40049.762081] [] (kthread+0x7c/0x84) from [] (kernel_thread_exit+0x0/0x8) [40049.762101] nilfs_cleaner S c02c8610 0 860 1 0x00000000 [40049.762115] [] (schedule+0x2a8/0x3b0) from [] (do_nanosleep+0xb0/0x110) [40049.762137] [] (do_nanosleep+0xb0/0x110) from [] (hrtimer_nanosleep+0xa4/0x12c) [40049.762161] [] (hrtimer_nanosleep+0xa4/0x12c) from [] (sys_nanosleep+0x9c/0xa4) [40049.762181] [] (sys_nanosleep+0x9c/0xa4) from [] (ret_fast_syscall+0x0/0x2c) [40049.762201] segctord S c02c8610 0 862 2 0x00000000 [40049.762214] [] (schedule+0x2a8/0x3b0) from [] (nilfs_segctor_thread+0x2d4/0x328 [nilfs2]) [40049.762298] [] (nilfs_segctor_thread+0x2d4/0x328 [nilfs2]) from [] (kthread+0x7c/0x84) [40049.762377] [] (kthread+0x7c/0x84) from [] (kernel_thread_exit+0x0/0x8) [40049.762397] nilfs_cleaner S c02c8610 0 863 1 0x00000000 [40049.762411] [] (schedule+0x2a8/0x3b0) from [] (do_nanosleep+0xb0/0x110) [40049.762433] [] (do_nanosleep+0xb0/0x110) from [] (hrtimer_nanosleep+0xa4/0x12c) [40049.762455] [] (hrtimer_nanosleep+0xa4/0x12c) from [] (sys_nanosleep+0x9c/0xa4) [40049.762475] [] (sys_nanosleep+0x9c/0xa4) from [] (ret_fast_syscall+0x0/0x2c) [40049.762495] segctord S c02c8610 0 865 2 0x00000000 [40049.762507] [] (schedule+0x2a8/0x3b0) from [] (nilfs_segctor_thread+0x2d4/0x328 [nilfs2]) [40049.762591] [] (nilfs_segctor_thread+0x2d4/0x328 [nilfs2]) from [] (kthread+0x7c/0x84) [40049.762670] [] (kthread+0x7c/0x84) from [] (kernel_thread_exit+0x0/0x8) [40049.762690] nilfs_cleaner S c02c8610 0 866 1 0x00000000 [40049.762703] [] (schedule+0x2a8/0x3b0) from [] (do_nanosleep+0xb0/0x110) [40049.762726] [] (do_nanosleep+0xb0/0x110) from [] (hrtimer_nanosleep+0xa4/0x12c) [40049.762748] [] (hrtimer_nanosleep+0xa4/0x12c) from [] (sys_nanosleep+0x9c/0xa4) [40049.762768] [] (sys_nanosleep+0x9c/0xa4) from [] (ret_fast_syscall+0x0/0x2c)