From mboxrd@z Thu Jan 1 00:00:00 1970
From: bugzilla-daemon@bugzilla.kernel.org
Subject: [Bug 43292] jdb2 lockup with ext3 and nfs
Date: Wed, 30 May 2012 09:18:31 +0000 (UTC)
Message-ID: <20120530091831.258FC11FD1B@bugzilla.kernel.org>
References:
Mime-Version: 1.0
Content-Type: text/plain; charset="UTF-8"
To: linux-ext4@vger.kernel.org
Return-path:
Received: from mail.kernel.org ([198.145.19.201]:47734 "EHLO mail.kernel.org"
rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP
id S1750889Ab2E3JSf (ORCPT );
Wed, 30 May 2012 05:18:35 -0400
Received: from mail.kernel.org (localhost [127.0.0.1])
by mail.kernel.org (Postfix) with ESMTP id A100B20322
for ; Wed, 30 May 2012 09:18:33 +0000 (UTC)
Received: from bugzilla.kernel.org (bugzilla.kernel.org [198.145.19.204])
by mail.kernel.org (Postfix) with ESMTP id 2BA3920300
for ; Wed, 30 May 2012 09:18:31 +0000 (UTC)
In-Reply-To:
Sender: linux-ext4-owner@vger.kernel.org
List-ID:
https://bugzilla.kernel.org/show_bug.cgi?id=43292
--- Comment #4 from Jan Kara 2012-05-30 09:18:30 ---
Hmm, thanks for the data. So jbd2 thread (which is blocking the rest) is
waiting for the following nfsd thread to finish a transaction:
[ 128.487611] nfsd D 0000000000000000 0 750 2 0x00000000
[ 128.487613] ffff880076833050 0000000000000046 ffff880076832f50
ffffffff00000000
[ 128.487615] ffff8800790ea040 ffff880076833fd8 ffff880076833fd8
ffff880076833fd8
[ 128.487618] ffffffff8189b020 ffff8800790ea040 0000000179880938
ffff880079880848
[ 128.487620] Call Trace:
[ 128.487624] [] ? queue_unplugged+0x59/0x110
[ 128.487627] [] ? __wait_on_buffer+0x30/0x30
[ 128.487629] [] schedule+0x3f/0x60
[ 128.487631] [] io_schedule+0x8f/0xd0
[ 128.487633] [] sleep_on_buffer+0xe/0x20
[ 128.487634] [] __wait_on_bit+0x60/0x90
[ 128.487637] [] ? __wait_on_buffer+0x30/0x30
[ 128.487639] [] out_of_line_wait_on_bit+0x7c/0x90
[ 128.487641] [] ? autoremove_wake_function+0x40/0x40
[ 128.487643] [] __wait_on_buffer+0x2e/0x30
[ 128.487648] [] ext4_mb_init_cache+0x203/0x9c0 [ext4]
[ 128.487651] [] ? __lru_cache_add+0x90/0xb0
[ 128.487656] [] ext4_mb_init_group+0x10e/0x210 [ext4]
[ 128.487660] [] ext4_mb_good_group+0x116/0x130 [ext4]
[ 128.487665] [] ext4_mb_regular_allocator+0x19b/0x420
[ext4]
[ 128.487669] [] ? ext4_mb_normalize_request+0x20d/0x500
[ext4]
[ 128.487674] [] ext4_mb_new_blocks+0x42e/0x5d0 [ext4]
[ 128.487678] [] ext4_alloc_branch+0x528/0x670 [ext4]
[ 128.487681] [] ? zone_statistics+0x99/0xc0
[ 128.487686] [] ext4_ind_map_blocks+0x328/0x7c0 [ext4]
[ 128.487690] [] ext4_map_blocks+0xda/0x1f0 [ext4]
[ 128.487694] [] _ext4_get_block+0xa6/0x160 [ext4]
[ 128.487698] [] ext4_get_block+0x16/0x20 [ext4]
[ 128.487701] [] __block_write_begin+0x1c7/0x590
[ 128.487705] [] ? noalloc_get_block_write+0x30/0x30 [ext4]
[ 128.487709] [] ext4_write_begin+0x162/0x390 [ext4]
[ 128.487711] [] generic_file_buffered_write+0x109/0x260
[ 128.487714] [] __generic_file_aio_write+0x245/0x460
[ 128.487716] [] ? __mutex_lock_slowpath+0x25d/0x350
[ 128.487718] [] generic_file_aio_write+0x6e/0xe0
[ 128.487722] [] ext4_file_write+0xaf/0x260 [ext4]
[ 128.487724] [] ? iget_locked+0x89/0x180
[ 128.487727] [] ? _fh_update.isra.8.part.9+0x60/0x60
[nfsd]
[ 128.487729] [] ? iput+0x42/0x1c0
[ 128.487733] [] ? ext4_llseek+0x110/0x110 [ext4]
[ 128.487735] [] do_sync_readv_writev+0xd2/0x110
[ 128.487738] [] ? _fh_update.isra.8.part.9+0x60/0x60
[nfsd]
[ 128.487741] [] ? __kmalloc+0x39/0x1a0
[ 128.487743] [] ? security_file_permission+0x2c/0xb0
[ 128.487745] [] ? rw_verify_area+0x61/0xf0
[ 128.487747] [] do_readv_writev+0xd4/0x1e0
[ 128.487751] [] ? ext4_file_open+0x6f/0x1e0 [ext4]
[ 128.487753] [] vfs_writev+0x35/0x60
[ 128.487757] [] nfsd_vfs_write.isra.9+0xeb/0x3e0 [nfsd]
[ 128.487759] [] ? dentry_open+0x4f/0x90
[ 128.487762] [] ? nfsd_open+0xa0/0x1a0 [nfsd]
[ 128.487766] [] nfsd_write+0xf8/0x110 [nfsd]
[ 128.487770] [] nfsd3_proc_write+0xbb/0x150 [nfsd]
[ 128.487773] [] nfsd_dispatch+0xfe/0x240 [nfsd]
[ 128.487777] [] svc_process+0x4bb/0x840 [sunrpc]
[ 128.487779] [] ? recalc_sigpending+0x1b/0x50
[ 128.487782] [] nfsd+0xc2/0x160 [nfsd]
[ 128.487785] [] ? 0xffffffffa0625fff
[ 128.487786] [] kthread+0x8c/0xa0
[ 128.487788] [] kernel_thread_helper+0x4/0x10
[ 128.487791] [] ? kthread_worker_fn+0x180/0x180
[ 128.487792] [] ? gs_change+0x13/0x13
That thread waits for block bitmap to be read from disk. The question is why
would reading take so long (or never actually finish).
For start can you maybe switch IO scheduler to deadline. You can do that by:
echo "deadline" >/sys/block//queue/scheduler
for all disks, reproduce the problem and post here output from echo w >
/proc/sysrq-trigger again? Thanks.
--
Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are watching the assignee of the bug.