From mboxrd@z Thu Jan 1 00:00:00 1970 From: Vladimir Murzin Subject: Re: Kernel NFS boot failure Date: Wed, 3 Aug 2016 13:06:00 +0100 Message-ID: <57A1DE28.7020604@arm.com> References: <674e6841-d2c5-ccae-6633-a699e848e6d2@ti.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Cc: "linux-omap@vger.kernel.org" , Sekhar Nori , linux-arm To: Grygorii Strashko , netdev Return-path: In-Reply-To: <674e6841-d2c5-ccae-6633-a699e848e6d2@ti.com> List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: "linux-arm-kernel" Errors-To: linux-arm-kernel-bounces+linux-arm-kernel=m.gmane.org@lists.infradead.org List-Id: netdev.vger.kernel.org Hi, On 03/08/16 12:41, Grygorii Strashko wrote: > Hi All, > > We observe Kernel boot failure while running NFS boot stress test (1000 iterations): > - Linux version 4.7.0 > - am335x-evm (TI AM335x EVM) > - failure rate 10-20 times per test. > Originally this issue was reproduced using TI Kernel 4.4 > ( git://git.ti.com/ti-linux-kernel/ti-linux-kernel.git, branch: ti-linux-4.4.y) > on both am335x-evm and am57xx-beagle-x15(am57xx-evm) platforms. > This issues has not been reproduced with TI Kernel 4.1 before. > > The SysRq shows that system stuck in nfs_fs_mount() > > [ 207.904632] [] (schedule) from [] (rpc_wait_bit_killable+0x2c/0xd8) > [ 207.912996] [] (rpc_wait_bit_killable) from [] (__wait_on_bit+0x84/0xc0) > [ 207.921812] [] (__wait_on_bit) from [] (out_of_line_wait_on_bit+0x64/0x70) > [ 207.930810] [] (out_of_line_wait_on_bit) from [] (__rpc_execute+0x18c/0x544) > [ 207.939988] [] (__rpc_execute) from [] (rpc_run_task+0x13c/0x158) > [ 207.948166] [] (rpc_run_task) from [] (rpc_call_sync+0x44/0xc4) > [ 207.956163] [] (rpc_call_sync) from [] (rpc_ping+0x48/0x68) > [ 207.963796] [] (rpc_ping) from [] (rpc_create_xprt+0xec/0x164) > [ 207.971702] [] (rpc_create_xprt) from [] (rpc_create+0xf0/0x1a0) > [ 207.979794] [] (rpc_create) from [] (nfs_create_rpc_client+0xd4/0xec) > [ 207.988338] [] (nfs_create_rpc_client) from [] (nfs_init_client+0x20/0x78) > [ 207.997332] [] (nfs_init_client) from [] (nfs_create_server+0xa0/0x3bc) > [ 208.006057] [] (nfs_create_server) from [] (nfs3_create_server+0x8/0x20) > [ 208.014879] [] (nfs3_create_server) from [] (nfs_try_mount+0xc4/0x1f0) > [ 208.023513] [] (nfs_try_mount) from [] (nfs_fs_mount+0x290/0x910) > [ 208.031702] [] (nfs_fs_mount) from [] (mount_fs+0x44/0x168) > > Has anyone else seen this issue? > > I'd be appreciated for any help or advice related to this issue? I did not look at details, but because it is 4.4 and __wait_on_bit showed up you might want to look at [1] [1] https://lkml.org/lkml/2015/11/20/472 Just my 2p. Vladimir > > Thanks in advance. > > regards, > -grygorii