* Re: [lkp] [net] 9317bb6982: INFO: task cat-kmsg:893 blocked for more than 300 seconds. [not found] ` <87zirywvms.fsf@yhuang-dev.intel.com> @ 2016-05-10 1:42 ` Eric Dumazet 2016-05-11 2:16 ` [LKP] " Huang, Ying 0 siblings, 1 reply; 4+ messages in thread From: Eric Dumazet @ 2016-05-10 1:42 UTC (permalink / raw) To: Huang, Ying Cc: David S. Miller, lkp, Xiaolong Ye, linux-kernel@vger.kernel.org On Mon, May 9, 2016 at 6:26 PM, Huang, Ying <ying.huang@linux.intel.com> wrote: > Hi, Eric, > > kernel test robot <ying.huang@linux.intel.com> writes: >> FYI, we noticed the following commit: >> >> git://internal_merge_and_test_tree devel-catchup-201604281529 >> commit 9317bb69824ec8d078b0b786b6971aedb0af3d4f ("net: SOCKWQ_ASYNC_NOSPACE optimizations") >> >> on test machine: vm-kbuild-2G: 2 threads qemu-system-x86_64 -enable-kvm -cpu Haswell,+smep,+smap with 2G memory >> >> caused below changes: >> >> >> +--------------------------------------------------+------------+------------+ >> | | 210732d16d | 9317bb6982 | >> +--------------------------------------------------+------------+------------+ >> | boot_successes | 40 | 13 | >> | boot_failures | 0 | 27 | >> | INFO:task_blocked_for_more_than#seconds | 0 | 27 | >> | RIP:native_safe_halt | 0 | 20 | >> | RIP:native_write_msr_safe | 0 | 27 | >> | Kernel_panic-not_syncing:hung_task:blocked_tasks | 0 | 27 | >> | backtrace:__close_fd | 0 | 27 | >> | backtrace:SyS_close | 0 | 27 | >> | backtrace:cpu_startup_entry | 0 | 19 | >> | backtrace:watchdog | 0 | 27 | >> | RIP:__lock_acquire | 0 | 2 | >> | backtrace:rpc_async_schedule | 0 | 2 | >> | backtrace:lock_acquire | 0 | 1 | >> | RIP:delay_tsc | 0 | 1 | >> | backtrace:SYSC_epoll_wait | 0 | 1 | >> | backtrace:SyS_epoll_wait | 0 | 1 | >> | RIP:pvclock_clocksource_read | 0 | 1 | >> | RIP:xs_reclassify_socket | 0 | 1 | >> | backtrace:xs_tcp_setup_socket | 0 | 2 | >> | RIP:insert_work | 0 | 1 | >> +--------------------------------------------------+------------+------------+ > > We recently found this patch cause NFS hang in 0day/LKP test system. > The NFS export can be mounted, but after a while all read/write to NFS > mount blocked. This influenced the 0day/LKP testing. Could you help us > to fix this? > > Best Regards, > Huang, Ying I need to officially submit this patch : http://www.spinics.net/lists/netdev/msg375777.html Thanks. ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [LKP] [lkp] [net] 9317bb6982: INFO: task cat-kmsg:893 blocked for more than 300 seconds. 2016-05-10 1:42 ` [lkp] [net] 9317bb6982: INFO: task cat-kmsg:893 blocked for more than 300 seconds Eric Dumazet @ 2016-05-11 2:16 ` Huang, Ying 2016-05-13 3:01 ` Huang, Ying 0 siblings, 1 reply; 4+ messages in thread From: Huang, Ying @ 2016-05-11 2:16 UTC (permalink / raw) To: Eric Dumazet; +Cc: linux-kernel@vger.kernel.org, lkp, David S. Miller Eric Dumazet <edumazet@google.com> writes: > On Mon, May 9, 2016 at 6:26 PM, Huang, Ying <ying.huang@linux.intel.com> wrote: >> Hi, Eric, >> >> kernel test robot <ying.huang@linux.intel.com> writes: >>> FYI, we noticed the following commit: >>> >>> git://internal_merge_and_test_tree devel-catchup-201604281529 >>> commit 9317bb69824ec8d078b0b786b6971aedb0af3d4f ("net: SOCKWQ_ASYNC_NOSPACE optimizations") >>> >>> on test machine: vm-kbuild-2G: 2 threads qemu-system-x86_64 -enable-kvm -cpu Haswell,+smep,+smap with 2G memory >>> >>> caused below changes: >>> >>> >>> +--------------------------------------------------+------------+------------+ >>> | | 210732d16d | 9317bb6982 | >>> +--------------------------------------------------+------------+------------+ >>> | boot_successes | 40 | 13 | >>> | boot_failures | 0 | 27 | >>> | INFO:task_blocked_for_more_than#seconds | 0 | 27 | >>> | RIP:native_safe_halt | 0 | 20 | >>> | RIP:native_write_msr_safe | 0 | 27 | >>> | Kernel_panic-not_syncing:hung_task:blocked_tasks | 0 | 27 | >>> | backtrace:__close_fd | 0 | 27 | >>> | backtrace:SyS_close | 0 | 27 | >>> | backtrace:cpu_startup_entry | 0 | 19 | >>> | backtrace:watchdog | 0 | 27 | >>> | RIP:__lock_acquire | 0 | 2 | >>> | backtrace:rpc_async_schedule | 0 | 2 | >>> | backtrace:lock_acquire | 0 | 1 | >>> | RIP:delay_tsc | 0 | 1 | >>> | backtrace:SYSC_epoll_wait | 0 | 1 | >>> | backtrace:SyS_epoll_wait | 0 | 1 | >>> | RIP:pvclock_clocksource_read | 0 | 1 | >>> | RIP:xs_reclassify_socket | 0 | 1 | >>> | backtrace:xs_tcp_setup_socket | 0 | 2 | >>> | RIP:insert_work | 0 | 1 | >>> +--------------------------------------------------+------------+------------+ >> >> We recently found this patch cause NFS hang in 0day/LKP test system. >> The NFS export can be mounted, but after a while all read/write to NFS >> mount blocked. This influenced the 0day/LKP testing. Could you help us >> to fix this? >> >> Best Regards, >> Huang, Ying > > > I need to officially submit this patch : > http://www.spinics.net/lists/netdev/msg375777.html Thanks a lot! The patch fixed our NFS hang. Tested-by: Huang, Ying <ying.huang@intel.com> Is it possible to put the fixing patch near (preferably next) the patch trigger regression? Otherwise a bigger range of patches will not be bisectable. Best Regards, Huang, Ying ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [LKP] [lkp] [net] 9317bb6982: INFO: task cat-kmsg:893 blocked for more than 300 seconds. 2016-05-11 2:16 ` [LKP] " Huang, Ying @ 2016-05-13 3:01 ` Huang, Ying 2016-05-13 3:38 ` Eric Dumazet 0 siblings, 1 reply; 4+ messages in thread From: Huang, Ying @ 2016-05-13 3:01 UTC (permalink / raw) To: Eric Dumazet Cc: lkp, linux-kernel@vger.kernel.org, David S. Miller, Huang, Ying "Huang, Ying" <ying.huang@intel.com> writes: > Eric Dumazet <edumazet@google.com> writes: >> On Mon, May 9, 2016 at 6:26 PM, Huang, Ying <ying.huang@linux.intel.com> wrote: >>> Hi, Eric, >>> >>> kernel test robot <ying.huang@linux.intel.com> writes: >>>> FYI, we noticed the following commit: >>>> >>>> git://internal_merge_and_test_tree devel-catchup-201604281529 >>>> commit 9317bb69824ec8d078b0b786b6971aedb0af3d4f ("net: SOCKWQ_ASYNC_NOSPACE optimizations") >>>> >>>> on test machine: vm-kbuild-2G: 2 threads qemu-system-x86_64 -enable-kvm -cpu Haswell,+smep,+smap with 2G memory >>>> >>>> caused below changes: >>>> >>>> >>>> +--------------------------------------------------+------------+------------+ >>>> | | 210732d16d | 9317bb6982 | >>>> +--------------------------------------------------+------------+------------+ >>>> | boot_successes | 40 | 13 | >>>> | boot_failures | 0 | 27 | >>>> | INFO:task_blocked_for_more_than#seconds | 0 | 27 | >>>> | RIP:native_safe_halt | 0 | 20 | >>>> | RIP:native_write_msr_safe | 0 | 27 | >>>> | Kernel_panic-not_syncing:hung_task:blocked_tasks | 0 | 27 | >>>> | backtrace:__close_fd | 0 | 27 | >>>> | backtrace:SyS_close | 0 | 27 | >>>> | backtrace:cpu_startup_entry | 0 | 19 | >>>> | backtrace:watchdog | 0 | 27 | >>>> | RIP:__lock_acquire | 0 | 2 | >>>> | backtrace:rpc_async_schedule | 0 | 2 | >>>> | backtrace:lock_acquire | 0 | 1 | >>>> | RIP:delay_tsc | 0 | 1 | >>>> | backtrace:SYSC_epoll_wait | 0 | 1 | >>>> | backtrace:SyS_epoll_wait | 0 | 1 | >>>> | RIP:pvclock_clocksource_read | 0 | 1 | >>>> | RIP:xs_reclassify_socket | 0 | 1 | >>>> | backtrace:xs_tcp_setup_socket | 0 | 2 | >>>> | RIP:insert_work | 0 | 1 | >>>> +--------------------------------------------------+------------+------------+ >>> >>> We recently found this patch cause NFS hang in 0day/LKP test system. >>> The NFS export can be mounted, but after a while all read/write to NFS >>> mount blocked. This influenced the 0day/LKP testing. Could you help us >>> to fix this? >>> >>> Best Regards, >>> Huang, Ying >> >> >> I need to officially submit this patch : >> http://www.spinics.net/lists/netdev/msg375777.html > > Thanks a lot! The patch fixed our NFS hang. > > Tested-by: Huang, Ying <ying.huang@intel.com> > > Is it possible to put the fixing patch near (preferably next) the patch > trigger regression? Otherwise a bigger range of patches will not be > bisectable. Hi, Eric, Is it possible for this to be fixed in linux-next ASAP? This has blocked us (0day functionality and performance test) from testing linux-next and many other git trees for near 1 week. Because NFS is used in our test infrastructure to save test result data. Best Regards, Huang, Ying ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [LKP] [lkp] [net] 9317bb6982: INFO: task cat-kmsg:893 blocked for more than 300 seconds. 2016-05-13 3:01 ` Huang, Ying @ 2016-05-13 3:38 ` Eric Dumazet 0 siblings, 0 replies; 4+ messages in thread From: Eric Dumazet @ 2016-05-13 3:38 UTC (permalink / raw) To: Huang, Ying; +Cc: lkp, linux-kernel@vger.kernel.org, David S. Miller Oh right, sorry for the delay. On Thu, May 12, 2016 at 8:01 PM, Huang, Ying <ying.huang@intel.com> wrote: > "Huang, Ying" <ying.huang@intel.com> writes: > >> Eric Dumazet <edumazet@google.com> writes: >>> On Mon, May 9, 2016 at 6:26 PM, Huang, Ying <ying.huang@linux.intel.com> wrote: >>>> Hi, Eric, >>>> >>>> kernel test robot <ying.huang@linux.intel.com> writes: >>>>> FYI, we noticed the following commit: >>>>> >>>>> git://internal_merge_and_test_tree devel-catchup-201604281529 >>>>> commit 9317bb69824ec8d078b0b786b6971aedb0af3d4f ("net: SOCKWQ_ASYNC_NOSPACE optimizations") >>>>> >>>>> on test machine: vm-kbuild-2G: 2 threads qemu-system-x86_64 -enable-kvm -cpu Haswell,+smep,+smap with 2G memory >>>>> >>>>> caused below changes: >>>>> >>>>> >>>>> +--------------------------------------------------+------------+------------+ >>>>> | | 210732d16d | 9317bb6982 | >>>>> +--------------------------------------------------+------------+------------+ >>>>> | boot_successes | 40 | 13 | >>>>> | boot_failures | 0 | 27 | >>>>> | INFO:task_blocked_for_more_than#seconds | 0 | 27 | >>>>> | RIP:native_safe_halt | 0 | 20 | >>>>> | RIP:native_write_msr_safe | 0 | 27 | >>>>> | Kernel_panic-not_syncing:hung_task:blocked_tasks | 0 | 27 | >>>>> | backtrace:__close_fd | 0 | 27 | >>>>> | backtrace:SyS_close | 0 | 27 | >>>>> | backtrace:cpu_startup_entry | 0 | 19 | >>>>> | backtrace:watchdog | 0 | 27 | >>>>> | RIP:__lock_acquire | 0 | 2 | >>>>> | backtrace:rpc_async_schedule | 0 | 2 | >>>>> | backtrace:lock_acquire | 0 | 1 | >>>>> | RIP:delay_tsc | 0 | 1 | >>>>> | backtrace:SYSC_epoll_wait | 0 | 1 | >>>>> | backtrace:SyS_epoll_wait | 0 | 1 | >>>>> | RIP:pvclock_clocksource_read | 0 | 1 | >>>>> | RIP:xs_reclassify_socket | 0 | 1 | >>>>> | backtrace:xs_tcp_setup_socket | 0 | 2 | >>>>> | RIP:insert_work | 0 | 1 | >>>>> +--------------------------------------------------+------------+------------+ >>>> >>>> We recently found this patch cause NFS hang in 0day/LKP test system. >>>> The NFS export can be mounted, but after a while all read/write to NFS >>>> mount blocked. This influenced the 0day/LKP testing. Could you help us >>>> to fix this? >>>> >>>> Best Regards, >>>> Huang, Ying >>> >>> >>> I need to officially submit this patch : >>> http://www.spinics.net/lists/netdev/msg375777.html >> >> Thanks a lot! The patch fixed our NFS hang. >> >> Tested-by: Huang, Ying <ying.huang@intel.com> >> >> Is it possible to put the fixing patch near (preferably next) the patch >> trigger regression? Otherwise a bigger range of patches will not be >> bisectable. > > Hi, Eric, > > Is it possible for this to be fixed in linux-next ASAP? This has > blocked us (0day functionality and performance test) from testing > linux-next and many other git trees for near 1 week. Because NFS is > used in our test infrastructure to save test result data. > > Best Regards, > Huang, Ying ^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2016-05-13 3:38 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <87eg9kgcwg.fsf@yhuang-dev.intel.com>
[not found] ` <87zirywvms.fsf@yhuang-dev.intel.com>
2016-05-10 1:42 ` [lkp] [net] 9317bb6982: INFO: task cat-kmsg:893 blocked for more than 300 seconds Eric Dumazet
2016-05-11 2:16 ` [LKP] " Huang, Ying
2016-05-13 3:01 ` Huang, Ying
2016-05-13 3:38 ` Eric Dumazet
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox