public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* Re: [lkp] [net] 9317bb6982: INFO: task cat-kmsg:893 blocked for more than 300 seconds.
       [not found] ` <87zirywvms.fsf@yhuang-dev.intel.com>
@ 2016-05-10  1:42   ` Eric Dumazet
  2016-05-11  2:16     ` [LKP] " Huang, Ying
  0 siblings, 1 reply; 4+ messages in thread
From: Eric Dumazet @ 2016-05-10  1:42 UTC (permalink / raw)
  To: Huang, Ying
  Cc: David S. Miller, lkp, Xiaolong Ye, linux-kernel@vger.kernel.org

On Mon, May 9, 2016 at 6:26 PM, Huang, Ying <ying.huang@linux.intel.com> wrote:
> Hi, Eric,
>
> kernel test robot <ying.huang@linux.intel.com> writes:
>> FYI, we noticed the following commit:
>>
>> git://internal_merge_and_test_tree devel-catchup-201604281529
>> commit 9317bb69824ec8d078b0b786b6971aedb0af3d4f ("net: SOCKWQ_ASYNC_NOSPACE optimizations")
>>
>> on test machine: vm-kbuild-2G: 2 threads qemu-system-x86_64 -enable-kvm -cpu Haswell,+smep,+smap with 2G memory
>>
>> caused below changes:
>>
>>
>> +--------------------------------------------------+------------+------------+
>> |                                                  | 210732d16d | 9317bb6982 |
>> +--------------------------------------------------+------------+------------+
>> | boot_successes                                   | 40         | 13         |
>> | boot_failures                                    | 0          | 27         |
>> | INFO:task_blocked_for_more_than#seconds          | 0          | 27         |
>> | RIP:native_safe_halt                             | 0          | 20         |
>> | RIP:native_write_msr_safe                        | 0          | 27         |
>> | Kernel_panic-not_syncing:hung_task:blocked_tasks | 0          | 27         |
>> | backtrace:__close_fd                             | 0          | 27         |
>> | backtrace:SyS_close                              | 0          | 27         |
>> | backtrace:cpu_startup_entry                      | 0          | 19         |
>> | backtrace:watchdog                               | 0          | 27         |
>> | RIP:__lock_acquire                               | 0          | 2          |
>> | backtrace:rpc_async_schedule                     | 0          | 2          |
>> | backtrace:lock_acquire                           | 0          | 1          |
>> | RIP:delay_tsc                                    | 0          | 1          |
>> | backtrace:SYSC_epoll_wait                        | 0          | 1          |
>> | backtrace:SyS_epoll_wait                         | 0          | 1          |
>> | RIP:pvclock_clocksource_read                     | 0          | 1          |
>> | RIP:xs_reclassify_socket                         | 0          | 1          |
>> | backtrace:xs_tcp_setup_socket                    | 0          | 2          |
>> | RIP:insert_work                                  | 0          | 1          |
>> +--------------------------------------------------+------------+------------+
>
> We recently found this patch cause NFS hang in 0day/LKP test system.
> The NFS export can be mounted, but after a while all read/write to NFS
> mount blocked.  This influenced the 0day/LKP testing.  Could you help us
> to fix this?
>
> Best Regards,
> Huang, Ying


I need to officially submit this patch :
http://www.spinics.net/lists/netdev/msg375777.html

Thanks.

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [LKP] [lkp] [net] 9317bb6982: INFO: task cat-kmsg:893 blocked for more than 300 seconds.
  2016-05-10  1:42   ` [lkp] [net] 9317bb6982: INFO: task cat-kmsg:893 blocked for more than 300 seconds Eric Dumazet
@ 2016-05-11  2:16     ` Huang, Ying
  2016-05-13  3:01       ` Huang, Ying
  0 siblings, 1 reply; 4+ messages in thread
From: Huang, Ying @ 2016-05-11  2:16 UTC (permalink / raw)
  To: Eric Dumazet; +Cc: linux-kernel@vger.kernel.org, lkp, David S. Miller

Eric Dumazet <edumazet@google.com> writes:
> On Mon, May 9, 2016 at 6:26 PM, Huang, Ying <ying.huang@linux.intel.com> wrote:
>> Hi, Eric,
>>
>> kernel test robot <ying.huang@linux.intel.com> writes:
>>> FYI, we noticed the following commit:
>>>
>>> git://internal_merge_and_test_tree devel-catchup-201604281529
>>> commit 9317bb69824ec8d078b0b786b6971aedb0af3d4f ("net: SOCKWQ_ASYNC_NOSPACE optimizations")
>>>
>>> on test machine: vm-kbuild-2G: 2 threads qemu-system-x86_64 -enable-kvm -cpu Haswell,+smep,+smap with 2G memory
>>>
>>> caused below changes:
>>>
>>>
>>> +--------------------------------------------------+------------+------------+
>>> |                                                  | 210732d16d | 9317bb6982 |
>>> +--------------------------------------------------+------------+------------+
>>> | boot_successes                                   | 40         | 13         |
>>> | boot_failures                                    | 0          | 27         |
>>> | INFO:task_blocked_for_more_than#seconds          | 0          | 27         |
>>> | RIP:native_safe_halt                             | 0          | 20         |
>>> | RIP:native_write_msr_safe                        | 0          | 27         |
>>> | Kernel_panic-not_syncing:hung_task:blocked_tasks | 0          | 27         |
>>> | backtrace:__close_fd                             | 0          | 27         |
>>> | backtrace:SyS_close                              | 0          | 27         |
>>> | backtrace:cpu_startup_entry                      | 0          | 19         |
>>> | backtrace:watchdog                               | 0          | 27         |
>>> | RIP:__lock_acquire                               | 0          | 2          |
>>> | backtrace:rpc_async_schedule                     | 0          | 2          |
>>> | backtrace:lock_acquire                           | 0          | 1          |
>>> | RIP:delay_tsc                                    | 0          | 1          |
>>> | backtrace:SYSC_epoll_wait                        | 0          | 1          |
>>> | backtrace:SyS_epoll_wait                         | 0          | 1          |
>>> | RIP:pvclock_clocksource_read                     | 0          | 1          |
>>> | RIP:xs_reclassify_socket                         | 0          | 1          |
>>> | backtrace:xs_tcp_setup_socket                    | 0          | 2          |
>>> | RIP:insert_work                                  | 0          | 1          |
>>> +--------------------------------------------------+------------+------------+
>>
>> We recently found this patch cause NFS hang in 0day/LKP test system.
>> The NFS export can be mounted, but after a while all read/write to NFS
>> mount blocked.  This influenced the 0day/LKP testing.  Could you help us
>> to fix this?
>>
>> Best Regards,
>> Huang, Ying
>
>
> I need to officially submit this patch :
> http://www.spinics.net/lists/netdev/msg375777.html

Thanks a lot!  The patch fixed our NFS hang.

Tested-by: Huang, Ying <ying.huang@intel.com>

Is it possible to put the fixing patch near (preferably next) the patch
trigger regression?  Otherwise a bigger range of patches will not be
bisectable.

Best Regards,
Huang, Ying

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [LKP] [lkp] [net] 9317bb6982: INFO: task cat-kmsg:893 blocked for more than 300 seconds.
  2016-05-11  2:16     ` [LKP] " Huang, Ying
@ 2016-05-13  3:01       ` Huang, Ying
  2016-05-13  3:38         ` Eric Dumazet
  0 siblings, 1 reply; 4+ messages in thread
From: Huang, Ying @ 2016-05-13  3:01 UTC (permalink / raw)
  To: Eric Dumazet
  Cc: lkp, linux-kernel@vger.kernel.org, David S. Miller, Huang, Ying

"Huang, Ying" <ying.huang@intel.com> writes:

> Eric Dumazet <edumazet@google.com> writes:
>> On Mon, May 9, 2016 at 6:26 PM, Huang, Ying <ying.huang@linux.intel.com> wrote:
>>> Hi, Eric,
>>>
>>> kernel test robot <ying.huang@linux.intel.com> writes:
>>>> FYI, we noticed the following commit:
>>>>
>>>> git://internal_merge_and_test_tree devel-catchup-201604281529
>>>> commit 9317bb69824ec8d078b0b786b6971aedb0af3d4f ("net: SOCKWQ_ASYNC_NOSPACE optimizations")
>>>>
>>>> on test machine: vm-kbuild-2G: 2 threads qemu-system-x86_64 -enable-kvm -cpu Haswell,+smep,+smap with 2G memory
>>>>
>>>> caused below changes:
>>>>
>>>>
>>>> +--------------------------------------------------+------------+------------+
>>>> |                                                  | 210732d16d | 9317bb6982 |
>>>> +--------------------------------------------------+------------+------------+
>>>> | boot_successes                                   | 40         | 13         |
>>>> | boot_failures                                    | 0          | 27         |
>>>> | INFO:task_blocked_for_more_than#seconds          | 0          | 27         |
>>>> | RIP:native_safe_halt                             | 0          | 20         |
>>>> | RIP:native_write_msr_safe                        | 0          | 27         |
>>>> | Kernel_panic-not_syncing:hung_task:blocked_tasks | 0          | 27         |
>>>> | backtrace:__close_fd                             | 0          | 27         |
>>>> | backtrace:SyS_close                              | 0          | 27         |
>>>> | backtrace:cpu_startup_entry                      | 0          | 19         |
>>>> | backtrace:watchdog                               | 0          | 27         |
>>>> | RIP:__lock_acquire                               | 0          | 2          |
>>>> | backtrace:rpc_async_schedule                     | 0          | 2          |
>>>> | backtrace:lock_acquire                           | 0          | 1          |
>>>> | RIP:delay_tsc                                    | 0          | 1          |
>>>> | backtrace:SYSC_epoll_wait                        | 0          | 1          |
>>>> | backtrace:SyS_epoll_wait                         | 0          | 1          |
>>>> | RIP:pvclock_clocksource_read                     | 0          | 1          |
>>>> | RIP:xs_reclassify_socket                         | 0          | 1          |
>>>> | backtrace:xs_tcp_setup_socket                    | 0          | 2          |
>>>> | RIP:insert_work                                  | 0          | 1          |
>>>> +--------------------------------------------------+------------+------------+
>>>
>>> We recently found this patch cause NFS hang in 0day/LKP test system.
>>> The NFS export can be mounted, but after a while all read/write to NFS
>>> mount blocked.  This influenced the 0day/LKP testing.  Could you help us
>>> to fix this?
>>>
>>> Best Regards,
>>> Huang, Ying
>>
>>
>> I need to officially submit this patch :
>> http://www.spinics.net/lists/netdev/msg375777.html
>
> Thanks a lot!  The patch fixed our NFS hang.
>
> Tested-by: Huang, Ying <ying.huang@intel.com>
>
> Is it possible to put the fixing patch near (preferably next) the patch
> trigger regression?  Otherwise a bigger range of patches will not be
> bisectable.

Hi, Eric,

Is it possible for this to be fixed in linux-next ASAP?  This has
blocked us (0day functionality and performance test) from testing
linux-next and many other git trees for near 1 week.  Because NFS is
used in our test infrastructure to save test result data.

Best Regards,
Huang, Ying

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [LKP] [lkp] [net] 9317bb6982: INFO: task cat-kmsg:893 blocked for more than 300 seconds.
  2016-05-13  3:01       ` Huang, Ying
@ 2016-05-13  3:38         ` Eric Dumazet
  0 siblings, 0 replies; 4+ messages in thread
From: Eric Dumazet @ 2016-05-13  3:38 UTC (permalink / raw)
  To: Huang, Ying; +Cc: lkp, linux-kernel@vger.kernel.org, David S. Miller

Oh right, sorry for the delay.

On Thu, May 12, 2016 at 8:01 PM, Huang, Ying <ying.huang@intel.com> wrote:
> "Huang, Ying" <ying.huang@intel.com> writes:
>
>> Eric Dumazet <edumazet@google.com> writes:
>>> On Mon, May 9, 2016 at 6:26 PM, Huang, Ying <ying.huang@linux.intel.com> wrote:
>>>> Hi, Eric,
>>>>
>>>> kernel test robot <ying.huang@linux.intel.com> writes:
>>>>> FYI, we noticed the following commit:
>>>>>
>>>>> git://internal_merge_and_test_tree devel-catchup-201604281529
>>>>> commit 9317bb69824ec8d078b0b786b6971aedb0af3d4f ("net: SOCKWQ_ASYNC_NOSPACE optimizations")
>>>>>
>>>>> on test machine: vm-kbuild-2G: 2 threads qemu-system-x86_64 -enable-kvm -cpu Haswell,+smep,+smap with 2G memory
>>>>>
>>>>> caused below changes:
>>>>>
>>>>>
>>>>> +--------------------------------------------------+------------+------------+
>>>>> |                                                  | 210732d16d | 9317bb6982 |
>>>>> +--------------------------------------------------+------------+------------+
>>>>> | boot_successes                                   | 40         | 13         |
>>>>> | boot_failures                                    | 0          | 27         |
>>>>> | INFO:task_blocked_for_more_than#seconds          | 0          | 27         |
>>>>> | RIP:native_safe_halt                             | 0          | 20         |
>>>>> | RIP:native_write_msr_safe                        | 0          | 27         |
>>>>> | Kernel_panic-not_syncing:hung_task:blocked_tasks | 0          | 27         |
>>>>> | backtrace:__close_fd                             | 0          | 27         |
>>>>> | backtrace:SyS_close                              | 0          | 27         |
>>>>> | backtrace:cpu_startup_entry                      | 0          | 19         |
>>>>> | backtrace:watchdog                               | 0          | 27         |
>>>>> | RIP:__lock_acquire                               | 0          | 2          |
>>>>> | backtrace:rpc_async_schedule                     | 0          | 2          |
>>>>> | backtrace:lock_acquire                           | 0          | 1          |
>>>>> | RIP:delay_tsc                                    | 0          | 1          |
>>>>> | backtrace:SYSC_epoll_wait                        | 0          | 1          |
>>>>> | backtrace:SyS_epoll_wait                         | 0          | 1          |
>>>>> | RIP:pvclock_clocksource_read                     | 0          | 1          |
>>>>> | RIP:xs_reclassify_socket                         | 0          | 1          |
>>>>> | backtrace:xs_tcp_setup_socket                    | 0          | 2          |
>>>>> | RIP:insert_work                                  | 0          | 1          |
>>>>> +--------------------------------------------------+------------+------------+
>>>>
>>>> We recently found this patch cause NFS hang in 0day/LKP test system.
>>>> The NFS export can be mounted, but after a while all read/write to NFS
>>>> mount blocked.  This influenced the 0day/LKP testing.  Could you help us
>>>> to fix this?
>>>>
>>>> Best Regards,
>>>> Huang, Ying
>>>
>>>
>>> I need to officially submit this patch :
>>> http://www.spinics.net/lists/netdev/msg375777.html
>>
>> Thanks a lot!  The patch fixed our NFS hang.
>>
>> Tested-by: Huang, Ying <ying.huang@intel.com>
>>
>> Is it possible to put the fixing patch near (preferably next) the patch
>> trigger regression?  Otherwise a bigger range of patches will not be
>> bisectable.
>
> Hi, Eric,
>
> Is it possible for this to be fixed in linux-next ASAP?  This has
> blocked us (0day functionality and performance test) from testing
> linux-next and many other git trees for near 1 week.  Because NFS is
> used in our test infrastructure to save test result data.
>
> Best Regards,
> Huang, Ying

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2016-05-13  3:38 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <87eg9kgcwg.fsf@yhuang-dev.intel.com>
     [not found] ` <87zirywvms.fsf@yhuang-dev.intel.com>
2016-05-10  1:42   ` [lkp] [net] 9317bb6982: INFO: task cat-kmsg:893 blocked for more than 300 seconds Eric Dumazet
2016-05-11  2:16     ` [LKP] " Huang, Ying
2016-05-13  3:01       ` Huang, Ying
2016-05-13  3:38         ` Eric Dumazet

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox