public inbox for mptcp@lists.linux.dev
 help / color / mirror / Atom feed
* Re: rcu stalls during fstests runs for xfs
       [not found]           ` <aXyRRaOBkvENTlBE@shinmob>
@ 2026-02-06  9:33             ` Matthieu Baerts
  2026-02-06 10:02               ` Shinichiro Kawasaki
  0 siblings, 1 reply; 3+ messages in thread
From: Matthieu Baerts @ 2026-02-06  9:33 UTC (permalink / raw)
  To: Shinichiro Kawasaki
  Cc: Kunwu Chan, rcu@vger.kernel.org, linux-xfs@vger.kernel.org, hch,
	Paul E. McKenney, MPTCP Linux

Hi Shinichiro,

Sorry to jump in, but I *think* our CI for the MPTCP subsystem is
hitting the same issue.

On 30/01/2026 12:16, Shinichiro Kawasaki wrote:
> On Jan 29, 2026 / 15:19, Paul E. McKenney wrote:
> [...]
>>>>> I have seen the static-key pattern called out by Dave Chinner when running
>>>>> KASAN on large systems.  We worked around this by disabling KASAN's use
>>>>> of static keys.  In case you were running KASAN in these tests.
>>>>
>>>> As to KASAN, yes, I enable it in my test runs. I find three static-keys under
>>>> mm/kasan/*. I will think if they can be disabled in my test runs. Thanks.
>>>
>>> There is a set of Kconfig options that disables static branches.  If you
>>> cannot find them quickly, please let me know and I can look them up.
> 
> Thank you. But now I know the fix series by Thomas is available. I prioritize
> the evaluation of the fix series. Later on, I will try disabling the static-keys
> if it is required.
> 
>>
>> And Thomas Gleixner posted an alleged fix to the CID issue here:
>>
>> https://lore.kernel.org/lkml/20260129210219.452851594@kernel.org/
>>
>> Please let him know whether or not it helps.
> 
> Good to see this fix candidate series, thanks :) I have set up the patches and
> started my regular test runs. So far, the hangs have been observed once or twice
> a week. To confirm the effect of the fix series, I think two weeks runs will be
> required. Once I get the result, will share it on this thread and with Thomas.

I know it is only one week now, but did you see any effects so far? On
my side, I applied the v2 series -- which has been applied in
tip/sched/urgent -- but I still have issues, and it looks like it is
even more frequent. Maybe what I see is different. If you no longer see
the issues on your side after one week, I'm going to start a new thread
with my issues not to mix them.

Note that in my case, the issue is visible on a system where nested VMs
are used, with and without KASAN (enabled via debug.config), just after
having started a VSOCK listening socket via socat.

Cheers,
Matt
-- 
Sponsored by the NGI0 Core fund.


^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: rcu stalls during fstests runs for xfs
  2026-02-06  9:33             ` rcu stalls during fstests runs for xfs Matthieu Baerts
@ 2026-02-06 10:02               ` Shinichiro Kawasaki
  2026-02-06 11:04                 ` Matthieu Baerts
  0 siblings, 1 reply; 3+ messages in thread
From: Shinichiro Kawasaki @ 2026-02-06 10:02 UTC (permalink / raw)
  To: Matthieu Baerts
  Cc: Kunwu Chan, rcu@vger.kernel.org, linux-xfs@vger.kernel.org, hch,
	Paul E. McKenney, MPTCP Linux

On Feb 06, 2026 / 10:33, Matthieu Baerts wrote:
> Hi Shinichiro,
> 
> Sorry to jump in, but I *think* our CI for the MPTCP subsystem is
> hitting the same issue.

Hi Matthieu,

> On 30/01/2026 12:16, Shinichiro Kawasaki wrote:
> > On Jan 29, 2026 / 15:19, Paul E. McKenney wrote:
> > [...]
> >>>>> I have seen the static-key pattern called out by Dave Chinner when running
> >>>>> KASAN on large systems.  We worked around this by disabling KASAN's use
> >>>>> of static keys.  In case you were running KASAN in these tests.
> >>>>
> >>>> As to KASAN, yes, I enable it in my test runs. I find three static-keys under
> >>>> mm/kasan/*. I will think if they can be disabled in my test runs. Thanks.
> >>>
> >>> There is a set of Kconfig options that disables static branches.  If you
> >>> cannot find them quickly, please let me know and I can look them up.
> > 
> > Thank you. But now I know the fix series by Thomas is available. I prioritize
> > the evaluation of the fix series. Later on, I will try disabling the static-keys
> > if it is required.
> > 
> >>
> >> And Thomas Gleixner posted an alleged fix to the CID issue here:
> >>
> >> https://lore.kernel.org/lkml/20260129210219.452851594@kernel.org/
> >>
> >> Please let him know whether or not it helps.
> > 
> > Good to see this fix candidate series, thanks :) I have set up the patches and
> > started my regular test runs. So far, the hangs have been observed once or twice
> > a week. To confirm the effect of the fix series, I think two weeks runs will be
> > required. Once I get the result, will share it on this thread and with Thomas.
> 
> I know it is only one week now, but did you see any effects so far?

No, I do not see any hang so far. And I hope there will be no hang in the
next week either. Fingers crossed...

> On
> my side, I applied the v2 series -- which has been applied i
> tip/sched/urgent -- but I still have issues, and it looks like it is
> even more frequent. Maybe what I see is different. If you no longer see
> the issues on your side after one week, I'm going to start a new thread
> with my issues not to mix them.
> 
> Note that in my case, the issue is visible on a system where nested VMs
> are used, with and without KASAN (enabled via debug.config), just after
> having started a VSOCK listening socket via socat.

I applied the v1 series on top of my test target xfs kernel branches enabling
KASAN.

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: rcu stalls during fstests runs for xfs
  2026-02-06 10:02               ` Shinichiro Kawasaki
@ 2026-02-06 11:04                 ` Matthieu Baerts
  0 siblings, 0 replies; 3+ messages in thread
From: Matthieu Baerts @ 2026-02-06 11:04 UTC (permalink / raw)
  To: Shinichiro Kawasaki
  Cc: Kunwu Chan, rcu@vger.kernel.org, linux-xfs@vger.kernel.org, hch,
	Paul E. McKenney, MPTCP Linux

On 06/02/2026 11:02, Shinichiro Kawasaki wrote:
> On Feb 06, 2026 / 10:33, Matthieu Baerts wrote:
>> Hi Shinichiro,
>>
>> Sorry to jump in, but I *think* our CI for the MPTCP subsystem is
>> hitting the same issue.
> 
> Hi Matthieu,
> 
>> On 30/01/2026 12:16, Shinichiro Kawasaki wrote:
>>> On Jan 29, 2026 / 15:19, Paul E. McKenney wrote:
>>> [...]
>>>>>>> I have seen the static-key pattern called out by Dave Chinner when running
>>>>>>> KASAN on large systems.  We worked around this by disabling KASAN's use
>>>>>>> of static keys.  In case you were running KASAN in these tests.
>>>>>>
>>>>>> As to KASAN, yes, I enable it in my test runs. I find three static-keys under
>>>>>> mm/kasan/*. I will think if they can be disabled in my test runs. Thanks.
>>>>>
>>>>> There is a set of Kconfig options that disables static branches.  If you
>>>>> cannot find them quickly, please let me know and I can look them up.
>>>
>>> Thank you. But now I know the fix series by Thomas is available. I prioritize
>>> the evaluation of the fix series. Later on, I will try disabling the static-keys
>>> if it is required.
>>>
>>>>
>>>> And Thomas Gleixner posted an alleged fix to the CID issue here:
>>>>
>>>> https://lore.kernel.org/lkml/20260129210219.452851594@kernel.org/
>>>>
>>>> Please let him know whether or not it helps.
>>>
>>> Good to see this fix candidate series, thanks :) I have set up the patches and
>>> started my regular test runs. So far, the hangs have been observed once or twice
>>> a week. To confirm the effect of the fix series, I think two weeks runs will be
>>> required. Once I get the result, will share it on this thread and with Thomas.
>>
>> I know it is only one week now, but did you see any effects so far?
> 
> No, I do not see any hang so far. And I hope there will be no hang in the
> next week either. Fingers crossed...

Thank you for your reply!

>> On
>> my side, I applied the v2 series -- which has been applied i
>> tip/sched/urgent -- but I still have issues, and it looks like it is
>> even more frequent. Maybe what I see is different. If you no longer see
>> the issues on your side after one week, I'm going to start a new thread
>> with my issues not to mix them.
>>
>> Note that in my case, the issue is visible on a system where nested VMs
>> are used, with and without KASAN (enabled via debug.config), just after
>> having started a VSOCK listening socket via socat.
> 
> I applied the v1 series on top of my test target xfs kernel branches enabling
> KASAN.

Sorry for the noise, I guess I have a different issue, even if the
traces look similar [1]. Hopefully someone can help me find the root
cause :)

[1]
https://github.com/multipath-tcp/mptcp_net-next/actions/runs/21723325004/job/62658752123#step:7:7288

Cheers,
Matt
-- 
Sponsored by the NGI0 Core fund.


^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2026-02-06 11:04 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <aXdO52wh2rqTUi1E@shinmob>
     [not found] ` <IA1PR14MB565903564F4AA105AF6A21099791A@IA1PR14MB5659.namprd14.prod.outlook.com>
     [not found]   ` <fc611e8e-0da9-4b88-83ef-092d300307e3@paulmck-laptop>
     [not found]     ` <aXrl46PxeHQSpYbX@shinmob>
     [not found]       ` <13b25e07-d7b8-4b4e-a249-b6826b2eea39@paulmck-laptop>
     [not found]         ` <c33c3d3e-a59c-4f5a-a562-13e2cabc2faf@paulmck-laptop>
     [not found]           ` <aXyRRaOBkvENTlBE@shinmob>
2026-02-06  9:33             ` rcu stalls during fstests runs for xfs Matthieu Baerts
2026-02-06 10:02               ` Shinichiro Kawasaki
2026-02-06 11:04                 ` Matthieu Baerts

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox