Linux-mm Archive on lore.kernel.org
 help / color / mirror / Atom feed
* [Kernel Bug] INFO: task hung in zswap_decompress
@ 2026-06-09 11:51 Longxing Li
  2026-06-09 15:40 ` Nhat Pham
  0 siblings, 1 reply; 5+ messages in thread
From: Longxing Li @ 2026-06-09 11:51 UTC (permalink / raw)
  To: syzkaller, hannes, yosry, Nhat Pham, chengming.zhou, akpm,
	linux-mm, linux-kernel

Dear Linux kernel developers and maintainers,

We would like to report a new kernel bug found by our tool. INFO: task
hung in zswap_decompress. Details are as follows.

Kernel commit: v7.0.6
Kernel config: see attachment
report: see attachment

We are currently analyzing the root cause and  working on a
reproducible PoC. We will provide further updates in this thread as
soon as we have more information.

Best regards,
Longxing Li

==================================================================
https://drive.google.com/file/d/1Bx2unEf-QntjVi8g6Zw7QNO6OP4cjGO_/view?usp=drive_link

https://drive.google.com/file/d/16xzUrwOvwE67cnMPH3AhhNRWq6hr26Qj/view?usp=drive_link


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [Kernel Bug] INFO: task hung in zswap_decompress
  2026-06-09 11:51 [Kernel Bug] INFO: task hung in zswap_decompress Longxing Li
@ 2026-06-09 15:40 ` Nhat Pham
  2026-06-09 17:50   ` Yosry Ahmed
  0 siblings, 1 reply; 5+ messages in thread
From: Nhat Pham @ 2026-06-09 15:40 UTC (permalink / raw)
  To: Longxing Li
  Cc: syzkaller, hannes, yosry, chengming.zhou, akpm, linux-mm,
	linux-kernel, Kanchana P. Sridhar

On Tue, Jun 9, 2026 at 4:51 AM Longxing Li <coregee2000@gmail.com> wrote:
>
> Dear Linux kernel developers and maintainers,
>
> We would like to report a new kernel bug found by our tool. INFO: task
> hung in zswap_decompress. Details are as follows.
>
> Kernel commit: v7.0.6
> Kernel config: see attachment
> report: see attachment
>
> We are currently analyzing the root cause and  working on a
> reproducible PoC. We will provide further updates in this thread as
> soon as we have more information.
>
> Best regards,
> Longxing Li
>
> ==================================================================
> https://drive.google.com/file/d/1Bx2unEf-QntjVi8g6Zw7QNO6OP4cjGO_/view?usp=drive_link
>
> https://drive.google.com/file/d/16xzUrwOvwE67cnMPH3AhhNRWq6hr26Qj/view?usp=drive_link

+ Kanchana, who last worked on this piece of code


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [Kernel Bug] INFO: task hung in zswap_decompress
  2026-06-09 15:40 ` Nhat Pham
@ 2026-06-09 17:50   ` Yosry Ahmed
  2026-06-09 18:36     ` Kanchana P. Sridhar
  0 siblings, 1 reply; 5+ messages in thread
From: Yosry Ahmed @ 2026-06-09 17:50 UTC (permalink / raw)
  To: Nhat Pham
  Cc: Longxing Li, syzkaller, hannes, chengming.zhou, akpm, linux-mm,
	linux-kernel, Kanchana P. Sridhar

On Tue, Jun 9, 2026 at 8:40 AM Nhat Pham <nphamcs@gmail.com> wrote:
>
> On Tue, Jun 9, 2026 at 4:51 AM Longxing Li <coregee2000@gmail.com> wrote:
> >
> > Dear Linux kernel developers and maintainers,
> >
> > We would like to report a new kernel bug found by our tool. INFO: task
> > hung in zswap_decompress. Details are as follows.
> >
> > Kernel commit: v7.0.6
> > Kernel config: see attachment
> > report: see attachment

If I am reading the report correctly, it seems like we are doing
swapin from the page fault path, and waiting for the per-CPU mutex
that is held by kswapd. Since we can sleep waiting for decompression
while holding the mutex, it's possible that we have some kind of
priority inversion where kswap held the lock, went to sleep, and
didn't run again for a while. But that always been possible for a long
time AFAICT.

Do you have any more details? Is this a new regression (observed when
upgrading to v7.0.6), or is it possible this was a pre-existing issue
and you just found it on this kernel?

> >
> > We are currently analyzing the root cause and  working on a
> > reproducible PoC. We will provide further updates in this thread as
> > soon as we have more information.

Yeah more details like a known-good kernel version, or even better a
reproducer, would certainly help a lot.

> >
> > Best regards,
> > Longxing Li
> >
> > ==================================================================
> > https://drive.google.com/file/d/1Bx2unEf-QntjVi8g6Zw7QNO6OP4cjGO_/view?usp=drive_link
> >
> > https://drive.google.com/file/d/16xzUrwOvwE67cnMPH3AhhNRWq6hr26Qj/view?usp=drive_link
>
> + Kanchana, who last worked on this piece of code


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [Kernel Bug] INFO: task hung in zswap_decompress
  2026-06-09 17:50   ` Yosry Ahmed
@ 2026-06-09 18:36     ` Kanchana P. Sridhar
  2026-06-10  4:56       ` Lance Yang
  0 siblings, 1 reply; 5+ messages in thread
From: Kanchana P. Sridhar @ 2026-06-09 18:36 UTC (permalink / raw)
  To: Yosry Ahmed
  Cc: Nhat Pham, Longxing Li, syzkaller, hannes, chengming.zhou, akpm,
	linux-mm, linux-kernel, Kanchana P. Sridhar

On Tue, Jun 9, 2026 at 10:50 AM Yosry Ahmed <yosry@kernel.org> wrote:
>
> On Tue, Jun 9, 2026 at 8:40 AM Nhat Pham <nphamcs@gmail.com> wrote:
> >
> > On Tue, Jun 9, 2026 at 4:51 AM Longxing Li <coregee2000@gmail.com> wrote:
> > >
> > > Dear Linux kernel developers and maintainers,
> > >
> > > We would like to report a new kernel bug found by our tool. INFO: task
> > > hung in zswap_decompress. Details are as follows.
> > >
> > > Kernel commit: v7.0.6
> > > Kernel config: see attachment
> > > report: see attachment
>
> If I am reading the report correctly, it seems like we are doing
> swapin from the page fault path, and waiting for the per-CPU mutex
> that is held by kswapd. Since we can sleep waiting for decompression
> while holding the mutex, it's possible that we have some kind of
> priority inversion where kswap held the lock, went to sleep, and
> didn't run again for a while. But that always been possible for a long
> time AFAICT.
>
> Do you have any more details? Is this a new regression (observed when
> upgrading to v7.0.6), or is it possible this was a pre-existing issue
> and you just found it on this kernel?
>
> > >
> > > We are currently analyzing the root cause and  working on a
> > > reproducible PoC. We will provide further updates in this thread as
> > > soon as we have more information.
>
> Yeah more details like a known-good kernel version, or even better a
> reproducer, would certainly help a lot.
>
> > >
> > > Best regards,
> > > Longxing Li
> > >
> > > ==================================================================
> > > https://drive.google.com/file/d/1Bx2unEf-QntjVi8g6Zw7QNO6OP4cjGO_/view?usp=drive_link
> > >
> > > https://drive.google.com/file/d/16xzUrwOvwE67cnMPH3AhhNRWq6hr26Qj/view?usp=drive_link
> >
> > + Kanchana, who last worked on this piece of code

Thanks Nhat and Yosry. I agree with Yosry, having a known-good kernel
version would be helpful.

Also, it appears the kernel stack-trace is from before the merging of
the per-CPU acomp_ctx simplifications wrt the mutex, and resources'
lifetime being tied to that of the zswap pool.

Looking forward to more details.

Thanks,
Kanchana


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [Kernel Bug] INFO: task hung in zswap_decompress
  2026-06-09 18:36     ` Kanchana P. Sridhar
@ 2026-06-10  4:56       ` Lance Yang
  0 siblings, 0 replies; 5+ messages in thread
From: Lance Yang @ 2026-06-10  4:56 UTC (permalink / raw)
  To: kanchanapsridhar2026, yosry, nphamcs, coregee2000
  Cc: syzkaller, hannes, chengming.zhou, akpm, linux-mm, linux-kernel,
	Lance Yang


On Tue, Jun 09, 2026 at 08:36:29AM -1000, Kanchana P. Sridhar wrote:
>On Tue, Jun 9, 2026 at 10:50 AM Yosry Ahmed <yosry@kernel.org> wrote:
>>
>> On Tue, Jun 9, 2026 at 8:40 AM Nhat Pham <nphamcs@gmail.com> wrote:
>> >
>> > On Tue, Jun 9, 2026 at 4:51 AM Longxing Li <coregee2000@gmail.com> wrote:
>> > >
>> > > Dear Linux kernel developers and maintainers,

Thanks for reporting this!

>> > >
>> > > We would like to report a new kernel bug found by our tool. INFO: task
>> > > hung in zswap_decompress. Details are as follows.
>> > >
>> > > Kernel commit: v7.0.6
>> > > Kernel config: see attachment
>> > > report: see attachment
>>
>> If I am reading the report correctly, it seems like we are doing
>> swapin from the page fault path, and waiting for the per-CPU mutex
>> that is held by kswapd. Since we can sleep waiting for decompression
>> while holding the mutex, it's possible that we have some kind of
>> priority inversion where kswap held the lock, went to sleep, and
>> didn't run again for a while. But that always been possible for a long
>> time AFAICT.

Cool!

Worth rerunning with CONFIG_DETECT_HUNG_TASK_BLOCKER=y, Should be on by
default with CONFIG_DETECT_HUNG_TASK=y, but I don't see it in the config.

With that enabled, the kernel should hopefully tell us which task likely
owns the mutex :) If kswapd is sitting on the per-CPU zswap mutex and not
getting scheduled back in, that should be pretty clear ;)

CONFIG_DETECT_HUNG_TASK=y
CONFIG_DETECT_HUNG_TASK_BLOCKER=y

# detect after 10s in D state instead of the default 120s
echo 10 > /proc/sys/kernel/hung_task_timeout_secs

# 0 means use hung_task_timeout_secs as the check interval
echo 0 > /proc/sys/kernel/hung_task_check_interval_secs

>>
>> Do you have any more details? Is this a new regression (observed when
>> upgrading to v7.0.6), or is it possible this was a pre-existing issue
>> and you just found it on this kernel?
>>
>> > >
>> > > We are currently analyzing the root cause and  working on a
>> > > reproducible PoC. We will provide further updates in this thread as
>> > > soon as we have more information.
>>
>> Yeah more details like a known-good kernel version, or even better a
>> reproducer, would certainly help a lot.
>>
>> > >
>> > > Best regards,
>> > > Longxing Li
>> > >
>> > > ==================================================================
>> > > https://drive.google.com/file/d/1Bx2unEf-QntjVi8g6Zw7QNO6OP4cjGO_/view?usp=drive_link
>> > >
>> > > https://drive.google.com/file/d/16xzUrwOvwE67cnMPH3AhhNRWq6hr26Qj/view?usp=drive_link
>> >
>> > + Kanchana, who last worked on this piece of code
>
>Thanks Nhat and Yosry. I agree with Yosry, having a known-good kernel
>version would be helpful.
>
>Also, it appears the kernel stack-trace is from before the merging of
>the per-CPU acomp_ctx simplifications wrt the mutex, and resources'
>lifetime being tied to that of the zswap pool.
>
>Looking forward to more details.

+1

Cheers, Lance


^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2026-06-10  4:56 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2026-06-09 11:51 [Kernel Bug] INFO: task hung in zswap_decompress Longxing Li
2026-06-09 15:40 ` Nhat Pham
2026-06-09 17:50   ` Yosry Ahmed
2026-06-09 18:36     ` Kanchana P. Sridhar
2026-06-10  4:56       ` Lance Yang

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox