zswap_writeback_entry crashes in 6.9.5

linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed

* zswap_writeback_entry crashes in 6.9.5
@ 2024-06-30 17:58 Pedro Falcato
  2024-07-01  1:07 ` Nhat Pham
                   ` (2 more replies)
  0 siblings, 3 replies; 7+ messages in thread
From: Pedro Falcato @ 2024-06-30 17:58 UTC (permalink / raw)
  To: mm
  Cc: Linux Regressions, Johannes Weiner, Yosry Ahmed, Nhat Pham,
	Chengming Zhou, Christian Heusel

Hi everyone,

Reporting a bug I've experienced over the last few releases, all in
6.9. I don't have a repro for it (seems to happen randomly?). Distro
is Arch Linux (which is itself minimally patched, config
https://gitlab.archlinux.org/archlinux/packaging/packages/linux/-/blob/e1057d426590bd2064df83779f8644de519874c0/config).

I understand 6.9.5 is a little old, but as I have no repro I'm posting
this in case someone has any hints or has hit this before.

dmesg:
BUG: unable to handle page fault for address: 00000000000315af
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0 P4D 0
Oops: 0000 [#1] PREEMPT SMP PTI
CPU: 5 PID: 90 Comm: kswapd0 Tainted: P     U     OE
6.9.5-arch1-1 #1 b9e5462a84a73f67b5c7c6b73f88d2a6349ae768
Hardware name: ASUSTeK COMPUTER INC. X542URR/X542URR, BIOS X542URR.310
10/29/2021
RIP: 0010:zswap_writeback_entry+0x128/0x1f0
Code: 89 ef e8 3b 5a a8 00 48 89 de 48 89 ef e8 20 f4 ff ff 65 48 ff
05 d8 82 68 7e 4c 8b 6d 38 4d 85 ed 74 1a 66 90 e8 f8 2a dc ff <49> 8b
7d 10 be 6e 00 00 00 e8 fa eb ff ff e8 75 7d dc ff 48 89 ef
RSP: 0018:ffffbc740049b848 EFLAGS: 00010202
RAX: 0000000000000001 RBX: ffffdcd8c489c480 RCX: 0000000000001000
RDX: ffff9e0401e1ec00 RSI: 0000000000000f80 RDI: ffff9e04b3641bc8
RBP: ffff9e04603b15a0 R08: 0000000000000080 R09: ffff9e0422713000
R10: 0000000000000000 R11: 0000000000000000 R12: ffffbc740049b850
R13: 000000000003159f R14: 00000000001d6747 R15: ffff9e040d207750
FS:  0000000000000000(0000) GS:ffff9e0566c80000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00000000000315af CR3: 000000020a420003 CR4: 00000000003706f0
Call Trace:
<TASK>
? __die_body.cold+0x19/0x27
? page_fault_oops+0x15a/0x2b0
? exc_page_fault+0x81/0x190
? asm_exc_page_fault+0x26/0x30
? zswap_writeback_entry+0x128/0x1f0
? zswap_writeback_entry+0x128/0x1f0
shrink_memcg_cb+0x82/0xd0
__list_lru_walk_one+0xa3/0x1b0
? __pfx_shrink_memcg_cb+0x10/0x10
? __pfx_shrink_memcg_cb+0x10/0x10
list_lru_walk_one+0x5d/0x90
zswap_shrinker_scan+0xbd/0x120
do_shrink_slab+0x143/0x360
shrink_slab+0x2a9/0x3b0
shrink_one+0x120/0x1f0
shrink_node+0x962/0xbb0
balance_pgdat+0x4c5/0x960
? psi_task_switch+0xd6/0x230
? finish_task_switch.isra.0+0x99/0x2e0
kswapd+0x1f5/0x380
? __pfx_autoremove_wake_function+0x10/0x10
? __pfx_kswapd+0x10/0x10
kthread+0xcf/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x31/0x50
? __pfx_kthread+0x10/0x10
ret_from_fork_asm+0x1a/0x30
</TASK>
Modules linked in: snd_seq_dummy rfcomm snd_hrtimer snd_seq
xt_conntrack xt_MASQUERADE nf_conntrack_netlink xt_addrtype nft_compat
br_netfilter xfrm_interface xfrm6_tunnel tunnel4 tunnel6 xfrm_user
xfrm_algo nf
t_masq nft_ct nft_reject_ipv4 nf_reject_ipv4 nft_reject nft_chain_nat
nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nf_tables libcrc32c
bridge stp llc overlay cmac algif_hash algif_skcipher af_alg bnep vbox
netflt(OE) vboxnetadp(OE) vboxdrv(OE) nvidia_uvm(POE) nvidia_drm(POE)
nvidia_modeset(POE) snd_hda_codec_hdmi nvidia(POE)
intel_uncore_frequency intel_uncore_frequency_common snd_soc_avs
snd_soc_hda_codec snd_so
c_skl snd_soc_hdac_hda snd_hda_ext_core snd_soc_sst_ipc
snd_soc_sst_dsp intel_tcc_cooling x86_pkg_temp_thermal
snd_soc_acpi_intel_match snd_hda_codec_realtek intel_powerclamp
snd_soc_acpi coretemp snd_hda_codec
_generic snd_soc_core snd_hda_scodec_component snd_compress kvm_intel
ac97_bus vfat snd_pcm_dmaengine crct10dif_pclmul fat snd_hda_intel
crc32_pclmul polyval_clmulni
snd_intel_dspcfg ath10k_pci polyval_generic btusb uvcvideo
snd_intel_sdw_acpi gf128mul ath10k_core videobuf2_vmalloc
ghash_clmulni_intel btrtl snd_usb_audio sha512_ssse3 btintel uvc
snd_hda_codec ath sha256_ss
se3 processor_thermal_device_pci_legacy videobuf2_memops
snd_usbmidi_lib btbcm processor_thermal_device videobuf2_v4l2
sha1_ssse3 spi_pxa2xx_platform btmtk hid_multitouch aesni_intel
8250_dw dw_dmac mac80211 le
dtrig_netdev snd_ump processor_thermal_wt_hint snd_hda_core
crypto_simd videodev bluetooth processor_thermal_rfim snd_rawmidi
cryptd r8169 iTCO_wdt asus_nb_wmi snd_hwdep videobuf2_common
processor_thermal_rapl
rapl asus_wmi intel_pmc_bxt libarc4 snd_seq_device ecdh_generic mc
mousedev joydev snd_pcm intel_rapl_msr ee1004 mei_hdcp mei_pxp
iTCO_vendor_support intel_cstate snd_timer intel_rapl_common realtek
platform_pr
ofile cfg80211 intel_uncore pcspkr wmi_bmof i2c_i801 sparse_keymap
processor_thermal_wt_req snd mei_me mdio_devres intel_lpss_pci
processor_thermal_power_floor i2c_smbus rfkill
processor_thermal_mbox intel_xhci_usb_role_switch intel_lpss soundcore
libphy mei intel_pch_thermal roles intel_soc_dts_iosf idma64
i2c_hid_acpi i2c_hid int3403_thermal intel_pmc_core
int340x_thermal_zone inte
l_vsec pmt_telemetry int3400_thermal acpi_thermal_rel
pinctrl_sunrisepoint pmt_class acpi_pad asus_wireless mac_hid kvmgt
mdev vfio_iommu_type1 vfio iommufd kvm i2c_dev sg crypto_user dm_mod
loop nfnetlink ip_t
ables x_tables ext4 crc32c_generic crc16 mbcache jbd2 hid_generic
usbhid i915 serio_raw i2c_algo_bit atkbd drm_buddy libps2 ttm
vivaldi_fmap mxm_wmi intel_gtt crc32c_intel drm_display_helper
xhci_pci cec xhci_p
ci_renesas i8042 video serio wmi
CR2: 00000000000315af
---[ end trace 0000000000000000 ]---
RIP: 0010:zswap_writeback_entry+0x128/0x1f0
Code: 89 ef e8 3b 5a a8 00 48 89 de 48 89 ef e8 20 f4 ff ff 65 48 ff
05 d8 82 68 7e 4c 8b 6d 38 4d 85 ed 74 1a 66 90 e8 f8 2a dc ff <49> 8b
7d 10 be 6e 00 00 00 e8 fa eb ff ff e8 75 7d dc ff 48 89 ef
RSP: 0018:ffffbc740049b848 EFLAGS: 00010202
RAX: 0000000000000001 RBX: ffffdcd8c489c480 RCX: 0000000000001000
RDX: ffff9e0401e1ec00 RSI: 0000000000000f80 RDI: ffff9e04b3641bc8
RBP: ffff9e04603b15a0 R08: 0000000000000080 R09: ffff9e0422713000
R10: 0000000000000000 R11: 0000000000000000 R12: ffffbc740049b850
R13: 000000000003159f R14: 00000000001d6747 R15: ffff9e040d207750
FS:  0000000000000000(0000) GS:ffff9e0566c80000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00000000000315af CR3: 000000020a420003 CR4: 00000000003706f0
note: kswapd0[90] exited with irqs disabled
------------[ cut here ]------------
WARNING: CPU: 5 PID: 90 at kernel/exit.c:827 do_exit+0x8c7/0xaf0
Modules linked in: snd_seq_dummy rfcomm snd_hrtimer snd_seq
xt_conntrack xt_MASQUERADE nf_conntrack_netlink xt_addrtype nft_compat
br_netfilter xfrm_interface xfrm6_tunnel tunnel4 tunnel6 xfrm_user
xfrm_algo nf
t_masq nft_ct nft_reject_ipv4 nf_reject_ipv4 nft_reject nft_chain_nat
nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nf_tables libcrc32c
bridge stp llc overlay cmac algif_hash algif_skcipher af_alg bnep vbox
netflt(OE) vboxnetadp(OE) vboxdrv(OE) nvidia_uvm(POE) nvidia_drm(POE)
nvidia_modeset(POE) snd_hda_codec_hdmi nvidia(POE)
intel_uncore_frequency intel_uncore_frequency_common snd_soc_avs
snd_soc_hda_codec snd_so
c_skl snd_soc_hdac_hda snd_hda_ext_core snd_soc_sst_ipc
snd_soc_sst_dsp intel_tcc_cooling x86_pkg_temp_thermal
snd_soc_acpi_intel_match snd_hda_codec_realtek intel_powerclamp
snd_soc_acpi coretemp snd_hda_codec
_generic snd_soc_core snd_hda_scodec_component snd_compress kvm_intel
ac97_bus vfat snd_pcm_dmaengine crct10dif_pclmul fat snd_hda_intel
crc32_pclmul polyval_clmulni
snd_intel_dspcfg ath10k_pci polyval_generic btusb uvcvideo
snd_intel_sdw_acpi gf128mul ath10k_core videobuf2_vmalloc
ghash_clmulni_intel btrtl snd_usb_audio sha512_ssse3 btintel uvc
snd_hda_codec ath sha256_ss
se3 processor_thermal_device_pci_legacy videobuf2_memops
snd_usbmidi_lib btbcm processor_thermal_device videobuf2_v4l2
sha1_ssse3 spi_pxa2xx_platform btmtk hid_multitouch aesni_intel
8250_dw dw_dmac mac80211 le
dtrig_netdev snd_ump processor_thermal_wt_hint snd_hda_core
crypto_simd videodev bluetooth processor_thermal_rfim snd_rawmidi
cryptd r8169 iTCO_wdt asus_nb_wmi snd_hwdep videobuf2_common
processor_thermal_rapl
rapl asus_wmi intel_pmc_bxt libarc4 snd_seq_device ecdh_generic mc
mousedev joydev snd_pcm intel_rapl_msr ee1004 mei_hdcp mei_pxp
iTCO_vendor_support intel_cstate snd_timer intel_rapl_common realtek
platform_pr
ofile cfg80211 intel_uncore pcspkr wmi_bmof i2c_i801 sparse_keymap
processor_thermal_wt_req snd mei_me mdio_devres intel_lpss_pci
processor_thermal_power_floor i2c_smbus rfkill
processor_thermal_mbox intel_xhci_usb_role_switch intel_lpss soundcore
libphy mei intel_pch_thermal roles intel_soc_dts_iosf idma64
i2c_hid_acpi i2c_hid int3403_thermal intel_pmc_core
int340x_thermal_zone inte
l_vsec pmt_telemetry int3400_thermal acpi_thermal_rel
pinctrl_sunrisepoint pmt_class acpi_pad asus_wireless mac_hid kvmgt
mdev vfio_iommu_type1 vfio iommufd kvm i2c_dev sg crypto_user dm_mod
loop nfnetlink ip_t
ables x_tables ext4 crc32c_generic crc16 mbcache jbd2 hid_generic
usbhid i915 serio_raw i2c_algo_bit atkbd drm_buddy libps2 ttm
vivaldi_fmap mxm_wmi intel_gtt crc32c_intel drm_display_helper
xhci_pci cec xhci_p
ci_renesas i8042 video serio wmi
CPU: 5 PID: 90 Comm: kswapd0 Tainted: P     UD    OE
6.9.5-arch1-1 #1 b9e5462a84a73f67b5c7c6b73f88d2a6349ae768
Hardware name: ASUSTeK COMPUTER INC. X542URR/X542URR, BIOS X542URR.310
10/29/2021
RIP: 0010:do_exit+0x8c7/0xaf0
Code: 08 00 00 e9 09 fe ff ff 49 8d 7c 24 18 e8 61 4f 07 00 e9 3b f8
ff ff 48 8b bb 20 06 00 00 31 f6 e8 7e e1 ff ff e9 83 fd ff ff <0f> 0b
e9 b1 f7 ff ff 48 89 df e8 ca 67 12 00 e9 54 f9 ff ff 0f 0b
RSP: 0018:ffffbc740049bec8 EFLAGS: 00010282
RAX: 0000000400000000 RBX: ffff9e0401e1ec00 RCX: 0000000000000000
RDX: 0000000000000001 RSI: 0000000000002710 RDI: ffff9e0401017380
RBP: 0000000000000009 R08: 0000000000000000 R09: ffffbc740049bdb8
R10: ffffffff836b21a8 R11: 0000000000000003 R12: ffff9e040101f500
R13: ffff9e0401017380 R14: 0000000000000000 R15: 0000000000000000
FS:  0000000000000000(0000) GS:ffff9e0566c80000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00000000000315af CR3: 000000020a420003 CR4: 00000000003706f0
Call Trace:
<TASK>
? do_exit+0x8c7/0xaf0
? __warn.cold+0x8e/0xe8
? do_exit+0x8c7/0xaf0
? report_bug+0xff/0x140
? handle_bug+0x3c/0x80
? exc_invalid_op+0x17/0x70
? asm_exc_invalid_op+0x1a/0x20
? do_exit+0x8c7/0xaf0
? do_exit+0x71/0xaf0
make_task_dead+0x90/0x90
rewind_stack_and_make_dead+0x16/0x20
RIP: 0000:0x0
Code: Unable to access opcode bytes at 0xffffffffffffffd6.
RSP: 0000:0000000000000000 EFLAGS: 00000000 ORIG_RAX: 0000000000000000
RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
</TASK>
---[ end trace 0000000000000000 ]---

A little disassembly around zswap_writeback_entry+0x128 (+296 in the disasm):
   0xffffffff813a97ab <+267>:   call   0xffffffff813a8bd0 <zswap_decompress>
  0xffffffff813a97b0 <+272>:   incq   %gs:0x7ec882d8(%rip)        #
0x31a90 <vm_event_states+880>
  0xffffffff813a97b8 <+280>:   mov    0x38(%rbp),%r13
  0xffffffff813a97bc <+284>:   test   %r13,%r13
  0xffffffff813a97bf <+287>:   je     0xffffffff813a97db
<zswap_writeback_entry+315>
  0xffffffff813a97c1 <+289>:   jmp    0xffffffff813a97db
<zswap_writeback_entry+315>
  0xffffffff813a97c3 <+291>:   call   0xffffffff8116c2c0 <__rcu_read_lock>
  0xffffffff813a97c8 <+296>:   mov    0x10(%r13),%rdi
  0xffffffff813a97cc <+300>:   mov    $0x6e,%esi
  0xffffffff813a97d1 <+305>:   call   0xffffffff813a83d0
<count_memcg_events.constprop.0>
  0xffffffff813a97d6 <+310>:   call   0xffffffff81171550 <__rcu_read_unlock>
  0xffffffff813a97db <+315>:   mov    %rbp,%rdi
  0xffffffff813a97de <+318>:   call   0xffffffff813a9520 <zswap_entry_free>
  0xffffffff813a97e3 <+323>:   lock orb $0x8,(%rbx)
  0xffffffff813a97e7 <+327>:   lock orb $0x4,0x2(%rbx)
  0xffffffff813a97ec <+332>:   mov    %r12,%rsi
  0xffffffff813a97ef <+335>:   mov    %rbx,%rdi
  0xffffffff813a97f2 <+338>:   call   0xffffffff8139f2e0 <__swap_writepage>

which (by my manual inspection) more or less cleanly lands us around:


zswap_decompress(entry, &folio->page);

count_vm_event(ZSWPWB);
if (entry->objcg)
    count_objcg_event(entry->objcg, ZSWPWB);

zswap_entry_free(entry);

/* folio is up to date */
folio_mark_uptodate(folio);

/* move it to the tail of the inactive list after end_writeback */
folio_set_reclaim(folio);

/* start writeback */
__swap_writepage(folio, &wbc);



It looks like entry->objcg is garbage? My second-to-last repro of this
(in my logs) has an equally bogus but different address (0x1317), on
6.9.1.

Again, really sorry I can't give out more information, but my repro
really is "Use the system for a bunch of days in a row and we'll
Eventually(tm) hit this", which really isn't suited for bisection or
anything of the sort :)

PS: I tested my RAM for a few memtest86 cycles, to be sure it wasn't
just a RAM stick gone bad.

Thanks!
Pedro


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: zswap_writeback_entry crashes in 6.9.5
  2024-06-30 17:58 zswap_writeback_entry crashes in 6.9.5 Pedro Falcato
@ 2024-07-01  1:07 ` Nhat Pham
  2024-07-19 18:57   ` Pedro Falcato
  2024-07-01  3:44 ` Chengming Zhou
  2024-07-02  0:33 ` Builder
  2 siblings, 1 reply; 7+ messages in thread
From: Nhat Pham @ 2024-07-01  1:07 UTC (permalink / raw)
  To: Pedro Falcato
  Cc: mm, Linux Regressions, Johannes Weiner, Yosry Ahmed,
	Chengming Zhou, Christian Heusel

On Sun, Jun 30, 2024 at 10:58 AM Pedro Falcato <pedro.falcato@gmail.com> wrote:
>
> Hi everyone,

Hi Pedro,

Thanks for the bug report! Taking a look now - some preliminary
questions to narrow down the suspects and aid the debugging process:

a) Do you observe this bug in 6.8? 6.10?

b) Have you run the faddr2line script to verify that the line that
triggers the crash is count_objcg_event(entry->objcg, ZSWPWB);?

c) Do you have a full dmesg log? Or maybe some other reproduction instructions?

If entry->objcg is garbage, then this smells like a lifetime/reference
counting issue. Either:

a) The zswap entry itself is garbage. Not impossible, but seems
unlikely. In 6.9, we effectively isolate the entry first through the
swap cache, then check and remove it from the zswap tree (under the
tree's lock). The former locks out concurrent accessors, and the
latter should have taken care of invalidated entries (and prevents
future invalidation attempts). Furthermore, after this, if the entry
is somehow garbage (i.e freed and recycled), it should also be
possible to blow up in the decompression step first, by feeding a
garbage handle to zsmalloc and crashing the kernel at that point. IOW,
we should also see zsmalloc crashes in addition to this particular
crash, no? I cannot think of any protection mechanism that applies to
the decompression step and not to count_objcg_event().

b) entry->objcg has been freed/recycled under us. This is much
trickier, as the culprit could be any holder of the objcg reference
who accidentally double-released the reference it held. That said, if
it only happened on zswap shrinker path, then maybe there is something
to this...

Let me muse on this a bit more. Please let us know if you have other
clues, traces, hints, or observation - it will help the investigation
a lot!

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: zswap_writeback_entry crashes in 6.9.5
  2024-07-01  1:07 ` Nhat Pham
@ 2024-07-19 18:57   ` Pedro Falcato
  2024-07-19 20:47     ` Nhat Pham
  0 siblings, 1 reply; 7+ messages in thread
From: Pedro Falcato @ 2024-07-19 18:57 UTC (permalink / raw)
  To: Nhat Pham
  Cc: mm, Linux Regressions, Johannes Weiner, Yosry Ahmed,
	Chengming Zhou, Christian Heusel, yshuiv7

On Mon, Jul 1, 2024 at 2:07 AM Nhat Pham <nphamcs@gmail.com> wrote:
>
> On Sun, Jun 30, 2024 at 10:58 AM Pedro Falcato <pedro.falcato@gmail.com> wrote:
> >
> > Hi everyone,
>
> Hi Pedro,
>

Hi Nhat,

Sorry for necroing, but I've been really busy and this issue totally
left my mind. Following up so I don't leave you hanging:

> Thanks for the bug report! Taking a look now - some preliminary
> questions to narrow down the suspects and aid the debugging process:
>
> a) Do you observe this bug in 6.8? 6.10?

Yes.
>
> b) Have you run the faddr2line script to verify that the line that
> triggers the crash is count_objcg_event(entry->objcg, ZSWPWB);?

I did not (distro kernel does not seem to have debug symbols), but
it's pretty clear from disassembly.
>
> c) Do you have a full dmesg log? Or maybe some other reproduction instructions?
>
> If entry->objcg is garbage, then this smells like a lifetime/reference
> counting issue. Either:
>
> a) The zswap entry itself is garbage. Not impossible, but seems
> unlikely. In 6.9, we effectively isolate the entry first through the
> swap cache, then check and remove it from the zswap tree (under the
> tree's lock). The former locks out concurrent accessors, and the
> latter should have taken care of invalidated entries (and prevents
> future invalidation attempts). Furthermore, after this, if the entry
> is somehow garbage (i.e freed and recycled), it should also be
> possible to blow up in the decompression step first, by feeding a
> garbage handle to zsmalloc and crashing the kernel at that point. IOW,
> we should also see zsmalloc crashes in addition to this particular
> crash, no? I cannot think of any protection mechanism that applies to
> the decompression step and not to count_objcg_event().
>
> b) entry->objcg has been freed/recycled under us. This is much
> trickier, as the culprit could be any holder of the objcg reference
> who accidentally double-released the reference it held. That said, if
> it only happened on zswap shrinker path, then maybe there is something
> to this...
>

I have a separate theory. I also run the NVIDIA proprietary drivers.
slabinfo -a shows us:
[...]
:0000080     <- zswap_entry Acpi-Parse kernfs_iattrs_cache
uvm_tools_replay_data_t Acpi-State audit_tree_mark
[...]

See the uvm_tools_replay_data_t there? Yeah, it's entirely possible
some random nvidia.ko bug has been corrupting zswap_entry from time to
time (which explains why e.g the big server people have not seen
this).
I'm not sure if Yuxuan is running the same driver, but their kernel is
also proprietary-tainted.

As such I'll refrain from posting more about this or similar bugs
until I can get a guarantee it happens with a non-tainted kernel
(fwiw, I have not seen crashes for 2 weeks or so, hopefully this issue
is fixed).

Again, sorry for not checking the taint before posting this, and thank
you for your time :)

-- 
Pedro


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: zswap_writeback_entry crashes in 6.9.5
  2024-07-19 18:57   ` Pedro Falcato
@ 2024-07-19 20:47     ` Nhat Pham
  0 siblings, 0 replies; 7+ messages in thread
From: Nhat Pham @ 2024-07-19 20:47 UTC (permalink / raw)
  To: Pedro Falcato
  Cc: mm, Linux Regressions, Johannes Weiner, Yosry Ahmed,
	Chengming Zhou, Christian Heusel, yshuiv7

On Fri, Jul 19, 2024 at 11:57 AM Pedro Falcato <pedro.falcato@gmail.com> wrote:
>
> On Mon, Jul 1, 2024 at 2:07 AM Nhat Pham <nphamcs@gmail.com> wrote:
> >
> > On Sun, Jun 30, 2024 at 10:58 AM Pedro Falcato <pedro.falcato@gmail.com> wrote:
> > >
> > > Hi everyone,
> >
> > Hi Pedro,
> >
>
> I have a separate theory. I also run the NVIDIA proprietary drivers.
> slabinfo -a shows us:
> [...]
> :0000080     <- zswap_entry Acpi-Parse kernfs_iattrs_cache
> uvm_tools_replay_data_t Acpi-State audit_tree_mark
> [...]
>
> See the uvm_tools_replay_data_t there? Yeah, it's entirely possible
> some random nvidia.ko bug has been corrupting zswap_entry from time to
> time (which explains why e.g the big server people have not seen
> this).

Oh this is fascinating. I would never have guessed this. Thanks for
doing the investigation!

> I'm not sure if Yuxuan is running the same driver, but their kernel is
> also proprietary-tainted.
>
> As such I'll refrain from posting more about this or similar bugs
> until I can get a guarantee it happens with a non-tainted kernel
> (fwiw, I have not seen crashes for 2 weeks or so, hopefully this issue
> is fixed).

Crossing my fingers :)

>
> Again, sorry for not checking the taint before posting this, and thank
> you for your time :)

No worries at all :) Let me know if there are new developments
regarding this issue, or if you are able to confirm one way or another
:)


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: zswap_writeback_entry crashes in 6.9.5
  2024-06-30 17:58 zswap_writeback_entry crashes in 6.9.5 Pedro Falcato
  2024-07-01  1:07 ` Nhat Pham
@ 2024-07-01  3:44 ` Chengming Zhou
  2024-07-02  0:33 ` Builder
  2 siblings, 0 replies; 7+ messages in thread
From: Chengming Zhou @ 2024-07-01  3:44 UTC (permalink / raw)
  To: Pedro Falcato, mm
  Cc: Linux Regressions, Johannes Weiner, Yosry Ahmed, Nhat Pham,
	Christian Heusel

On 2024/7/1 01:58, Pedro Falcato wrote:
> Hi everyone,
> 
> Reporting a bug I've experienced over the last few releases, all in
> 6.9. I don't have a repro for it (seems to happen randomly?). Distro
> is Arch Linux (which is itself minimally patched, config
> https://gitlab.archlinux.org/archlinux/packaging/packages/linux/-/blob/e1057d426590bd2064df83779f8644de519874c0/config).
> 
> I understand 6.9.5 is a little old, but as I have no repro I'm posting
> this in case someone has any hints or has hit this before.
> 
> dmesg:
> BUG: unable to handle page fault for address: 00000000000315af
> #PF: supervisor read access in kernel mode
> #PF: error_code(0x0000) - not-present page
> PGD 0 P4D 0
> Oops: 0000 [#1] PREEMPT SMP PTI
> CPU: 5 PID: 90 Comm: kswapd0 Tainted: P     U     OE
> 6.9.5-arch1-1 #1 b9e5462a84a73f67b5c7c6b73f88d2a6349ae768
> Hardware name: ASUSTeK COMPUTER INC. X542URR/X542URR, BIOS X542URR.310
> 10/29/2021
> RIP: 0010:zswap_writeback_entry+0x128/0x1f0
[...]
> note: kswapd0[90] exited with irqs disabled

I'm wondering why this kswapd has irqs disabled? Seems we don't disable
interrupts in zswap_writeback_entry()?


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: zswap_writeback_entry crashes in 6.9.5
  2024-06-30 17:58 zswap_writeback_entry crashes in 6.9.5 Pedro Falcato
  2024-07-01  1:07 ` Nhat Pham
  2024-07-01  3:44 ` Chengming Zhou
@ 2024-07-02  0:33 ` Builder
  2024-07-02 15:28   ` Nhat Pham
  2 siblings, 1 reply; 7+ messages in thread
From: Builder @ 2024-07-02  0:33 UTC (permalink / raw)
  To: pedro.falcato
  Cc: chengming.zhou, christian, hannes, linux-mm, nphamcs, regressions,
	yosryahmed

On Sun, Jun 30, 2024 at 10:58 AM Pedro Falcato <pedro.falcato@gmail.com> wrote:
>
> Hi everyone,

Hi,

I think I have hit this problem a well. I actually reported this on RedHat's
bug tracker a while back, along with a couple of stack traces:

https://bugzilla.redhat.com/show_bug.cgi?id=2275252

Reverting the commit I mentioned there seems to make this problem go away for
me. This is a long shot, but I am curious if it will also fix the problem for
you.

(Also inserting myself into this thread so I will get updates.)

Regards,
Yuxuan Shui


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: zswap_writeback_entry crashes in 6.9.5
  2024-07-02  0:33 ` Builder
@ 2024-07-02 15:28   ` Nhat Pham
  0 siblings, 0 replies; 7+ messages in thread
From: Nhat Pham @ 2024-07-02 15:28 UTC (permalink / raw)
  To: Builder
  Cc: pedro.falcato, chengming.zhou, christian, hannes, linux-mm,
	regressions, yosryahmed

On Mon, Jul 1, 2024 at 5:33 PM Builder <yshuiv7@gmail.com> wrote:
>
> On Sun, Jun 30, 2024 at 10:58 AM Pedro Falcato <pedro.falcato@gmail.com> wrote:
> >
> > Hi everyone,
>
> Hi,
>
> I think I have hit this problem a well. I actually reported this on RedHat's
> bug tracker a while back, along with a couple of stack traces:
>
> https://bugzilla.redhat.com/show_bug.cgi?id=2275252
>
> Reverting the commit I mentioned there seems to make this problem go away for
> me. This is a long shot, but I am curious if it will also fix the problem for
> you.
>
> (Also inserting myself into this thread so I will get updates.)
>
> Regards,
> Yuxuan Shui

This looks like a different issue. The hang-up is one task waiting for
the mutex lock (&acomp_ctx->mutex), whose holder is the other task
that crashes. Looking at that trace in particular, the line that
triggers the BUG_ON call (mm/zswap.c:1395):

BUG_ON(crypto_wait_req(crypto_acomp_decompress(acomp_ctx->req),
&acomp_ctx->wait));

is the compressor failing to decompress the data. This looks like some
sort of memory corruption, and could happen for a lot of reasons - a
zswap bug, a backend allocator bug, a compression library bug, or a
hardware issue that corrupts memory.

If it only happens on 6.8.9 (and not 6.8.5), then it's likely some
changes in between, but I'd be very surprised if the bug somehow comes
from the patch you reverted. If you look at the patch's content, all
it does is essentially handling the case where the shrinker receives a
NULL memcg, by using an alternative source of stats. It could
potentially reveal the problems previously hidden, but definitely not
the cause of those problems itself. I'd recommend that you send a
separate bug report with the build config, steps to reproduce, and
more information about your setup overall (what backend allocator are
you using for zswap - it should be zsmalloc btw, what compression
algorithm you are using, etc.)

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2024-07-19 20:47 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-06-30 17:58 zswap_writeback_entry crashes in 6.9.5 Pedro Falcato
2024-07-01  1:07 ` Nhat Pham
2024-07-19 18:57   ` Pedro Falcato
2024-07-19 20:47     ` Nhat Pham
2024-07-01  3:44 ` Chengming Zhou
2024-07-02  0:33 ` Builder
2024-07-02 15:28   ` Nhat Pham

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).