* tree first key mismatch detected (reproducible error) @ 2020-01-25 11:37 Thorsten Hirsch 2020-01-25 11:46 ` Andrei Borzenkov 2020-01-25 12:15 ` Qu Wenruo 0 siblings, 2 replies; 8+ messages in thread From: Thorsten Hirsch @ 2020-01-25 11:37 UTC (permalink / raw) To: linux-btrfs Hi, here's a btrfs problem that started happening today on my main computer: BTRFS error (device nvme0n1p3): tree first key mismatch detected, bytenr=109690880 parent_transid=1329869 key expected=(48044838912,168,12288) has=(48045363200,168,12288) It always occurs some minutes after booting, sometimes even seconds after booting. The partition is then remounted read-only. I already tried scrubbing the partition (aborts itself after some seconds) and balancing (seems to trigger the error immediately and doesn't even start). I attached some more output of dmesg. The distribution is Arch Linux and the kernel is the most recent one in Arch's default kernel package: 5.4.14-arch1-1 (I upgraded from 5.4.13 to 5.4.14 just yesterday). Best regards, Thorsten [Jan25 12:00] BTRFS error (device nvme0n1p3): tree first key mismatch detected, bytenr=109690880 parent_transid=1329869 key expected=(48044838912,168,12288) has=(48045363200,168,12288) [ +0,000003] ------------[ cut here ]------------ [ +0,000001] BTRFS: Transaction aborted (error -117) [ +0,000041] WARNING: CPU: 7 PID: 382 at fs/btrfs/extent-tree.c:3080 __btrfs_free_extent.isra.0+0x694/0x9e0 [btrfs] [ +0,000000] Modules linked in: xt_nat xt_tcpudp veth xt_conntrack xt_MASQUERADE nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype iptable_filter iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 br_netfilter bridge stp llc edac_mce_amd kvm_amd snd_hda_codec_ca0110 snd_hda_codec_generic wmi_bmof kvm ledtrig_audio snd_hda_codec_hdmi snd_hda_intel snd_intel_nhlt pktcdvd irqbypass snd_hda_codec uvcvideo snd_hda_core snd_hwdep videobuf2_vmalloc snd_pcm videobuf2_memops nls_iso8859_1 videobuf2_v4l2 nls_cp437 videobuf2_common snd_timer crct10dif_pclmul vfat crc32_pclmul videodev fat snd joydev ghash_clmulni_intel input_leds mousedev mc psmouse aesni_intel r8169 crypto_simd realtek cryptd ccp glue_helper k10temp i2c_piix4 soundcore libphy rng_core wmi gpio_amdpt evdev mac_hid pinctrl_amd acpi_cpufreq fuse vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) sg crypto_user ip_tables x_tables sr_mod cdrom sd_mod hid_generic usbhid hid serio_raw atkbd libps2 ahci libahci libata xhci_pci [ +0,000018] xhci_hcd scsi_mod i8042 serio amdgpu gpu_sched i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops drm agpgart btrfs libcrc32c crc32c_generic crc32c_intel xor raid6_pq [ +0,000005] CPU: 7 PID: 382 Comm: btrfs-transacti Tainted: G OE 5.4.14-arch1-1 #1 [ +0,000001] Hardware name: Gigabyte Technology Co., Ltd. AB350M-DS3H/AB350M-DS3H-CF, BIOS F50a 11/27/2019 [ +0,000010] RIP: 0010:__btrfs_free_extent.isra.0+0x694/0x9e0 [btrfs] [ +0,000001] Code: e8 c1 ee 00 00 8b 4c 24 38 85 c9 0f 84 39 fe ff ff 48 8b 54 24 48 e9 04 fe ff ff 44 89 fe 48 c7 c7 a0 ce 30 c0 e8 ba 48 c4 d1 <0f> 0b 48 8b 3c 24 44 89 f9 ba 08 0c 00 00 48 c7 c6 a0 20 30 c0 e8 [ +0,000001] RSP: 0018:ffff8fc081363ba0 EFLAGS: 00010286 [ +0,000001] RAX: 0000000000000000 RBX: 0000000000000192 RCX: 0000000000000000 [ +0,000000] RDX: 0000000000000001 RSI: 0000000000000096 RDI: 00000000ffffffff [ +0,000001] RBP: 0000000b3090a000 R08: 000000000000049b R09: 0000000000000004 [ +0,000000] R10: 0000000000000000 R11: 0000000000000001 R12: ffff8b958a1c9c40 [ +0,000001] R13: 0000000000000000 R14: 0000000000000001 R15: 00000000ffffff8b [ +0,000001] FS: 0000000000000000(0000) GS:ffff8b958e9c0000(0000) knlGS:0000000000000000 [ +0,000000] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ +0,000001] CR2: 00007fdcf263d000 CR3: 000000032f11a000 CR4: 00000000003406e0 [ +0,000001] Call Trace: [ +0,000012] ? __btrfs_run_delayed_refs+0xc9f/0xff0 [btrfs] [ +0,000009] __btrfs_run_delayed_refs+0x25e/0xff0 [btrfs] [ +0,000011] btrfs_run_delayed_refs+0x6a/0x180 [btrfs] [ +0,000013] btrfs_start_dirty_block_groups+0x28e/0x470 [btrfs] [ +0,000011] btrfs_commit_transaction+0x116/0x9b0 [btrfs] [ +0,000003] ? _raw_spin_unlock+0x16/0x30 [ +0,000010] ? join_transaction+0x108/0x3a0 [btrfs] [ +0,000010] transaction_kthread+0x13a/0x180 [btrfs] [ +0,000002] kthread+0xfb/0x130 [ +0,000010] ? btrfs_cleanup_transaction+0x560/0x560 [btrfs] [ +0,000001] ? kthread_park+0x90/0x90 [ +0,000001] ret_from_fork+0x1f/0x40 [ +0,000002] ---[ end trace 51366456523028bd ]--- [ +0,000001] BTRFS: error (device nvme0n1p3) in __btrfs_free_extent:3080: errno=-117 unknown [ +0,000001] BTRFS info (device nvme0n1p3): forced readonly [ +0,000002] BTRFS: error (device nvme0n1p3) in btrfs_run_delayed_refs:2188: errno=-117 unknown ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: tree first key mismatch detected (reproducible error) 2020-01-25 11:37 tree first key mismatch detected (reproducible error) Thorsten Hirsch @ 2020-01-25 11:46 ` Andrei Borzenkov 2020-01-25 12:23 ` Qu Wenruo 2020-01-25 12:15 ` Qu Wenruo 1 sibling, 1 reply; 8+ messages in thread From: Andrei Borzenkov @ 2020-01-25 11:46 UTC (permalink / raw) To: Thorsten Hirsch, linux-btrfs 25.01.2020 14:37, Thorsten Hirsch пишет: > Hi, here's a btrfs problem that started happening today on my main computer: > > BTRFS error (device nvme0n1p3): tree first key mismatch detected, > bytenr=109690880 parent_transid=1329869 key > expected=(48044838912,168,12288) has=(48045363200,168,12288) > This looks like bit flip 48044838912 == B2FB21000 48045363200 == B2FBA1000 with usual recommendation to check your RAM. > It always occurs some minutes after booting, sometimes even seconds > after booting. The partition is then remounted read-only. I already > tried scrubbing the partition (aborts itself after some seconds) and > balancing (seems to trigger the error immediately and doesn't even > start). > > I attached some more output of dmesg. The distribution is Arch Linux > and the kernel is the most recent one in Arch's default kernel > package: 5.4.14-arch1-1 (I upgraded from 5.4.13 to 5.4.14 just > yesterday). > > Best regards, > Thorsten > > [Jan25 12:00] BTRFS error (device nvme0n1p3): tree first key mismatch > detected, bytenr=109690880 parent_transid=1329869 key > expected=(48044838912,168,12288) has=(48045363200,168,12288) > [ +0,000003] ------------[ cut here ]------------ > [ +0,000001] BTRFS: Transaction aborted (error -117) > [ +0,000041] WARNING: CPU: 7 PID: 382 at fs/btrfs/extent-tree.c:3080 > __btrfs_free_extent.isra.0+0x694/0x9e0 [btrfs] > [ +0,000000] Modules linked in: xt_nat xt_tcpudp veth xt_conntrack > xt_MASQUERADE nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo > xt_addrtype iptable_filter iptable_nat nf_nat nf_conntrack > nf_defrag_ipv6 nf_defrag_ipv4 br_netfilter bridge stp llc edac_mce_amd > kvm_amd snd_hda_codec_ca0110 snd_hda_codec_generic wmi_bmof kvm > ledtrig_audio snd_hda_codec_hdmi snd_hda_intel snd_intel_nhlt pktcdvd > irqbypass snd_hda_codec uvcvideo snd_hda_core snd_hwdep > videobuf2_vmalloc snd_pcm videobuf2_memops nls_iso8859_1 > videobuf2_v4l2 nls_cp437 videobuf2_common snd_timer crct10dif_pclmul > vfat crc32_pclmul videodev fat snd joydev ghash_clmulni_intel > input_leds mousedev mc psmouse aesni_intel r8169 crypto_simd realtek > cryptd ccp glue_helper k10temp i2c_piix4 soundcore libphy rng_core wmi > gpio_amdpt evdev mac_hid pinctrl_amd acpi_cpufreq fuse vboxnetflt(OE) > vboxnetadp(OE) vboxdrv(OE) sg crypto_user ip_tables x_tables sr_mod > cdrom sd_mod hid_generic usbhid hid serio_raw atkbd libps2 ahci > libahci libata xhci_pci > [ +0,000018] xhci_hcd scsi_mod i8042 serio amdgpu gpu_sched > i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect sysimgblt > fb_sys_fops drm agpgart btrfs libcrc32c crc32c_generic crc32c_intel > xor raid6_pq > [ +0,000005] CPU: 7 PID: 382 Comm: btrfs-transacti Tainted: G > OE 5.4.14-arch1-1 #1 > [ +0,000001] Hardware name: Gigabyte Technology Co., Ltd. > AB350M-DS3H/AB350M-DS3H-CF, BIOS F50a 11/27/2019 > [ +0,000010] RIP: 0010:__btrfs_free_extent.isra.0+0x694/0x9e0 [btrfs] > [ +0,000001] Code: e8 c1 ee 00 00 8b 4c 24 38 85 c9 0f 84 39 fe ff ff > 48 8b 54 24 48 e9 04 fe ff ff 44 89 fe 48 c7 c7 a0 ce 30 c0 e8 ba 48 > c4 d1 <0f> 0b 48 8b 3c 24 44 89 f9 ba 08 0c 00 00 48 c7 c6 a0 20 30 c0 > e8 > [ +0,000001] RSP: 0018:ffff8fc081363ba0 EFLAGS: 00010286 > [ +0,000001] RAX: 0000000000000000 RBX: 0000000000000192 RCX: 0000000000000000 > [ +0,000000] RDX: 0000000000000001 RSI: 0000000000000096 RDI: 00000000ffffffff > [ +0,000001] RBP: 0000000b3090a000 R08: 000000000000049b R09: 0000000000000004 > [ +0,000000] R10: 0000000000000000 R11: 0000000000000001 R12: ffff8b958a1c9c40 > [ +0,000001] R13: 0000000000000000 R14: 0000000000000001 R15: 00000000ffffff8b > [ +0,000001] FS: 0000000000000000(0000) GS:ffff8b958e9c0000(0000) > knlGS:0000000000000000 > [ +0,000000] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [ +0,000001] CR2: 00007fdcf263d000 CR3: 000000032f11a000 CR4: 00000000003406e0 > [ +0,000001] Call Trace: > [ +0,000012] ? __btrfs_run_delayed_refs+0xc9f/0xff0 [btrfs] > [ +0,000009] __btrfs_run_delayed_refs+0x25e/0xff0 [btrfs] > [ +0,000011] btrfs_run_delayed_refs+0x6a/0x180 [btrfs] > [ +0,000013] btrfs_start_dirty_block_groups+0x28e/0x470 [btrfs] > [ +0,000011] btrfs_commit_transaction+0x116/0x9b0 [btrfs] > [ +0,000003] ? _raw_spin_unlock+0x16/0x30 > [ +0,000010] ? join_transaction+0x108/0x3a0 [btrfs] > [ +0,000010] transaction_kthread+0x13a/0x180 [btrfs] > [ +0,000002] kthread+0xfb/0x130 > [ +0,000010] ? btrfs_cleanup_transaction+0x560/0x560 [btrfs] > [ +0,000001] ? kthread_park+0x90/0x90 > [ +0,000001] ret_from_fork+0x1f/0x40 > [ +0,000002] ---[ end trace 51366456523028bd ]--- > [ +0,000001] BTRFS: error (device nvme0n1p3) in > __btrfs_free_extent:3080: errno=-117 unknown > [ +0,000001] BTRFS info (device nvme0n1p3): forced readonly > [ +0,000002] BTRFS: error (device nvme0n1p3) in > btrfs_run_delayed_refs:2188: errno=-117 unknown > ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: tree first key mismatch detected (reproducible error) 2020-01-25 11:46 ` Andrei Borzenkov @ 2020-01-25 12:23 ` Qu Wenruo 2020-01-25 15:44 ` Thorsten Hirsch 0 siblings, 1 reply; 8+ messages in thread From: Qu Wenruo @ 2020-01-25 12:23 UTC (permalink / raw) To: Andrei Borzenkov, Thorsten Hirsch, linux-btrfs [-- Attachment #1.1: Type: text/plain, Size: 5266 bytes --] On 2020/1/25 下午7:46, Andrei Borzenkov wrote: > 25.01.2020 14:37, Thorsten Hirsch пишет: >> Hi, here's a btrfs problem that started happening today on my main computer: >> >> BTRFS error (device nvme0n1p3): tree first key mismatch detected, >> bytenr=109690880 parent_transid=1329869 key >> expected=(48044838912,168,12288) has=(48045363200,168,12288) >> > > This looks like bit flip > > 48044838912 == B2FB21000 > 48045363200 == B2FBA1000 > > with usual recommendation to check your RAM. > Ops, forgot the case of bitflip. Just as mentioned by Andrei, make sure the memory problem is solved, then `btrfs check`. Thanks, Qu >> It always occurs some minutes after booting, sometimes even seconds >> after booting. The partition is then remounted read-only. I already >> tried scrubbing the partition (aborts itself after some seconds) and >> balancing (seems to trigger the error immediately and doesn't even >> start). >> >> I attached some more output of dmesg. The distribution is Arch Linux >> and the kernel is the most recent one in Arch's default kernel >> package: 5.4.14-arch1-1 (I upgraded from 5.4.13 to 5.4.14 just >> yesterday). >> >> Best regards, >> Thorsten >> >> [Jan25 12:00] BTRFS error (device nvme0n1p3): tree first key mismatch >> detected, bytenr=109690880 parent_transid=1329869 key >> expected=(48044838912,168,12288) has=(48045363200,168,12288) >> [ +0,000003] ------------[ cut here ]------------ >> [ +0,000001] BTRFS: Transaction aborted (error -117) >> [ +0,000041] WARNING: CPU: 7 PID: 382 at fs/btrfs/extent-tree.c:3080 >> __btrfs_free_extent.isra.0+0x694/0x9e0 [btrfs] >> [ +0,000000] Modules linked in: xt_nat xt_tcpudp veth xt_conntrack >> xt_MASQUERADE nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo >> xt_addrtype iptable_filter iptable_nat nf_nat nf_conntrack >> nf_defrag_ipv6 nf_defrag_ipv4 br_netfilter bridge stp llc edac_mce_amd >> kvm_amd snd_hda_codec_ca0110 snd_hda_codec_generic wmi_bmof kvm >> ledtrig_audio snd_hda_codec_hdmi snd_hda_intel snd_intel_nhlt pktcdvd >> irqbypass snd_hda_codec uvcvideo snd_hda_core snd_hwdep >> videobuf2_vmalloc snd_pcm videobuf2_memops nls_iso8859_1 >> videobuf2_v4l2 nls_cp437 videobuf2_common snd_timer crct10dif_pclmul >> vfat crc32_pclmul videodev fat snd joydev ghash_clmulni_intel >> input_leds mousedev mc psmouse aesni_intel r8169 crypto_simd realtek >> cryptd ccp glue_helper k10temp i2c_piix4 soundcore libphy rng_core wmi >> gpio_amdpt evdev mac_hid pinctrl_amd acpi_cpufreq fuse vboxnetflt(OE) >> vboxnetadp(OE) vboxdrv(OE) sg crypto_user ip_tables x_tables sr_mod >> cdrom sd_mod hid_generic usbhid hid serio_raw atkbd libps2 ahci >> libahci libata xhci_pci >> [ +0,000018] xhci_hcd scsi_mod i8042 serio amdgpu gpu_sched >> i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect sysimgblt >> fb_sys_fops drm agpgart btrfs libcrc32c crc32c_generic crc32c_intel >> xor raid6_pq >> [ +0,000005] CPU: 7 PID: 382 Comm: btrfs-transacti Tainted: G >> OE 5.4.14-arch1-1 #1 >> [ +0,000001] Hardware name: Gigabyte Technology Co., Ltd. >> AB350M-DS3H/AB350M-DS3H-CF, BIOS F50a 11/27/2019 >> [ +0,000010] RIP: 0010:__btrfs_free_extent.isra.0+0x694/0x9e0 [btrfs] >> [ +0,000001] Code: e8 c1 ee 00 00 8b 4c 24 38 85 c9 0f 84 39 fe ff ff >> 48 8b 54 24 48 e9 04 fe ff ff 44 89 fe 48 c7 c7 a0 ce 30 c0 e8 ba 48 >> c4 d1 <0f> 0b 48 8b 3c 24 44 89 f9 ba 08 0c 00 00 48 c7 c6 a0 20 30 c0 >> e8 >> [ +0,000001] RSP: 0018:ffff8fc081363ba0 EFLAGS: 00010286 >> [ +0,000001] RAX: 0000000000000000 RBX: 0000000000000192 RCX: 0000000000000000 >> [ +0,000000] RDX: 0000000000000001 RSI: 0000000000000096 RDI: 00000000ffffffff >> [ +0,000001] RBP: 0000000b3090a000 R08: 000000000000049b R09: 0000000000000004 >> [ +0,000000] R10: 0000000000000000 R11: 0000000000000001 R12: ffff8b958a1c9c40 >> [ +0,000001] R13: 0000000000000000 R14: 0000000000000001 R15: 00000000ffffff8b >> [ +0,000001] FS: 0000000000000000(0000) GS:ffff8b958e9c0000(0000) >> knlGS:0000000000000000 >> [ +0,000000] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 >> [ +0,000001] CR2: 00007fdcf263d000 CR3: 000000032f11a000 CR4: 00000000003406e0 >> [ +0,000001] Call Trace: >> [ +0,000012] ? __btrfs_run_delayed_refs+0xc9f/0xff0 [btrfs] >> [ +0,000009] __btrfs_run_delayed_refs+0x25e/0xff0 [btrfs] >> [ +0,000011] btrfs_run_delayed_refs+0x6a/0x180 [btrfs] >> [ +0,000013] btrfs_start_dirty_block_groups+0x28e/0x470 [btrfs] >> [ +0,000011] btrfs_commit_transaction+0x116/0x9b0 [btrfs] >> [ +0,000003] ? _raw_spin_unlock+0x16/0x30 >> [ +0,000010] ? join_transaction+0x108/0x3a0 [btrfs] >> [ +0,000010] transaction_kthread+0x13a/0x180 [btrfs] >> [ +0,000002] kthread+0xfb/0x130 >> [ +0,000010] ? btrfs_cleanup_transaction+0x560/0x560 [btrfs] >> [ +0,000001] ? kthread_park+0x90/0x90 >> [ +0,000001] ret_from_fork+0x1f/0x40 >> [ +0,000002] ---[ end trace 51366456523028bd ]--- >> [ +0,000001] BTRFS: error (device nvme0n1p3) in >> __btrfs_free_extent:3080: errno=-117 unknown >> [ +0,000001] BTRFS info (device nvme0n1p3): forced readonly >> [ +0,000002] BTRFS: error (device nvme0n1p3) in >> btrfs_run_delayed_refs:2188: errno=-117 unknown >> > [-- Attachment #2: OpenPGP digital signature --] [-- Type: application/pgp-signature, Size: 488 bytes --] ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: tree first key mismatch detected (reproducible error) 2020-01-25 12:23 ` Qu Wenruo @ 2020-01-25 15:44 ` Thorsten Hirsch 2020-01-25 16:01 ` Martin Raiber 0 siblings, 1 reply; 8+ messages in thread From: Thorsten Hirsch @ 2020-01-25 15:44 UTC (permalink / raw) To: linux-btrfs Thanks, guys. However, checking the RAM with memtest86 hasn't revealed any errors. Currently I let it run another pass, but so far everything's good. Here's the output of btrfs check... [1/7] checking root items [2/7] checking extents leaf parent key incorrect 109690880 bad block 109690880 ERROR: errors found in extent allocation tree or chunk allocation [3/7] checking free space cache [4/7] checking fs roots root 5 inode 3583162 errors 1040, bad file extent, some csum missing root 5 inode 3767022 errors 1040, bad file extent, some csum missing root 5 inode 3819591 errors 1040, bad file extent, some csum missing root 5 inode 4108194 errors 1040, bad file extent, some csum missing ERROR: errors found in fs roots Opening filesystem to check... Checking filesystem on /dev/nvme0n1p3 UUID: 26717c9f-df62-4c57-a482-b9e4880b31e6 found 6132469760 bytes used, error(s) found total csum bytes: 0 total tree bytes: 4161536 total fs tree bytes: 0 total extent tree bytes: 3850240 btree space waste bytes: 1115823 file data blocks allocated: 108003328 referenced 108003328 -- Thorsten Am Sa., 25. Jan. 2020 um 13:23 Uhr schrieb Qu Wenruo <quwenruo.btrfs@gmx.com>: > > > > On 2020/1/25 下午7:46, Andrei Borzenkov wrote: > > 25.01.2020 14:37, Thorsten Hirsch пишет: > >> Hi, here's a btrfs problem that started happening today on my main computer: > >> > >> BTRFS error (device nvme0n1p3): tree first key mismatch detected, > >> bytenr=109690880 parent_transid=1329869 key > >> expected=(48044838912,168,12288) has=(48045363200,168,12288) > >> > > > > This looks like bit flip > > > > 48044838912 == B2FB21000 > > 48045363200 == B2FBA1000 > > > > with usual recommendation to check your RAM. > > > > Ops, forgot the case of bitflip. > > Just as mentioned by Andrei, make sure the memory problem is solved, > then `btrfs check`. > > Thanks, > Qu > > >> It always occurs some minutes after booting, sometimes even seconds > >> after booting. The partition is then remounted read-only. I already > >> tried scrubbing the partition (aborts itself after some seconds) and > >> balancing (seems to trigger the error immediately and doesn't even > >> start). > >> > >> I attached some more output of dmesg. The distribution is Arch Linux > >> and the kernel is the most recent one in Arch's default kernel > >> package: 5.4.14-arch1-1 (I upgraded from 5.4.13 to 5.4.14 just > >> yesterday). > >> > >> Best regards, > >> Thorsten > >> > >> [Jan25 12:00] BTRFS error (device nvme0n1p3): tree first key mismatch > >> detected, bytenr=109690880 parent_transid=1329869 key > >> expected=(48044838912,168,12288) has=(48045363200,168,12288) > >> [ +0,000003] ------------[ cut here ]------------ > >> [ +0,000001] BTRFS: Transaction aborted (error -117) > >> [ +0,000041] WARNING: CPU: 7 PID: 382 at fs/btrfs/extent-tree.c:3080 > >> __btrfs_free_extent.isra.0+0x694/0x9e0 [btrfs] > >> [ +0,000000] Modules linked in: xt_nat xt_tcpudp veth xt_conntrack > >> xt_MASQUERADE nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo > >> xt_addrtype iptable_filter iptable_nat nf_nat nf_conntrack > >> nf_defrag_ipv6 nf_defrag_ipv4 br_netfilter bridge stp llc edac_mce_amd > >> kvm_amd snd_hda_codec_ca0110 snd_hda_codec_generic wmi_bmof kvm > >> ledtrig_audio snd_hda_codec_hdmi snd_hda_intel snd_intel_nhlt pktcdvd > >> irqbypass snd_hda_codec uvcvideo snd_hda_core snd_hwdep > >> videobuf2_vmalloc snd_pcm videobuf2_memops nls_iso8859_1 > >> videobuf2_v4l2 nls_cp437 videobuf2_common snd_timer crct10dif_pclmul > >> vfat crc32_pclmul videodev fat snd joydev ghash_clmulni_intel > >> input_leds mousedev mc psmouse aesni_intel r8169 crypto_simd realtek > >> cryptd ccp glue_helper k10temp i2c_piix4 soundcore libphy rng_core wmi > >> gpio_amdpt evdev mac_hid pinctrl_amd acpi_cpufreq fuse vboxnetflt(OE) > >> vboxnetadp(OE) vboxdrv(OE) sg crypto_user ip_tables x_tables sr_mod > >> cdrom sd_mod hid_generic usbhid hid serio_raw atkbd libps2 ahci > >> libahci libata xhci_pci > >> [ +0,000018] xhci_hcd scsi_mod i8042 serio amdgpu gpu_sched > >> i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect sysimgblt > >> fb_sys_fops drm agpgart btrfs libcrc32c crc32c_generic crc32c_intel > >> xor raid6_pq > >> [ +0,000005] CPU: 7 PID: 382 Comm: btrfs-transacti Tainted: G > >> OE 5.4.14-arch1-1 #1 > >> [ +0,000001] Hardware name: Gigabyte Technology Co., Ltd. > >> AB350M-DS3H/AB350M-DS3H-CF, BIOS F50a 11/27/2019 > >> [ +0,000010] RIP: 0010:__btrfs_free_extent.isra.0+0x694/0x9e0 [btrfs] > >> [ +0,000001] Code: e8 c1 ee 00 00 8b 4c 24 38 85 c9 0f 84 39 fe ff ff > >> 48 8b 54 24 48 e9 04 fe ff ff 44 89 fe 48 c7 c7 a0 ce 30 c0 e8 ba 48 > >> c4 d1 <0f> 0b 48 8b 3c 24 44 89 f9 ba 08 0c 00 00 48 c7 c6 a0 20 30 c0 > >> e8 > >> [ +0,000001] RSP: 0018:ffff8fc081363ba0 EFLAGS: 00010286 > >> [ +0,000001] RAX: 0000000000000000 RBX: 0000000000000192 RCX: 0000000000000000 > >> [ +0,000000] RDX: 0000000000000001 RSI: 0000000000000096 RDI: 00000000ffffffff > >> [ +0,000001] RBP: 0000000b3090a000 R08: 000000000000049b R09: 0000000000000004 > >> [ +0,000000] R10: 0000000000000000 R11: 0000000000000001 R12: ffff8b958a1c9c40 > >> [ +0,000001] R13: 0000000000000000 R14: 0000000000000001 R15: 00000000ffffff8b > >> [ +0,000001] FS: 0000000000000000(0000) GS:ffff8b958e9c0000(0000) > >> knlGS:0000000000000000 > >> [ +0,000000] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > >> [ +0,000001] CR2: 00007fdcf263d000 CR3: 000000032f11a000 CR4: 00000000003406e0 > >> [ +0,000001] Call Trace: > >> [ +0,000012] ? __btrfs_run_delayed_refs+0xc9f/0xff0 [btrfs] > >> [ +0,000009] __btrfs_run_delayed_refs+0x25e/0xff0 [btrfs] > >> [ +0,000011] btrfs_run_delayed_refs+0x6a/0x180 [btrfs] > >> [ +0,000013] btrfs_start_dirty_block_groups+0x28e/0x470 [btrfs] > >> [ +0,000011] btrfs_commit_transaction+0x116/0x9b0 [btrfs] > >> [ +0,000003] ? _raw_spin_unlock+0x16/0x30 > >> [ +0,000010] ? join_transaction+0x108/0x3a0 [btrfs] > >> [ +0,000010] transaction_kthread+0x13a/0x180 [btrfs] > >> [ +0,000002] kthread+0xfb/0x130 > >> [ +0,000010] ? btrfs_cleanup_transaction+0x560/0x560 [btrfs] > >> [ +0,000001] ? kthread_park+0x90/0x90 > >> [ +0,000001] ret_from_fork+0x1f/0x40 > >> [ +0,000002] ---[ end trace 51366456523028bd ]--- > >> [ +0,000001] BTRFS: error (device nvme0n1p3) in > >> __btrfs_free_extent:3080: errno=-117 unknown > >> [ +0,000001] BTRFS info (device nvme0n1p3): forced readonly > >> [ +0,000002] BTRFS: error (device nvme0n1p3) in > >> btrfs_run_delayed_refs:2188: errno=-117 unknown > >> > > > ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: tree first key mismatch detected (reproducible error) 2020-01-25 15:44 ` Thorsten Hirsch @ 2020-01-25 16:01 ` Martin Raiber 2020-01-26 11:17 ` Thorsten Hirsch 0 siblings, 1 reply; 8+ messages in thread From: Martin Raiber @ 2020-01-25 16:01 UTC (permalink / raw) To: Thorsten Hirsch, linux-btrfs On 25.01.2020 16:44 Thorsten Hirsch wrote: > Thanks, guys. > > However, checking the RAM with memtest86 hasn't revealed any errors. > Currently I let it run another pass, but so far everything's good. > Here's the output of btrfs check... just from my experience with non-ECC RAM: When I had RAM corruption it only occurred after a few days of uptime and only when I ran memtester on Linux. memtest86/memtest86+ didn't show any problems even when running for a week (and in multi cpu mode). > [1/7] checking root items > [2/7] checking extents > leaf parent key incorrect 109690880 > bad block 109690880 > ERROR: errors found in extent allocation tree or chunk allocation > [3/7] checking free space cache > [4/7] checking fs roots > root 5 inode 3583162 errors 1040, bad file extent, some csum missing > root 5 inode 3767022 errors 1040, bad file extent, some csum missing > root 5 inode 3819591 errors 1040, bad file extent, some csum missing > root 5 inode 4108194 errors 1040, bad file extent, some csum missing > ERROR: errors found in fs roots > Opening filesystem to check... > Checking filesystem on /dev/nvme0n1p3 > UUID: 26717c9f-df62-4c57-a482-b9e4880b31e6 > found 6132469760 bytes used, error(s) found > total csum bytes: 0 > total tree bytes: 4161536 > total fs tree bytes: 0 > total extent tree bytes: 3850240 > btree space waste bytes: 1115823 > file data blocks allocated: 108003328 > referenced 108003328 > ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: tree first key mismatch detected (reproducible error) 2020-01-25 16:01 ` Martin Raiber @ 2020-01-26 11:17 ` Thorsten Hirsch 2020-01-26 13:03 ` Qu Wenruo 0 siblings, 1 reply; 8+ messages in thread From: Thorsten Hirsch @ 2020-01-26 11:17 UTC (permalink / raw) To: linux-btrfs Thank you, Martin. So I started memtester yesterday and meanwhile it has run 90 loops w/o any errors. Back to btrfs: - I could restore pretty much all data with "btrfs restore", except for some virtualbox disk images - "btrfs check --init-extent-tree" took some hours to finish, but I still couldn't mount the partition due to multiple "corrupt leaf" errors - mounting with "-o backuproot" resulted in the same error - "btrfs rescue super-recover" said everything was fine - after "btrfs rescue chunk-recover" or "btrfs check --repair" there was only 1 "corrupt leaf" error left, but mounting was still not possible So basically the mount errors after "btrfs check --init-extent-tree" and all later commands looked like this: [64385.439530] BTRFS critical (device nvme0n1p3): corrupt leaf: block=156450816 slot=30 extent bytenr=51548897280 len=262144 invalid generation, have 315981823 expect (0, 2265510] [64385.440779] BTRFS error (device nvme0n1p3): block=156450816 read time tree block corruption detected [64385.440785] BTRFS error (device nvme0n1p3): failed to read block groups: -5 [64385.493696] BTRFS error (device nvme0n1p3): open_ctree failed mount: /mnt/nvme: wrong fs type, bad option, bad superblock on /dev/nvme0n1p3, missing codepage or helper program, or other error. Then I gave up and called mkfs.btrfs. Currently the restored data is on its way back to the device. -- Thorsten Am Sa., 25. Jan. 2020 um 17:01 Uhr schrieb Martin Raiber <martin@urbackup.org>: > > On 25.01.2020 16:44 Thorsten Hirsch wrote: > > Thanks, guys. > > > > However, checking the RAM with memtest86 hasn't revealed any errors. > > Currently I let it run another pass, but so far everything's good. > > Here's the output of btrfs check... > > just from my experience with non-ECC RAM: > When I had RAM corruption it only occurred after a few days of uptime > and only when I ran memtester on Linux. memtest86/memtest86+ didn't show > any problems even when running for a week (and in multi cpu mode). > > > [1/7] checking root items > > [2/7] checking extents > > leaf parent key incorrect 109690880 > > bad block 109690880 > > ERROR: errors found in extent allocation tree or chunk allocation > > [3/7] checking free space cache > > [4/7] checking fs roots > > root 5 inode 3583162 errors 1040, bad file extent, some csum missing > > root 5 inode 3767022 errors 1040, bad file extent, some csum missing > > root 5 inode 3819591 errors 1040, bad file extent, some csum missing > > root 5 inode 4108194 errors 1040, bad file extent, some csum missing > > ERROR: errors found in fs roots > > Opening filesystem to check... > > Checking filesystem on /dev/nvme0n1p3 > > UUID: 26717c9f-df62-4c57-a482-b9e4880b31e6 > > found 6132469760 bytes used, error(s) found > > total csum bytes: 0 > > total tree bytes: 4161536 > > total fs tree bytes: 0 > > total extent tree bytes: 3850240 > > btree space waste bytes: 1115823 > > file data blocks allocated: 108003328 > > referenced 108003328 > > > ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: tree first key mismatch detected (reproducible error) 2020-01-26 11:17 ` Thorsten Hirsch @ 2020-01-26 13:03 ` Qu Wenruo 0 siblings, 0 replies; 8+ messages in thread From: Qu Wenruo @ 2020-01-26 13:03 UTC (permalink / raw) To: Thorsten Hirsch, linux-btrfs [-- Attachment #1.1: Type: text/plain, Size: 2062 bytes --] On 2020/1/26 下午7:17, Thorsten Hirsch wrote: > Thank you, Martin. So I started memtester yesterday and meanwhile it > has run 90 loops w/o any errors. > Back to btrfs: > > - I could restore pretty much all data with "btrfs restore", except > for some virtualbox disk images > - "btrfs check --init-extent-tree" took some hours to finish, but I > still couldn't mount the partition due to multiple "corrupt leaf" > errors That's due to a bug in btrfs-progs where extent item generation is not reset properly. You can either use the devel branch https://github.com/kdave/btrfs-progs/tree/devel Or at least apply this commit to fix it, while without using all other patches which may break --init-extent-tree. https://github.com/kdave/btrfs-progs/commit/8d45dc270a3791d7217625190c9fc8f7cc129285 Or, you can just use v5.3 to skip such warning and do a full balance to reset the whole extent tree and call it a day. Thanks, Qu > - mounting with "-o backuproot" resulted in the same error > - "btrfs rescue super-recover" said everything was fine > - after "btrfs rescue chunk-recover" or "btrfs check --repair" there > was only 1 "corrupt leaf" error left, but mounting was still not > possible > > So basically the mount errors after "btrfs check --init-extent-tree" > and all later commands looked like this: > > [64385.439530] BTRFS critical (device nvme0n1p3): corrupt leaf: > block=156450816 slot=30 extent bytenr=51548897280 len=262144 invalid > generation, have 315981823 expect (0, 2265510] > [64385.440779] BTRFS error (device nvme0n1p3): block=156450816 read > time tree block corruption detected > [64385.440785] BTRFS error (device nvme0n1p3): failed to read block groups: -5 > [64385.493696] BTRFS error (device nvme0n1p3): open_ctree failed > mount: /mnt/nvme: wrong fs type, bad option, bad superblock on > /dev/nvme0n1p3, missing codepage or helper program, or other error. > > Then I gave up and called mkfs.btrfs. Currently the restored data is > on its way back to the device. > [-- Attachment #2: OpenPGP digital signature --] [-- Type: application/pgp-signature, Size: 488 bytes --] ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: tree first key mismatch detected (reproducible error) 2020-01-25 11:37 tree first key mismatch detected (reproducible error) Thorsten Hirsch 2020-01-25 11:46 ` Andrei Borzenkov @ 2020-01-25 12:15 ` Qu Wenruo 1 sibling, 0 replies; 8+ messages in thread From: Qu Wenruo @ 2020-01-25 12:15 UTC (permalink / raw) To: Thorsten Hirsch, linux-btrfs [-- Attachment #1.1: Type: text/plain, Size: 5467 bytes --] On 2020/1/25 下午7:37, Thorsten Hirsch wrote: > Hi, here's a btrfs problem that started happening today on my main computer: > > BTRFS error (device nvme0n1p3): tree first key mismatch detected, > bytenr=109690880 parent_transid=1329869 key > expected=(48044838912,168,12288) has=(48045363200,168,12288) This means your fs is already corrupted. The only good news is, that corruption is in extent tree. Thus you can still salvage your data in RO mode. > > It always occurs some minutes after booting, sometimes even seconds > after booting. The partition is then remounted read-only. I already > tried scrubbing the partition (aborts itself after some seconds) and > balancing (seems to trigger the error immediately and doesn't even > start). Please run `btrfs check` on the unmounted fs. (Since you're already using Arch, using latest arch iso looks like the best solution if it's your root fs). If feel like to have a adventure, you could try `btrfs check --init-extent-tree` after posting the `btrfs check` result. It can be very slow, and may not always fix your problem. > > I attached some more output of dmesg. The distribution is Arch Linux > and the kernel is the most recent one in Arch's default kernel > package: 5.4.14-arch1-1 (I upgraded from 5.4.13 to 5.4.14 just > yesterday). Arch's kernel is mostly upstream, which is mostly good for btrfs usage, so is its btrfs-progs version. Thanks, Qu > > Best regards, > Thorsten > > [Jan25 12:00] BTRFS error (device nvme0n1p3): tree first key mismatch > detected, bytenr=109690880 parent_transid=1329869 key > expected=(48044838912,168,12288) has=(48045363200,168,12288) > [ +0,000003] ------------[ cut here ]------------ > [ +0,000001] BTRFS: Transaction aborted (error -117) > [ +0,000041] WARNING: CPU: 7 PID: 382 at fs/btrfs/extent-tree.c:3080 > __btrfs_free_extent.isra.0+0x694/0x9e0 [btrfs] > [ +0,000000] Modules linked in: xt_nat xt_tcpudp veth xt_conntrack > xt_MASQUERADE nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo > xt_addrtype iptable_filter iptable_nat nf_nat nf_conntrack > nf_defrag_ipv6 nf_defrag_ipv4 br_netfilter bridge stp llc edac_mce_amd > kvm_amd snd_hda_codec_ca0110 snd_hda_codec_generic wmi_bmof kvm > ledtrig_audio snd_hda_codec_hdmi snd_hda_intel snd_intel_nhlt pktcdvd > irqbypass snd_hda_codec uvcvideo snd_hda_core snd_hwdep > videobuf2_vmalloc snd_pcm videobuf2_memops nls_iso8859_1 > videobuf2_v4l2 nls_cp437 videobuf2_common snd_timer crct10dif_pclmul > vfat crc32_pclmul videodev fat snd joydev ghash_clmulni_intel > input_leds mousedev mc psmouse aesni_intel r8169 crypto_simd realtek > cryptd ccp glue_helper k10temp i2c_piix4 soundcore libphy rng_core wmi > gpio_amdpt evdev mac_hid pinctrl_amd acpi_cpufreq fuse vboxnetflt(OE) > vboxnetadp(OE) vboxdrv(OE) sg crypto_user ip_tables x_tables sr_mod > cdrom sd_mod hid_generic usbhid hid serio_raw atkbd libps2 ahci > libahci libata xhci_pci > [ +0,000018] xhci_hcd scsi_mod i8042 serio amdgpu gpu_sched > i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect sysimgblt > fb_sys_fops drm agpgart btrfs libcrc32c crc32c_generic crc32c_intel > xor raid6_pq > [ +0,000005] CPU: 7 PID: 382 Comm: btrfs-transacti Tainted: G > OE 5.4.14-arch1-1 #1 > [ +0,000001] Hardware name: Gigabyte Technology Co., Ltd. > AB350M-DS3H/AB350M-DS3H-CF, BIOS F50a 11/27/2019 > [ +0,000010] RIP: 0010:__btrfs_free_extent.isra.0+0x694/0x9e0 [btrfs] > [ +0,000001] Code: e8 c1 ee 00 00 8b 4c 24 38 85 c9 0f 84 39 fe ff ff > 48 8b 54 24 48 e9 04 fe ff ff 44 89 fe 48 c7 c7 a0 ce 30 c0 e8 ba 48 > c4 d1 <0f> 0b 48 8b 3c 24 44 89 f9 ba 08 0c 00 00 48 c7 c6 a0 20 30 c0 > e8 > [ +0,000001] RSP: 0018:ffff8fc081363ba0 EFLAGS: 00010286 > [ +0,000001] RAX: 0000000000000000 RBX: 0000000000000192 RCX: 0000000000000000 > [ +0,000000] RDX: 0000000000000001 RSI: 0000000000000096 RDI: 00000000ffffffff > [ +0,000001] RBP: 0000000b3090a000 R08: 000000000000049b R09: 0000000000000004 > [ +0,000000] R10: 0000000000000000 R11: 0000000000000001 R12: ffff8b958a1c9c40 > [ +0,000001] R13: 0000000000000000 R14: 0000000000000001 R15: 00000000ffffff8b > [ +0,000001] FS: 0000000000000000(0000) GS:ffff8b958e9c0000(0000) > knlGS:0000000000000000 > [ +0,000000] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [ +0,000001] CR2: 00007fdcf263d000 CR3: 000000032f11a000 CR4: 00000000003406e0 > [ +0,000001] Call Trace: > [ +0,000012] ? __btrfs_run_delayed_refs+0xc9f/0xff0 [btrfs] > [ +0,000009] __btrfs_run_delayed_refs+0x25e/0xff0 [btrfs] > [ +0,000011] btrfs_run_delayed_refs+0x6a/0x180 [btrfs] > [ +0,000013] btrfs_start_dirty_block_groups+0x28e/0x470 [btrfs] > [ +0,000011] btrfs_commit_transaction+0x116/0x9b0 [btrfs] > [ +0,000003] ? _raw_spin_unlock+0x16/0x30 > [ +0,000010] ? join_transaction+0x108/0x3a0 [btrfs] > [ +0,000010] transaction_kthread+0x13a/0x180 [btrfs] > [ +0,000002] kthread+0xfb/0x130 > [ +0,000010] ? btrfs_cleanup_transaction+0x560/0x560 [btrfs] > [ +0,000001] ? kthread_park+0x90/0x90 > [ +0,000001] ret_from_fork+0x1f/0x40 > [ +0,000002] ---[ end trace 51366456523028bd ]--- > [ +0,000001] BTRFS: error (device nvme0n1p3) in > __btrfs_free_extent:3080: errno=-117 unknown > [ +0,000001] BTRFS info (device nvme0n1p3): forced readonly > [ +0,000002] BTRFS: error (device nvme0n1p3) in > btrfs_run_delayed_refs:2188: errno=-117 unknown > [-- Attachment #2: OpenPGP digital signature --] [-- Type: application/pgp-signature, Size: 488 bytes --] ^ permalink raw reply [flat|nested] 8+ messages in thread
end of thread, other threads:[~2020-01-26 13:03 UTC | newest] Thread overview: 8+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2020-01-25 11:37 tree first key mismatch detected (reproducible error) Thorsten Hirsch 2020-01-25 11:46 ` Andrei Borzenkov 2020-01-25 12:23 ` Qu Wenruo 2020-01-25 15:44 ` Thorsten Hirsch 2020-01-25 16:01 ` Martin Raiber 2020-01-26 11:17 ` Thorsten Hirsch 2020-01-26 13:03 ` Qu Wenruo 2020-01-25 12:15 ` Qu Wenruo
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox