linux-btrfs.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [REGRESSION] 3.12-rc1: Trying to create snapshot corrupted filesystem
@ 2013-09-20 22:25 Martin Steigerwald
  2013-09-21  2:01 ` Josef Bacik
                   ` (2 more replies)
  0 siblings, 3 replies; 8+ messages in thread
From: Martin Steigerwald @ 2013-09-20 22:25 UTC (permalink / raw)
  To: linux-btrfs

Hi!

I tried to create a snapshot today like this:

merkaba:/mnt/debian-zeit> ls -l
insgesamt 0
drwxr-xr-x 1 root root 210 Sep 20 11:48 root
merkaba:/mnt/debian-zeit> btrfs subvol list /
ID 256 gen 21382 top level 5 path root
merkaba:/mnt/debian-zeit> btrfs subvol snap -r root root-2013-09-20
merkaba:/mnt/debian-zeit#1>

on

martin@merkaba:~> cat /proc/version 
Linux version 3.12.0-rc1-tp520 (martin@merkaba) (gcc version 4.8.1 (Debian 4.8.1-10) ) #30 SMP PREEMPT Tue Sep 17 12:40:01 CEST 2013

and got this:

[   44.069468] ------------[ cut here ]------------
[   44.073291] WARNING: CPU: 3 PID: 3660 at fs/btrfs/uuid-tree.c:45 btrfs_uuid_tree_add+0x4e/0x2a0 [btrfs]()
[   44.073345] Modules linked in: ip6table_filter ip6_tables iptable_filter ip_tables ebtable_nat ebtables x_tables rfcomm bnep bluetooth cuse nfnetlin
k_queue nfnetlink_log nfnetlink cpufreq_userspace cpufreq_stats cpufreq_powersave cpufreq_conservative pci_stub vboxpci(O) vboxnetadp(O) vboxnetflt(O) 
vboxdrv(O) binfmt_misc uinput ext4 crc16 mbcache jbd2 fuse ecryptfs sbs sbshc joydev hdaps(O) tp_smapi(O) thinkpad_ec(O) loop firewire_sbp2 dm_crypt sn
d_hda_codec_hdmi snd_hda_codec_conexant snd_hda_intel snd_hda_codec snd_hwdep snd_pcm_oss snd_mixer_oss x86_pkg_temp_thermal intel_powerclamp coretemp 
snd_pcm kvm_intel kvm snd_page_alloc iwldvm mac80211 microcode snd_seq_midi snd_seq_midi_event psmouse iwlwifi pcspkr serio_raw i2c_i801 snd_rawmidi lp
c_ich mfd_core cfg80211 snd_seq snd_seq_device thinkpad_acpi snd_timer tpm_tis tpm battery tpm_bios nvram ac rfkill snd evdev soundcore processor btrfs
 xor raid6_pq libcrc32c md_mod dm_mirror dm_region_hash dm_log dm_mod sg sr_mod cdrom sd_mod crc_t10dif crct10dif_pclmul crc32_pclmul crc32c_intel ghas
h_clmulni_intel aesni_intel aes_x86_64 lrw gf128mul ahci glue_helper sata_sil24 ablk_helper libahci cryptd sdhci_pci libata ehci_pci sdhci firewire_ohc
i ehci_hcd mmc_core firewire_core crc_itu_t scsi_mod usbcore e1000e usb_common ptp pps_core thermal
[   44.073393] CPU: 3 PID: 3660 Comm: btrfs Tainted: G           O 3.12.0-rc1-tp520 #30
[   44.073394] Hardware name: LENOVO 42433WG/42433WG, BIOS 8AET63WW (1.43 ) 05/08/2013
[   44.073398]  0000000000000000 ffff880201e57a58 ffffffff81420f5f 0000000000000000
[   44.073401]  ffff880201e57a90 ffffffff81036a58 ffffffffa029928f ffff88020f8e11e0
[   44.073404]  ffff880201e57be0 0000000000000000 00000000000000fb ffff880201e57aa0
[   44.073405] Call Trace:
[   44.073413]  [<ffffffff81420f5f>] dump_stack+0x4e/0x82
[   44.073418]  [<ffffffff81036a58>] warn_slowpath_common+0x7a/0x93
[   44.073451]  [<ffffffffa029928f>] ? btrfs_uuid_tree_add+0x4e/0x2a0 [btrfs]
[   44.073454]  [<ffffffff81036b10>] warn_slowpath_null+0x15/0x17
[   44.073484]  [<ffffffffa029928f>] btrfs_uuid_tree_add+0x4e/0x2a0 [btrfs]
[   44.073511]  [<ffffffffa024b3ff>] ? btrfs_update_inode+0x41/0x85 [btrfs]
[   44.073536]  [<ffffffffa0245bf5>] create_pending_snapshot+0x605/0x6c5 [btrfs]
[   44.073559]  [<ffffffffa0245d22>] create_pending_snapshots+0x6d/0x89 [btrfs]
[   44.073581]  [<ffffffffa0246606>] btrfs_commit_transaction+0x3a0/0x857 [btrfs]
[   44.073610]  [<ffffffffa026b8c1>] btrfs_mksubvol.isra.27+0x253/0x37f [btrfs]
[   44.073637]  [<ffffffffa026bb0e>] btrfs_ioctl_snap_create_transid+0x121/0x151 [btrfs]
[   44.073662]  [<ffffffffa026bc70>] btrfs_ioctl_snap_create_v2+0xcb/0x11c [btrfs]
[   44.073688]  [<ffffffffa026e45c>] btrfs_ioctl+0x419/0x1e59 [btrfs]
[   44.073692]  [<ffffffff81428d04>] ? __do_page_fault+0x33d/0x3b1
[   44.073696]  [<ffffffff81008367>] ? __cycles_2_ns+0xd/0x4d
[   44.073701]  [<ffffffff8112b891>] vfs_ioctl+0x18/0x34
[   44.073704]  [<ffffffff8112c04d>] do_vfs_ioctl+0x32f/0x3e4
[   44.073708]  [<ffffffff8112c154>] SyS_ioctl+0x52/0x7d
[   44.073712]  [<ffffffff8142ad0b>] tracesys+0xdd/0xe2
[   44.073714] ---[ end trace 2968ecf24298c279 ]---
[   44.073716] ------------[ cut here ]------------
[   44.073743] WARNING: CPU: 3 PID: 3660 at fs/btrfs/uuid-tree.c:109 btrfs_uuid_tree_add+0x177/0x2a0 [btrfs]()
[   44.073777] Modules linked in: ip6table_filter ip6_tables iptable_filter ip_tables ebtable_nat ebtables x_tables rfcomm bnep bluetooth cuse nfnetlink_queue nfnetlink_log nfnetlink cpufreq_userspace cpufreq_stats cpufreq_powersave cpufreq_conservative pci_stub vboxpci(O) vboxnetadp(O) vboxnetflt(O) vboxdrv(O) binfmt_misc uinput ext4 crc16 mbcache jbd2 fuse ecryptfs sbs sbshc joydev hdaps(O) tp_smapi(O) thinkpad_ec(O) loop firewire_sbp2 dm_crypt snd_hda_codec_hdmi snd_hda_codec_conexant snd_hda_intel snd_hda_codec snd_hwdep snd_pcm_oss snd_mixer_oss x86_pkg_temp_thermal intel_powerclamp coretemp snd_pcm kvm_intel kvm snd_page_alloc iwldvm mac80211 microcode snd_seq_midi snd_seq_midi_event psmouse iwlwifi pcspkr serio_raw i2c_i801 snd_rawmidi lpc_ich mfd_core cfg80211 snd_seq snd_seq_device thinkpad_acpi snd_timer tpm_tis tpm battery tpm_bios nvram ac rfkill snd evdev soundcore processor btrfs xor raid6_pq libcrc32c md_mod dm_mirror dm_region_hash dm_log dm_mod sg sr_mod cdrom sd_mod crc_t10dif crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel aes_x86_64 lrw gf128mul ahci glue_helper sata_sil24 ablk_helper libahci cryptd sdhci_pci libata ehci_pci sdhci firewire_ohci ehci_hcd mmc_core firewire_core crc_itu_t scsi_mod usbcore e1000e usb_common ptp pps_core thermal
[   44.073804] CPU: 3 PID: 3660 Comm: btrfs Tainted: G        W  O 3.12.0-rc1-tp520 #30
[   44.073805] Hardware name: LENOVO 42433WG/42433WG, BIOS 8AET63WW (1.43 ) 05/08/2013
[   44.073808]  0000000000000000 ffff880201e57a58 ffffffff81420f5f 0000000000000000
[   44.073810]  ffff880201e57a90 ffffffff81036a58 ffffffffa02993b8 0000000000000000
[   44.073813]  ffff880201e57be0 0000000000000000 00000000000000fb ffff880201e57aa0
[   44.073814] Call Trace:
[   44.073817]  [<ffffffff81420f5f>] dump_stack+0x4e/0x82
[   44.073820]  [<ffffffff81036a58>] warn_slowpath_common+0x7a/0x93
[   44.073848]  [<ffffffffa02993b8>] ? btrfs_uuid_tree_add+0x177/0x2a0 [btrfs]
[   44.073851]  [<ffffffff81036b10>] warn_slowpath_null+0x15/0x17
[   44.073879]  [<ffffffffa02993b8>] btrfs_uuid_tree_add+0x177/0x2a0 [btrfs]
[   44.073904]  [<ffffffffa024b3ff>] ? btrfs_update_inode+0x41/0x85 [btrfs]
[   44.073926]  [<ffffffffa0245bf5>] create_pending_snapshot+0x605/0x6c5 [btrfs]
[   44.073949]  [<ffffffffa0245d22>] create_pending_snapshots+0x6d/0x89 [btrfs]
[   44.073971]  [<ffffffffa0246606>] btrfs_commit_transaction+0x3a0/0x857 [btrfs]
[   44.073998]  [<ffffffffa026b8c1>] btrfs_mksubvol.isra.27+0x253/0x37f [btrfs]
[   44.074022]  [<ffffffffa026bb0e>] btrfs_ioctl_snap_create_transid+0x121/0x151 [btrfs]
[   44.074046]  [<ffffffffa026bc70>] btrfs_ioctl_snap_create_v2+0xcb/0x11c [btrfs]
[   44.074071]  [<ffffffffa026e45c>] btrfs_ioctl+0x419/0x1e59 [btrfs]
[   44.074074]  [<ffffffff81428d04>] ? __do_page_fault+0x33d/0x3b1
[   44.074078]  [<ffffffff81008367>] ? __cycles_2_ns+0xd/0x4d
[   44.074081]  [<ffffffff8112b891>] vfs_ioctl+0x18/0x34
[   44.074085]  [<ffffffff8112c04d>] do_vfs_ioctl+0x32f/0x3e4
[   44.074088]  [<ffffffff8112c154>] SyS_ioctl+0x52/0x7d
[   44.074092]  [<ffffffff8142ad0b>] tracesys+0xdd/0xe2
[   44.074094] ---[ end trace 2968ecf24298c27a ]---
[   44.074095] ------------[ cut here ]------------
[   44.074112] WARNING: CPU: 3 PID: 3660 at fs/btrfs/super.c:255 __btrfs_abort_transaction+0x4f/0xff [btrfs]()
[   44.074113] btrfs: Transaction aborted (error -22)
[   44.074146] Modules linked in: ip6table_filter ip6_tables iptable_filter ip_tables ebtable_nat ebtables x_tables rfcomm bnep bluetooth cuse nfnetlink_queue nfnetlink_log nfnetlink cpufreq_userspace cpufreq_stats cpufreq_powersave cpufreq_conservative pci_stub vboxpci(O) vboxnetadp(O) vboxnetflt(O) vboxdrv(O) binfmt_misc uinput ext4 crc16 mbcache jbd2 fuse ecryptfs sbs sbshc joydev hdaps(O) tp_smapi(O) thinkpad_ec(O) loop firewire_sbp2 dm_crypt snd_hda_codec_hdmi snd_hda_codec_conexant snd_hda_intel snd_hda_codec snd_hwdep snd_pcm_oss snd_mixer_oss x86_pkg_temp_thermal intel_powerclamp coretemp snd_pcm kvm_intel kvm snd_page_alloc iwldvm mac80211 microcode snd_seq_midi snd_seq_midi_event psmouse iwlwifi pcspkr serio_raw i2c_i801 snd_rawmidi lpc_ich mfd_core cfg80211 snd_seq snd_seq_device thinkpad_acpi snd_timer tpm_tis tpm battery tpm_bios nvram ac rfkill snd evdev soundcore processor btrfs xor raid6_pq libcrc32c md_mod dm_mirror dm_region_hash dm_log dm_mod sg sr_mod cdrom sd_mod crc_t10dif crct10dif_pclmul crc32_pclmul crc32c_intel ghas
h_clmulni_intel aesni_intel aes_x86_64 lrw gf128mul ahci glue_helper sata_sil24 ablk_helper libahci cryptd sdhci_pci libata ehci_pci sdhci firewire_ohci ehci_hcd mmc_core firewire_core crc_itu_t scsi_mod usbcore e1000e usb_common ptp pps_core thermal
[   44.074173] CPU: 3 PID: 3660 Comm: btrfs Tainted: G        W  O 3.12.0-rc1-tp520 #30
[   44.074174] Hardware name: LENOVO 42433WG/42433WG, BIOS 8AET63WW (1.43 ) 05/08/2013
[   44.074177]  0000000000000000 ffff880201e57a68 ffffffff81420f5f ffff880201e57ab0
[   44.074180]  ffff880201e57aa0 ffffffff81036a58 ffffffffa0225c91 00000000ffffffea
[   44.074182]  ffff8800cf97b000 ffff88020f8e11e0 ffffffffa02a19b0 ffff880201e57b00
[   44.074183] Call Trace:
[   44.074186]  [<ffffffff81420f5f>] dump_stack+0x4e/0x82
[   44.074189]  [<ffffffff81036a58>] warn_slowpath_common+0x7a/0x93
[   44.074207]  [<ffffffffa0225c91>] ? __btrfs_abort_transaction+0x4f/0xff [btrfs]
[   44.074212]  [<ffffffff81036ab8>] warn_slowpath_fmt+0x47/0x49
[   44.074231]  [<ffffffffa0225c91>] __btrfs_abort_transaction+0x4f/0xff [btrfs]
[   44.074253]  [<ffffffffa0245c64>] create_pending_snapshot+0x674/0x6c5 [btrfs]
[   44.074276]  [<ffffffffa0245d22>] create_pending_snapshots+0x6d/0x89 [btrfs]
[   44.074298]  [<ffffffffa0246606>] btrfs_commit_transaction+0x3a0/0x857 [btrfs]
[   44.074323]  [<ffffffffa026b8c1>] btrfs_mksubvol.isra.27+0x253/0x37f [btrfs]
[   44.074353]  [<ffffffffa026bb0e>] btrfs_ioctl_snap_create_transid+0x121/0x151 [btrfs]
[   44.074389]  [<ffffffffa026bc70>] btrfs_ioctl_snap_create_v2+0xcb/0x11c [btrfs]
[   44.074414]  [<ffffffffa026e45c>] btrfs_ioctl+0x419/0x1e59 [btrfs]
[   44.074418]  [<ffffffff81428d04>] ? __do_page_fault+0x33d/0x3b1
[   44.074421]  [<ffffffff81008367>] ? __cycles_2_ns+0xd/0x4d
[   44.074425]  [<ffffffff8112b891>] vfs_ioctl+0x18/0x34
[   44.074428]  [<ffffffff8112c04d>] do_vfs_ioctl+0x32f/0x3e4
[   44.074432]  [<ffffffff8112c154>] SyS_ioctl+0x52/0x7d
[   44.074436]  [<ffffffff8142ad0b>] tracesys+0xdd/0xe2
[   44.074444] ---[ end trace 2968ecf24298c27b ]---
[   44.074447] BTRFS error (device sdb3) in create_pending_snapshot:1321: errno=-22 unknown
[   44.074449] BTRFS info (device sdb3): forced readonly
[   44.074453] BTRFS warning (device sdb3): Skipping commit of aborted transaction.
[   44.074455] BTRFS error (device sdb3) in cleanup_transaction:1526: errno=-22 unknown

The snapshot was not created.



The first time I got this I immediately scrubbed that / filesystem. The
scrub was fine.

I tried doing this snapshot another time after a reboot, to see whether it
happens again. It happened again. And this time the scrub got two errors:

merkaba:~> btrfs scrub status /
scrub status for 2f5c334d-249b-4c89-95cc-18572f750bd7
        scrub started at Sat Sep 21 00:12:08 2013 and finished after 56 seconds
        total bytes scrubbed: 11.91GB with 2 errors
        error details: csum=2
        corrected errors: 0, uncorrectable errors: 2, unverified errors: 0


[   17.104914] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[   17.105028] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[   33.956693] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[   63.794269] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[   63.794375] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[   63.794471] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[   63.794634] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[   63.795022] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[   63.895130] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   63.895499] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   68.374843] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   68.375500] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   70.847884] btrfs_readpage_end_io_hook: 2 callbacks suppressed
[   70.847892] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   70.848082] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   70.848422] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   72.125945] btrfs: checksum error at logical 347918336 on dev /dev/sdb3, sector 679528, root 256, inode 703589, offset 122880, length 3984, links 1 (path: var/log/auth.log)
[   72.125955] btrfs: bdev /dev/sdb3 errs: wr 0, rd 0, flush 0, corrupt 3, gen 0
[   72.125959] btrfs: unable to fixup (regular) error at logical 347918336 on dev /dev/sdb3
[   72.206252] btrfs: checksum error at logical 366272512 on dev /dev/sdb3, sector 715376, root 256, inode 760517, offset 16384, length 3541, links 1 (path: root/.histfile)
[   72.206265] btrfs: bdev /dev/sdb3 errs: wr 0, rd 0, flush 0, corrupt 4, gen 0
[   72.206268] btrfs: unable to fixup (regular) error at logical 366272512 on dev /dev/sdb3
[   72.713399] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   72.715442] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   72.717669] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   74.810905] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   74.813084] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   74.815255] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   75.695764] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   76.409554] btrfs_readpage_end_io_hook: 2 callbacks suppressed
[   76.409561] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   76.411572] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   76.411830] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   77.205828] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   77.207987] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   77.210291] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   77.928918] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   77.931074] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   77.933276] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   78.693109] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   85.671161] btrfs_readpage_end_io_hook: 9 callbacks suppressed
[   85.671169] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   85.673191] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   85.675278] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   90.543999] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   90.545910] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[   90.546076] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  188.834397] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  188.834575] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  188.834674] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  189.805403] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  189.805577] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  189.805667] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  198.813652] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  198.813847] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  198.814033] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  200.063988] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  200.064164] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  200.064264] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  214.143681] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  214.143931] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  214.144060] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  217.938320] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  217.938474] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  217.938555] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  219.652380] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  219.652582] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  219.652746] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  220.495560] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  220.495776] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  220.495900] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  230.830936] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  230.831105] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  230.831202] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  243.215473] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[  243.215566] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[  243.218387] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[  243.218473] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[  248.000900] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  301.681557] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  301.681713] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  363.253315] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[  363.253475] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[  363.256074] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[  363.256252] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[  470.728931] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  470.729234] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  470.729443] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  478.882101] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  478.882526] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  478.882715] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  492.253997] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  492.254377] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142
[  492.254611] BTRFS info (device sdb3): csum failed ino 760517 off 16384 csum 2078849109 expected csum 2695320142



This is on an only about a month old 30 GB Intel mSATA SSDMCEAC030B3 which
reports SMART status as passed and has media wearout indicator still at 100
from 100. So I bet the SSD is still fine.

Will do self-tests tomorrow nonetheless.


I didn´t try this on another BTRFS filesystem and I do not feel comfortable
trying it on /home, although I have a backup from yesterday, but in a different
city :).


I will scrub /home tough, but I bet it is just fine.

I will keep / as is for now. Maybe you have some ideas where I can dig at.
Since this is a production laptop (well, heck, I still test rc1 kernels on
it), I will redo the filesystem if needed for proper operation.


To what I recall snapshots just worked fine with 3.11.

Thanks,
-- 
Martin 'Helios' Steigerwald - http://www.Lichtvoll.de
GPG: 03B0 0D6C 0040 0710 4AFA  B82F 991B EAAC A599 84C7ÿôèº{.nÇ+‰·Ÿ®‰­†+%ŠËÿ±éݶ\x17¥Šwÿº{.nÇ+‰·¥Š{±ý»k~ÏâžØ^n‡r¡ö¦zË\x1aëh™¨è­Ú&£ûàz¿äz¹Þ—ú+€Ê+zf£¢·hšˆ§~†­†Ûiÿÿïêÿ‘êçz_è®\x0fæj:+v‰¨þ)ߣøm

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [REGRESSION] 3.12-rc1: Trying to create snapshot corrupted filesystem
  2013-09-20 22:25 [REGRESSION] 3.12-rc1: Trying to create snapshot corrupted filesystem Martin Steigerwald
@ 2013-09-21  2:01 ` Josef Bacik
  2013-09-21  2:16 ` Josef Bacik
  2013-09-21  2:34 ` Josef Bacik
  2 siblings, 0 replies; 8+ messages in thread
From: Josef Bacik @ 2013-09-21  2:01 UTC (permalink / raw)
  To: Martin Steigerwald; +Cc: linux-btrfs

On Sat, Sep 21, 2013 at 12:25:02AM +0200, Martin Steigerwald wrote:
> Hi!
> 
> I tried to create a snapshot today like this:
> 
> merkaba:/mnt/debian-zeit> ls -l
> insgesamt 0
> drwxr-xr-x 1 root root 210 Sep 20 11:48 root
> merkaba:/mnt/debian-zeit> btrfs subvol list /
> ID 256 gen 21382 top level 5 path root
> merkaba:/mnt/debian-zeit> btrfs subvol snap -r root root-2013-09-20
> merkaba:/mnt/debian-zeit#1>
>

Do you happen to have your messages from the point where you first booted the
3.12 kernel?  Could you attach them please if you do?  Thanks,

Josef 

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [REGRESSION] 3.12-rc1: Trying to create snapshot corrupted filesystem
  2013-09-20 22:25 [REGRESSION] 3.12-rc1: Trying to create snapshot corrupted filesystem Martin Steigerwald
  2013-09-21  2:01 ` Josef Bacik
@ 2013-09-21  2:16 ` Josef Bacik
  2013-09-21  2:34 ` Josef Bacik
  2 siblings, 0 replies; 8+ messages in thread
From: Josef Bacik @ 2013-09-21  2:16 UTC (permalink / raw)
  To: Martin Steigerwald; +Cc: linux-btrfs

On Sat, Sep 21, 2013 at 12:25:02AM +0200, Martin Steigerwald wrote:
> Hi!
> 
> I tried to create a snapshot today like this:
> 
> merkaba:/mnt/debian-zeit> ls -l
> insgesamt 0
> drwxr-xr-x 1 root root 210 Sep 20 11:48 root
> merkaba:/mnt/debian-zeit> btrfs subvol list /
> ID 256 gen 21382 top level 5 path root
> merkaba:/mnt/debian-zeit> btrfs subvol snap -r root root-2013-09-20
> merkaba:/mnt/debian-zeit#1>
> 

Nevermind I've reproduced it, sorry about that I will get it fixed up and a
patch sent out right away.  Thanks,

Josef

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [REGRESSION] 3.12-rc1: Trying to create snapshot corrupted filesystem
  2013-09-20 22:25 [REGRESSION] 3.12-rc1: Trying to create snapshot corrupted filesystem Martin Steigerwald
  2013-09-21  2:01 ` Josef Bacik
  2013-09-21  2:16 ` Josef Bacik
@ 2013-09-21  2:34 ` Josef Bacik
  2013-09-21  8:54   ` Martin Steigerwald
  2013-09-21 11:20   ` Ahmet Inan
  2 siblings, 2 replies; 8+ messages in thread
From: Josef Bacik @ 2013-09-21  2:34 UTC (permalink / raw)
  To: Martin Steigerwald; +Cc: linux-btrfs

On Sat, Sep 21, 2013 at 12:25:02AM +0200, Martin Steigerwald wrote:
> Hi!
> 
> I tried to create a snapshot today like this:
> 
> merkaba:/mnt/debian-zeit> ls -l
> insgesamt 0
> drwxr-xr-x 1 root root 210 Sep 20 11:48 root
> merkaba:/mnt/debian-zeit> btrfs subvol list /
> ID 256 gen 21382 top level 5 path root
> merkaba:/mnt/debian-zeit> btrfs subvol snap -r root root-2013-09-20
> merkaba:/mnt/debian-zeit#1>
> 

You will want the patch I just sent,

Btrfs: create the uuid tree on remount rw

and that should fix the snapshot problems.  Thanks,

Josef

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [REGRESSION] 3.12-rc1: Trying to create snapshot corrupted filesystem
  2013-09-21  2:34 ` Josef Bacik
@ 2013-09-21  8:54   ` Martin Steigerwald
  2013-09-21  9:50     ` Martin Steigerwald
  2013-09-21 11:20   ` Ahmet Inan
  1 sibling, 1 reply; 8+ messages in thread
From: Martin Steigerwald @ 2013-09-21  8:54 UTC (permalink / raw)
  To: Josef Bacik; +Cc: linux-btrfs

Am Freitag, 20. September 2013, 22:34:15 schrieb Josef Bacik:
> On Sat, Sep 21, 2013 at 12:25:02AM +0200, Martin Steigerwald wrote:
> > Hi!
> > 
> > I tried to create a snapshot today like this:
> > 
> > merkaba:/mnt/debian-zeit> ls -l
> > insgesamt 0
> > drwxr-xr-x 1 root root 210 Sep 20 11:48 root
> > merkaba:/mnt/debian-zeit> btrfs subvol list /
> > ID 256 gen 21382 top level 5 path root
> > merkaba:/mnt/debian-zeit> btrfs subvol snap -r root root-2013-09-20
> > merkaba:/mnt/debian-zeit#1>
> 
> You will want the patch I just sent,
> 
> Btrfs: create the uuid tree on remount rw
> 
> and that should fix the snapshot problems.  Thanks,

Many thanks, will compile a kernel with that patch.

Any idea how I can fix these checksum errors without reformatting? I would like 
to avoid it if possible.

In syslog it didn´t mention which files they affected, so maybe it is "just" 
metadata. But I just have single for metadata as per suggestion for SSDs.

Would it make sense to run fsck on it?

Thanks,
-- 
Martin 'Helios' Steigerwald - http://www.Lichtvoll.de
GPG: 03B0 0D6C 0040 0710 4AFA  B82F 991B EAAC A599 84C7

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [REGRESSION] 3.12-rc1: Trying to create snapshot corrupted filesystem
  2013-09-21  8:54   ` Martin Steigerwald
@ 2013-09-21  9:50     ` Martin Steigerwald
  0 siblings, 0 replies; 8+ messages in thread
From: Martin Steigerwald @ 2013-09-21  9:50 UTC (permalink / raw)
  To: Josef Bacik; +Cc: linux-btrfs

Am Samstag, 21. September 2013, 10:54:55 schrieb Martin Steigerwald:
> Am Freitag, 20. September 2013, 22:34:15 schrieb Josef Bacik:
> > On Sat, Sep 21, 2013 at 12:25:02AM +0200, Martin Steigerwald wrote:
> > > Hi!
> > > 
> > > I tried to create a snapshot today like this:
> > > 
> > > merkaba:/mnt/debian-zeit> ls -l
> > > insgesamt 0
> > > drwxr-xr-x 1 root root 210 Sep 20 11:48 root
> > > merkaba:/mnt/debian-zeit> btrfs subvol list /
> > > ID 256 gen 21382 top level 5 path root
> > > merkaba:/mnt/debian-zeit> btrfs subvol snap -r root root-2013-09-20
> > > merkaba:/mnt/debian-zeit#1>
> > 
> > You will want the patch I just sent,
> > 
> > Btrfs: create the uuid tree on remount rw
> > 
> > and that should fix the snapshot problems.  Thanks,
> 
> Many thanks, will compile a kernel with that patch.

Patch works. I can create snapshots again.

Reported-and-tested-by: Martin Steigerwald <martin@lichtvoll.de>

> Any idea how I can fix these checksum errors without reformatting? I would
> like to avoid it if possible.
> 
> In syslog it didn´t mention which files they affected, so maybe it is "just"
> metadata. But I just have single for metadata as per suggestion for SSDs.
> 
> Would it make sense to run fsck on it?

This problem remains. Could a rebalance of just the metadata help?

I just have these two csum errors but don´t see anything misbehaving.

Oh well, I had two csum errors, now it is one:

merkaba:~> btrfs scrub status /
scrub status for 2f5c334d-249b-4c89-95cc-18572f750bd7
        scrub started at Sat Sep 21 11:39:24 2013 and finished after 56 seconds
        total bytes scrubbed: 11.94GB with 1 errors
        error details: csum=1
        corrected errors: 0, uncorrectable errors: 1, unverified errors: 0


Uhm, I have a file name this time. I didn´t see it initially:
[   46.057796] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[   46.057899] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[   46.083806] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[   46.083914] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[   46.084012] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[   46.084102] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[  128.121975] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[  128.122159] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[  128.428071] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[  128.428182] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[  140.865751] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[  140.865903] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[  140.867493] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[  140.867736] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[  140.868201] BTRFS info (device sdb3): csum failed ino 703589 off 122880 csum 4200103148 expected csum 331742955
[  152.628173] btrfs: checksum error at logical 347918336 on dev /dev/sdb3, sector 679528, root 256, inode 703589, offset 122880, length 3984, links 1 (path: var/log/auth.log)
[  152.643045] btrfs: checksum error at logical 347918336 on dev /dev/sdb3, sector 679528, root 279, inode 703589, offset 122880, length 3984, links 1 (path: var/log/auth.log)
[  152.643059] btrfs: bdev /dev/sdb3 errs: wr 0, rd 0, flush 0, corrupt 5, gen 0
[  152.643062] btrfs: unable to fixup (regular) error at logical 347918336 on dev /dev/sdb3


I just run

merkaba:~> truncate -s0 /var/log/auth.log


And rescrub:

That wasn´t enough. So maybe its the directory entry.


I will remove the file.

Didn´t help either.


Oh wait, I just created a snapshot. May be a good idea to delete it again.

Uhm:

[  436.342060] btrfs: checksum error at logical 347918336 on dev /dev/sdb3, sector 679528, root 279, inode 703589, offset 122880, length 3984, links 1 (path: var/log/auth.log)
[  436.342072] btrfs: bdev /dev/sdb3 errs: wr 0, rd 0, flush 0, corrupt 7, gen 0
[  436.342075] btrfs: unable to fixup (regular) error at logical 347918336 on dev /dev/sdb3
[  496.413453] btrfs: checksum error at logical 347918336 on dev /dev/sdb3, sector 679528, root 279, inode 703589, offset 122880: path resolving failed with ret=-2
[  496.413471] btrfs: bdev /dev/sdb3 errs: wr 0, rd 0, flush 0, corrupt 8, gen 0
[  496.413475] btrfs: unable to fixup (regular) error at logical 347918336 on dev /dev/sdb3


It should be gone by now. Maybe snapshot deletion didn´t yet complete.


There we go. All is happy again:

merkaba:~> btrfs scrub status /
scrub status for 2f5c334d-249b-4c89-95cc-18572f750bd7
        scrub started at Sat Sep 21 11:47:57 2013 and finished after 60 seconds
        total bytes scrubbed: 11.94GB with 0 errors

Thanks,
-- 
Martin 'Helios' Steigerwald - http://www.Lichtvoll.de
GPG: 03B0 0D6C 0040 0710 4AFA  B82F 991B EAAC A599 84C7

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [REGRESSION] 3.12-rc1: Trying to create snapshot corrupted filesystem
  2013-09-21  2:34 ` Josef Bacik
  2013-09-21  8:54   ` Martin Steigerwald
@ 2013-09-21 11:20   ` Ahmet Inan
  2013-09-22  9:39     ` Ahmet Inan
  1 sibling, 1 reply; 8+ messages in thread
From: Ahmet Inan @ 2013-09-21 11:20 UTC (permalink / raw)
  To: Josef Bacik; +Cc: Martin Steigerwald, linux-btrfs

> You will want the patch I just sent,
>
> Btrfs: create the uuid tree on remount rw
>
> and that should fix the snapshot problems.  Thanks,

thanks Josef - you can close this bug:

https://bugzilla.kernel.org/show_bug.cgi?id=61301

then. will try your patch later, too.

Ahmet

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [REGRESSION] 3.12-rc1: Trying to create snapshot corrupted filesystem
  2013-09-21 11:20   ` Ahmet Inan
@ 2013-09-22  9:39     ` Ahmet Inan
  0 siblings, 0 replies; 8+ messages in thread
From: Ahmet Inan @ 2013-09-22  9:39 UTC (permalink / raw)
  To: Josef Bacik; +Cc: Martin Steigerwald, linux-btrfs

On Sat, Sep 21, 2013 at 1:20 PM, Ahmet Inan
<ainan@mathematik.uni-freiburg.de> wrote:
>> You will want the patch I just sent,
>>
>> Btrfs: create the uuid tree on remount rw
>>
>> and that should fix the snapshot problems.  Thanks,
>
> thanks Josef - you can close this bug:
>
> https://bugzilla.kernel.org/show_bug.cgi?id=61301
>
> then. will try your patch later, too.
>
ok, works for me, too.

changed state of bug to "resolved"

Ahmet

^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2013-09-22  9:39 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2013-09-20 22:25 [REGRESSION] 3.12-rc1: Trying to create snapshot corrupted filesystem Martin Steigerwald
2013-09-21  2:01 ` Josef Bacik
2013-09-21  2:16 ` Josef Bacik
2013-09-21  2:34 ` Josef Bacik
2013-09-21  8:54   ` Martin Steigerwald
2013-09-21  9:50     ` Martin Steigerwald
2013-09-21 11:20   ` Ahmet Inan
2013-09-22  9:39     ` Ahmet Inan

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).