From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-wi0-f176.google.com ([209.85.212.176]:33147 "EHLO mail-wi0-f176.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750803AbbJKLKq (ORCPT ); Sun, 11 Oct 2015 07:10:46 -0400 Received: by wiclk2 with SMTP id lk2so119833064wic.0 for ; Sun, 11 Oct 2015 04:10:45 -0700 (PDT) MIME-Version: 1.0 In-Reply-To: References: Date: Sun, 11 Oct 2015 13:10:45 +0200 Message-ID: Subject: Re: kernel BUG at /linux/fs/btrfs/extent-tree.c:1833! From: Peter Becker To: Henk Slager , linux-btrfs@vger.kernel.org Content-Type: text/plain; charset=UTF-8 Sender: linux-btrfs-owner@vger.kernel.org List-ID: I also get some of this syslog entrys: [ 3840.616538] INFO: task btrfs-transacti:11423 blocked for more than 120 seconds. [ 3840.616541] Tainted: G U 4.2.3-040203-generic #201510030832 [ 3840.616542] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 3840.616543] btrfs-transacti D 0000000000000000 0 11423 2 0x00000000 [ 3840.616545] ffff88007bd2bb58 0000000000000046 ffff880215d7e600 ffff8802121d2640 [ 3840.616547] 0000000000000246 ffff88007bd2c000 ffff8801aa607928 ffff8801aa607940 [ 3840.616548] ffff88007bd2bba8 ffff8801aa607920 ffff88007bd2bb78 ffffffff817a6357 [ 3840.616549] Call Trace: [ 3840.616555] [] schedule+0x37/0x80 [ 3840.616569] [] btrfs_tree_lock+0x98/0x1c0 [btrfs] [ 3840.616572] [] ? prepare_to_wait_event+0xf0/0xf0 [ 3840.616576] [] btrfs_search_slot+0x6e5/0x9c0 [btrfs] [ 3840.616579] [] ? __set_page_dirty_nobuffers+0xe7/0x140 [ 3840.616584] [] btrfs_lookup_inode+0x2f/0xa0 [btrfs] [ 3840.616592] [] ? btrfs_delete_delayed_items+0x313/0x330 [btrfs] [ 3840.616598] [] ? btrfs_insert_delayed_items+0xa4/0x420 [btrfs] [ 3840.616604] [] __btrfs_update_delayed_inode+0x5a/0x1e0 [btrfs] [ 3840.616609] [] __btrfs_run_delayed_items+0x17c/0x210 [btrfs] [ 3840.616615] [] btrfs_run_delayed_items+0x13/0x20 [btrfs] [ 3840.616621] [] btrfs_commit_transaction+0x2c8/0xb10 [btrfs] [ 3840.616626] [] ? start_transaction+0x93/0x580 [btrfs] [ 3840.616631] [] transaction_kthread+0x1ba/0x240 [btrfs] [ 3840.616636] [] ? btrfs_cleanup_transaction+0x540/0x540 [btrfs] [ 3840.616637] [] kthread+0xc9/0xe0 [ 3840.616638] [] ? kthread_create_on_node+0x180/0x180 [ 3840.616640] [] ret_from_fork+0x3f/0x70 [ 3840.616641] [] ? kthread_create_on_node+0x180/0x180 Is this helpfull? The filesystem is usable but i need the new space. 2015-10-10 21:48 GMT+02:00 Peter Becker : > btrfs balance start -m /media/RAID > > complete with out any error but the resulte of device usage is confusing me. > Metadata on sdb and sdc are 2 GiB, but on sdd (the new added device) > is 4 GiB. And the 2. one that's confusing me, is that sdd has a > "System" entry but sdb and sdc dosn't > > floyd@nas ~ $ sudo btrfs dev us /media/RAID/ > /dev/sdb, ID: 1 > Device size: 2.73TiB > Data,RAID1: 2.11TiB > Metadata,RAID1: 2.00GiB > System,RAID1: 32.00MiB > Unallocated: 628.49GiB > > /dev/sdc, ID: 2 > Device size: 2.73TiB > Data,RAID1: 2.11TiB > Metadata,RAID1: 2.00GiB > Unallocated: 628.52GiB > > /dev/sdd, ID: 3 > Device size: 2.73TiB > Data,RAID1: 792.00GiB > Metadata,RAID1: 4.00GiB > System,RAID1: 32.00MiB > Unallocated: 1.95TiB > > 2015-10-10 21:23 GMT+02:00 Peter Becker : >> Hi Henk, >> >> i have try it with kernel 4.1.6 and 4.2.3; btrfs progs 4.2.1 and 4.2.2 >> .. the same error. >> System freeze after 70% of balancing. >> >> Scrub complete without error. >> >> has someone a hint what i can do now? >> >> 2015-10-09 15:52 GMT+02:00 Henk Slager : >>> Hi Peter, >>> >>> I would try to add the mount option skip_balance for your raid1 >>> pool first, then see if you can use your system as you normally would. >>> I assume you can live without explicit (re-)balance for some time, >>> i.e. that the original disks are not too full. >>> >>> I recently did also some disks add/remove and also raid profile >>> convert and found out that kernel 4.2.x did crash my system with >>> various kernel bugs. So I switched back to 4.1.6 and although other >>> bugs hit me (see https://bugzilla.kernel.org/show_bug.cgi?id=104371 ) >>> the actions I wanted did complete. >>> >>> Using "btrfs check --repair" has never resulted in succes for me (for >>> some root filesystems (single profiles for s m d) on real and virual >>> machines), so I would only use that once you have your files backed up >>> on some other (cloned) filesystem. >>> >>> /Henk >>> >>> On Fri, Oct 9, 2015 at 9:41 AM, Peter Becker wrote: >>>> >>>> At first i add a new device to my btrfs raid1 pool and start balance. >>>> After ~5 hours, balanace hangs and cpu-usage goes to 100% (kworker/u4 >>>> use all cpu-power). >>>> >>>> What should i do now? Run "btrfs check --repair" on all devices? >>>> >>>> Kernel: 4.2.3-040203-generic >>>> Btrfs progs v4.2.1 >>>> >>>> Full Syslog: https://bugzilla.kernel.org/show_bug.cgi?id=105681 >>>> >>>> From Syslog: >>>> >>>> [16880.495586] kernel BUG at /home/kernel/COD/linux/fs/btrfs/extent-tree.c:1833! >>>> [16880.495603] invalid opcode: 0000 [#1] SMP >>>> [16880.495614] Modules linked in: xt_nat veth xt_conntrack xt_addrtype >>>> br_netfilter nvram dm_thin_pool dm_persistent_data msr dm_bio_prison >>>> dm_bufio libcrc32c ir_lirc_codec ir_xmp_decoder lirc_dev >>>> ir_mce_kbd_decoder ir_sharp_decoder ir_sony_decoder ir_sanyo_decoder >>>> ir_jvc_decoder ir_rc6_decoder ir_rc5_decoder ir_nec_decoder rc_rc6_mce >>>> xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 >>>> iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 mceusb nf_nat_ipv4 >>>> rc_core nf_nat nf_conntrack input_leds joydev xt_tcpudp bridge stp llc >>>> iptable_filter ip_tables x_tables autofs4 eeepc_wmi asus_wmi >>>> sparse_keymap dm_multipath scsi_dh intel_rapl iosf_mbi >>>> x86_pkg_temp_thermal intel_powerclamp kvm crct10dif_pclmul >>>> crc32_pclmul snd_seq_midi snd_seq_midi_event snd_rawmidi >>>> ghash_clmulni_intel cryptd snd_hda_codec_hdmi rfcomm snd_seq serio_raw >>>> bnep snd_hda_codec_realtek snd_hda_codec_generic bluetooth >>>> snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep snd_pcm >>>> snd_seq_device lpc_ich snd_timer mei_me mei snd shpchp mac_hid >>>> soundcore parport_pc ppdev nfsd nct6775 hwmon_vid coretemp auth_rpcgss >>>> nfs_acl nfs lockd grace binfmt_misc sunrpc lp parport fscache >>>> nls_iso8859_1 btrfs xor raid6_pq dm_mirror dm_region_hash dm_log >>>> hid_generic usbhid hid uas usb_storage psmouse ahci libahci wmi i915 >>>> video i2c_algo_bit drm_kms_helper drm e1000e ptp pps_core >>>> [16880.495944] CPU: 0 PID: 5967 Comm: btrfs Tainted: G U >>>> 4.2.3-040203-generic #201510030832 >>>> [16880.495964] Hardware name: ASUS All Series/H87I-PLUS, BIOS 2003 11/05/2014 >>>> [16880.495979] task: ffff8800a918b300 ti: ffff8800a9f00000 task.ti: >>>> ffff8800a9f00000 >>>> [16880.495995] RIP: 0010:[] [] >>>> insert_inline_extent_backref+0xc6/0xd0 [btrfs] >>>> [16880.496028] RSP: 0018:ffff8800a9f03718 EFLAGS: 00010293 >>>> [16880.496042] RAX: 0000000000000000 RBX: 0000000000000000 RCX: ffff8800a9f03750 >>>> [16880.496060] RDX: 0000000000000001 RSI: 0000000000000001 RDI: 0000000000000000 >>>> [16880.496077] RBP: ffff8800a9f03788 R08: 0000000000004000 R09: ffff8800a9f03608 >>>> [16880.496093] R10: 0000000000000000 R11: 0000000000000002 R12: ffff880214b11000 >>>> [16880.496109] R13: 000006a203798000 R14: 0000000000000000 R15: ffff8800d80c7990 >>>> [16880.496125] FS: 00007fd9f02fa900(0000) GS:ffff88021fa00000(0000) >>>> knlGS:0000000000000000 >>>> [16880.496146] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 >>>> [16880.496159] CR2: 00007f438fbc2000 CR3: 00000000a90ae000 CR4: 00000000000406f0 >>>> [16880.496175] Stack: >>>> [16880.496180] 000006a203798000 0000000000000414 0000000000000000 >>>> 0000000000000000 >>>> [16880.496200] 0000000000000001 ffffffffc02cef1a ffff8800a9f037f8 >>>> 0000000000002bbd >>>> [16880.496220] ffff88016f19f2e0 ffff880214b10000 ffff88016f19f2e0 >>>> ffff8800d80c7990 >>>> [16880.496239] Call Trace: >>>> [16880.496251] [] ? btrfs_alloc_path+0x1a/0x20 [btrfs] >>>> [16880.496271] [] >>>> __btrfs_inc_extent_ref.isra.51+0x98/0x250 [btrfs] >>>> [16880.496295] [] >>>> __btrfs_run_delayed_refs+0xcfa/0x1070 [btrfs] >>>> [16880.496316] [] ? __percpu_counter_add+0x55/0x70 >>>> [16880.496337] [] >>>> btrfs_run_delayed_refs.part.73+0x6e/0x280 [btrfs] >>>> [16880.496360] [] btrfs_run_delayed_refs+0x17/0x20 [btrfs] >>>> [16880.496383] [] >>>> btrfs_should_end_transaction+0x49/0x60 [btrfs] >>>> [16880.496407] [] btrfs_drop_snapshot+0x439/0x830 [btrfs] >>>> [16880.496431] [] ? >>>> invalidate_extent_cache+0x160/0x1a0 [btrfs] >>>> [16880.496455] [] merge_reloc_roots+0xd2/0x230 [btrfs] >>>> [16880.496475] [] relocate_block_group+0x256/0x600 [btrfs] >>>> [16880.496495] [] >>>> btrfs_relocate_block_group+0x1c3/0x2d0 [btrfs] >>>> [16880.496517] [] >>>> btrfs_relocate_chunk.isra.39+0x3e/0xc0 [btrfs] >>>> [16880.496537] [] __btrfs_balance+0x48f/0x8c0 [btrfs] >>>> [16880.496556] [] btrfs_balance+0x37d/0x650 [btrfs] >>>> [16880.496575] [] ? btrfs_ioctl_balance+0x284/0x510 [btrfs] >>>> [16880.496594] [] btrfs_ioctl_balance+0x164/0x510 [btrfs] >>>> [16880.496613] [] btrfs_ioctl+0x56f/0x2470 [btrfs] >>>> [16880.496628] [] ? >>>> lru_cache_add_active_or_unevictable+0x2b/0xa0 >>>> [16880.496645] [] ? handle_mm_fault+0xb8a/0x1810 >>>> [16880.496658] [] ? vma_link+0xb9/0xc0 >>>> [16880.496670] [] do_vfs_ioctl+0x2cd/0x4b0 >>>> [16880.496684] [] ? __do_page_fault+0x1b7/0x430 >>>> [16880.496697] [] SyS_ioctl+0x79/0x90 >>>> [16880.496709] [] entry_SYSCALL_64_fastpath+0x16/0x75 >>>> [16880.496723] Code: 45 10 49 89 d9 48 8b 55 c8 4c 89 34 24 4c 89 e9 >>>> 4c 89 fe 4c 89 e7 48 89 44 24 10 8b 45 28 89 44 24 08 e8 4e e4 ff ff >>>> 31 c0 eb bb <0f> 0b 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 55 48 89 e5 >>>> 41 57 >>>> [16880.496802] RIP [] >>>> insert_inline_extent_backref+0xc6/0xd0 [btrfs] >>>> [16880.496823] RSP >>>> [16880.502316] ---[ end trace bcd7d52a2c7cfdc7 ]--- >>>> -- >>>> To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in >>>> the body of a message to majordomo@vger.kernel.org >>>> More majordomo info at http://vger.kernel.org/majordomo-info.html