From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from magic.merlins.org ([209.81.13.136]:47469 "EHLO mail1.merlins.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752457Ab3ERBZN (ORCPT ); Fri, 17 May 2013 21:25:13 -0400 Date: Fri, 17 May 2013 18:25:12 -0700 From: Marc MERLIN To: Josef Bacik , linux-btrfs@vger.kernel.org Subject: Re: kernel 3.8.8: btrfs still crashes on boot when it can't replay a log Message-ID: <20130518012512.GD28168@merlins.org> References: <20130516150918.GB26762@merlins.org> <20130517154811.GA19176@merlins.org> <20130516150918.GB26762@merlins.org> <20130517165456.GH1765@localhost.localdomain> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii In-Reply-To: <20130517154811.GA19176@merlins.org> <20130517165456.GH1765@localhost.localdomain> Sender: linux-btrfs-owner@vger.kernel.org List-ID: On Fri, May 17, 2013 at 12:54:56PM -0400, Josef Bacik wrote: > > If so, could you add this to the list of things to fix to make btrfs a bit > > less scary to others? :) > > (and of course more production ready, this repeated problem would kill any > > server it happens on) > > This has been all fixed in 3.10. Thanks, This is fantastic news, thanks a lot for that. One question left: How can I tell in a problem like below whether btrfs is having issues, or whether my hardware is hanging? When this happened below, I didn't get any SATA errors from the kernel, but I had to reboot to clear all those hangs (after reboot it was ok) Thanks, Marc On Fri, May 17, 2013 at 08:48:11AM -0700, Marc MERLIN wrote: > Sigh, last night my laptop hung again, I don't have a way to know why. > > When I rebooted wit 3.9.2, soon after boot, I started to get this: > INFO: task btrfs-transacti:520 blocked for more than 120 seconds. > "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. > btrfs-transacti D ffff8802139aa798 0 520 2 0x00000000 > ffff88021435b8d8 0000000000000046 ffffffff8108b708 0000000000000296 > ffff8802139aa380 ffff88021435bfd8 ffff88021435bfd8 0000000000013f00 > ffff88021552e380 ffff8802139aa380 ffff88021435b8e8 ffff8801da07f120 > Call Trace: > [] ? arch_local_irq_save+0x15/0x1b > [] schedule+0x5f/0x61 > [] btrfs_tree_lock+0x78/0x234 > [] ? add_wait_queue+0x44/0x44 > [] btrfs_lock_root_node+0x1d/0x3c > [] btrfs_search_slot+0x184/0x517 > [] ? zone_statistics+0x77/0x7e > [] lookup_inline_extent_backref+0x99/0x374 > [] ? cpuacct_charge+0x5f/0x67 > [] insert_inline_extent_backref+0x57/0xd4 > [] ? kmem_cache_alloc+0x87/0x109 > [] __btrfs_inc_extent_ref+0x93/0x1c2 > [] run_clustered_refs+0x705/0x7d4 > [] ? btrfs_find_ref_cluster+0xc7/0x120 > [] btrfs_run_delayed_refs+0x234/0x3da > [] ? btrfs_run_ordered_operations+0x261/0x273 > [] btrfs_commit_transaction+0xac/0x886 > [] ? add_wait_queue+0x44/0x44 > [] transaction_kthread+0xe7/0x18a > [] ? try_to_freeze+0x35/0x35 > [] kthread+0x88/0x90 > [] ? kthread_freezable_should_stop+0x39/0x39 > [] ret_from_fork+0x7c/0xb0 > [] ? kthread_freezable_should_stop+0x39/0x39 > > INFO: task firefox-bin:8553 blocked for more than 120 seconds. > "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. > firefox-bin D ffff8801ede4ea58 0 8553 4885 0x00000080 > ffff8801c1e6f918 0000000000000086 ffffffff8108b708 0000000000000292 > ffff8801ede4e640 ffff8801c1e6ffd8 ffff8801c1e6ffd8 0000000000013f00 > ffff88021552a340 ffff8801ede4e640 ffff8801c1e6f928 ffff8801ce0299d0 > Call Trace: > [] ? arch_local_irq_save+0x15/0x1b > [] schedule+0x5f/0x61 > [] btrfs_tree_lock+0xd5/0x234 > [] ? add_wait_queue+0x44/0x44 > [] btrfs_search_slot+0x3d6/0x517 > [] lookup_inline_extent_backref+0x99/0x374 > [] insert_inline_extent_backref+0x57/0xd4 > [] ? kmem_cache_alloc+0x87/0x109 > [] __btrfs_inc_extent_ref+0x93/0x1c2 > [] run_clustered_refs+0x705/0x7d4 > [] ? _raw_spin_lock_irq+0x9/0x24 > [] ? btrfs_find_ref_cluster+0xc7/0x120 > [] btrfs_run_delayed_refs+0x234/0x3da > [] ? btrfs_run_ordered_operations+0x261/0x273 > [] btrfs_commit_transaction+0xac/0x886 > [] ? _raw_spin_lock+0x1b/0x1f > [] ? add_wait_queue+0x44/0x44 > [] ? btrfs_log_dentry_safe+0x43/0x51 > [] btrfs_sync_file+0x23b/0x277 > [] vfs_fsync_range+0x1e/0x20 > [] vfs_fsync+0x17/0x19 > [] do_fsync+0x35/0x53 > [] ? current_kernel_time+0x14/0x38 > [] sys_fsync+0xb/0xf > [] system_call_fastpath+0x1a/0x1f -- "A mouse is a device used to point at the xterm you want to type in" - A.S.R. Microsoft is to operating systems .... .... what McDonalds is to gourmet cooking Home page: http://marc.merlins.org/