From: Martin Steigerwald <Martin@lichtvoll.de>
To: Chris Mason <clm@fb.com>
Cc: linux-btrfs@vger.kernel.org
Subject: Re: BTRFS hang with 3.16-rc5
Date: Sat, 19 Jul 2014 21:00:32 +0200 [thread overview]
Message-ID: <1780280.HkjhvDE2xN@merkaba> (raw)
In-Reply-To: <53CABB77.5090008@fb.com>
Am Samstag, 19. Juli 2014, 14:39:51 schrieb Chris Mason:
> On 07/19/2014 01:59 PM, Martin Steigerwald wrote:
> > Am Freitag, 18. Juli 2014, 09:36:06 schrieb Chris Mason:
> >> On 07/18/2014 03:51 AM, Martin Steigerwald wrote:
> >>> Am Dienstag, 15. Juli 2014, 09:21:40 schrieb Chris Mason:
> >>>> On 07/14/2014 05:58 PM, Martin Steigerwald wrote:
> >>>>> Am Montag, 14. Juli 2014, 16:12:22 schrieb Chris Mason:
> >>>>>> On 07/14/2014 11:10 AM, Martin Steigerwald wrote:
> >>>>>>> Am Montag, 14. Juli 2014, 17:04:22 schrieben Sie:
> >>>>>>>> Hi!
> >>>>>>>>
> >>>>>>>> While with 3.16-rc3 and rc4 I didn´t have a BTRFS hang in several
> >>>>>>>> days
> >>>>>>>> of
> >>>>>>>> usage, with 3-16-rc5 I had a hang again. Less than a hour since
> >>>>>>>> booting
> >>>>>>>> it.
> >>>>>>>>
> >>>>>>>> Since the hang bug I and others had with 3.15 and upto 3.16-rc2
> >>>>>>>> usually
> >>>>>>>> didn´t happen that quickly after boot and since backtrace looks a
> >>>>>>>> bit
> >>>>>>>> different from what I have in memory, I post this in a new thread.
> >>>>>>>> See thread "Blocked tasks on 3.15.1" for a discussion of previous
> >>>>>>>> hang
> >>>>>>>> issues.
> >>>>>>>
> >>>>>>> Probably good to add some basic information on the filesystem:
> >>>>>> Do you have compression enabled? I wasn't able to nail down the
> >>>>>> 3.15.1
> >>>>>> hang before vacation attacked me, but I'm hoping to track it down
> >>>>>> today.
> >>>>>
> >>>>> Yes. I have.
> >>>>>
> >>>>> It just hung again while I was playing PlaneShift.
> >>>>>
> >>>>> Back to 3.16-rc4 as rc5 seems to be broke here.
> >>>>
> >>>> The btrfs hang you're hitting goes back to 3.15. So 3.16-rc4 vs rc5
> >>>> shouldn't be a factor. Are you hitting other problems with 3.16?
> >>>
> >>> On this system it is a matter.
> >>>
> >>> 3.16-rc5: Two hangs in one day
> >>>
> >>> 3.16-rc4: No hang so far with three days uptime (well with hibernation
> >>> cycles in between)
> >>>
> >>> So easy observation for me: 3.16-rc4 fine, 3.16-rc5 broke.
> >>
> >> Can you please try this patch on rc5 and look for the printk:
> >>
> >> diff --git a/fs/btrfs/inode.c b/fs/btrfs/inode.c
> >> index 3668048..8ab56df 100644
> >> --- a/fs/btrfs/inode.c
> >> +++ b/fs/btrfs/inode.c
> >> @@ -8157,6 +8157,13 @@ void btrfs_destroy_inode(struct inode *inode)
> >>
> >> spin_unlock(&root->fs_info->ordered_root_lock);
> >>
> >> }
> >>
> >> + spin_lock(&root->fs_info->ordered_root_lock);
> >> + if (!list_empty(&BTRFS_I(inode)->ordered_operations)) {
> >> + list_del_init(&BTRFS_I(inode)->ordered_operations);
> >> +printk(KERN_CRIT "racing inode deletion with ordered
> >> operations!!!!!!!!!!!\n"); + }
> >> + spin_unlock(&root->fs_info->ordered_root_lock);
> >> +
> >>
> >> if (test_bit(BTRFS_INODE_HAS_ORPHAN_ITEM,
> >>
> >> &BTRFS_I(inode)->runtime_flags)) {
> >>
> >> btrfs_info(root->fs_info, "inode %llu still on the orphan
list",
> >
> > Did so and again got a hang.
> >
> > No racing inodes tough:
> >
> > merkaba:/boot> zgrep -i "racing inode" /var/log/syslog*
> > merkaba:/boot#1>
> >
> > Built kernel seems right:
> >
> > martin@merkaba:[…]> LANG=C grep -ir "racing inode" fs/btrfs
> > fs/btrfs/inode.c:printk(KERN_CRIT "racing inode deletion with ordered
> > operations!!!!!!!!!!!\n"); Binary file fs/btrfs/inode.o matches
> > Binary file fs/btrfs/btrfs.o matches
> > Binary file fs/btrfs/btrfs.ko matches
> >
> > Backtrace doesn´t seem to contain any function related to inodes.
> >
> >
> > Back to rc4 again for now.
> >
> >
> > These hangs seemed to occur first at writing several hundred MiB onto a
> > high speed SDHC card… yet, they persisted long after the write was
> > finished, upto to a point where I had to reboot cause machine hung on
> > trying to switch between tty7 (X11) and tty1 (for diagnosis).
>
> Ok, this is definitely the same hang reported on 3.15.1. Thanks for
> giving the patch a try, I've got another long running test going this
> weekend in hopes of triggering it here.
I found make-kpkg (from Debian kernel-package) trigger BTRFS hang quite
reliably with 3.14 and 3.15 at least after some update. Often during running
objcopy commands.
Example call:
make-kpkg -j4 --rootcmd fakeroot --initrd --append-to-version -tp520-btrfs-
delrace --revision 1 linux_image
--
Martin 'Helios' Steigerwald - http://www.Lichtvoll.de
GPG: 03B0 0D6C 0040 0710 4AFA B82F 991B EAAC A599 84C7
prev parent reply other threads:[~2014-07-19 19:00 UTC|newest]
Thread overview: 34+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-07-14 15:04 BTRFS hang with 3.16-rc5 Martin Steigerwald
2014-07-14 15:10 ` Martin Steigerwald
2014-07-14 17:51 ` Duncan
2014-07-14 22:03 ` Martin Steigerwald
2014-07-15 2:45 ` Duncan
2014-07-14 20:12 ` Chris Mason
2014-07-14 21:58 ` Martin Steigerwald
2014-07-15 13:21 ` Chris Mason
2014-07-15 15:08 ` Martin Steigerwald
2014-07-23 22:47 ` BTRFS hang with 3.16-rc5 (and also with 3.16-rc4) Martin Steigerwald
2014-07-24 14:58 ` Chris Mason
2014-07-24 16:24 ` Martin Steigerwald
2014-07-24 18:49 ` Martin Steigerwald
2014-07-24 20:04 ` Chris Mason
2014-07-28 22:57 ` Martin Steigerwald
2014-07-25 2:32 ` Duncan
2014-07-25 3:06 ` Nick Krause
[not found] ` <20140725080244.GA31950@carfax.org.uk>
2014-07-25 9:13 ` Hugo Mills
2014-07-28 13:20 ` David Sterba
2014-07-25 10:07 ` Martin Steigerwald
2014-07-25 4:51 ` Torbjørn
[not found] ` <20140725092800.GC25859@localhost.localdomain>
2014-07-25 10:22 ` Torbjørn
[not found] ` <53D23AF1.9010704@skagestad.org>
2014-07-25 11:37 ` Torbjørn
2014-07-25 16:14 ` Torbjørn
2014-07-28 10:00 ` Liu Bo
2014-07-28 11:11 ` Torbjørn
2014-07-29 10:18 ` Liu Bo
2014-07-29 15:07 ` Torbjørn
2014-07-30 5:09 ` Liu Bo
2014-07-18 7:51 ` BTRFS hang with 3.16-rc5 Martin Steigerwald
2014-07-18 13:36 ` Chris Mason
2014-07-19 17:59 ` Martin Steigerwald
2014-07-19 18:39 ` Chris Mason
2014-07-19 19:00 ` Martin Steigerwald [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1780280.HkjhvDE2xN@merkaba \
--to=martin@lichtvoll.de \
--cc=clm@fb.com \
--cc=linux-btrfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).