From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <linux-btrfs-owner@vger.kernel.org>
Received: from plane.gmane.org ([80.91.229.3]:47520 "EHLO plane.gmane.org"
	rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP
	id S1755352Ab3EPH0V (ORCPT <rfc822;linux-btrfs@vger.kernel.org>);
	Thu, 16 May 2013 03:26:21 -0400
Received: from list by plane.gmane.org with local (Exim 4.69)
	(envelope-from <gcfb-btrfs-devel-moved1@m.gmane.org>)
	id 1UcsZv-0002km-4h
	for linux-btrfs@vger.kernel.org; Thu, 16 May 2013 09:26:19 +0200
Received: from dyndsl-080-228-190-127.ewe-ip-backbone.de ([80.228.190.127])
        by main.gmane.org with esmtp (Gmexim 0.1 (Debian))
        id 1AlnuQ-0007hv-00
        for <linux-btrfs@vger.kernel.org>; Thu, 16 May 2013 09:26:19 +0200
Received: from hurikhan77+btrfs by dyndsl-080-228-190-127.ewe-ip-backbone.de with local (Gmexim 0.1 (Debian))
        id 1AlnuQ-0007hv-00
        for <linux-btrfs@vger.kernel.org>; Thu, 16 May 2013 09:26:19 +0200
To: linux-btrfs@vger.kernel.org
From: Kai Krakow <hurikhan77+btrfs@gmail.com>
Subject: Re: hard freezes with 3.9.0 during io-intensive loads
Date: Thu, 16 May 2013 09:19:38 +0200
Message-ID: <btde6a-5an.ln1@hurikhan.ath.cx>
References: <6moh5a-knf.ln1@hurikhan.ath.cx> <2sci5a-n8p.ln1@hurikhan.ath.cx> <5187700F.20807@jan-o-sch.net> <3egl5a-al5.ln1@hurikhan.ath.cx> <51889A4D.8000201@jan-o-sch.net> <ch7o5a-g0b.ln1@hurikhan.ath.cx> <7jio5a-b5g.ln1@hurikhan.ath.cx> <518A318A.7060105@jan-o-sch.net> <15ot5a-f1m.ln1@hurikhan.ath.cx> <518C9B55.3050008@jan-o-sch.net>
Mime-Version: 1.0
Content-Type: text/plain; charset="ISO-8859-1"
Sender: linux-btrfs-owner@vger.kernel.org
List-ID: <linux-btrfs.vger.kernel.org>

Jan Schmidt <list.btrfs@jan-o-sch.net> schrieb:

>>>> Apparently, it's not fixed. The system does not freeze now but it threw
>>>> multiple backtraces right in front of my Xorg session. The backtraces
>>>> look a little bit different now. Here's what I got:
>>>>
>>>> https://gist.github.com/kakra/8a340f006d01e146865d
>>>>
>>>> Occurence while running "bedup dedup --defrag --size-cutoff
>>>> $((1024*1024))" which was currently dedup'ing my backup volume with
>>>> daily snapshots filled by "rsync --inplace" - so I suppose some file
>>>> contents are pretty scattered.
>>>
>>> At least that looks different for now. I'm not certain about all the
>>> fixes in btrfs-next. Can you give it a try and bisect if btrfs-next is
>>> good? That would be really helpful.
>> 
>> I'd prefer to not bisect my production system kernel... That will
>> probably take ages as running the "reproducable test" takes about 30-60
>> minutes before the problem hits my system. At least unless you had a
>> suggestion how to speed up the process... ;-)
> 
> I see, hoped it would be something quicker.
> 
>> I saw the pull request with those fixes, so I supsect it didn't go into
>> 3.9.1 but rather will go into 3.9.2?
> 
> Probably. However, those patches obviously weren't enough to solve your
> problem. We don't submit a lot of things to stable, so they are likely to
> remain the only btrfs related changes in there, which would mean it is
> unlikely to help with your problem.

I turned off autodefrag which fixes these problems. So without bisect I can 
at least say the problem is probably somewhere in the new snapshot-aware 
defragmentation code which came with 3.9.0 or related to the introduction of 
the same.

3.9.2 still does not fix anything. I'll go with autodefrag=off for the 
moment until I hear some news in that regard. With this new information, is 
it still helpful to get a metadata image from me? It should be reproducable 
if you enable autodefrag or defragment cow'ed files.

Regards,
Kai