All of lore.kernel.org
 help / color / mirror / Atom feed
From: Marc MERLIN <marc@merlins.org>
To: Hugo Mills <hugo@carfax.org.uk>, linux-btrfs@vger.kernel.org
Subject: Re: 4.11.3: BTRFS critical (device dm-1): unable to add free space :-17 => btrfs check --repair runs clean
Date: Tue, 20 Jun 2017 16:12:03 -0700	[thread overview]
Message-ID: <20170620231202.GB5303@merlins.org> (raw)
In-Reply-To: <20170620154429.GC22987@merlins.org>

[-- Attachment #1: Type: text/plain, Size: 3513 bytes --]

On Tue, Jun 20, 2017 at 08:44:29AM -0700, Marc MERLIN wrote:
> On Tue, Jun 20, 2017 at 03:36:01PM +0000, Hugo Mills wrote:
> > > Thanks for having a look. Is it a bug, or is it a problem with my storage
> > > subsystem?
> > 
> >    Well, I'd say it's probably a problem with some inconsistent data
> > on the disk. How that data got there is another matter -- it may be
> > due to a bug which wrote the inconsistent data some time ago, and has
> > only now been found out.
>  
> Understood.
> 
> > > "space cache will be invalidated " => doesn't that mean that my cache was
> > > already cleared by check --repair, or are you saying I need to clear it
> > > again?
> > 
> >    I'm never quite sure about that one. :)
> > 
> >    It can't hurt to clear it manually as well.
> 
> Sounds good, done.
 
Except it didn't help :(
It worked for a while, and failed again.

It looks like I'm hitting a persistent bug :(

[   86.383988] BTRFS: device label dshelf2 devid 1 transid 37975 /dev/mapper/dshelf2
[   98.232529] BTRFS info (device dm-1): use lzo compression
[   98.251982] BTRFS info (device dm-1): disk space caching is enabled
[   98.274847] BTRFS info (device dm-1): has skinny extents
[  104.171597] BTRFS info (device dm-1): detected SSD devices, enabling SSD mode
[  165.429894] BTRFS error (device dm-1): Duplicate entries in free space cache, dumping
[  165.455673] BTRFS warning (device dm-1): failed to load free space cache for block group 2039601954816, rebuilding it now
[  234.221435] BTRFS warning (device dm-1): block group 2837392130048 has wrong amount of free space
[  234.249264] BTRFS warning (device dm-1): failed to load free space cache for block group 2837392130048, rebuilding it now
[  234.636396] BTRFS warning (device dm-1): block group 2885173641216 has wrong amount of free space
[  234.664015] BTRFS warning (device dm-1): failed to load free space cache for block group 2885173641216, rebuilding it now
[  242.042940] BTRFS warning (device dm-1): block group 3116565004288 has wrong amount of free space
[  242.071207] BTRFS warning (device dm-1): failed to load free space cache for block group 3116565004288, rebuilding it now
[  273.910918] BTRFS warning (device dm-1): block group 3209980542976 has wrong amount of free space
[  273.937625] BTRFS warning (device dm-1): failed to load free space cache for block group 3209980542976, rebuilding it now
[  298.578615] BTRFS warning (device dm-1): block group 2305889927168 has wrong amount of free space
[  298.605250] BTRFS warning (device dm-1): failed to load free space cache for block group 2305889927168, rebuilding it now
[  873.265687] BTRFS: Transaction aborted (error -17)
[  873.948245] BTRFS: error (device dm-1) in btrfs_run_delayed_refs:2961: errno=-17 Object already exists
[  873.978884] BTRFS info (device dm-1): forced readonly

Given that check --repair ran clean when I ran it yesterday after this first happened,
and I then ran  mount -o clear_cache , the cache got rebuilt, and I got the problem again, 
this is not looking good, seems like a persistent bug :-/

I'm now going to remount this with nospace_cache to see if your guess about
space_cache was correct.
Other suggestions also welcome :)

Marc
-- 
"A mouse is a device used to point at the xterm you want to type in" - A.S.R.
Microsoft is to operating systems ....
                                      .... what McDonalds is to gourmet cooking
Home page: http://marc.merlins.org/  

[-- Attachment #2: Digital signature --]
[-- Type: application/pgp-signature, Size: 291 bytes --]

  reply	other threads:[~2017-06-20 23:12 UTC|newest]

Thread overview: 77+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2017-06-20 14:39 4.11.3: BTRFS critical (device dm-1): unable to add free space :-17 => btrfs check --repair runs clean Marc MERLIN
2017-06-20 15:23 ` Hugo Mills
2017-06-20 15:26   ` Marc MERLIN
2017-06-20 15:36     ` Hugo Mills
2017-06-20 15:44       ` Marc MERLIN
2017-06-20 23:12         ` Marc MERLIN [this message]
2017-06-20 23:58           ` Marc MERLIN
2017-06-21  3:31           ` Chris Murphy
2017-06-21  3:43             ` Marc MERLIN
2017-06-21 15:13               ` How to fix errors that check --mode lomem finds, but --mode normal doesn't? Marc MERLIN
2017-06-21 23:22                 ` Chris Murphy
2017-06-22  0:48                   ` Marc MERLIN
2017-06-22  2:22                 ` Qu Wenruo
2017-06-22  2:53                   ` Marc MERLIN
2017-06-22  4:08                     ` Qu Wenruo
2017-06-23  4:06                       ` Marc MERLIN
2017-06-23  8:54                         ` Lu Fengqi
2017-06-23 16:17                           ` Marc MERLIN
2017-06-24  2:34                             ` Marc MERLIN
2017-06-26 10:46                               ` Lu Fengqi
2017-06-27 23:11                                 ` Marc MERLIN
2017-06-28  7:10                                   ` Lu Fengqi
2017-05-01 17:06                                     ` 4.11 relocate crash, null pointer Marc MERLIN
2017-05-01 18:08                                       ` 4.11 relocate crash, null pointer + rolling back a filesystem by X hours? Marc MERLIN
2017-05-02  1:50                                         ` Chris Murphy
2017-05-02  3:23                                           ` Marc MERLIN
2017-05-02  4:56                                             ` Chris Murphy
2017-05-02  5:11                                               ` Marc MERLIN
2017-05-02 18:47                                                 ` btrfs check --repair: failed to repair damaged filesystem, aborting Marc MERLIN
2017-05-03  6:00                                                   ` Marc MERLIN
2017-05-03  6:17                                                     ` Marc MERLIN
2017-05-03  6:32                                                       ` Roman Mamedov
2017-05-03 20:40                                                         ` Marc MERLIN
2017-07-07  5:37                                                 ` ctree.c:197: update_ref_for_cow: BUG_ON `ret` triggered, value -5 Marc MERLIN
2017-07-07  5:39                                                   ` Marc MERLIN
2017-07-07  9:33                                                     ` Lu Fengqi
2017-07-07 16:38                                                       ` Marc MERLIN
2017-07-09  4:34                                                         ` 4.11.6 / more corruption / root 15455 has a root item with a more recent gen (33682) compared to the found root node (0) Marc MERLIN
2017-07-09  5:05                                                           ` We really need a better/working btrfs check --repair Marc MERLIN
2017-07-09  6:34                                                           ` 4.11.6 / more corruption / root 15455 has a root item with a more recent gen (33682) compared to the found root node (0) Marc MERLIN
2017-07-09  7:57                                                           ` Martin Steigerwald
2017-07-09  9:16                                                             ` Paul Jones
2017-07-09 11:17                                                               ` Duncan
2017-07-09 13:00                                                                 ` Martin Steigerwald
2017-07-29 19:29                                                                 ` Imran Geriskovan
2017-07-29 23:38                                                                   ` Duncan
2017-07-30 14:54                                                                     ` Imran Geriskovan
2017-07-31  4:53                                                                       ` Duncan
2017-07-31 20:32                                                                         ` Imran Geriskovan
2017-08-01  1:36                                                                           ` Duncan
2017-08-01 15:18                                                                             ` Imran Geriskovan
2017-07-31 21:07                                                             ` Ivan Sizov
2017-07-31 21:17                                                               ` Marc MERLIN
2017-07-31 21:39                                                                 ` Ivan Sizov
2017-08-01 16:41                                                                   ` Ivan Sizov
2017-07-31 22:00                                                                 ` Justin Maggard
2017-08-01  6:38                                                                   ` Marc MERLIN
2017-05-02 19:59                                               ` 4.11 relocate crash, null pointer + rolling back a filesystem by X hours? Kai Krakow
2017-05-02  5:01                                             ` Duncan
2017-05-02 19:53                                               ` Kai Krakow
2017-05-23 16:58                                               ` Marc MERLIN
2017-05-24 10:16                                                 ` Duncan
2017-05-05  1:19                                             ` Qu Wenruo
2017-05-05  2:10                                               ` Qu Wenruo
2017-05-05  2:40                                               ` Marc MERLIN
2017-05-05  5:03                                                 ` Qu Wenruo
2017-05-05 15:43                                                   ` Marc MERLIN
2017-05-17 18:23                                                     ` Kai Krakow
2017-05-05  1:13                                         ` Qu Wenruo
2017-06-28 14:43                                     ` How to fix errors that check --mode lomem finds, but --mode normal doesn't? Marc MERLIN
2017-06-29 13:36                                       ` Lu Fengqi
2017-06-29 15:30                                         ` Marc MERLIN
2017-06-30 14:59                                           ` Lu Fengqi
2017-06-22  4:08                     ` Qu Wenruo
2017-06-21 12:04           ` 4.11.3: BTRFS critical (device dm-1): unable to add free space :-17 => btrfs check --repair runs clean Duncan
2017-06-21  3:26         ` Chris Murphy
2017-06-21  4:06           ` Marc MERLIN

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20170620231202.GB5303@merlins.org \
    --to=marc@merlins.org \
    --cc=hugo@carfax.org.uk \
    --cc=linux-btrfs@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.