linux-btrfs.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Duncan <1i5t5.duncan@cox.net>
To: linux-btrfs@vger.kernel.org
Subject: Re: Out of space on small-ish partition, clobber and other methods haven't worked
Date: Thu, 21 Jan 2016 10:41:25 +0000 (UTC)	[thread overview]
Message-ID: <pan$a0430$c59023ae$8ef223cd$7c2025cc@cox.net> (raw)
In-Reply-To: CAJCQCtSyc1YAsVgNcPCeA5fW8cofwqSVuF69acZo+GYm2rUx4A@mail.gmail.com

Chris Murphy posted on Wed, 20 Jan 2016 19:28:35 -0700 as excerpted:

> On Wed, Jan 20, 2016 at 2:22 PM, Jerry Steinhauer
> <jerry.steinhauer@singlewire.com> wrote:
> 
>> % rm a.file
>> rm: cannot remove 'a.file': No space left on device
>> % cat /dev/null > a.file
>> -sh: a.file: No space left on device

>> % btrfs fi df /data
>> System, single: total=32.00MiB, used=4.00KiB
>> Data+Metadata, single: total=506.00MiB, used=500.39MiB
>> GlobalReserve, single: total=12.00MiB, used=6.45MiB
> 
> 
> I see somewhere between 6MiB and 12MiB that should be available for file
> removal.

I don't.  See that global reserve?  6.45 MiB into its emergency reserve, 
so effectively -6.45 MiB of space available for file removal.

First of all, any time global reserve is used at all the filesystem is in 
very dire straits, and he's 6.45 MiB into the 12.00 MiB global reserve, 
so that alone tells us "we're not in Kansas any more!" =8^0

Second, the btrfs fi show (which you didn't quote) says 540 MiB capacity.

System		 32 MiB total, can't be used for anything else
Data+Metadata	506 MiB total, shared data/metadata as it's a small 
filesystem  (See why I didn't list global reserve here, below.)


Total		538 MiB chunked out.  While that's 2 MiB from the 
reported 540 capacity, I don't believe system includes the reserved space 
(for boot loader, etc) at the beginning of the partition.  Between that 
and the limits of the chunk-allocator, he's likely all chunked-out, no 
possibility of allocating further chunks.

Global reserve is normally reserved from metadata, which of course is 
shared data/metadata here, due to the size of the filesystem (which makes 
shared a practical necessity, the problems would be much worse if data 
and metadata chunks were separate!).

So of the 506 MiB in data/metadata, 12 MiB are global reserve.  Which 
means there's only 494 MiB of normal data/metadata space, plus 12 MiB of 
global reserve.

But the DF shows 500.39 MiB of data/metadata used, which means we're 
roughly 6.4 MiB past normal data/metadata usage into the emergency use 
only global reserve, which is indeed (roughly) what global reserve shows, 
6.45 MiB used.

So as I said, that btrfs is in pretty severely dire straits!  Not only is 
all the available data/metadata space used, but we're well past half way 
into the emergency global reserve as well.  No WONDER there's no space 
left even to delete a file (which because btrfs is COW, copy-on-write, 
requires metadata space even to delete a file, as the metadata block 
containing the original data cannot be rewritten in place and must be 
written elsewhere... thus answering the question of why btrfs needs space 
even on the unlink).


As for solutions, there's still a couple things (plus one already tried) 
to try to get out of the situation:

0) Try clobbering the file, reducing it to zero size, but you did and 
that didn't work.  It might have if the btrfs wasn't already so far into 
global reserve.

1) As CMurphy says (with two Chris Ms on the list that isn't clear, so 
CMurphy it is), try a later kernel, either 4.1.x or 4.4.  AFAIK there 
were a few patches having to do with ENOSPC errors and allowing file 
deletes to take from global reserve, as the result should be more room 
afterward and that's exactly the sort of thing global reserve is supposed 
to be there for.  Tho it's just a try, no guarantees.

2) This could be difficult on embedded, but the other option is 
temporarily adding a second device (btrfs device add), to give the 
filesystem a bit of work with.  That takes space as well, but luckily, I 
believe it's system-chunk space, and there's plenty of room there, so it 
should be possible.

The idea is to get enough metadata space to work with to get out of the 
fix by deleting a file or the like (normally, a balance could help as 
well, but that primarily helps to reclaim empty chunks from say data, so 
they can be reassigned to metadata, and since this is shared data/
metadata, that's unlikely to help).

Then when the filesystem is back to usable and enough has been deleted so 
what's on the temporary second device will fit back on the first device 
again, btrfs device delete the second one.

I'm unfamiliar with how small an added device can be and still be useful 
at that level, or more precisely, how the system chunk shrinks with total 
device size, but the one small data point I have here is a 256 MiB /boot, 
which has a 16 MiB system chunk, so I'm guessing it should shrink at 
least that far.

So let's say 16 MiB system, and it's into global reserve by ~6.5 MiB, so 
we want to give it at least that much more, plus something to work with.

So I'd suggest a 24 MiB or if it's available, 32 MiB, second device, at 
minimum.  Smaller can be tried, with the hope that the system chunk 
shrinks to say 8 MiB or smaller if the device is small enough, but I'm 
not sure it will.

As for actually making available a device on embedded, if there's no USB 
port available and thus the "simple" solution of plugging in a thumb 
drive is out of the question... maybe there's enough memory to create a 
tmpfs and do a loopback file on it, then add that loopback file as the 
temporary second device.  Of course if the power dies or the system 
otherwise crashes when part of the filesystem's on that tmpfs... not good 
news.  And obviously in that case it /better/ be temporary, because you 
can't reboot without losing that tmpfs and with it the loopback.  But if 
there's no other way to get access to a suitable device and the system 
and power is stable enough...


So that answers the what to do to exit that state question, and in a 
parenthetical  I answered the question of why it's requiring space to 
unlink -- btrfs is cow, copy-on-write, so even unlinking a file requires 
space to copy the metadata block containing the information about that 
file for the write.  And it makes the third question moot, as we have the 
root cause already -- the cow nature of btrfs.


Meanwhile, one more thing to address.  Despite what various distros may 
claim, here on this list, btrfs is considered "stablizING, but not yet 
fully stable or mature."  Production usage, particularly without backups, 
isn't recommended, occasional bugs can be expected, and the standard 
recommendation is using no older than the last two of either current 
kernels or LTS kernels.   With the just released 4.4 being an LTS kernel, 
that makes 4.1 the previous one back and the oldest recommended kernel, 
tho with 4.4 being so new, still being on the LTS before that, 3.18, 
would still be somewhat acceptable if you're already working on updating 
to 4.1.  But before that, while we'll try to support as best we can, 
chances are very good among the first requests is going to be to update 
to something not so ancient.

Under those conditions, honestly, it may be that btrfs isn't yet stable 
enough to be the right choice, particularly for embedded projects that 
are supposed to be field-usable without backups and without available 
technical maintenance for some years.  As I said, we're stabilizing, and 
actually, I'm not sure about the devs (I'm a list regular and btrfs user, 
not a dev) and other list regulars, but it may be that with LTS 4.4 we'll 
extend the informal support scope to three LTS series and thus support 
3.18 awhile longer, but btrfs is definitely not where /I'd/ recommend 
using btrfs on designed to be field usable without ready backups or 
direct tech supervision embedded, just yet.

OTOH, if it's embedded but with backups and direct tech supervision, then 
btrfs may be just fine, if you're willing to put up with the occasional 
bug and accept that you must be prepared to actually have to use those 
backups, should one of those occasional bugs require it, and if keeping 
generally to the last two LTS (or current) kernel series is acceptable.

-- 
Duncan - List replies preferred.   No HTML msgs.
"Every nonfree program has a lord, a master --
and if you use the program, he is your master."  Richard Stallman


  reply	other threads:[~2016-01-21 10:41 UTC|newest]

Thread overview: 7+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2016-01-20 21:22 Out of space on small-ish partition, clobber and other methods haven't worked Jerry Steinhauer
2016-01-21  2:28 ` Chris Murphy
2016-01-21 10:41   ` Duncan [this message]
2016-01-21 17:40     ` Chris Murphy
2016-01-22 12:08       ` Duncan
2016-01-22 12:11     ` Jerry Steinhauer
     [not found]   ` <CAHRikPHt7SmFhzQsZ-XKLYSbwCAgCeccEFXbw+YXBobJx8w1Ew@mail.gmail.com>
2016-01-23 15:23     ` Jerry Steinhauer

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='pan$a0430$c59023ae$8ef223cd$7c2025cc@cox.net' \
    --to=1i5t5.duncan@cox.net \
    --cc=linux-btrfs@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).