From: "Austin S. Hemmelgarn" <ahferroin7@gmail.com>
To: Qu Wenruo <quwenruo.btrfs@gmx.com>,
kreijack@inwind.it, linux-btrfs <linux-btrfs@vger.kernel.org>
Subject: Re: [btrfs-progs] Bug in mkfs.btrfs -r
Date: Fri, 1 Sep 2017 08:05:03 -0400 [thread overview]
Message-ID: <c3b7396a-f3c6-40e8-f85e-32fa1150046c@gmail.com> (raw)
In-Reply-To: <20381200-2f35-2edc-a20b-8852744ed050@gmx.com>
On 2017-09-01 07:49, Qu Wenruo wrote:
>
> On 2017年09月01日 19:28, Austin S. Hemmelgarn wrote:
>> On 2017-08-31 20:13, Qu Wenruo wrote:
>>>
>>> On 2017年09月01日 01:27, Goffredo Baroncelli wrote:
>>>> Hi All,
>>>>
>>>> I found a bug in mkfs.btrfs, when it is used the option '-r'. It
>>>> seems that it is not visible the full disk.
>>>
>>> Despite the new bug you found, -r has several existing bugs.
>> Is this actually a bug though? Every other filesystem creation tool
>> that I know of that offers functionality like this generates the
>> filesystem just large enough to contain the data you want in it, so I
>> would argue that making this use the whole device is actually breaking
>> consistency with other tools, not to mention removing functionality
>> that is useful (even aside from the system image generation use case I
>> mentioned, there are other practical applications (seed 'device'
>> generation comes to mind).
>
> Well, then documentation bug.
>
> And I'm not sure the chunk size is correct or optimized.
> Even for btrfs-convert, which will make data chunks very scattered, we
> still try to make a large chunk to cover scattered data extents.
For a one-shot or read-only filesystem though, a maximally sized chunk
is probably suboptimal. Suppose you use this to generate a base image
for a system in the form of a seed device. This actually ends up being
a pretty easy way to get factory reset functionality. It's also a case
where you want the base image to take up as little space as possible, so
that the end-user usable storage space is as much as possible. In that
case, if your base image doesn't need an exact multiple of 1GB for data
chunks, then using 1GB data chunks is not the best choice for at least
the final data chunk (because the rest of that 1GB gets wasted). A
similar argument applies for metadata.
>
> At least to me, it's not the case for chunk created by -r option.
>
> BTW, seed device is RO anyway, how much or how less spare space we have
> is not a problem at all.
That really depends on how you look at it. Aside from the above
example, there's the rather specific question of why you would not want
to avoid wasting space. The filesystem is read-only, which means that
any 'free space' on that filesystem is completely unusable, can't be
reclaimed for anything else, and in general is just a waste.
>
> So to me, even follow other tools -r, we should follow the normal extent
> allocator behavior to create data/metadata, and then set the device size
> to end of its dev extents.
I don't entirely agree, but I think I've made my point well enough above.
>>>
>>> For example it will create dev extent starting from physical offset
>>> 0, while kernel and mkfs will avoid that range, as 0~1M on each
>>> device is reserved.
>>>
>>> According to the code, -r will modify chunk layout by itself, not the
>>> traditional way kernel is doing.
>>>
>>> I'll fix them (if I'm not a lazybone), before that fix, please don't
>>> use -r option as it's not well maintained or fully tested.
>> FWIW, based on my own testing, filesystems generated with '-r' work
>> just fine as long as you don't try to embed boot code in the FS itself.
>
> It works fine because btrfs extent allocator will try to avoid superblock.
> But it doesn't mean we should put dev extents into 0~1M range.
>
> In fact, there is a deprecated mount option, alloc_start, to set how
> many bytes we should reserve for *each* device.
> And since it's deprecated, we'd better follow the 1M reservation for
> each device.
>
> Anyway, kernel balance and plain mkfs won't create chunk stripe in 0~1M
> range of each device, mkfs -r should also follow it.
Agreed, although the comments I made above about wasted space do still
apply here (albeit to a lesser degree, 1MB is not going to make much of
a difference for most people).
next prev parent reply other threads:[~2017-09-01 12:05 UTC|newest]
Thread overview: 15+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-08-31 17:27 [btrfs-progs] Bug in mkfs.btrfs -r Goffredo Baroncelli
2017-08-31 18:49 ` Austin S. Hemmelgarn
2017-08-31 20:29 ` Goffredo Baroncelli
2017-09-01 11:49 ` Austin S. Hemmelgarn
2017-09-01 0:13 ` Qu Wenruo
2017-09-01 11:28 ` Austin S. Hemmelgarn
2017-09-01 11:49 ` Qu Wenruo
2017-09-01 12:05 ` Austin S. Hemmelgarn [this message]
2017-09-01 12:19 ` Qu Wenruo
2017-09-01 12:47 ` Austin S. Hemmelgarn
2017-09-01 13:54 ` Qu Wenruo
2017-09-01 14:07 ` Austin S. Hemmelgarn
2017-09-02 4:03 ` Duncan
2017-09-05 3:57 ` Duncan
2017-09-01 11:54 ` Qu Wenruo
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=c3b7396a-f3c6-40e8-f85e-32fa1150046c@gmail.com \
--to=ahferroin7@gmail.com \
--cc=kreijack@inwind.it \
--cc=linux-btrfs@vger.kernel.org \
--cc=quwenruo.btrfs@gmx.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).