Re: Large multi-device BTRFS array (usually) fails to mount on boot.

linux-btrfs.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

From: "Joshua" <joshua@mailmag.net>
To: "Graham Cobb" <g.btrfs@cobb.uk.net>, linux-btrfs@vger.kernel.org
Subject: Re: Large multi-device BTRFS array (usually) fails to mount on  boot.
Date: Fri, 19 Feb 2021 17:42:10 +0000	[thread overview]
Message-ID: <2cf10687560a2acb0b9445dfe570867b@mailmag.net> (raw)
In-Reply-To: <8ae61a4f-8a7f-332f-a6b6-4ad808cc88c8@cobb.uk.net>

February 3, 2021 3:16 PM, "Graham Cobb" <g.btrfs@cobb.uk.net> wrote:

> On 03/02/2021 21:54, joshua@mailmag.net wrote:
> 
>> Good Evening.
>> 
>> I have a large BTRFS array, (14 Drives, ~100 TB RAW) which has been having problems mounting on
>> boot without timing out. This causes the system to drop to emergency mode. I am then able to mount
>> the array in emergency mode and all data appears fine, but upon reboot it fails again.
>> 
>> I actually first had this problem around a year ago, and initially put considerable effort into
>> extending the timeout in systemd, as I believed that to be the problem. However, all the methods I
>> attempted did not work properly or caused the system to continue booting before the array was
>> mounted, causing all sorts of issues. Eventually, I was able to almost completely resolve it by
>> defragmenting the extent tree and subvolume tree for each subvolume. (btrfs fi defrag
>> /mountpoint/subvolume/) This seemed to reduce the time required to mount, and made it mount on boot
>> the majority of the time.
> 
> Not what you asked, but adding "x-systemd.mount-timeout=180s" to the
> mount options in /etc/fstab works reliably for me to extend the timeout.
> Of course, my largest filesystem is only 20TB, across only two devices
> (two lvm-over-LUKS, each on separate physical drives) but it has very
> heavy use of snapshot creation and deletion. I also run with commit=15
> as power is not too reliable here and losing power is the most frequent
> cause of a reboot.

Thanks for the suggestion, but I have not been able to get this method to work either.

Here's what my fstab looks like, let me know if this is not what you meant!

UUID={snip} /         ext4  errors=remount-ro 0 0
UUID={snip} /mnt/data btrfs defaults,noatime,compress-force=zstd:2,x-systemd.mount-timeout=300s 0 0

However, the system still fails to mount in less than 5 minutes, and drops to emergency mode.
Upon checking dmesg logs, it is clear the system is only wait 120 seconds, before giving up on mounting, and dropping to emergency mode.

--Joshua

next prev parent reply	other threads:[~2021-02-19 17:43 UTC|newest]

Thread overview: 7+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-02-03 21:54 Large multi-device BTRFS array (usually) fails to mount on boot joshua
2021-02-03 23:08 ` Graham Cobb
2021-02-04  0:56 ` Qu Wenruo
2021-02-06  5:00 ` Joshua
2021-02-19 17:42 ` Joshua [this message]
2021-02-19 22:45   ` Graham Cobb
2021-02-19 23:56   ` Joshua

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=2cf10687560a2acb0b9445dfe570867b@mailmag.net \
    --to=joshua@mailmag.net \
    --cc=g.btrfs@cobb.uk.net \
    --cc=linux-btrfs@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).