From: M G Berberich <btrfs@oss.m-berberich.de>
To: linux-btrfs@vger.kernel.org
Subject: Btrfs filesystem-fail observations and hints
Date: Tue, 14 Jul 2015 13:47:05 +0200 [thread overview]
Message-ID: <20150714114705.GI23491@forwiss.uni-passau.de> (raw)
Hello,
at the weekend we had a disk-fail in a 5-disk BtrFS-RAID1
setup. Ideally one failing disk in a RAID1 setup should (at least
temporarily) degrade the filesystem and inform root about the
situation, but should let the rest of the system unaffected. That’s
not what happend. Processes accessing the filesystem hung
device-waiting and the filesystem itself “hung” too, producing lots of
BTRFS: lost page write due to I/O error on /dev/sdd
BTRFS: bdev /dev/sdd errs: wr …, rd …, flush 0, corrupt 0, gen 0
messages. Attempts to reboot the system regularly failed. Only after
physically removing the failed (hotplugable) disk from the system, it
was possible to reboot the system somewhat normal.
Afterwards, trying to get the system running again, the following
observation where made:
· “btrfs device delete missing”
There seems to be no straight-forward way to monitor the progress of
the “rebalancing” of the filesystem. It took about 6 hours and while
it was possible to estimate the time of finish by watching “btrfs fi
show” and extrapolating device-usagem, a method to monitor the
progess like “btrfs balance status” would be fine. (“btrfs balance
status” says “No balance found on …”)
· “btrfs fi df”
During “btrfs device delete missing”-rebalance “btrfs fi df” does
not reflect the current state of the filesystem. It says p.e.
Data, RAID1: total=1.46TiB, used=1.46TiB
Data, single: total=8.00MiB, used=0.00B
while actually, depending of the advance of the rebalance, about 0
to 300 GByte have only one copy on the devices. So p.e.
Data, RAID1: total=1.1TiB, used=1.1TiB
Data, single: total=290GiB, used=290GiB
would be better reflecting the state of the system.
MfG
bmg
--
„Des is völlig wurscht, was heut beschlos- | M G Berberich
sen wird: I bin sowieso dagegn!“ | berberic@fmi.uni-passau.de
(SPD-Stadtrat Kurt Schindler; Regensburg) | www.fmi.uni-passau.de/~berberic
reply other threads:[~2015-07-14 11:56 UTC|newest]
Thread overview: [no followups] expand[flat|nested] mbox.gz Atom feed
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20150714114705.GI23491@forwiss.uni-passau.de \
--to=btrfs@oss.m-berberich.de \
--cc=linux-btrfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).