From: Christoph Anton Mitterer <calestyo@scientia.net>
To: Qu Wenruo <quwenruo@cn.fujitsu.com>, linux-btrfs@vger.kernel.org
Subject: Re: corruption: yet another one after deleting a ro snapshot
Date: Mon, 16 Jan 2017 03:56:23 +0100 [thread overview]
Message-ID: <1484535383.7577.53.camel@scientia.net> (raw)
In-Reply-To: <4d7e21ed-a8f6-6c75-4a14-ff43201bd27b@cn.fujitsu.com>
[-- Attachment #1: Type: text/plain, Size: 3086 bytes --]
On Mon, 2017-01-16 at 09:38 +0800, Qu Wenruo wrote:
> So the fs is REALLY corrupted.
*sigh* ... (not as in fuck-I'm-loosing-my-data™ ... but as in *sigh*
another-possibly-deeply-hidden-bug-in-btrfs-that-might-eventually-
cause-data-loss...)
> BTW, lowmem mode seems to have a new false alert when checking the
> block
> group item.
Anything you want to check me there?
> Did you have any "lightweight" method to reproduce the bug?
Na, not at all... as I've said this already happened to me once before,
and in both cases I was cleaning up old ro-snapshots.
At least in the current case the fs was only ever filled via
send/receive (well apart from minor mkdirs or so)... so there shouldn't
have been any "extreme ways" of using it.
I think (but not sure), that this was also the case on the other
occasion that happened to me with a different fs (i.e. I think it was
also a backup 8TB disk).
> For example, on a 1G btrfs fs with moderate operations, for example
> 15min or so, to reproduce the bug?
Well I could try to produce it, but I guess you'd have far better means
to do so.
As I've said I was mostly doing send (with -p) | receive to do
incremental backups... and after a while I was cleaning up the old
snapshots on the backup fs.
Of course the snapshot subvols are pretty huge.. as I've said close to
8TB (7.5 or so)... everything from quite big files (4GB) to very small,
smylinks (no device/sockets/fifos)... perhaps some hardlinks...
Some refcopied files. The whole fs has compression enabled.
> > Shall I rw-mount the fs and do sync and wait and retry? Or is there
> > anything else that you want me to try before in order to get the
> > kernel
> > bug (if any) or btrfs-progs bug nailed down?
>
> Personally speaking, rw mount would help, to verify if it's just a
> bug
> that will disappear after the deletion is done.
Well but than we might loose any chance to further track it down.
And even if it would go away, it would still at least be a bug in terms
of fsck false positive.... if not more (in the sense of... corruptions
may happen if some affect parts of the fs are used while not cleaned up
again).
> But considering the size of your fs, it may not be a good idea as we
> don't have reliable method to recover/rebuild extent tree yet.
So what do you effectively want now?
Wait and try something else?
RW mount and recheck to see whether it goes away with that? (And even
if, should I rather re-create/populate the fs from scratch just to be
sure?
What I can also offer in addition... as mentioned some times
previously, I do have full lists of the reg-files/dirs/symlinks as well
as SHA512 sums of each of the reg-files, as they are expected to be on
the fs respectively the snapshot.
So I can offer to do a full verification pass of these, to see whether
anything is missing or (file)data actually corrupted.
Of course that will take a while, and even if everything verifies, I'm
still not really sure whether I'd trust that fs anymore ;-)
Cheers,
Chris.
[-- Attachment #2: smime.p7s --]
[-- Type: application/x-pkcs7-signature, Size: 5930 bytes --]
next prev parent reply other threads:[~2017-01-16 2:56 UTC|newest]
Thread overview: 18+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-01-12 1:07 corruption: yet another one after deleting a ro snapshot Christoph Anton Mitterer
2017-01-12 1:13 ` Christoph Anton Mitterer
2017-01-12 1:25 ` Qu Wenruo
2017-01-12 2:28 ` Christoph Anton Mitterer
2017-01-12 2:38 ` Qu Wenruo
2017-01-15 17:04 ` Christoph Anton Mitterer
2017-01-16 1:38 ` Qu Wenruo
2017-01-16 2:56 ` Christoph Anton Mitterer [this message]
2017-01-16 3:16 ` Qu Wenruo
2017-01-16 4:53 ` Christoph Anton Mitterer
2017-01-16 5:47 ` Qu Wenruo
2017-01-16 22:07 ` Christoph Anton Mitterer
2017-01-17 8:53 ` Qu Wenruo
2017-01-17 10:39 ` Christoph Anton Mitterer
2017-01-18 0:41 ` Qu Wenruo
2017-01-18 1:20 ` Christoph Anton Mitterer
-- strict thread matches above, loose matches on Subject: below --
2017-01-12 10:27 Giuseppe Della Bianca
2017-01-16 11:06 Giuseppe Della Bianca
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1484535383.7577.53.camel@scientia.net \
--to=calestyo@scientia.net \
--cc=linux-btrfs@vger.kernel.org \
--cc=quwenruo@cn.fujitsu.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).