From: Nix <nix@esperi.org.uk>
To: Michael Lyle <mlyle@lyle.org>
Cc: Alexandr Kuznetsov <progmachine@xenlab.one>,
linux-bcache@vger.kernel.org
Subject: Re: bcache failure hangs something in kernel
Date: Tue, 14 Nov 2017 13:27:03 +0000 [thread overview]
Message-ID: <87r2t1xa6w.fsf@esperi.org.uk> (raw)
In-Reply-To: <024c8d28-9ddb-09fe-c2b0-8a7d0aed493d@lyle.org> (Michael Lyle's message of "Fri, 13 Oct 2017 01:11:41 -0700")
On 13 Oct 2017, Michael Lyle said:
> On 10/13/2017 12:59 AM, Alexandr Kuznetsov wrote:
>> I thought that lvm is old, mature and safe technology, but here it is
>> stuck, then manualy interrupted and result is catastrophic data corruption.
>> lvm sits on top of that sandwich of block devices, on layer of
>> /dev/bcache* devices. Another question here is how crazy lvm could
>> damage data outside of /dev/bcache* devices? This means that some
>> necessary io buffer range checks are missing inside bcache.
>
> I don't know what commands you ran. I've never seen/heard of a bcache
> superblock corrupted, and I believe the mappings/shrink are appropriate.
I have also had corruption on a writethrough bcache (atop RAID-6, with
LVM PVs within it) causing (rootfs) mount failure: bucket corruption,
IIRC. Every time I rebooted I got warnings that bcache couldn't clean up
in time, and I suspect this caused corruption in the end (fairly fast,
actually, less than a month after starting using bcache: it had only
just finished populating).
The thing is in none mode at the moment, waiting for me to revamp my
shutdown process to rotate the initramfs into place at shutdown so I can
unmount the rootfs and stop the bcache, in the hope that that might give
it a chance to shut down neatly. (Even so, finding that dirty shutdown
can corrupt the bcache is unpleasant. I guess nobody does powerfail
tests? How do most people shut down their bcache-on-rootfs systems?)
--
NULL && (void)
next prev parent reply other threads:[~2017-11-14 14:25 UTC|newest]
Thread overview: 13+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-10-12 12:49 bcache failure hangs something in kernel Alexandr Kuznetsov
2017-10-12 18:12 ` Michael Lyle
2017-10-13 7:59 ` Alexandr Kuznetsov
2017-10-13 8:11 ` Michael Lyle
2017-10-13 9:10 ` Alexandr Kuznetsov
2017-10-13 9:13 ` Michael Lyle
2017-10-13 10:11 ` Alexandr Kuznetsov
2017-11-14 13:27 ` Nix [this message]
2017-11-14 17:20 ` Michael Lyle
2017-11-14 18:25 ` Nix
2017-11-14 19:03 ` Michael Lyle
2017-11-17 20:13 ` Nix
2017-11-15 8:44 ` Alexandr Kuznetsov
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=87r2t1xa6w.fsf@esperi.org.uk \
--to=nix@esperi.org.uk \
--cc=linux-bcache@vger.kernel.org \
--cc=mlyle@lyle.org \
--cc=progmachine@xenlab.one \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox