linux-ext4.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Theodore Ts'o <tytso-3s7WtUTddSA@public.gmane.org>
To: Nix <nix-dKoSMcxRz+Te9xe1eoZjHA@public.gmane.org>
Cc: Eric Sandeen <sandeen-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>,
	linux-ext4-u79uwXL29TY76Z2rM5mHXA@public.gmane.org,
	linux-kernel-u79uwXL29TY76Z2rM5mHXA@public.gmane.org,
	"J. Bruce Fields"
	<bfields-uC3wQj2KruNg9hUCZPvPmw@public.gmane.org>,
	Bryan Schumaker
	<bjschuma-HgOvQuBEEgTQT0dZR+AlfA@public.gmane.org>,
	Peng Tao <bergwolf-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>,
	Trond.Myklebust-HgOvQuBEEgTQT0dZR+AlfA@public.gmane.org,
	gregkh-hQyY1W1yCW8ekmWlsbkhG0B+6BGkLq7r@public.gmane.org,
	linux-nfs-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
Subject: Re: Apparent serious progressive ext4 data corruption bug in 3.6.3 (and other stable branches?)
Date: Fri, 26 Oct 2012 17:15:42 -0400	[thread overview]
Message-ID: <20121026211542.GE8614@thunk.org> (raw)
In-Reply-To: <87objpx84k.fsf-AdTWujXS48Mg67Zj9sPl2A@public.gmane.org>

> This isn't the first time that journal_checksum has proven problematic.
> It's a shame that we're stuck between two error-inducing stools here...

The problem is that it currently bails out be aborting the entire
journal replay, and the file system will get left in a mess when it
does that.  It's actually safer today to just be blissfully ignorant
of a corrupted block in the journal, than to have the journal getting
aborted mid-replay when we detect a corrupted commit.

The plan is that eventually, we will have checksums on a
per-journalled block basis, instead of a per-commit basis, and when we
get a failed checksum, we skip the replay of that block, but we keep
going and replay all of the other blocks and commits.  We'll then set
the "file system corrupted" bit and force an e2fsck check.

The problem is this code isn't done yet, and journal_checksum is
really not ready for prime time.  When it is ready, my plan is to wire
it up so it is enabled by default; at the moment, it was intended for
developer experimentation only.  As I said, it's my fault for not
clearly labelling it "Not for you!", or putting it under an #ifdef to
prevent unwary civilians from coming across the feature and saying,
"oooh, shiny!" and turning it on.  :-(

					- Ted
--
To unsubscribe from this list: send the line "unsubscribe linux-nfs" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

  parent reply	other threads:[~2012-10-26 21:15 UTC|newest]

Thread overview: 78+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <87objupjlr.fsf@spindle.srvr.nix>
     [not found] ` <20121023013343.GB6370@fieldses.org>
     [not found]   ` <87mwzdnuww.fsf@spindle.srvr.nix>
     [not found]     ` <20121023143019.GA3040@fieldses.org>
     [not found]       ` <874nllxi7e.fsf_-_@spindle.srvr.nix>
     [not found]         ` <874nllxi7e.fsf_-_-AdTWujXS48Mg67Zj9sPl2A@public.gmane.org>
2012-10-23 20:57           ` Apparent serious progressive ext4 data corruption bug in 3.6.3 (and other stable branches?) Nix
2012-10-23 22:19             ` Theodore Ts'o
2012-10-23 22:47               ` Nix
2012-10-23 23:16                 ` Theodore Ts'o
2012-10-23 23:06               ` Nix
2012-10-23 23:28                 ` Theodore Ts'o
2012-10-23 23:34                   ` Nix
2012-10-24  0:57               ` Eric Sandeen
2012-10-24 20:17                 ` Jan Kara
2012-10-26 15:25                   ` Eric Sandeen
2012-10-24 19:13               ` Jannis Achstetter
2012-10-24 21:31                 ` Theodore Ts'o
2012-10-24 22:05                   ` Jannis Achstetter
2012-10-24 23:47                   ` Nix
2012-10-25 17:02                   ` Felipe Contreras
     [not found]             ` <87pq48nbyz.fsf_-_-AdTWujXS48Mg67Zj9sPl2A@public.gmane.org>
2012-10-24  1:13               ` Eric Sandeen
2012-10-24  4:15                 ` Nix
2012-10-24  4:27                   ` Eric Sandeen
2012-10-24  5:23                     ` Theodore Ts'o
2012-10-24  7:00                       ` Hugh Dickins
2012-10-24 11:46                         ` Nix
2012-10-24 11:45                       ` Nix
2012-10-24 17:22                       ` Eric Sandeen
2012-10-24 19:49                       ` Nix
2012-10-24 19:54                         ` Nix
2012-10-24 20:30                         ` Eric Sandeen
2012-10-24 20:34                           ` Nix
2012-10-24 20:45                         ` Nix
2012-10-24 21:08                         ` Theodore Ts'o
2012-10-24 23:27                           ` Apparent serious progressive ext4 data corruption bug in 3.6 (when rebooting during umount) Nix
2012-10-24 23:42                             ` Nix
2012-10-25  1:10                             ` Theodore Ts'o
2012-10-25  1:45                               ` Nix
2012-10-25 14:12                                 ` Theodore Ts'o
2012-10-25 14:15                                   ` Nix
2012-10-25 17:39                                     ` Nix
2012-10-25 11:06                               ` Nix
2012-10-26  0:22                               ` Apparent serious progressive ext4 data corruption bug in 3.6 (when rebooting during umount) (possibly blockdev / arcmsr at fault??) Nix
2012-10-26 20:35             ` Apparent serious progressive ext4 data corruption bug in 3.6.3 (and other stable branches?) Eric Sandeen
2012-10-26 20:37               ` Nix
     [not found]                 ` <87wqydx957.fsf-AdTWujXS48Mg67Zj9sPl2A@public.gmane.org>
2012-10-26 20:56                   ` Theodore Ts'o
     [not found]                     ` <20121026205618.GC8614-AKGzg7BKzIDYtjvyW6yDsg@public.gmane.org>
2012-10-26 20:59                       ` Nix
     [not found]                         ` <87objpx84k.fsf-AdTWujXS48Mg67Zj9sPl2A@public.gmane.org>
2012-10-26 21:15                           ` Theodore Ts'o [this message]
2012-10-26 21:19                             ` Nix
     [not found]                               ` <87haphx76u.fsf-AdTWujXS48Mg67Zj9sPl2A@public.gmane.org>
2012-10-27  0:22                                 ` Theodore Ts'o
2012-10-27 12:45                                   ` Nix
2012-10-27 17:55                                     ` Theodore Ts'o
2012-10-27 18:47                                       ` Nix
2012-10-27 21:19                                         ` Eric Sandeen
2012-10-27 21:21                                           ` Nix
2012-10-27 21:23                                             ` Eric Sandeen
2012-10-27 21:29                                               ` Nix
2012-10-27 21:34                                                 ` Eric Sandeen
2012-10-27 21:40                                                   ` Nix
     [not found]                                                   ` <09758CEA-74B5-48D0-8075-BB723A2CABBB@dilger.ca>
2012-10-29  2:09                                                     ` Eric Sandeen
2012-10-27 22:42                                           ` Eric Sandeen
2012-10-29  1:00                                             ` Theodore Ts'o
2012-10-29  1:04                                               ` Nix
2012-10-29  2:24                                               ` Eric Sandeen
2012-10-29  2:34                                                 ` Theodore Ts'o
2012-10-29  2:35                                                   ` Eric Sandeen
2012-10-29  2:42                                                     ` Theodore Ts'o
2012-10-27 18:30                                     ` Eric Sandeen
     [not found]                             ` <20121026211542.GE8614-AKGzg7BKzIDYtjvyW6yDsg@public.gmane.org>
2012-10-27  3:11                               ` Jim Rees
2012-10-27  8:01               ` Testing ext4's journal via simulating a reboot via KVM Theodore Ts'o
2012-10-28  4:23             ` [PATCH] ext4: fix unjournaled inode bitmap modification Eric Sandeen
2012-10-28 13:59               ` Nix
2012-10-29  2:30               ` [PATCH -v3] " Theodore Ts'o
2012-10-29  3:24                 ` Eric Sandeen
2012-10-29  5:07                 ` Andreas Dilger
2012-10-29 17:08                 ` Darrick J. Wong
     [not found] <jXsTo-5lW-13@gated-at.bofh.it>
     [not found] ` <jXBDk-7vn-13@gated-at.bofh.it>
     [not found]   ` <jXNl8-5m5-13@gated-at.bofh.it>
     [not found]     ` <jXNOa-5MR-23@gated-at.bofh.it>
     [not found]       ` <jXPGh-87s-5@gated-at.bofh.it>
     [not found]         ` <jXTJW-4CH-55@gated-at.bofh.it>
     [not found]           ` <jXUZj-6mo-13@gated-at.bofh.it>
     [not found]             ` <jXVLH-7kO-5@gated-at.bofh.it>
     [not found]               ` <jXW53-7CC-5@gated-at.bofh.it>
     [not found]                 ` <jXWeJ-7Lk-1@gated-at.bofh.it>
2012-10-24 17:38                   ` Apparent serious progressive ext4 data corruption bug in 3.6.3 (and other stable branches?) Martin
2012-10-26 20:13                     ` Martin
2012-10-26 20:24                       ` Nix
2012-10-26 20:44                         ` Martin
2012-10-26 20:47                           ` Nix
2012-10-26 21:10                       ` Theodore Ts'o
2012-10-26 23:15                         ` Martin

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20121026211542.GE8614@thunk.org \
    --to=tytso-3s7wtutddsa@public.gmane.org \
    --cc=Trond.Myklebust-HgOvQuBEEgTQT0dZR+AlfA@public.gmane.org \
    --cc=bergwolf-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org \
    --cc=bfields-uC3wQj2KruNg9hUCZPvPmw@public.gmane.org \
    --cc=bjschuma-HgOvQuBEEgTQT0dZR+AlfA@public.gmane.org \
    --cc=gregkh-hQyY1W1yCW8ekmWlsbkhG0B+6BGkLq7r@public.gmane.org \
    --cc=linux-ext4-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
    --cc=linux-kernel-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
    --cc=linux-nfs-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
    --cc=nix-dKoSMcxRz+Te9xe1eoZjHA@public.gmane.org \
    --cc=sandeen-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).