All of lore.kernel.org
 help / color / mirror / Atom feed
From: Martin <marogge@onlinehome.de>
To: Nix <nix@esperi.org.uk>
Cc: Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
	linux-ext4@vger.kernel.org, tytso@mit.edu,
	stable@vger.kernel.org, gregkh@linuxfoundation.org
Subject: Re: Apparent serious progressive ext4 data corruption bug in 3.6.3 (and other stable branches?)
Date: Fri, 26 Oct 2012 22:44:41 +0200	[thread overview]
Message-ID: <508AF639.30603@onlinehome.de> (raw)
In-Reply-To: <871uglyoap.fsf@spindle.srvr.nix>

On 10/26/2012 10:24 PM, Nix wrote:
> On 26 Oct 2012, Martin spake thusly:
[...]
>> I have studied my corruption problem more closely and can give you a
>> description of what happened below. Would you say this may be the same
>> bug?
>
> No. You want to keep up with the thread. Ted's first educated guess is
> not always guaranteed to be correct (though this is rare).

OK

>
>> Oct 15 19:56:12
>>
>> Computer is booted again in order to copy a few files to memory stick. Unbeknownst to me, the following entries are logged in the
>> system log:
>>
>> Oct 15 20:00:16 harold kernel: EXT4-fs error (device sda5): add_dirent_to_buf:1587: inode #655361: block 2629945: comm mount: bad
>> entry in directory: rec_len % 4 != 0 - offset=360(360), inode=655682, rec_len=18, name_len=5
>> Oct 15 20:00:16 harold kernel: Aborting journal on device sda5-8.
>> Oct 15 20:00:16 harold kernel: EXT4-fs (sda5): Remounting filesystem read-only
>> Oct 15 20:00:16 harold kernel: EXT4-fs error (device sda5) in ext4_evict_inode:238: Journal has aborted
>> Oct 15 20:00:16 harold kernel: EXT4-fs error (device sda5) in ext4_create:2120: IO failure
>
> That's an interesting failure, but looks slightly different to what I
> saw. No bad directory entries, no aborted journals: a replayed journal
> and subsequent corruption. Still damaged though, and after a journal
> abort I'm not surprised you had problems!

So my corrupt journal is simply the result of a user turning off the 
machine at a bad point in time? That's scary. In that scenario even the 
option data=journal wouldn't save me from harm, would it?

Funny this happens to someone who has always said that robustness is the 
most important quality of a filesystem (and who thinks data=writeback is 
madness).

>
>>                            I will try to rename them to their
>> proper name on another machine, and restore them on the target
>> machine. However, due to the sheer number this might take forever.
>
> I relearned this week that backups are good.

Backups are good, and always too old.

>
>> Also I am worried the problem might re-surface, as it has neither been
>> identified nor fixed.
>
> I'm seeing it on almost every reboot.

Indeed the symptoms look different.

>
>> NB: kernel was v3.5.5
>
> Hm, this provides possible evidence that the problem does indeed extend
> into 3.5.x.
>
>> with CK1 and BFQ patches, tainted by nvidia module.
>
> It's hard to reason about a kernel that's had *that* massive lump of
> binary junk applied to it, alas. This may or may not be the same
> problem: it has some common features with what I see, but not all.
>

true, i normally re-create problems with vanilla kernels before 
reporting them. In this case I was cleanly sniped with no chance of 
re-play so far.


  reply	other threads:[~2012-10-26 20:44 UTC|newest]

Thread overview: 74+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <jXsTo-5lW-13@gated-at.bofh.it>
     [not found] ` <jXBDk-7vn-13@gated-at.bofh.it>
     [not found]   ` <jXNl8-5m5-13@gated-at.bofh.it>
     [not found]     ` <jXNOa-5MR-23@gated-at.bofh.it>
     [not found]       ` <jXPGh-87s-5@gated-at.bofh.it>
     [not found]         ` <jXTJW-4CH-55@gated-at.bofh.it>
     [not found]           ` <jXUZj-6mo-13@gated-at.bofh.it>
     [not found]             ` <jXVLH-7kO-5@gated-at.bofh.it>
     [not found]               ` <jXW53-7CC-5@gated-at.bofh.it>
     [not found]                 ` <jXWeJ-7Lk-1@gated-at.bofh.it>
2012-10-24 17:38                   ` Apparent serious progressive ext4 data corruption bug in 3.6.3 (and other stable branches?) Martin
2012-10-26 20:13                     ` Martin
2012-10-26 20:24                       ` Nix
2012-10-26 20:44                         ` Martin [this message]
2012-10-26 20:47                           ` Nix
2012-10-26 21:10                       ` Theodore Ts'o
2012-10-26 23:15                         ` Martin
2012-10-22 16:17 Heads-up: 3.6.2 / 3.6.3 NFS server panic: 3.6.2+ regression? Nix
2012-10-23  1:33 ` J. Bruce Fields
2012-10-23 14:07   ` Nix
2012-10-23 14:30     ` J. Bruce Fields
2012-10-23 16:32       ` Heads-up: 3.6.2 / 3.6.3 NFS server oops: 3.6.2+ regression? (also an unrelated ext4 data loss bug) Nix
     [not found]         ` <874nllxi7e.fsf_-_-AdTWujXS48Mg67Zj9sPl2A@public.gmane.org>
2012-10-23 20:57           ` Apparent serious progressive ext4 data corruption bug in 3.6.3 (and other stable branches?) Nix
2012-10-23 20:57             ` Nix
2012-10-23 22:19             ` Theodore Ts'o
2012-10-23 22:47               ` Nix
2012-10-23 23:16                 ` Theodore Ts'o
2012-10-23 23:06               ` Nix
2012-10-23 23:28                 ` Theodore Ts'o
2012-10-23 23:34                   ` Nix
2012-10-24  0:57               ` Eric Sandeen
2012-10-24 20:17                 ` Jan Kara
2012-10-26 15:25                   ` Eric Sandeen
2012-10-24 19:13               ` Jannis Achstetter
2012-10-24 19:13                 ` Jannis Achstetter
2012-10-24 21:31                 ` Theodore Ts'o
2012-10-24 22:05                   ` Jannis Achstetter
2012-10-24 23:47                   ` Nix
2012-10-25 17:02                   ` Felipe Contreras
2012-10-24 21:04               ` Jannis Achstetter
     [not found]             ` <87pq48nbyz.fsf_-_-AdTWujXS48Mg67Zj9sPl2A@public.gmane.org>
2012-10-24  1:13               ` Eric Sandeen
2012-10-24  1:13                 ` Eric Sandeen
2012-10-24  4:15                 ` Nix
2012-10-24  4:27                   ` Eric Sandeen
2012-10-24  5:23                     ` Theodore Ts'o
2012-10-24  7:00                       ` Hugh Dickins
2012-10-24 11:46                         ` Nix
2012-10-24 11:45                       ` Nix
2012-10-24 17:22                       ` Eric Sandeen
2012-10-24 19:49                       ` Nix
2012-10-24 19:54                         ` Nix
2012-10-24 20:30                         ` Eric Sandeen
2012-10-24 20:34                           ` Nix
2012-10-24 20:45                         ` Nix
2012-10-24 21:08                         ` Theodore Ts'o
2012-10-26  0:11                   ` Ric Wheeler
2012-10-26  0:43                     ` Theodore Ts'o
2012-10-26 12:12                       ` Nix
2012-10-26 20:35             ` Eric Sandeen
2012-10-26 20:37               ` Nix
     [not found]                 ` <87wqydx957.fsf-AdTWujXS48Mg67Zj9sPl2A@public.gmane.org>
2012-10-26 20:56                   ` Theodore Ts'o
2012-10-26 20:56                     ` Theodore Ts'o
     [not found]                     ` <20121026205618.GC8614-AKGzg7BKzIDYtjvyW6yDsg@public.gmane.org>
2012-10-26 20:59                       ` Nix
2012-10-26 20:59                         ` Nix
     [not found]                         ` <87objpx84k.fsf-AdTWujXS48Mg67Zj9sPl2A@public.gmane.org>
2012-10-26 21:15                           ` Theodore Ts'o
2012-10-26 21:15                             ` Theodore Ts'o
2012-10-26 21:19                             ` Nix
     [not found]                               ` <87haphx76u.fsf-AdTWujXS48Mg67Zj9sPl2A@public.gmane.org>
2012-10-27  0:22                                 ` Theodore Ts'o
2012-10-27  0:22                                   ` Theodore Ts'o
2012-10-27 12:45                                   ` Nix
2012-10-27 17:55                                     ` Theodore Ts'o
2012-10-27 18:47                                       ` Nix
2012-10-27 21:19                                         ` Eric Sandeen
2012-10-27 21:21                                           ` Nix
2012-10-27 21:23                                             ` Eric Sandeen
2012-10-27 21:29                                               ` Nix
2012-10-27 21:34                                                 ` Eric Sandeen
2012-10-27 21:40                                                   ` Nix
     [not found]                                                   ` <09758CEA-74B5-48D0-8075-BB723A2CABBB@dilger.ca>
2012-10-29  2:09                                                     ` Eric Sandeen
2012-10-27 22:42                                           ` Eric Sandeen
2012-10-29  1:00                                             ` Theodore Ts'o
2012-10-29  1:04                                               ` Nix
2012-10-29  2:24                                               ` Eric Sandeen
2012-10-29  2:34                                                 ` Theodore Ts'o
2012-10-29  2:35                                                   ` Eric Sandeen
2012-10-29  2:42                                                     ` Theodore Ts'o
2012-10-27 18:30                                     ` Eric Sandeen
     [not found]                             ` <20121026211542.GE8614-AKGzg7BKzIDYtjvyW6yDsg@public.gmane.org>
2012-10-27  3:11                               ` Jim Rees
2012-10-27  3:11                                 ` Jim Rees

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=508AF639.30603@onlinehome.de \
    --to=marogge@onlinehome.de \
    --cc=gregkh@linuxfoundation.org \
    --cc=linux-ext4@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=nix@esperi.org.uk \
    --cc=stable@vger.kernel.org \
    --cc=tytso@mit.edu \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.