All of lore.kernel.org
 help / color / mirror / Atom feed
From: Theodore Ts'o <tytso@mit.edu>
To: Sander Smeenk <ssmeenk@freshdot.net>,
	Nathaniel W Filardo <nwf@cs.jhu.edu>
Cc: linux-ext4@vger.kernel.org
Subject: Re: ext4 metadata corruption bug?
Date: Wed, 23 Apr 2014 10:36:42 -0400	[thread overview]
Message-ID: <20140423143642.GA29925@thunk.org> (raw)
In-Reply-To: <20140423072311.GD10163@dot.freshdot.net>

OK, with the two of you reporting this problem, can you do me the
following so we can try to seriously dig into this:

First, of all, can you go through your log files and find me as many
instances of these two pairs of ext4 error messges:

EXT4-fs (vdd): pa ffff88000dea9b90: logic 0, phys. 1934464544, len 32
EXT4-fs error (device vdd): ext4_mb_release_inode_pa:3729: group 59035, free 14, pa_free 12

I want to see if there's any pattern in the physical block number (in
the two samples I have, they are always fairly large numbers), and in
the difference between the free and pa_free numbers.  

Secondly, can you send me the output of dumpe2fs -h for the file
systems in question.

Finally, since the both of you are seeing these messages fairly
frequently, would you be willing to run with a patched kernel?
Specifically, can you add a WARN_ON(1) to fs/ext4/mballoc.c here:

	if (free != pa->pa_free) {
		ext4_msg(e4b->bd_sb, KERN_CRIT,
			 "pa %p: logic %lu, phys. %lu, len %lu",
			 pa, (unsigned long) pa->pa_lstart,
			 (unsigned long) pa->pa_pstart,
			 (unsigned long) pa->pa_len);
		ext4_grp_locked_error(sb, group, 0, 0, "free %u, pa_free %u",
					free, pa->pa_free);
		WARN_ON(1); <---------------- add this line			
		/*
		 * pa is already deleted so we use the value obtained
		 * from the bitmap and continue.
		 */
	}

Then when it triggers, can you send me the stack trace that will be
triggered by the WARN_ON.

The two really interesting commonalities which I've seen so far is:

1)  You are both using virtualization via qemu/kvm

2)  You are both using file systems > 8TB.

Yes?  And Sander, you're not using a remote block device, correct?
You're using a local disk to back the large fileystem on the host OS
side?

Cheers,

					- Ted

  reply	other threads:[~2014-04-23 14:36 UTC|newest]

Thread overview: 19+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <20140409223820.GU10985@gradx.cs.jhu.edu>
     [not found] ` <CAGagf4eEzY4+3cfNWSEENTo1PKe40nq1Ne6ZzOLGm-O78W7RcA@mail.gmail.com>
2014-04-10  5:04   ` ext4 metadata corruption bug? Nathaniel W Filardo
2014-04-10 14:03     ` Theodore Ts'o
2014-04-10 16:33       ` Nathaniel W Filardo
2014-04-10 22:17         ` Theodore Ts'o
2014-04-20 16:32           ` Nathaniel W Filardo
2014-04-20 17:57             ` Theodore Ts'o
2014-04-23  7:23             ` Sander Smeenk
2014-04-23 14:36               ` Theodore Ts'o [this message]
2014-04-23 15:30                 ` Nathaniel W Filardo
2014-04-23 18:05                 ` Sander Smeenk
2014-04-29 15:22                 ` Nathaniel W Filardo
2014-05-01 16:25                 ` Nathaniel W Filardo
2014-05-06 15:42                   ` Theodore Ts'o
2014-05-06 15:51                     ` Nathaniel W Filardo
2014-07-31  2:37                       ` Theodore Ts'o
2014-08-06  8:53                         ` Sander Smeenk
2014-05-01 17:02                 ` Sander Smeenk
2014-05-06 14:22                   ` Sander Smeenk
2014-05-26 14:59                     ` Sander Smeenk

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20140423143642.GA29925@thunk.org \
    --to=tytso@mit.edu \
    --cc=linux-ext4@vger.kernel.org \
    --cc=nwf@cs.jhu.edu \
    --cc=ssmeenk@freshdot.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.