linux-ext4.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Theodore Ts'o <tytso@mit.edu>
To: Sander Smeenk <ssmeenk@freshdot.net>,
	Nathaniel W Filardo <nwf@cs.jhu.edu>
Cc: linux-ext4@vger.kernel.org
Subject: Re: ext4 metadata corruption bug?
Date: Wed, 23 Apr 2014 10:36:42 -0400	[thread overview]
Message-ID: <20140423143642.GA29925@thunk.org> (raw)
In-Reply-To: <20140423072311.GD10163@dot.freshdot.net>

OK, with the two of you reporting this problem, can you do me the
following so we can try to seriously dig into this:

First, of all, can you go through your log files and find me as many
instances of these two pairs of ext4 error messges:

EXT4-fs (vdd): pa ffff88000dea9b90: logic 0, phys. 1934464544, len 32
EXT4-fs error (device vdd): ext4_mb_release_inode_pa:3729: group 59035, free 14, pa_free 12

I want to see if there's any pattern in the physical block number (in
the two samples I have, they are always fairly large numbers), and in
the difference between the free and pa_free numbers.  

Secondly, can you send me the output of dumpe2fs -h for the file
systems in question.

Finally, since the both of you are seeing these messages fairly
frequently, would you be willing to run with a patched kernel?
Specifically, can you add a WARN_ON(1) to fs/ext4/mballoc.c here:

	if (free != pa->pa_free) {
		ext4_msg(e4b->bd_sb, KERN_CRIT,
			 "pa %p: logic %lu, phys. %lu, len %lu",
			 pa, (unsigned long) pa->pa_lstart,
			 (unsigned long) pa->pa_pstart,
			 (unsigned long) pa->pa_len);
		ext4_grp_locked_error(sb, group, 0, 0, "free %u, pa_free %u",
					free, pa->pa_free);
		WARN_ON(1); <---------------- add this line			
		/*
		 * pa is already deleted so we use the value obtained
		 * from the bitmap and continue.
		 */
	}

Then when it triggers, can you send me the stack trace that will be
triggered by the WARN_ON.

The two really interesting commonalities which I've seen so far is:

1)  You are both using virtualization via qemu/kvm

2)  You are both using file systems > 8TB.

Yes?  And Sander, you're not using a remote block device, correct?
You're using a local disk to back the large fileystem on the host OS
side?

Cheers,

					- Ted

  reply	other threads:[~2014-04-23 14:36 UTC|newest]

Thread overview: 19+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <20140409223820.GU10985@gradx.cs.jhu.edu>
     [not found] ` <CAGagf4eEzY4+3cfNWSEENTo1PKe40nq1Ne6ZzOLGm-O78W7RcA@mail.gmail.com>
2014-04-10  5:04   ` ext4 metadata corruption bug? Nathaniel W Filardo
2014-04-10 14:03     ` Theodore Ts'o
2014-04-10 16:33       ` Nathaniel W Filardo
2014-04-10 22:17         ` Theodore Ts'o
2014-04-20 16:32           ` Nathaniel W Filardo
2014-04-20 17:57             ` Theodore Ts'o
2014-04-23  7:23             ` Sander Smeenk
2014-04-23 14:36               ` Theodore Ts'o [this message]
2014-04-23 15:30                 ` Nathaniel W Filardo
2014-04-23 18:05                 ` Sander Smeenk
2014-04-29 15:22                 ` Nathaniel W Filardo
2014-05-01 16:25                 ` Nathaniel W Filardo
2014-05-06 15:42                   ` Theodore Ts'o
2014-05-06 15:51                     ` Nathaniel W Filardo
2014-07-31  2:37                       ` Theodore Ts'o
2014-08-06  8:53                         ` Sander Smeenk
2014-05-01 17:02                 ` Sander Smeenk
2014-05-06 14:22                   ` Sander Smeenk
2014-05-26 14:59                     ` Sander Smeenk

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20140423143642.GA29925@thunk.org \
    --to=tytso@mit.edu \
    --cc=linux-ext4@vger.kernel.org \
    --cc=nwf@cs.jhu.edu \
    --cc=ssmeenk@freshdot.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).