linux-fsdevel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Ric Wheeler <rwheeler@redhat.com>
To: Andreas Dilger <adilger@sun.com>
Cc: nicholas.dokos@hp.com, linux-fsdevel@vger.kernel.org,
	Christoph Hellwig <hch@infradead.org>,
	Douglas Shakshober <dshaks@redhat.com>,
	Joshua Giles <jgiles@redhat.com>,
	Valerie Aurora <vaurora@redhat.com>,
	Eric Sandeen <esandeen@redhat.com>,
	Steven Whitehouse <swhiteho@redhat.com>,
	Edward Shishkin <edward@redhat.com>,
	Josef Bacik <jbacik@redhat.com>, Jeff Moyer <jmoyer@redhat.com>,
	Chris Mason <chris.mason@oracle.com>,
	"Whitney, Eric" <eric.whitney@hp.com>,
	Theodore Tso <tytso@mit.edu>
Subject: Re: large fs testing
Date: Thu, 28 May 2009 06:52:10 -0400	[thread overview]
Message-ID: <4A1E6CDA.3090500@redhat.com> (raw)
In-Reply-To: <20090528063003.GC3218@webber.adilger.int>

On 05/28/2009 02:30 AM, Andreas Dilger wrote:
> On May 26, 2009  18:17 -0400, Ric Wheeler wrote:
>> What I did get was the following from the fsck run:
>>
>> root@l82bi250:/home/redhat\aYou have new mail in /var/spool/mail/root
>> [root@l82bi250 redhat]# time /sbin/fsck.ext4 -tt -y /dev/mapper/Big_boy-Big_boy
>> e2fsck 1.41.4 (27-Jan-2009)
>> Pass 1: Checking inodes, blocks, and sizes
>> Pass 1: Memory used: 1596k/1177752k (1447k/150k), time: 1184.73/514.16/344.38
>> Pass 1: I/O read: 50655MB, write: 0MB, rate: 42.76MB/s
>> Pass 2: Checking directory structure
>> Entry '4a1590dc~~~~~~~~O4A0SMJ1VC34YQ1PD3B5DL9Q' in /da (188378)
>> references inode 196988 in group 30 where _INODE_UNINIT is set.
>> Fix? yes
>>
>> Restarting e2fsck from the beginning...
>> Group descriptor 15 checksum is invalid.  Fix? yes
>>
>> Pass 1: Checking inodes, blocks, and sizes
>> Pass 1: Memory used: 120396k/-1389015k (120134k/263k), time: 1134.71/522.48/323.65
>> Pass 1: I/O read: 50656MB, write: 0MB, rate: 44.64MB/s
>> Pass 2: Checking directory structure
>> Entry '4a15910c~~~~~~~~H8099TRM701Q29CSTCWBVIHJ' in /0b (404925)
>> references inode 413100 in group 62 where _INODE_UNINIT is set.
>> Fix? yes
>>
>> Restarting e2fsck from the beginning...
>> Group descriptor 31 checksum is invalid.  Fix? yes
>
> This looks like there is a patch of ours missing from the upstream e2fsprogs.
> We have a patch that will restart e2fsck only a single time for inodes
> beyond the high waterwark.  On a large filesystem like yours this would
> have cut 30 minutes off the e2fsck time.  I'll submit that separately.
>
>> Pass 1: Checking inodes, blocks, and sizes
>> Pass 1: Memory used: 231360k/246272k (231083k/278k), time: 1140.48/521.00/334.74
>> Pass 1: I/O read: 50658MB, write: 0MB, rate: 44.42MB/s
>> Pass 2: Checking directory structure
>> Pass 2: Memory used: 231360k/1290436k (231083k/278k), time: 538.22/264.56/83.49
>> Pass 2: I/O read: 13749MB, write: 0MB, rate: 25.55MB/s
>> Pass 3: Checking directory connectivity
>> Peak memory: Memory used: 231360k/1789000k (231083k/278k), time:
>> 4221.57/1947.37/1116.21
>> Pass 3A: Memory used: 231360k/1789000k (231083k/278k), time:  0.00/ 0.00/ 0.00
>> Pass 3A: I/O read: 0MB, write: 0MB, rate: 0.00MB/s
>> Pass 3: Memory used: 231360k/1290436k (231083k/278k), time:  9.99/ 0.26/ 1.37
>> Pass 3: I/O read: 1MB, write: 0MB, rate: 0.10MB/s
>> Pass 4: Checking reference counts
>> Pass 4: Memory used: 231360k/-1481575k (231082k/279k), time: 147.16/139.87/ 1.94
>
> Sign overflow here...  Looks like we exceed 2.5GB of memory here.   Still,
> not too bad considering this is a 80TB filesystem.

We fsck had a virtual size of around 10GB (5.4 resident in the 6GB of DRAM) when 
I checked...  I wonder if it would have been significantly faster without the 
excessive swap use (i.e., on a box with more memory)?

>
>> Pass 4: I/O read: 0MB, write: 0MB, rate: 0.00MB/s
>> Pass 5: Checking group summary information
>> Inode bitmap differences:  -(98404--98405)
>>
>> Note that it got truncated in Pass 5 - just after writing out some values
>> that look like they sign wrapped?
>>
>> -(103650--103655) -(103659--103660) -103663 -103665 -103667
>> -(103669--103670) -(103673--103676) -103679 -103684 -103687 -10
>
> No, this is what gets printed when there are inodes (or blocks) marked
> in the bitmap that are not in use.  It shouldn't be truncated however.
> You said the node crashed at this point?
>
> Cheers, Andreas

Yes - unfortunately, no logs or other indication of why it crashed and we did 
not have a serial console setup either so we don't have anything to go on.

I am going to push harder to get some large storage configurations that we can 
use for testing internally, so hopefully, we will have something to test on in a 
couple of months....

Ric

      reply	other threads:[~2009-05-28 10:53 UTC|newest]

Thread overview: 10+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-05-23 13:53 large fs testing Ric Wheeler
2009-05-26 12:21 ` Joshua Giles
2009-05-26 12:28   ` Ric Wheeler
2009-05-26 17:39 ` Nick Dokos
2009-05-26 17:47   ` Ric Wheeler
2009-05-26 21:21     ` Andreas Dilger
2009-05-26 21:39       ` Theodore Tso
2009-05-26 22:17       ` Ric Wheeler
2009-05-28  6:30         ` Andreas Dilger
2009-05-28 10:52           ` Ric Wheeler [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4A1E6CDA.3090500@redhat.com \
    --to=rwheeler@redhat.com \
    --cc=adilger@sun.com \
    --cc=chris.mason@oracle.com \
    --cc=dshaks@redhat.com \
    --cc=edward@redhat.com \
    --cc=eric.whitney@hp.com \
    --cc=esandeen@redhat.com \
    --cc=hch@infradead.org \
    --cc=jbacik@redhat.com \
    --cc=jgiles@redhat.com \
    --cc=jmoyer@redhat.com \
    --cc=linux-fsdevel@vger.kernel.org \
    --cc=nicholas.dokos@hp.com \
    --cc=swhiteho@redhat.com \
    --cc=tytso@mit.edu \
    --cc=vaurora@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).