From: Ric Wheeler <rwheeler@redhat.com>
To: Andreas Dilger <adilger@sun.com>
Cc: nicholas.dokos@hp.com, linux-fsdevel@vger.kernel.org,
Christoph Hellwig <hch@infradead.org>,
Douglas Shakshober <dshaks@redhat.com>,
Joshua Giles <jgiles@redhat.com>,
Valerie Aurora <vaurora@redhat.com>,
Eric Sandeen <esandeen@redhat.com>,
Steven Whitehouse <swhiteho@redhat.com>,
Edward Shishkin <edward@redhat.com>,
Josef Bacik <jbacik@redhat.com>, Jeff Moyer <jmoyer@redhat.com>,
Chris Mason <chris.mason@oracle.com>,
"Whitney, Eric" <eric.whitney@hp.com>,
Theodore Tso <tytso@mit.edu>
Subject: Re: large fs testing
Date: Thu, 28 May 2009 06:52:10 -0400 [thread overview]
Message-ID: <4A1E6CDA.3090500@redhat.com> (raw)
In-Reply-To: <20090528063003.GC3218@webber.adilger.int>
On 05/28/2009 02:30 AM, Andreas Dilger wrote:
> On May 26, 2009 18:17 -0400, Ric Wheeler wrote:
>> What I did get was the following from the fsck run:
>>
>> root@l82bi250:/home/redhat\aYou have new mail in /var/spool/mail/root
>> [root@l82bi250 redhat]# time /sbin/fsck.ext4 -tt -y /dev/mapper/Big_boy-Big_boy
>> e2fsck 1.41.4 (27-Jan-2009)
>> Pass 1: Checking inodes, blocks, and sizes
>> Pass 1: Memory used: 1596k/1177752k (1447k/150k), time: 1184.73/514.16/344.38
>> Pass 1: I/O read: 50655MB, write: 0MB, rate: 42.76MB/s
>> Pass 2: Checking directory structure
>> Entry '4a1590dc~~~~~~~~O4A0SMJ1VC34YQ1PD3B5DL9Q' in /da (188378)
>> references inode 196988 in group 30 where _INODE_UNINIT is set.
>> Fix? yes
>>
>> Restarting e2fsck from the beginning...
>> Group descriptor 15 checksum is invalid. Fix? yes
>>
>> Pass 1: Checking inodes, blocks, and sizes
>> Pass 1: Memory used: 120396k/-1389015k (120134k/263k), time: 1134.71/522.48/323.65
>> Pass 1: I/O read: 50656MB, write: 0MB, rate: 44.64MB/s
>> Pass 2: Checking directory structure
>> Entry '4a15910c~~~~~~~~H8099TRM701Q29CSTCWBVIHJ' in /0b (404925)
>> references inode 413100 in group 62 where _INODE_UNINIT is set.
>> Fix? yes
>>
>> Restarting e2fsck from the beginning...
>> Group descriptor 31 checksum is invalid. Fix? yes
>
> This looks like there is a patch of ours missing from the upstream e2fsprogs.
> We have a patch that will restart e2fsck only a single time for inodes
> beyond the high waterwark. On a large filesystem like yours this would
> have cut 30 minutes off the e2fsck time. I'll submit that separately.
>
>> Pass 1: Checking inodes, blocks, and sizes
>> Pass 1: Memory used: 231360k/246272k (231083k/278k), time: 1140.48/521.00/334.74
>> Pass 1: I/O read: 50658MB, write: 0MB, rate: 44.42MB/s
>> Pass 2: Checking directory structure
>> Pass 2: Memory used: 231360k/1290436k (231083k/278k), time: 538.22/264.56/83.49
>> Pass 2: I/O read: 13749MB, write: 0MB, rate: 25.55MB/s
>> Pass 3: Checking directory connectivity
>> Peak memory: Memory used: 231360k/1789000k (231083k/278k), time:
>> 4221.57/1947.37/1116.21
>> Pass 3A: Memory used: 231360k/1789000k (231083k/278k), time: 0.00/ 0.00/ 0.00
>> Pass 3A: I/O read: 0MB, write: 0MB, rate: 0.00MB/s
>> Pass 3: Memory used: 231360k/1290436k (231083k/278k), time: 9.99/ 0.26/ 1.37
>> Pass 3: I/O read: 1MB, write: 0MB, rate: 0.10MB/s
>> Pass 4: Checking reference counts
>> Pass 4: Memory used: 231360k/-1481575k (231082k/279k), time: 147.16/139.87/ 1.94
>
> Sign overflow here... Looks like we exceed 2.5GB of memory here. Still,
> not too bad considering this is a 80TB filesystem.
We fsck had a virtual size of around 10GB (5.4 resident in the 6GB of DRAM) when
I checked... I wonder if it would have been significantly faster without the
excessive swap use (i.e., on a box with more memory)?
>
>> Pass 4: I/O read: 0MB, write: 0MB, rate: 0.00MB/s
>> Pass 5: Checking group summary information
>> Inode bitmap differences: -(98404--98405)
>>
>> Note that it got truncated in Pass 5 - just after writing out some values
>> that look like they sign wrapped?
>>
>> -(103650--103655) -(103659--103660) -103663 -103665 -103667
>> -(103669--103670) -(103673--103676) -103679 -103684 -103687 -10
>
> No, this is what gets printed when there are inodes (or blocks) marked
> in the bitmap that are not in use. It shouldn't be truncated however.
> You said the node crashed at this point?
>
> Cheers, Andreas
Yes - unfortunately, no logs or other indication of why it crashed and we did
not have a serial console setup either so we don't have anything to go on.
I am going to push harder to get some large storage configurations that we can
use for testing internally, so hopefully, we will have something to test on in a
couple of months....
Ric
prev parent reply other threads:[~2009-05-28 10:53 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-05-23 13:53 large fs testing Ric Wheeler
2009-05-26 12:21 ` Joshua Giles
2009-05-26 12:28 ` Ric Wheeler
2009-05-26 17:39 ` Nick Dokos
2009-05-26 17:47 ` Ric Wheeler
2009-05-26 21:21 ` Andreas Dilger
2009-05-26 21:39 ` Theodore Tso
2009-05-26 22:17 ` Ric Wheeler
2009-05-28 6:30 ` Andreas Dilger
2009-05-28 10:52 ` Ric Wheeler [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4A1E6CDA.3090500@redhat.com \
--to=rwheeler@redhat.com \
--cc=adilger@sun.com \
--cc=chris.mason@oracle.com \
--cc=dshaks@redhat.com \
--cc=edward@redhat.com \
--cc=eric.whitney@hp.com \
--cc=esandeen@redhat.com \
--cc=hch@infradead.org \
--cc=jbacik@redhat.com \
--cc=jgiles@redhat.com \
--cc=jmoyer@redhat.com \
--cc=linux-fsdevel@vger.kernel.org \
--cc=nicholas.dokos@hp.com \
--cc=swhiteho@redhat.com \
--cc=tytso@mit.edu \
--cc=vaurora@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).