public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: David Ford <david@blue-labs.org>
To: Andreas Dilger <adilger@turbolinux.com>
Cc: linux-kernel <linux-kernel@vger.kernel.org>
Subject: Re: Any known ext2 FS problems in 2.4.7?
Date: Fri, 03 Aug 2001 23:35:37 -0400	[thread overview]
Message-ID: <3B6B6D89.90804@blue-labs.org> (raw)
In-Reply-To: <200108032301.f73N18E01809@lynx.adilger.int>

Yes I'm sure, this machine has been running for a long time quite well 
save the reported problems.

a) fsck on boot, yes
b) manual fsck and reboot every few days, yes
c) fstab is correct :P
d) missing space isn't account for in du, lsof, or any other tool that 
i'm aware of
e) 4.5 gigs is a lot of space to eat over a couple days, particularly 
when this server's disk use varies by a couple hundred megs per day
f) only the e2fs partition has problems, the other partitions are fine, 
no journal replays or otherwise

After a fresh clean fsck'd start, the machine runs fine for a day or 
two, then in about one day disk space grinds away.  Bringing it to 
single user mode with nothing running but the kernel threads and a few 
init children, lsof shows nothing but a few libs open, and no deleted or 
otherwise open for write files.

All my other machines run fine.  Also note as I said, it only does this 
under 2.4.7, if I boot into 2.4.5 it will stay up for weeks until the 
reported-and-fixed OOPs kills it.  No disk space issues there.

David

Andreas Dilger wrote:

>David Ford writes:
>
>>I'm starting to go through a cycle every 2-3 days where I have to bring 
>>one particular machine down to init l1, kill any processes and remount 
>>RO, then run e2fsck on the e2fs partition.  Over that period of time, 
>>disk space is eaten without accouting. 'du' shows about 13 gigs used 
>>when I sum up all the directories.  Roughly 4.5 gigs is missing.  During 
>>e2fsck, there are many many pages of deleted inodes with zero dtime, ref 
>>count fixups, and free inode count fixups.  When I say many, I mean that 
>>this pIII 667 scrolls for about four minutes...
>>
>>There is nothing special about this partition, it doesn't do it while 
>>running 2.4.5-ac15, but I can't use that kernel either because it OOPSes 
>>as I reported.  That OOPS was fixed for 2.4.7, but this disk space issue 
>>is rather frustrating.  Fortunately all my other systems are reiserfs 
>>and work fine.
>>
>>/dev/ide/host0/bus0/target0/lun0/part2 on / type ext2 
>>(rw,usrquota=/usr/local/admin/system-info/quota-home)
>>
>>I haven't mucked with any /proc settings other than "16384" 
>> >/proc/sys/fs/file-max.  It's also worthy to note that this machine 
>>also likes to break and spontaneously reboot about once every day.  No 
>>klog, no console, no nothing, just bewm.  Again 2.4.5 didn't do this.
>>
>
>Are you sure you are running e2fsck on this partition at boot time?
>I mean, if it is rebooting spontaneously every day, but you need to run
>e2fsck manually to clean up the filesystem every 2-3 days, the fsck after
>reboot should already clean up the filesystem for you.  If you _don't_
>run e2fsck on this filesystem (you need a non-zero number in the 6th
>column of /etc/fstab) that would explain the problem.
>
>The "missing space" you are seeing is because files are being held open
>(thus not reported by "du", which only can check linked files, but reported
>by "df" which shows the whole filesystem stats).  If the files are held
>open at the time of a crash, then you need to run e2fsck to clean up all
>of these "orphans".  You should be able to see what process is causing this
>by running "lsof | grep deleted" to find open-but-deleted files.
>
>Note that reiserfs still has the same problem (AFAIK, I don't think it
>is fixed in the stock kernels, although there is a patch available),
>so even though it doesn't _need_ reiserfsck at boot time, you still
>don't get the space back until it is run.  If the other machines don't
>crash all the time, the space won't be "lost", so you may not notice it.
>Ext3 cleans up orphans at boot time (no fsck needed).
>
>Cheers, Andreas
>



  reply	other threads:[~2001-08-04  3:35 UTC|newest]

Thread overview: 5+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2001-08-03 17:40 Any known ext2 FS problems in 2.4.7? David Ford
2001-08-03 23:01 ` Andreas Dilger
2001-08-04  3:35   ` David Ford [this message]
2001-08-04  7:14     ` David Ford
2001-08-04 23:32       ` Andreas Dilger

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=3B6B6D89.90804@blue-labs.org \
    --to=david@blue-labs.org \
    --cc=adilger@turbolinux.com \
    --cc=linux-kernel@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox