public inbox for linux-xfs@vger.kernel.org
 help / color / mirror / Atom feed
From: Alexandru Coman <ghost_3k@gmx.net>
To: xfs@oss.sgi.com
Subject: help investigating some xfs errors
Date: Tue, 12 Jan 2010 17:32:01 +0200	[thread overview]
Message-ID: <4B4C95F1.20106@gmx.net> (raw)

Hello,

I'm having some problems with an XFS filesystem, and I'm wondering if
anyone can point me in the right direction, it would be greatly appreciated.

I have several XFS filesystems on top of LVM in a RAID-1 (mdadm) created
on a pair of 1TB SATA drives. Running on Linux (Debian, amd64). One of
the XFS filesystems is 600GB in size (65% used), storing ~19 mil files
under 100KB (jpeg), usually under high load (read+write). There are also
a few other smaller XFS partitions on the same drives. It has been
running like this for 11 months, until a few days ago when I started to
get a lot of errors.

On Jan 10, I got a few lines with "ata3: hard resetting link", after
which the partition could not be accessed, I couldn't umount/mount it.
All other partitions were fine. I rebooted the server, but that
filesystem still wouldn't mount (it said "Structure needs cleaning"), I
then ran xfs_repair on it, which reported that I needed to use the "-L"
option to destroy the log. I then ran "xfs_repair -L" which appeared to
fix a lot of errors, and then I was able to mount the filesystem again.
Everything appeared to be ok at that point.

Jan 10 night: a lot of xfs call traces start to appear in the log

Jan 11: xfs call traces along with
- xfs_force_shutdown(dm-4,0x8) called from line 1164 of file
fs/xfs/xfs_trans.c.  Return address = 0xffffffffa01999ff
- xfs_imap_to_bp: xfs_trans_read_buf()returned an error 5 on dm-4. 
Returning error.
- lots of "Filesystem "dm-4": xfs_log_force: error 5 returned."
The filesystem disappeared, but I could unmount and mount it again with
no errors. At this point I've also decided to update the kernel, and
switched from 2.6.26 to 2.6.30 Then ran xfs_repair which again found a
few errors.

Jan 12:  xfs call traces along with:
- Filesystem "dm-4": corrupt dinode 1293803384, extent total = 1,
nblocks = 0.  Unmount and run xfs_repair.
- Filesystem "dm-4": corrupt dinode 665458404, extent total = 1, nblocks
= 0.  Unmount and run xfs_repair.
- Filesystem "dm-4": corrupt dinode 225720890, extent total = 1, nblocks
= 0.  Unmount and run xfs_repair.
I then unmounted the fs and ran xfs_repair again. This time the output
was massive compared to the previous runs, and it put around ~ 100.000
files in lost+found.

Beside 3 lines on Jan 10 with "ata3: hard resetting link", there have
been no sign of possible hardware problems. The raid and the hdd's
appear to be fine, no errors. What's curious is that I'm experiencing
problems only with the large XFS filesystem, and there hasn't been not
even a single error in the logs about the other xfs partitions.

So, if anyone has any ideea what I can research next, to help me find
out more information about what's happening here...

I've uploaded some detailed logs at  http://ghost3k.net/xfs1/


Thanks,
Alexandru Coman

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

             reply	other threads:[~2010-01-12 15:31 UTC|newest]

Thread overview: 2+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-01-12 15:32 Alexandru Coman [this message]
2010-01-12 20:26 ` help investigating some xfs errors Eric Sandeen

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4B4C95F1.20106@gmx.net \
    --to=ghost_3k@gmx.net \
    --cc=xfs@oss.sgi.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox