From: Dave Chinner <david@fromorbit.com>
To: Simon Dray <sdray@Espial.com>
Cc: "xfs@oss.sgi.com" <xfs@oss.sgi.com>
Subject: Re: XFS issue xfs goes offline with various messages drive not recoverable without reboot
Date: Thu, 25 Sep 2014 18:12:54 +1000 [thread overview]
Message-ID: <20140925081254.GH4758@dastard> (raw)
In-Reply-To: <dd6d1d6e9fa7469584e72574347bb088@BN1PR0301MB0625.namprd03.prod.outlook.com>
On Thu, Sep 25, 2014 at 07:30:23AM +0000, Simon Dray wrote:
> Dear Sirs
>
> I wonder if you can help with an issue we see re-occuring on a regular basis with one of our HP systems which uses a HP 420 Raid controller
tl;dr: more information is needed about your system to make sense of
the problem. See here:
http://xfs.org/index.php/XFS_FAQ#Q:_What_information_should_I_include_when_reporting_a_problem.3F
> Action taken
>
> We first saw the following:
> [root@ content]# ls
> ls: cannot open directory .: Input/output error
The filesystem has shut down due to a fatal error.
> We try to run:
> [root@ /]# xfs_check /dev/md0
> xfs_check: /dev/md0 contains a mounted and writable filesystem
> fatal error -- couldn't initialize XFS library
FYI, xfs_check was deprecated a quite a while ago. It no longer
exists in current releases....
> We also tried to umount the /dev/md0 before runniing xfs_check but no luck. We received the error: device is in use
That can happen if the storage has gone bad and IOs have been lost.
> We use xfs for one of our large raid file systems and we are seeing the xfs filesystem go offline with the following messages in dmesg
>
> messages-20140921:Sep 18 23:01: kernel: XFS (md0): Device md0: metadata write error block 0x5e28623d8
What messages occurred before this? Something reported an IO error
back to XFS, and so that something should have logged an error
message...
> messages-20140921:Sep 18 23:01:04 kernel: XFS (md0): I/O error occurred: meta-data dev md0 block 0x445cccc40 ("xlog_iodone") error 5 buf count 32768
> messages-20140921:Sep 18 23:01:04 kernel: XFS (md0): xfs_do_force_shutdown(0x2) called from line 891 of file fs/xfs/xfs_log.c. Return address = 0xffffffffa2c428dc
> messages-20140921:Sep 18 23:01:04 kernel: XFS (md0): Log I/O Error Detected. Shutting down filesystem
> messages-20140921:Sep 18 23:01:04 kernel: XFS (md0): Please umount the filesystem and rectify the problem(s)
> messages-20140921:Sep 18 23:01:04 kernel: XFS (md0): xfs_imap_to_bp: xfs_trans_read_buf() returned error 5.
> messages-20140921:Sep 18 23:01:04 kernel: XFS (md0): xfs_iunlink_remove: xfs_itobp() returned error 5.
Yup, kernel code is at least 2 years old, because we removed
xfs_itobp in mid 2012... ;)
> In all occurrences the only way to recover from this is to reboot
> the system and allow xfs_repair to run during boot this clears the
> issue until next time
>
> We have checked the RAID health and nothing seems to be amiss, if
> you could help with this it would be much appreciated
That's par for the course when hardware raid goes AWOL - they almost
never report that they had a problem when they hang (e.g. firmware
crashes so can't log an event to say it crashed).
But really, more information about your system and more complete
logs are needed to be able to make any progress triaging the
problem.
Cheers,
Dave.
--
Dave Chinner
david@fromorbit.com
_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs
next prev parent reply other threads:[~2014-09-25 8:13 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-09-25 7:30 XFS issue xfs goes offline with various messages drive not recoverable without reboot Simon Dray
2014-09-25 8:12 ` Dave Chinner [this message]
2014-10-02 11:05 ` Simon Dray
2014-10-02 15:37 ` Eric Sandeen
2014-10-02 17:30 ` Simon Dray
2014-10-02 19:40 ` Eric Sandeen
2014-10-02 19:42 ` Simon Dray
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20140925081254.GH4758@dastard \
--to=david@fromorbit.com \
--cc=sdray@Espial.com \
--cc=xfs@oss.sgi.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox