From: David Greaves <david@dgreaves.com>
To: Peter Rabbitson <rabbit+list@rabbit.us>
Cc: linux-raid@vger.kernel.org
Subject: Re: Help to decipher kernel io error log
Date: Thu, 28 Aug 2008 16:38:12 +0100 [thread overview]
Message-ID: <48B6C664.6010108@dgreaves.com> (raw)
In-Reply-To: <48B677DB.4010306@rabbit.us>
Peter Rabbitson wrote:
> Greetings,
>
> This is not a strictly raid question, but this is the best list I know
> of for this type of questions. Two days ago my server ground to a halt
> without apparent reasons. There were tons of processes in D state, with
> no signs of any significant work being done. I attributed it to resource
> starvation (the server is pretty loaded), rebooted and went on with my
> life.
>
> Yesterday I received the log messages included at the bottom of this
> email. Since I am running a --level=10 --raid-devices=4 --layout=f3 I am
> not that worried abiut losing data, and decided to investigate. I
> removed (mdadm -r) the devices in question from the arrays, power cycled
> the server, and executed a full badblocks -svw /dev/sda run. It passed
> with flying colors.
>
> So here is my question - what does the log below signify (there are no
> omissions, this is all I got) - is my controller dying? Or is there
> indeed a well masked hard drive failure? Should I change the drive, the
> controller, or both?
Looks to me like a drive failed with a sector problem.
Then, quite possibly the sector was re-allocated.
What does
smartctl -a /dev/sda
say?
Run
man smartctl
to ensure you're informed :)
Then run:
smartctl -t long /dev/sda
(you may need smartctl -o on /dev/sda)
Depending on the version of smartctl you'll be given a 'poll time' or completion
time. It's safe to run
smartctl -a /dev/sda
early, but make sure the selftest has completed and post the output of that -
especially noting any differences to the earlier -a.
David
prev parent reply other threads:[~2008-08-28 15:38 UTC|newest]
Thread overview: 2+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-08-28 10:03 Help to decipher kernel io error log Peter Rabbitson
2008-08-28 15:38 ` David Greaves [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=48B6C664.6010108@dgreaves.com \
--to=david@dgreaves.com \
--cc=linux-raid@vger.kernel.org \
--cc=rabbit+list@rabbit.us \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).