From: David Greaves <david@dgreaves.com>
To: Peter Rabbitson <rabbit+list@rabbit.us>
Cc: linux-raid@vger.kernel.org
Subject: Re: Help to decipher kernel io error log
Date: Thu, 28 Aug 2008 16:38:12 +0100 [thread overview]
Message-ID: <48B6C664.6010108@dgreaves.com> (raw)
In-Reply-To: <48B677DB.4010306@rabbit.us>
Peter Rabbitson wrote:
> Greetings,
>
> This is not a strictly raid question, but this is the best list I know
> of for this type of questions. Two days ago my server ground to a halt
> without apparent reasons. There were tons of processes in D state, with
> no signs of any significant work being done. I attributed it to resource
> starvation (the server is pretty loaded), rebooted and went on with my
> life.
>
> Yesterday I received the log messages included at the bottom of this
> email. Since I am running a --level=10 --raid-devices=4 --layout=f3 I am
> not that worried abiut losing data, and decided to investigate. I
> removed (mdadm -r) the devices in question from the arrays, power cycled
> the server, and executed a full badblocks -svw /dev/sda run. It passed
> with flying colors.
>
> So here is my question - what does the log below signify (there are no
> omissions, this is all I got) - is my controller dying? Or is there
> indeed a well masked hard drive failure? Should I change the drive, the
> controller, or both?
Looks to me like a drive failed with a sector problem.
Then, quite possibly the sector was re-allocated.
What does
smartctl -a /dev/sda
say?
Run
man smartctl
to ensure you're informed :)
Then run:
smartctl -t long /dev/sda
(you may need smartctl -o on /dev/sda)
Depending on the version of smartctl you'll be given a 'poll time' or completion
time. It's safe to run
smartctl -a /dev/sda
early, but make sure the selftest has completed and post the output of that -
especially noting any differences to the earlier -a.
David
prev parent reply other threads:[~2008-08-28 15:38 UTC|newest]
Thread overview: 2+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-08-28 10:03 Help to decipher kernel io error log Peter Rabbitson
2008-08-28 15:38 ` David Greaves [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=48B6C664.6010108@dgreaves.com \
--to=david@dgreaves.com \
--cc=linux-raid@vger.kernel.org \
--cc=rabbit+list@rabbit.us \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.