From: Phil Turmel <philip@turmel.org>
To: Julie Ashworth <ashworth@berkeley.edu>
Cc: linux-raid@vger.kernel.org
Subject: Re: request help with RAID1 array that endlessly attempts to sync
Date: Wed, 18 Dec 2013 07:08:37 -0500 [thread overview]
Message-ID: <52B19045.5010102@turmel.org> (raw)
In-Reply-To: <20131218034556.GA9457@localhost.localdomain>
On 12/17/2013 10:45 PM, Julie Ashworth wrote:
> hi Phil,
> thanks again for your help. It was surprisingly easy to install the latest smarmontools.
>
> On 17-12-2013 14.43 -0500, Phil Turmel wrote:
>> I was interested in the reallocation counts, the current pending
>> sectors, and the scterc timeouts. The latter were not present, and are
>> important.
>
> ID# ATTRIBUTE_NAME FLAGS VALUE WORST THRESH FAIL RAW_VALUE
> 5 Reallocated_Sector_Ct PO--CK 100 100 036 - 3
> 197 Current_Pending_Sector -O--C- 100 100 000 - 1
> SCT Error Recovery Control:
> Read: 100 (10.0 seconds)
> Write: 100 (10.0 seconds)
>
> (I also attached the full output)
>
> I verified that a weekly scrub is performed via cron (default with Centos5), and there were no errors detected prior to the sync. The output is included in syslog reports.
Very good. You do not have a timeout mismatch problem. But the
behavior of /dev/sdb does not match its health. That suggests some
other problem is present, like a bad SATA cord or socket, a bad power
supply, bad cooling, et cetera.
>> But /dev/sdb has three relocations and only one pending error. That's
>> an old drive, but not sick. I'd be concerned that there're other
>> hardware issues in your system if the timeout issue is not part of the
>> problem.
>
> Should I run the sync (mdadm -a) in verbose mode? If so, what is the best way to terminate the current sync? By failing/removing /dev/sda?
I'd let the sync continue until it fails or completes. And if it
completes, exercise the array to see if it stays flaky. If it does not
complete, start swapping parts in the system.
Regards,
Phil
ps. I'll be offline all day today--I'm sure the list will chip in if
you need more help.
next prev parent reply other threads:[~2013-12-18 12:08 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-12-17 6:50 request help with RAID1 array that endlessly attempts to sync Julie Ashworth
2013-12-17 16:53 ` Julie Ashworth
2013-12-17 17:55 ` Phil Turmel
2013-12-17 19:26 ` Julie Ashworth
2013-12-17 19:43 ` Phil Turmel
2013-12-17 23:12 ` David C. Rankin
2013-12-18 3:45 ` Julie Ashworth
2013-12-18 12:08 ` Phil Turmel [this message]
2014-01-21 6:38 ` Julie Ashworth
2014-01-21 13:23 ` Phil Turmel
2014-02-25 0:16 ` Julie Ashworth
2013-12-17 18:12 ` Wilson Jonathan
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=52B19045.5010102@turmel.org \
--to=philip@turmel.org \
--cc=ashworth@berkeley.edu \
--cc=linux-raid@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).