From: Bill Davidsen <davidsen@tmr.com>
To: Maurice Hilarius <maurice@harddata.com>
Cc: Linux RAID <linux-raid@vger.kernel.org>
Subject: Re: Question: how to identify failing disk in a RAID1
Date: Mon, 14 Apr 2008 12:38:06 -0400 [thread overview]
Message-ID: <4803886E.3050007@tmr.com> (raw)
In-Reply-To: <4802CDA2.605@harddata.com>
Maurice Hilarius wrote:
> Bill Davidsen wrote:
>> ..
>>>> I am pretty sure that one of the drives has developed some issues
>>>> and needs to be replaced.
>>>> ..
>>
>> Very unexpected that the data would be bad without any hardware errors.
> I DID say:
> "I am pretty sure that one of the drives has developed some issues and
> needs to be replaced. "
>> Did you look at your logs to see if one of your drives, or perhasps
>> both, are getting hardware errors?
> Oh, I KNOW one does..
> The question is WHICH one?
>
I no longer have any old logs showing errors, but /var/log/messages
and/or dmesg should have an error message with a drive identification if
you are getting disk errors.
>> I would run a 'check' and and see what mdadm finds on the array, you
>> may have other problems.
>>
> Pardon my stupidity, care to share some syntax for that?
cd /sys/block/md0/md
echo check >sync_action; cat mismatch_cnt
That's the count of errors found. Replace 'check' with 'repair' to make
the errors go away, reboot, run 'check' again.
>> Actually, I think I would run memtest86 for at least a few hours,
>> starting from a really cold system (not just a cold boot, off for a
>> few hours).
> Did that already.
>> Your comment "on boot" may come from memory or other component which
>> needs to physically get up to temperature before working reliably.
>> Particularly if you don't get additional errors after you have been
>> up for a while.
>>
> It happens cold or hot.
>
>
> --
> Regards, Maurice
>
--
Bill Davidsen <davidsen@tmr.com>
"Woe unto the statesman who makes war without a reason that will still
be valid when the war is over..." Otto von Bismark
next prev parent reply other threads:[~2008-04-14 16:38 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-04-13 19:14 Question: how to identify failing disk in a RAID1 Maurice Hilarius
2008-04-13 19:29 ` Justin Piszcz
2008-04-14 1:14 ` Bill Davidsen
[not found] ` <4802CDA2.605@harddata.com>
2008-04-14 16:38 ` Bill Davidsen [this message]
[not found] ` <4804CD4F.7080303@harddata.com>
2008-04-15 18:14 ` Bill Davidsen
[not found] ` <48050DD6.7020404@harddata.com>
[not found] ` <48055EFA.8060505@tmr.com>
[not found] ` <480607F2.3060504@harddata.com>
2008-04-17 13:12 ` Bill Davidsen
[not found] ` <48076096.2020804@harddata.com>
2008-04-18 13:17 ` Bill Davidsen
[not found] ` <480F7105.9030405@harddata.com>
2008-04-23 18:54 ` Justin Piszcz
[not found] ` <480F8830.6020207@harddata.com>
2008-04-23 19:26 ` Justin Piszcz
2008-04-27 17:03 ` Keith Roberts
2008-04-27 19:28 ` Richard Scobie
2008-04-28 5:29 ` Keith Roberts
2008-04-28 6:06 ` Michael Tokarev
2008-04-28 7:01 ` Richard Scobie
2008-04-27 21:53 ` Mark Hahn
-- strict thread matches above, loose matches on Subject: below --
2008-04-18 17:36 David Lethe
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4803886E.3050007@tmr.com \
--to=davidsen@tmr.com \
--cc=linux-raid@vger.kernel.org \
--cc=maurice@harddata.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.