From: Andreas Klauer <Andreas.Klauer@metamorpher.de>
To: Gandalf Corvotempesta <gandalf.corvotempesta@gmail.com>
Cc: Wolfgang Denk <wd@denx.de>, linux-raid@vger.kernel.org
Subject: Re: Disk Monitoring
Date: Thu, 29 Jun 2017 12:14:00 +0200 [thread overview]
Message-ID: <20170629101400.GA4604@metamorpher.de> (raw)
In-Reply-To: <CAJH6TXhJ8HoH=tnxreZ4EAFYQYDh05KHAQxwoLqp1f7zQM4N0g@mail.gmail.com>
On Thu, Jun 29, 2017 at 11:52:01AM +0200, Gandalf Corvotempesta wrote:
> disk0 has sector X (unused) failed.
> It's unused, thus, kernel knows nothing about that and is operting
> normally, no warning message or anything. If you don't access sector
> X, you wont be notified.
>
> Now, disk1 hard-fail. You have to replace that.
> During the resync, you have to resync the whole array, but disk0,
> sectorX is unreadable.
> The resync will fail and the whole array is down.
> Am I missing something?
Not really. It's just that you have to set up the monitoring yourself,
whichever way you feel comfortable with.
SMART has a selftest feature which causes the disk to read sectors.
You can do whole disk at once (long selftest) or in segments
(selective selftest). I prefer the selective since that allows you
to place the selftest in the time window of least activity.
Instead of spending an entire day (or two) testing the whole drive
you can put in an hour or two of testing every night and have it
cover the entire drive over X days.
mdadm can also perform RAID checks, reading everything including
parity, RAID layer would attempt to fix read errors then, and you
can also check mismatch_cnt.
The mdadm checks can also done region by region to distribute
load over several days but I think it's still not a direct option
for mdadm, the region can be set via /proc or /sys somewhere...
Both smartmontools and mdadm should be set up to run such checks
periodically, and instantly notify you by email if any problem occurs.
If a disk has problems, replace it, otherwise it's a gamble.
Whatever promises RAID makes regarding redundancy, it always
assumes the other drives to work 100%.
It's very unlikely to encounter read errors during rebuild if
you ran regular checks and didn't forcibly keep bad drives.
Regards
Andreas Klauer
next prev parent reply other threads:[~2017-06-29 10:14 UTC|newest]
Thread overview: 23+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-06-28 13:19 Disk Monitoring Wolfgang Denk
2017-06-29 9:52 ` Gandalf Corvotempesta
2017-06-29 10:10 ` Reindl Harald
2017-06-29 10:14 ` Gandalf Corvotempesta
2017-06-29 10:37 ` Reindl Harald
2017-06-29 14:28 ` Wols Lists
2017-06-29 10:14 ` Andreas Klauer [this message]
2017-06-29 10:14 ` Mateusz Korniak
2017-06-29 10:16 ` Gandalf Corvotempesta
2017-06-29 14:33 ` Wols Lists
2017-06-30 12:35 ` Gandalf Corvotempesta
2017-06-30 14:35 ` Phil Turmel
2017-06-30 19:56 ` Anthony Youngman
2017-07-01 13:42 ` Drew
2017-07-01 14:12 ` Gandalf Corvotempesta
2017-07-01 15:36 ` Drew
2017-06-29 10:20 ` Mateusz Korniak
2017-06-29 10:25 ` Gandalf Corvotempesta
2017-06-29 10:34 ` Reindl Harald
-- strict thread matches above, loose matches on Subject: below --
2017-06-28 10:25 Gandalf Corvotempesta
2017-06-28 10:45 ` Johannes Truschnigg
2017-07-06 3:31 ` NeilBrown
2017-06-28 12:43 ` Wols Lists
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20170629101400.GA4604@metamorpher.de \
--to=andreas.klauer@metamorpher.de \
--cc=gandalf.corvotempesta@gmail.com \
--cc=linux-raid@vger.kernel.org \
--cc=wd@denx.de \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).