From: Phil Turmel <philip@turmel.org>
To: Philip Hands <phil@hands.com>
Cc: Roger Heflin <rogerheflin@gmail.com>,
'LinuxRaid' <linux-raid@vger.kernel.org>
Subject: Re: RAID1 seems not to be able to scrub pending sectors shown by smart
Date: Sat, 24 Dec 2011 09:27:45 -0500 [thread overview]
Message-ID: <4EF5E161.5010001@turmel.org> (raw)
In-Reply-To: <8762h62sgb.fsf@poker.hands.com>
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Hi Philip,
On 12/24/2011 05:07 AM, Philip Hands wrote:
[...]
> Last night I started a check of the RAID that contained most of the errors on
> that disk, and it's pretty much finished (81%), in which time the Pending
> sector count is back up to 53. [Erm, 83% and 54 now -- while writing
> this mail]
>
> Clearly it's not a particularly happy drive, so I guess that smart will
> eventually diagnose it as faulty, but in the mean time it may be a
> useful test case for mdadm.
>
> One of those newly pending sectors was found almost immediately, as I
> was able to see from the logs, and while that was being dealt with, it
> drove the system load up to about 18, and rendered the system
> unresponsive for at least 10 seconds, probably more like 20 or 30 (the
> normal load once it had chance to settle down again was about 2, on a 6
> core CPU, so it wasn't really that busy).
>
> [84% and 55 pending now -- with the first indication being a spike in
> load, followed a minute or two later by mention of the read problems in
> the logs, but apparently nothing logged by md, so presumably the read
> eventually succeeded]
>
>> I wonder if a patch might be possible that allows one to put an array
>> into a mode (or go into said mode once a badblock condition has
>> happened) that causes it to read from at least 2 possible data sources
>> and return whichever gets there first...
>
> Well, given that something appears to be blocking in a fairly
> disastrous way on the read that's not coming back, I was wondering if
> there might be some way of having a timeout on those reads that if one
> gets no response for long enough (say 10 seconds) reacts by getting the
> data from elsewhere, and overwriting the slow sector.
Have you set up TLER or SCTERC on these drives? I suspect you haven't, as
these long delays on read errors are typical of default error handling on
consumer drives.
Can you show the complete "smartctl -x" output for this failing drive?
Phil
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.18 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/
iEYEARECAAYFAk714VwACgkQBP+iHzflm3BXmACffzNuNvh98KueHKUL06e9Ultj
ETcAn20P84PxbN3n6K0BlDoNsMpg1+2n
=2gBn
-----END PGP SIGNATURE-----
next prev parent reply other threads:[~2011-12-24 14:27 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-12-23 18:39 RAID1 seems not to be able to scrub pending sectors shown by smart Philip Hands
2011-12-23 19:59 ` Roger Heflin
2011-12-23 21:22 ` Philip Hands
2011-12-23 22:26 ` Roger Heflin
2011-12-24 10:07 ` Philip Hands
2011-12-24 14:27 ` Phil Turmel [this message]
2011-12-24 15:30 ` Philip Hands
2011-12-25 0:11 ` Phil Turmel
2011-12-24 15:54 ` Roger Heflin
2011-12-25 0:24 ` Phil Turmel
2011-12-25 15:07 ` Philip Hands
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4EF5E161.5010001@turmel.org \
--to=philip@turmel.org \
--cc=linux-raid@vger.kernel.org \
--cc=phil@hands.com \
--cc=rogerheflin@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox