From: "Pasi Kärkkäinen" <pasik@iki.fi>
To: Zygo Blaxell <u0oo5pgu@umail.furryterror.org>
Cc: Chris Murphy <lists@colorremedies.com>,
Hannes Reinecke <hare@suse.de>,
linux-raid@vger.kernel.org
Subject: Re: URE, link resets, user hostile defaults
Date: Fri, 19 Aug 2016 13:00:10 +0300 [thread overview]
Message-ID: <20160819100010.GA5195@reaktio.net> (raw)
In-Reply-To: <20160704214304.GZ13212@reaktio.net>
ping
Let's not forget this thread :)
-- Pasi
On Tue, Jul 05, 2016 at 12:43:04AM +0300, Pasi Kärkkäinen wrote:
> On Wed, Jun 29, 2016 at 08:17:51AM -0400, Zygo Blaxell wrote:
> > On Tue, Jun 28, 2016 at 11:33:36AM -0600, Chris Murphy wrote:
> > > On Tue, Jun 28, 2016 at 12:33 AM, Hannes Reinecke <hare@suse.de> wrote:
> > > > Can you post a message log detailing this problem?
> > >
> > > Just over the weekend Phil Turmel posted an email with a bunch of back
> > > reading on the subject of timeout mismatches for someone to read. I've
> > > lost track of how many user emails he's replied to, discovering this
> > > common misconfiguration, and get it straightened out and more often
> > > than not helping the user recover data that otherwise would have been
> > > lost *because* of hard link resetting instead of explicit read errors.
> >
> > OK, but the two links you provided are not examples of these.
> >
>
> Here's one of the threads where Phil explains the issue:
>
> http://marc.info/?l=linux-raid&m=133665797115876&w=2
>
> quote:
>
>
> "A very common report I see on this mailing list is people who have lost arrays
> where the drives all appear to be healthy.
> Given the large size of today's hard drives, even healthy drives will occasionally
> have an unrecoverable read error.
>
> When this happens in a raid array with a desktop drive without SCTERC,
> the driver times out and reports an error to MD. MD proceeds to
> reconstruct the missing data and tries to write it back to the bad
> sector. However, that drive is still trying to read the bad sector and
> ignores the controller. The write is immediately rejected. BOOM! The
> *write* error ejects that member from the array. And you are now
> degraded.
>
> If you don't notice the degraded array right away, you probably won't
> notice until a URE on another drive pops up. Once that happens, you
> can't complete a resync to revive the array.
>
> Running a "check" or "repair" on an array without TLER will have the
> opposite of the intended effect: any URE will kick a drive out instead
> of fixing it.
>
> In the same scenario with an enterprise drive, or a drive with SCTERC
> turned on, the drive read times out before the controller driver, the
> controller never resets the link to the drive, and the followup write
> succeeds. (The sector is either successfully corrected in place, or
> it is relocated by the drive.) No BOOM."
>
>
>
> -- Pasi
>
next prev parent reply other threads:[~2016-08-19 10:00 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-06-27 16:42 URE, link resets, user hostile defaults Chris Murphy
2016-06-28 6:33 ` Hannes Reinecke
2016-06-28 17:33 ` Chris Murphy
2016-06-28 18:28 ` Phil Turmel
2016-06-28 20:46 ` Wols Lists
2016-06-28 22:17 ` Chris Murphy
2016-06-29 6:01 ` Hannes Reinecke
2016-06-29 10:48 ` Pasi Kärkkäinen
2016-06-29 12:17 ` Zygo Blaxell
2016-06-29 18:16 ` Edward Kuns
2016-07-01 20:43 ` Chris Murphy
2016-07-04 6:00 ` Hannes Reinecke
2016-07-04 21:43 ` Pasi Kärkkäinen
2016-08-19 10:00 ` Pasi Kärkkäinen [this message]
2016-08-19 12:36 ` Phil Turmel
2016-08-19 15:30 ` Chris Murphy
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20160819100010.GA5195@reaktio.net \
--to=pasik@iki.fi \
--cc=hare@suse.de \
--cc=linux-raid@vger.kernel.org \
--cc=lists@colorremedies.com \
--cc=u0oo5pgu@umail.furryterror.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).