linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: "Pasi Kärkkäinen" <pasik@iki.fi>
To: Zygo Blaxell <u0oo5pgu@umail.furryterror.org>
Cc: Chris Murphy <lists@colorremedies.com>,
	Hannes Reinecke <hare@suse.de>,
	linux-raid@vger.kernel.org
Subject: Re: URE, link resets, user hostile defaults
Date: Fri, 19 Aug 2016 13:00:10 +0300	[thread overview]
Message-ID: <20160819100010.GA5195@reaktio.net> (raw)
In-Reply-To: <20160704214304.GZ13212@reaktio.net>


ping

Let's not forget this thread :)


-- Pasi

On Tue, Jul 05, 2016 at 12:43:04AM +0300, Pasi Kärkkäinen wrote:
> On Wed, Jun 29, 2016 at 08:17:51AM -0400, Zygo Blaxell wrote:
> > On Tue, Jun 28, 2016 at 11:33:36AM -0600, Chris Murphy wrote:
> > > On Tue, Jun 28, 2016 at 12:33 AM, Hannes Reinecke <hare@suse.de> wrote:
> > > > Can you post a message log detailing this problem?
> > >
> > > Just over the weekend Phil Turmel posted an email with a bunch of back
> > > reading on the subject of timeout mismatches for someone to read. I've
> > > lost track of how many user emails he's replied to, discovering this
> > > common misconfiguration, and get it straightened out and more often
> > > than not helping the user recover data that otherwise would have been
> > > lost *because* of hard link resetting instead of explicit read errors.
> > 
> > OK, but the two links you provided are not examples of these.
> > 
> 
> Here's one of the threads where Phil explains the issue:
> 
> http://marc.info/?l=linux-raid&m=133665797115876&w=2
> 
> quote:
> 
> 
> "A very common report I see on this mailing list is people who have lost arrays 
> where the drives all appear to be healthy.  
> Given the large size of today's hard drives, even healthy drives will occasionally 
> have an unrecoverable read error.
> 
> When this happens in a raid array with a desktop drive without SCTERC,
> the driver times out and reports an error to MD.  MD proceeds to
> reconstruct the missing data and tries to write it back to the bad
> sector.  However, that drive is still trying to read the bad sector and
> ignores the controller.  The write is immediately rejected.  BOOM!  The
> *write* error ejects that member from the array.  And you are now
> degraded.
> 
> If you don't notice the degraded array right away, you probably won't
> notice until a URE on another drive pops up.  Once that happens, you
> can't complete a resync to revive the array.
> 
> Running a "check" or "repair" on an array without TLER will have the
> opposite of the intended effect: any URE will kick a drive out instead
> of fixing it.
> 
> In the same scenario with an enterprise drive, or a drive with SCTERC
> turned on, the drive read times out before the controller driver, the
> controller never resets the link to the drive, and the followup write
> succeeds.  (The sector is either successfully corrected in place, or
> it is relocated by the drive.)  No BOOM."
> 
> 
> 
> -- Pasi
> 


  reply	other threads:[~2016-08-19 10:00 UTC|newest]

Thread overview: 16+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2016-06-27 16:42 URE, link resets, user hostile defaults Chris Murphy
2016-06-28  6:33 ` Hannes Reinecke
2016-06-28 17:33   ` Chris Murphy
2016-06-28 18:28     ` Phil Turmel
2016-06-28 20:46       ` Wols Lists
2016-06-28 22:17         ` Chris Murphy
2016-06-29  6:01     ` Hannes Reinecke
2016-06-29 10:48       ` Pasi Kärkkäinen
2016-06-29 12:17     ` Zygo Blaxell
2016-06-29 18:16       ` Edward Kuns
2016-07-01 20:43         ` Chris Murphy
2016-07-04  6:00           ` Hannes Reinecke
2016-07-04 21:43       ` Pasi Kärkkäinen
2016-08-19 10:00         ` Pasi Kärkkäinen [this message]
2016-08-19 12:36           ` Phil Turmel
2016-08-19 15:30         ` Chris Murphy

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20160819100010.GA5195@reaktio.net \
    --to=pasik@iki.fi \
    --cc=hare@suse.de \
    --cc=linux-raid@vger.kernel.org \
    --cc=lists@colorremedies.com \
    --cc=u0oo5pgu@umail.furryterror.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).