linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* URE, link resets, user hostile defaults
@ 2016-06-27 16:42 Chris Murphy
  2016-06-28  6:33 ` Hannes Reinecke
  0 siblings, 1 reply; 16+ messages in thread
From: Chris Murphy @ 2016-06-27 16:42 UTC (permalink / raw)
  To: linux-raid

Hi,

Drives with SCT ERC not supported or unset, result in potentially long
error recoveries for marginal or bad sectors: upwards of 180 second
recovers are suggested.

The kernel's SCSI command timer default of 30 seconds, i.e.

cat /sys/block/<dev>/device/timeout

conspires to  undermine the deep recovery of most drives now on the
market. This by default misconfiguration results in problems list
regulars are very well aware of. It affects all raid configurations,
and even affects the non-RAID single drive use case. And it does so in
a way that doesn't happen on either Windows or macOS. Basically it is
linux kernel induced data loss, the drive very possibly could present
the requested data upon deep recovery being permitted, but the
kernel's command timer is reached before recovery completes, and
obliterates any possibility of recovering that data. By default.

This now seems to affect the majority of use cases. At one time 30
seconds might have been sane for a world with drives that had less
than 30 second recoveries for bad sectors. But that's no longer the
case.

I'm wondering if anyone has floated the idea of changing the kernels
default SCSI command timer? And if so, if there's a thread discussing
where that was rejected upstream? Or if this exposes other liabilities
that merits an alternative work around for what now amounts to a
defect. Maybe it needs to be a udev rule?

Perhaps ideally what we'd like to have is two timers. A timer that
reports back "slowness" for a drive to complete a queued command,
which could be used by e.g. scrubs to preemptively overwrite those
sectors rather than wait for read errors to happen. And then a timer
with a longer value would be the present timer that results in a link
reset once it's reached.

Thanks,

-- 
Chris Murphy

^ permalink raw reply	[flat|nested] 16+ messages in thread

end of thread, other threads:[~2016-08-19 15:30 UTC | newest]

Thread overview: 16+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2016-06-27 16:42 URE, link resets, user hostile defaults Chris Murphy
2016-06-28  6:33 ` Hannes Reinecke
2016-06-28 17:33   ` Chris Murphy
2016-06-28 18:28     ` Phil Turmel
2016-06-28 20:46       ` Wols Lists
2016-06-28 22:17         ` Chris Murphy
2016-06-29  6:01     ` Hannes Reinecke
2016-06-29 10:48       ` Pasi Kärkkäinen
2016-06-29 12:17     ` Zygo Blaxell
2016-06-29 18:16       ` Edward Kuns
2016-07-01 20:43         ` Chris Murphy
2016-07-04  6:00           ` Hannes Reinecke
2016-07-04 21:43       ` Pasi Kärkkäinen
2016-08-19 10:00         ` Pasi Kärkkäinen
2016-08-19 12:36           ` Phil Turmel
2016-08-19 15:30         ` Chris Murphy

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).