Robustness in the face of errors

linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

* Robustness in the face of errors
@ 2002-11-16  9:56 John L. Bass
  2002-11-16 12:08 ` Neil Brown
  0 siblings, 1 reply; 4+ messages in thread
From: John L. Bass @ 2002-11-16  9:56 UTC (permalink / raw)
  To: linux-raid

On first error the system currently appears to just abandon a drive, forcing
the system into degraded mode for all I/O which follows. A much more reasonable
approach would be to not abandon the drive completely, but rather build a fast
lookup table with known bad blocks which would allow accesses to most areas of
the array to continue without degradation, and only areas that have bad blocks
would be forced into degraded mode.

Many drives will trash a sector if power drops when writing, and that sector
will generate read errors until written. It makes sense on those drives to
recover the data in degraded mode, and re-write followed by a verify. If the
verify fails, and the drive support dynamic sparing/remapping the sector
should be remapped, rewritten, and verified again. On a large 200GB arry, this
single feature would remove nearly a day of reconstruction time for normal
errors and sector failures, substantially improving realized reliability.

Doing dynamic error management would remove 99% of the gross software raid
device failures I have seen over the last year.

John Bass

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Robustness in the face of errors
  2002-11-16  9:56 John L. Bass
@ 2002-11-16 12:08 ` Neil Brown
  0 siblings, 0 replies; 4+ messages in thread
From: Neil Brown @ 2002-11-16 12:08 UTC (permalink / raw)
  To: John L. Bass; +Cc: linux-raid

On Saturday November 16, jbass@dmsd.com wrote:
> On first error the system currently appears to just abandon a drive, forcing
> the system into degraded mode for all I/O which follows. A much more reasonable
> approach would be to not abandon the drive completely, but rather build a fast
> lookup table with known bad blocks which would allow accesses to most areas of
> the array to continue without degradation, and only areas that have bad blocks
> would be forced into degraded mode.
> 
> Many drives will trash a sector if power drops when writing, and that sector
> will generate read errors until written. It makes sense on those drives to
> recover the data in degraded mode, and re-write followed by a verify. If the
> verify fails, and the drive support dynamic sparing/remapping the sector
> should be remapped, rewritten, and verified again. On a large 200GB arry, this
> single feature would remove nearly a day of reconstruction time for normal
> errors and sector failures, substantially improving realized reliability.
> 
> Doing dynamic error management would remove 99% of the gross software raid
> device failures I have seen over the last year.

You are largely correct...
I look forward to you providing (or sponsoring) code to do this. :-)

Maybe this should go on a FAQ as it does get mentioned from time to
time.
The answer is:
    Yes, it could be done.
    No, it hasn't been done.
    Patches are always welcome.

NeilBrown

^ permalink raw reply	[flat|nested] 4+ messages in thread

* RE: Robustness in the face of errors
@ 2002-11-18 20:07 Cress, Andrew R
  0 siblings, 0 replies; 4+ messages in thread
From: Cress, Andrew R @ 2002-11-18 20:07 UTC (permalink / raw)
  To: 'Neil Brown', jbass; +Cc: linux-raid

Yeah, this is logic that scsi couldn't do by itself, but md can, since it
can recover the data.

Also, wouldn't we want to check (and even set) the auto-reallocation
(AWRE/ARRE) mode page bits on the drive when md loads, to let the disk do as
much as it can with remapping?  Or does that belong outside of md?

Andy

-----Original Message-----
From: Neil Brown [mailto:neilb@cse.unsw.edu.au] 
Sent: Saturday, November 16, 2002 7:09 AM
To: jbass@dmsd.com
Cc: linux-raid@vger.kernel.org
Subject: Re: Robustness in the face of errors


On Saturday November 16, jbass@dmsd.com wrote:
> On first error the system currently appears to just abandon a drive,
forcing
> the system into degraded mode for all I/O which follows. A much more
reasonable
> approach would be to not abandon the drive completely, but rather build a
fast
> lookup table with known bad blocks which would allow accesses to most
areas of
> the array to continue without degradation, and only areas that have bad
blocks
> would be forced into degraded mode.
> 
> Many drives will trash a sector if power drops when writing, and that
sector
> will generate read errors until written. It makes sense on those drives to
> recover the data in degraded mode, and re-write followed by a verify. If
the
> verify fails, and the drive support dynamic sparing/remapping the sector
> should be remapped, rewritten, and verified again. On a large 200GB arry,
this
> single feature would remove nearly a day of reconstruction time for normal
> errors and sector failures, substantially improving realized reliability.
> 
> Doing dynamic error management would remove 99% of the gross software raid
> device failures I have seen over the last year.

You are largely correct...
I look forward to you providing (or sponsoring) code to do this. :-)

Maybe this should go on a FAQ as it does get mentioned from time to
time.
The answer is:
    Yes, it could be done.
    No, it hasn't been done.
    Patches are always welcome.

NeilBrown
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 4+ messages in thread

* RE: Robustness in the face of errors
@ 2002-11-18 20:18 John L. Bass
  0 siblings, 0 replies; 4+ messages in thread
From: John L. Bass @ 2002-11-18 20:18 UTC (permalink / raw)
  To: andrew.r.cress, jbass, neilb; +Cc: linux-raid

	Yeah, this is logic that scsi couldn't do by itself, but md can, since it
	can recover the data.

	Also, wouldn't we want to check (and even set) the auto-reallocation
	(AWRE/ARRE) mode page bits on the drive when md loads, to let the disk do as
	much as it can with remapping?  Or does that belong outside of md?

	Andy

There are limited spare resources in a drive, which are wasted if consumed by
"normal" transient errors. It's much better to recover/rewrite the sector inside
md, and if persistant then spare the sector at the drive level.

At Fortune Systems (largest M68K Unix mfgr in early 1980's) we tried auto sparing
on first error and it completely drove us crazy when the errors were introduced by
poor power and EMI coupling. The drives that were returned, were almost always good,
it was the system environment that triggered the majority of the errors.

I've been running software raid here on a large FC array ... and a number of relatively
normal errors have repeatedly taken the raid array off-line and potentially exposed
the data to corruption since the only recover is to "mkraid -R" and accept the data
state as it is.

John

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2002-11-18 20:18 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2002-11-18 20:18 Robustness in the face of errors John L. Bass
  -- strict thread matches above, loose matches on Subject: below --
2002-11-18 20:07 Cress, Andrew R
2002-11-16  9:56 John L. Bass
2002-11-16 12:08 ` Neil Brown

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).