linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: maarten <maarten@ultratux.net>
To: linux-raid@vger.kernel.org
Subject: Re: Spares and partitioning huge disks
Date: Sun, 9 Jan 2005 22:26:25 +0100	[thread overview]
Message-ID: <200501092226.25910.maarten@ultratux.net> (raw)
In-Reply-To: <20050109193353.GB9524@janus>

On Sunday 09 January 2005 20:33, Frank van Maarseveen wrote:
> On Sat, Jan 08, 2005 at 05:49:32PM +0100, maarten wrote:

> > However, IF during that
> > resync one other drive has a read error, it gets kicked too and the array
> > dies.  The chances of that happening are not very small;
>
> Ouch! never considered this. So, RAID5 will actually decrease reliability
> in a significant number of cases because:

> -	>1 read errors can cause a total break-down whereas it used
> 	to cause only a few userland I/O errors, disruptive but not foobar.

Well, yes and no.  You can decide to do a full backup in case you hadn't, 
prior to changing drives. And if it is _just_ a bad sector, you can 'assemble 
--force' yielding what you would've had in a non-raid setup; some file 
somewhere that's got corrupted. No big deal, ie. the same trouble as was 
caused without raid-5.

> -	disk replacement is quite risky. This is totally unexpected to me
> 	but it should have been obvious: there's no bad block list in MD
> 	so if we would postpone I/O errors during reconstruction then
> 	1:	it might cause silent data corruption when I/O error
> 		unexpectedly disappears.
> 	2:	we might silently loose redundancy in a number of places.

Not sure if I understood all of that, but I think you're saying that md 
_could_ disregard read errors _when_already_running_in_degraded_mode_ so as 
to preserve the array at all cost.  Hum.  That choice should be left to the 
user if it happens, he probably knows best what to choose in the 
circumstances.

No really, what would be best is that md made a difference between total media 
failure and sector failure.  If one sector is bad on one drive [and it gets 
kicked therefore] it should be possible when a further read error occurs on 
other media, to try and read the missing sector data from the kicked drive, 
who may well have the data there waiting, intact and all.

Don't know how hard that is really, but one could maybe think of pushing a 
disk in an intermediate state between "failed" and "good" like "in_disgrace" 
what signals to the end user "Don't remove this disk as yet; we may still 
need it, but add and resync a spare at your earliest convenience as we're 
running in degraded mode as of now".
Hmm.  Complicated stuff. :-)

This kind of error will get more and more predominant with growing media and 
decreasing disk quality. Statistically there is not a huge chance of getting 
a read failure on a 18GB scsi disk, but on a cheap(ish) 500 GB ATA disk that 
is an entrirely different ballpark. 

> I think RAID6 but especially RAID1 is safer.

Well, duh :)  At the expense of buying everything twice, sure it's safer :))

> A small side note on disk behavior:
> If it becomes possible to do block remapping at any level (MD, DM/LVM,
> FS) then we might not want to write to sectors with read errors at all
> but just remap the corresponding blocks by software as long as we have
> free blocks: save disk-internal spare sectors so the disk firmware can
> pre-emptively remap degraded but ECC correctable sectors upon read.

Well I dunno.  In ancient times, the OS was charged with remapping bad sectors 
back when disk drives had no intelligence.  Now we delegated that task to the 
disk.  I'm not sure reverting back to the old behaviour is a smart move.
But with raid, who knows...

And as it is I don't think you get the chance to save the disk-internal spare 
sectors; the disk handles that transparently so any higher layer cannot only 
not prevent that, but is even kept completely ignorant to it happening. 

Maarten



  reply	other threads:[~2005-01-09 21:26 UTC|newest]

Thread overview: 94+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2005-01-06 14:16 Spares and partitioning huge disks maarten
2005-01-06 16:46 ` Guy
2005-01-06 17:08   ` maarten
2005-01-06 17:31 ` Guy
2005-01-06 18:18   ` maarten
     [not found]     ` <41DD83DA.9040609@h3c.com>
2005-01-06 19:42       ` maarten
2005-01-07 20:59 ` Mario Holbe
2005-01-07 21:57   ` Guy
2005-01-08 10:22     ` Mario Holbe
2005-01-08 12:19       ` maarten
2005-01-08 16:33         ` Guy
2005-01-08 16:58           ` maarten
2005-01-08 14:52     ` Frank van Maarseveen
2005-01-08 15:50       ` Mario Holbe
2005-01-08 16:32       ` Guy
2005-01-08 17:16         ` maarten
2005-01-08 18:55           ` Guy
2005-01-08 19:25             ` maarten
2005-01-08 20:33               ` Mario Holbe
2005-01-08 23:01                 ` maarten
2005-01-09 10:10                   ` Mario Holbe
2005-01-09 16:23                     ` Guy
2005-01-09 16:36                       ` Michael Tokarev
2005-01-09 17:52                         ` Peter T. Breuer
2005-01-09 17:59                           ` Michael Tokarev
2005-01-09 18:34                             ` Peter T. Breuer
2005-01-09 20:28                             ` Guy
2005-01-09 20:47                               ` Peter T. Breuer
2005-01-10  7:19                                 ` Peter T. Breuer
2005-01-10  9:05                                   ` Guy
2005-01-10  9:38                                     ` Peter T. Breuer
2005-01-10 12:31                                   ` Peter T. Breuer
2005-01-10 13:19                                     ` Peter T. Breuer
2005-01-10 18:37                                       ` Peter T. Breuer
2005-01-11 11:34                                         ` Peter T. Breuer
2005-01-08 23:09               ` Guy
2005-01-09  0:56                 ` maarten
2005-01-13  2:05                 ` Neil Brown
2005-01-13  4:55                   ` Guy
2005-01-13  9:27                   ` Peter T. Breuer
2005-01-13 15:53                     ` Guy
2005-01-13 17:16                       ` Peter T. Breuer
2005-01-13 20:40                         ` Guy
2005-01-13 23:32                           ` Peter T. Breuer
2005-01-14  2:43                             ` Guy
2005-01-08 16:49       ` maarten
2005-01-08 19:01         ` maarten
2005-01-10 16:34           ` maarten
2005-01-10 16:36             ` Gordon Henderson
2005-01-10 17:10               ` maarten
2005-01-16 16:19                 ` 4 questions. Chieftec chassis case CA-01B, resync times, selecting ide driver module loading, raid5 :2 drives on same ide channel Mitchell Laks
2005-01-16 17:53                   ` Gordon Henderson
2005-01-16 18:22                   ` Maarten
2005-01-16 19:39                   ` Guy
2005-01-16 20:55                     ` Maarten
2005-01-16 21:58                       ` Guy
2005-01-10 17:13             ` Spares and partitioning huge disks Guy
2005-01-10 17:35               ` hard disk re-locates bad block on read Guy
2005-01-11 14:34                 ` Tom Coughlan
2005-01-11 22:43                   ` Guy
2005-01-12 13:51                     ` Tom Coughlan
2005-01-10 18:24               ` Spares and partitioning huge disks maarten
2005-01-10 20:09                 ` Guy
2005-01-10 21:21                   ` maarten
2005-01-11  1:04                   ` maarten
2005-01-10 18:40               ` maarten
2005-01-10 19:41                 ` Guy
2005-01-12 11:41               ` RAID-6 Gordon Henderson
2005-01-13  2:11                 ` RAID-6 Neil Brown
2005-01-15 16:12                   ` RAID-6 Gordon Henderson
2005-01-17  8:04                     ` RAID-6 Turbo Fredriksson
2005-01-11 10:09             ` Spares and partitioning huge disks KELEMEN Peter
2005-01-09 19:33         ` Frank van Maarseveen
2005-01-09 21:26           ` maarten [this message]
2005-01-09 22:29             ` Frank van Maarseveen
2005-01-09 23:16               ` maarten
2005-01-10  8:15                 ` Frank van Maarseveen
2005-01-14 17:29                   ` Dieter Stueken
2005-01-14 17:46                     ` maarten
2005-01-14 19:14                       ` Derek Piper
2005-01-15  0:13                     ` Michael Tokarev
2005-01-15  9:34                       ` Peter T. Breuer
2005-01-15  9:54                         ` Mikael Abrahamsson
2005-01-15 10:31                           ` Brad Campbell
2005-01-15 11:10                             ` Mikael Abrahamsson
2005-01-15 10:33                           ` Peter T. Breuer
2005-01-15 11:07                             ` Mikael Abrahamsson
2005-01-09 23:20             ` Guy
2005-01-10  7:42               ` Gordon Henderson
2005-01-10  9:03                 ` Guy
2005-01-10 12:21                   ` Stats... [RE: Spares and partitioning huge disks] Gordon Henderson
2005-01-10  0:42             ` Spares and partitioning huge disks Guy
  -- strict thread matches above, loose matches on Subject: below --
2005-01-13  9:53 Bene Martin
2005-01-13 10:11 ` Peter T. Breuer

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=200501092226.25910.maarten@ultratux.net \
    --to=maarten@ultratux.net \
    --cc=linux-raid@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).