From: Pallai Roland <dap@mail.index.hu>
To: Lars Marowsky-Bree <lmb@suse.de>
Cc: linux-raid@vger.kernel.org
Subject: Re: [PATCH] proactive raid5 disk replacement for 2.6.11, updated
Date: Thu, 18 Aug 2005 16:13:03 +0200 [thread overview]
Message-ID: <1124374384.21362.114.camel@localhost.localdomain> (raw)
In-Reply-To: <20050818102458.GJ13344@marowsky-bree.de>
On Thu, 2005-08-18 at 12:24 +0200, Lars Marowsky-Bree wrote:
> On 2005-08-18T15:28:41, Neil Brown <neilb@cse.unsw.edu.au> wrote:
> > To handle read failures, I would like the first step to be to re-write
> > the failed block. I believe most (all?) drives will relocate the
> > block if a write cannot succeed at the normal location, so this will
> > often fix the problem.
>
> Yes. This would be highly useful.
yes, but I'm not sure that should be handled immediately, see my
previous mail
> > This possible doesn't handle the possibility of a write failing very
> > well, but I'm not sure what your approach does in that case. Could
> > you explain that?
>
> I think a failed write can't really be handled - it might be retried
> once or twice, but then the way to proceed is to kick the drive and
> rebuild the array.
I'm not sure that's a fatal error if that sector isn't readable too.
badblock tolerance comes to play there..
> > It also means that if the raid1 rebuild hits a read-error it cannot
> > cope whereas your code would just reconstruct the block from the rest
> > of the raid5.
>
> Good point. One way to fix this would be to have a callback to one level
> up "Hi, I can't read this section, can you reconstruct and give it to
> me?". (Which is a pretty ugly hack.)
I think it's simpler, just issue a generic read request to the parent,
special callback isn't needed
> However, that would also assume that the data on the disk which _can_ be
> read still can be trusted. I'm not sure I'd buy that myself, untrusted.
> But a periodic background consistency check for RAID might help convince
> users that this is indeed the case ;-)
> If you can no longer pro-actively reconstruct the disk because it has
> indeed failed, maybe treating it like a failed disk and rebuilding the
> array in the "classic" fashion isn't the worst idea, though.
yes, the chance for that will have been forever of course
--
dap
next prev parent reply other threads:[~2005-08-18 14:13 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2005-08-17 23:52 [PATCH] proactive raid5 disk replacement for 2.6.11, updated Pallai Roland
2005-08-18 1:55 ` Tyler
2005-08-18 5:28 ` Neil Brown
2005-08-18 10:24 ` Lars Marowsky-Bree
2005-08-18 14:13 ` Pallai Roland [this message]
2005-08-18 10:56 ` Michael Tokarev
2005-08-18 13:46 ` Pallai Roland
2005-08-19 14:58 ` Pallai Roland
2005-08-20 15:35 ` Pallai Roland
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1124374384.21362.114.camel@localhost.localdomain \
--to=dap@mail.index.hu \
--cc=linux-raid@vger.kernel.org \
--cc=lmb@suse.de \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).