From: Dan Christensen <jdc@uwo.ca>
To: linux-raid@vger.kernel.org
Subject: Re: devices get kicked from RAID about once a month
Date: Thu, 03 Jun 2010 12:47:39 -0400 [thread overview]
Message-ID: <87eigom5as.fsf@uwo.ca> (raw)
In-Reply-To: 4C07DA4E.70501@tmr.com
Bill Davidsen <davidsen@tmr.com> writes:
> Those logs don't show any information useful to me which tells me how
> long md waited, and I'm not able to parse any of the res: information
> to gain clarity. It would be nice if someone can parse that, but I
> can't. On timeout an elapsed time output would be nice to indicate
> what the time limit is.
I agree. It would also be nice to know whether there was in fact a read
error at that time (in which case I may just replace the drives to avoid
this problem) or whether it was some other communications glitch (in
which case I may suspect the power supply, try a newer kernel, etc).
With the information at hand, I'm not sure how to fix this, and since
it often is a month or more between occurrences, trial and error is
not likely to help.
> I sure would like to see a timeout in ms [md?] in
> the /sys for the device and a flag for the array to not kick a drive
> for timeout until some number of consecutive timeouts have
> occurred.
That could be useful. And, as Neil said, if the SATA driver could be
told to use longer timeouts, that might help. Neil, if you think that's
a good idea, maybe you could put the request in with the SATA folks?
> I would hope that a drive with multiple partitions would get the
> partitions kicked, not the whole drive at once. So one slow sector
> wouldn't take out multiple arrays.
Only the partition gets kicked out. Yesterday, this saved me, since I
had timeouts on two drives in RAID5, but all the arrays stayed up because
the partitions didn't happen to be in the same array.
Dan
next prev parent reply other threads:[~2010-06-03 16:47 UTC|newest]
Thread overview: 17+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-06-02 14:14 devices get kicked from RAID about once a month Dan Christensen
2010-06-02 15:02 ` rsivak
2010-06-02 15:29 ` Dan Christensen
2010-06-02 15:37 ` John Robinson
2010-06-02 16:33 ` Dan Christensen
2010-06-02 17:42 ` Bill Davidsen
2010-06-02 17:49 ` Dan Christensen
2010-06-03 16:37 ` Bill Davidsen
2010-06-03 16:47 ` Dan Christensen [this message]
2010-06-03 21:33 ` Neil Brown
2010-06-04 13:30 ` Dan Christensen
2010-06-04 13:50 ` Robin Hill
2010-06-04 15:56 ` Dan Christensen
2010-06-02 19:55 ` Miha Verlic
-- strict thread matches above, loose matches on Subject: below --
2010-06-02 18:29 Stefan /*St0fF*/ Hübner
2010-06-03 0:13 ` Neil Brown
2010-06-03 17:00 ` Bill Davidsen
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=87eigom5as.fsf@uwo.ca \
--to=jdc@uwo.ca \
--cc=linux-raid@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.