From: PFC <lists@peufeu.com>
To: Neil Brown <neilb@suse.de>, Gordon Henderson <gordon@drogon.net>
Cc: linux-raid@vger.kernel.org
Subject: Re: [PATCH 000 of 5] md: Introduction
Date: Thu, 19 Jan 2006 08:20:59 +0100 [thread overview]
Message-ID: <op.s3l6o9krcigqcu@apollo13> (raw)
In-Reply-To: <17358.54414.410350.594083@cse.unsw.edu.au>
While we're at it, here's a little issue I had with RAID5 ; not really
the fault of md, but you might want to know...
I have a 5x250GB RAID5 array for home storage (digital photo, my lossless
ripped cds, etc). 1 IDE Drive ave 4 SATA Drives.
Now, turns out one of the SATA drives is a Maxtor 6V250F0, and these have
problems ; it died, then was RMA'd, then died again. Finally, it turned
out this drive series is incompatible with nvidia sata chipsets. A third
drive seems to work, setting the jumper to SATA 150.
Back to the point.
Failure mode of these drives is an IDE command timeout. This takes a long
time ! So, when the drive has failed, each command to it takes forever. md
will eventually reject said drive, but it takes hours ; and meanwhile, the
computer is unusable and data is offline...
In this case, the really tempting solution is to hit the windows key (er,
the hard reset button) ; but doing this, makes the array dirty and
degraded, and it won't mount, and all data is seemingly lost. (well,
recoverable with a bit of hacking /* goto error; */, but that's not very
clean...)
This isn't really a md issue, but it's really annoying only when using
RAID, because it makes a normal process (kicking a dead drive out) so slow
it's almost non-functional. Is there a way to modify the timeout in
question ?
Note that, re-reading the log below, it writes "Disk failure on sdd1,
disabling device. Operation continuing on 4 devices", but errors continue
to come, and the array is still unreachable (ie. cat /proc/mdstat hangs,
etc). Hmm...
Thanks for the time.
Jan 8 21:38:41 apollo13 ReiserFS: md2: checking transaction log (md2)
Jan 8 21:39:11 apollo13 ata4: command 0xca timeout, stat 0xd0 host_stat
0x21
Jan 8 21:39:11 apollo13 ata4: translated ATA stat/err 0xca/00 to SCSI
SK/ASC/ASCQ 0xb/47/00
Jan 8 21:39:11 apollo13 ata4: status=0xca { Busy }
Jan 8 21:39:11 apollo13 sd 3:0:0:0: SCSI error: return code = 0x8000002
Jan 8 21:39:11 apollo13 sdd: Current: sense key=0xb
Jan 8 21:39:11 apollo13 ASC=0x47 ASCQ=0x0
Jan 8 21:39:11 apollo13 Info fld=0x3f
Jan 8 21:39:11 apollo13 end_request: I/O error, dev sdd, sector 63
Jan 8 21:39:11 apollo13 raid5: Disk failure on sdd1, disabling device.
Operation continuing on 4 devices
Jan 8 21:39:11 apollo13 ATA: abnormal status 0xD0 on port 0x977
Jan 8 21:39:11 apollo13 ATA: abnormal status 0xD0 on port 0x977
Jan 8 21:39:11 apollo13 ATA: abnormal status 0xD0 on port 0x977
Jan 8 21:39:41 apollo13 ata4: command 0xca timeout, stat 0xd0 host_stat
0x21
Jan 8 21:39:41 apollo13 ata4: translated ATA stat/err 0xca/00 to SCSI
SK/ASC/ASCQ 0xb/47/00
Jan 8 21:39:41 apollo13 ata4: status=0xca { Busy }
Jan 8 21:39:41 apollo13 sd 3:0:0:0: SCSI error: return code = 0x8000002
Jan 8 21:39:41 apollo13 sdd: Current: sense key=0xb
Jan 8 21:39:41 apollo13 ASC=0x47 ASCQ=0x0
Jan 8 21:39:41 apollo13 Info fld=0x9840097
Jan 8 21:39:41 apollo13 end_request: I/O error, dev sdd, sector 159645847
Jan 8 21:39:41 apollo13 ATA: abnormal status 0xD0 on port 0x977
Jan 8 21:39:41 apollo13 ATA: abnormal status 0xD0 on port 0x977
Jan 8 21:39:41 apollo13 ATA: abnormal status 0xD0 on port 0x977
Jan 8 21:40:01 apollo13 cron[17973]: (root) CMD (test -x
/usr/sbin/run-crons && /usr/sbin/run-crons )
Jan 8 21:40:11 apollo13 ata4: command 0x35 timeout, stat 0xd0 host_stat
0x21
Jan 8 21:40:11 apollo13 ata4: translated ATA stat/err 0x35/00 to SCSI
SK/ASC/ASCQ 0x4/00/00
Jan 8 21:40:11 apollo13 ata4: status=0x35 { DeviceFault SeekComplete
CorrectedError Error }
Jan 8 21:40:11 apollo13 sd 3:0:0:0: SCSI error: return code = 0x8000002
Jan 8 21:40:11 apollo13 sdd: Current: sense key=0x4
Jan 8 21:40:11 apollo13 ASC=0x0 ASCQ=0x0
Jan 8 21:40:11 apollo13 end_request: I/O error, dev sdd, sector 465232831
next prev parent reply other threads:[~2006-01-19 7:20 UTC|newest]
Thread overview: 111+ messages / expand[flat|nested] mbox.gz Atom feed top
2006-01-17 6:56 [PATCH 000 of 5] md: Introduction NeilBrown
2006-01-17 6:56 ` [PATCH 001 of 5] md: Split disks array out of raid5 conf structure so it is easier to grow NeilBrown
2006-01-17 14:37 ` John Stoffel
2006-01-19 0:26 ` Neil Brown
2006-01-21 3:37 ` John Stoffel
2006-01-22 22:57 ` Neil Brown
2006-01-17 6:56 ` [PATCH 002 of 5] md: Allow stripes to be expanded in preparation for expanding an array NeilBrown
2006-01-17 6:56 ` [PATCH 003 of 5] md: Infrastructure to allow normal IO to continue while array is expanding NeilBrown
2006-01-17 6:56 ` [PATCH 004 of 5] md: Core of raid5 resize process NeilBrown
2006-01-17 6:56 ` [PATCH 005 of 5] md: Final stages of raid5 expand code NeilBrown
2006-01-17 9:55 ` Sander
2006-01-19 0:32 ` Neil Brown
2006-01-17 8:17 ` [PATCH 000 of 5] md: Introduction Michael Tokarev
[not found] ` <fd8d0180601170121s1e6a55b7o@mail.gmail.com>
2006-01-17 9:38 ` Francois Barre
2006-01-19 0:35 ` Neil Brown
2006-01-17 9:50 ` Sander
2006-01-17 11:26 ` Michael Tokarev
2006-01-17 11:37 ` Francois Barre
2006-01-17 14:03 ` Kyle Moffett
2006-01-19 0:28 ` Neil Brown
2006-01-17 16:08 ` Ross Vandegrift
2006-01-17 16:08 ` Ross Vandegrift
2006-01-17 18:12 ` Michael Tokarev
2006-01-17 18:12 ` Michael Tokarev
2006-01-18 8:14 ` Sander
2006-01-18 8:14 ` Sander
2006-01-18 8:37 ` Brad Campbell
2006-01-18 9:03 ` Alan Cox
2006-01-18 12:46 ` John Hendrikx
2006-01-18 12:51 ` Gordon Henderson
2006-01-18 23:51 ` Neil Brown
2006-01-19 7:20 ` PFC [this message]
2006-01-19 8:01 ` dean gaudet
2006-01-18 23:54 ` Neil Brown
2006-01-19 0:22 ` Neil Brown
2006-01-19 0:22 ` Neil Brown
2006-01-19 9:01 ` Jakob Oestergaard
2006-01-19 9:01 ` Jakob Oestergaard
2006-01-17 22:38 ` Phillip Susi
2006-01-17 22:57 ` Neil Brown
2006-01-17 14:10 ` Steinar H. Gunderson
2006-01-17 15:07 ` Mr. James W. Laferriere
2006-01-19 0:23 ` Neil Brown
2006-01-22 4:42 ` Adam Kropelin
2006-01-22 22:52 ` Neil Brown
2006-01-23 23:02 ` Adam Kropelin
2006-01-23 23:02 ` Adam Kropelin
2006-01-23 1:08 ` John Hendrikx
2006-01-23 1:25 ` Neil Brown
2006-01-23 1:54 ` Kyle Moffett
2006-01-23 2:09 ` Mr. James W. Laferriere
2006-01-23 2:33 ` Neil Brown
-- strict thread matches above, loose matches on Subject: below --
2006-01-17 21:38 Lincoln Dale (ltd)
2006-01-17 21:38 ` Lincoln Dale (ltd)
2006-01-18 13:27 ` Jan Engelhardt
2006-01-18 23:19 ` Neil Brown
2006-01-19 15:33 ` Mark Hahn
2006-01-19 15:33 ` Mark Hahn
2006-01-19 20:12 ` Jan Engelhardt
2006-01-19 21:22 ` Lars Marowsky-Bree
2006-01-19 21:22 ` Lars Marowsky-Bree
2006-01-19 22:17 ` Phillip Susi
2006-01-19 22:32 ` Neil Brown
2006-01-19 23:26 ` Phillip Susi
2006-01-19 23:43 ` Neil Brown
2006-01-20 2:17 ` Phillip Susi
2006-01-20 10:53 ` Lars Marowsky-Bree
2006-01-20 10:53 ` Lars Marowsky-Bree
2006-01-20 12:06 ` Jens Axboe
2006-01-20 18:38 ` Heinz Mauelshagen
2006-01-20 18:38 ` Heinz Mauelshagen
2006-01-20 22:09 ` Lars Marowsky-Bree
2006-01-20 22:09 ` Lars Marowsky-Bree
2006-01-21 0:06 ` Heinz Mauelshagen
2006-01-21 0:06 ` Heinz Mauelshagen
2006-01-20 18:41 ` Heinz Mauelshagen
2006-01-20 17:29 ` Ross Vandegrift
2006-01-20 17:29 ` Ross Vandegrift
2006-01-20 18:36 ` Heinz Mauelshagen
2006-01-20 22:57 ` Lars Marowsky-Bree
2006-01-20 22:57 ` Lars Marowsky-Bree
2006-01-21 0:01 ` Heinz Mauelshagen
2006-01-21 0:01 ` Heinz Mauelshagen
2006-01-21 0:03 ` Lars Marowsky-Bree
2006-01-21 0:03 ` Lars Marowsky-Bree
2006-01-21 0:08 ` Heinz Mauelshagen
2006-01-21 0:08 ` Heinz Mauelshagen
2006-01-21 0:13 ` Lars Marowsky-Bree
2006-01-23 9:44 ` Heinz Mauelshagen
2006-01-23 10:26 ` Lars Marowsky-Bree
2006-01-23 10:38 ` Heinz Mauelshagen
2006-01-23 10:38 ` Heinz Mauelshagen
2006-01-23 10:45 ` Lars Marowsky-Bree
2006-01-23 10:45 ` Lars Marowsky-Bree
2006-01-23 11:00 ` Heinz Mauelshagen
2006-01-23 11:00 ` Heinz Mauelshagen
2006-01-23 12:54 ` Ville Herva
2006-01-23 12:54 ` Ville Herva
2006-01-23 13:00 ` Steinar H. Gunderson
2006-01-23 13:54 ` Heinz Mauelshagen
2006-01-23 17:33 ` Ville Herva
2006-01-23 17:33 ` Ville Herva
2006-01-24 2:02 ` Phillip Susi
2006-01-20 7:51 ` Reuben Farrelly
2006-01-20 3:43 ` Andre' Breiler
2006-01-21 0:42 ` David Greaves
2006-01-20 16:48 Hubert Tonneau
2006-01-20 17:01 Hubert Tonneau
2006-01-20 16:15 ` Christoph Hellwig
2006-01-22 6:45 ` Herbert Poetzl
2006-01-20 18:05 Hubert Tonneau
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=op.s3l6o9krcigqcu@apollo13 \
--to=lists@peufeu.com \
--cc=gordon@drogon.net \
--cc=linux-raid@vger.kernel.org \
--cc=neilb@suse.de \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.