linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: PFC <lists@peufeu.com>
To: Neil Brown <neilb@suse.de>, Gordon Henderson <gordon@drogon.net>
Cc: linux-raid@vger.kernel.org
Subject: Re: [PATCH 000 of 5] md: Introduction
Date: Thu, 19 Jan 2006 08:20:59 +0100	[thread overview]
Message-ID: <op.s3l6o9krcigqcu@apollo13> (raw)
In-Reply-To: <17358.54414.410350.594083@cse.unsw.edu.au>


	While we're at it, here's a little issue I had with RAID5 ; not really  
the fault of md, but you might want to know...

	I have a 5x250GB RAID5 array for home storage (digital photo, my lossless  
ripped cds, etc). 1 IDE Drive ave 4 SATA Drives.
	Now, turns out one of the SATA drives is a Maxtor 6V250F0, and these have  
problems ; it died, then was RMA'd, then died again. Finally, it turned  
out this drive series is incompatible with nvidia sata chipsets. A third  
drive seems to work, setting the jumper to SATA 150.
	Back to the point.

	Failure mode of these drives is an IDE command timeout. This takes a long  
time ! So, when the drive has failed, each command to it takes forever. md  
will eventually reject said drive, but it takes hours ; and meanwhile, the  
computer is unusable and data is offline...

	In this case, the really tempting solution is to hit the windows key (er,  
the hard reset button) ; but doing this, makes the array dirty and  
degraded, and it won't mount, and all data is seemingly lost. (well,  
recoverable with a bit of hacking /* goto error; */, but that's not very  
clean...)

	This isn't really a md issue, but it's really annoying only when using  
RAID, because it makes a normal process (kicking a dead drive out) so slow  
it's almost non-functional. Is there a way to modify the timeout in  
question ?

	Note that, re-reading the log below, it writes "Disk failure on sdd1,  
disabling device. Operation continuing on 4 devices", but errors continue  
to come, and the array is still unreachable (ie. cat /proc/mdstat hangs,  
etc). Hmm...

	Thanks for the time.


Jan  8 21:38:41 apollo13 ReiserFS: md2: checking transaction log (md2)
Jan  8 21:39:11 apollo13 ata4: command 0xca timeout, stat 0xd0 host_stat  
0x21
Jan  8 21:39:11 apollo13 ata4: translated ATA stat/err 0xca/00 to SCSI  
SK/ASC/ASCQ 0xb/47/00
Jan  8 21:39:11 apollo13 ata4: status=0xca { Busy }
Jan  8 21:39:11 apollo13 sd 3:0:0:0: SCSI error: return code = 0x8000002
Jan  8 21:39:11 apollo13 sdd: Current: sense key=0xb
Jan  8 21:39:11 apollo13 ASC=0x47 ASCQ=0x0
Jan  8 21:39:11 apollo13 Info fld=0x3f
Jan  8 21:39:11 apollo13 end_request: I/O error, dev sdd, sector 63
Jan  8 21:39:11 apollo13 raid5: Disk failure on sdd1, disabling device.  
Operation continuing on 4 devices
Jan  8 21:39:11 apollo13 ATA: abnormal status 0xD0 on port 0x977
Jan  8 21:39:11 apollo13 ATA: abnormal status 0xD0 on port 0x977
Jan  8 21:39:11 apollo13 ATA: abnormal status 0xD0 on port 0x977
Jan  8 21:39:41 apollo13 ata4: command 0xca timeout, stat 0xd0 host_stat  
0x21
Jan  8 21:39:41 apollo13 ata4: translated ATA stat/err 0xca/00 to SCSI  
SK/ASC/ASCQ 0xb/47/00
Jan  8 21:39:41 apollo13 ata4: status=0xca { Busy }
Jan  8 21:39:41 apollo13 sd 3:0:0:0: SCSI error: return code = 0x8000002
Jan  8 21:39:41 apollo13 sdd: Current: sense key=0xb
Jan  8 21:39:41 apollo13 ASC=0x47 ASCQ=0x0
Jan  8 21:39:41 apollo13 Info fld=0x9840097
Jan  8 21:39:41 apollo13 end_request: I/O error, dev sdd, sector 159645847
Jan  8 21:39:41 apollo13 ATA: abnormal status 0xD0 on port 0x977
Jan  8 21:39:41 apollo13 ATA: abnormal status 0xD0 on port 0x977
Jan  8 21:39:41 apollo13 ATA: abnormal status 0xD0 on port 0x977
Jan  8 21:40:01 apollo13 cron[17973]: (root) CMD (test -x  
/usr/sbin/run-crons && /usr/sbin/run-crons )
Jan  8 21:40:11 apollo13 ata4: command 0x35 timeout, stat 0xd0 host_stat  
0x21
Jan  8 21:40:11 apollo13 ata4: translated ATA stat/err 0x35/00 to SCSI  
SK/ASC/ASCQ 0x4/00/00
Jan  8 21:40:11 apollo13 ata4: status=0x35 { DeviceFault SeekComplete  
CorrectedError Error }
Jan  8 21:40:11 apollo13 sd 3:0:0:0: SCSI error: return code = 0x8000002
Jan  8 21:40:11 apollo13 sdd: Current: sense key=0x4
Jan  8 21:40:11 apollo13 ASC=0x0 ASCQ=0x0
Jan  8 21:40:11 apollo13 end_request: I/O error, dev sdd, sector 465232831

  reply	other threads:[~2006-01-19  7:20 UTC|newest]

Thread overview: 83+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2006-01-17  6:56 [PATCH 000 of 5] md: Introduction NeilBrown
2006-01-17  6:56 ` [PATCH 001 of 5] md: Split disks array out of raid5 conf structure so it is easier to grow NeilBrown
2006-01-17 14:37   ` John Stoffel
2006-01-19  0:26     ` Neil Brown
2006-01-21  3:37       ` John Stoffel
2006-01-22 22:57         ` Neil Brown
2006-01-17  6:56 ` [PATCH 002 of 5] md: Allow stripes to be expanded in preparation for expanding an array NeilBrown
2006-01-17  6:56 ` [PATCH 003 of 5] md: Infrastructure to allow normal IO to continue while array is expanding NeilBrown
2006-01-17  6:56 ` [PATCH 004 of 5] md: Core of raid5 resize process NeilBrown
2006-01-17  6:56 ` [PATCH 005 of 5] md: Final stages of raid5 expand code NeilBrown
2006-01-17  9:55   ` Sander
2006-01-19  0:32     ` Neil Brown
2006-01-17  8:17 ` [PATCH 000 of 5] md: Introduction Michael Tokarev
     [not found]   ` <fd8d0180601170121s1e6a55b7o@mail.gmail.com>
2006-01-17  9:38     ` Francois Barre
2006-01-19  0:35       ` Neil Brown
2006-01-17  9:50   ` Sander
2006-01-17 11:26     ` Michael Tokarev
2006-01-17 11:37       ` Francois Barre
2006-01-17 14:03       ` Kyle Moffett
2006-01-19  0:28         ` Neil Brown
2006-01-17 16:08       ` Ross Vandegrift
2006-01-17 18:12         ` Michael Tokarev
2006-01-18  8:14           ` Sander
2006-01-18  8:37             ` Brad Campbell
2006-01-18  9:03             ` Alan Cox
2006-01-18 12:46             ` John Hendrikx
2006-01-18 12:51               ` Gordon Henderson
2006-01-18 23:51                 ` Neil Brown
2006-01-19  7:20                   ` PFC [this message]
2006-01-19  8:01                     ` dean gaudet
2006-01-18 23:54               ` Neil Brown
2006-01-19  0:22           ` Neil Brown
2006-01-19  9:01             ` Jakob Oestergaard
2006-01-17 22:38       ` Phillip Susi
2006-01-17 22:57         ` Neil Brown
2006-01-17 14:10   ` Steinar H. Gunderson
2006-01-17 15:07 ` Mr. James W. Laferriere
2006-01-19  0:23   ` Neil Brown
2006-01-22  4:42 ` Adam Kropelin
2006-01-22 22:52   ` Neil Brown
2006-01-23 23:02     ` Adam Kropelin
2006-01-23  1:08 ` John Hendrikx
2006-01-23  1:25   ` Neil Brown
2006-01-23  1:54     ` Kyle Moffett
2006-01-23  2:09     ` Mr. James W. Laferriere
2006-01-23  2:33       ` Neil Brown
  -- strict thread matches above, loose matches on Subject: below --
2006-01-17 21:38 Lincoln Dale (ltd)
2006-01-18 13:27 ` Jan Engelhardt
2006-01-18 23:19   ` Neil Brown
2006-01-19 15:33     ` Mark Hahn
2006-01-19 20:12     ` Jan Engelhardt
2006-01-19 21:22       ` Lars Marowsky-Bree
2006-01-19 22:17     ` Phillip Susi
2006-01-19 22:32       ` Neil Brown
2006-01-19 23:26         ` Phillip Susi
2006-01-19 23:43           ` Neil Brown
2006-01-20  2:17             ` Phillip Susi
2006-01-20 10:53               ` Lars Marowsky-Bree
2006-01-20 12:06                 ` Jens Axboe
2006-01-20 18:38                 ` Heinz Mauelshagen
2006-01-20 22:09                   ` Lars Marowsky-Bree
2006-01-21  0:06                     ` Heinz Mauelshagen
2006-01-20 18:41               ` Heinz Mauelshagen
2006-01-20 17:29             ` Ross Vandegrift
2006-01-20 18:36             ` Heinz Mauelshagen
2006-01-20 22:57               ` Lars Marowsky-Bree
2006-01-21  0:01                 ` Heinz Mauelshagen
2006-01-21  0:03                   ` Lars Marowsky-Bree
2006-01-21  0:08                     ` Heinz Mauelshagen
2006-01-21  0:13                       ` Lars Marowsky-Bree
2006-01-23  9:44                         ` Heinz Mauelshagen
2006-01-23 10:26                           ` Lars Marowsky-Bree
2006-01-23 10:38                             ` Heinz Mauelshagen
2006-01-23 10:45                               ` Lars Marowsky-Bree
2006-01-23 11:00                                 ` Heinz Mauelshagen
2006-01-23 12:54                           ` Ville Herva
2006-01-23 13:00                             ` Steinar H. Gunderson
2006-01-23 13:54                             ` Heinz Mauelshagen
2006-01-23 17:33                               ` Ville Herva
2006-01-24  2:02                             ` Phillip Susi
2006-01-20  7:51         ` Reuben Farrelly
2006-01-20  3:43           ` Andre' Breiler
2006-01-21  0:42             ` David Greaves

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=op.s3l6o9krcigqcu@apollo13 \
    --to=lists@peufeu.com \
    --cc=gordon@drogon.net \
    --cc=linux-raid@vger.kernel.org \
    --cc=neilb@suse.de \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).