public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* Software raid not kicking devices out of the array
@ 2006-03-09 19:01 Anton Titov
  0 siblings, 0 replies; only message in thread
From: Anton Titov @ 2006-03-09 19:01 UTC (permalink / raw)
  To: linux-kernel

[-- Attachment #1: Type: text/plain, Size: 1016 bytes --]

Hello,

I have a server with 5 serial ata disks, 4 of them connected into 2
software raid1 devices. Today this server stopped responding (no ping,
nothing on the screen, even numlock not working) and after inspecting
logs I found 5 records like:

Mar  9 19:30:00 shaman ata5: status=0x51 { DriveReady SeekComplete
Error }
Mar  9 19:30:00 shaman ata5: error=0x0c { DriveStatusError }

(not consequent) before the freeze. First one was at 19:03 - about half
an hour before the freeze. I'm pretty sure, that the reason for server
stopping responding is hard drive failure.

So the question is, isn't raid supposed to kick the device out of the
array in case of io error? Surely I can write a script that monitors the
logs and kicks drives out, but this does not sound like a good solution.

The drive was still in the array after the reboot and after the reboot
it continued to issue such errors until I removed the drive from array
with mdadm -f.

I'm attaching dmesg of the machine after reboot.

Anton Titov
Host.bg

[-- Attachment #2: dmesg.shaman.gz --]
[-- Type: application/x-gzip, Size: 4827 bytes --]

^ permalink raw reply	[flat|nested] only message in thread

only message in thread, other threads:[~2006-03-09 19:01 UTC | newest]

Thread overview: (only message) (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2006-03-09 19:01 Software raid not kicking devices out of the array Anton Titov

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox