raid partition crash

linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

* raid partition crash
@ 2003-08-23  2:30 Mikael Chambon
  2003-08-25  2:08 ` Mads Peter Bach
  0 siblings, 1 reply; 2+ messages in thread
From: Mikael Chambon @ 2003-08-23  2:30 UTC (permalink / raw)
  To: linux-raid

Dear RAID users,

I  have a root raid 1 fileserver using 2 120 IDE disk and a standard Redhat
2.4.20 kernel
Two days ago I discovered that one of my raid partition failed:

====== From my logs ============
Aug 15 23:00:05 mekare kernel: hdc: dma_intr: status=0x51 { DriveReady
SeekComplete Error }
Aug 15 23:00:05 mekare kernel: hdc: dma_intr: error=0x40
 UncorrectableError }, LBAsect=73785359, sector=64487504
Aug 15 23:00:05 mekare kernel: end_request: I/O error, dev 16:06 (hdc),
sector 64487504
Aug 15 23:00:05 mekare kernel: raid1: Disk failure on hdc6, disabling
device.
Aug 15 23:00:05 mekare kernel: ^IOperation continuing on 1 devices
Aug 15 23:00:05 mekare kernel: raid1: hdc6: rescheduling block 64487504
Aug 15 23:00:05 mekare kernel: md: updating md0 RAID superblock on device
Aug 15 23:00:05 mekare kernel: md: hda6 [events: 0000001d]<6>(write) hda6's
sb offset: 115411840
Aug 15 23:00:05 mekare kernel: md: recovery thread got woken up ...
Aug 15 23:00:05 mekare kernel: md0: no spare disk to reconstruct array! --
continuing in degraded mode
Aug 15 23:00:05 mekare kernel: md: recovery thread finished ...
Aug 15 23:00:05 mekare kernel: md: (skipping faulty hdc6 )
Aug 15 23:00:05 mekare kernel: raid1: hda6: redirecting sector 64487504 to
another mirror
============================

hda6 and hdc6 are my fileserver partition using RAID1 as md0.
Others partitions on hdc seem to work fine.

====== From /etc/raidtab ==========
[.....skip....]

raiddev             /dev/md0
raid-level                  1
nr-raid-disks               2
chunk-size                  64k
persistent-superblock       1
nr-spare-disks              0
    device          /dev/hda6
    raid-disk     0
    device          /dev/hdc6
    raid-disk     1
[....skip.....]
=============================

========From /proc/mdstat=========
Personalities : [raid1]
read_ahead 1024 sectors
md0 : active raid1 hda6[0] hdc6[1](F)
      115411840 blocks [2/1] [U_]
md2 : active raid1 hda5[0] hdc5[1]
      532608 blocks [2/2] [UU]
md1 : active raid1 hda3[0] hdc3[1]
      2047680 blocks [2/2] [UU]
md3 : active raid1 hda2[0] hdc2[1]
      2047680 blocks [2/2] [UU]
md4 : active raid1 hda1[0] hdc1[1]
      20544 blocks [2/2] [UU]
unused devices: <none>
=============================

As others hdc partitions seem to work fine, I assume there is not hardware
problem with hdc.
So I tried to reinsert hdc6 to md0 with: raidhotadd -a /dev/md0 /dev/hdc6
but I get the following error:

/dev/md0: can not hot-add disk: disk busy!

What do you thing of this problem ? Is this an hardware problem ?
How can I reinsert hdc6 without stopping the raid array (the fileserver is
currently in production).

Thanks a lot for your answer and sorry for this long email.

--
Mikael Chambon

^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: raid partition crash
  2003-08-23  2:30 raid partition crash Mikael Chambon
@ 2003-08-25  2:08 ` Mads Peter Bach
  0 siblings, 0 replies; 2+ messages in thread
From: Mads Peter Bach @ 2003-08-25  2:08 UTC (permalink / raw)
  To: linux-raid

Mikael Chambon wrote:

> ========From /proc/mdstat=========
> Personalities : [raid1]
> read_ahead 1024 sectors
> md0 : active raid1 hda6[0] hdc6[1](F)
                             ^^---- hdc6 is still is the array, it is just
marked as failed.

>       115411840 blocks [2/1] [U_]
> md2 : active raid1 hda5[0] hdc5[1]
>       532608 blocks [2/2] [UU]
> md1 : active raid1 hda3[0] hdc3[1]
>       2047680 blocks [2/2] [UU]
> md3 : active raid1 hda2[0] hdc2[1]
>       2047680 blocks [2/2] [UU]
> md4 : active raid1 hda1[0] hdc1[1]
>       20544 blocks [2/2] [UU]
> unused devices: <none>
> =============================
> 
> 
> 
> As others hdc partitions seem to work fine, I assume there is not hardware
> problem with hdc.

You shouldn't really assume that. You really should do a test on this
partition (raid/write to it with dd, and see if the error occurs again).

> So I tried to reinsert hdc6 to md0 with: raidhotadd -a /dev/md0 /dev/hdc6
> but I get the following error:
> 
> /dev/md0: can not hot-add disk: disk busy!

You'll need to remove the disk from the array, before you can add it back.
Currently, it is just marked as failed.

-- 
Mads Peter Bach
Systemadministrator,  Det Humanistiske Fakultet, Aalborg Universitet
Kroghstræde 3 - 5.111, DK-9220 Aalborg Øst - (+45) 96358062
# whois MPB1-DK@whois.dk-hostmaster.dk

-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2003-08-25  2:08 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2003-08-23  2:30 raid partition crash Mikael Chambon
2003-08-25  2:08 ` Mads Peter Bach

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).