* RAID5 software general crash
@ 2003-06-04 7:58 Compte centre de calcul UCP
2003-06-04 8:15 ` Mads Peter Bach
0 siblings, 1 reply; 2+ messages in thread
From: Compte centre de calcul UCP @ 2003-06-04 7:58 UTC (permalink / raw)
To: linux-raid
Hi,
I'm newbie in RAID5 Software.
I need some help (direct answer on the list or links) to understand and
repair my NFS server turning with RAID5 software.
It turn under
- an old Distro : Linux Mandrake release 7.1 (helium)
- an old kernel : Kernel 2.2.19-6.3mdk on an i686
- a raid version : mkraid version 0.90.0
We have 5 SCSI disk :
- 4 for the RAID5
- 1 for the spare
Sunday morning there was a crash. I don't now what happened exactly
until now, but the raid stopped.
On the screen there was log saying :
RAID5 : md0: Unrecoverable I/O erreor for Block...
I logged in and type
# cat /proc/mdstat
md0 : active raid5 sde1[5] sdd1[4](F) sdc1[3](F) sdb1[2](F) sda1[1](F)
(or something like that... the only thing I'm sure it's that there was
Flag (F) on the fourth SCSI disk !!!)
I shutdown the server (and all the client).
When I restart the server. The dmesg said :
---- begin of dmesg------
<snip>
autorun ...
considering sde1 ...
adding sde1 ...
adding sdd1 ...
adding sdc1 ...
adding sdb1 ...
adding sda1 ...
created md0
bind<sda1,1>
bind<sdb1,2>
bind<sdc1,3>
bind<sdd1,4>
bind<sde1,5>
running: <sde1><sdd1><sdc1><sdb1><sda1>
now!
sde1's event counter: 0000003f
sdd1's event counter: 0000003c
sdc1's event counter: 0000003c
sdb1's event counter: 0000003c
sda1's event counter: 0000003c
md: superblock update time inconsistency -- using the most recent one
freshest: sde1
md: kicking non-fresh sdd1 from array!
unbind<sdd1,4>
export_rdev(sdd1)
md: kicking non-fresh sdc1 from array!
unbind<sdc1,3>
export_rdev(sdc1)
md: kicking non-fresh sdb1 from array!
unbind<sdb1,2>
export_rdev(sdb1)
md: kicking non-fresh sda1 from array!
unbind<sda1,1>
export_rdev(sda1)
md0: removing former faulty sda1!
md0: removing former faulty sdb1!
md0: removing former faulty sdc1!
md0: removing former faulty sdd1!
md0: kicking faulty sde1!
unbind<sde1,0>
export_rdev(sde1)
md: md0: raid array is not clean -- starting background reconstruction
raid5 personality registered
md0: max total readahead window set to 1536k
md0: 3 data-disks, max readahead per data-disk: 512k
raid5: not enough operational devices for md0 (4/4 failed)
RAID5 conf printout:
--- rd:4 wd:0 fd:4
disk 0, s:0, o:0, n:0 rd:0 us:1 dev:[dev 00:00]
disk 1, s:0, o:0, n:1 rd:1 us:1 dev:[dev 00:00]
disk 2, s:0, o:0, n:2 rd:2 us:1 dev:[dev 00:00]
disk 3, s:0, o:0, n:3 rd:3 us:1 dev:[dev 00:00]
disk 4, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
disk 5, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
disk 6, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
disk 7, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
disk 8, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
disk 9, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
disk 10, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
disk 11, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
raid5: failed to run raid set md0
pers->run() failed ...
do_md_run() returned -22
... autorun DONE.
-----end of dmesg--------
My /etc/raidtab is :
# more /etc/raidtab
raiddev /dev/md0
raid-level 5
nr-raid-disks 4
nr-spare-disks 1
persistent-superblock 1
parity-algorithm left-symmetric
chunk-size 128
device /dev/sda1
raid-disk 0
device /dev/sdb1
raid-disk 1
device /dev/sdc1
raid-disk 2
device /dev/sdd1
raid-disk 3
device /dev/sde1
spare-disk 0
I checked all my physical devices with the scsi diagnostic tool from
Adaptec => no error !
I'm in trouble because All my device was in F flag (failure ???) in the
same time.
My adapter look good.
I already check on the web some HOW-TO and other cookbook. They talk
about a
mkraid command option --only-superblock or --force-resync but I haven't
this option.
I installed mdadm
# mdadm -V
mdadm - v1.2.0 - 13 Mar 2003
But I can't (or I don't understand how to) use safely for data.
Can someone help me?
Thank in advance
Nico
--
L'equipe du Service Informatique Recherche
Universite de Cergy-Pontoise
http://www.cdc.u-cergy.fr
^ permalink raw reply [flat|nested] 2+ messages in thread
* Re: RAID5 software general crash
2003-06-04 7:58 RAID5 software general crash Compte centre de calcul UCP
@ 2003-06-04 8:15 ` Mads Peter Bach
0 siblings, 0 replies; 2+ messages in thread
From: Mads Peter Bach @ 2003-06-04 8:15 UTC (permalink / raw)
To: linux-raid
Compte centre de calcul UCP wrote:
> Hi,
>
> I'm newbie in RAID5 Software.
> I need some help (direct answer on the list or links) to understand and
> repair my NFS server turning with RAID5 software.
[snip]
> Sunday morning there was a crash. I don't now what happened exactly
> until now, but the raid stopped.
[snip]
> sde1's event counter: 0000003f
> sdd1's event counter: 0000003c
> sdc1's event counter: 0000003c
> sdb1's event counter: 0000003c
> sda1's event counter: 0000003c
> md: superblock update time inconsistency -- using the most recent one
> freshest: sde1
Your superblocks are not in sync. See item 6.1 "Recovery from a multiple
disk failure" in the Linux Software RAID howto.
If you set /dev/sde1 as failed-disk instead of raid-disk in you
/etc/raidtab, and do a mkraid on the array, hotadd /dev/sde1 to the array
and set /dev/sde1 back to raid-disk, you should be back in business.
--
Mads Peter Bach
Systemadministrator, Det Humanistiske Fakultet, Aalborg Universitet
Kroghstræde 3 - 5.111, DK-9220 Aalborg Øst - (+45) 96358062
# whois MPB1-DK@whois.dk-hostmaster.dk
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2003-06-04 8:15 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2003-06-04 7:58 RAID5 software general crash Compte centre de calcul UCP
2003-06-04 8:15 ` Mads Peter Bach
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).