MD bug or me being stupid?

linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

* MD bug or me being stupid?
@ 2005-04-22 10:45 Molle Bestefich
  2005-04-22 11:25 ` Molle Bestefich
  2005-05-12  8:55 ` Molle Bestefich
  0 siblings, 2 replies; 14+ messages in thread
From: Molle Bestefich @ 2005-04-22 10:45 UTC (permalink / raw)
  To: linux-raid

Just upgraded a MD RAID 5 box to 2.6.11 from 2.4.something.

Found out one disk was failing completely, got a replacement from Maxtor.  Neat.
Replaced disk, rebooted..
Added the new disk to the array with 'raidhotadd'.
MD started syncing.

A couple of minutes into the process, it started *seriously* spamming
the console with messages:

==========================
Apr 22 01:47:00 linux kernel: ..<6>md: syncing RAID array md1
Apr 22 01:47:00 linux kernel: md: minimum _guaranteed_ reconstruction
speed: 1000 KB/sec/disc.
Apr 22 01:47:00 linux kernel: md: using maximum available idle IO bandwith (but
not more than 200000 KB/sec) for reconstruction.
Apr 22 01:47:00 linux kernel: md: using 128k window, over a total of
199141632 blocks.
Apr 22 01:47:00 linux kernel: md: md1: sync done.
Apr 22 01:47:00 linux kernel: ..<6>md: syncing RAID array md1
Apr 22 01:47:01 linux kernel: md: minimum _guaranteed_ reconstruction
speed: 1000 KB/sec/disc.
Apr 22 01:47:01 linux kernel: md: using maximum available idle IO bandwith (but
not more than 200000 KB/sec) for reconstruction.
Apr 22 01:47:01 linux kernel: md: using 128k window, over a total of
199141632 blocks.
Apr 22 01:47:01 linux kernel: md: md1: sync done.
==========================

Thought it had probably gone haywire and decided to start trashing my
data, so pulled the plug and rebooted.  When examining the log
afterwards, I can see that the above messages repeat themselves.
cat /var/log/messages | grep md | grep 'Apr 22 01:47:01' | grep 'sync done'
tells me that the messages were repeated 12 times per second.  The
/var/log/messages file grew to 600kB before I pulled the plug.

Noticed something strange during next boot: Linux failed to recognize
one of the disks.  Booted into Maxtor PowerMax and it said various
weird things (seemed different each time) about the disk too.  So
decided to switch ATA cables on this disk, which made wonders - both
PowerMax and Linux talks to the disk fine now.

Now, when I boot the machine, the 6 disks in the array have these
event counters, according to mdadm:

sda1:  0.19704  (completely new, "blank" disk)
sdb1:  0.19704
sdc1:  0.144      (but why?)
sdd1:  0.19704
sde1:  0.19704  (this disk had the bad cable)
sdf1:  0.19704

My questions now are (and yes, I know I had faulty hardware, but
that's incidentally also the reason I use MD at all):

1. What's with the infinitely repeated md1: sync done messages?

2. Wouldn't the mentioned event counters cause MD to think that the
totally blank disk /dev/sda1 has valid data, and that the disk with
valid data (/dev/sdc1) does not next time it syncs?

3. How do I proceed from here, if I want to rescue my data?

Best regards, many thanks for MD and all that :-).

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: MD bug or me being stupid?
  2005-04-22 10:45 MD bug or me being stupid? Molle Bestefich
@ 2005-04-22 11:25 ` Molle Bestefich
  2005-05-12  8:55 ` Molle Bestefich
  1 sibling, 0 replies; 14+ messages in thread
From: Molle Bestefich @ 2005-04-22 11:25 UTC (permalink / raw)
  To: linux-raid

[-- Attachment #1: Type: text/plain, Size: 385 bytes --]

Hmm, I think the information in /var/log/messages are actually
interesting for MD debugging.

Seems there was a bad sector somewhere in the middle of all this,
which might have triggered something?

Attached (gzipped - sorry for the inconvenience, but it's 5 kB vs. 250 kB!)

I've cut out a lot of irrelevant cruft (kernel messages) and added
comments about what I did when.

[-- Attachment #2: linux-22apr-messages.gz --]
[-- Type: application/x-gzip, Size: 5713 bytes --]

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: MD bug or me being stupid?
  2005-04-22 10:45 MD bug or me being stupid? Molle Bestefich
  2005-04-22 11:25 ` Molle Bestefich
@ 2005-05-12  8:55 ` Molle Bestefich
  2005-05-13  2:55   ` Neil Brown
  1 sibling, 1 reply; 14+ messages in thread
From: Molle Bestefich @ 2005-05-12  8:55 UTC (permalink / raw)
  To: linux-raid; +Cc: neilb

On 4/22/05, Molle Bestefich wrote:
> Just upgraded a MD RAID 5 box to 2.6.11 from 2.4.something.
> 
> Found out one disk was failing completely, got a replacement from Maxtor.  Neat.
> Replaced disk, rebooted..
> Added the new disk to the array with 'raidhotadd'.
> MD started syncing.
> 
> A couple of minutes into the process, it started *seriously* spamming
> the console with messages:
> 
> ==========================
> Apr 22 01:47:00 linux kernel: ..<6>md: syncing RAID array md1
> Apr 22 01:47:00 linux kernel: md: minimum _guaranteed_ reconstruction
> speed: 1000 KB/sec/disc.
> Apr 22 01:47:00 linux kernel: md: using maximum available idle IO bandwith (but
> not more than 200000 KB/sec) for reconstruction.
> Apr 22 01:47:00 linux kernel: md: using 128k window, over a total of
> 199141632 blocks.
> Apr 22 01:47:00 linux kernel: md: md1: sync done.
> Apr 22 01:47:00 linux kernel: ..<6>md: syncing RAID array md1
> Apr 22 01:47:01 linux kernel: md: minimum _guaranteed_ reconstruction
> speed: 1000 KB/sec/disc.
> Apr 22 01:47:01 linux kernel: md: using maximum available idle IO bandwith (but
> not more than 200000 KB/sec) for reconstruction.
> Apr 22 01:47:01 linux kernel: md: using 128k window, over a total of
> 199141632 blocks.
> Apr 22 01:47:01 linux kernel: md: md1: sync done.
> ==========================

[snip]

> afterwards, I can see that the above messages repeat themselves.
> cat /var/log/messages | grep md | grep 'Apr 22 01:47:01' | grep 'sync done'
> tells me that the messages were repeated 12 times per second.  The

Ping!...
Neil, just wondering, any comments regarding this particular endless loop in MD?
(Anything I can test or some such?)

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: MD bug or me being stupid?
  2005-05-12  8:55 ` Molle Bestefich
@ 2005-05-13  2:55   ` Neil Brown
  2005-05-15 18:10     ` Molle Bestefich
  0 siblings, 1 reply; 14+ messages in thread
From: Neil Brown @ 2005-05-13  2:55 UTC (permalink / raw)
  To: Molle Bestefich; +Cc: linux-raid

On Thursday May 12, molle.bestefich@gmail.com wrote:
> On 4/22/05, Molle Bestefich wrote:
> > Just upgraded a MD RAID 5 box to 2.6.11 from 2.4.something.
> > 
> > Found out one disk was failing completely, got a replacement from Maxtor.  Neat.
> > Replaced disk, rebooted..
> > Added the new disk to the array with 'raidhotadd'.
> > MD started syncing.
> > 
> > A couple of minutes into the process, it started *seriously* spamming
> > the console with messages:
> > 
> > ==========================
> > Apr 22 01:47:00 linux kernel: ..<6>md: syncing RAID array md1
> > Apr 22 01:47:00 linux kernel: md: minimum _guaranteed_ reconstruction
> > speed: 1000 KB/sec/disc.
> > Apr 22 01:47:00 linux kernel: md: using maximum available idle IO bandwith (but
> > not more than 200000 KB/sec) for reconstruction.
> > Apr 22 01:47:00 linux kernel: md: using 128k window, over a total of
> > 199141632 blocks.
> > Apr 22 01:47:00 linux kernel: md: md1: sync done.
> > Apr 22 01:47:00 linux kernel: ..<6>md: syncing RAID array md1
> > Apr 22 01:47:01 linux kernel: md: minimum _guaranteed_ reconstruction
> > speed: 1000 KB/sec/disc.
> > Apr 22 01:47:01 linux kernel: md: using maximum available idle IO bandwith (but
> > not more than 200000 KB/sec) for reconstruction.
> > Apr 22 01:47:01 linux kernel: md: using 128k window, over a total of
> > 199141632 blocks.
> > Apr 22 01:47:01 linux kernel: md: md1: sync done.
> > ==========================
> 
> [snip]
> 
> > afterwards, I can see that the above messages repeat themselves.
> > cat /var/log/messages | grep md | grep 'Apr 22 01:47:01' | grep 'sync done'
> > tells me that the messages were repeated 12 times per second.  The
> 
> Ping!...
> Neil, just wondering, any comments regarding this particular endless loop in MD?
> (Anything I can test or some such?)

Thanks for the ping, things sometimes get lost in the noise....

This sounds a bit like the problem that is addressed by 
  md-make-raid5-and-raid6-robust-against-failure-during-recovery.patch 
in the current -mm patches (look in the brokenout directory).

This would only happen if you have multiple failed devices.  So maybe
while the rebuild was happening, another device failed (which seems to
happen more and more as device sizes are increasing and reliability is
going the other way).

Could this (another drive failure) be the case?

NeilBrown

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: MD bug or me being stupid?
  2005-05-13  2:55   ` Neil Brown
@ 2005-05-15 18:10     ` Molle Bestefich
  2005-05-15 18:22       ` Strange behaviour on "toy array" Patrik Jonsson
  0 siblings, 1 reply; 14+ messages in thread
From: Molle Bestefich @ 2005-05-15 18:10 UTC (permalink / raw)
  To: Neil Brown; +Cc: linux-raid

Neil Brown wrote:
> Molle Bestefich wrote:
> > Ping!...
> > Neil, just wondering, any comments regarding this particular endless loop in MD?
> > (Anything I can test or some such?)
> 
> Thanks for the ping, things sometimes get lost in the noise....

Thanks for taking a look at it!

> This sounds a bit like the problem that is addressed by
>   md-make-raid5-and-raid6-robust-against-failure-during-recovery.patch
> in the current -mm patches (look in the brokenout directory).
> 
> This would only happen if you have multiple failed devices.  So maybe
> while the rebuild was happening, another device failed (which seems to
> happen more and more as device sizes are increasing and reliability is
> going the other way).
> 
> Could this (another drive failure) be the case?

Sounds correct!
There was both a real disk failure and a cable failure involved (first
message in this thread has details).

Apologies for bringing up already fixed bugs.

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Strange behaviour on "toy array"
  2005-05-15 18:10     ` Molle Bestefich
@ 2005-05-15 18:22       ` Patrik Jonsson
  2005-05-15 20:09         ` David Greaves
  0 siblings, 1 reply; 14+ messages in thread
From: Patrik Jonsson @ 2005-05-15 18:22 UTC (permalink / raw)
  Cc: linux-raid

hi all,

I'm gearing up to setting up a 2tb raid for our research group, and just 
to see how this stuff works I made a loopback array on one of my 
machines. I created 5 loopback devices of 1mb each, created a raid5 
array and formatted. so far so good. I could copy files on and off, fail 
a disk with mdadm -f  and then return it and everything seemed to work 
as i expected. Then I decided to see what happens if things go bad, so i 
fail one disk. fine, array reports "clean, degraded" but I can still 
access files. Then I fail another, now expecting not to be able to read 
anything. But, array reports "clean, degraded" and I can still access 
the files. I then proceeded to fail ALL disks and the array was still 
"clean, degraded" and I could read the files on it just as well as 
before??? Can anyone explain to me what's going on here? Was I seeing 
some cached version (given that the array was so small)?

This is on a machine running fedora core 3 ppc.

thanks,

/Patrik

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: Strange behaviour on "toy array"
  2005-05-15 18:22       ` Strange behaviour on "toy array" Patrik Jonsson
@ 2005-05-15 20:09         ` David Greaves
  2005-05-15 20:55           ` Patrik Jonsson
  0 siblings, 1 reply; 14+ messages in thread
From: David Greaves @ 2005-05-15 20:09 UTC (permalink / raw)
  To: Patrik Jonsson; +Cc: linux-raid

Patrik Jonsson wrote:

> hi all,
>
> I'm gearing up to setting up a 2tb raid for our research group, and
> just to see how this stuff works I made a loopback array on one of my
> machines. I created 5 loopback devices of 1mb each, created a raid5
> array and formatted. so far so good. I could copy files on and off,
> fail a disk with mdadm -f  and then return it and everything seemed to
> work as i expected. Then I decided to see what happens if things go
> bad, so i fail one disk. fine, array reports "clean, degraded" but I
> can still access files. Then I fail another, now expecting not to be
> able to read anything. But, array reports "clean, degraded" and I can
> still access the files. I then proceeded to fail ALL disks and the
> array was still "clean, degraded" and I could read the files on it
> just as well as before??? Can anyone explain to me what's going on
> here? Was I seeing some cached version (given that the array was so
> small)?

I think you'd need to post the commands you used and the results of 
things like mdadm --detail and cat /proc/mdstat
also kernel version, mdadm version etc.

That way we can ensure you really did fail the right drives etc etc.

Right now it could be anything from (allowed!) user error to a weird ppc
thing...

FYI I run a 1.2Tb array on 6x250Gb SATA drives (1 spare) with lvm2 and xfs

David



^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: Strange behaviour on "toy array"
  2005-05-15 20:09         ` David Greaves
@ 2005-05-15 20:55           ` Patrik Jonsson
  2005-05-15 22:13             ` Guy
  0 siblings, 1 reply; 14+ messages in thread
From: Patrik Jonsson @ 2005-05-15 20:55 UTC (permalink / raw)
  To: David Greaves, linux-raid



David Greaves wrote:

>
>I think you'd need to post the commands you used and the results of 
>things like mdadm --detail and cat /proc/mdstat
>also kernel version, mdadm version etc.
>
>That way we can ensure you really did fail the right drives etc etc.
>
>Right now it could be anything from (allowed!) user error to a weird ppc
>thing...
>  
>
sure thing:
[root@localhost junk]# uname -a
Linux localhost.localdomain 2.6.9-prep #1 Tue Apr 19 16:00:33 PDT 2005 
ppc ppc ppc GNU/Linux
[root@localhost junk]# mdadm --version
mdadm - v1.6.0 - 4 June 2004

now I do (loop1-5 are files):
losetup /dev/loop0 loop1
losetup /dev/loop1 loop2
losetup /dev/loop2 loop3
losetup /dev/loop3 loop4
losetup /dev/loop4 loop5
mdadm --create /dev/md0 -l 5 -n 5 /dev/loop0 /dev/loop1 /dev/loop2 
/dev/loop3 /dev/loop4
mkfs.ext3 /dev/md0
mount -t ext3 /dev/md0 junk

at this point, mdadm shows:
[root@localhost junk]# mdadm --detail /dev/md0
/dev/md0:
        Version : 00.90.01
  Creation Time : Sun May 15 13:41:24 2005
     Raid Level : raid5
     Array Size : 3840
    Device Size : 960
   Raid Devices : 5
  Total Devices : 5
Preferred Minor : 0
    Persistence : Superblock is persistent

    Update Time : Sun May 15 13:45:34 2005
          State : clean
 Active Devices : 5
Working Devices : 5
 Failed Devices : 0
  Spare Devices : 0

         Layout : left-symmetric
     Chunk Size : 64K

    Number   Major   Minor   RaidDevice State
       0       7        0        0      active sync   /dev/loop0
       1       7        1        1      active sync   /dev/loop1
       2       7        2        2      active sync   /dev/loop2
       3       7        3        3      active sync   /dev/loop3
       4       7        4        4      active sync   /dev/loop4
           UUID : b89aa5de:da1054f5:b052cc51:393d7435
         Events : 0.24

and /proc/mdstat:
[root@localhost junk]# cat /proc/mdstat
Personalities : [raid5]
md0 : active raid5 loop4[4] loop3[3] loop2[2] loop1[1] loop0[0]
      3840 blocks level 5, 64k chunk, algorithm 2 [5/5] [UUUUU]
     
unused devices: <none>

Now I fail (e.g.) /dev/loop0:
mdadm -f /dev/md0 /dev/loop0

and get:
[root@localhost junk]# mdadm --detail /dev/md0
/dev/md0:
        Version : 00.90.01
  Creation Time : Sun May 15 13:41:24 2005
     Raid Level : raid5
     Array Size : 3840
    Device Size : 960
   Raid Devices : 5
  Total Devices : 5
Preferred Minor : 0
    Persistence : Superblock is persistent

    Update Time : Sun May 15 13:49:20 2005
          State : clean, degraded
 Active Devices : 4
Working Devices : 4
 Failed Devices : 1
  Spare Devices : 0

         Layout : left-symmetric
     Chunk Size : 64K

    Number   Major   Minor   RaidDevice State
       0       0        0       -1      removed
       1       7        1        1      active sync   /dev/loop1
       2       7        2        2      active sync   /dev/loop2
       3       7        3        3      active sync   /dev/loop3
       4       7        4        4      active sync   /dev/loop4
       5       7        0       -1      faulty   /dev/loop0
           UUID : b89aa5de:da1054f5:b052cc51:393d7435
         Events : 0.27

and:
[root@localhost junk]# cat /proc/mdstat
Personalities : [raid5]
md0 : active raid5 loop4[4] loop3[3] loop2[2] loop1[1] loop0[5](F)
      3840 blocks level 5, 64k chunk, algorithm 2 [5/4] [_UUUU]
     
unused devices: <none>

continue failing drives:
[root@localhost junk]# mdadm -f /dev/md0 /dev/loop1
mdadm: set /dev/loop1 faulty in /dev/md0
[root@localhost junk]# mdadm -f /dev/md0 /dev/loop2
mdadm: set /dev/loop2 faulty in /dev/md0
[root@localhost junk]# mdadm -f /dev/md0 /dev/loop3
mdadm: set /dev/loop3 faulty in /dev/md0
[root@localhost junk]# mdadm -f /dev/md0 /dev/loop4
mdadm: set /dev/loop4 faulty in /dev/md0

now I get:
[root@localhost junk]# mdadm --detail /dev/md0
/dev/md0:
        Version : 00.90.01
  Creation Time : Sun May 15 13:41:24 2005
     Raid Level : raid5
     Array Size : 3840
    Device Size : 960
   Raid Devices : 5
  Total Devices : 5
Preferred Minor : 0
    Persistence : Superblock is persistent

    Update Time : Sun May 15 13:51:09 2005
          State : clean, degraded
 Active Devices : 0
Working Devices : 0
 Failed Devices : 5
  Spare Devices : 0

         Layout : left-symmetric
     Chunk Size : 64K

    Number   Major   Minor   RaidDevice State
       0       0        0       -1      removed
       1       0        0       -1      removed
       2       0        0       -1      removed
       3       0        0       -1      removed
       4       0        0       -1      removed
       5       7        4       -1      faulty   /dev/loop4
       6       7        3       -1      faulty   /dev/loop3
       7       7        2       -1      faulty   /dev/loop2
       8       7        1       -1      faulty   /dev/loop1
       9       7        0       -1      faulty   /dev/loop0

and:
Personalities : [raid5]
md0 : active raid5 loop4[5](F) loop3[6](F) loop2[7](F) loop1[8](F) 
loop0[9](F)
      3840 blocks level 5, 64k chunk, algorithm 2 [5/0] [_____]
     
unused devices: <none>

but I can still read the file on the filesystem that is mounted (ie in 
the "junk" dir).

Hope that contains all the info you need.

/Patrik


^ permalink raw reply	[flat|nested] 14+ messages in thread

* RE: Strange behaviour on "toy array"
  2005-05-15 20:55           ` Patrik Jonsson
@ 2005-05-15 22:13             ` Guy
  0 siblings, 0 replies; 14+ messages in thread
From: Guy @ 2005-05-15 22:13 UTC (permalink / raw)
  To: 'Patrik Jonsson', 'David Greaves', linux-raid

Ok, now put more files on your filesystem.

I think you are correct and the buffer cache is handling the reads.

> -----Original Message-----
> From: linux-raid-owner@vger.kernel.org [mailto:linux-raid-
> owner@vger.kernel.org] On Behalf Of Patrik Jonsson
> Sent: Sunday, May 15, 2005 4:55 PM
> To: David Greaves; linux-raid@vger.kernel.org
> Subject: Re: Strange behaviour on "toy array"
> 
> 
> 
> David Greaves wrote:
> 
> >
> >I think you'd need to post the commands you used and the results of
> >things like mdadm --detail and cat /proc/mdstat
> >also kernel version, mdadm version etc.
> >
> >That way we can ensure you really did fail the right drives etc etc.
> >
> >Right now it could be anything from (allowed!) user error to a weird ppc
> >thing...
> >
> >
> sure thing:
> [root@localhost junk]# uname -a
> Linux localhost.localdomain 2.6.9-prep #1 Tue Apr 19 16:00:33 PDT 2005
> ppc ppc ppc GNU/Linux
> [root@localhost junk]# mdadm --version
> mdadm - v1.6.0 - 4 June 2004
> 
> now I do (loop1-5 are files):
> losetup /dev/loop0 loop1
> losetup /dev/loop1 loop2
> losetup /dev/loop2 loop3
> losetup /dev/loop3 loop4
> losetup /dev/loop4 loop5
> mdadm --create /dev/md0 -l 5 -n 5 /dev/loop0 /dev/loop1 /dev/loop2
> /dev/loop3 /dev/loop4
> mkfs.ext3 /dev/md0
> mount -t ext3 /dev/md0 junk
> 
> at this point, mdadm shows:
> [root@localhost junk]# mdadm --detail /dev/md0
> /dev/md0:
>         Version : 00.90.01
>   Creation Time : Sun May 15 13:41:24 2005
>      Raid Level : raid5
>      Array Size : 3840
>     Device Size : 960
>    Raid Devices : 5
>   Total Devices : 5
> Preferred Minor : 0
>     Persistence : Superblock is persistent
> 
>     Update Time : Sun May 15 13:45:34 2005
>           State : clean
>  Active Devices : 5
> Working Devices : 5
>  Failed Devices : 0
>   Spare Devices : 0
> 
>          Layout : left-symmetric
>      Chunk Size : 64K
> 
>     Number   Major   Minor   RaidDevice State
>        0       7        0        0      active sync   /dev/loop0
>        1       7        1        1      active sync   /dev/loop1
>        2       7        2        2      active sync   /dev/loop2
>        3       7        3        3      active sync   /dev/loop3
>        4       7        4        4      active sync   /dev/loop4
>            UUID : b89aa5de:da1054f5:b052cc51:393d7435
>          Events : 0.24
> 
> and /proc/mdstat:
> [root@localhost junk]# cat /proc/mdstat
> Personalities : [raid5]
> md0 : active raid5 loop4[4] loop3[3] loop2[2] loop1[1] loop0[0]
>       3840 blocks level 5, 64k chunk, algorithm 2 [5/5] [UUUUU]
> 
> unused devices: <none>
> 
> Now I fail (e.g.) /dev/loop0:
> mdadm -f /dev/md0 /dev/loop0
> 
> and get:
> [root@localhost junk]# mdadm --detail /dev/md0
> /dev/md0:
>         Version : 00.90.01
>   Creation Time : Sun May 15 13:41:24 2005
>      Raid Level : raid5
>      Array Size : 3840
>     Device Size : 960
>    Raid Devices : 5
>   Total Devices : 5
> Preferred Minor : 0
>     Persistence : Superblock is persistent
> 
>     Update Time : Sun May 15 13:49:20 2005
>           State : clean, degraded
>  Active Devices : 4
> Working Devices : 4
>  Failed Devices : 1
>   Spare Devices : 0
> 
>          Layout : left-symmetric
>      Chunk Size : 64K
> 
>     Number   Major   Minor   RaidDevice State
>        0       0        0       -1      removed
>        1       7        1        1      active sync   /dev/loop1
>        2       7        2        2      active sync   /dev/loop2
>        3       7        3        3      active sync   /dev/loop3
>        4       7        4        4      active sync   /dev/loop4
>        5       7        0       -1      faulty   /dev/loop0
>            UUID : b89aa5de:da1054f5:b052cc51:393d7435
>          Events : 0.27
> 
> and:
> [root@localhost junk]# cat /proc/mdstat
> Personalities : [raid5]
> md0 : active raid5 loop4[4] loop3[3] loop2[2] loop1[1] loop0[5](F)
>       3840 blocks level 5, 64k chunk, algorithm 2 [5/4] [_UUUU]
> 
> unused devices: <none>
> 
> continue failing drives:
> [root@localhost junk]# mdadm -f /dev/md0 /dev/loop1
> mdadm: set /dev/loop1 faulty in /dev/md0
> [root@localhost junk]# mdadm -f /dev/md0 /dev/loop2
> mdadm: set /dev/loop2 faulty in /dev/md0
> [root@localhost junk]# mdadm -f /dev/md0 /dev/loop3
> mdadm: set /dev/loop3 faulty in /dev/md0
> [root@localhost junk]# mdadm -f /dev/md0 /dev/loop4
> mdadm: set /dev/loop4 faulty in /dev/md0
> 
> now I get:
> [root@localhost junk]# mdadm --detail /dev/md0
> /dev/md0:
>         Version : 00.90.01
>   Creation Time : Sun May 15 13:41:24 2005
>      Raid Level : raid5
>      Array Size : 3840
>     Device Size : 960
>    Raid Devices : 5
>   Total Devices : 5
> Preferred Minor : 0
>     Persistence : Superblock is persistent
> 
>     Update Time : Sun May 15 13:51:09 2005
>           State : clean, degraded
>  Active Devices : 0
> Working Devices : 0
>  Failed Devices : 5
>   Spare Devices : 0
> 
>          Layout : left-symmetric
>      Chunk Size : 64K
> 
>     Number   Major   Minor   RaidDevice State
>        0       0        0       -1      removed
>        1       0        0       -1      removed
>        2       0        0       -1      removed
>        3       0        0       -1      removed
>        4       0        0       -1      removed
>        5       7        4       -1      faulty   /dev/loop4
>        6       7        3       -1      faulty   /dev/loop3
>        7       7        2       -1      faulty   /dev/loop2
>        8       7        1       -1      faulty   /dev/loop1
>        9       7        0       -1      faulty   /dev/loop0
> 
> and:
> Personalities : [raid5]
> md0 : active raid5 loop4[5](F) loop3[6](F) loop2[7](F) loop1[8](F)
> loop0[9](F)
>       3840 blocks level 5, 64k chunk, algorithm 2 [5/0] [_____]
> 
> unused devices: <none>
> 
> but I can still read the file on the filesystem that is mounted (ie in
> the "junk" dir).
> 
> Hope that contains all the info you need.
> 
> /Patrik
> 
> -
> To unsubscribe from this list: send the line "unsubscribe linux-raid" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html


^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: Strange behaviour on "toy array"
       [not found] <20050516184049.90DDA11416@smtp.ucolick.org>
@ 2005-05-16 21:54 ` Patrik Jonsson
  2005-05-17  2:28   ` Guy
  0 siblings, 1 reply; 14+ messages in thread
From: Patrik Jonsson @ 2005-05-16 21:54 UTC (permalink / raw)
  To: Ruth Ivimey-Cook, linux-raid

Ruth Ivimey-Cook wrote:

>
>Yes, I believe this interpretation is correct. Moreover, I've seen this happen
>"for real": when two drives died on my raid5 array while I was playing, I
>started to see some I/O errors, but only for things that hadn't just been
>accessed: recently accessed things were returned fine. As time went by, even
>those disappeared.
>
>I must admit, it's rather disconcerting, but it is a logical result of having a
>block cache.
>  
>
This makes sense, however, I would have expected /proc/mdstat or 
something telling me the array is DEAD. It seems "clean, degraded" is 
not a proper description of a raid5 without any working drives... Or 
would this not happen until I tried to write to it (which I haven't 
gotten to yet)?

I must admit I don't remember seeing in the FAQ or anywhere what is 
supposed to happen when you lose more than one drive. I sort of expected 
to have the entire array go offline, but it seems it just limps along 
like a normal faulty drive would do?

/Patrik


^ permalink raw reply	[flat|nested] 14+ messages in thread

* RE: Strange behaviour on "toy array"
  2005-05-16 21:54 ` Patrik Jonsson
@ 2005-05-17  2:28   ` Guy
  2005-05-17  6:04     ` Patrik Jonsson
  0 siblings, 1 reply; 14+ messages in thread
From: Guy @ 2005-05-17  2:28 UTC (permalink / raw)
  To: 'Patrik Jonsson', 'Ruth Ivimey-Cook', linux-raid

My guess is it will not change state until it needs to access a disk.
So, try some writes!

> -----Original Message-----
> From: linux-raid-owner@vger.kernel.org [mailto:linux-raid-
> owner@vger.kernel.org] On Behalf Of Patrik Jonsson
> Sent: Monday, May 16, 2005 5:54 PM
> To: Ruth Ivimey-Cook; linux-raid@vger.kernel.org
> Subject: Re: Strange behaviour on "toy array"
> 
> Ruth Ivimey-Cook wrote:
> 
> >
> >Yes, I believe this interpretation is correct. Moreover, I've seen this
> happen
> >"for real": when two drives died on my raid5 array while I was playing, I
> >started to see some I/O errors, but only for things that hadn't just been
> >accessed: recently accessed things were returned fine. As time went by,
> even
> >those disappeared.
> >
> >I must admit, it's rather disconcerting, but it is a logical result of
> having a
> >block cache.
> >
> >
> This makes sense, however, I would have expected /proc/mdstat or
> something telling me the array is DEAD. It seems "clean, degraded" is
> not a proper description of a raid5 without any working drives... Or
> would this not happen until I tried to write to it (which I haven't
> gotten to yet)?
> 
> I must admit I don't remember seeing in the FAQ or anywhere what is
> supposed to happen when you lose more than one drive. I sort of expected
> to have the entire array go offline, but it seems it just limps along
> like a normal faulty drive would do?
> 
> /Patrik
> 
> -
> To unsubscribe from this list: send the line "unsubscribe linux-raid" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html


^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: Strange behaviour on "toy array"
  2005-05-17  2:28   ` Guy
@ 2005-05-17  6:04     ` Patrik Jonsson
  2005-05-17  7:12       ` Patrik Jonsson
  0 siblings, 1 reply; 14+ messages in thread
From: Patrik Jonsson @ 2005-05-17  6:04 UTC (permalink / raw)
  To: Guy; +Cc: 'Ruth Ivimey-Cook', linux-raid

Ok, so I did as Guy suggested, and tried to write to the array after 
failing more than one disk. It says:

[root@localhost raidtest]# echo test > junk/test
-bash: junk/test: Read-only file system

so that's at least an indication that not all is well. The syslog contains:

May 16 22:49:31 localhost kernel: raid5: Disk failure on loop2, 
disabling device. Operation continuing on 3 devices
May 16 22:49:31 localhost kernel: RAID5 conf printout:
May 16 22:49:31 localhost kernel:  --- rd:5 wd:3 fd:2
May 16 22:49:31 localhost kernel:  disk 1, o:1, dev:loop1
May 16 22:49:31 localhost kernel:  disk 2, o:0, dev:loop2
May 16 22:49:31 localhost kernel:  disk 3, o:1, dev:loop3
May 16 22:49:31 localhost kernel:  disk 4, o:1, dev:loop4
May 16 22:49:31 localhost kernel: RAID5 conf printout:
May 16 22:49:31 localhost kernel:  --- rd:5 wd:3 fd:2
May 16 22:49:31 localhost kernel:  disk 1, o:1, dev:loop1
May 16 22:49:31 localhost kernel:  disk 3, o:1, dev:loop3
May 16 22:49:31 localhost kernel:  disk 4, o:1, dev:loop4
May 16 22:49:39 localhost kernel: Buffer I/O error on device md0, 
logical block 112
May 16 22:49:39 localhost kernel: lost page write due to I/O error on md0
May 16 22:49:39 localhost kernel: Aborting journal on device md0.
May 16 22:49:44 localhost kernel: ext3_abort called.
May 16 22:49:44 localhost kernel: EXT3-fs error (device md0): 
ext3_journal_start_sb: Detected aborted journal
May 16 22:49:44 localhost kernel: Remounting filesystem read-only
May 16 22:50:14 localhost kernel: Buffer I/O error on device md0, 
logical block 19
May 16 22:50:14 localhost kernel: lost page write due to I/O error on md0

So I guess I'm happy with that, remounting to read-only seems smart, 
that way the disks aren't messed up more.
Now I added the disks back with

mdadm --add /dev/loop0
mdadm --add /dev/loop2

and the (actual hard-) drive started chugging, the md0_raid5 process is 
sucking cpu and I don't know what it's trying to do... the system has 
become unresponsive, but the drive is still ticking. Is hot-adding the 
drives back in a bad thing to do?

This is educational, at least... :-)

/Patrik

Guy wrote:

>My guess is it will not change state until it needs to access a disk.
>So, try some writes!
>
>  
>
>  
>

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: Strange behaviour on "toy array"
  2005-05-17  6:04     ` Patrik Jonsson
@ 2005-05-17  7:12       ` Patrik Jonsson
  2005-05-17  8:41         ` David Greaves
  0 siblings, 1 reply; 14+ messages in thread
From: Patrik Jonsson @ 2005-05-17  7:12 UTC (permalink / raw)
  Cc: linux-raid

Just as a further comment on what happened when my system hung: The 
process [md0_sync] was rapidly respawning and the syslog filled with 
thousands of messages like these:

May 16 23:16:44 localhost kernel: md: syncing RAID array md0
May 16 23:16:44 localhost kernel: md: minimum _guaranteed_ 
reconstruction speed\
: 1000 KB/sec/disc.
May 16 23:16:44 localhost kernel: md: using maximum available idle IO 
bandwith \
(but not more than 200000 KB/sec) for reconstruction.
May 16 23:16:44 localhost kernel: md: using 128k window, over a total of 
960 bl\
ocks.
May 16 23:16:44 localhost kernel: md: md0: sync done.
May 16 23:16:44 localhost kernel: md: syncing RAID array md0
May 16 23:16:44 localhost kernel: md: minimum _guaranteed_ 
reconstruction speed\
: 1000 KB/sec/disc.
May 16 23:16:44 localhost kernel: md: using maximum available idle IO 
bandwith \
(but not more than 200000 KB/sec) for reconstruction.
May 16 23:16:44 localhost kernel: md: using 128k window, over a total of 
960 bl\
ocks.
May 16 23:16:45 localhost kernel: md: md0: sync done.
... etc etc...

I had to halt the system to make it stop. I tried to stop the array with 
mdadm -S /dev/md0 but got "device or resource busy". Did i do something 
illegal here?

Thanks,

/Patrik

Patrik Jonsson wrote:

> Ok, so I did as Guy suggested, and tried to write to the array after 
> failing more than one disk. It says:
>
> [root@localhost raidtest]# echo test > junk/test
> -bash: junk/test: Read-only file system
>
> so that's at least an indication that not all is well. The syslog 
> contains:
>
> May 16 22:49:31 localhost kernel: raid5: Disk failure on loop2, 
> disabling device. Operation continuing on 3 devices
> May 16 22:49:31 localhost kernel: RAID5 conf printout:
> May 16 22:49:31 localhost kernel:  --- rd:5 wd:3 fd:2
> May 16 22:49:31 localhost kernel:  disk 1, o:1, dev:loop1
> May 16 22:49:31 localhost kernel:  disk 2, o:0, dev:loop2
> May 16 22:49:31 localhost kernel:  disk 3, o:1, dev:loop3
> May 16 22:49:31 localhost kernel:  disk 4, o:1, dev:loop4
> May 16 22:49:31 localhost kernel: RAID5 conf printout:
> May 16 22:49:31 localhost kernel:  --- rd:5 wd:3 fd:2
> May 16 22:49:31 localhost kernel:  disk 1, o:1, dev:loop1
> May 16 22:49:31 localhost kernel:  disk 3, o:1, dev:loop3
> May 16 22:49:31 localhost kernel:  disk 4, o:1, dev:loop4
> May 16 22:49:39 localhost kernel: Buffer I/O error on device md0, 
> logical block 112
> May 16 22:49:39 localhost kernel: lost page write due to I/O error on md0
> May 16 22:49:39 localhost kernel: Aborting journal on device md0.
> May 16 22:49:44 localhost kernel: ext3_abort called.
> May 16 22:49:44 localhost kernel: EXT3-fs error (device md0): 
> ext3_journal_start_sb: Detected aborted journal
> May 16 22:49:44 localhost kernel: Remounting filesystem read-only
> May 16 22:50:14 localhost kernel: Buffer I/O error on device md0, 
> logical block 19
> May 16 22:50:14 localhost kernel: lost page write due to I/O error on md0
>
> So I guess I'm happy with that, remounting to read-only seems smart, 
> that way the disks aren't messed up more.
> Now I added the disks back with
>
> mdadm --add /dev/loop0
> mdadm --add /dev/loop2
>
> and the (actual hard-) drive started chugging, the md0_raid5 process 
> is sucking cpu and I don't know what it's trying to do... the system 
> has become unresponsive, but the drive is still ticking. Is hot-adding 
> the drives back in a bad thing to do?
>
> This is educational, at least... :-)
>
> /Patrik
>
> Guy wrote:
>
>> My guess is it will not change state until it needs to access a disk.
>> So, try some writes!
>>
>>  
>>
>>  
>>
> -
> To unsubscribe from this list: send the line "unsubscribe linux-raid" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>
> !DSPAM:428989ab396844711317!
>

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: Strange behaviour on "toy array"
  2005-05-17  7:12       ` Patrik Jonsson
@ 2005-05-17  8:41         ` David Greaves
  0 siblings, 0 replies; 14+ messages in thread
From: David Greaves @ 2005-05-17  8:41 UTC (permalink / raw)
  To: Patrik Jonsson; +Cc: linux-raid

IIRC, Neil submitted a fix for this looping very recently.

yup - 13/May/05 03:55

>Thanks for the ping, things sometimes get lost in the noise....
>
>This sounds a bit like the problem that is addressed by 
>  md-make-raid5-and-raid6-robust-against-failure-during-recovery.patch 
>in the current -mm patches (look in the brokenout directory).
>
>This would only happen if you have multiple failed devices.  So maybe
>while the rebuild was happening, another device failed (which seems to
>happen more and more as device sizes are increasing and reliability is
>going the other way).
>
>Could this (another drive failure) be the case?
>
>NeilBrown
>  
>

I would say that there's been a fair amount of bug fixing activity since
2.6.9 and mdadm 1.6
I'd be happy with them for 'simple' ext3 over md but I'm pretty damned
sure that you'll have problems with xfs and/or lvm2

I _strongly_ suggest 2.6.11.xx and mdadm 1.9 if you'll be playing with
xfs and or lvm2 (which I suggest you do for the system you described)



David



Patrik Jonsson wrote:

> Just as a further comment on what happened when my system hung: The
> process [md0_sync] was rapidly respawning and the syslog filled with
> thousands of messages like these:
>
> May 16 23:16:44 localhost kernel: md: syncing RAID array md0
> May 16 23:16:44 localhost kernel: md: minimum _guaranteed_
> reconstruction speed\
> : 1000 KB/sec/disc.
> May 16 23:16:44 localhost kernel: md: using maximum available idle IO
> bandwith \
> (but not more than 200000 KB/sec) for reconstruction.
> May 16 23:16:44 localhost kernel: md: using 128k window, over a total
> of 960 bl\
> ocks.
> May 16 23:16:44 localhost kernel: md: md0: sync done.
> May 16 23:16:44 localhost kernel: md: syncing RAID array md0
> May 16 23:16:44 localhost kernel: md: minimum _guaranteed_
> reconstruction speed\
> : 1000 KB/sec/disc.
> May 16 23:16:44 localhost kernel: md: using maximum available idle IO
> bandwith \
> (but not more than 200000 KB/sec) for reconstruction.
> May 16 23:16:44 localhost kernel: md: using 128k window, over a total
> of 960 bl\
> ocks.
> May 16 23:16:45 localhost kernel: md: md0: sync done.
> ... etc etc...
>
> I had to halt the system to make it stop. I tried to stop the array
> with mdadm -S /dev/md0 but got "device or resource busy". Did i do
> something illegal here?
>
> Thanks,
>
> /Patrik



^ permalink raw reply	[flat|nested] 14+ messages in thread

end of thread, other threads:[~2005-05-17  8:41 UTC | newest]

Thread overview: 14+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2005-04-22 10:45 MD bug or me being stupid? Molle Bestefich
2005-04-22 11:25 ` Molle Bestefich
2005-05-12  8:55 ` Molle Bestefich
2005-05-13  2:55   ` Neil Brown
2005-05-15 18:10     ` Molle Bestefich
2005-05-15 18:22       ` Strange behaviour on "toy array" Patrik Jonsson
2005-05-15 20:09         ` David Greaves
2005-05-15 20:55           ` Patrik Jonsson
2005-05-15 22:13             ` Guy
     [not found] <20050516184049.90DDA11416@smtp.ucolick.org>
2005-05-16 21:54 ` Patrik Jonsson
2005-05-17  2:28   ` Guy
2005-05-17  6:04     ` Patrik Jonsson
2005-05-17  7:12       ` Patrik Jonsson
2005-05-17  8:41         ` David Greaves

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).