Change disk problem

linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

* Change disk problem
       [not found] <S1753847AbYFHHWg/20080608072236Z+62@vger.kernel.org>
@ 2008-06-08  7:58 ` Jon Lurås
  2008-06-08 10:00   ` David Greaves
  0 siblings, 1 reply; 5+ messages in thread
From: Jon Lurås @ 2008-06-08  7:58 UTC (permalink / raw)
  To: linux-raid

Hello

I got a big problem with replacing disk on Software-RAID.
Whatever I do it just ends up as a spare disk.
I guess the problem is that the faulty disk was not removed from the RAID.
We just shut down and replace'd the disk.
Any idea of how to solve this?

Thanks in advance 

Jon 

DETAILS:

xxxxx:~# mdadm --detail /dev/md0
/dev/md0:
        Version : 00.90.00
  Creation Time : Sat Jun  7 22:04:01 2008
     Raid Level : raid5
     Array Size : 490223232 (467.51 GiB 501.99 GB)
    Device Size : 245111616 (233.76 GiB 250.99 GB)
   Raid Devices : 3
  Total Devices : 3
Preferred Minor : 0
    Persistence : Superblock is persistent

    Update Time : Sat Jun  7 22:04:01 2008
          State : active, degraded
 Active Devices : 2
Working Devices : 2
 Failed Devices : 1
  Spare Devices : 0

         Layout : left-symmetric
     Chunk Size : 64K

           UUID : 7edd5dfe:14b3ffb2:304b0607:0d821dcd
         Events : 0.2

    Number   Major   Minor   RaidDevice State
       0      56        1        0      active sync   /dev/hdi1
       1      56       65        1      active sync   /dev/hdj1
       2       0        0        2      removed


xxxxx:~# mdadm --version
mdadm - v2.5.6 - 9 November 2006

xxxxx:~# uname -a
Linux xxxxx 2.4.27-3-386 xxxxxxxxxxxxxxxxxxxxxxxxxx

xxxxx:~# cat /etc/debian_version
Debian 4.0



^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Change disk problem
  2008-06-08  7:58 ` Change disk problem Jon Lurås
@ 2008-06-08 10:00   ` David Greaves
       [not found]     ` <WorldClient-F200806081212.AA12040517@irisinfo.net>
  0 siblings, 1 reply; 5+ messages in thread
From: David Greaves @ 2008-06-08 10:00 UTC (permalink / raw)
  To: Jon Lurås; +Cc: linux-raid

Jon Lurås wrote:
> Hello
> 
> I got a big problem with replacing disk on Software-RAID.
> Whatever I do it just ends up as a spare disk.
> I guess the problem is that the faulty disk was not removed from the RAID.
> We just shut down and replace'd the disk.
> Any idea of how to solve this?

You say 'spare' but I think you mean 'that md/mdadm is ignoring it completely
(on the grounds that it's not showing literally as a 'spare' in the --detail output.

This is expected behaviour - md does not just randomly grab new disks; you have
to use 'mdadm /dev/md0 --add /dev/hdk1' (use the correct component) to give md
permission to use the new disk.

Note that you should have partitioned it too (ie 'just shut down and replace'
becomes, shut down, replace, partition, mdadm --add )

> 
>     Number   Major   Minor   RaidDevice State
>        0      56        1        0      active sync   /dev/hdi1
>        1      56       65        1      active sync   /dev/hdj1
>        2       0        0        2      removed
> 


Look at http://linux-raid.osdl.org/ for more info...

HTH

David

--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Change disk problem
       [not found]     ` <WorldClient-F200806081212.AA12040517@irisinfo.net>
@ 2008-06-08 11:26       ` David Greaves
  2008-06-08 13:14         ` Jon Lurås
  0 siblings, 1 reply; 5+ messages in thread
From: David Greaves @ 2008-06-08 11:26 UTC (permalink / raw)
  To: Jon Lurås; +Cc: linux-raid@vger.kernel.org

Jon Lurås wrote:
> Hello 
> 
> Thank you for your input.
> 
> I gave to little information in the original mail. I had tried to 'add'
> the new disk. It shows up like this - se below. The bad thing is that then
> the device '/dev/hdj1' suddenly changes to 'spare'...
> and everything looks very bad :-(

OK, you also wrote (forgot to cc the list)
Jon Lurås wrote:
> Hi again
>
> A little more information:
> The only solution I found to the problem which arised after the 'add' og
> /dev/hdk1 is
> "mdadm --create /dev/md0 --verbose --level=5 --chunk=64 --raid-devices=3
> /dev/hdi1 /dev/hdj1 missing"
>
> The it looks like the original 'mdadm --detail /dev/md0' result.
>
> Jon

So I'm now quite confused.

Can you tell us what you did a step at a time.
This is what should have happened:

1 Original disk failed.
2 md0 went into degraded and mdadm --monitor sent you an email
3 you shutdown and replaced /dev/hdk (?)
4 you partitioned /dev/hdk to make hdk1
5 you used --add to add hdk1 to the array
6 all was well

now you seem to have done several 'interesting' things in addition (like issuing
a --create).
However, if you can go back to the original --detail given, a simple --add
should resolve your problems...

You may want to check your dmesg to make sure you don't have a hardware issue.

> demeter:/etc# mdadm --detail /dev/md0
> /dev/md0:
>         Version : 00.90.00
>   Creation Time : Sat Jun  7 22:04:01 2008
>      Raid Level : raid5
>      Array Size : 490223232 (467.51 GiB 501.99 GB)
>     Device Size : 245111616 (233.76 GiB 250.99 GB)
>    Raid Devices : 3
>   Total Devices : 4
> Preferred Minor : 0
>     Persistence : Superblock is persistent
> 
>     Update Time : Sun Jun  8 12:05:02 2008
>           State : active, degraded
>  Active Devices : 1
> Working Devices : 3
>  Failed Devices : 1
>   Spare Devices : 2
> 
>          Layout : left-symmetric
>      Chunk Size : 64K
> 
>            UUID : 7edd5dfe:14b3ffb2:304b0607:0d821dcd
>          Events : 0.8
> 
>     Number   Major   Minor   RaidDevice State
>        0      56        1        0      active sync   /dev/hdi1
>        1       0        0        1      removed
>        2       0        0        2      removed
> 
>        3      57        1        3      spare   /dev/hdk1
>        4      56       65        4      spare   /dev/hdj1

This doesn't look good.

what does --examine say about each of the components?

David
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Change disk problem
  2008-06-08 11:26       ` David Greaves
@ 2008-06-08 13:14         ` Jon Lurås
  2008-06-08 13:20           ` Justin Piszcz
  0 siblings, 1 reply; 5+ messages in thread
From: Jon Lurås @ 2008-06-08 13:14 UTC (permalink / raw)
  To: David Greaves, linux-raid@vger.kernel.org


Hi

I didn't check in dmesg all the time. Now I check again - and your are
correct there is a hardware problem (maybe) with my hdj. This has nothing
to do on the linux-raid list... "hdj dma_intr: status=0x51 { DriveReady
SeekComplete Error }"

Jon

-----Original Message-----
From: David Greaves <david@dgreaves.com>
To: Jon Lurås <jon@irisinfo.net>
Cc: "linux-raid@vger.kernel.org" <linux-raid@vger.kernel.org>
Date: Sun, 08 Jun 2008 12:26:45 +0100
Subject: Re: Change disk problem

> Jon Lurås wrote:
> > Hello 
> > 
> > Thank you for your input.
> > 
> > I gave to little information in the original mail. I had tried to
> 'add'
> > the new disk. It shows up like this - se below. The bad thing is that
> then
> > the device '/dev/hdj1' suddenly changes to 'spare'...
> > and everything looks very bad :-(
> 
> OK, you also wrote (forgot to cc the list)
> Jon Lurås wrote:
> > Hi again
> >
> > A little more information:
> > The only solution I found to the problem which arised after the 'add'
> og
> > /dev/hdk1 is
> > "mdadm --create /dev/md0 --verbose --level=5 --chunk=64
> --raid-devices=3
> > /dev/hdi1 /dev/hdj1 missing"
> >
> > The it looks like the original 'mdadm --detail /dev/md0' result.
> >
> > Jon
> 
> So I'm now quite confused.
> 
> Can you tell us what you did a step at a time.
> This is what should have happened:
> 
> 1 Original disk failed.
> 2 md0 went into degraded and mdadm --monitor sent you an email
> 3 you shutdown and replaced /dev/hdk (?)
> 4 you partitioned /dev/hdk to make hdk1
> 5 you used --add to add hdk1 to the array
> 6 all was well
> 
> now you seem to have done several 'interesting' things in addition
> (like issuing
> a --create).
> However, if you can go back to the original --detail given, a simple
> --add
> should resolve your problems...
> 
> You may want to check your dmesg to make sure you don't have a hardware
> issue.
> 
> > demeter:/etc# mdadm --detail /dev/md0
> > /dev/md0:
> >         Version : 00.90.00
> >   Creation Time : Sat Jun  7 22:04:01 2008
> >      Raid Level : raid5
> >      Array Size : 490223232 (467.51 GiB 501.99 GB)
> >     Device Size : 245111616 (233.76 GiB 250.99 GB)
> >    Raid Devices : 3
> >   Total Devices : 4
> > Preferred Minor : 0
> >     Persistence : Superblock is persistent
> > 
> >     Update Time : Sun Jun  8 12:05:02 2008
> >           State : active, degraded
> >  Active Devices : 1
> > Working Devices : 3
> >  Failed Devices : 1
> >   Spare Devices : 2
> > 
> >          Layout : left-symmetric
> >      Chunk Size : 64K
> > 
> >            UUID : 7edd5dfe:14b3ffb2:304b0607:0d821dcd
> >          Events : 0.8
> > 
> >     Number   Major   Minor   RaidDevice State
> >        0      56        1        0      active sync   /dev/hdi1
> >        1       0        0        1      removed
> >        2       0        0        2      removed
> > 
> >        3      57        1        3      spare   /dev/hdk1
> >        4      56       65        4      spare   /dev/hdj1
> 
> This doesn't look good.
> 
> what does --examine say about each of the components?
> 
> David


--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Change disk problem
  2008-06-08 13:14         ` Jon Lurås
@ 2008-06-08 13:20           ` Justin Piszcz
  0 siblings, 0 replies; 5+ messages in thread
From: Justin Piszcz @ 2008-06-08 13:20 UTC (permalink / raw)
  To: Jon Lurås; +Cc: David Greaves, linux-raid@vger.kernel.org

[-- Attachment #1: Type: TEXT/PLAIN, Size: 373 bytes --]



On Sun, 8 Jun 2008, Jon Lurås wrote:

>
> Hi
>
> I didn't check in dmesg all the time. Now I check again - and your are
> correct there is a hardware problem (maybe) with my hdj. This has nothing
> to do on the linux-raid list... "hdj dma_intr: status=0x51 { DriveReady
> SeekComplete Error }"
>
> Jon

What does:
smartctl -a /dev/hdj

Show?

Justin.

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2008-06-08 13:20 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <S1753847AbYFHHWg/20080608072236Z+62@vger.kernel.org>
2008-06-08  7:58 ` Change disk problem Jon Lurås
2008-06-08 10:00   ` David Greaves
     [not found]     ` <WorldClient-F200806081212.AA12040517@irisinfo.net>
2008-06-08 11:26       ` David Greaves
2008-06-08 13:14         ` Jon Lurås
2008-06-08 13:20           ` Justin Piszcz

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).