Checking individual drive state

linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

* Checking individual drive state
@ 2006-11-05 15:27 Bradshaw
  2006-11-05 20:44 ` dean gaudet
  0 siblings, 1 reply; 3+ messages in thread
From: Bradshaw @ 2006-11-05 15:27 UTC (permalink / raw)
  To: linux-raid

I've recently built a smallish RAID5 box as a storage area for my home 
network, using mdadm. However, one of the drives will not remain in the 
array for longer that around two days before it is removed. Readding it 
to the array does not throw any errors, leading me to believe that it's 
probably a problem with the controller, which is an add-in SATA card, as 
well as the other drive connected to it failing once.

I don't know how to scan the one disk for bad sectors, stopping the 
array and doing an fsck or similar throws errors, so I need help in 
determining whether the disc itself is faulty.

If the controller is to be replaced, how would I go about migrating the 
two discs to the new controller whilst maintaining the array?

Thanks in advance

Tom Bradshaw

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: Checking individual drive state
  2006-11-05 15:27 Checking individual drive state Bradshaw
@ 2006-11-05 20:44 ` dean gaudet
  2006-11-06  2:47   ` Mike Hardy
  0 siblings, 1 reply; 3+ messages in thread
From: dean gaudet @ 2006-11-05 20:44 UTC (permalink / raw)
  To: Bradshaw; +Cc: linux-raid

On Sun, 5 Nov 2006, Bradshaw wrote:

> I've recently built a smallish RAID5 box as a storage area for my home
> network, using mdadm. However, one of the drives will not remain in the array
> for longer that around two days before it is removed. Readding it to the array
> does not throw any errors, leading me to believe that it's probably a problem
> with the controller, which is an add-in SATA card, as well as the other drive
> connected to it failing once.
> 
> I don't know how to scan the one disk for bad sectors, stopping the array and
> doing an fsck or similar throws errors, so I need help in determining whether
> the disc itself is faulty.

try swapping the cable first.  after that swap ports with another disk and 
see if the problem follows the port or the disk.

you can see if smartctl -a (from smartmontools) tells you anything 
interesting.  (it can be quite difficult, to impossible, to understand 
smartctl -a output though.  but if you've got errors in the SMART error 
log that's a good place to start.)

> If the controller is to be replaced, how would I go about migrating the two
> discs to the new controller whilst maintaining the array?

it depends on which method you're using to assemble the array at boot 
time.  in most cases if these aren't your root disks then a swap of two 
disks won't result in any troubles reassembling the array.  other device 
renames may cause problems depending on your distribution though -- but 
generally when two devices swap names within an array you should be fine.

you'll want to do the disk swap with the array offline (either shutdown 
the box or mdadm --stop the array).

-dean

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: Checking individual drive state
  2006-11-05 20:44 ` dean gaudet
@ 2006-11-06  2:47   ` Mike Hardy
  0 siblings, 0 replies; 3+ messages in thread
From: Mike Hardy @ 2006-11-06  2:47 UTC (permalink / raw)
  To: dean gaudet; +Cc: Bradshaw, linux-raid

dean gaudet wrote:
> On Sun, 5 Nov 2006, Bradshaw wrote:

>> I don't know how to scan the one disk for bad sectors, stopping the array and
>> doing an fsck or similar throws errors, so I need help in determining whether
>> the disc itself is faulty.
> 
> try swapping the cable first.  after that swap ports with another disk and 
> see if the problem follows the port or the disk.
> 
> you can see if smartctl -a (from smartmontools) tells you anything 
> interesting.  (it can be quite difficult, to impossible, to understand 
> smartctl -a output though.  but if you've got errors in the SMART error 
> log that's a good place to start.)

I don't think SMART output is that hard to understand.

And checking the entire drive for errors is as easy as 'smartctl -t long
/dev/<drive>' usually. If it is SATA as you say, you may need to put a
'-d ata' in there.

Wait for however long it says to wait, then do a 'smartctl -a
/dev/<drive>' and you should see the self test log at the bottom. Did it
finish? If not, there are bad sectors. If there are bad sectors, you
should google the string 'BadBlockHowTo' to see if you can clear them
(after failing the drive out of the array)

Note that this won't tell you anything about cables or controllers or
power or anything else that could and may be wrong. It's just for the
drive media and firmware.

-Mike

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2006-11-06  2:47 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2006-11-05 15:27 Checking individual drive state Bradshaw
2006-11-05 20:44 ` dean gaudet
2006-11-06  2:47   ` Mike Hardy

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).