* help diagnosing bad disk
@ 2007-12-19 18:18 Jon Sabo
2007-12-19 18:23 ` Justin Piszcz
` (2 more replies)
0 siblings, 3 replies; 9+ messages in thread
From: Jon Sabo @ 2007-12-19 18:18 UTC (permalink / raw)
To: linux-raid
So I was trying to copy over some Indiana Jones wav files and it
wasn't going my way. I noticed that my software raid device showed:
/dev/md1 on / type ext3 (rw,errors=remount-ro)
Is this saying that it was remounted, read only because it found a
problem with the md1 meta device? That's what it looks like it's
saying but I can still write to /.
mdadm --detail showed:
root@recoil:/home/illsci# mdadm --detail /dev/md0
/dev/md0:
Version : 00.90.03
Creation Time : Mon Jul 30 21:47:14 2007
Raid Level : raid1
Array Size : 1951744 ( 1906.32 MiB 1998.59 MB)
Device Size : 1951744 (1906.32 MiB 1998.59 MB)
Raid Devices : 2
Total Devices : 1
Preferred Minor : 0
Persistence : Superblock is persistent
Update Time : Wed Dec 19 12:59:56 2007
State : clean, degraded
Active Devices : 1
Working Devices : 1
Failed Devices : 0
Spare Devices : 0
UUID : 157f716c:0e7aebca:c20741f6
:bb6099c9
Events : 0.28
Number Major Minor RaidDevice State
0 8 1 0 active sync /dev/sda1
1 0 0 1 removed
root@recoil:/home/illsci# mdadm --detail /dev/md1
/dev/md1:
Version : 00.90.03
Creation Time : Mon Jul 30 21:47:47 2007
Raid Level : raid1
Array Size : 974808064 (929.65 GiB 998.20 GB)
Device Size : 974808064 (929.65 GiB 998.20 GB)
Raid Devices : 2
Total Devices : 1
Preferred Minor : 1
Persistence : Superblock is persistent
Update Time : Wed Dec 19 13:14:53 2007
State : clean, degraded
Active Devices : 1
Working Devices : 1
Failed Devices : 0
Spare Devices : 0
UUID : 156a030e:9a6f8eb3:9b0c439e:d718e744
Events : 0.1990
Number Major Minor RaidDevice State
0 8 2 0 active sync /dev/sda2
1 0 0 1 removed
I have two 1 terabyte sata drives in this box. From what I was
reading wouldn't it show an F for the failed drive? I thought I would
see that /dev/sdb1 and /dev/sdb2 were failed and it would show an F.
What is this saying and how do you know that its /dev/sdb and not some
other drive? It shows removed and that the state is clean, degraded.
Is that something you can recover from with out returning this disk
and putting in a new one to add to the raid1 array?
Thanks,
Jonathan
^ permalink raw reply [flat|nested] 9+ messages in thread* Re: help diagnosing bad disk 2007-12-19 18:18 help diagnosing bad disk Jon Sabo @ 2007-12-19 18:23 ` Justin Piszcz 2007-12-19 18:49 ` Jon Sabo 2007-12-19 19:16 ` Bill Davidsen 2007-12-19 21:45 ` Iustin Pop 2 siblings, 1 reply; 9+ messages in thread From: Justin Piszcz @ 2007-12-19 18:23 UTC (permalink / raw) To: Jon Sabo; +Cc: linux-raid On Wed, 19 Dec 2007, Jon Sabo wrote: > So I was trying to copy over some Indiana Jones wav files and it > wasn't going my way. I noticed that my software raid device showed: > > /dev/md1 on / type ext3 (rw,errors=remount-ro) > > Is this saying that it was remounted, read only because it found a > problem with the md1 meta device? That's what it looks like it's > saying but I can still write to /. > > mdadm --detail showed: > > root@recoil:/home/illsci# mdadm --detail /dev/md0 > /dev/md0: > Version : 00.90.03 > Creation Time : Mon Jul 30 21:47:14 2007 > Raid Level : raid1 > Array Size : 1951744 ( 1906.32 MiB 1998.59 MB) > Device Size : 1951744 (1906.32 MiB 1998.59 MB) > Raid Devices : 2 > Total Devices : 1 > Preferred Minor : 0 > Persistence : Superblock is persistent > > Update Time : Wed Dec 19 12:59:56 2007 > State : clean, degraded > Active Devices : 1 > Working Devices : 1 > Failed Devices : 0 > Spare Devices : 0 > > UUID : 157f716c:0e7aebca:c20741f6 > :bb6099c9 > Events : 0.28 > > Number Major Minor RaidDevice State > 0 8 1 0 active sync /dev/sda1 > 1 0 0 1 removed > > root@recoil:/home/illsci# mdadm --detail /dev/md1 > /dev/md1: > Version : 00.90.03 > Creation Time : Mon Jul 30 21:47:47 2007 > Raid Level : raid1 > Array Size : 974808064 (929.65 GiB 998.20 GB) > Device Size : 974808064 (929.65 GiB 998.20 GB) > Raid Devices : 2 > Total Devices : 1 > Preferred Minor : 1 > Persistence : Superblock is persistent > > Update Time : Wed Dec 19 13:14:53 2007 > State : clean, degraded > Active Devices : 1 > Working Devices : 1 > Failed Devices : 0 > Spare Devices : 0 > > UUID : 156a030e:9a6f8eb3:9b0c439e:d718e744 > Events : 0.1990 > > Number Major Minor RaidDevice State > 0 8 2 0 active sync /dev/sda2 > 1 0 0 1 removed > > > I have two 1 terabyte sata drives in this box. From what I was > reading wouldn't it show an F for the failed drive? I thought I would > see that /dev/sdb1 and /dev/sdb2 were failed and it would show an F. > What is this saying and how do you know that its /dev/sdb and not some > other drive? It shows removed and that the state is clean, degraded. > Is that something you can recover from with out returning this disk > and putting in a new one to add to the raid1 array? mdadm /dev/md1 -a /dev/sdb2 to re-add it back into the array What does cat /proc/mdstat show? I would also show us: smartctl -a /dev/sdb Justin. ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: help diagnosing bad disk 2007-12-19 18:23 ` Justin Piszcz @ 2007-12-19 18:49 ` Jon Sabo 2007-12-19 19:43 ` Justin Piszcz 0 siblings, 1 reply; 9+ messages in thread From: Jon Sabo @ 2007-12-19 18:49 UTC (permalink / raw) To: Justin Piszcz; +Cc: linux-raid I found the problem. The power was unplugged from the drive. The sata power connectors aren't very good at securing the connector. I reattached the power connector to the sata drive and booted up. This is what it looks like now: root@recoil:/home/illsci# mdadm --detail /dev/md0 /dev/md0: Version : 00.90.03 Creation Time : Mon Jul 30 21:47:14 2007 Raid Level : raid1 Array Size : 1951744 (1906.32 MiB 1998.59 MB) Device Size : 1951744 (1906.32 MiB 1998.59 MB) Raid Devices : 2 Total Devices : 1 Preferred Minor : 0 Persistence : Superblock is persistent Update Time : Wed Dec 19 13:48:12 2007 State : clean, degraded Active Devices : 1 Working Devices : 1 Failed Devices : 0 Spare Devices : 0 UUID : 157f716c:0e7aebca:c20741f6:bb6099c9 Events : 0.44 Number Major Minor RaidDevice State 0 8 1 0 active sync /dev/sda1 1 0 0 1 removed root@recoil:/home/illsci# mdadm --detail /dev/md1 /dev/md1: Version : 00.90.03 Creation Time : Mon Jul 30 21:47:47 2007 Raid Level : raid1 Array Size : 974808064 (929.65 GiB 998.20 GB) Device Size : 974808064 (929.65 GiB 998.20 GB) Raid Devices : 2 Total Devices : 1 Preferred Minor : 1 Persistence : Superblock is persistent Update Time : Wed Dec 19 13:50:02 2007 State : clean, degraded Active Devices : 1 Working Devices : 1 Failed Devices : 0 Spare Devices : 0 UUID : 156a030e:9a6f8eb3:9b0c439e:d718e744 Events : 0.1498340 Number Major Minor RaidDevice State 0 0 0 0 removed 1 8 18 1 active sync /dev/sdb2 How do I put it back into the correct state? Thanks! Jonathan On Dec 19, 2007 1:23 PM, Justin Piszcz <jpiszcz@lucidpixels.com> wrote: > > > > On Wed, 19 Dec 2007, Jon Sabo wrote: > > > So I was trying to copy over some Indiana Jones wav files and it > > wasn't going my way. I noticed that my software raid device showed: > > > > /dev/md1 on / type ext3 (rw,errors=remount-ro) > > > > Is this saying that it was remounted, read only because it found a > > problem with the md1 meta device? That's what it looks like it's > > saying but I can still write to /. > > > > mdadm --detail showed: > > > > root@recoil:/home/illsci# mdadm --detail /dev/md0 > > /dev/md0: > > Version : 00.90.03 > > Creation Time : Mon Jul 30 21:47:14 2007 > > Raid Level : raid1 > > Array Size : 1951744 ( 1906.32 MiB 1998.59 MB) > > Device Size : 1951744 (1906.32 MiB 1998.59 MB) > > Raid Devices : 2 > > Total Devices : 1 > > Preferred Minor : 0 > > Persistence : Superblock is persistent > > > > Update Time : Wed Dec 19 12:59:56 2007 > > State : clean, degraded > > Active Devices : 1 > > Working Devices : 1 > > Failed Devices : 0 > > Spare Devices : 0 > > > > UUID : 157f716c:0e7aebca:c20741f6 > > :bb6099c9 > > Events : 0.28 > > > > Number Major Minor RaidDevice State > > 0 8 1 0 active sync /dev/sda1 > > 1 0 0 1 removed > > > > root@recoil:/home/illsci# mdadm --detail /dev/md1 > > /dev/md1: > > Version : 00.90.03 > > Creation Time : Mon Jul 30 21:47:47 2007 > > Raid Level : raid1 > > Array Size : 974808064 (929.65 GiB 998.20 GB) > > Device Size : 974808064 (929.65 GiB 998.20 GB) > > Raid Devices : 2 > > Total Devices : 1 > > Preferred Minor : 1 > > Persistence : Superblock is persistent > > > > Update Time : Wed Dec 19 13:14:53 2007 > > State : clean, degraded > > Active Devices : 1 > > Working Devices : 1 > > Failed Devices : 0 > > Spare Devices : 0 > > > > UUID : 156a030e:9a6f8eb3:9b0c439e:d718e744 > > Events : 0.1990 > > > > Number Major Minor RaidDevice State > > 0 8 2 0 active sync /dev/sda2 > > 1 0 0 1 removed > > > > > > I have two 1 terabyte sata drives in this box. From what I was > > reading wouldn't it show an F for the failed drive? I thought I would > > see that /dev/sdb1 and /dev/sdb2 were failed and it would show an F. > > What is this saying and how do you know that its /dev/sdb and not some > > other drive? It shows removed and that the state is clean, degraded. > > Is that something you can recover from with out returning this disk > > and putting in a new one to add to the raid1 array? > > mdadm /dev/md1 -a /dev/sdb2 to re-add it back into the array > > What does cat /proc/mdstat show? > > I would also show us: smartctl -a /dev/sdb > > Justin. > > ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: help diagnosing bad disk 2007-12-19 18:49 ` Jon Sabo @ 2007-12-19 19:43 ` Justin Piszcz 0 siblings, 0 replies; 9+ messages in thread From: Justin Piszcz @ 2007-12-19 19:43 UTC (permalink / raw) To: Jon Sabo; +Cc: linux-raid On Wed, 19 Dec 2007, Jon Sabo wrote: > I found the problem. The power was unplugged from the drive. The > sata power connectors aren't very good at securing the connector. I > reattached the power connector to the sata drive and booted up. This > is what it looks like now: > > root@recoil:/home/illsci# mdadm --detail /dev/md0 > /dev/md0: > Version : 00.90.03 > Creation Time : Mon Jul 30 21:47:14 2007 > Raid Level : raid1 > Array Size : 1951744 (1906.32 MiB 1998.59 MB) > Device Size : 1951744 (1906.32 MiB 1998.59 MB) > Raid Devices : 2 > Total Devices : 1 > Preferred Minor : 0 > Persistence : Superblock is persistent > > Update Time : Wed Dec 19 13:48:12 2007 > State : clean, degraded > Active Devices : 1 > Working Devices : 1 > Failed Devices : 0 > Spare Devices : 0 > > UUID : 157f716c:0e7aebca:c20741f6:bb6099c9 > Events : 0.44 > > Number Major Minor RaidDevice State > 0 8 1 0 active sync /dev/sda1 > 1 0 0 1 removed > root@recoil:/home/illsci# mdadm --detail /dev/md1 > /dev/md1: > Version : 00.90.03 > Creation Time : Mon Jul 30 21:47:47 2007 > Raid Level : raid1 > Array Size : 974808064 (929.65 GiB 998.20 GB) > Device Size : 974808064 (929.65 GiB 998.20 GB) > Raid Devices : 2 > Total Devices : 1 > Preferred Minor : 1 > Persistence : Superblock is persistent > > Update Time : Wed Dec 19 13:50:02 2007 > State : clean, degraded > Active Devices : 1 > Working Devices : 1 > Failed Devices : 0 > Spare Devices : 0 > > UUID : 156a030e:9a6f8eb3:9b0c439e:d718e744 > Events : 0.1498340 > > Number Major Minor RaidDevice State > 0 0 0 0 removed > 1 8 18 1 active sync /dev/sdb2 > > > How do I put it back into the correct state? > > Thanks! mdadm /dev/md0 -a /dev/sdb1 mdadm /dev/md1 -a /dev/sda1 Weird that they got out out of sync on different drives. Justin. ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: help diagnosing bad disk 2007-12-19 18:18 help diagnosing bad disk Jon Sabo 2007-12-19 18:23 ` Justin Piszcz @ 2007-12-19 19:16 ` Bill Davidsen 2007-12-19 19:09 ` Jon Sabo 2007-12-19 21:45 ` Iustin Pop 2 siblings, 1 reply; 9+ messages in thread From: Bill Davidsen @ 2007-12-19 19:16 UTC (permalink / raw) To: Jon Sabo; +Cc: linux-raid Jon Sabo wrote: > So I was trying to copy over some Indiana Jones wav files and it > wasn't going my way. I noticed that my software raid device showed: > > /dev/md1 on / type ext3 (rw,errors=remount-ro) > > Is this saying that it was remounted, read only because it found a > problem with the md1 meta device? That's what it looks like it's > saying but I can still write to /. > > mdadm --detail showed: > > root@recoil:/home/illsci# mdadm --detail /dev/md0 > /dev/md0: > Version : 00.90.03 > Creation Time : Mon Jul 30 21:47:14 2007 > Raid Level : raid1 > Array Size : 1951744 ( 1906.32 MiB 1998.59 MB) > Device Size : 1951744 (1906.32 MiB 1998.59 MB) > Raid Devices : 2 > Total Devices : 1 > Preferred Minor : 0 > Persistence : Superblock is persistent > > Update Time : Wed Dec 19 12:59:56 2007 > State : clean, degraded > Active Devices : 1 > Working Devices : 1 > Failed Devices : 0 > Spare Devices : 0 > > UUID : 157f716c:0e7aebca:c20741f6 > :bb6099c9 > Events : 0.28 > > Number Major Minor RaidDevice State > 0 8 1 0 active sync /dev/sda1 > 1 0 0 1 removed > > root@recoil:/home/illsci# mdadm --detail /dev/md1 > /dev/md1: > Version : 00.90.03 > Creation Time : Mon Jul 30 21:47:47 2007 > Raid Level : raid1 > Array Size : 974808064 (929.65 GiB 998.20 GB) > Device Size : 974808064 (929.65 GiB 998.20 GB) > Raid Devices : 2 > Total Devices : 1 > Preferred Minor : 1 > Persistence : Superblock is persistent > > Update Time : Wed Dec 19 13:14:53 2007 > State : clean, degraded > Active Devices : 1 > Working Devices : 1 > Failed Devices : 0 > Spare Devices : 0 > > UUID : 156a030e:9a6f8eb3:9b0c439e:d718e744 > Events : 0.1990 > > Number Major Minor RaidDevice State > 0 8 2 0 active sync /dev/sda2 > 1 0 0 1 removed > > > I have two 1 terabyte sata drives in this box. From what I was > reading wouldn't it show an F for the failed drive? I thought I would > see that /dev/sdb1 and /dev/sdb2 were failed and it would show an F. > What is this saying and how do you know that its /dev/sdb and not some > other drive? It shows removed and that the state is clean, degraded. > Is that something you can recover from with out returning this disk > and putting in a new one to add to the raid1 array? > You can try adding the partitions back to your array, but I suspect something bad has happened to your sdb drive, since it's failed out of both arrays. You can use dmesg to look for any additional information. Justin gave you the rest of the info you need to investigate, I'll not repeat it. ;-) -- Bill Davidsen <davidsen@tmr.com> "Woe unto the statesman who makes war without a reason that will still be valid when the war is over..." Otto von Bismark ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: help diagnosing bad disk 2007-12-19 19:16 ` Bill Davidsen @ 2007-12-19 19:09 ` Jon Sabo 2007-12-19 19:15 ` Jon Sabo 0 siblings, 1 reply; 9+ messages in thread From: Jon Sabo @ 2007-12-19 19:09 UTC (permalink / raw) To: Bill Davidsen; +Cc: linux-raid We'll here's the rest of the info I should have sent in the last email: root@recoil:/home/illsci# cat /proc/mdstat Personalities : [multipath] [raid1] md1 : active raid1 sdb2[1] 974808064 blocks [2/1] [_U] md0 : active raid1 sda1[0] 1951744 blocks [2/1] [U_] unused devices: <none> root@recoil:/home/illsci# dmesg | grep sdb sd 1:0:0:0: [sdb] 1953523055 512-byte hardware sectors (1000204 MB) sd 1:0:0:0: [sdb] Write Protect is off sd 1:0:0:0: [sdb] Mode Sense: 00 3a 00 00 sd 1:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA sd 1:0:0:0: [sdb] 1953523055 512-byte hardware sectors (1000204 MB) sd 1:0:0:0: [sdb] Write Protect is off sd 1:0:0:0: [sdb] Mode Sense: 00 3a 00 00 sd 1:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA sdb: sdb1 sdb2 sd 1:0:0:0: [sdb] Attached SCSI disk md: bind<sdb1> md: kicking non-fresh sdb1 from array! md: unbind<sdb1> md: export_rdev(sdb1) md: bind<sdb2> root@recoil:/home/illsci# dmesg | grep sda sd 0:0:0:0: [sda] 1953523055 512-byte hardware sectors (1000204 MB) sd 0:0:0:0: [sda] Write Protect is off sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00 sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA sd 0:0:0:0: [sda] 1953523055 512-byte hardware sectors (1000204 MB) sd 0:0:0:0: [sda] Write Protect is off sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00 sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA sda: sda1 sda2 sd 0:0:0:0: [sda] Attached SCSI disk md: bind<sda1> md: bind<sda2> md: kicking non-fresh sda2 from array! md: unbind<sda2> md: export_rdev(sda2) root@recoil:/home/illsci# smartctl -a /dev/sda smartctl version 5.36 [x86_64-unknown-linux-gnu] Copyright (C) 2002-6 Bruce Allen Home page is http://smartmontools.sourceforge.net/ Device: ATA Hitachi HDS72101 Version: GKAO Serial number: GTJ000PAG2HZUC Device type: disk Local Time is: Wed Dec 19 14:13:47 2007 EST Device does not support SMART Error Counter logging not supported [GLTSD (Global Logging Target Save Disable) set. Enable Save with '-S on'] Device does not support Self Test logging root@recoil:/home/illsci# smartctl -a /dev/sdb smartctl version 5.36 [x86_64-unknown-linux-gnu] Copyright (C) 2002-6 Bruce Allen Home page is http://smartmontools.sourceforge.net/ Device: ATA Hitachi HDS72101 Version: GKAO Serial number: GTJ000PAG2K43C Device type: disk Local Time is: Wed Dec 19 14:13:49 2007 EST Device does not support SMART Error Counter logging not supported [GLTSD (Global Logging Target Save Disable) set. Enable Save with '-S on'] Device does not support Self Test logging On Dec 19, 2007 2:16 PM, Bill Davidsen <davidsen@tmr.com> wrote: > > Jon Sabo wrote: > > So I was trying to copy over some Indiana Jones wav files and it > > wasn't going my way. I noticed that my software raid device showed: > > > > /dev/md1 on / type ext3 (rw,errors=remount-ro) > > > > Is this saying that it was remounted, read only because it found a > > problem with the md1 meta device? That's what it looks like it's > > saying but I can still write to /. > > > > mdadm --detail showed: > > > > root@recoil:/home/illsci# mdadm --detail /dev/md0 > > /dev/md0: > > Version : 00.90.03 > > Creation Time : Mon Jul 30 21:47:14 2007 > > Raid Level : raid1 > > Array Size : 1951744 ( 1906.32 MiB 1998.59 MB) > > Device Size : 1951744 (1906.32 MiB 1998.59 MB) > > Raid Devices : 2 > > Total Devices : 1 > > Preferred Minor : 0 > > Persistence : Superblock is persistent > > > > Update Time : Wed Dec 19 12:59:56 2007 > > State : clean, degraded > > Active Devices : 1 > > Working Devices : 1 > > Failed Devices : 0 > > Spare Devices : 0 > > > > UUID : 157f716c:0e7aebca:c20741f6 > > :bb6099c9 > > Events : 0.28 > > > > Number Major Minor RaidDevice State > > 0 8 1 0 active sync /dev/sda1 > > 1 0 0 1 removed > > > > root@recoil:/home/illsci# mdadm --detail /dev/md1 > > /dev/md1: > > Version : 00.90.03 > > Creation Time : Mon Jul 30 21:47:47 2007 > > Raid Level : raid1 > > Array Size : 974808064 (929.65 GiB 998.20 GB) > > Device Size : 974808064 (929.65 GiB 998.20 GB) > > Raid Devices : 2 > > Total Devices : 1 > > Preferred Minor : 1 > > Persistence : Superblock is persistent > > > > Update Time : Wed Dec 19 13:14:53 2007 > > State : clean, degraded > > Active Devices : 1 > > Working Devices : 1 > > Failed Devices : 0 > > Spare Devices : 0 > > > > UUID : 156a030e:9a6f8eb3:9b0c439e:d718e744 > > Events : 0.1990 > > > > Number Major Minor RaidDevice State > > 0 8 2 0 active sync /dev/sda2 > > 1 0 0 1 removed > > > > > > I have two 1 terabyte sata drives in this box. From what I was > > reading wouldn't it show an F for the failed drive? I thought I would > > see that /dev/sdb1 and /dev/sdb2 were failed and it would show an F. > > What is this saying and how do you know that its /dev/sdb and not some > > other drive? It shows removed and that the state is clean, degraded. > > Is that something you can recover from with out returning this disk > > and putting in a new one to add to the raid1 array? > > > > You can try adding the partitions back to your array, but I suspect > something bad has happened to your sdb drive, since it's failed out of > both arrays. You can use dmesg to look for any additional information. > > Justin gave you the rest of the info you need to investigate, I'll not > repeat it. ;-) > > -- > Bill Davidsen <davidsen@tmr.com> > "Woe unto the statesman who makes war without a reason that will still > be valid when the war is over..." Otto von Bismark > > > ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: help diagnosing bad disk 2007-12-19 19:09 ` Jon Sabo @ 2007-12-19 19:15 ` Jon Sabo 2007-12-20 14:48 ` Bill Davidsen 0 siblings, 1 reply; 9+ messages in thread From: Jon Sabo @ 2007-12-19 19:15 UTC (permalink / raw) To: Bill Davidsen; +Cc: linux-raid I think I got it now. Thanks for your help! root@recoil:/home/illsci# mdadm --detail /dev/md0 /dev/md0: Version : 00.90.03 Creation Time : Mon Jul 30 21:47:14 2007 Raid Level : raid1 Array Size : 1951744 (1906.32 MiB 1998.59 MB) Device Size : 1951744 (1906.32 MiB 1998.59 MB) Raid Devices : 2 Total Devices : 1 Preferred Minor : 0 Persistence : Superblock is persistent Update Time : Wed Dec 19 14:15:31 2007 State : clean, degraded Active Devices : 1 Working Devices : 1 Failed Devices : 0 Spare Devices : 0 UUID : 157f716c:0e7aebca:c20741f6:bb6099c9 Events : 0.48 Number Major Minor RaidDevice State 0 8 1 0 active sync /dev/sda1 1 0 0 1 removed root@recoil:/home/illsci# mdadm --detail /dev/md1 /dev/md1: Version : 00.90.03 Creation Time : Mon Jul 30 21:47:47 2007 Raid Level : raid1 Array Size : 974808064 (929.65 GiB 998.20 GB) Device Size : 974808064 (929.65 GiB 998.20 GB) Raid Devices : 2 Total Devices : 1 Preferred Minor : 1 Persistence : Superblock is persistent Update Time : Wed Dec 19 14:19:06 2007 State : clean, degraded Active Devices : 1 Working Devices : 1 Failed Devices : 0 Spare Devices : 0 UUID : 156a030e:9a6f8eb3:9b0c439e:d718e744 Events : 0.1498998 Number Major Minor RaidDevice State 0 0 0 0 removed 1 8 18 1 active sync /dev/sdb2 root@recoil:/home/illsci# mdadm /dev/md0 -a /dev/sdb1 mdadm: re-added /dev/sdb1 root@recoil:/home/illsci# mdadm /dev/md1 -a /dev/sda2 mdadm: re-added /dev/sda2 root@recoil:/home/illsci# cat /proc/mdstat Personalities : [multipath] [raid1] md1 : active raid1 sda2[2] sdb2[1] 974808064 blocks [2/1] [_U] resync=DELAYED md0 : active raid1 sdb1[2] sda1[0] 1951744 blocks [2/1] [U_] [=================>...] recovery = 86.6% (1693504/1951744) finish=0.0min speed=80643K/sec unused devices: <none> root@recoil:/home/illsci# cat /proc/mdstat Personalities : [multipath] [raid1] md1 : active raid1 sda2[2] sdb2[1] 974808064 blocks [2/1] [_U] [>....................] recovery = 0.0% (86848/974808064) finish=186.9min speed=86848K/sec md0 : active raid1 sdb1[1] sda1[0] 1951744 blocks [2/2] [UU] unused devices: <none> On Dec 19, 2007 2:09 PM, Jon Sabo <jonathan.sabo@gmail.com> wrote: > We'll here's the rest of the info I should have sent in the last email: > > root@recoil:/home/illsci# cat /proc/mdstat > Personalities : [multipath] [raid1] > md1 : active raid1 sdb2[1] > 974808064 blocks [2/1] [_U] > > md0 : active raid1 sda1[0] > 1951744 blocks [2/1] [U_] > > unused devices: <none> > root@recoil:/home/illsci# dmesg | grep sdb > sd 1:0:0:0: [sdb] 1953523055 512-byte hardware sectors (1000204 MB) > sd 1:0:0:0: [sdb] Write Protect is off > sd 1:0:0:0: [sdb] Mode Sense: 00 3a 00 00 > sd 1:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't > support DPO or FUA > sd 1:0:0:0: [sdb] 1953523055 512-byte hardware sectors (1000204 MB) > sd 1:0:0:0: [sdb] Write Protect is off > sd 1:0:0:0: [sdb] Mode Sense: 00 3a 00 00 > sd 1:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't > support DPO or FUA > sdb: sdb1 sdb2 > sd 1:0:0:0: [sdb] Attached SCSI disk > md: bind<sdb1> > md: kicking non-fresh sdb1 from array! > md: unbind<sdb1> > md: export_rdev(sdb1) > md: bind<sdb2> > root@recoil:/home/illsci# dmesg | grep sda > sd 0:0:0:0: [sda] 1953523055 512-byte hardware sectors (1000204 MB) > sd 0:0:0:0: [sda] Write Protect is off > sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00 > sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't > support DPO or FUA > sd 0:0:0:0: [sda] 1953523055 512-byte hardware sectors (1000204 MB) > sd 0:0:0:0: [sda] Write Protect is off > sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00 > sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't > support DPO or FUA > sda: sda1 sda2 > sd 0:0:0:0: [sda] Attached SCSI disk > md: bind<sda1> > md: bind<sda2> > md: kicking non-fresh sda2 from array! > md: unbind<sda2> > md: export_rdev(sda2) > > root@recoil:/home/illsci# smartctl -a /dev/sda > smartctl version 5.36 [x86_64-unknown-linux-gnu] Copyright (C) 2002-6 > Bruce Allen > Home page is http://smartmontools.sourceforge.net/ > > Device: ATA Hitachi HDS72101 Version: GKAO > Serial number: GTJ000PAG2HZUC > Device type: disk > Local Time is: Wed Dec 19 14:13:47 2007 EST > Device does not support SMART > > Error Counter logging not supported > > [GLTSD (Global Logging Target Save Disable) set. Enable Save with '-S on'] > Device does not support Self Test logging > root@recoil:/home/illsci# smartctl -a /dev/sdb > smartctl version 5.36 [x86_64-unknown-linux-gnu] Copyright (C) 2002-6 > Bruce Allen > Home page is http://smartmontools.sourceforge.net/ > > Device: ATA Hitachi HDS72101 Version: GKAO > Serial number: GTJ000PAG2K43C > Device type: disk > Local Time is: Wed Dec 19 14:13:49 2007 EST > Device does not support SMART > > Error Counter logging not supported > > [GLTSD (Global Logging Target Save Disable) set. Enable Save with '-S on'] > Device does not support Self Test logging > > > > > > On Dec 19, 2007 2:16 PM, Bill Davidsen <davidsen@tmr.com> wrote: > > > > Jon Sabo wrote: > > > So I was trying to copy over some Indiana Jones wav files and it > > > wasn't going my way. I noticed that my software raid device showed: > > > > > > /dev/md1 on / type ext3 (rw,errors=remount-ro) > > > > > > Is this saying that it was remounted, read only because it found a > > > problem with the md1 meta device? That's what it looks like it's > > > saying but I can still write to /. > > > > > > mdadm --detail showed: > > > > > > root@recoil:/home/illsci# mdadm --detail /dev/md0 > > > /dev/md0: > > > Version : 00.90.03 > > > Creation Time : Mon Jul 30 21:47:14 2007 > > > Raid Level : raid1 > > > Array Size : 1951744 ( 1906.32 MiB 1998.59 MB) > > > Device Size : 1951744 (1906.32 MiB 1998.59 MB) > > > Raid Devices : 2 > > > Total Devices : 1 > > > Preferred Minor : 0 > > > Persistence : Superblock is persistent > > > > > > Update Time : Wed Dec 19 12:59:56 2007 > > > State : clean, degraded > > > Active Devices : 1 > > > Working Devices : 1 > > > Failed Devices : 0 > > > Spare Devices : 0 > > > > > > UUID : 157f716c:0e7aebca:c20741f6 > > > :bb6099c9 > > > Events : 0.28 > > > > > > Number Major Minor RaidDevice State > > > 0 8 1 0 active sync /dev/sda1 > > > 1 0 0 1 removed > > > > > > root@recoil:/home/illsci# mdadm --detail /dev/md1 > > > /dev/md1: > > > Version : 00.90.03 > > > Creation Time : Mon Jul 30 21:47:47 2007 > > > Raid Level : raid1 > > > Array Size : 974808064 (929.65 GiB 998.20 GB) > > > Device Size : 974808064 (929.65 GiB 998.20 GB) > > > Raid Devices : 2 > > > Total Devices : 1 > > > Preferred Minor : 1 > > > Persistence : Superblock is persistent > > > > > > Update Time : Wed Dec 19 13:14:53 2007 > > > State : clean, degraded > > > Active Devices : 1 > > > Working Devices : 1 > > > Failed Devices : 0 > > > Spare Devices : 0 > > > > > > UUID : 156a030e:9a6f8eb3:9b0c439e:d718e744 > > > Events : 0.1990 > > > > > > Number Major Minor RaidDevice State > > > 0 8 2 0 active sync /dev/sda2 > > > 1 0 0 1 removed > > > > > > > > > I have two 1 terabyte sata drives in this box. From what I was > > > reading wouldn't it show an F for the failed drive? I thought I would > > > see that /dev/sdb1 and /dev/sdb2 were failed and it would show an F. > > > What is this saying and how do you know that its /dev/sdb and not some > > > other drive? It shows removed and that the state is clean, degraded. > > > Is that something you can recover from with out returning this disk > > > and putting in a new one to add to the raid1 array? > > > > > > > You can try adding the partitions back to your array, but I suspect > > something bad has happened to your sdb drive, since it's failed out of > > both arrays. You can use dmesg to look for any additional information. > > > > Justin gave you the rest of the info you need to investigate, I'll not > > repeat it. ;-) > > > > -- > > Bill Davidsen <davidsen@tmr.com> > > "Woe unto the statesman who makes war without a reason that will still > > be valid when the war is over..." Otto von Bismark > > > > > > > ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: help diagnosing bad disk 2007-12-19 19:15 ` Jon Sabo @ 2007-12-20 14:48 ` Bill Davidsen 0 siblings, 0 replies; 9+ messages in thread From: Bill Davidsen @ 2007-12-20 14:48 UTC (permalink / raw) To: Jon Sabo; +Cc: linux-raid Jon Sabo wrote: > I think I got it now. Thanks for your help! > Now just make our holiday cheer complete by waiting until the resync is complete and rebooting to be sure that everything is *really* back as it should be. ;-) > root@recoil:/home/illsci# mdadm --detail /dev/md0 > /dev/md0: > Version : 00.90.03 > Creation Time : Mon Jul 30 21:47:14 2007 > Raid Level : raid1 > Array Size : 1951744 (1906.32 MiB 1998.59 MB) > Device Size : 1951744 (1906.32 MiB 1998.59 MB) > Raid Devices : 2 > Total Devices : 1 > Preferred Minor : 0 > Persistence : Superblock is persistent > > Update Time : Wed Dec 19 14:15:31 2007 > State : clean, degraded > Active Devices : 1 > Working Devices : 1 > Failed Devices : 0 > Spare Devices : 0 > > UUID : 157f716c:0e7aebca:c20741f6:bb6099c9 > Events : 0.48 > > Number Major Minor RaidDevice State > 0 8 1 0 active sync /dev/sda1 > 1 0 0 1 removed > root@recoil:/home/illsci# mdadm --detail /dev/md1 > /dev/md1: > Version : 00.90.03 > Creation Time : Mon Jul 30 21:47:47 2007 > Raid Level : raid1 > Array Size : 974808064 (929.65 GiB 998.20 GB) > Device Size : 974808064 (929.65 GiB 998.20 GB) > Raid Devices : 2 > Total Devices : 1 > Preferred Minor : 1 > Persistence : Superblock is persistent > > Update Time : Wed Dec 19 14:19:06 2007 > State : clean, degraded > Active Devices : 1 > Working Devices : 1 > Failed Devices : 0 > Spare Devices : 0 > > UUID : 156a030e:9a6f8eb3:9b0c439e:d718e744 > Events : 0.1498998 > > Number Major Minor RaidDevice State > 0 0 0 0 removed > 1 8 18 1 active sync /dev/sdb2 > root@recoil:/home/illsci# mdadm /dev/md0 -a /dev/sdb1 > mdadm: re-added /dev/sdb1 > root@recoil:/home/illsci# mdadm /dev/md1 -a /dev/sda2 > mdadm: re-added /dev/sda2 > root@recoil:/home/illsci# cat /proc/mdstat > Personalities : [multipath] [raid1] > md1 : active raid1 sda2[2] sdb2[1] > 974808064 blocks [2/1] [_U] > resync=DELAYED > > md0 : active raid1 sdb1[2] sda1[0] > 1951744 blocks [2/1] [U_] > [=================>...] recovery = 86.6% (1693504/1951744) > finish=0.0min speed=80643K/sec > > unused devices: <none> > root@recoil:/home/illsci# cat /proc/mdstat > Personalities : [multipath] [raid1] > md1 : active raid1 sda2[2] sdb2[1] > 974808064 blocks [2/1] [_U] > [>....................] recovery = 0.0% (86848/974808064) > finish=186.9min speed=86848K/sec > > md0 : active raid1 sdb1[1] sda1[0] > 1951744 blocks [2/2] [UU] > > unused devices: <none> > -- Bill Davidsen <davidsen@tmr.com> "Woe unto the statesman who makes war without a reason that will still be valid when the war is over..." Otto von Bismark ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: help diagnosing bad disk 2007-12-19 18:18 help diagnosing bad disk Jon Sabo 2007-12-19 18:23 ` Justin Piszcz 2007-12-19 19:16 ` Bill Davidsen @ 2007-12-19 21:45 ` Iustin Pop 2 siblings, 0 replies; 9+ messages in thread From: Iustin Pop @ 2007-12-19 21:45 UTC (permalink / raw) To: Jon Sabo; +Cc: linux-raid On Wed, Dec 19, 2007 at 01:18:21PM -0500, Jon Sabo wrote: > So I was trying to copy over some Indiana Jones wav files and it > wasn't going my way. I noticed that my software raid device showed: > > /dev/md1 on / type ext3 (rw,errors=remount-ro) > > Is this saying that it was remounted, read only because it found a > problem with the md1 meta device? That's what it looks like it's > saying but I can still write to /. FYI, it means that it is currently "rw", and if there are errors, it will remount the filesystem readonly (as opposed to panic-ing). regards, iustin ^ permalink raw reply [flat|nested] 9+ messages in thread
end of thread, other threads:[~2007-12-20 14:48 UTC | newest] Thread overview: 9+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2007-12-19 18:18 help diagnosing bad disk Jon Sabo 2007-12-19 18:23 ` Justin Piszcz 2007-12-19 18:49 ` Jon Sabo 2007-12-19 19:43 ` Justin Piszcz 2007-12-19 19:16 ` Bill Davidsen 2007-12-19 19:09 ` Jon Sabo 2007-12-19 19:15 ` Jon Sabo 2007-12-20 14:48 ` Bill Davidsen 2007-12-19 21:45 ` Iustin Pop
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).