linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* Re: Crooked raid
@ 2005-11-16 15:41 Andrew Burgess
  2005-11-16 16:39 ` Guillaume Filion
  0 siblings, 1 reply; 10+ messages in thread
From: Andrew Burgess @ 2005-11-16 15:41 UTC (permalink / raw)
  To: linux-raid

>> Look at:
>>   mdadm -E /dev/hdc
>> If it has a superblock, zero it with 'mdadm --zero-superblock /dev/hdc'
> > Same for hdg

>I did this, rebooted and the system wouldn't reboot. Yikes! I was 
>however able to boot with giving root=/dev/hdc2 to the kernel.

I didn't realize we were talking about your root filesystem else I would have
been more cautious (or maybe just kept my mouth shut). Glad you got it to boot!
Do you recall why it didn't want to boot? What are the kernel command line args?

Did you check for a bootup script somewhere screwing things up?
  egrep -i 'raid|mdadm' /etc/rc.d/* /etc/rc.d/init.d/* /etc/*

And did you already say that you don't have a mdadm.conf file?

You might need to pick which mirror (hdc2 or hdg2) you trust more as your root
filesystem (since they may be different now) and then start over and follow the
HowTo for making a normal root filesystem into a raided one.

But if you can't find out why the system is still looking at hdc then it might
all happen again...

>Here's the relevant part of dmesg:
>device-mapper: 4.1.1-ioctl (2004-04-07) initialised: dm-devel@redhat.com
>md: can not import ide/host0/bus1/target0/lun0/part2, has active inodes!

I guess this is because you booted from it as hdc2 so its busy and mdadm
won't try to use it?

Do you know why the system sometimes says ide/host0/bus1/target0/lun0/part2 and
sometimes says hd[cg]2 ? Its confusing...

>md0 : active raid1 ide/host2/bus1/target0/lun0/part2[0]
>       77508032 blocks [2/1] [U_]

>/dev/md0:
>    Raid Devices : 2
>   Total Devices : 2
>           State : active, degraded
>  Active Devices : 1
>Working Devices : 1
>  Failed Devices : 1
>        0      34        2        0      active sync   /dev/hdg2
>        1       0        0        1      faulty removed

>mdadm: No super block found on /dev/hdc (Expected magic a92b4efc, got 
>00000000)
>mdadm: No super block found on /dev/hdg (Expected magic a92b4efc, got 
>00000000)

Those two results are good

>/dev/hdc2:
..
>this     1      22        2        1      active sync   /dev/hdc2
>    0     0      34        2        0      active sync   /dev/hdg2
>    1     1      22        2        1      active sync   /dev/hdc2

>/dev/hdg2:
..
>this     0      34        2        0      active sync   /dev/hdg2
>    0     0      34        2        0      active sync   /dev/hdg2
>    1     1       0        0        1      faulty removed

I still don't see where hdc comes from. Sorry!

Any other raid superblocks around?
  for dev in /dev/hd? /dev/hd??; do mdadm -E $dev; done

And any raid autodetect partitions around?
  fdisk -l | grep raid



^ permalink raw reply	[flat|nested] 10+ messages in thread
* Re: Crooked raid
@ 2005-11-16 16:07 Andrew Burgess
  0 siblings, 0 replies; 10+ messages in thread
From: Andrew Burgess @ 2005-11-16 16:07 UTC (permalink / raw)
  To: linux-raid

>I still don't see where hdc comes from. Sorry!

Maybe its in the kernel boot args? Look in /boot/grub/grub.conf
if you use grub. Or /etc/lilo.conf or...


^ permalink raw reply	[flat|nested] 10+ messages in thread
* Re: Crooked raid
@ 2005-11-15 18:44 Andrew Burgess
  2005-11-15 23:24 ` Guillaume Filion
  0 siblings, 1 reply; 10+ messages in thread
From: Andrew Burgess @ 2005-11-15 18:44 UTC (permalink / raw)
  To: gfk; +Cc: linux-raid

>Some time ago, I wanted to setup a software RAID-1 between hdc2 and 
>hdg2. However, not being familiar with mdadm and software raid, I made a 
>couple of bad commands. I don't remember the exact commands, but it's in 
>  the order of setting /dev/hdc, /dev/hdc2, /dev/hdg and /dev/hdg2 in 
>the same RAID. 

Oops

>Because of this, I was getting all sorts of errors in my 
>dmesg (see dmesg-broken below), but otherwise the RAID would work fine.

>After setting my /etc/mdadm/mdadm.conf to:
>DEVICE /dev/hdc2 /dev/hdg2
>ARRAY /dev/md0 level=raid1 num-devices=2 
>UUID=b013e39b:ec629293:98df4657:97255939

>I tried to
>mdadm /dev/md0 --remove /dev/hdc
>but without success:
>mdadm: hot remove failed for /dev/hdc: No such device or address

Look at:
  mdadm -E /dev/hdc
If it has a superblock, zero it with 'mdadm --zero-superblock /dev/hdc'

Same for hdg

Just as a failsafe test, before you zero, compare mdadm -E for hdc2 and hdc,
they should print different things. You want to insure you don't zero hd[cg]2
accidently.


^ permalink raw reply	[flat|nested] 10+ messages in thread
* Crooked raid
@ 2005-11-15 15:20 Guillaume Filion
  2005-11-15 23:07 ` Neil Brown
  0 siblings, 1 reply; 10+ messages in thread
From: Guillaume Filion @ 2005-11-15 15:20 UTC (permalink / raw)
  To: linux-raid

Hi all,

Some time ago, I wanted to setup a software RAID-1 between hdc2 and 
hdg2. However, not being familiar with mdadm and software raid, I made a 
couple of bad commands. I don't remember the exact commands, but it's in 
  the order of setting /dev/hdc, /dev/hdc2, /dev/hdg and /dev/hdg2 in 
the same RAID. Because of this, I was getting all sorts of errors in my 
dmesg (see dmesg-broken below), but otherwise the RAID would work fine.

After setting my /etc/mdadm/mdadm.conf to:
DEVICE /dev/hdc2 /dev/hdg2
ARRAY /dev/md0 level=raid1 num-devices=2 
UUID=b013e39b:ec629293:98df4657:97255939

I don't get theses errors in my dmesg (see dmesg-now), but I can still 
see incoherent data when looking at md0 -- see the lines Total Devices 
and Failed Devices in mdadm below.

I tried to
mdadm /dev/md0 --remove /dev/hdc
but without success:
mdadm: hot remove failed for /dev/hdc: No such device or address

I'm wondering what would be the easiest way to correct this. If 
possible, I'd prefer not having to start from scratch.

I'm running kernel 2.4.27-2-k7 on debian testing and using mdadm v1.12.0.

Thanks in advance,
GFK's

========= dmesg-broken ===========
device-mapper: 4.1.1-ioctl (2004-04-07) initialised: dm-devel@redhat.com
VFS: Disk change detected on device 21:00
  /dev/ide/host2/bus0/target0/lun0: p1
VFS: Disk change detected on device 21:00
  /dev/ide/host2/bus0/target0/lun0: p1
  /dev/ide/host2/bus0/target0/lun0: p1
  /dev/ide/host2/bus0/target0/lun0: p1
VFS: Disk change detected on device 21:00
  /dev/ide/host2/bus0/target0/lun0: p1
VFS: Disk change detected on device 21:00
  /dev/ide/host2/bus0/target0/lun0: p1
  [events: 0000005d]
md: bind<ide/host0/bus1/target0/lun0/part2,1>
  [events: 00000027]
md0: WARNING: ide/host0/bus1/target0/lun0/disc appears to be on the same 
physical disk as ide/host0/bus1/target0/lun0/part2. True
      protection against single-disk failure might be compromised.
md: bind<ide/host0/bus1/target0/lun0/disc,2>
  [events: 0000002a]
md: bind<ide/host2/bus1/target0/lun0/disc,3>
  [events: 0000005d]
md0: WARNING: ide/host2/bus1/target0/lun0/part2 appears to be on the 
same physical disk as ide/host2/bus1/target0/lun0/disc. True
      protection against single-disk failure might be compromised.
md: bind<ide/host2/bus1/target0/lun0/part2,4>
md: ide/host2/bus1/target0/lun0/part2's event counter: 0000005d
md: ide/host2/bus1/target0/lun0/disc's event counter: 0000002a
md: ide/host0/bus1/target0/lun0/disc's event counter: 00000027
md: ide/host0/bus1/target0/lun0/part2's event counter: 0000005d
md: superblock update time inconsistency -- using the most recent one
md: freshest: ide/host2/bus1/target0/lun0/part2
md: kicking non-fresh ide/host2/bus1/target0/lun0/disc from array!
md: unbind<ide/host2/bus1/target0/lun0/disc,3>
md: export_rdev(ide/host2/bus1/target0/lun0/disc)
md: kicking non-fresh ide/host0/bus1/target0/lun0/disc from array!
md: unbind<ide/host0/bus1/target0/lun0/disc,2>
md: export_rdev(ide/host0/bus1/target0/lun0/disc)
md0: max total readahead window set to 124k
raid1: device ide/host2/bus1/target0/lun0/part2 operational as mirror 0
raid1: device ide/host0/bus1/target0/lun0/part2 operational as mirror 1
raid1: raid set md0 active with 2 out of 2 mirrors
md: updating md0 RAID superblock on device
md: ide/host2/bus1/target0/lun0/part2 [events: 0000005e]<6>(write) 
ide/host2/bus1/target0/lun0/part2's sb offset: 77508032
md: ide/host0/bus1/target0/lun0/part2 [events: 0000005e]<6>(write) 
ide/host0/bus1/target0/lun0/part2's sb offset: 77508032
========== dmesg-now =============
Partition check:
  /dev/ide/host0/bus1/target0/lun0: p1 p2
  /dev/ide/host2/bus0/target0/lun0: p1
  /dev/ide/host2/bus1/target0/lun0: p1 p2
  [events: 00000065]
md: bind<ide/host0/bus1/target0/lun0/part2,1>
  [events: 00000065]
md: bind<ide/host2/bus1/target0/lun0/part2,2>
md: ide/host2/bus1/target0/lun0/part2's event counter: 00000065
md: ide/host0/bus1/target0/lun0/part2's event counter: 00000065
md0: max total readahead window set to 124k
md0: 1 data-disks, max readahead per data-disk: 124k
raid1: device ide/host2/bus1/target0/lun0/part2 operational as mirror 0
raid1: device ide/host0/bus1/target0/lun0/part2 operational as mirror 1
raid1: raid set md0 active with 2 out of 2 mirrors
md: updating md0 RAID superblock on device
md: ide/host2/bus1/target0/lun0/part2 [events: 00000066]<6>(write) 
ide/host2/bus1/target0/lun0/part2's sb offset: 77508032
md: ide/host0/bus1/target0/lun0/part2 [events: 00000066]<6>(write) 
ide/host0/bus1/target0/lun0/part2's sb offset: 77508032
========== mdadm ==============
gfk@ali:~$ sudo mdadm --detail /dev/md0
/dev/md0:
         Version : 00.90.00
   Creation Time : Wed Dec 29 21:32:26 2004
      Raid Level : raid1
      Array Size : 77508032 (73.92 GiB 79.37 GB)
     Device Size : 77508032 (73.92 GiB 79.37 GB)
    Raid Devices : 2
   Total Devices : 3
Preferred Minor : 0
     Persistence : Superblock is persistent

     Update Time : Mon Nov 14 19:17:30 2005
           State : active
  Active Devices : 2
Working Devices : 2
  Failed Devices : 1
   Spare Devices : 0

            UUID : b013e39b:ec629293:98df4657:97255939
          Events : 0.102

     Number   Major   Minor   RaidDevice State
        0      34        2        0      active sync   /dev/hdg2
        1      22        2        1      active sync   /dev/hdc2
gfk@ali:~$
=============================

-- 
Guillaume Filion, ing. jr
Logidac Tech., Beaumont, Québec, Canada - http://logidac.com/
PGP Key and more: http://guillaume.filion.org/
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2005-11-21  3:50 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2005-11-16 15:41 Crooked raid Andrew Burgess
2005-11-16 16:39 ` Guillaume Filion
2005-11-16 23:55   ` Guillaume Filion
2005-11-21  3:50     ` Crooked raid [solved] Guillaume Filion
  -- strict thread matches above, loose matches on Subject: below --
2005-11-16 16:07 Crooked raid Andrew Burgess
2005-11-15 18:44 Andrew Burgess
2005-11-15 23:24 ` Guillaume Filion
2005-11-15 15:20 Guillaume Filion
2005-11-15 23:07 ` Neil Brown
2005-11-16 13:47   ` Guillaume Filion

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).