linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* Newbie: Kernel panic during RAID1 test & reboot loses one disk
@ 2006-08-26 21:31 James Brown
  2006-08-28  1:48 ` Neil Brown
  0 siblings, 1 reply; 5+ messages in thread
From: James Brown @ 2006-08-26 21:31 UTC (permalink / raw)
  To: linux-raid

All,

I'm fairly new to Linux/Debian and have been trying to configure mdadm 
for RAID1 with 2x120Gb IDE disks. Unfortunately, I have two problems 
with the configuration and would really appreciate some advice.

* Problem 1: Since moving from 2.4 -> 2.6 kernel, a reboot kicks one 
device out of the array (c.f. post by Andreas Pelzner on 24th Aug 2006).

* Problem 2: When booting my system, unless both disks plugged in, I get 
a kernel panic (oh dear!):

 > mdadm md0 stopped
 > mdadm cannot open device /dev/hda6 no such device or address
 > mdadm /dev/hda6 has wrong uuid
 > mdadm no devices found for /dev/md0
 > ext3fs unable to read superblock
 > ecit 2 - unable to read superblock cramfs
 > kernel panic attempting to kill init

Here is the information about my system/config:

* System Info:
# uname -a
Linux cinzano. 2.6.8-3-386 #1 Sat Jul 15 09:26:40 UTC 2006 i686 GNU/Linux

* mdadm Config:
# cat /etc/mdadm/mdadm.conf
DEVICE partitions
ARRAY /dev/md1 level=raid1 num-devices=2 
UUID=cc518d12:0e602331:8715a849:6dac0873
    devices=/dev/hda7,/dev/hdc7
ARRAY /dev/md0 level=raid1 num-devices=2 
UUID=07c5cab1:1b86a5ca:f4599353:4ccfc5c1
    devices=/dev/hda6,/dev/hdc6

* After reboot:
# cat /proc/mdstat
Personalities : [raid1]
md1 : active raid1 hda7[0]
       4675264 blocks [2/1] [U_]
md0 : active raid1 hda6[0]
       101562816 blocks [2/1] [U_]

* After hotadding again:
# cat /proc/mdstat
Personalities : [raid1]
md1 : active raid1 hda7[0] hdc7[1]
       14675264 blocks [2/2] [UU]
md0 : active raid1 hdc6[1] hda6[0]
       101562816 blocks [2/2] [UU]

* Mdadm version
# apt-show-versions | grep mdadm
mdadm/stable uptodate 1.9.0-4sarge1

* System md logs don't mention hdc6
# grep md /var/log/messages
Aug 26 14:21:32 cinzano kernel: Kernel command line: root=/dev/md0 ro
Aug 26 14:21:32 cinzano kernel: md: md driver 0.90.0 MAX_MD_DEVS=256, 
MD_SB_DISKS=27
Aug 26 14:21:32 cinzano kernel: md: raid1 personality registered as nr 3
Aug 26 14:21:32 cinzano kernel: md: md0 stopped.
Aug 26 14:21:32 cinzano kernel: md: bind<hda6>
Aug 26 14:21:32 cinzano kernel: raid1: raid set md0 active with 1 out of 
2 mirrors
Aug 26 14:21:32 cinzano kernel: EXT3 FS on md0, internal journal
Aug 26 14:21:32 cinzano kernel: md: md1 stopped.
Aug 26 14:21:32 cinzano kernel: md: bind<hdc7>
Aug 26 14:21:32 cinzano kernel: md: bind<hda7>
Aug 26 14:21:32 cinzano kernel: raid1: raid set md1 active with 2 out of 
2 mirrors
Aug 26 14:21:32 cinzano kernel: EXT3 FS on md1, internal journal
Aug 26 14:25:43 cinzano kernel: Kernel command line: root=/dev/md0 ro
Aug 26 14:25:43 cinzano kernel: md: md driver 0.90.0 MAX_MD_DEVS=256, 
MD_SB_DISKS=27
Aug 26 14:25:43 cinzano kernel: md: raid1 personality registered as nr 3
Aug 26 14:25:43 cinzano kernel: md: md0 stopped.
Aug 26 14:25:43 cinzano kernel: md: bind<hda6>
Aug 26 14:25:43 cinzano kernel: raid1: raid set md0 active with 1 out of 
2 mirrors
Aug 26 14:25:43 cinzano kernel: EXT3-fs: md0: orphan cleanup on readonly fs
Aug 26 14:25:43 cinzano kernel: EXT3-fs: md0: 3 orphan inodes deleted
Aug 26 14:25:43 cinzano kernel: EXT3 FS on md0, internal journal
Aug 26 14:25:43 cinzano kernel: md: md1 stopped.
Aug 26 14:25:43 cinzano kernel: md: bind<hdc7>
Aug 26 14:25:43 cinzano kernel: md: bind<hda7>
Aug 26 14:25:43 cinzano kernel: raid1: raid set md1 active with 2 out of 
2 mirrors
Aug 26 14:25:43 cinzano kernel: EXT3 FS on md1, internal journal

* FDisk output
# fdisk -l
Disk /dev/hda: 120.0 GB, 120034123776 bytes
255 heads, 63 sectors/track, 14593 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

    Device Boot      Start         End      Blocks   Id  System
/dev/hda1               1       14593   117218241    5  Extended
/dev/hda5               1         122      979902   82  Linux swap/Sola.
/dev/hda6             123       12766   101562898+  fd  Linux raid auto.
/dev/hda7           12767       14593    14675346   fd  Linux raid auto.

Disk /dev/hdc: 120.0 GB, 120034123776 bytes
255 heads, 63 sectors/track, 14593 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

    Device Boot      Start         End      Blocks   Id  System
/dev/hdc1               1       14593   117218241    5  Extended
/dev/hdc5               1         122      979902   82  Linux swap/Sola.
/dev/hdc6             123       12766   101562898+  fd  Linux raid auto.
/dev/hdc7           12767       14593    14675346   fd  Linux raid auto.

Disk /dev/md0: 104.0 GB, 104000323584 bytes
2 heads, 4 sectors/track, 25390704 cylinders
Units = cylinders of 8 * 512 = 4096 bytes

Disk /dev/md0 doesn't contain a valid partition table

Disk /dev/md1: 15.0 GB, 15027470336 bytes
2 heads, 4 sectors/track, 3668816 cylinders
Units = cylinders of 8 * 512 = 4096 bytes

Disk /dev/md1 doesn't contain a valid partition table

* Grub config
# cat /boot/grub/menu.lst
[...]
# groot=(hd0,5)
[...]
title           Debian GNU/Linux, kernel 2.6.8-3-386
root            (hd0,5)
kernel          /boot/vmlinuz-2.6.8-3-386 root=/dev/md0 ro
initrd          /boot/initrd.img-2.6.8-3-386
savedefault
boot
[...]

* Mounts
# cat /etc/fstab
# <file system> <mount point>   <type>  <options>       <dump>  <pass>
proc            /proc           proc    defaults        0       0
/dev/md0        /               ext3    defaults,errors=remount-ro 0       1
/dev/md1        /var/mail       ext3    defaults        0       2
/dev/hda5       none            swap    sw              0       0
/dev/hdc5       none            swap    sw              0       0
/dev/hdd        /media/cdrom0   iso9660 ro,user,noauto  0       0

Many thanks to anyone who can help.

James.


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Newbie: Kernel panic during RAID1 test & reboot loses one disk
  2006-08-26 21:31 Newbie: Kernel panic during RAID1 test & reboot loses one disk James Brown
@ 2006-08-28  1:48 ` Neil Brown
  2006-08-28 21:53   ` James Brown
  0 siblings, 1 reply; 5+ messages in thread
From: Neil Brown @ 2006-08-28  1:48 UTC (permalink / raw)
  To: James Brown; +Cc: linux-raid

On Saturday August 26, jbrown@orange.net wrote:
> All,
> 
> I'm fairly new to Linux/Debian and have been trying to configure mdadm 
> for RAID1 with 2x120Gb IDE disks. Unfortunately, I have two problems 
> with the configuration and would really appreciate some advice.
> 
> * Problem 1: Since moving from 2.4 -> 2.6 kernel, a reboot kicks one 
> device out of the array (c.f. post by Andreas Pelzner on 24th Aug 2006).
> 
> * Problem 2: When booting my system, unless both disks plugged in, I get 
> a kernel panic (oh dear!):
> 
>  > mdadm md0 stopped
>  > mdadm cannot open device /dev/hda6 no such device or address
>  > mdadm /dev/hda6 has wrong uuid
>  > mdadm no devices found for /dev/md0
>  > ext3fs unable to read superblock
>  > ecit 2 - unable to read superblock cramfs
>  > kernel panic attempting to kill init

At a guess, I'd say something is wrong with your initramfs/initrd.
Can you look inside it and see what /etc/mdadm/mdadm.conf contains?

> * mdadm Config:
> # cat /etc/mdadm/mdadm.conf
> DEVICE partitions
> ARRAY /dev/md1 level=raid1 num-devices=2 
> UUID=cc518d12:0e602331:8715a849:6dac0873
>     devices=/dev/hda7,/dev/hdc7
> ARRAY /dev/md0 level=raid1 num-devices=2 
> UUID=07c5cab1:1b86a5ca:f4599353:4ccfc5c1
>     devices=/dev/hda6,/dev/hdc6

You don't want the 'devices=' lines, but they shouldn't be causing a
problem in this context.

> * System md logs don't mention hdc6
> # grep md /var/log/messages

 grep -C 5 md /var/log/messages
might be better as it gives a bit more context.l

But I'm betting on the initramfs being a problem.

NeilBrown

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Newbie: Kernel panic during RAID1 test & reboot loses one disk
  2006-08-28  1:48 ` Neil Brown
@ 2006-08-28 21:53   ` James Brown
  2006-08-28 22:08     ` James Brown
  2006-09-04  5:35     ` Neil Brown
  0 siblings, 2 replies; 5+ messages in thread
From: James Brown @ 2006-08-28 21:53 UTC (permalink / raw)
  To: Neil Brown; +Cc: linux-raid

Neil Brown wrote:
> On Saturday August 26, jbrown@orange.net wrote:
>> All,
>>
>> [...]
>>
>> * Problem 1: Since moving from 2.4 -> 2.6 kernel, a reboot kicks one 
>> device out of the array (c.f. post by Andreas Pelzner on 24th Aug 2006).
>>
>> * Problem 2: When booting my system, unless both disks plugged in, I get 
>> a kernel panic (oh dear!):
>>
>>  > mdadm md0 stopped
>>  > mdadm cannot open device /dev/hda6 no such device or address
>>  > mdadm /dev/hda6 has wrong uuid
>>  > mdadm no devices found for /dev/md0
>>  > ext3fs unable to read superblock
>>  > ecit 2 - unable to read superblock cramfs
>>  > kernel panic attempting to kill init
> 
> At a guess, I'd say something is wrong with your initramfs/initrd.
> Can you look inside it and see what /etc/mdadm/mdadm.conf contains?

Sure, this is the first time I've mounted an initrd, here goes:

# file /boot/initrd.img-2.6.8-3-386
[...]Linux Compressed ROM File System data, little endian size 4333568 
version #2 sorted_dirs CRC 0xa04ccaa3, edition 0, 2492 blocks, 312 files

# losetup /dev/loop0

# mkdir /tmp/initrdmount

# mount -t cramfs /dev/loop0 /tmp/initrdmount

# ls -al /tmp/initrdmount/etc/
total 1.0K
drwxr-xr-x  1 root root 64 1970-01-01 01:00 modprobe.d/
-rw-r--r--  1 root root  0 1970-01-01 01:00 mtab

There is no mdadm/mdadm.conf! What I should do about this?

> [...]
>> * System md logs don't mention hdc6
>> # grep md /var/log/messages
> 
>  grep -C 5 md /var/log/messages
> might be better as it gives a bit more context.l

I've put the messages here:

http://www.zen6780.zen.co.uk/messages.txt

Many thanks for your time.

James.

> But I'm betting on the initramfs being a problem.
> 
> NeilBrown
> -
> To unsubscribe from this list: send the line "unsubscribe linux-raid" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> 


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Newbie: Kernel panic during RAID1 test & reboot loses one disk
  2006-08-28 21:53   ` James Brown
@ 2006-08-28 22:08     ` James Brown
  2006-09-04  5:35     ` Neil Brown
  1 sibling, 0 replies; 5+ messages in thread
From: James Brown @ 2006-08-28 22:08 UTC (permalink / raw)
  To: James Brown; +Cc: linux-raid

James Brown wrote:

> [...]
> There is no mdadm/mdadm.conf! What I should do about this?

Having just read the post from Andreas Pelzner, perhaps I should create 
a new initrd:

 > Andreas Pelzner wrote:
> you told me the rigt way. I had to add the lines "raid1" and "md_mod" to
> /etc/mkinitrd/modules. After recreating the initrd image "mkinitrd -o
> /boot/initrd.img-2.6.17.8 /lib/modules/2.6.17.8" the server boots into
> both raid disk correctly.

I don't understand the "raid1" or "md_mod" steps here. Could anyone 
explain please?

Regards,

James.


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Newbie: Kernel panic during RAID1 test & reboot loses one disk
  2006-08-28 21:53   ` James Brown
  2006-08-28 22:08     ` James Brown
@ 2006-09-04  5:35     ` Neil Brown
  1 sibling, 0 replies; 5+ messages in thread
From: Neil Brown @ 2006-09-04  5:35 UTC (permalink / raw)
  To: James Brown; +Cc: linux-raid

On Monday August 28, jbrown@orange.net wrote:
> Neil Brown wrote:
> > On Saturday August 26, jbrown@orange.net wrote:
> >> All,
> >>
> >> [...]
> >>
> >> * Problem 1: Since moving from 2.4 -> 2.6 kernel, a reboot kicks one 
> >> device out of the array (c.f. post by Andreas Pelzner on 24th Aug 2006).
> >>
> >> * Problem 2: When booting my system, unless both disks plugged in, I get 
> >> a kernel panic (oh dear!):
> >>
> >>  > mdadm md0 stopped
> >>  > mdadm cannot open device /dev/hda6 no such device or address
> >>  > mdadm /dev/hda6 has wrong uuid
> >>  > mdadm no devices found for /dev/md0
> >>  > ext3fs unable to read superblock
> >>  > ecit 2 - unable to read superblock cramfs
> >>  > kernel panic attempting to kill init
> > 
> > At a guess, I'd say something is wrong with your initramfs/initrd.
> > Can you look inside it and see what /etc/mdadm/mdadm.conf contains?
> 
> Sure, this is the first time I've mounted an initrd, here goes:
> 
> # file /boot/initrd.img-2.6.8-3-386
> [...]Linux Compressed ROM File System data, little endian size 4333568 
> version #2 sorted_dirs CRC 0xa04ccaa3, edition 0, 2492 blocks, 312 files
> 
> # losetup /dev/loop0
> 
> # mkdir /tmp/initrdmount
> 
> # mount -t cramfs /dev/loop0 /tmp/initrdmount
> 
> # ls -al /tmp/initrdmount/etc/
> total 1.0K
> drwxr-xr-x  1 root root 64 1970-01-01 01:00 modprobe.d/
> -rw-r--r--  1 root root  0 1970-01-01 01:00 mtab
> 
> There is no mdadm/mdadm.conf! What I should do about this?

Sorry, I don't think.  You'll have to ask on some Debian list.  I
don't know the intricacies of Debian initrd.

NeilBrown

-- 
VGER BF report: U 0.5

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2006-09-04  5:35 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2006-08-26 21:31 Newbie: Kernel panic during RAID1 test & reboot loses one disk James Brown
2006-08-28  1:48 ` Neil Brown
2006-08-28 21:53   ` James Brown
2006-08-28 22:08     ` James Brown
2006-09-04  5:35     ` Neil Brown

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).