* no resync after hotadd
@ 2003-09-23 14:43 Cress, Andrew R
2003-09-25 7:18 ` Lars Marowsky-Bree
0 siblings, 1 reply; 3+ messages in thread
From: Cress, Andrew R @ 2003-09-23 14:43 UTC (permalink / raw)
To: linux-raid
[-- Attachment #1: Type: text/plain, Size: 5769 bytes --]
Neil,
I've seen this before, but it is very reproducable on the SuSE/UL system
I am testing on now.
When I hotadd a new/replacement disk (sdb) into the raid device, the
resync doesn't start.
From the superblocks, apparently the active disk still has total disks=1
so only 1 active, but the new disk superblock was written with total
disks=2 and both active.
The configuration is raid1 with sda and sdb, and sdb was hot-removed and
hot-added (each partition with mdadm -a).
I've included the resulting superblocks, and the syslog for the add
sequence is attached.
Is this a problem with the md driver?
Andy
# uname -a
Linux telcoraid 2.4.21-55-smp #1 SMP Wed Aug 27 19:53:48 UTC 2003 i686
unknown
# cat /proc/mdstat
Personalities : [raid1]
read_ahead 1024 sectors
md2 : active raid1 sdb2[2] sda2[1]
136448 blocks [2/1] [_U]
md1 : active raid1 sdb1[2] sda1[1]
80256 blocks [2/1] [_U]
md0 : active raid1 sdb3[2] sda3[1]
17518784 blocks [2/1] [_U]
unused devices: <none>
#
# mdadm -E /dev/sda1
/dev/sda1:
Magic : a92b4efc
Version : 00.90.00
UUID : 15e24cfe:af2ecf43:9c74fae1:7aabf6f0
Creation Time : Thu Sep 18 13:12:09 2003
Raid Level : raid1
Device Size : 80256 (78.38 MiB 82.18 MB)
Raid Devices : 2
Total Devices : 1
Preferred Minor : 1
Update Time : Mon Sep 22 12:58:41 2003
State : dirty, no-errors
Active Devices : 1
Working Devices : 1
Failed Devices : 0
Spare Devices : 0
Checksum : 43828e3 - correct
Events : 0.28
Number Major Minor RaidDevice State
this 1 8 1 1 active sync /dev/sda1
0 0 0 0 0 faulty removed
1 1 8 1 1 active sync /dev/sda1
/dev/sdb1:
Magic : a92b4efc
Version : 00.90.00
UUID : 15e24cfe:af2ecf43:9c74fae1:7aabf6f0
Creation Time : Thu Sep 18 13:12:09 2003
Raid Level : raid1
Device Size : 80256 (78.38 MiB 82.18 MB)
Raid Devices : 2
Total Devices : 2
Preferred Minor : 1
Update Time : Mon Sep 22 09:37:09 2003
State : dirty, no-errors
Active Devices : 2
Working Devices : 2
Failed Devices : 0
Spare Devices : 0
Checksum : 437fa2f - correct
Events : 0.25
Number Major Minor RaidDevice State
this 0 8 17 0 active sync /dev/sdb1
0 0 8 17 0 active sync /dev/sdb1
1 1 8 1 1 active sync /dev/sda1
/dev/sda2:
Magic : a92b4efc
Version : 00.90.00
UUID : 16df7960:13fe43ac:ac0cc4ba:42690535
Creation Time : Thu Sep 18 13:12:56 2003
Raid Level : raid1
Device Size : 136448 (133.25 MiB 139.72 MB)
Raid Devices : 2
Total Devices : 1
Preferred Minor : 2
Update Time : Mon Sep 22 12:58:41 2003
State : dirty, no-errors
Active Devices : 1
Working Devices : 1
Failed Devices : 0
Spare Devices : 0
Checksum : 415a7d7f - correct
Events : 0.30
Number Major Minor RaidDevice State
this 1 8 2 1 active sync /dev/sda2
0 0 0 0 0 faulty removed
1 1 8 2 1 active sync /dev/sda2
/dev/sdb2:
Magic : a92b4efc
Version : 00.90.00
UUID : 16df7960:13fe43ac:ac0cc4ba:42690535
Creation Time : Thu Sep 18 13:12:56 2003
Raid Level : raid1
Device Size : 136448 (133.25 MiB 139.72 MB)
Raid Devices : 2
Total Devices : 2
Preferred Minor : 2
Update Time : Mon Sep 22 09:37:09 2003
State : dirty, no-errors
Active Devices : 2
Working Devices : 2
Failed Devices : 0
Spare Devices : 0
Checksum : 415a4ecc - correct
Events : 0.27
Number Major Minor RaidDevice State
this 0 8 18 0 active sync /dev/sdb2
0 0 8 18 0 active sync /dev/sdb2
1 1 8 2 1 active sync /dev/sda2
/dev/sda3:
Magic : a92b4efc
Version : 00.90.00
UUID : 31b4390a:af5afa82:44c73f24:a2a36931
Creation Time : Thu Sep 18 13:13:12 2003
Raid Level : raid1
Device Size : 17518784 (16.71 GiB 17.94 GB)
Raid Devices : 2
Total Devices : 1
Preferred Minor : 0
Update Time : Mon Sep 22 12:58:40 2003
State : dirty, no-errors
Active Devices : 1
Working Devices : 1
Failed Devices : 0
Spare Devices : 0
Checksum : f18a0e32 - correct
Events : 0.28
Number Major Minor RaidDevice State
this 1 8 3 1 active sync /dev/sda3
0 0 0 0 0 faulty removed
1 1 8 3 1 active sync /dev/sda3
/dev/sdb3:
Magic : a92b4efc
Version : 00.90.00
UUID : 31b4390a:af5afa82:44c73f24:a2a36931
Creation Time : Thu Sep 18 13:13:12 2003
Raid Level : raid1
Device Size : 17518784 (16.71 GiB 17.94 GB)
Raid Devices : 2
Total Devices : 2
Preferred Minor : 0
Update Time : Mon Sep 22 09:37:08 2003
State : dirty, no-errors
Active Devices : 2
Working Devices : 2
Failed Devices : 0
Spare Devices : 0
Checksum : f189df80 - correct
Events : 0.25
Number Major Minor RaidDevice State
this 0 8 19 0 active sync /dev/sdb3
0 0 8 19 0 active sync /dev/sdb3
1 1 8 3 1 active sync /dev/sda3
[-- Attachment #2: syslog --]
[-- Type: application/octet-stream, Size: 15516 bytes --]
Sep 23 10:14:38 telcoraid kernel: md: trying to hot-add sdb1 to md1 ...
Sep 23 10:14:38 telcoraid kernel: md: bind<sdb1,2>
Sep 23 10:14:38 telcoraid kernel: RAID1 conf printout:
Sep 23 10:14:38 telcoraid kernel: --- wd:1 rd:2 nd:1
Sep 23 10:14:38 telcoraid kernel: disk 0, s:0, o:0, n:0 rd:0 us:1 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 1, s:0, o:1, n:1 rd:1 us:1 dev:sda1
Sep 23 10:14:38 telcoraid kernel: disk 2, s:1, o:0, n:2 rd:2 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 3, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 4, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 5, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 6, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 7, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 8, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 9, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 10, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 11, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 12, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 13, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 14, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 15, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 16, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 17, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 18, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 19, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 20, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 21, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 22, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 23, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 24, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 25, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 26, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: RAID1 conf printout:
Sep 23 10:14:38 telcoraid kernel: --- wd:1 rd:2 nd:2
Sep 23 10:14:38 telcoraid kernel: disk 0, s:0, o:0, n:0 rd:0 us:1 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 1, s:0, o:1, n:1 rd:1 us:1 dev:sda1
Sep 23 10:14:38 telcoraid kernel: disk 2, s:1, o:0, n:2 rd:2 us:1 dev:sdb1
Sep 23 10:14:38 telcoraid kernel: disk 3, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 4, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 5, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 6, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 7, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 8, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 9, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 10, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 11, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 12, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 13, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 14, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 15, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 16, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 17, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 18, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 19, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 20, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 21, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 22, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 23, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 24, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 25, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: disk 26, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:38 telcoraid kernel: md: updating md1 RAID superblock on device
Sep 23 10:14:38 telcoraid kernel: md: sdb1 (skipping alias)
Sep 23 10:14:38 telcoraid kernel: md: sda1 (skipping alias)
Sep 23 10:14:38 telcoraid kernel: md: recovery thread got woken up ...
Sep 23 10:14:38 telcoraid kernel: md: recovery thread finished ...
Sep 23 10:14:50 telcoraid kernel: md: trying to hot-add sdb2 to md2 ...
Sep 23 10:14:50 telcoraid kernel: md: bind<sdb2,2>
Sep 23 10:14:50 telcoraid kernel: RAID1 conf printout:
Sep 23 10:14:50 telcoraid kernel: --- wd:1 rd:2 nd:1
Sep 23 10:14:50 telcoraid kernel: disk 0, s:0, o:0, n:0 rd:0 us:1 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 1, s:0, o:1, n:1 rd:1 us:1 dev:sda2
Sep 23 10:14:50 telcoraid kernel: disk 2, s:1, o:0, n:2 rd:2 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 3, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 4, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 5, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 6, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 7, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 8, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 9, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 10, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 11, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 12, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 13, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 14, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 15, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 16, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 17, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 18, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 19, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 20, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 21, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 22, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 23, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 24, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 25, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 26, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: RAID1 conf printout:
Sep 23 10:14:50 telcoraid kernel: --- wd:1 rd:2 nd:2
Sep 23 10:14:50 telcoraid kernel: disk 0, s:0, o:0, n:0 rd:0 us:1 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 1, s:0, o:1, n:1 rd:1 us:1 dev:sda2
Sep 23 10:14:50 telcoraid kernel: disk 2, s:1, o:0, n:2 rd:2 us:1 dev:sdb2
Sep 23 10:14:50 telcoraid kernel: disk 3, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 4, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 5, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 6, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 7, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 8, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 9, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 10, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 11, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 12, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 13, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 14, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 15, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 16, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 17, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 18, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 19, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 20, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 21, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 22, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 23, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 24, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 25, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: disk 26, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:14:50 telcoraid kernel: md: updating md2 RAID superblock on device
Sep 23 10:14:50 telcoraid kernel: md: sdb2 (skipping alias)
Sep 23 10:14:50 telcoraid kernel: md: sda2 (skipping alias)
Sep 23 10:14:50 telcoraid kernel: md: recovery thread got woken up ...
Sep 23 10:14:50 telcoraid kernel: md: recovery thread finished ...
Sep 23 10:15:13 telcoraid kernel: md: trying to hot-add sdb3 to md0 ...
Sep 23 10:15:13 telcoraid kernel: md: bind<sdb3,2>
Sep 23 10:15:13 telcoraid kernel: RAID1 conf printout:
Sep 23 10:15:13 telcoraid kernel: --- wd:1 rd:2 nd:1
Sep 23 10:15:13 telcoraid kernel: disk 0, s:0, o:0, n:0 rd:0 us:1 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 1, s:0, o:1, n:1 rd:1 us:1 dev:sda3
Sep 23 10:15:13 telcoraid kernel: disk 2, s:1, o:0, n:2 rd:2 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 3, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 4, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 5, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 6, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 7, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 8, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 9, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 10, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 11, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 12, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 13, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 14, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 15, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 16, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 17, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 18, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 19, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 20, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 21, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 22, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 23, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 24, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 25, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 26, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: RAID1 conf printout:
Sep 23 10:15:13 telcoraid kernel: --- wd:1 rd:2 nd:2
Sep 23 10:15:13 telcoraid kernel: disk 0, s:0, o:0, n:0 rd:0 us:1 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 1, s:0, o:1, n:1 rd:1 us:1 dev:sda3
Sep 23 10:15:13 telcoraid kernel: disk 2, s:1, o:0, n:2 rd:2 us:1 dev:sdb3
Sep 23 10:15:13 telcoraid kernel: disk 3, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 4, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 5, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 6, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 7, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 8, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 9, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 10, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 11, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 12, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 13, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 14, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 15, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 16, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 17, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 18, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 19, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 20, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 21, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 22, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 23, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 24, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 25, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: disk 26, s:0, o:0, n:0 rd:0 us:0 dev:[dev 00:00]
Sep 23 10:15:13 telcoraid kernel: md: updating md0 RAID superblock on device
Sep 23 10:15:13 telcoraid kernel: md: sdb3 (skipping alias)
Sep 23 10:15:13 telcoraid kernel: md: sda3 (skipping alias)
Sep 23 10:15:13 telcoraid kernel: md: recovery thread got woken up ...
Sep 23 10:15:13 telcoraid kernel: md: recovery thread finished ...
^ permalink raw reply [flat|nested] 3+ messages in thread
* RE: no resync after hotadd
@ 2003-09-24 18:56 Cress, Andrew R
0 siblings, 0 replies; 3+ messages in thread
From: Cress, Andrew R @ 2003-09-24 18:56 UTC (permalink / raw)
To: linux-raid
In looking at the code, hot_add_disk does:
mddev->sb>nr_disks++;
mddev->sb>spare_disks++;
mddev->sb>working_disks++;
then do_recovery checks
if (sb->active_disks == sb->raid_disks)
continue;
which seems ok to me, but I'd guess that the problem is in the counters
somewhere.
Perhaps it is looking at the new sdb sb instead of the one from sda?
Andy
-----Original Message-----
From: linux-raid-owner@vger.kernel.org
[mailto:linux-raid-owner@vger.kernel.org] On Behalf Of Cress, Andrew R
Sent: Tuesday, September 23, 2003 10:44 AM
To: linux-raid@vger.kernel.org
Subject: no resync after hotadd
Neil,
I've seen this before, but it is very reproducable on the SuSE/UL system
I am testing on now.
When I hotadd a new/replacement disk (sdb) into the raid device, the
resync doesn't start.
From the superblocks, apparently the active disk still has total disks=1
so only 1 active, but the new disk superblock was written with total
disks=2 and both active.
The configuration is raid1 with sda and sdb, and sdb was hot-removed and
hot-added (each partition with mdadm -a).
I've included the resulting superblocks, and the syslog for the add
sequence is attached.
Is this a problem with the md driver?
Andy
# uname -a
Linux telcoraid 2.4.21-55-smp #1 SMP Wed Aug 27 19:53:48 UTC 2003 i686
unknown
# cat /proc/mdstat
Personalities : [raid1]
read_ahead 1024 sectors
md2 : active raid1 sdb2[2] sda2[1]
136448 blocks [2/1] [_U]
md1 : active raid1 sdb1[2] sda1[1]
80256 blocks [2/1] [_U]
md0 : active raid1 sdb3[2] sda3[1]
17518784 blocks [2/1] [_U]
unused devices: <none>
#
# mdadm -E /dev/sda1
/dev/sda1:
Magic : a92b4efc
Version : 00.90.00
UUID : 15e24cfe:af2ecf43:9c74fae1:7aabf6f0
Creation Time : Thu Sep 18 13:12:09 2003
Raid Level : raid1
Device Size : 80256 (78.38 MiB 82.18 MB)
Raid Devices : 2
Total Devices : 1
Preferred Minor : 1
Update Time : Mon Sep 22 12:58:41 2003
State : dirty, no-errors
Active Devices : 1
Working Devices : 1
Failed Devices : 0
Spare Devices : 0
Checksum : 43828e3 - correct
Events : 0.28
Number Major Minor RaidDevice State
this 1 8 1 1 active sync /dev/sda1
0 0 0 0 0 faulty removed
1 1 8 1 1 active sync /dev/sda1
/dev/sdb1:
Magic : a92b4efc
Version : 00.90.00
UUID : 15e24cfe:af2ecf43:9c74fae1:7aabf6f0
Creation Time : Thu Sep 18 13:12:09 2003
Raid Level : raid1
Device Size : 80256 (78.38 MiB 82.18 MB)
Raid Devices : 2
Total Devices : 2
Preferred Minor : 1
Update Time : Mon Sep 22 09:37:09 2003
State : dirty, no-errors
Active Devices : 2
Working Devices : 2
Failed Devices : 0
Spare Devices : 0
Checksum : 437fa2f - correct
Events : 0.25
Number Major Minor RaidDevice State
this 0 8 17 0 active sync /dev/sdb1
0 0 8 17 0 active sync /dev/sdb1
1 1 8 1 1 active sync /dev/sda1
/dev/sda2:
Magic : a92b4efc
Version : 00.90.00
UUID : 16df7960:13fe43ac:ac0cc4ba:42690535
Creation Time : Thu Sep 18 13:12:56 2003
Raid Level : raid1
Device Size : 136448 (133.25 MiB 139.72 MB)
Raid Devices : 2
Total Devices : 1
Preferred Minor : 2
Update Time : Mon Sep 22 12:58:41 2003
State : dirty, no-errors
Active Devices : 1
Working Devices : 1
Failed Devices : 0
Spare Devices : 0
Checksum : 415a7d7f - correct
Events : 0.30
Number Major Minor RaidDevice State
this 1 8 2 1 active sync /dev/sda2
0 0 0 0 0 faulty removed
1 1 8 2 1 active sync /dev/sda2
/dev/sdb2:
Magic : a92b4efc
Version : 00.90.00
UUID : 16df7960:13fe43ac:ac0cc4ba:42690535
Creation Time : Thu Sep 18 13:12:56 2003
Raid Level : raid1
Device Size : 136448 (133.25 MiB 139.72 MB)
Raid Devices : 2
Total Devices : 2
Preferred Minor : 2
Update Time : Mon Sep 22 09:37:09 2003
State : dirty, no-errors
Active Devices : 2
Working Devices : 2
Failed Devices : 0
Spare Devices : 0
Checksum : 415a4ecc - correct
Events : 0.27
Number Major Minor RaidDevice State
this 0 8 18 0 active sync /dev/sdb2
0 0 8 18 0 active sync /dev/sdb2
1 1 8 2 1 active sync /dev/sda2
/dev/sda3:
Magic : a92b4efc
Version : 00.90.00
UUID : 31b4390a:af5afa82:44c73f24:a2a36931
Creation Time : Thu Sep 18 13:13:12 2003
Raid Level : raid1
Device Size : 17518784 (16.71 GiB 17.94 GB)
Raid Devices : 2
Total Devices : 1
Preferred Minor : 0
Update Time : Mon Sep 22 12:58:40 2003
State : dirty, no-errors
Active Devices : 1
Working Devices : 1
Failed Devices : 0
Spare Devices : 0
Checksum : f18a0e32 - correct
Events : 0.28
Number Major Minor RaidDevice State
this 1 8 3 1 active sync /dev/sda3
0 0 0 0 0 faulty removed
1 1 8 3 1 active sync /dev/sda3
/dev/sdb3:
Magic : a92b4efc
Version : 00.90.00
UUID : 31b4390a:af5afa82:44c73f24:a2a36931
Creation Time : Thu Sep 18 13:13:12 2003
Raid Level : raid1
Device Size : 17518784 (16.71 GiB 17.94 GB)
Raid Devices : 2
Total Devices : 2
Preferred Minor : 0
Update Time : Mon Sep 22 09:37:08 2003
State : dirty, no-errors
Active Devices : 2
Working Devices : 2
Failed Devices : 0
Spare Devices : 0
Checksum : f189df80 - correct
Events : 0.25
Number Major Minor RaidDevice State
this 0 8 19 0 active sync /dev/sdb3
0 0 8 19 0 active sync /dev/sdb3
1 1 8 3 1 active sync /dev/sda3
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: no resync after hotadd
2003-09-23 14:43 no resync after hotadd Cress, Andrew R
@ 2003-09-25 7:18 ` Lars Marowsky-Bree
0 siblings, 0 replies; 3+ messages in thread
From: Lars Marowsky-Bree @ 2003-09-25 7:18 UTC (permalink / raw)
To: Cress, Andrew R, linux-raid
On 2003-09-23T10:43:55,
"Cress, Andrew R" <andrew.r.cress@intel.com> said:
> I've seen this before, but it is very reproducable on the SuSE/UL system
> I am testing on now.
Please try the latest kernel from
ftp.suse.com:/pub/people/mantel/next/RPM/ (if I got the path right).
There have been some md bugs fixed in the last few days, one of them
relating to resync not starting (which we attributed to the locking bugs
in md). I'd guess it has been fixes.
Otherwise, to lighten the load on the public channels, please post
UL/SuSE specific bugs via the SuSE/UL Bugzilla and only post to the Open
Source lists if you can reproduce with vanilla Linux. (If you can't,
please mention that in the bugzilla ;-)
Sincerely,
Lars Marowsky-Brée <lmb@suse.de>
--
High Availability & Clustering ever tried. ever failed. no matter.
SuSE Labs try again. fail again. fail better.
Research & Development, SUSE LINUX AG -- Samuel Beckett
-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2003-09-25 7:18 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2003-09-23 14:43 no resync after hotadd Cress, Andrew R
2003-09-25 7:18 ` Lars Marowsky-Bree
-- strict thread matches above, loose matches on Subject: below --
2003-09-24 18:56 Cress, Andrew R
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).