* equal size not large enough for RAID1?
@ 2008-03-05 9:05 Lars Täuber
[not found] ` <18383.24086.165437.659336@notabene.brown>
0 siblings, 1 reply; 4+ messages in thread
From: Lars Täuber @ 2008-03-05 9:05 UTC (permalink / raw)
To: linux-raid
Hallo list,
I'm new to this list. But I came across some strange problems I nowhere found a solution for on the net.
This is our situation:
monosan + duosan:
Linux duosan 2.6.22.17-0.1-default #1 SMP 2008/02/10 20:01:04 UTC x86_64 x86_64 x86_64 GNU/Linux
/proc/cpuinfo:
[...]
processor : 3
vendor_id : AuthenticAMD
cpu family : 15
model : 65
model name : Dual-Core AMD Opteron(tm) Processor 2216
stepping : 3
cpu MHz : 2412.402
cache size : 1024 KB
physical id : 1
siblings : 2
core id : 1
cpu cores : 2
fpu : yes
fpu_exception : yes
cpuid level : 1
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt rdtscp lm 3dnowext 3dnow pni cx16 lahf_lm cmp_legacy svm extapic cr8_legacy
bogomips : 4825.08
TLB size : 1024 4K pages
clflush size : 64
cache_alignment : 64
address sizes : 40 bits physical, 48 bits virtual
power management: ts fid vid ttp tm stc
SAS controller:
07:00.0 Ethernet controller: MYRICOM Inc. Myri-10G Dual-Protocol NIC (10G-PCIE-8A)
There are 16 SATA drives multipathed connected to each server.
duosan exports its RAID 6 over AoE via qaoed
monosan sees this as etherd/e22.0
monosan# cat /proc/partitions:
9 4 12697912448 md4
9 9 12697912312 md9
152 5632 12697912448 etherd/e22.0
monosan:~ # cat /proc/mdstat
Personalities : [raid1] [raid0] [raid6] [raid5] [raid4]
md9 : active raid1 md4[0]
12697912312 blocks super 1.0 [2/1] [U_]
md4 : active raid6 dm-0[0] dm-8[14] dm-7[13] dm-6[12] dm-5[11] dm-4[10] dm-3[9] dm-2[8] dm-14[7] dm-13[6] dm-12[5] dm-11[4] dm-10[3] dm-9[2] dm-1[1]
12697912448 blocks level 6, 64k chunk, algorithm 2 [15/15] [UUUUUUUUUUUUUUU]
But then this:
monosan:~ # mdadm /dev/md9 -a /dev/etherd/e22.0
mdadm: /dev/etherd/e22.0 not large enough to join array
The md9 RAID1 was originally built with e22.0 as second drive. I just simulated a connection loss.
Why is this happening? As you can see md9 consists of md4 and e22.0 and both are equal in size: 12697912448 .
How can I debug this? Are there detailed logs anywhere?
Many thanks for any hint.
Lars
--
Informationstechnologie
Berlin-Brandenburgische Akademie der Wissenschaften
Jägerstrasse 22-23 10117 Berlin
Tel.: +49 30 20370-352 http://www.bbaw.de
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: equal size not large enough for RAID1?
[not found] ` <18383.24086.165437.659336@notabene.brown>
@ 2008-03-06 9:11 ` Lars Täuber
2008-03-07 3:43 ` Neil Brown
0 siblings, 1 reply; 4+ messages in thread
From: Lars Täuber @ 2008-03-06 9:11 UTC (permalink / raw)
To: linux-raid
Hi Neil,
Neil Brown <neilb@suse.de> schrieb:
> On Wednesday March 5, taeuber@bbaw.de wrote:
> > Hallo list,
>
> Hi.
>
> >
> > monosan# cat /proc/partitions:
> > 9 4 12697912448 md4
> > 9 9 12697912312 md9
> > 152 5632 12697912448 etherd/e22.0
> >
> >
> > monosan:~ # cat /proc/mdstat
> > Personalities : [raid1] [raid0] [raid6] [raid5] [raid4]
> > md9 : active raid1 md4[0]
> > 12697912312 blocks super 1.0 [2/1] [U_]
> >
> > md4 : active raid6 dm-0[0] dm-8[14] dm-7[13] dm-6[12] dm-5[11] dm-4[10] dm-3[9] dm-2[8] dm-14[7] dm-13[6] dm-12[5] dm-11[4] dm-10[3] dm-9[2] dm-1[1]
> > 12697912448 blocks level 6, 64k chunk, algorithm 2 [15/15] [UUUUUUUUUUUUUUU]
> >
> >
> >
> > But then this:
> > monosan:~ # mdadm /dev/md9 -a /dev/etherd/e22.0
> > mdadm: /dev/etherd/e22.0 not large enough to join array
>
> This would be because mdadm is reserving a bit of space in case you
> want to add an internal bitmap one day. And the version of mdadm you
> are now using is reserving more space than the version that was used
> to create the array does.
we are using SuSE 10.3 here. And I don't remember if mdadm was updated due to an online update. But I think it should be consistent/compatible with the version shipped in the original distribution.
What is this internal bitmap good for? Is there a documentation somewhere on the net about this?
>
> I should fix that...
>
> If you want a quick fix and are happy to compile your own mdadm, then
> edit super1.c and remove the line
>
> devsize -= choose_bm_space(devsize);
>
> in avail_size1().
I could compile, but on our productive server we don't compile anything that is shipped with the distribution due to the possibility to have security updates from SuSE.
I just repeated the following:
monosan:~ # mdadm -C /dev/md9 -l1 -n2 -x0 /dev/md4 /dev/etherd/e22.0
mdadm: Defaulting to verion 1.0 metadata
mdadm: /dev/md4 appears to be part of a raid array:
level=raid1 devices=2 ctime=Thu Mar 6 09:39:37 2008
mdadm: /dev/etherd/e22.0 appears to be part of a raid array:
level=raid1 devices=2 ctime=Thu Mar 6 09:39:37 2008
Continue creating array? y
mdadm: array /dev/md9 started.
monosan:~ # mdadm -V
mdadm - v2.6.2 - 21st May 2007
monosan:~ # mdadm /dev/md9 -f /dev/etherd/e22.0
mdadm: set /dev/etherd/e22.0 faulty in /dev/md9
monosan:~ # mdadm /dev/md9 -r /dev/etherd/e22.0
mdadm: hot removed /dev/etherd/e22.0
monosan:~ # mdadm /dev/md9 -a /dev/etherd/e22.0
mdadm: /dev/etherd/e22.0 not large enough to join array
monosan:~ # cat /etc/SuSE-release
openSUSE 10.3 (X86-64)
VERSION = 10.3
How come?
Thanks
Lars
--
Informationstechnologie
Berlin-Brandenburgische Akademie der Wissenschaften
Jägerstrasse 22-23 10117 Berlin
Tel.: +49 30 20370-352 http://www.bbaw.de
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: equal size not large enough for RAID1?
2008-03-06 9:11 ` Lars Täuber
@ 2008-03-07 3:43 ` Neil Brown
2008-03-07 11:08 ` Lars Täuber
0 siblings, 1 reply; 4+ messages in thread
From: Neil Brown @ 2008-03-07 3:43 UTC (permalink / raw)
To: Lars Täuber; +Cc: linux-raid
On Thursday March 6, taeuber@bbaw.de wrote:
>
> What is this internal bitmap good for? Is there a documentation
> somewhere on the net about this?
man md
and search for BITMAP. There is a section of BITMAP WRITE-INTENT
LOGGING.
>
> I just repeated the following:
>
> monosan:~ # mdadm -C /dev/md9 -l1 -n2 -x0 /dev/md4 /dev/etherd/e22.0
> mdadm: Defaulting to verion 1.0 metadata
> mdadm: /dev/md4 appears to be part of a raid array:
> level=raid1 devices=2 ctime=Thu Mar 6 09:39:37 2008
> mdadm: /dev/etherd/e22.0 appears to be part of a raid array:
> level=raid1 devices=2 ctime=Thu Mar 6 09:39:37 2008
> Continue creating array? y
> mdadm: array /dev/md9 started.
>
> monosan:~ # mdadm -V
> mdadm - v2.6.2 - 21st May 2007
>
> monosan:~ # mdadm /dev/md9 -f /dev/etherd/e22.0
> mdadm: set /dev/etherd/e22.0 faulty in /dev/md9
>
> monosan:~ # mdadm /dev/md9 -r /dev/etherd/e22.0
> mdadm: hot removed /dev/etherd/e22.0
>
> monosan:~ # mdadm /dev/md9 -a /dev/etherd/e22.0
> mdadm: /dev/etherd/e22.0 not large enough to join array
Hmm.. I just read through the change logs and I see there was a bug
here that was fixed in 2.6.3.
If you want an update from SuSE you would need to log a bug report in
the Novell bugzilla.
Alternately request v0.90 metadata (--metadata=0.90) That will avoid
the bug.
NeilBrown
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: equal size not large enough for RAID1?
2008-03-07 3:43 ` Neil Brown
@ 2008-03-07 11:08 ` Lars Täuber
0 siblings, 0 replies; 4+ messages in thread
From: Lars Täuber @ 2008-03-07 11:08 UTC (permalink / raw)
To: linux-raid
Hallo Neil,
> Hmm.. I just read through the change logs and I see there was a bug
> here that was fixed in 2.6.3.
> If you want an update from SuSE you would need to log a bug report in
> the Novell bugzilla.
>
> Alternately request v0.90 metadata (--metadata=0.90) That will avoid
> the bug.
>
> NeilBrown
thank you for this info. I'll try to get SuSE to correct this.
I just created a bugzilla account.
Thanks again.
Lars
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2008-03-07 11:08 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-03-05 9:05 equal size not large enough for RAID1? Lars Täuber
[not found] ` <18383.24086.165437.659336@notabene.brown>
2008-03-06 9:11 ` Lars Täuber
2008-03-07 3:43 ` Neil Brown
2008-03-07 11:08 ` Lars Täuber
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).