* some non critical problems... (1. mdadm segfault -> write-mostly, 2. smart?)
@ 2010-09-06 12:33 Janek Kozicki
2010-09-08 9:44 ` Janek Kozicki
0 siblings, 1 reply; 6+ messages in thread
From: Janek Kozicki @ 2010-09-06 12:33 UTC (permalink / raw)
To: linux-raid
[-- Attachment #1: Type: text/plain, Size: 4116 bytes --]
(oops, sending again, the attachment was too big)
Hello,
1. mdadm segfault -> write-mostly problem
I was changing my SAS drives last month, and now I have 3*Seagate
Cheetach 15k rpm, 147 GB connected to an Adaptec ASR-2405 controller.
During the migration process of /home I used an extra disc and used
lvm for live migration. Now it is on raid10 config. Works great.
For migration of /var and /tmp which are on raid1 + lvm I did a following trick:
old SAS drives inserted:
mdadm --add /dev/md1 --write-mostly /dev/sdf2 (add the extra disc to array)
mdadm --fail /dev/md1 /dev/sdc1 (fail old SASs disc)
mdadm --remove /dev/md1 /dev/sdc1
mdadm --fail /dev/md1 /dev/sdb1 (fail another old SAS disc)
mdadm --remove /dev/md1 /dev/sdb1
turn off PC, replace SAS drives with the new ones, turn on PC,
prepare partitions on fresh sda and sdb, then:
mdadm --add /dev/md1 /dev/sdc1
mdadm --add /dev/md1 /dev/sdb1
mdadm --fail /dev/md1 /dev/sdf2
mdadm --remove /dev/md1 /dev/sdf2
mdadm --grow --raid-devices=2 /dev/md1 (here I forgot to --zero-superblock on sdf2)
reboot.
During bootup, I see mdadm segfaulting (a message in dmesg saying
that mdadm segfaults). But I was able to bring up /dev/md1 manually,
however - only from /dev/sdf2, not from sdc1 + sdb1 (which complained
that metadata is not compatibile with metadata from other devices
belonging to md1).
Therefore since my /dev/md1 was up and working (albeit
from /dev/sdf2) I repeated the operation shown above, but this time
afterwards I used `dd` to zero completely sdf2. And this time it
worked.
So it works. But there's one strange quirk left: the two devices sdc1
and sdb1 are in --write-mostly mode! And I used this mode only for
sdf2. Not for other devices. Look at md1 here:
janek@atak:~$ cat /proc/mdstat
Personalities : [raid0] [raid1] [raid10]
md2 : active raid10 sda2[0] sdc2[2] sdb2[1]
185381376 blocks super 1.0 512K chunks 2 far-copies [3/3] [UUU]
bitmap: 2/6 pages [8KB], 16384KB chunk
md1 : active raid1 sdc1[3](W) sdb1[5](W)
9767416 blocks super 1.0 [2/2] [UU]
bitmap: 6/150 pages [24KB], 32KB chunk
md0 : active raid1 sde1[0] sdd1[2] sda1[1]
9767424 blocks [3/3] [UUU]
bitmap: 1/150 pages [4KB], 32KB chunk
unused devices: <none>
Is it possible to change that (W) flag? Does it decrease performance
or something, if both devices in the array are set to (W) ?
Attached the bootup dmesg, with mdadm segfaults.
2. smart ?
Now I am trying to get smartd running and supporting those SAS
drives. I managed to get some smart info from them with this command:
sudo modprobe sg
sudo smartctl --all /dev/sg4 -d scsi
but I don't know what should I write in /etc/smartd.conf
DEVICESCAN insists on not scanning /dev/sg? devices, even when
instructed directly to do so:
DEVICESCAN -d removable -d sata -d scsi,/dev/sg4 -d scsi,/dev/sg5 -d scsi,/dev/sg6 -d scsi,/dev/sg7 -n standby -m root -M exec /usr/share/smartmontools/smartd-runner
or not directly:
DEVICESCAN -d sata -d scsi -n standby -m root -M exec /usr/share/smartmontools/smartd-runner
I tried without DEVICESCAN, and putting instead those lines:
/dev/sde -H -l error -l selftest -t
/dev/sg4 -H -l error -l selftest -t
/dev/sg5 -H -l error -l selftest -t
/dev/sg6 -H -l error -l selftest -t
/dev/sg7 -H -l error -l selftest -t
still nothing.
Putting into /etc/default/smartmontools this line:
enable_smart="/dev/sde /dev/sg4 /dev/sg5 /dev/sg6 /dev/sg7"
doesn't help either.
I know that this is not an mdadm related problem, but since you all
here are dealing with HDDs I though I would ask, maybe someone will
know.
3. my system versions:
debian squeeze
fully up to date, except kernel version (due to TuxOnIce patch on vanilla kernel).
atak:~# uname -a
Linux atak 2.6.29-bpo.2-amd64 #1 SMP Fri Jul 10 15:23:52 CEST 2009 x86_64 GNU/Linux
atak:~# mdadm --version
mdadm - v3.1.2 - 10th March 2010
atak:~# smartd --version
smartd 5.40 2010-07-12 r3124 [x86_64-unknown-linux-gnu] (local build)
thanks in advance for your help
--
Janek Kozicki http://janek.kozicki.pl/ |
[-- Attachment #2: mdadm-segfault.txt.gz --]
[-- Type: application/x-gzip, Size: 16006 bytes --]
^ permalink raw reply [flat|nested] 6+ messages in thread* Re: some non critical problems... (1. mdadm segfault -> write-mostly, 2. smart?)
2010-09-06 12:33 some non critical problems... (1. mdadm segfault -> write-mostly, 2. smart?) Janek Kozicki
@ 2010-09-08 9:44 ` Janek Kozicki
2010-09-14 13:13 ` Janek Kozicki
0 siblings, 1 reply; 6+ messages in thread
From: Janek Kozicki @ 2010-09-08 9:44 UTC (permalink / raw)
To: linux-raid
Umm... to reiterate one of my questions in above email:
> md1 : active raid1 sdc1[3](W) sdb1[5](W)
> 9767416 blocks super 1.0 [2/2] [UU]
> bitmap: 6/150 pages [24KB], 32KB chunk
Is it possible to disable that (W) --write-mostly flag? Does it
decrease performance or something, if both devices in the array are
set to (W) ?
thanks
--
Janek Kozicki http://janek.kozicki.pl/ |
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: some non critical problems... (1. mdadm segfault -> write-mostly, 2. smart?)
2010-09-08 9:44 ` Janek Kozicki
@ 2010-09-14 13:13 ` Janek Kozicki
2010-09-18 18:01 ` Bill Davidsen
0 siblings, 1 reply; 6+ messages in thread
From: Janek Kozicki @ 2010-09-14 13:13 UTC (permalink / raw)
To: linux-raid
Janek Kozicki said: (by the date of Wed, 8 Sep 2010 11:44:15 +0200)
> Umm... to reiterate one of my questions in above email:
>
>
> > md1 : active raid1 sdc1[3](W) sdb1[5](W)
> > 9767416 blocks super 1.0 [2/2] [UU]
> > bitmap: 6/150 pages [24KB], 32KB chunk
>
> Is it possible to disable that (W) --write-mostly flag? Does it
> decrease performance or something, if both devices in the array are
> set to (W) ?
this seems to have fixed it:
mdadm --fail /dev/md1 /dev/sdb1
mdadm --remove /dev/md1 /dev/sdb1
mdadm --zero-superblock /dev/sdb1
mdadm --add --readwrite /dev/md1 /dev/sdb1
--
Janek Kozicki http://janek.kozicki.pl/ |
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: some non critical problems... (1. mdadm segfault -> write-mostly, 2. smart?)
2010-09-14 13:13 ` Janek Kozicki
@ 2010-09-18 18:01 ` Bill Davidsen
2010-09-20 11:49 ` Janek Kozicki
0 siblings, 1 reply; 6+ messages in thread
From: Bill Davidsen @ 2010-09-18 18:01 UTC (permalink / raw)
To: Janek Kozicki; +Cc: linux-raid
Janek Kozicki wrote:
> Janek Kozicki said: (by the date of Wed, 8 Sep 2010 11:44:15 +0200)
>
>
>> Umm... to reiterate one of my questions in above email:
>>
>>
>>
>>> md1 : active raid1 sdc1[3](W) sdb1[5](W)
>>> 9767416 blocks super 1.0 [2/2] [UU]
>>> bitmap: 6/150 pages [24KB], 32KB chunk
>>>
>> Is it possible to disable that (W) --write-mostly flag? Does it
>> decrease performance or something, if both devices in the array are
>> set to (W) ?
>>
> this seems to have fixed it:
>
> mdadm --fail /dev/md1 /dev/sdb1
> mdadm --remove /dev/md1 /dev/sdb1
> mdadm --zero-superblock /dev/sdb1
> mdadm --add --readwrite /dev/md1 /dev/sdb1
>
>
>
Sorry for a late question, but in adding this to my notes, I realized I
wasn't clear on why this flag moved to the other partitions. Are you,
and if so can you clarify?
--
Bill Davidsen<davidsen@tmr.com>
"We can't solve today's problems by using the same thinking we
used in creating them." - Einstein
^ permalink raw reply [flat|nested] 6+ messages in thread* Re: some non critical problems... (1. mdadm segfault -> write-mostly, 2. smart?)
2010-09-18 18:01 ` Bill Davidsen
@ 2010-09-20 11:49 ` Janek Kozicki
0 siblings, 0 replies; 6+ messages in thread
From: Janek Kozicki @ 2010-09-20 11:49 UTC (permalink / raw)
To: Bill Davidsen; +Cc: linux-raid
Bill Davidsen said: (by the date of Sat, 18 Sep 2010 14:01:31 -0400)
> Sorry for a late question, but in adding this to my notes, I realized I
> wasn't clear on why this flag moved to the other partitions. Are you,
> and if so can you clarify?
If you read my first email in this thread, you will see an excerpt
from syslog saying that mdadm segfaulted during system bootup.
I think that this is the reason, because I cannot think of any other
possible reason.
--
Janek Kozicki http://janek.kozicki.pl/ |
^ permalink raw reply [flat|nested] 6+ messages in thread
[parent not found: <20100906141324.1a19fa4d@atak.bl.pg.gda.pl>]
* Re: some non critical problems... (1. mdadm segfault -> write-mostly, 2. smart?)
[not found] <20100906141324.1a19fa4d@atak.bl.pg.gda.pl>
@ 2010-09-06 12:17 ` Janek Kozicki
0 siblings, 0 replies; 6+ messages in thread
From: Janek Kozicki @ 2010-09-06 12:17 UTC (permalink / raw)
To: linux-raid
Janek Kozicki said: (by the date of Mon, 6 Sep 2010 14:13:24 +0200)
> Sep 1 18:23:33 atak kernel: [ 19.506889] mdadm[3703]: segfault at 134 ip 00007f903caefd9a sp 00007fff45025010 error 4 in libc.so.6[7f903caaa000+158000]
> Sep 1 18:23:33 atak kernel: [ 19.507677] mdadm[3706]: segfault at 134 ip 00007f5779212d9a sp 00007fff81748740 error 4 in libc.so.6[7f57791cd000+158000]
> Sep 1 18:23:33 atak kernel: [ 19.527658] mdadm[3705]: segfault at 134 ip 00007f5779212d9a sp 00007fff81748740 error 4 in libc.so.6[7f57791cd000+158000]
I should have all the debug information installed and System.map file
too. So if you need some file to identify what is 00007f903caefd9a,
please tell. Personally I don't know how to find what is
00007f903caefd9a. Just tell me what to do, if you feel like
investigating this further.
--
Janek Kozicki http://janek.kozicki.pl/ |
^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2010-09-20 11:49 UTC | newest]
Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-09-06 12:33 some non critical problems... (1. mdadm segfault -> write-mostly, 2. smart?) Janek Kozicki
2010-09-08 9:44 ` Janek Kozicki
2010-09-14 13:13 ` Janek Kozicki
2010-09-18 18:01 ` Bill Davidsen
2010-09-20 11:49 ` Janek Kozicki
[not found] <20100906141324.1a19fa4d@atak.bl.pg.gda.pl>
2010-09-06 12:17 ` Janek Kozicki
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).