From: Patrik Jonsson <patrik@ucolick.org>
Cc: linux-raid@vger.kernel.org
Subject: Re: Strange behaviour on "toy array"
Date: Tue, 17 May 2005 00:12:07 -0700 [thread overview]
Message-ID: <42899947.6080107@ucolick.org> (raw)
In-Reply-To: <42898989.8060201@ucolick.org>
Just as a further comment on what happened when my system hung: The
process [md0_sync] was rapidly respawning and the syslog filled with
thousands of messages like these:
May 16 23:16:44 localhost kernel: md: syncing RAID array md0
May 16 23:16:44 localhost kernel: md: minimum _guaranteed_
reconstruction speed\
: 1000 KB/sec/disc.
May 16 23:16:44 localhost kernel: md: using maximum available idle IO
bandwith \
(but not more than 200000 KB/sec) for reconstruction.
May 16 23:16:44 localhost kernel: md: using 128k window, over a total of
960 bl\
ocks.
May 16 23:16:44 localhost kernel: md: md0: sync done.
May 16 23:16:44 localhost kernel: md: syncing RAID array md0
May 16 23:16:44 localhost kernel: md: minimum _guaranteed_
reconstruction speed\
: 1000 KB/sec/disc.
May 16 23:16:44 localhost kernel: md: using maximum available idle IO
bandwith \
(but not more than 200000 KB/sec) for reconstruction.
May 16 23:16:44 localhost kernel: md: using 128k window, over a total of
960 bl\
ocks.
May 16 23:16:45 localhost kernel: md: md0: sync done.
... etc etc...
I had to halt the system to make it stop. I tried to stop the array with
mdadm -S /dev/md0 but got "device or resource busy". Did i do something
illegal here?
Thanks,
/Patrik
Patrik Jonsson wrote:
> Ok, so I did as Guy suggested, and tried to write to the array after
> failing more than one disk. It says:
>
> [root@localhost raidtest]# echo test > junk/test
> -bash: junk/test: Read-only file system
>
> so that's at least an indication that not all is well. The syslog
> contains:
>
> May 16 22:49:31 localhost kernel: raid5: Disk failure on loop2,
> disabling device. Operation continuing on 3 devices
> May 16 22:49:31 localhost kernel: RAID5 conf printout:
> May 16 22:49:31 localhost kernel: --- rd:5 wd:3 fd:2
> May 16 22:49:31 localhost kernel: disk 1, o:1, dev:loop1
> May 16 22:49:31 localhost kernel: disk 2, o:0, dev:loop2
> May 16 22:49:31 localhost kernel: disk 3, o:1, dev:loop3
> May 16 22:49:31 localhost kernel: disk 4, o:1, dev:loop4
> May 16 22:49:31 localhost kernel: RAID5 conf printout:
> May 16 22:49:31 localhost kernel: --- rd:5 wd:3 fd:2
> May 16 22:49:31 localhost kernel: disk 1, o:1, dev:loop1
> May 16 22:49:31 localhost kernel: disk 3, o:1, dev:loop3
> May 16 22:49:31 localhost kernel: disk 4, o:1, dev:loop4
> May 16 22:49:39 localhost kernel: Buffer I/O error on device md0,
> logical block 112
> May 16 22:49:39 localhost kernel: lost page write due to I/O error on md0
> May 16 22:49:39 localhost kernel: Aborting journal on device md0.
> May 16 22:49:44 localhost kernel: ext3_abort called.
> May 16 22:49:44 localhost kernel: EXT3-fs error (device md0):
> ext3_journal_start_sb: Detected aborted journal
> May 16 22:49:44 localhost kernel: Remounting filesystem read-only
> May 16 22:50:14 localhost kernel: Buffer I/O error on device md0,
> logical block 19
> May 16 22:50:14 localhost kernel: lost page write due to I/O error on md0
>
> So I guess I'm happy with that, remounting to read-only seems smart,
> that way the disks aren't messed up more.
> Now I added the disks back with
>
> mdadm --add /dev/loop0
> mdadm --add /dev/loop2
>
> and the (actual hard-) drive started chugging, the md0_raid5 process
> is sucking cpu and I don't know what it's trying to do... the system
> has become unresponsive, but the drive is still ticking. Is hot-adding
> the drives back in a bad thing to do?
>
> This is educational, at least... :-)
>
> /Patrik
>
> Guy wrote:
>
>> My guess is it will not change state until it needs to access a disk.
>> So, try some writes!
>>
>>
>>
>>
>>
> -
> To unsubscribe from this list: send the line "unsubscribe linux-raid" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>
> !DSPAM:428989ab396844711317!
>
next prev parent reply other threads:[~2005-05-17 7:12 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <20050516184049.90DDA11416@smtp.ucolick.org>
2005-05-16 21:54 ` Strange behaviour on "toy array" Patrik Jonsson
2005-05-17 2:28 ` Guy
2005-05-17 6:04 ` Patrik Jonsson
2005-05-17 7:12 ` Patrik Jonsson [this message]
2005-05-17 8:41 ` David Greaves
2005-04-22 10:45 MD bug or me being stupid? Molle Bestefich
2005-05-12 8:55 ` Molle Bestefich
2005-05-13 2:55 ` Neil Brown
2005-05-15 18:10 ` Molle Bestefich
2005-05-15 18:22 ` Strange behaviour on "toy array" Patrik Jonsson
2005-05-15 20:09 ` David Greaves
2005-05-15 20:55 ` Patrik Jonsson
2005-05-15 22:13 ` Guy
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=42899947.6080107@ucolick.org \
--to=patrik@ucolick.org \
--cc=linux-raid@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).