From: "Tor Arne Vestbø" <torarnv@gmail.com>
To: Neil Brown <neilb@suse.de>
Cc: linux-raid@vger.kernel.org
Subject: Re: Linux RAID autodetect partitions go missing from /dev, but fdisk can see them
Date: Thu, 18 Dec 2008 23:03:38 +0100 [thread overview]
Message-ID: <494AC8BA.7010704@gmail.com> (raw)
In-Reply-To: <18759.8246.568849.244513@notabene.brown>
Hi Neil!
Neil Brown wrote:
> On Wednesday December 10, torarnv@gmail.com wrote:
>> I have a very strange problem that I've been trying to debug for
>> days now. I had a RAID5 with four drives and one spare,
>> /dev/sd[bcde]1 + /dev/sdf1, and everything was working fine, until
>> one day one of the drives in the array (sdb) no longer had a
>> partition (sdb1). Letting the spare take over I ignored this for a
>> few days, but then it happened again, this time with sdc1.
>> I'm hoping someone on this list may have ran into this before, or
>> have any tips on how I can continue debugging this, because I have to
>> admit I'm a little lost...
>
> Yes, it does sound rather weird.
First of all, thank you so much for helping me out with this, as I'm
still very lost :)
In addition to the things listed in the first e-mail, I've also tried
installing the latest kernel from kernel.org, but that did not solve
anything. Also, in case it's relevant, I'm running openSUSE 10.3.
> Can you:
>
> mdadm -Esv
http://pastebin.com/d7b14d14e
For some reason it seems to think that /dev/sdc and /dev/sdb are part of
the array, while it really is /dev/sdc1 and /dev/sdb1. I'm guessing
since they are missing somehow from the device nodes in /dev mdadm
assumes the disk itself is the member?
> and
> mdadm --stop /dev/md0
> strace -o /tmp/str -s 200 mdadm --assemble --scan --verbose /dev/md0
http://pastebin.com/f2c1db2e4
The original array had sd[bcde]1 + sdf1 as spare. Then sdb1 went missing
and the spare kicked in, and then sdc1 went missing, leaving me with a
degraded array.
> Also the contents of /etc/mdadm.conf might help.
http://pastebin.com/f573346ef
Is there anything else I can run, cat, and/or paste that would shed
light over what's going on?
> Thanks,
Thank _you_ :)
Tor Arne
>> raid support in. The symptoms are:
>>
>> - The kernel seems to detect the partitions (lines 396 and 407 in the
>> dmesg [1])
>>
>> - But once the boot process finishes and the RAID is started, there is
>> no longer any sdc1 or sdb1, so the RAID fails to start (lines 550-576 in
>> dmesg [1])
>>
>> - Running fdisk -l shows that the drives in question (sdb and sdc) do
>> have similar partitions as the other working drives, namely one Linux
>> RAID autodetect partition each (see command output [2])
>>
>> - But, the partitions are missing from /proc/partitions (see [3])
>>
>> - Manually adding device nodes using mknod works, but doing file -sL
>> on the device gives "writable, no read permission", even though
>> permissions are the same as the other sd* nodes in /dev
>>
>> - Running 'partprobe -s' successfully finds the two missing partitions
>> and adds device nodes, and the nodes can be 'file -sL'ed, but when
>> trying to assemble the array again with these new nodes in the system,
>> I'm told that sdc1 is not found, and after the --assemble is done, the
>> device nodes are once again missing (!) see [4]
>>
>> - I've tried using the 'dmraid' command to look for fakeraid
>> partitions or meta data on the drives, which I was told could mess up
>> the auto-detection of Linux software ride partitions, but could not find
>> any issues.
>>
>>
>> As you can tell I've exhausted all my current options, so any help on
>> what I could try next would be very much appreciated. I am especially
>> curious as to why I lose the partitions when mdadm tries to assemble the
>> array?
>>
>> Thanks!
>>
>> Tor Arne Vestbø
>>
>> [1] http://pastebin.com/m15b9c275 dmesg
>> [2] http://pastebin.com/f50fb323a fdisk -l
>> [3] http://pastebin.com/f4547c2ca cat /proc/partitions
>> [4] http://pastebin.com/m4475c9ae partprobe + mdadm --assemble
>> --
>> To unsubscribe from this list: send the line "unsubscribe linux-raid" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at http://vger.kernel.org/majordomo-info.html
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
next prev parent reply other threads:[~2008-12-18 22:03 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-12-10 20:56 Linux RAID autodetect partitions go missing from /dev, but fdisk can see them Tor Arne Vestbø
2008-12-16 3:27 ` Neil Brown
2008-12-18 22:03 ` Tor Arne Vestbø [this message]
2008-12-18 22:19 ` Tor Arne Vestbø
[not found] ` <18762.53424.819087.495066@notabene.brown>
2009-01-18 17:51 ` Tor Arne Vestbø
2009-01-19 16:18 ` Tor Arne Vestbø
2009-01-31 13:19 ` Tor Arne Vestbø
2009-01-31 18:50 ` Richard Scobie
2008-12-18 22:22 ` Tor Arne Vestbø
-- strict thread matches above, loose matches on Subject: below --
2008-12-11 6:41 Tor Arne Vestbø
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=494AC8BA.7010704@gmail.com \
--to=torarnv@gmail.com \
--cc=linux-raid@vger.kernel.org \
--cc=neilb@suse.de \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.