* Re: mdadm + intel matrix
[not found] <CAF1jjLtMKmvQrZqO9Uv5vW6VGSJ8OmJZ8G3QxNZ6oGXr-3kS_Q@mail.gmail.com>
@ 2013-04-10 4:53 ` NeilBrown
2013-04-10 11:48 ` NightStrike
0 siblings, 1 reply; 2+ messages in thread
From: NeilBrown @ 2013-04-10 4:53 UTC (permalink / raw)
To: NightStrike; +Cc: linux-raid
[-- Attachment #1: Type: text/plain, Size: 2443 bytes --]
On Tue, 9 Apr 2013 03:20:44 -1000 NightStrike <nightstrike@gmail.com> wrote:
> Neil,
>
> I have been talking to a lot of irc types, and ultimately, they all
> want me to just contact you directly (even people on
> oftc/#kernelnewbies).
The correct advice is to email linux-raid@vger.kernel.org (you don't need to
be subscribed) and to optionally Cc me.
I've Cc:ed this reply to linux-raid.
>
> Basically, using mdadm with the Intel Matrix fakeraid doesn't work
> right anymore. I can't tell you with what kernel it stopped, but I
> can tell you that it is broken with archlinux 3.8.5, which is a
> vanilla kernel plus a console logging patch. Here's what happens:
>
> 1) Create a new raid in the Option Rom
> 2) Boot up and see md126 and md127, respectively ArchRaid_0 and imsm0.
> 3) Run all sorts of read only things, like mdadm --detail-platform, all good
> 4) Run cfdisk /dev/md/ArchRaid_0 (which points to md126), set up my partitions
> 5) Hit "Write" to write the partition table
> 6) Everything hangs
I've had this reported on openSUSE too. I haven't yet had a chance to look
into it properly.
It sounds like "mdmon" is not running - or not working correctly.
On the first attempt to write to the array, md signals mdmon and waits for
the array to be marked "active".
mdmon should notice this, update the metadata on the array to record that the
array is active (so if a crash happens a resync will be force) and then tell
md that the array is active. md notices and allows the write.
Something in this sequence is not working.
Does everything "unhang" if you run
mdmon md127 &
??
>
> At this point, any program that tries to do anything to the raid, or
> to even mount other volumes, will hang. Here's a dmesg output of the
> blocked stack traces:
>
> http://sprunge.us/BBMM
>
> derRichard on #kernelnewbies looked at that and said that everything
> is stuck in __enqueue_entity, and that I have to contact the author of
> mdadm to find out why. :(
>
>
> Is there any way you can help? What other info do you need? And do
> you know how I can reboot gracefully? Because right now, I'll only be
> able to do a hard reset or an REISUB, because those threads are
> forever blocked.
Hard reset (or "echo b > /proc/sysrq-trigger") is fine. Nothing has been
written to those devices so on restart it will look just like it did before.
NeilBrown
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 828 bytes --]
^ permalink raw reply [flat|nested] 2+ messages in thread
* Re: mdadm + intel matrix
2013-04-10 4:53 ` mdadm + intel matrix NeilBrown
@ 2013-04-10 11:48 ` NightStrike
0 siblings, 0 replies; 2+ messages in thread
From: NightStrike @ 2013-04-10 11:48 UTC (permalink / raw)
To: NeilBrown; +Cc: linux-raid
On Tue, Apr 9, 2013 at 6:53 PM, NeilBrown <neilb@suse.de> wrote:
> On Tue, 9 Apr 2013 03:20:44 -1000 NightStrike <nightstrike@gmail.com> wrote:
>
>> Neil,
>>
>> I have been talking to a lot of irc types, and ultimately, they all
>> want me to just contact you directly (even people on
>> oftc/#kernelnewbies).
>
> The correct advice is to email linux-raid@vger.kernel.org (you don't need to
> be subscribed) and to optionally Cc me.
> I've Cc:ed this reply to linux-raid.
Noted
>> Basically, using mdadm with the Intel Matrix fakeraid doesn't work
>> right anymore. I can't tell you with what kernel it stopped, but I
>> can tell you that it is broken with archlinux 3.8.5, which is a
>> vanilla kernel plus a console logging patch. Here's what happens:
>>
>> 1) Create a new raid in the Option Rom
>> 2) Boot up and see md126 and md127, respectively ArchRaid_0 and imsm0.
>> 3) Run all sorts of read only things, like mdadm --detail-platform, all good
>> 4) Run cfdisk /dev/md/ArchRaid_0 (which points to md126), set up my partitions
>> 5) Hit "Write" to write the partition table
>> 6) Everything hangs
>
> I've had this reported on openSUSE too. I haven't yet had a chance to look
> into it properly.
> It sounds like "mdmon" is not running - or not working correctly.
>
> On the first attempt to write to the array, md signals mdmon and waits for
> the array to be marked "active".
> mdmon should notice this, update the metadata on the array to record that the
> array is active (so if a crash happens a resync will be force) and then tell
> md that the array is active. md notices and allows the write.
>
> Something in this sequence is not working.
> Does everything "unhang" if you run
> mdmon md127 &
>
> ??
YES!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
THANK YOU!!!!!!!!!!!!!!!11111oneone!!!!!!!!!!
I eagerly await an official fix, and in the meantime, I will use your
workaround.
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2013-04-10 11:48 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <CAF1jjLtMKmvQrZqO9Uv5vW6VGSJ8OmJZ8G3QxNZ6oGXr-3kS_Q@mail.gmail.com>
2013-04-10 4:53 ` mdadm + intel matrix NeilBrown
2013-04-10 11:48 ` NightStrike
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox