* System freeze triggered by RAID 10 resync and high io
@ 2014-05-20 21:24 christian.schwarz
2014-05-20 23:26 ` NeilBrown
0 siblings, 1 reply; 2+ messages in thread
From: christian.schwarz @ 2014-05-20 21:24 UTC (permalink / raw)
To: linux-raid
Hello,
I have an issue with stalling writes that within a few seconds
ultimately lead to a full system freeze. Since all writes fail once the
condition has been triggered, there is no log output. Also dmesg remains
without errors.
My setup is as follows:
md127 : active raid10 sda1[0] sdd1[3] sdc1[2] sdb1[1]
3907023872 blocks super 1.2 512K chunks 2 near-copies [4/4]
[UUUU]
Linux 3.14.2-hardened-r1 #4 SMP PREEMPT
Dell Poweredge T20 Server
To trigger this condition, a resync of the array has to be running and
high disk IO needs to be performed in addition. The condition is
triggered regardless of the resync speed (tested with 1, 10 and 100
MB/s). I also tried different io schedulers and a non-preempt kernel.
Someone else over at Fedora reported a similiar problem:
http://www.spinics.net/linux/fedora/fedora-kernel/msg05163.html
How can I help with providing additional information so you can locate
the problem?
Thanks,
Christian
^ permalink raw reply [flat|nested] 2+ messages in thread
* Re: System freeze triggered by RAID 10 resync and high io
2014-05-20 21:24 System freeze triggered by RAID 10 resync and high io christian.schwarz
@ 2014-05-20 23:26 ` NeilBrown
0 siblings, 0 replies; 2+ messages in thread
From: NeilBrown @ 2014-05-20 23:26 UTC (permalink / raw)
To: christian.schwarz; +Cc: linux-raid
[-- Attachment #1: Type: text/plain, Size: 1699 bytes --]
On Tue, 20 May 2014 23:24:36 +0200 christian.schwarz@posteo.de wrote:
> Hello,
>
> I have an issue with stalling writes that within a few seconds
> ultimately lead to a full system freeze. Since all writes fail once the
> condition has been triggered, there is no log output. Also dmesg remains
> without errors.
>
> My setup is as follows:
>
> md127 : active raid10 sda1[0] sdd1[3] sdc1[2] sdb1[1]
> 3907023872 blocks super 1.2 512K chunks 2 near-copies [4/4]
> [UUUU]
>
> Linux 3.14.2-hardened-r1 #4 SMP PREEMPT
Fixed by upstream commit
commit cc13b1d1500656a20e41960668f3392dda9fa6e2
Author: NeilBrown <neilb@suse.de>
Date: Mon May 5 13:34:37 2014 +1000
md/raid10: call wait_barrier() for each request submitted.
which will be in 3.15, and hopefully will appear in the next 3.14.y release
(it isn't in 3.14.4).
NeilBrown
>
> Dell Poweredge T20 Server
>
>
> To trigger this condition, a resync of the array has to be running and
> high disk IO needs to be performed in addition. The condition is
> triggered regardless of the resync speed (tested with 1, 10 and 100
> MB/s). I also tried different io schedulers and a non-preempt kernel.
>
>
> Someone else over at Fedora reported a similiar problem:
> http://www.spinics.net/linux/fedora/fedora-kernel/msg05163.html
>
>
> How can I help with providing additional information so you can locate
> the problem?
>
>
> Thanks,
>
> Christian
>
> --
> To unsubscribe from this list: send the line "unsubscribe linux-raid" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 828 bytes --]
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2014-05-20 23:26 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2014-05-20 21:24 System freeze triggered by RAID 10 resync and high io christian.schwarz
2014-05-20 23:26 ` NeilBrown
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).