From: Brad Campbell <brad@wasp.net.au>
To: Neil Brown <neilb@cse.unsw.edu.au>
Cc: lkml <linux-kernel@vger.kernel.org>,
RAID Linux <linux-raid@vger.kernel.org>
Subject: Re: Raid-6 hang on write.
Date: Tue, 01 Mar 2005 09:34:01 +0400 [thread overview]
Message-ID: <4223FEC9.8070306@wasp.net.au> (raw)
In-Reply-To: <16930.45319.682534.351648@cse.unsw.edu.au>
Neil Brown wrote:
> On Friday February 25, brad@wasp.net.au wrote:
>
>>Turning on debugging in raid6main.c and md.c make it much harder to hit. So I'm assuming something
>>timing related.
>>
>>raid6d --> md_check_recovery --> generic_make_request --> make_request --> get_active_stripe
>
>
> Yes, there is a real problem here. I see if I can figure out the best
> way to remedy it...
> However I think you reported this problem against a non "-mm" kernel,
> and the path from md_check_recovery to generic_make_requests only
> exists in "-mm".
>
> Could you please confirm if there is a problem with
> 2.6.11-rc4-bk4->bk10
There is (was). I have three kernels I was testing against. 2.6.11-rc4-bk4, 2.6.11-rc4-bk10 and
2.6.11-rc4-mm1. I moved onto 2.6.11-rc4-mm1 for my main debugging (inserting lots of printks and
generally doing stuff that was going to crash). I hope to reproduce the faults against the vanilla
2.6.11-rc4 kernels and I'm now testing with 2.6.11-rc5-bk2.
As per the original bug report, 2.6.11-rc4-bk(4/10) locked in [<c0268574>]
get_active_stripe+0x224/0x260. Although unlike -mm1 I'm not sure of the sequence of events that
caused it and it's not anywhere as easy to hit. I am willing to investigate as time allows however.
Testing 2.6.11-rc5-bk2 and it of course is flatly refusing to misbehave. I'll keep beating on it for
a couple of days and after writing 3TB with 2.6.11-rc5-bk2, I'll go back to the older kernels and
try and reproduce the failure there. It *did* lockup 4 times in a row in get_active_stripe on the
older non -mm kernels.
Regards,
Brad
--
"Human beings, who are almost unique in having the ability
to learn from the experience of others, are also remarkable
for their apparent disinclination to do so." -- Douglas Adams
next prev parent reply other threads:[~2005-03-01 5:34 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2005-02-24 14:50 Raid-6 hang on write Brad Campbell
2005-02-25 15:37 ` Brad Campbell
2005-02-28 5:49 ` Neil Brown
2005-02-28 9:42 ` Bigmem in Linux 2.6.8 Iwan Sanders
2005-02-28 12:23 ` Mathieu Segaud
2005-03-01 5:34 ` Brad Campbell [this message]
2005-03-01 6:58 ` Raid-6 hang on write Brad Campbell
2005-03-01 9:18 ` Neil Brown
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4223FEC9.8070306@wasp.net.au \
--to=brad@wasp.net.au \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-raid@vger.kernel.org \
--cc=neilb@cse.unsw.edu.au \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.