From: "Martin J. Bligh" <mbligh@aracnet.com>
To: "Rechenberg, Andrew" <ARechenberg@shermanfinancialgroup.com>,
linux-kernel@vger.kernel.org
Subject: Re: OOPS in do_try_to_free_pages with VERY large software RAID array
Date: Mon, 10 Mar 2003 11:27:48 -0800 [thread overview]
Message-ID: <6240000.1047324468@flay> (raw)
In-Reply-To: <8075D5C3061B9441944E1373776451180F0761@cinshrexc03.shermfin.com>
> Can anyone help me out please? I'm trying to create a monster software
> RAID array and the kernel is not behaving. On some test hardware I can
> get 17 RAID1 arrays to begin syncing and will sync with
> /proc/sys/dev/raid/speed_limit_max set to 100000 (the max allowed) with
> no problem.
>
> We wanted to use 26 RAID1 arrays and then stripe across them to get very
> high performance. When I tried to do that this weekend on our
> production box we started getting kernel panics when the RAID1 arrays
> started syncing. This was with speed_limit_max set to 10000 so the rate
> wasn't very high. Since we knew 34 disks worked we decided to put the
> box in to production with just 13 RAID1 arrays and striping across
> those. The performance is great compared to our hardware RAID, but I
> would like to get all the disks we purchase for this system working.
>
> This morning I connected 56 disks to our test hardware and tried to
> reproduce the problem. With the test hardware, the 26 RAID1 arrays were
> working OK at speed_limit_max 10000 however the kernel OOPSed when I
> 'less'ed /proc/mdstat. It wasn't a hard crash because I could still
> work. However when I upped the speed_limit_max to 30000 there was a
> hard crash.
At a wild guess (OK, I only looked for about 1 minute),
md_status_read_proc is generating more than 4K of information, and overwriting
the end of it's 4K page. Throw some debug in there, and get it to printk
how much of the buffer it thinks it's using (just printk sz every time it
changes it). If it's > 4K, convert it to the seq_file interface.
May not be it, but it seems likely given the unusual scale of what you're
doing, and it's easy to check.
M.
next prev parent reply other threads:[~2003-03-10 19:26 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2003-03-10 19:20 OOPS in do_try_to_free_pages with VERY large software RAID array Rechenberg, Andrew
2003-03-10 19:27 ` Martin J. Bligh [this message]
2003-03-10 19:48 ` Kevin P. Fleming
-- strict thread matches above, loose matches on Subject: below --
2003-03-10 21:23 Rechenberg, Andrew
2003-03-10 21:52 ` Martin J. Bligh
2003-03-11 17:38 Rechenberg, Andrew
2003-03-11 17:49 ` Randy.Dunlap
2003-03-11 19:38 Rechenberg, Andrew
2003-03-11 19:39 ` Martin J. Bligh
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=6240000.1047324468@flay \
--to=mbligh@aracnet.com \
--cc=ARechenberg@shermanfinancialgroup.com \
--cc=linux-kernel@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox