From: Wolfgang Denk <wd@denx.de>
To: Dale Dunlea <ddunlea@gmail.com>
Cc: linux-raid@vger.kernel.org
Subject: Re: RAID5 lockup with AMCC440 and async-tx
Date: Mon, 01 Oct 2007 12:32:16 +0200 [thread overview]
Message-ID: <20071001103216.433742408A@gemini.denx.de> (raw)
In-Reply-To: Your message of "Mon, 01 Oct 2007 10:16:00 BST." <8a24fb800710010216m21cd7734p4c19df1aa7dd5564@mail.gmail.com>
Dear Dale,
in message <8a24fb800710010216m21cd7734p4c19df1aa7dd5564@mail.gmail.com> you wrote:
>
> I have a board with an AMCC440 processor, running RAID5 using the
> async-tx interface. In general, it works well, but I have found a test
> case that consistently causes a hard lockup of the entire system.
Please make sure to use latest code - we found a bug recently.
> What makes this case odd is that I have only been able to generate it
> when accessing disks that are on two separate HBAs - in my case
> mpt-fusion based SAS HBAs. Once two HBAs are in use, the bug is
> trivial to repeat. I simply create a RAID5 using disks from each HBA,
> wait for it to resync, and then run
We saw similar problems, in our case they showed up only with a large
number of disks in combination with big kernel pages sizes (64 kB).
> Any pointers on how to debug this? It feels like a race condition of
> some description, but any serial port printing I enable causes the
> problem to go away, and I can't print silently to /var/log/messages as
> the system hangs before it can flush.
See above - please try current code.
Best regards,
Wolfgang Denk
--
DENX Software Engineering GmbH, MD: Wolfgang Denk & Detlev Zundel
HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany
Phone: (+49)-8142-66989-10 Fax: (+49)-8142-66989-80 Email: wd@denx.de
HR Manager to job candidate "I see you've had no computer training.
Although that qualifies you for upper management, it means you're
under-qualified for our entry level positions."
next prev parent reply other threads:[~2007-10-01 10:32 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2007-10-01 9:16 RAID5 lockup with AMCC440 and async-tx Dale Dunlea
2007-10-01 10:13 ` Justin Piszcz
2007-10-01 10:32 ` Wolfgang Denk [this message]
2007-10-01 11:02 ` Dale Dunlea
2007-10-01 17:39 ` Wolfgang Denk
2007-10-01 19:25 ` Dale Dunlea
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20071001103216.433742408A@gemini.denx.de \
--to=wd@denx.de \
--cc=ddunlea@gmail.com \
--cc=linux-raid@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).