linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Wolfgang Denk <wd@denx.de>
To: Dale Dunlea <ddunlea@gmail.com>
Cc: linux-raid@vger.kernel.org
Subject: Re: RAID5 lockup with AMCC440 and async-tx
Date: Mon, 01 Oct 2007 12:32:16 +0200	[thread overview]
Message-ID: <20071001103216.433742408A@gemini.denx.de> (raw)
In-Reply-To: Your message of "Mon, 01 Oct 2007 10:16:00 BST." <8a24fb800710010216m21cd7734p4c19df1aa7dd5564@mail.gmail.com>

Dear Dale,

in message <8a24fb800710010216m21cd7734p4c19df1aa7dd5564@mail.gmail.com> you wrote:
> 
> I have a board with an AMCC440 processor, running RAID5 using the
> async-tx interface. In general, it works well, but I have found a test
> case that consistently causes a hard lockup of the entire system.

Please make sure to use latest code - we found a bug recently.

> What makes this case odd is that I have only been able to generate it
> when accessing disks that are on two separate HBAs - in my case
> mpt-fusion based SAS HBAs. Once two HBAs are in use, the bug is
> trivial to repeat. I simply create a RAID5 using disks from each HBA,
> wait for it to resync, and then run

We saw similar problems, in our case they showed up only with a large
number of disks in combination with big kernel pages sizes (64 kB).

> Any pointers on how to debug this? It feels like a race condition of
> some description, but any serial port printing I enable causes the
> problem to go away, and I can't print silently to /var/log/messages as
> the system hangs before it can flush.

See above - please try current code.

Best regards,

Wolfgang Denk

-- 
DENX Software Engineering GmbH,     MD: Wolfgang Denk & Detlev Zundel
HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany
Phone: (+49)-8142-66989-10 Fax: (+49)-8142-66989-80 Email: wd@denx.de
HR Manager to job candidate "I see you've had no  computer  training.
Although  that  qualifies  you  for upper management, it means you're
under-qualified for our entry level positions."

  parent reply	other threads:[~2007-10-01 10:32 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2007-10-01  9:16 RAID5 lockup with AMCC440 and async-tx Dale Dunlea
2007-10-01 10:13 ` Justin Piszcz
2007-10-01 10:32 ` Wolfgang Denk [this message]
2007-10-01 11:02   ` Dale Dunlea
2007-10-01 17:39     ` Wolfgang Denk
2007-10-01 19:25       ` Dale Dunlea

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20071001103216.433742408A@gemini.denx.de \
    --to=wd@denx.de \
    --cc=ddunlea@gmail.com \
    --cc=linux-raid@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).