linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* RAID5 lockup with AMCC440 and async-tx
@ 2007-10-01  9:16 Dale Dunlea
  2007-10-01 10:13 ` Justin Piszcz
  2007-10-01 10:32 ` Wolfgang Denk
  0 siblings, 2 replies; 6+ messages in thread
From: Dale Dunlea @ 2007-10-01  9:16 UTC (permalink / raw)
  To: linux-raid

Hi,

I have a board with an AMCC440 processor, running RAID5 using the
async-tx interface. In general, it works well, but I have found a test
case that consistently causes a hard lockup of the entire system.

What makes this case odd is that I have only been able to generate it
when accessing disks that are on two separate HBAs - in my case
mpt-fusion based SAS HBAs. Once two HBAs are in use, the bug is
trivial to repeat. I simply create a RAID5 using disks from each HBA,
wait for it to resync, and then run

"dd if=/dev/zero of=/dev/md0 bs=512 count=100000".

By disabling CONFIG_DMA_ENGINE in my kernel config, the hang goes
away, but then so does my performance.

Any pointers on how to debug this? It feels like a race condition of
some description, but any serial port printing I enable causes the
problem to go away, and I can't print silently to /var/log/messages as
the system hangs before it can flush.

Regards,
Dale

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2007-10-01 19:25 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2007-10-01  9:16 RAID5 lockup with AMCC440 and async-tx Dale Dunlea
2007-10-01 10:13 ` Justin Piszcz
2007-10-01 10:32 ` Wolfgang Denk
2007-10-01 11:02   ` Dale Dunlea
2007-10-01 17:39     ` Wolfgang Denk
2007-10-01 19:25       ` Dale Dunlea

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).