public inbox for linux-scsi@vger.kernel.org
 help / color / mirror / Atom feed
* aacraid controller hangs if kernel uses non-default ASPM policy
@ 2011-11-11  9:42 Vasily Averin
  2011-11-11 12:27 ` James Bottomley
  2011-11-11 13:09 ` Matthew Garrett
  0 siblings, 2 replies; 9+ messages in thread
From: Vasily Averin @ 2011-11-11  9:42 UTC (permalink / raw)
  To: linux-scsi, Adaptec OEM Raid Solutions
  Cc: Matthew Garrett, James Bottomley, Mark Salyzyn

Aacraid controller can hang on some nodes if kernel uses non-default (powersave) ASPM policy.
Controller hangs shortly after successful load and hardware detection. Scsi error handler detects this hang and tries to restart hardware but it does not help.

Initially it was noticed on RHEL6-based openVZ kernel after backporting aacraid driver from mainline (RHEL6 kernel with original driver works well)
http://bugzilla.openvz.org/show_bug.cgi?id=2043

This issue happens because default ASPM policy was changed in Red Hat kernels. Therefore guys from Red Hat have noticed this problem long time ago:
on Fedora 12
 https://bugzilla.redhat.com/show_bug.cgi?id=540478
on Fedora 14
 https://bugzilla.redhat.com/show_bug.cgi?id=679385

In RHEL6 kernel this issue was fixed, ASPM was disabled in aacraid driver. In kernel changelog I've found that seems it was done by Matthew Garrett: 
- [scsi] aacraid: Disable ASPM by default (Matthew Garrett) [599735]

However seems this patch was not submitted to mainline. I've reproduced this issue on vanilla 3.1.0 kernel booted with "pcie_aspm.policy=powersave" option,
So I believe it makes sense to do it now.

I've reviewed similar issues and found that similar troubles happen with another hardware too. For example similar patch can be found in e1000 driver.

Btw. It's funny that this problem was not fixed even in newly released Fedora 16 kernel: default policy was changed, but driver was not patched.

Thank you,
	Vasily Averin

^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2011-11-11 14:41 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-11-11  9:42 aacraid controller hangs if kernel uses non-default ASPM policy Vasily Averin
2011-11-11 12:27 ` James Bottomley
2011-11-11 13:11   ` Matthew Garrett
2011-11-11 13:47     ` Vasily Averin
2011-11-11 13:53       ` Matthew Garrett
2011-11-11 13:09 ` Matthew Garrett
2011-11-11 14:15   ` James Bottomley
2011-11-11 14:20     ` Mark Salyzyn
2011-11-11 14:41       ` James Bottomley

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox