public inbox for linux-scsi@vger.kernel.org
 help / color / mirror / Atom feed
From: Vasily Averin <vvs@parallels.com>
To: linux-scsi@vger.kernel.org,
	Adaptec OEM Raid Solutions <aacraid@adaptec.com>
Cc: Matthew Garrett <mjg@redhat.com>,
	James Bottomley <jbottomley@parallels.com>,
	Mark Salyzyn <mark_salyzyn@us.xyratex.com>
Subject: aacraid controller hangs if kernel uses non-default ASPM policy
Date: Fri, 11 Nov 2011 13:42:05 +0400	[thread overview]
Message-ID: <4EBCEDED.7030907@parallels.com> (raw)

Aacraid controller can hang on some nodes if kernel uses non-default (powersave) ASPM policy.
Controller hangs shortly after successful load and hardware detection. Scsi error handler detects this hang and tries to restart hardware but it does not help.

Initially it was noticed on RHEL6-based openVZ kernel after backporting aacraid driver from mainline (RHEL6 kernel with original driver works well)
http://bugzilla.openvz.org/show_bug.cgi?id=2043

This issue happens because default ASPM policy was changed in Red Hat kernels. Therefore guys from Red Hat have noticed this problem long time ago:
on Fedora 12
 https://bugzilla.redhat.com/show_bug.cgi?id=540478
on Fedora 14
 https://bugzilla.redhat.com/show_bug.cgi?id=679385

In RHEL6 kernel this issue was fixed, ASPM was disabled in aacraid driver. In kernel changelog I've found that seems it was done by Matthew Garrett: 
- [scsi] aacraid: Disable ASPM by default (Matthew Garrett) [599735]

However seems this patch was not submitted to mainline. I've reproduced this issue on vanilla 3.1.0 kernel booted with "pcie_aspm.policy=powersave" option,
So I believe it makes sense to do it now.

I've reviewed similar issues and found that similar troubles happen with another hardware too. For example similar patch can be found in e1000 driver.

Btw. It's funny that this problem was not fixed even in newly released Fedora 16 kernel: default policy was changed, but driver was not patched.

Thank you,
	Vasily Averin

             reply	other threads:[~2011-11-11  9:42 UTC|newest]

Thread overview: 9+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2011-11-11  9:42 Vasily Averin [this message]
2011-11-11 12:27 ` aacraid controller hangs if kernel uses non-default ASPM policy James Bottomley
2011-11-11 13:11   ` Matthew Garrett
2011-11-11 13:47     ` Vasily Averin
2011-11-11 13:53       ` Matthew Garrett
2011-11-11 13:09 ` Matthew Garrett
2011-11-11 14:15   ` James Bottomley
2011-11-11 14:20     ` Mark Salyzyn
2011-11-11 14:41       ` James Bottomley

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4EBCEDED.7030907@parallels.com \
    --to=vvs@parallels.com \
    --cc=aacraid@adaptec.com \
    --cc=jbottomley@parallels.com \
    --cc=linux-scsi@vger.kernel.org \
    --cc=mark_salyzyn@us.xyratex.com \
    --cc=mjg@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox