From: Brian King <brking@linux.vnet.ibm.com>
To: Tejun Heo <tj@kernel.org>
Cc: wenxiong@linux.vnet.ibm.com, jgarzik@pobox.com,
linux-ide@vger.kernel.org, Wen Xiong <wenxiong@us.ibm.com>
Subject: Re: [PATCH] ahci: Add support for EEH error recovery
Date: Thu, 14 May 2015 11:09:56 -0500 [thread overview]
Message-ID: <5554C8D4.8080400@linux.vnet.ibm.com> (raw)
In-Reply-To: <20150514154804.GK11388@htj.duckdns.org>
On 05/14/2015 10:48 AM, Tejun Heo wrote:
> Hello, Brian.
>
> On Thu, May 14, 2015 at 10:44:18AM -0500, Brian King wrote:
>> So, on the Power platform, the pci_error_handlers map to our EEH recovery.
>
> What's EEH?
It stands for "Extended Error Handling".
http://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git/tree/Documentation/PCI/pci-error-recovery.txt
http://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git/tree/Documentation/powerpc/eeh-pci-error-recovery.txt
>
>> In that case, without this patch, if we hit any sort of PCIe error, we
>> won't be able to recover and we'll lose all access to the ahci disks.
>> This could be the adapter trying to access an invalid DMA address due
>> to a transient hardware issue, or it could be due to a driver bug giving
>> the adapter an invalid address. It could also be other various PCIe
>> errors that cause our PCIe bridge chip to isolate the device and
>> place it into the EEH "frozen" state. When this occurs, if the driver
>> associated with the hardware does not have these handlers registered,
>> powerpc arch kernel code will hotplug remove the adapter, recover the
>> adapter, then hotplug add it back. This works OK for some devices,
>> but generally not so well for storage devices with mounted filesystems,
>> which would tend to go readonly in this case.
>
> I think the above, with more details on how the error handling
> actually works (IOW what it does), should be in the patch description
> and comments. Wen, can you please update the patch with more
> information?
Agreed.
Thanks,
Brian
--
Brian King
Power Linux I/O
IBM Linux Technology Center
prev parent reply other threads:[~2015-05-14 16:10 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-05-14 1:35 [PATCH] ahci: Add support for EEH error recovery wenxiong
2015-05-14 15:13 ` Tejun Heo
2015-05-14 15:44 ` Brian King
2015-05-14 15:48 ` Tejun Heo
2015-05-14 16:09 ` Brian King [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=5554C8D4.8080400@linux.vnet.ibm.com \
--to=brking@linux.vnet.ibm.com \
--cc=jgarzik@pobox.com \
--cc=linux-ide@vger.kernel.org \
--cc=tj@kernel.org \
--cc=wenxiong@linux.vnet.ibm.com \
--cc=wenxiong@us.ibm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.