linux-pci.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Jon Masters <jcm@redhat.com>
To: Bjorn Helgaas <bhelgaas@google.com>, Duc Dang <dhdang@apm.com>
Cc: Tanmay Inamdar <tinamdar@apm.com>,
	"linux-pci@vger.kernel.org" <linux-pci@vger.kernel.org>,
	linux-arm <linux-arm-kernel@lists.infradead.org>,
	"linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>
Subject: Re: X-Gene: Unhandled fault: synchronous external abort in pci_generic_config_read32
Date: Sat, 05 Sep 2015 16:22:02 -0400	[thread overview]
Message-ID: <55EB4EEA.2070409@redhat.com> (raw)
In-Reply-To: <55EB4CD8.8080405@redhat.com>

On 09/05/2015 04:13 PM, Jon Masters wrote:
> On 08/11/2015 03:28 PM, Bjorn Helgaas wrote:
>> On Mon, Aug 10, 2015 at 2:07 PM, Duc Dang <dhdang@apm.com> wrote:
>>> On Mon, Aug 10, 2015 at 10:42 AM, Bjorn Helgaas <bhelgaas@google.com> wrote:
>>>> On Mon, Aug 10, 2015 at 12:16 PM, Duc Dang <dhdang@apm.com> wrote:
>>>>> On Monday, August 10, 2015, Bjorn Helgaas <bhelgaas@google.com> wrote:
>>>>>>
>>>>>> On Fri, Jul 31, 2015 at 12:00 PM, Duc Dang <dhdang@apm.com> wrote:
>>>>>>> On Wed, Jul 29, 2015 at 8:55 AM, Bjorn Helgaas <bhelgaas@google.com>
>>>>>>> wrote:
>>>>>>>> On Tue, Jul 28, 2015 at 08:22:55PM -0500, Bjorn Helgaas wrote:
>>>>>>>>> On Tue, Jul 28, 2015 at 02:50:39PM -0700, Duc Dang wrote:
>>>>>>>>
>>>>>>>>>> Do you have another PCIe card to try on the same reboot test on this
>>>>>>>>>> board?
>>>>>>>>>
>>>>>>>>> I've seen this on at least two Mellanox cards.  I'm running similar
>>>>>>>>> tests
>>>>>>>>> on a different type of card now.
>>>>>>>>
>>>>>>>> FWIW, reboot tests on two machines with Mellanox cards failed, while
>>>>>>>> the
>>>>>>>> same test on a machine with a different proprietary card succeeded.
>>>>>>>
>>>>>>> Thanks, Bjorn.
>>>>>>>
>>>>>>> I don't have the same Mellanox card as yours, but I will also run
>>>>>>> similar reboot test to see if I hit the same issue with my card.
>>>>>>
>>>>>> Any more hints on this?  Nothing has changed on my end, so of course
>>>>>> I'm still seeing this, always on machines with Mellanox, and never on
>>>>>> other machines.  Could this be a hardware issue like a signal
>>>>>> integrity or margin issue?  I don't know where to go from here because
>>>>>> I'm not a hardware person, and I don't know anything to do in
>>>>>> software.
>>>>>
>>>>>
>>>>> Hi Bjorn,
>>>>>
>>>>> I tried to run similar reboot tests on 2 different Mellanox cards (Connect-X
>>>>> family, one card has 2 10G interfaces, the other one has 1 port that
>>>>> supports InfiniBand) with U-Boot 1.15.12 and linux 4.2-rc5 and I did not see
>>>>> the crash that you encounterred.
>>>>>
>>>>> Did you check if your Mellanox cards have latest firmware? I did see some
>>>>> link issues on my Mellanox cards with its old firmware before.
>>>>
>>>> Good idea; I'll check that, too.  Also, I just learned that these
>>>> cards on installed with an extender card because of some space issues,
>>>> so we're going to test again without the extender.
>>>
>>> Hi Bjorn,
>>>
>>> Are other cards that passed your test installed directly to the
>>> on-board PCIe slot?
>>> If yes, then this is a good data point and it will be useful to test
>>> the case where
>>> your Mellanox cards are directly installed into the on-board PCIe slot.
>>
>> The cards that passed the test were installed directly, with  no
>> extender.  We removed the extender from one of the machines with the
>> Mellanox card and have not seen this issue since then.  I think it's
>> very likely that the problem is related to using the extender.
> 
> If you're trying to use Mellanox cards in (for example) an APM Mustang
> like system with a PCIe extender card (for example a 90 degree angle
> adjustment for a low profile server case), you might want to ping me
> offline. I have procured a number of these over the past couple of years
> for my home lab and have found one that works (almost) reliably on that
> particular hardware platform and does 10G in my home lab.

Traveling for the holiday, but I guess it doesn't need to be a secret. I
think I have found some success with this one (but I have ordered many
different ones over the past year so will confirm next week):

http://www.amazon.com/gp/product/B00H8VVD00?psc=1&redirect=true&ref_=oh_aui_search_detailpage

Specifically, the fixed angle adapter brackets generally DO NOT work.

Jon.


  reply	other threads:[~2015-09-05 20:22 UTC|newest]

Thread overview: 24+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2015-07-24 22:42 X-Gene: Unhandled fault: synchronous external abort in pci_generic_config_read32 Bjorn Helgaas
2015-07-25  0:05 ` Duc Dang
2015-07-27 11:36   ` Catalin Marinas
2015-07-28 17:39     ` Duc Dang
2015-07-28 18:36       ` Bjorn Helgaas
2015-07-28 16:43   ` Bjorn Helgaas
2015-07-28 17:45     ` Duc Dang
2015-07-28 21:29       ` Bjorn Helgaas
2015-07-28 21:50         ` Duc Dang
2015-07-29  1:22           ` Bjorn Helgaas
2015-07-29 15:55             ` Bjorn Helgaas
2015-07-31 17:00               ` Duc Dang
2015-08-10 16:18                 ` Bjorn Helgaas
2015-08-10 17:38                   ` Catalin Marinas
     [not found]                   ` <CADaLNDkUQHzGACfFmYDeJWnaNrKmJUDx4Rby60OWr4FzOjx3rA@mail.gmail.com>
2015-08-10 17:42                     ` Bjorn Helgaas
2015-08-10 19:07                       ` Duc Dang
2015-08-11 19:28                         ` Bjorn Helgaas
2015-09-05 20:13                           ` Jon Masters
2015-09-05 20:22                             ` Jon Masters [this message]
2016-04-13  9:58         ` Sudeep Holla
2016-04-13 13:21           ` Bjorn Helgaas
2016-04-13 13:29             ` Sudeep Holla
2016-04-13 22:17               ` Jon Masters
2015-07-28 14:37 ` Dall, Elizabeth J

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=55EB4EEA.2070409@redhat.com \
    --to=jcm@redhat.com \
    --cc=bhelgaas@google.com \
    --cc=dhdang@apm.com \
    --cc=linux-arm-kernel@lists.infradead.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-pci@vger.kernel.org \
    --cc=tinamdar@apm.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).