From: Jayachandran C <jnair@caviumnetworks.com>
To: Bjorn Helgaas <bhelgaas@google.com>
Cc: Bjorn Helgaas <helgaas@kernel.org>,
"linux-pci@vger.kernel.org" <linux-pci@vger.kernel.org>,
"open list:INTEL IOMMU (VT-d)" <iommu@lists.linux-foundation.org>,
Robin Murphy <robin.murphy@arm.com>, Jon Masters <jcm@redhat.com>,
linux-arm <linux-arm-kernel@lists.infradead.org>,
Joerg Roedel <joro@8bytes.org>
Subject: Re: [PATCH v5 2/2] PCI: quirks: Fix ThunderX2 dma alias handling
Date: Mon, 17 Apr 2017 17:47:13 +0000 [thread overview]
Message-ID: <20170417174712.GA7575@localhost> (raw)
In-Reply-To: <CAErSpo731xSzrnKF+G+8SA_UUHy7ROA1V9oFkpwKi85GQf9VAg@mail.gmail.com>
On Fri, Apr 14, 2017 at 09:00:06PM -0500, Bjorn Helgaas wrote:
> On Fri, Apr 14, 2017 at 4:06 PM, Jayachandran C
> <jnair@caviumnetworks.com> wrote:
> > On Thu, Apr 13, 2017 at 07:19:11PM -0500, Bjorn Helgaas wrote:
> >> I tentatively applied both patches to pci/host-thunder for v4.12.
> >>
> >> However, I am concerned about the topology here:
> >>
> >> On Thu, Apr 13, 2017 at 08:30:45PM +0000, Jayachandran C wrote:
> >> > On Cavium ThunderX2 arm64 SoCs (called Broadcom Vulcan earlier), the
> >> > PCI topology is slightly unusual. For a multi-node system, it looks
> >> > like:
> >> >
> >> > 00:00.0 [PCI] bridge to [bus 01-1e]
> >> > 01:0a.0 [PCI-PCIe bridge, type 8] bridge to [bus 02-04]
> >> > 02:00.0 [PCIe root port, type 4] bridge to [bus 03-04] (XLATE_ROOT)
> >> > 03:00.0 PCIe Endpoint
> >>
> >> A root port normally has a single PCIe link leading downstream.
> >> According to this, 02:00.0 is a root port that has the usual
> >> downstream link leading to 03:00.0, but it also has an upstream link
> >> to 01:0a.0.
> >
> > The PCI topology is a bit broken due to the way that the PCIe IP block
> > was integrated into SoC PCI bridges and devices. The current mechanism
> > of adding a PCI-PCIe bridge to glue these together is not ideal.
>
> Yeah, that's definitely broken.
>
> >> Maybe this example is omitting details that are not relevant to DMA
> >> aliases? The PCIe capability only contains one set of link-related
> >> registers, so I don't know how we could manage a single device that
> >> has two links.
> >
> > The root port is standard and has just one link to the EP (or whatever
> > is on the external PCIe slot). The fallout of the current hw design is
> > that the PCI-PCIe bridge has a link that does not follow standard and
> > does not have a counterpart (as you noted).
> >
> >> A device with two links would break things like ASPM. In
> >> set_pcie_port_type(), for example, we have this comment:
> >>
> >> * A Root Port or a PCI-to-PCIe bridge is always the upstream end
> >> * of a Link. No PCIe component has two Links. Two Links are
> >> * connected by a Switch that has a Port on each Link and internal
> >> * logic to connect the two Ports.
> >>
> >> The topology above breaks these assumptions, which will make
> >> pdev->has_secondary_link incorrect, which means ASPM won't work
> >> correctly.
> >
> > Given the current hardware, the pcieport driver seems to work reasonably
> > for the root port at 02:00.0, with AER support. I will take a look at the
> > ASPM part.
>
> I don't think pcieport itself cares much about links. ASPM does, but
> it looks like set_pcie_port_type() actually is smart enough to know
> that PCI-to-PCIe bridges and Root Ports always have links on their
> secondary sides. So has_secondary_link probably does get set
> correctly.
>
> But I think you'll hit the VIA "strange chipset" thing in
> pcie_aspm_init_link_state(), which will probably prevent us from doing
> ASPM on the link from 02:00.0 to 03:00.0.
>
> Could you collect "lspci -vv" output from this system? I'd like to
> archive that as background for this IOMMU issue and the ASPM tweaks I
> suspect we'll have to do. I *wish* we had more information about that
> VIA thing, because I suspect we could get rid of it if we had more
> details.
The full logs are slightly large, so I have kept them at:
https://github.com/jchandra-cavm/thunderx2/blob/master/logs/
The lspci -vv output is lspci-vv.txt and lspci -tvn output is lspci-tvn.txt
The output is from 2 socket system, the cards are not on the first slot
like the example above, so the bus and device numbers are different.
Looks like I have to spend some time on ASPM next.
Thanks,
JC.
next prev parent reply other threads:[~2017-04-17 17:47 UTC|newest]
Thread overview: 17+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-04-13 20:30 [PATCH v5 0/2] Handle Cavium ThunderX2 PCI topology quirk Jayachandran C
2017-04-13 20:30 ` [PATCH v5 1/2] PCI: Add device flag PCI_DEV_FLAGS_BRIDGE_XLATE_ROOT Jayachandran C
2017-04-13 20:30 ` [PATCH v5 2/2] PCI: quirks: Fix ThunderX2 dma alias handling Jayachandran C
2017-04-14 0:19 ` Bjorn Helgaas
2017-04-14 21:06 ` Jayachandran C
2017-04-15 2:00 ` Bjorn Helgaas
2017-04-17 17:47 ` Jayachandran C [this message]
2017-04-17 19:51 ` Bjorn Helgaas
2017-04-21 15:48 ` Bjorn Helgaas
2017-04-21 17:05 ` Jayachandran C
2017-04-21 17:57 ` Bjorn Helgaas
2017-04-25 13:03 ` Jayachandran C
2017-04-25 13:37 ` Bjorn Helgaas
2017-04-19 23:38 ` Jon Masters
2017-04-20 0:25 ` Jon Masters
2017-04-20 13:20 ` Bjorn Helgaas
2017-04-20 15:12 ` Jon Masters
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20170417174712.GA7575@localhost \
--to=jnair@caviumnetworks.com \
--cc=bhelgaas@google.com \
--cc=helgaas@kernel.org \
--cc=iommu@lists.linux-foundation.org \
--cc=jcm@redhat.com \
--cc=joro@8bytes.org \
--cc=linux-arm-kernel@lists.infradead.org \
--cc=linux-pci@vger.kernel.org \
--cc=robin.murphy@arm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).