From: Robin Murphy <robin.murphy@arm.com>
To: Sajid Dalvi <sdalvi@google.com>
Cc: "Ajay Agarwal" <ajayagarwal@google.com>,
"Jingoo Han" <jingoohan1@gmail.com>,
"Gustavo Pimentel" <gustavo.pimentel@synopsys.com>,
"Manivannan Sadhasivam" <mani@kernel.org>,
"Lorenzo Pieralisi" <lpieralisi@kernel.org>,
"Krzysztof Wilczyński" <kw@linux.com>,
"Rob Herring" <robh@kernel.org>,
"Bjorn Helgaas" <bhelgaas@google.com>,
"Manu Gautam" <manugautam@google.com>,
"William McVicker" <willmcvicker@google.com>,
"Serge Semin" <fancer.lancer@gmail.com>,
linux-pci@vger.kernel.org
Subject: Re: [PATCH v2] PCI: dwc: Strengthen the MSI address allocation logic
Date: Tue, 16 Jan 2024 21:02:48 +0000 [thread overview]
Message-ID: <5ef31b1c-3069-4da7-8124-44efba0ad718@arm.com> (raw)
In-Reply-To: <CAEbtx1=hoDTtpkavk7gp5tmcvdYj6euAuDsQYRPW=EDeVsbUqg@mail.gmail.com>
On 2024-01-16 5:18 pm, Sajid Dalvi wrote:
> On Tue, Jan 16, 2024 at 7:30 AM Robin Murphy <robin.murphy@arm.com> wrote:
>>
>> On 2024-01-11 4:21 am, Ajay Agarwal wrote:
>>> There can be platforms that do not use/have 32-bit DMA addresses
>>> but want to enumerate endpoints which support only 32-bit MSI
>>> address. The current implementation of 32-bit IOVA allocation can
>>> fail for such platforms, eventually leading to the probe failure.
>>>
>>> If there is a memory region reserved for the pci->dev, pick up
>>> the MSI data from this region. This can be used by the platforms
>>> described above.
>>>
>>> Else, if the memory region is not reserved, try to allocate a
>>> 32-bit IOVA. Additionally, if this allocation also fails, attempt
>>> a 64-bit allocation for probe to be successful. If the 64-bit MSI
>>> address is allocated, then the EPs supporting 32-bit MSI address
>>> will not work.
>>>
>>> Signed-off-by: Ajay Agarwal <ajayagarwal@google.com>
>>> ---
>>> Changelog since v1:
>>> - Use reserved memory, if it exists, to setup the MSI data
>>> - Fallback to 64-bit IOVA allocation if 32-bit allocation fails
>>>
>>> .../pci/controller/dwc/pcie-designware-host.c | 50 ++++++++++++++-----
>>> drivers/pci/controller/dwc/pcie-designware.h | 1 +
>>> 2 files changed, 39 insertions(+), 12 deletions(-)
>>>
>>> diff --git a/drivers/pci/controller/dwc/pcie-designware-host.c
> b/drivers/pci/controller/dwc/pcie-designware-host.c
>>> index 7991f0e179b2..8c7c77b49ca8 100644
>>> --- a/drivers/pci/controller/dwc/pcie-designware-host.c
>>> +++ b/drivers/pci/controller/dwc/pcie-designware-host.c
>>> @@ -331,6 +331,8 @@ static int dw_pcie_msi_host_init(struct dw_pcie_rp
> *pp)
>>> u64 *msi_vaddr;
>>> int ret;
>>> u32 ctrl, num_ctrls;
>>> + struct device_node *np;
>>> + struct resource r;
>>>
>>> for (ctrl = 0; ctrl < MAX_MSI_CTRLS; ctrl++)
>>> pp->irq_mask[ctrl] = ~0;
>>> @@ -374,20 +376,44 @@ static int dw_pcie_msi_host_init(struct
> dw_pcie_rp *pp)
>>> * order not to miss MSI TLPs from those devices the MSI target
>>> * address has to be within the lowest 4GB.
>>> *
>>> - * Note until there is a better alternative found the reservation
> is
>>> - * done by allocating from the artificially limited DMA-coherent
>>> - * memory.
>>> + * Check if there is memory region reserved for this device. If
> yes,
>>> + * pick up the msi_data from this region. This will be helpful for
>>> + * platforms that do not use/have 32-bit DMA addresses but want
> to use
>>> + * endpoints which support only 32-bit MSI address.
>>> + * Else, if the memory region is not reserved, try to allocate a
> 32-bit
>>> + * IOVA. Additionally, if this allocation also fails, attempt a
> 64-bit
>>> + * allocation. If the 64-bit MSI address is allocated, then the
> EPs
>>> + * supporting 32-bit MSI address will not work.
>>> */
>>> - ret = dma_set_coherent_mask(dev, DMA_BIT_MASK(32));
>>> - if (ret)
>>> - dev_warn(dev, "Failed to set DMA mask to 32-bit. Devices
> with only 32-bit MSI support may not work properly\n");
>>> + np = of_parse_phandle(dev->of_node, "memory-region", 0);
>>> + if (np) {
>>> + ret = of_address_to_resource(np, 0, &r);
>>
>> This is incorrect in several ways - reserved-memory nodes represent
>> actual system memory, so can't be used to reserve arbitrary PCI memory
>> space (which may be different if DMA offsets are involved); the whole
>> purpose of going through the DMA API is to ensure we get a unique *bus*
>> address. Obviously we don't want to reserve actual memory for something
>> that functionally doesn't need it, but conversely having a
>> reserved-memory region for an address which isn't memory would be
>> nonsensical. And even if this *were* a viable approach, you haven't
>> updated the DWC binding to allow it, nor defined a reserved-memory
>> binding for the node itself.
>>
>> If it was reasonable to put something in DT at all, then the logical
>> thing would be a property expressing an MSI address directly on the
>> controller node itself, but even that would be dictating software policy
>> rather than describing an aspect of the platform itself. Plus this is
>> far from the only driver with this concern, so it wouldn't make much
>> sense to hack just one binding without all the others as well. The rest
>> of the DT already describes everything an OS needs to know in order to
>> decide an MSI address to use, it's just a matter of making this
>> particular OS do a better job of putting it all together.
>>
>> Thanks,
>> Robin.
>>
>
> Robin,
> Needed some clarification.
> It seems you are implying that the pcie device tree node should define a
> property for the MSI address within the PCIe address space.
> However, you also state that this would not be an ideal solution, and
> would prefer using existing device tree constructs.
> I am not sure what you mean by, " The rest of the DT already describes
> everything."
> Do you mean adding an "msi" reg to reg-names and defining the address
> in the reg list?
No, I'm saying the closest this should come to DT at all is the
possibility of the low-level driver hard-coding a platform-specific
value for pp->msi_data based on some platform-specific compatible, as
Serge pointed to on v1.
Otherwise, based on the system memory layout and dma-ranges of the
controller node we have enough information to figure out what PCI bus
address ranges can't collide with any valid DMA mapping of RAM, and thus
generate a suitable MSI address, but that really wants to be a generic
PCI-layer helper (which could also generically implement the various DMA
API tricks as a fallback if necessary).
Thanks,
Robin.
next prev parent reply other threads:[~2024-01-16 21:02 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-01-11 4:21 [PATCH v2] PCI: dwc: Strengthen the MSI address allocation logic Ajay Agarwal
2024-01-11 17:38 ` William McVicker
2024-01-11 18:02 ` Ajay Agarwal
2024-01-12 10:04 ` Serge Semin
2024-01-16 13:30 ` Robin Murphy
2024-01-16 20:47 ` Sajid Dalvi
[not found] ` <CAEbtx1=hoDTtpkavk7gp5tmcvdYj6euAuDsQYRPW=EDeVsbUqg@mail.gmail.com>
2024-01-16 21:02 ` Robin Murphy [this message]
2024-01-19 5:58 ` Ajay Agarwal
2024-01-30 17:22 ` Ajay Agarwal
2024-01-30 17:39 ` Robin Murphy
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=5ef31b1c-3069-4da7-8124-44efba0ad718@arm.com \
--to=robin.murphy@arm.com \
--cc=ajayagarwal@google.com \
--cc=bhelgaas@google.com \
--cc=fancer.lancer@gmail.com \
--cc=gustavo.pimentel@synopsys.com \
--cc=jingoohan1@gmail.com \
--cc=kw@linux.com \
--cc=linux-pci@vger.kernel.org \
--cc=lpieralisi@kernel.org \
--cc=mani@kernel.org \
--cc=manugautam@google.com \
--cc=robh@kernel.org \
--cc=sdalvi@google.com \
--cc=willmcvicker@google.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox