* [PATCH v4 1/4] arm64: dts: qcom: sc7280: Increase config size to 256MB for ECAM feature
2025-02-06 23:28 [PATCH v4 0/4] PCI: dwc: Add ECAM support with iATU configuration Krishna Chaitanya Chundru
@ 2025-02-06 23:28 ` Krishna Chaitanya Chundru
2025-02-06 23:28 ` [PATCH v4 2/4] PCI: dwc: Add ECAM support with iATU configuration Krishna Chaitanya Chundru
` (2 subsequent siblings)
3 siblings, 0 replies; 19+ messages in thread
From: Krishna Chaitanya Chundru @ 2025-02-06 23:28 UTC (permalink / raw)
To: cros-qcom-dts-watchers, Bjorn Andersson, Konrad Dybcio,
Rob Herring, Krzysztof Kozlowski, Conor Dooley, Lorenzo Pieralisi,
Krzysztof Wilczyński, Manivannan Sadhasivam, Bjorn Helgaas,
Jingoo Han
Cc: linux-arm-msm, devicetree, linux-kernel, linux-pci, quic_vbadigan,
quic_mrana, quic_vpernami, mmareddy, Krishna Chaitanya Chundru
PCIe ECAM(Enhanced Configuration Access Mechanism) feature requires
maximum of 256MB configuration space.
To enable this feature increase configuration space size to 256MB. If
the config space is increased, the BAR space needs to be truncated as
it resides in the same location. To avoid the bar space truncation move
config space, DBI, ELBI, iATU to upper PCIe region and use lower PCIe
iregion entirely for BAR region.
This depends on the commit: '10ba0854c5e6 ("PCI: qcom: Disable mirroring
of DBI and iATU register space in BAR region")'
Signed-off-by: Krishna Chaitanya Chundru <krishna.chundru@oss.qualcomm.com>
Reviewed-by: Manivannan Sadhasivam <manivannan.sadhasivam@linaro.org>
---
arch/arm64/boot/dts/qcom/sc7280.dtsi | 12 ++++++------
1 file changed, 6 insertions(+), 6 deletions(-)
diff --git a/arch/arm64/boot/dts/qcom/sc7280.dtsi b/arch/arm64/boot/dts/qcom/sc7280.dtsi
index 0f2caf36910b..64c46221d8bf 100644
--- a/arch/arm64/boot/dts/qcom/sc7280.dtsi
+++ b/arch/arm64/boot/dts/qcom/sc7280.dtsi
@@ -2201,10 +2201,10 @@ wifi: wifi@17a10040 {
pcie1: pcie@1c08000 {
compatible = "qcom,pcie-sc7280";
reg = <0 0x01c08000 0 0x3000>,
- <0 0x40000000 0 0xf1d>,
- <0 0x40000f20 0 0xa8>,
- <0 0x40001000 0 0x1000>,
- <0 0x40100000 0 0x100000>;
+ <4 0x00000000 0 0xf1d>,
+ <4 0x00000f20 0 0xa8>,
+ <4 0x10000000 0 0x1000>,
+ <4 0x00000000 0 0x10000000>;
reg-names = "parf", "dbi", "elbi", "atu", "config";
device_type = "pci";
@@ -2215,8 +2215,8 @@ pcie1: pcie@1c08000 {
#address-cells = <3>;
#size-cells = <2>;
- ranges = <0x01000000 0x0 0x00000000 0x0 0x40200000 0x0 0x100000>,
- <0x02000000 0x0 0x40300000 0x0 0x40300000 0x0 0x1fd00000>;
+ ranges = <0x01000000 0x0 0x00000000 0x0 0x40000000 0x0 0x100000>,
+ <0x02000000 0x0 0x40100000 0x0 0x40100000 0x0 0x1ff00000>;
interrupts = <GIC_SPI 307 IRQ_TYPE_LEVEL_HIGH>,
<GIC_SPI 308 IRQ_TYPE_LEVEL_HIGH>,
--
2.34.1
^ permalink raw reply related [flat|nested] 19+ messages in thread* [PATCH v4 2/4] PCI: dwc: Add ECAM support with iATU configuration
2025-02-06 23:28 [PATCH v4 0/4] PCI: dwc: Add ECAM support with iATU configuration Krishna Chaitanya Chundru
2025-02-06 23:28 ` [PATCH v4 1/4] arm64: dts: qcom: sc7280: Increase config size to 256MB for ECAM feature Krishna Chaitanya Chundru
@ 2025-02-06 23:28 ` Krishna Chaitanya Chundru
2025-02-10 8:37 ` Manivannan Sadhasivam
2025-03-05 18:14 ` Manivannan Sadhasivam
2025-02-06 23:28 ` [PATCH v4 3/4] PCI: dwc: Reduce DT reads by allocating host bridge via DWC glue driver Krishna Chaitanya Chundru
2025-02-06 23:28 ` [PATCH v4 4/4] PCI: qcom: Enable ECAM feature Krishna Chaitanya Chundru
3 siblings, 2 replies; 19+ messages in thread
From: Krishna Chaitanya Chundru @ 2025-02-06 23:28 UTC (permalink / raw)
To: cros-qcom-dts-watchers, Bjorn Andersson, Konrad Dybcio,
Rob Herring, Krzysztof Kozlowski, Conor Dooley, Lorenzo Pieralisi,
Krzysztof Wilczyński, Manivannan Sadhasivam, Bjorn Helgaas,
Jingoo Han
Cc: linux-arm-msm, devicetree, linux-kernel, linux-pci, quic_vbadigan,
quic_mrana, quic_vpernami, mmareddy, Krishna Chaitanya Chundru
The current implementation requires iATU for every configuration
space access which increases latency & cpu utilization.
Designware databook 5.20a, section 3.10.10.3 says about CFG Shift Feature,
which shifts/maps the BDF (bits [31:16] of the third header DWORD, which
would be matched against the Base and Limit addresses) of the incoming
CfgRd0/CfgWr0 down to bits[27:12]of the translated address.
Configuring iATU in config shift feature enables ECAM feature to access the
config space, which avoids iATU configuration for every config access.
Add "ctrl2" into struct dw_pcie_ob_atu_cfg to enable config shift feature.
As DBI comes under config space, this avoids remapping of DBI space
separately. Instead, it uses the mapped config space address returned from
ECAM initialization. Change the order of dw_pcie_get_resources() execution
to achieve this.
Enable the ECAM feature if the config space size is equal to size required
to represent number of buses in the bus range property, add a function
which checks this. The DWC glue drivers uses this function and decide to
enable ECAM mode or not.
Signed-off-by: Krishna Chaitanya Chundru <krishna.chundru@oss.qualcomm.com>
---
drivers/pci/controller/dwc/Kconfig | 1 +
drivers/pci/controller/dwc/pcie-designware-host.c | 133 +++++++++++++++++++---
drivers/pci/controller/dwc/pcie-designware.c | 2 +-
drivers/pci/controller/dwc/pcie-designware.h | 11 ++
4 files changed, 132 insertions(+), 15 deletions(-)
diff --git a/drivers/pci/controller/dwc/Kconfig b/drivers/pci/controller/dwc/Kconfig
index b6d6778b0698..73c3aed6b60a 100644
--- a/drivers/pci/controller/dwc/Kconfig
+++ b/drivers/pci/controller/dwc/Kconfig
@@ -9,6 +9,7 @@ config PCIE_DW
config PCIE_DW_HOST
bool
select PCIE_DW
+ select PCI_HOST_COMMON
config PCIE_DW_EP
bool
diff --git a/drivers/pci/controller/dwc/pcie-designware-host.c b/drivers/pci/controller/dwc/pcie-designware-host.c
index ffaded8f2df7..826ff9338646 100644
--- a/drivers/pci/controller/dwc/pcie-designware-host.c
+++ b/drivers/pci/controller/dwc/pcie-designware-host.c
@@ -418,6 +418,66 @@ static void dw_pcie_host_request_msg_tlp_res(struct dw_pcie_rp *pp)
}
}
+static int dw_pcie_config_ecam_iatu(struct dw_pcie_rp *pp)
+{
+ struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
+ struct dw_pcie_ob_atu_cfg atu = {0};
+ resource_size_t bus_range_max;
+ struct resource_entry *bus;
+ int ret;
+
+ bus = resource_list_first_type(&pp->bridge->windows, IORESOURCE_BUS);
+
+ /*
+ * Root bus under the host bridge doesn't require any iATU configuration
+ * as DBI space will represent Root bus configuration space.
+ * Immediate bus under Root Bus, needs type 0 iATU configuration and
+ * remaining buses need type 1 iATU configuration.
+ */
+ atu.index = 0;
+ atu.type = PCIE_ATU_TYPE_CFG0;
+ atu.cpu_addr = pp->cfg0_base + SZ_1M;
+ atu.size = SZ_1M;
+ atu.ctrl2 = PCIE_ATU_CFG_SHIFT_MODE_ENABLE;
+ ret = dw_pcie_prog_outbound_atu(pci, &atu);
+ if (ret)
+ return ret;
+
+ bus_range_max = resource_size(bus->res);
+
+ if (bus_range_max < 2)
+ return 0;
+
+ /* Configure remaining buses in type 1 iATU configuration */
+ atu.index = 1;
+ atu.type = PCIE_ATU_TYPE_CFG1;
+ atu.cpu_addr = pp->cfg0_base + SZ_2M;
+ atu.size = (SZ_1M * (bus_range_max - 2));
+ atu.ctrl2 = PCIE_ATU_CFG_SHIFT_MODE_ENABLE;
+
+ return dw_pcie_prog_outbound_atu(pci, &atu);
+}
+
+static int dw_pcie_create_ecam_window(struct dw_pcie_rp *pp, struct resource *res)
+{
+ struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
+ struct device *dev = pci->dev;
+ struct resource_entry *bus;
+
+ bus = resource_list_first_type(&pp->bridge->windows, IORESOURCE_BUS);
+ if (!bus)
+ return -ENODEV;
+
+ pp->cfg = pci_ecam_create(dev, res, bus->res, &pci_generic_ecam_ops);
+ if (IS_ERR(pp->cfg))
+ return PTR_ERR(pp->cfg);
+
+ pci->dbi_base = pp->cfg->win;
+ pci->dbi_phys_addr = res->start;
+
+ return 0;
+}
+
int dw_pcie_host_init(struct dw_pcie_rp *pp)
{
struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
@@ -431,10 +491,6 @@ int dw_pcie_host_init(struct dw_pcie_rp *pp)
raw_spin_lock_init(&pp->lock);
- ret = dw_pcie_get_resources(pci);
- if (ret)
- return ret;
-
res = platform_get_resource_byname(pdev, IORESOURCE_MEM, "config");
if (!res) {
dev_err(dev, "Missing \"config\" reg space\n");
@@ -444,9 +500,28 @@ int dw_pcie_host_init(struct dw_pcie_rp *pp)
pp->cfg0_size = resource_size(res);
pp->cfg0_base = res->start;
- pp->va_cfg0_base = devm_pci_remap_cfg_resource(dev, res);
- if (IS_ERR(pp->va_cfg0_base))
- return PTR_ERR(pp->va_cfg0_base);
+ if (pp->ecam_mode) {
+ ret = dw_pcie_create_ecam_window(pp, res);
+ if (ret)
+ return ret;
+
+ bridge->ops = (struct pci_ops *)&pci_generic_ecam_ops.pci_ops;
+ pp->bridge->sysdata = pp->cfg;
+ pp->cfg->priv = pp;
+ } else {
+ pp->va_cfg0_base = devm_pci_remap_cfg_resource(dev, res);
+ if (IS_ERR(pp->va_cfg0_base))
+ return PTR_ERR(pp->va_cfg0_base);
+
+ /* Set default bus ops */
+ bridge->ops = &dw_pcie_ops;
+ bridge->child_ops = &dw_child_pcie_ops;
+ bridge->sysdata = pp;
+ }
+
+ ret = dw_pcie_get_resources(pci);
+ if (ret)
+ return ret;
bridge = devm_pci_alloc_host_bridge(dev, 0);
if (!bridge)
@@ -462,14 +537,10 @@ int dw_pcie_host_init(struct dw_pcie_rp *pp)
pp->io_base = pci_pio_to_address(win->res->start);
}
- /* Set default bus ops */
- bridge->ops = &dw_pcie_ops;
- bridge->child_ops = &dw_child_pcie_ops;
-
if (pp->ops->init) {
ret = pp->ops->init(pp);
if (ret)
- return ret;
+ goto err_free_ecam;
}
if (pci_msi_enabled()) {
@@ -504,6 +575,14 @@ int dw_pcie_host_init(struct dw_pcie_rp *pp)
dw_pcie_iatu_detect(pci);
+ if (pp->ecam_mode) {
+ ret = dw_pcie_config_ecam_iatu(pp);
+ if (ret) {
+ dev_err(dev, "Failed to confuure iATU\n");
+ goto err_free_msi;
+ }
+ }
+
/*
* Allocate the resource for MSG TLP before programming the iATU
* outbound window in dw_pcie_setup_rc(). Since the allocation depends
@@ -539,8 +618,6 @@ int dw_pcie_host_init(struct dw_pcie_rp *pp)
/* Ignore errors, the link may come up later */
dw_pcie_wait_for_link(pci);
- bridge->sysdata = pp;
-
ret = pci_host_probe(bridge);
if (ret)
goto err_stop_link;
@@ -564,6 +641,10 @@ int dw_pcie_host_init(struct dw_pcie_rp *pp)
if (pp->ops->deinit)
pp->ops->deinit(pp);
+err_free_ecam:
+ if (pp->cfg)
+ pci_ecam_free(pp->cfg);
+
return ret;
}
EXPORT_SYMBOL_GPL(dw_pcie_host_init);
@@ -584,6 +665,9 @@ void dw_pcie_host_deinit(struct dw_pcie_rp *pp)
if (pp->ops->deinit)
pp->ops->deinit(pp);
+
+ if (pp->cfg)
+ pci_ecam_free(pp->cfg);
}
EXPORT_SYMBOL_GPL(dw_pcie_host_deinit);
@@ -999,3 +1083,24 @@ int dw_pcie_resume_noirq(struct dw_pcie *pci)
return ret;
}
EXPORT_SYMBOL_GPL(dw_pcie_resume_noirq);
+
+bool dw_pcie_ecam_supported(struct dw_pcie_rp *pp)
+{
+ struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
+ struct platform_device *pdev = to_platform_device(pci->dev);
+ struct resource *config_res, *bus_range;
+ u64 bus_config_space_count;
+
+ bus_range = resource_list_first_type(&pp->bridge->windows, IORESOURCE_BUS)->res;
+ if (!bus_range)
+ return false;
+
+ config_res = platform_get_resource_byname(pdev, IORESOURCE_MEM, "config");
+ if (!config_res)
+ return false;
+
+ bus_config_space_count = resource_size(config_res) >> PCIE_ECAM_BUS_SHIFT;
+
+ return !!(bus_config_space_count >= resource_size(bus_range));
+}
+EXPORT_SYMBOL_GPL(dw_pcie_ecam_supported);
diff --git a/drivers/pci/controller/dwc/pcie-designware.c b/drivers/pci/controller/dwc/pcie-designware.c
index 145e7f579072..523ca7f267fb 100644
--- a/drivers/pci/controller/dwc/pcie-designware.c
+++ b/drivers/pci/controller/dwc/pcie-designware.c
@@ -509,7 +509,7 @@ int dw_pcie_prog_outbound_atu(struct dw_pcie *pci,
val = dw_pcie_enable_ecrc(val);
dw_pcie_writel_atu_ob(pci, atu->index, PCIE_ATU_REGION_CTRL1, val);
- val = PCIE_ATU_ENABLE;
+ val = PCIE_ATU_ENABLE | atu->ctrl2;
if (atu->type == PCIE_ATU_TYPE_MSG) {
/* The data-less messages only for now */
val |= PCIE_ATU_INHIBIT_PAYLOAD | atu->code;
diff --git a/drivers/pci/controller/dwc/pcie-designware.h b/drivers/pci/controller/dwc/pcie-designware.h
index 501d9ddfea16..d0ba8855ba2a 100644
--- a/drivers/pci/controller/dwc/pcie-designware.h
+++ b/drivers/pci/controller/dwc/pcie-designware.h
@@ -20,6 +20,7 @@
#include <linux/irq.h>
#include <linux/msi.h>
#include <linux/pci.h>
+#include <linux/pci-ecam.h>
#include <linux/reset.h>
#include <linux/pci-epc.h>
@@ -171,6 +172,7 @@
#define PCIE_ATU_REGION_CTRL2 0x004
#define PCIE_ATU_ENABLE BIT(31)
#define PCIE_ATU_BAR_MODE_ENABLE BIT(30)
+#define PCIE_ATU_CFG_SHIFT_MODE_ENABLE BIT(28)
#define PCIE_ATU_INHIBIT_PAYLOAD BIT(22)
#define PCIE_ATU_FUNC_NUM_MATCH_EN BIT(19)
#define PCIE_ATU_LOWER_BASE 0x008
@@ -343,6 +345,7 @@ struct dw_pcie_ob_atu_cfg {
u8 func_no;
u8 code;
u8 routing;
+ u32 ctrl2;
u64 cpu_addr;
u64 pci_addr;
u64 size;
@@ -381,6 +384,8 @@ struct dw_pcie_rp {
int msg_atu_index;
struct resource *msg_res;
bool use_linkup_irq;
+ bool ecam_mode;
+ struct pci_config_window *cfg;
};
struct dw_pcie_ep_ops {
@@ -686,6 +691,7 @@ void dw_pcie_host_deinit(struct dw_pcie_rp *pp);
int dw_pcie_allocate_domains(struct dw_pcie_rp *pp);
void __iomem *dw_pcie_own_conf_map_bus(struct pci_bus *bus, unsigned int devfn,
int where);
+bool dw_pcie_ecam_supported(struct dw_pcie_rp *pp);
#else
static inline int dw_pcie_suspend_noirq(struct dw_pcie *pci)
{
@@ -726,6 +732,11 @@ static inline void __iomem *dw_pcie_own_conf_map_bus(struct pci_bus *bus,
{
return NULL;
}
+
+static inline bool dw_pcie_ecam_supported(struct dw_pcie_rp *pp)
+{
+ return 0;
+}
#endif
#ifdef CONFIG_PCIE_DW_EP
--
2.34.1
^ permalink raw reply related [flat|nested] 19+ messages in thread* Re: [PATCH v4 2/4] PCI: dwc: Add ECAM support with iATU configuration
2025-02-06 23:28 ` [PATCH v4 2/4] PCI: dwc: Add ECAM support with iATU configuration Krishna Chaitanya Chundru
@ 2025-02-10 8:37 ` Manivannan Sadhasivam
2025-03-05 18:14 ` Manivannan Sadhasivam
1 sibling, 0 replies; 19+ messages in thread
From: Manivannan Sadhasivam @ 2025-02-10 8:37 UTC (permalink / raw)
To: Krishna Chaitanya Chundru
Cc: cros-qcom-dts-watchers, Bjorn Andersson, Konrad Dybcio,
Rob Herring, Krzysztof Kozlowski, Conor Dooley, Lorenzo Pieralisi,
Krzysztof Wilczyński, Bjorn Helgaas, Jingoo Han,
linux-arm-msm, devicetree, linux-kernel, linux-pci, quic_vbadigan,
quic_mrana, quic_vpernami, mmareddy
On Fri, Feb 07, 2025 at 04:58:57AM +0530, Krishna Chaitanya Chundru wrote:
> The current implementation requires iATU for every configuration
> space access which increases latency & cpu utilization.
>
> Designware databook 5.20a, section 3.10.10.3 says about CFG Shift Feature,
> which shifts/maps the BDF (bits [31:16] of the third header DWORD, which
> would be matched against the Base and Limit addresses) of the incoming
> CfgRd0/CfgWr0 down to bits[27:12]of the translated address.
>
> Configuring iATU in config shift feature enables ECAM feature to access the
> config space, which avoids iATU configuration for every config access.
>
> Add "ctrl2" into struct dw_pcie_ob_atu_cfg to enable config shift feature.
>
> As DBI comes under config space, this avoids remapping of DBI space
> separately. Instead, it uses the mapped config space address returned from
> ECAM initialization. Change the order of dw_pcie_get_resources() execution
> to achieve this.
>
> Enable the ECAM feature if the config space size is equal to size required
> to represent number of buses in the bus range property, add a function
> which checks this. The DWC glue drivers uses this function and decide to
> enable ECAM mode or not.
>
> Signed-off-by: Krishna Chaitanya Chundru <krishna.chundru@oss.qualcomm.com>
> ---
> drivers/pci/controller/dwc/Kconfig | 1 +
> drivers/pci/controller/dwc/pcie-designware-host.c | 133 +++++++++++++++++++---
> drivers/pci/controller/dwc/pcie-designware.c | 2 +-
> drivers/pci/controller/dwc/pcie-designware.h | 11 ++
> 4 files changed, 132 insertions(+), 15 deletions(-)
>
> diff --git a/drivers/pci/controller/dwc/Kconfig b/drivers/pci/controller/dwc/Kconfig
> index b6d6778b0698..73c3aed6b60a 100644
> --- a/drivers/pci/controller/dwc/Kconfig
> +++ b/drivers/pci/controller/dwc/Kconfig
> @@ -9,6 +9,7 @@ config PCIE_DW
> config PCIE_DW_HOST
> bool
> select PCIE_DW
> + select PCI_HOST_COMMON
>
> config PCIE_DW_EP
> bool
> diff --git a/drivers/pci/controller/dwc/pcie-designware-host.c b/drivers/pci/controller/dwc/pcie-designware-host.c
> index ffaded8f2df7..826ff9338646 100644
> --- a/drivers/pci/controller/dwc/pcie-designware-host.c
> +++ b/drivers/pci/controller/dwc/pcie-designware-host.c
> @@ -418,6 +418,66 @@ static void dw_pcie_host_request_msg_tlp_res(struct dw_pcie_rp *pp)
> }
> }
>
> +static int dw_pcie_config_ecam_iatu(struct dw_pcie_rp *pp)
> +{
> + struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
> + struct dw_pcie_ob_atu_cfg atu = {0};
> + resource_size_t bus_range_max;
> + struct resource_entry *bus;
> + int ret;
> +
> + bus = resource_list_first_type(&pp->bridge->windows, IORESOURCE_BUS);
> +
> + /*
> + * Root bus under the host bridge doesn't require any iATU configuration
> + * as DBI space will represent Root bus configuration space.
> + * Immediate bus under Root Bus, needs type 0 iATU configuration and
> + * remaining buses need type 1 iATU configuration.
> + */
> + atu.index = 0;
> + atu.type = PCIE_ATU_TYPE_CFG0;
> + atu.cpu_addr = pp->cfg0_base + SZ_1M;
Why this 1MiB hole needed here? For DBI?
> + atu.size = SZ_1M;
Add a comment stating that this 1M corresponds to 1 bus, 32 devices and 8
functions. Like,
/* 1MiB is to cover 1 (bus) * 32 (devices) * 8 (functions) */
> + atu.ctrl2 = PCIE_ATU_CFG_SHIFT_MODE_ENABLE;
> + ret = dw_pcie_prog_outbound_atu(pci, &atu);
> + if (ret)
> + return ret;
> +
> + bus_range_max = resource_size(bus->res);
> +
> + if (bus_range_max < 2)
> + return 0;
> +
> + /* Configure remaining buses in type 1 iATU configuration */
> + atu.index = 1;
> + atu.type = PCIE_ATU_TYPE_CFG1;
> + atu.cpu_addr = pp->cfg0_base + SZ_2M;
> + atu.size = (SZ_1M * (bus_range_max - 2));
Nit: Use,
atu.size = (SZ_1M * bus_range_max) - SZ_2M;
> + atu.ctrl2 = PCIE_ATU_CFG_SHIFT_MODE_ENABLE;
> +
> + return dw_pcie_prog_outbound_atu(pci, &atu);
> +}
> +
> +static int dw_pcie_create_ecam_window(struct dw_pcie_rp *pp, struct resource *res)
> +{
> + struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
> + struct device *dev = pci->dev;
> + struct resource_entry *bus;
> +
> + bus = resource_list_first_type(&pp->bridge->windows, IORESOURCE_BUS);
> + if (!bus)
> + return -ENODEV;
> +
> + pp->cfg = pci_ecam_create(dev, res, bus->res, &pci_generic_ecam_ops);
> + if (IS_ERR(pp->cfg))
> + return PTR_ERR(pp->cfg);
> +
> + pci->dbi_base = pp->cfg->win;
> + pci->dbi_phys_addr = res->start;
Why can't we use the existing 'dbi' region?
> +
> + return 0;
> +}
> +
> int dw_pcie_host_init(struct dw_pcie_rp *pp)
> {
> struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
> @@ -431,10 +491,6 @@ int dw_pcie_host_init(struct dw_pcie_rp *pp)
>
> raw_spin_lock_init(&pp->lock);
>
> - ret = dw_pcie_get_resources(pci);
> - if (ret)
> - return ret;
> -
> res = platform_get_resource_byname(pdev, IORESOURCE_MEM, "config");
> if (!res) {
> dev_err(dev, "Missing \"config\" reg space\n");
> @@ -444,9 +500,28 @@ int dw_pcie_host_init(struct dw_pcie_rp *pp)
> pp->cfg0_size = resource_size(res);
> pp->cfg0_base = res->start;
>
> - pp->va_cfg0_base = devm_pci_remap_cfg_resource(dev, res);
> - if (IS_ERR(pp->va_cfg0_base))
> - return PTR_ERR(pp->va_cfg0_base);
> + if (pp->ecam_mode) {
> + ret = dw_pcie_create_ecam_window(pp, res);
> + if (ret)
> + return ret;
> +
> + bridge->ops = (struct pci_ops *)&pci_generic_ecam_ops.pci_ops;
> + pp->bridge->sysdata = pp->cfg;
> + pp->cfg->priv = pp;
> + } else {
> + pp->va_cfg0_base = devm_pci_remap_cfg_resource(dev, res);
> + if (IS_ERR(pp->va_cfg0_base))
> + return PTR_ERR(pp->va_cfg0_base);
> +
> + /* Set default bus ops */
> + bridge->ops = &dw_pcie_ops;
> + bridge->child_ops = &dw_child_pcie_ops;
> + bridge->sysdata = pp;
> + }
> +
> + ret = dw_pcie_get_resources(pci);
> + if (ret)
> + return ret;
>
> bridge = devm_pci_alloc_host_bridge(dev, 0);
> if (!bridge)
> @@ -462,14 +537,10 @@ int dw_pcie_host_init(struct dw_pcie_rp *pp)
> pp->io_base = pci_pio_to_address(win->res->start);
> }
>
> - /* Set default bus ops */
> - bridge->ops = &dw_pcie_ops;
> - bridge->child_ops = &dw_child_pcie_ops;
> -
> if (pp->ops->init) {
> ret = pp->ops->init(pp);
> if (ret)
> - return ret;
> + goto err_free_ecam;
> }
>
> if (pci_msi_enabled()) {
> @@ -504,6 +575,14 @@ int dw_pcie_host_init(struct dw_pcie_rp *pp)
>
> dw_pcie_iatu_detect(pci);
>
> + if (pp->ecam_mode) {
> + ret = dw_pcie_config_ecam_iatu(pp);
> + if (ret) {
> + dev_err(dev, "Failed to confuure iATU\n");
*configure iATU in ECAM mode
> + goto err_free_msi;
> + }
> + }
> +
> /*
> * Allocate the resource for MSG TLP before programming the iATU
> * outbound window in dw_pcie_setup_rc(). Since the allocation depends
> @@ -539,8 +618,6 @@ int dw_pcie_host_init(struct dw_pcie_rp *pp)
> /* Ignore errors, the link may come up later */
> dw_pcie_wait_for_link(pci);
>
> - bridge->sysdata = pp;
> -
> ret = pci_host_probe(bridge);
> if (ret)
> goto err_stop_link;
> @@ -564,6 +641,10 @@ int dw_pcie_host_init(struct dw_pcie_rp *pp)
> if (pp->ops->deinit)
> pp->ops->deinit(pp);
>
> +err_free_ecam:
> + if (pp->cfg)
> + pci_ecam_free(pp->cfg);
> +
> return ret;
> }
> EXPORT_SYMBOL_GPL(dw_pcie_host_init);
> @@ -584,6 +665,9 @@ void dw_pcie_host_deinit(struct dw_pcie_rp *pp)
>
> if (pp->ops->deinit)
> pp->ops->deinit(pp);
> +
> + if (pp->cfg)
> + pci_ecam_free(pp->cfg);
> }
> EXPORT_SYMBOL_GPL(dw_pcie_host_deinit);
>
> @@ -999,3 +1083,24 @@ int dw_pcie_resume_noirq(struct dw_pcie *pci)
> return ret;
> }
> EXPORT_SYMBOL_GPL(dw_pcie_resume_noirq);
> +
> +bool dw_pcie_ecam_supported(struct dw_pcie_rp *pp)
Add Kernel doc since this is an exported API. Clearly mention the requirement
needed (config region size) to support the ECAM mode.
> +{
> + struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
> + struct platform_device *pdev = to_platform_device(pci->dev);
> + struct resource *config_res, *bus_range;
> + u64 bus_config_space_count;
> +
> + bus_range = resource_list_first_type(&pp->bridge->windows, IORESOURCE_BUS)->res;
> + if (!bus_range)
> + return false;
> +
> + config_res = platform_get_resource_byname(pdev, IORESOURCE_MEM, "config");
> + if (!config_res)
> + return false;
> +
> + bus_config_space_count = resource_size(config_res) >> PCIE_ECAM_BUS_SHIFT;
> +
> + return !!(bus_config_space_count >= resource_size(bus_range));
> +}
> +EXPORT_SYMBOL_GPL(dw_pcie_ecam_supported);
> diff --git a/drivers/pci/controller/dwc/pcie-designware.c b/drivers/pci/controller/dwc/pcie-designware.c
> index 145e7f579072..523ca7f267fb 100644
> --- a/drivers/pci/controller/dwc/pcie-designware.c
> +++ b/drivers/pci/controller/dwc/pcie-designware.c
> @@ -509,7 +509,7 @@ int dw_pcie_prog_outbound_atu(struct dw_pcie *pci,
> val = dw_pcie_enable_ecrc(val);
> dw_pcie_writel_atu_ob(pci, atu->index, PCIE_ATU_REGION_CTRL1, val);
>
> - val = PCIE_ATU_ENABLE;
> + val = PCIE_ATU_ENABLE | atu->ctrl2;
> if (atu->type == PCIE_ATU_TYPE_MSG) {
> /* The data-less messages only for now */
> val |= PCIE_ATU_INHIBIT_PAYLOAD | atu->code;
> diff --git a/drivers/pci/controller/dwc/pcie-designware.h b/drivers/pci/controller/dwc/pcie-designware.h
> index 501d9ddfea16..d0ba8855ba2a 100644
> --- a/drivers/pci/controller/dwc/pcie-designware.h
> +++ b/drivers/pci/controller/dwc/pcie-designware.h
> @@ -20,6 +20,7 @@
> #include <linux/irq.h>
> #include <linux/msi.h>
> #include <linux/pci.h>
> +#include <linux/pci-ecam.h>
> #include <linux/reset.h>
>
> #include <linux/pci-epc.h>
> @@ -171,6 +172,7 @@
> #define PCIE_ATU_REGION_CTRL2 0x004
> #define PCIE_ATU_ENABLE BIT(31)
> #define PCIE_ATU_BAR_MODE_ENABLE BIT(30)
> +#define PCIE_ATU_CFG_SHIFT_MODE_ENABLE BIT(28)
> #define PCIE_ATU_INHIBIT_PAYLOAD BIT(22)
> #define PCIE_ATU_FUNC_NUM_MATCH_EN BIT(19)
> #define PCIE_ATU_LOWER_BASE 0x008
> @@ -343,6 +345,7 @@ struct dw_pcie_ob_atu_cfg {
> u8 func_no;
> u8 code;
> u8 routing;
> + u32 ctrl2;
> u64 cpu_addr;
> u64 pci_addr;
> u64 size;
> @@ -381,6 +384,8 @@ struct dw_pcie_rp {
> int msg_atu_index;
> struct resource *msg_res;
> bool use_linkup_irq;
> + bool ecam_mode;
> + struct pci_config_window *cfg;
> };
>
> struct dw_pcie_ep_ops {
> @@ -686,6 +691,7 @@ void dw_pcie_host_deinit(struct dw_pcie_rp *pp);
> int dw_pcie_allocate_domains(struct dw_pcie_rp *pp);
> void __iomem *dw_pcie_own_conf_map_bus(struct pci_bus *bus, unsigned int devfn,
> int where);
> +bool dw_pcie_ecam_supported(struct dw_pcie_rp *pp);
> #else
> static inline int dw_pcie_suspend_noirq(struct dw_pcie *pci)
> {
> @@ -726,6 +732,11 @@ static inline void __iomem *dw_pcie_own_conf_map_bus(struct pci_bus *bus,
> {
> return NULL;
> }
> +
> +static inline bool dw_pcie_ecam_supported(struct dw_pcie_rp *pp)
> +{
> + return 0;
return false
- Mani
--
மணிவண்ணன் சதாசிவம்
^ permalink raw reply [flat|nested] 19+ messages in thread* Re: [PATCH v4 2/4] PCI: dwc: Add ECAM support with iATU configuration
2025-02-06 23:28 ` [PATCH v4 2/4] PCI: dwc: Add ECAM support with iATU configuration Krishna Chaitanya Chundru
2025-02-10 8:37 ` Manivannan Sadhasivam
@ 2025-03-05 18:14 ` Manivannan Sadhasivam
1 sibling, 0 replies; 19+ messages in thread
From: Manivannan Sadhasivam @ 2025-03-05 18:14 UTC (permalink / raw)
To: Krishna Chaitanya Chundru
Cc: cros-qcom-dts-watchers, Bjorn Andersson, Konrad Dybcio,
Rob Herring, Krzysztof Kozlowski, Conor Dooley, Lorenzo Pieralisi,
Krzysztof Wilczyński, Bjorn Helgaas, Jingoo Han,
linux-arm-msm, devicetree, linux-kernel, linux-pci, quic_vbadigan,
quic_mrana, quic_vpernami, mmareddy
On Fri, Feb 07, 2025 at 04:58:57AM +0530, Krishna Chaitanya Chundru wrote:
> The current implementation requires iATU for every configuration
> space access which increases latency & cpu utilization.
>
> Designware databook 5.20a, section 3.10.10.3 says about CFG Shift Feature,
> which shifts/maps the BDF (bits [31:16] of the third header DWORD, which
> would be matched against the Base and Limit addresses) of the incoming
> CfgRd0/CfgWr0 down to bits[27:12]of the translated address.
>
> Configuring iATU in config shift feature enables ECAM feature to access the
> config space, which avoids iATU configuration for every config access.
>
> Add "ctrl2" into struct dw_pcie_ob_atu_cfg to enable config shift feature.
>
> As DBI comes under config space, this avoids remapping of DBI space
> separately. Instead, it uses the mapped config space address returned from
> ECAM initialization. Change the order of dw_pcie_get_resources() execution
> to achieve this.
>
> Enable the ECAM feature if the config space size is equal to size required
> to represent number of buses in the bus range property, add a function
> which checks this. The DWC glue drivers uses this function and decide to
> enable ECAM mode or not.
>
> Signed-off-by: Krishna Chaitanya Chundru <krishna.chundru@oss.qualcomm.com>
> ---
> drivers/pci/controller/dwc/Kconfig | 1 +
> drivers/pci/controller/dwc/pcie-designware-host.c | 133 +++++++++++++++++++---
> drivers/pci/controller/dwc/pcie-designware.c | 2 +-
> drivers/pci/controller/dwc/pcie-designware.h | 11 ++
> 4 files changed, 132 insertions(+), 15 deletions(-)
>
> diff --git a/drivers/pci/controller/dwc/Kconfig b/drivers/pci/controller/dwc/Kconfig
> index b6d6778b0698..73c3aed6b60a 100644
> --- a/drivers/pci/controller/dwc/Kconfig
> +++ b/drivers/pci/controller/dwc/Kconfig
> @@ -9,6 +9,7 @@ config PCIE_DW
> config PCIE_DW_HOST
> bool
> select PCIE_DW
> + select PCI_HOST_COMMON
>
> config PCIE_DW_EP
> bool
> diff --git a/drivers/pci/controller/dwc/pcie-designware-host.c b/drivers/pci/controller/dwc/pcie-designware-host.c
> index ffaded8f2df7..826ff9338646 100644
> --- a/drivers/pci/controller/dwc/pcie-designware-host.c
> +++ b/drivers/pci/controller/dwc/pcie-designware-host.c
> @@ -418,6 +418,66 @@ static void dw_pcie_host_request_msg_tlp_res(struct dw_pcie_rp *pp)
> }
> }
>
> +static int dw_pcie_config_ecam_iatu(struct dw_pcie_rp *pp)
> +{
> + struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
> + struct dw_pcie_ob_atu_cfg atu = {0};
> + resource_size_t bus_range_max;
> + struct resource_entry *bus;
> + int ret;
> +
> + bus = resource_list_first_type(&pp->bridge->windows, IORESOURCE_BUS);
> +
> + /*
> + * Root bus under the host bridge doesn't require any iATU configuration
> + * as DBI space will represent Root bus configuration space.
'as DBI region will be used to access root bus config space'
> + * Immediate bus under Root Bus, needs type 0 iATU configuration and
> + * remaining buses need type 1 iATU configuration.
> + */
> + atu.index = 0;
> + atu.type = PCIE_ATU_TYPE_CFG0;
> + atu.cpu_addr = pp->cfg0_base + SZ_1M;
> + atu.size = SZ_1M;
> + atu.ctrl2 = PCIE_ATU_CFG_SHIFT_MODE_ENABLE;
> + ret = dw_pcie_prog_outbound_atu(pci, &atu);
> + if (ret)
> + return ret;
> +
> + bus_range_max = resource_size(bus->res);
> +
> + if (bus_range_max < 2)
> + return 0;
> +
> + /* Configure remaining buses in type 1 iATU configuration */
> + atu.index = 1;
> + atu.type = PCIE_ATU_TYPE_CFG1;
> + atu.cpu_addr = pp->cfg0_base + SZ_2M;
> + atu.size = (SZ_1M * (bus_range_max - 2));
> + atu.ctrl2 = PCIE_ATU_CFG_SHIFT_MODE_ENABLE;
> +
> + return dw_pcie_prog_outbound_atu(pci, &atu);
> +}
> +
> +static int dw_pcie_create_ecam_window(struct dw_pcie_rp *pp, struct resource *res)
> +{
> + struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
> + struct device *dev = pci->dev;
> + struct resource_entry *bus;
> +
> + bus = resource_list_first_type(&pp->bridge->windows, IORESOURCE_BUS);
> + if (!bus)
> + return -ENODEV;
> +
> + pp->cfg = pci_ecam_create(dev, res, bus->res, &pci_generic_ecam_ops);
> + if (IS_ERR(pp->cfg))
> + return PTR_ERR(pp->cfg);
> +
> + pci->dbi_base = pp->cfg->win;
> + pci->dbi_phys_addr = res->start;
> +
> + return 0;
> +}
> +
> int dw_pcie_host_init(struct dw_pcie_rp *pp)
> {
> struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
> @@ -431,10 +491,6 @@ int dw_pcie_host_init(struct dw_pcie_rp *pp)
>
> raw_spin_lock_init(&pp->lock);
>
> - ret = dw_pcie_get_resources(pci);
> - if (ret)
> - return ret;
> -
> res = platform_get_resource_byname(pdev, IORESOURCE_MEM, "config");
> if (!res) {
> dev_err(dev, "Missing \"config\" reg space\n");
> @@ -444,9 +500,28 @@ int dw_pcie_host_init(struct dw_pcie_rp *pp)
> pp->cfg0_size = resource_size(res);
> pp->cfg0_base = res->start;
>
> - pp->va_cfg0_base = devm_pci_remap_cfg_resource(dev, res);
> - if (IS_ERR(pp->va_cfg0_base))
> - return PTR_ERR(pp->va_cfg0_base);
> + if (pp->ecam_mode) {
> + ret = dw_pcie_create_ecam_window(pp, res);
> + if (ret)
> + return ret;
> +
> + bridge->ops = (struct pci_ops *)&pci_generic_ecam_ops.pci_ops;
> + pp->bridge->sysdata = pp->cfg;
> + pp->cfg->priv = pp;
> + } else {
> + pp->va_cfg0_base = devm_pci_remap_cfg_resource(dev, res);
> + if (IS_ERR(pp->va_cfg0_base))
> + return PTR_ERR(pp->va_cfg0_base);
> +
> + /* Set default bus ops */
> + bridge->ops = &dw_pcie_ops;
> + bridge->child_ops = &dw_child_pcie_ops;
> + bridge->sysdata = pp;
> + }
So you dereference 'bridge' that is allocated only below? It doesn't matter
whether the upcoming commits allocate it earlier or not. This commit alone is
going to cause NULL ptr dereference.
> +
> + ret = dw_pcie_get_resources(pci);
> + if (ret)
> + return ret;
>
> bridge = devm_pci_alloc_host_bridge(dev, 0);
> if (!bridge)
> @@ -462,14 +537,10 @@ int dw_pcie_host_init(struct dw_pcie_rp *pp)
> pp->io_base = pci_pio_to_address(win->res->start);
> }
>
> - /* Set default bus ops */
> - bridge->ops = &dw_pcie_ops;
> - bridge->child_ops = &dw_child_pcie_ops;
> -
> if (pp->ops->init) {
> ret = pp->ops->init(pp);
> if (ret)
> - return ret;
> + goto err_free_ecam;
> }
>
> if (pci_msi_enabled()) {
> @@ -504,6 +575,14 @@ int dw_pcie_host_init(struct dw_pcie_rp *pp)
>
> dw_pcie_iatu_detect(pci);
>
> + if (pp->ecam_mode) {
> + ret = dw_pcie_config_ecam_iatu(pp);
> + if (ret) {
> + dev_err(dev, "Failed to confuure iATU\n");
'configure'
> + goto err_free_msi;
> + }
> + }
> +
> /*
> * Allocate the resource for MSG TLP before programming the iATU
> * outbound window in dw_pcie_setup_rc(). Since the allocation depends
> @@ -539,8 +618,6 @@ int dw_pcie_host_init(struct dw_pcie_rp *pp)
> /* Ignore errors, the link may come up later */
> dw_pcie_wait_for_link(pci);
>
> - bridge->sysdata = pp;
> -
> ret = pci_host_probe(bridge);
> if (ret)
> goto err_stop_link;
> @@ -564,6 +641,10 @@ int dw_pcie_host_init(struct dw_pcie_rp *pp)
> if (pp->ops->deinit)
> pp->ops->deinit(pp);
>
> +err_free_ecam:
> + if (pp->cfg)
> + pci_ecam_free(pp->cfg);
> +
> return ret;
> }
> EXPORT_SYMBOL_GPL(dw_pcie_host_init);
> @@ -584,6 +665,9 @@ void dw_pcie_host_deinit(struct dw_pcie_rp *pp)
>
> if (pp->ops->deinit)
> pp->ops->deinit(pp);
> +
> + if (pp->cfg)
> + pci_ecam_free(pp->cfg);
> }
> EXPORT_SYMBOL_GPL(dw_pcie_host_deinit);
>
> @@ -999,3 +1083,24 @@ int dw_pcie_resume_noirq(struct dw_pcie *pci)
> return ret;
> }
> EXPORT_SYMBOL_GPL(dw_pcie_resume_noirq);
> +
> +bool dw_pcie_ecam_supported(struct dw_pcie_rp *pp)
> +{
> + struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
> + struct platform_device *pdev = to_platform_device(pci->dev);
> + struct resource *config_res, *bus_range;
> + u64 bus_config_space_count;
> +
> + bus_range = resource_list_first_type(&pp->bridge->windows, IORESOURCE_BUS)->res;
> + if (!bus_range)
> + return false;
> +
> + config_res = platform_get_resource_byname(pdev, IORESOURCE_MEM, "config");
> + if (!config_res)
> + return false;
> +
> + bus_config_space_count = resource_size(config_res) >> PCIE_ECAM_BUS_SHIFT;
s/bus_config_space_count/nr_buses
- Mani
--
மணிவண்ணன் சதாசிவம்
^ permalink raw reply [flat|nested] 19+ messages in thread
* [PATCH v4 3/4] PCI: dwc: Reduce DT reads by allocating host bridge via DWC glue driver
2025-02-06 23:28 [PATCH v4 0/4] PCI: dwc: Add ECAM support with iATU configuration Krishna Chaitanya Chundru
2025-02-06 23:28 ` [PATCH v4 1/4] arm64: dts: qcom: sc7280: Increase config size to 256MB for ECAM feature Krishna Chaitanya Chundru
2025-02-06 23:28 ` [PATCH v4 2/4] PCI: dwc: Add ECAM support with iATU configuration Krishna Chaitanya Chundru
@ 2025-02-06 23:28 ` Krishna Chaitanya Chundru
2025-02-10 9:20 ` Manivannan Sadhasivam
2025-03-05 18:18 ` Manivannan Sadhasivam
2025-02-06 23:28 ` [PATCH v4 4/4] PCI: qcom: Enable ECAM feature Krishna Chaitanya Chundru
3 siblings, 2 replies; 19+ messages in thread
From: Krishna Chaitanya Chundru @ 2025-02-06 23:28 UTC (permalink / raw)
To: cros-qcom-dts-watchers, Bjorn Andersson, Konrad Dybcio,
Rob Herring, Krzysztof Kozlowski, Conor Dooley, Lorenzo Pieralisi,
Krzysztof Wilczyński, Manivannan Sadhasivam, Bjorn Helgaas,
Jingoo Han
Cc: linux-arm-msm, devicetree, linux-kernel, linux-pci, quic_vbadigan,
quic_mrana, quic_vpernami, mmareddy, Krishna Chaitanya Chundru
dw_pcie_ecam_supported() needs to read bus-range to find the maximum
bus range value. The devm_pci_alloc_host_bridge() is already reading
bus range and storing it in host bridge.If devm_pci_alloc_host_bridge()
moved to start of the controller probe, the dt reading can be avoided
and use values stored in the host bridge.
Allow DWC glue drivers to allocate the host bridge, avoiding redundant
device tree reads primarily in dw_pcie_ecam_supported().
Suggested-by: Bjorn Helgaas <bhelgaas@google.com>
Signed-off-by: Krishna Chaitanya Chundru <krishna.chundru@oss.qualcomm.com>
---
drivers/pci/controller/dwc/pcie-designware-host.c | 9 +++++++--
1 file changed, 7 insertions(+), 2 deletions(-)
diff --git a/drivers/pci/controller/dwc/pcie-designware-host.c b/drivers/pci/controller/dwc/pcie-designware-host.c
index 826ff9338646..a18cb1e411e4 100644
--- a/drivers/pci/controller/dwc/pcie-designware-host.c
+++ b/drivers/pci/controller/dwc/pcie-designware-host.c
@@ -484,8 +484,8 @@ int dw_pcie_host_init(struct dw_pcie_rp *pp)
struct device *dev = pci->dev;
struct device_node *np = dev->of_node;
struct platform_device *pdev = to_platform_device(dev);
+ struct pci_host_bridge *bridge = pp->bridge;
struct resource_entry *win;
- struct pci_host_bridge *bridge;
struct resource *res;
int ret;
@@ -527,7 +527,12 @@ int dw_pcie_host_init(struct dw_pcie_rp *pp)
if (!bridge)
return -ENOMEM;
- pp->bridge = bridge;
+ if (!pp->bridge) {
+ bridge = devm_pci_alloc_host_bridge(dev, 0);
+ if (!bridge)
+ return -ENOMEM;
+ pp->bridge = bridge;
+ }
/* Get the I/O range from DT */
win = resource_list_first_type(&bridge->windows, IORESOURCE_IO);
--
2.34.1
^ permalink raw reply related [flat|nested] 19+ messages in thread* Re: [PATCH v4 3/4] PCI: dwc: Reduce DT reads by allocating host bridge via DWC glue driver
2025-02-06 23:28 ` [PATCH v4 3/4] PCI: dwc: Reduce DT reads by allocating host bridge via DWC glue driver Krishna Chaitanya Chundru
@ 2025-02-10 9:20 ` Manivannan Sadhasivam
2025-03-05 18:18 ` Manivannan Sadhasivam
1 sibling, 0 replies; 19+ messages in thread
From: Manivannan Sadhasivam @ 2025-02-10 9:20 UTC (permalink / raw)
To: Krishna Chaitanya Chundru
Cc: cros-qcom-dts-watchers, Bjorn Andersson, Konrad Dybcio,
Rob Herring, Krzysztof Kozlowski, Conor Dooley, Lorenzo Pieralisi,
Krzysztof Wilczyński, Bjorn Helgaas, Jingoo Han,
linux-arm-msm, devicetree, linux-kernel, linux-pci, quic_vbadigan,
quic_mrana, quic_vpernami, mmareddy
On Fri, Feb 07, 2025 at 04:58:58AM +0530, Krishna Chaitanya Chundru wrote:
> dw_pcie_ecam_supported() needs to read bus-range to find the maximum
> bus range value. The devm_pci_alloc_host_bridge() is already reading
> bus range and storing it in host bridge.If devm_pci_alloc_host_bridge()
What do you mean by 'storig in host bridge' here? Mention the exact structure
name if that's what you are referring to.
> moved to start of the controller probe, the dt reading can be avoided
> and use values stored in the host bridge.
Same here.
>
> Allow DWC glue drivers to allocate the host bridge, avoiding redundant
> device tree reads primarily in dw_pcie_ecam_supported().
>
This makes little sense to me. By this change, you essentially want DWC glue
drivers to call devm_pci_alloc_host_bridge() just to get rid of one range
parsing.
I'd suggest to move dw_pcie_ecam_supported() inside dw_pcie_host_init() and call
after devm_pci_alloc_host_bridge(). This way, the glue drivers can rely on DWC
core to detect ECAM like other resources. And the API could be renamed as
dw_pcie_ecam_detect() to match other resource detection like iATU, eDMA.
More in patch 4.
- Mani
> Suggested-by: Bjorn Helgaas <bhelgaas@google.com>
> Signed-off-by: Krishna Chaitanya Chundru <krishna.chundru@oss.qualcomm.com>
> ---
> drivers/pci/controller/dwc/pcie-designware-host.c | 9 +++++++--
> 1 file changed, 7 insertions(+), 2 deletions(-)
>
> diff --git a/drivers/pci/controller/dwc/pcie-designware-host.c b/drivers/pci/controller/dwc/pcie-designware-host.c
> index 826ff9338646..a18cb1e411e4 100644
> --- a/drivers/pci/controller/dwc/pcie-designware-host.c
> +++ b/drivers/pci/controller/dwc/pcie-designware-host.c
> @@ -484,8 +484,8 @@ int dw_pcie_host_init(struct dw_pcie_rp *pp)
> struct device *dev = pci->dev;
> struct device_node *np = dev->of_node;
> struct platform_device *pdev = to_platform_device(dev);
> + struct pci_host_bridge *bridge = pp->bridge;
> struct resource_entry *win;
> - struct pci_host_bridge *bridge;
> struct resource *res;
> int ret;
>
> @@ -527,7 +527,12 @@ int dw_pcie_host_init(struct dw_pcie_rp *pp)
> if (!bridge)
> return -ENOMEM;
>
> - pp->bridge = bridge;
> + if (!pp->bridge) {
> + bridge = devm_pci_alloc_host_bridge(dev, 0);
> + if (!bridge)
> + return -ENOMEM;
> + pp->bridge = bridge;
> + }
>
> /* Get the I/O range from DT */
> win = resource_list_first_type(&bridge->windows, IORESOURCE_IO);
>
> --
> 2.34.1
>
--
மணிவண்ணன் சதாசிவம்
^ permalink raw reply [flat|nested] 19+ messages in thread* Re: [PATCH v4 3/4] PCI: dwc: Reduce DT reads by allocating host bridge via DWC glue driver
2025-02-06 23:28 ` [PATCH v4 3/4] PCI: dwc: Reduce DT reads by allocating host bridge via DWC glue driver Krishna Chaitanya Chundru
2025-02-10 9:20 ` Manivannan Sadhasivam
@ 2025-03-05 18:18 ` Manivannan Sadhasivam
1 sibling, 0 replies; 19+ messages in thread
From: Manivannan Sadhasivam @ 2025-03-05 18:18 UTC (permalink / raw)
To: Krishna Chaitanya Chundru
Cc: cros-qcom-dts-watchers, Bjorn Andersson, Konrad Dybcio,
Rob Herring, Krzysztof Kozlowski, Conor Dooley, Lorenzo Pieralisi,
Krzysztof Wilczyński, Bjorn Helgaas, Jingoo Han,
linux-arm-msm, devicetree, linux-kernel, linux-pci, quic_vbadigan,
quic_mrana, quic_vpernami, mmareddy
On Fri, Feb 07, 2025 at 04:58:58AM +0530, Krishna Chaitanya Chundru wrote:
> dw_pcie_ecam_supported() needs to read bus-range to find the maximum
> bus range value. The devm_pci_alloc_host_bridge() is already reading
> bus range and storing it in host bridge.If devm_pci_alloc_host_bridge()
> moved to start of the controller probe, the dt reading can be avoided
> and use values stored in the host bridge.
>
> Allow DWC glue drivers to allocate the host bridge, avoiding redundant
> device tree reads primarily in dw_pcie_ecam_supported().
>
> Suggested-by: Bjorn Helgaas <bhelgaas@google.com>
> Signed-off-by: Krishna Chaitanya Chundru <krishna.chundru@oss.qualcomm.com>
> ---
> drivers/pci/controller/dwc/pcie-designware-host.c | 9 +++++++--
> 1 file changed, 7 insertions(+), 2 deletions(-)
>
> diff --git a/drivers/pci/controller/dwc/pcie-designware-host.c b/drivers/pci/controller/dwc/pcie-designware-host.c
> index 826ff9338646..a18cb1e411e4 100644
> --- a/drivers/pci/controller/dwc/pcie-designware-host.c
> +++ b/drivers/pci/controller/dwc/pcie-designware-host.c
> @@ -484,8 +484,8 @@ int dw_pcie_host_init(struct dw_pcie_rp *pp)
> struct device *dev = pci->dev;
> struct device_node *np = dev->of_node;
> struct platform_device *pdev = to_platform_device(dev);
> + struct pci_host_bridge *bridge = pp->bridge;
> struct resource_entry *win;
> - struct pci_host_bridge *bridge;
> struct resource *res;
> int ret;
>
> @@ -527,7 +527,12 @@ int dw_pcie_host_init(struct dw_pcie_rp *pp)
> if (!bridge)
> return -ENOMEM;
>
> - pp->bridge = bridge;
> + if (!pp->bridge) {
'pp->bridge' is getting dereferenced above as I indicated in patch 1.
> + bridge = devm_pci_alloc_host_bridge(dev, 0);
> + if (!bridge)
> + return -ENOMEM;
> + pp->bridge = bridge;
There is already a previous devm_pci_alloc_host_bridge() call before this and
you are just duplicating the code here.
- Mani
--
மணிவண்ணன் சதாசிவம்
^ permalink raw reply [flat|nested] 19+ messages in thread
* [PATCH v4 4/4] PCI: qcom: Enable ECAM feature
2025-02-06 23:28 [PATCH v4 0/4] PCI: dwc: Add ECAM support with iATU configuration Krishna Chaitanya Chundru
` (2 preceding siblings ...)
2025-02-06 23:28 ` [PATCH v4 3/4] PCI: dwc: Reduce DT reads by allocating host bridge via DWC glue driver Krishna Chaitanya Chundru
@ 2025-02-06 23:28 ` Krishna Chaitanya Chundru
2025-02-10 9:22 ` Manivannan Sadhasivam
2025-02-10 22:54 ` Bjorn Helgaas
3 siblings, 2 replies; 19+ messages in thread
From: Krishna Chaitanya Chundru @ 2025-02-06 23:28 UTC (permalink / raw)
To: cros-qcom-dts-watchers, Bjorn Andersson, Konrad Dybcio,
Rob Herring, Krzysztof Kozlowski, Conor Dooley, Lorenzo Pieralisi,
Krzysztof Wilczyński, Manivannan Sadhasivam, Bjorn Helgaas,
Jingoo Han
Cc: linux-arm-msm, devicetree, linux-kernel, linux-pci, quic_vbadigan,
quic_mrana, quic_vpernami, mmareddy, Krishna Chaitanya Chundru
The ELBI registers falls after the DBI space, PARF_SLV_DBI_ELBI register
gives us the offset from which ELBI starts. so use this offset and cfg
win to map these regions instead of doing the ioremap again.
On root bus, we have only the root port. Any access other than that
should not go out of the link and should return all F's. Since the iATU
is configured for the buses which starts after root bus, block the
transactions starting from function 1 of the root bus to the end of
the root bus (i.e from dbi_base + 4kb to dbi_base + 1MB) from going
outside the link through ECAM blocker through PARF registers.
Signed-off-by: Krishna Chaitanya Chundru <krishna.chundru@oss.qualcomm.com>
---
drivers/pci/controller/dwc/pcie-qcom.c | 77 ++++++++++++++++++++++++++++++++--
1 file changed, 73 insertions(+), 4 deletions(-)
diff --git a/drivers/pci/controller/dwc/pcie-qcom.c b/drivers/pci/controller/dwc/pcie-qcom.c
index e4d3366ead1f..84297b308e7e 100644
--- a/drivers/pci/controller/dwc/pcie-qcom.c
+++ b/drivers/pci/controller/dwc/pcie-qcom.c
@@ -52,6 +52,7 @@
#define PARF_AXI_MSTR_WR_ADDR_HALT_V2 0x1a8
#define PARF_Q2A_FLUSH 0x1ac
#define PARF_LTSSM 0x1b0
+#define PARF_SLV_DBI_ELBI 0x1b4
#define PARF_INT_ALL_STATUS 0x224
#define PARF_INT_ALL_CLEAR 0x228
#define PARF_INT_ALL_MASK 0x22c
@@ -61,6 +62,17 @@
#define PARF_DBI_BASE_ADDR_V2_HI 0x354
#define PARF_SLV_ADDR_SPACE_SIZE_V2 0x358
#define PARF_SLV_ADDR_SPACE_SIZE_V2_HI 0x35c
+#define PARF_BLOCK_SLV_AXI_WR_BASE 0x360
+#define PARF_BLOCK_SLV_AXI_WR_BASE_HI 0x364
+#define PARF_BLOCK_SLV_AXI_WR_LIMIT 0x368
+#define PARF_BLOCK_SLV_AXI_WR_LIMIT_HI 0x36c
+#define PARF_BLOCK_SLV_AXI_RD_BASE 0x370
+#define PARF_BLOCK_SLV_AXI_RD_BASE_HI 0x374
+#define PARF_BLOCK_SLV_AXI_RD_LIMIT 0x378
+#define PARF_BLOCK_SLV_AXI_RD_LIMIT_HI 0x37c
+#define PARF_ECAM_BASE 0x380
+#define PARF_ECAM_BASE_HI 0x384
+
#define PARF_NO_SNOOP_OVERIDE 0x3d4
#define PARF_ATU_BASE_ADDR 0x634
#define PARF_ATU_BASE_ADDR_HI 0x638
@@ -84,6 +96,7 @@
/* PARF_SYS_CTRL register fields */
#define MAC_PHY_POWERDOWN_IN_P2_D_MUX_EN BIT(29)
+#define PCIE_ECAM_BLOCKER_EN BIT(26)
#define MST_WAKEUP_EN BIT(13)
#define SLV_WAKEUP_EN BIT(12)
#define MSTR_ACLK_CGC_DIS BIT(10)
@@ -294,6 +307,44 @@ static void qcom_ep_reset_deassert(struct qcom_pcie *pcie)
usleep_range(PERST_DELAY_US, PERST_DELAY_US + 500);
}
+static void qcom_pci_config_ecam(struct dw_pcie_rp *pp)
+{
+ struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
+ struct qcom_pcie *pcie = to_qcom_pcie(pci);
+ u64 addr, addr_end;
+ u32 val;
+
+ /* Set the ECAM base */
+ writel_relaxed(lower_32_bits(pci->dbi_phys_addr), pcie->parf + PARF_ECAM_BASE);
+ writel_relaxed(upper_32_bits(pci->dbi_phys_addr), pcie->parf + PARF_ECAM_BASE_HI);
+
+ /*
+ * The only device on root bus is the Root Port. Any access other than that
+ * should not go out of the link and should return all F's. Since the iATU
+ * is configured for the buses which starts after root bus, block the transactions
+ * starting from function 1 of the root bus to the end of the root bus (i.e from
+ * dbi_base + 4kb to dbi_base + 1MB) from going outside the link.
+ */
+ addr = pci->dbi_phys_addr + SZ_4K;
+ writel_relaxed(lower_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_WR_BASE);
+ writel_relaxed(upper_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_WR_BASE_HI);
+
+ writel_relaxed(lower_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_RD_BASE);
+ writel_relaxed(upper_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_RD_BASE_HI);
+
+ addr_end = pci->dbi_phys_addr + SZ_1M - 1;
+
+ writel_relaxed(lower_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_WR_LIMIT);
+ writel_relaxed(upper_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_WR_LIMIT_HI);
+
+ writel_relaxed(lower_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_RD_LIMIT);
+ writel_relaxed(upper_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_RD_LIMIT_HI);
+
+ val = readl_relaxed(pcie->parf + PARF_SYS_CTRL);
+ val |= PCIE_ECAM_BLOCKER_EN;
+ writel_relaxed(val, pcie->parf + PARF_SYS_CTRL);
+}
+
static int qcom_pcie_start_link(struct dw_pcie *pci)
{
struct qcom_pcie *pcie = to_qcom_pcie(pci);
@@ -303,6 +354,9 @@ static int qcom_pcie_start_link(struct dw_pcie *pci)
qcom_pcie_common_set_16gt_lane_margining(pci);
}
+ if (pci->pp.ecam_mode)
+ qcom_pci_config_ecam(&pci->pp);
+
/* Enable Link Training state machine */
if (pcie->cfg->ops->ltssm_enable)
pcie->cfg->ops->ltssm_enable(pcie);
@@ -1233,6 +1287,7 @@ static int qcom_pcie_host_init(struct dw_pcie_rp *pp)
{
struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
struct qcom_pcie *pcie = to_qcom_pcie(pci);
+ u16 offset;
int ret;
qcom_ep_reset_assert(pcie);
@@ -1241,6 +1296,11 @@ static int qcom_pcie_host_init(struct dw_pcie_rp *pp)
if (ret)
return ret;
+ if (pp->ecam_mode) {
+ offset = readl(pcie->parf + PARF_SLV_DBI_ELBI);
+ pcie->elbi = pci->dbi_base + offset;
+ }
+
ret = phy_set_mode_ext(pcie->phy, PHY_MODE_PCIE, PHY_MODE_PCIE_RC);
if (ret)
goto err_deinit;
@@ -1615,6 +1675,13 @@ static int qcom_pcie_probe(struct platform_device *pdev)
pci->ops = &dw_pcie_ops;
pp = &pci->pp;
+ pp->bridge = devm_pci_alloc_host_bridge(dev, 0);
+ if (!pp->bridge) {
+ ret = -ENOMEM;
+ goto err_pm_runtime_put;
+ }
+
+ pci->pp.ecam_mode = dw_pcie_ecam_supported(pp);
pcie->pci = pci;
pcie->cfg = pcie_cfg;
@@ -1631,10 +1698,12 @@ static int qcom_pcie_probe(struct platform_device *pdev)
goto err_pm_runtime_put;
}
- pcie->elbi = devm_platform_ioremap_resource_byname(pdev, "elbi");
- if (IS_ERR(pcie->elbi)) {
- ret = PTR_ERR(pcie->elbi);
- goto err_pm_runtime_put;
+ if (!pp->ecam_mode) {
+ pcie->elbi = devm_platform_ioremap_resource_byname(pdev, "elbi");
+ if (IS_ERR(pcie->elbi)) {
+ ret = PTR_ERR(pcie->elbi);
+ goto err_pm_runtime_put;
+ }
}
/* MHI region is optional */
--
2.34.1
^ permalink raw reply related [flat|nested] 19+ messages in thread* Re: [PATCH v4 4/4] PCI: qcom: Enable ECAM feature
2025-02-06 23:28 ` [PATCH v4 4/4] PCI: qcom: Enable ECAM feature Krishna Chaitanya Chundru
@ 2025-02-10 9:22 ` Manivannan Sadhasivam
2025-02-10 9:34 ` Krishna Chaitanya Chundru
2025-02-10 22:54 ` Bjorn Helgaas
1 sibling, 1 reply; 19+ messages in thread
From: Manivannan Sadhasivam @ 2025-02-10 9:22 UTC (permalink / raw)
To: Krishna Chaitanya Chundru
Cc: cros-qcom-dts-watchers, Bjorn Andersson, Konrad Dybcio,
Rob Herring, Krzysztof Kozlowski, Conor Dooley, Lorenzo Pieralisi,
Krzysztof Wilczyński, Bjorn Helgaas, Jingoo Han,
linux-arm-msm, devicetree, linux-kernel, linux-pci, quic_vbadigan,
quic_mrana, quic_vpernami, mmareddy
On Fri, Feb 07, 2025 at 04:58:59AM +0530, Krishna Chaitanya Chundru wrote:
> The ELBI registers falls after the DBI space, PARF_SLV_DBI_ELBI register
> gives us the offset from which ELBI starts. so use this offset and cfg
> win to map these regions instead of doing the ioremap again.
>
> On root bus, we have only the root port. Any access other than that
> should not go out of the link and should return all F's. Since the iATU
> is configured for the buses which starts after root bus, block the
> transactions starting from function 1 of the root bus to the end of
> the root bus (i.e from dbi_base + 4kb to dbi_base + 1MB) from going
> outside the link through ECAM blocker through PARF registers.
>
> Signed-off-by: Krishna Chaitanya Chundru <krishna.chundru@oss.qualcomm.com>
> ---
> drivers/pci/controller/dwc/pcie-qcom.c | 77 ++++++++++++++++++++++++++++++++--
> 1 file changed, 73 insertions(+), 4 deletions(-)
>
> diff --git a/drivers/pci/controller/dwc/pcie-qcom.c b/drivers/pci/controller/dwc/pcie-qcom.c
> index e4d3366ead1f..84297b308e7e 100644
> --- a/drivers/pci/controller/dwc/pcie-qcom.c
> +++ b/drivers/pci/controller/dwc/pcie-qcom.c
> @@ -52,6 +52,7 @@
> #define PARF_AXI_MSTR_WR_ADDR_HALT_V2 0x1a8
> #define PARF_Q2A_FLUSH 0x1ac
> #define PARF_LTSSM 0x1b0
> +#define PARF_SLV_DBI_ELBI 0x1b4
> #define PARF_INT_ALL_STATUS 0x224
> #define PARF_INT_ALL_CLEAR 0x228
> #define PARF_INT_ALL_MASK 0x22c
> @@ -61,6 +62,17 @@
> #define PARF_DBI_BASE_ADDR_V2_HI 0x354
> #define PARF_SLV_ADDR_SPACE_SIZE_V2 0x358
> #define PARF_SLV_ADDR_SPACE_SIZE_V2_HI 0x35c
> +#define PARF_BLOCK_SLV_AXI_WR_BASE 0x360
> +#define PARF_BLOCK_SLV_AXI_WR_BASE_HI 0x364
> +#define PARF_BLOCK_SLV_AXI_WR_LIMIT 0x368
> +#define PARF_BLOCK_SLV_AXI_WR_LIMIT_HI 0x36c
> +#define PARF_BLOCK_SLV_AXI_RD_BASE 0x370
> +#define PARF_BLOCK_SLV_AXI_RD_BASE_HI 0x374
> +#define PARF_BLOCK_SLV_AXI_RD_LIMIT 0x378
> +#define PARF_BLOCK_SLV_AXI_RD_LIMIT_HI 0x37c
> +#define PARF_ECAM_BASE 0x380
> +#define PARF_ECAM_BASE_HI 0x384
> +
> #define PARF_NO_SNOOP_OVERIDE 0x3d4
> #define PARF_ATU_BASE_ADDR 0x634
> #define PARF_ATU_BASE_ADDR_HI 0x638
> @@ -84,6 +96,7 @@
>
> /* PARF_SYS_CTRL register fields */
> #define MAC_PHY_POWERDOWN_IN_P2_D_MUX_EN BIT(29)
> +#define PCIE_ECAM_BLOCKER_EN BIT(26)
> #define MST_WAKEUP_EN BIT(13)
> #define SLV_WAKEUP_EN BIT(12)
> #define MSTR_ACLK_CGC_DIS BIT(10)
> @@ -294,6 +307,44 @@ static void qcom_ep_reset_deassert(struct qcom_pcie *pcie)
> usleep_range(PERST_DELAY_US, PERST_DELAY_US + 500);
> }
>
> +static void qcom_pci_config_ecam(struct dw_pcie_rp *pp)
> +{
> + struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
> + struct qcom_pcie *pcie = to_qcom_pcie(pci);
> + u64 addr, addr_end;
> + u32 val;
> +
> + /* Set the ECAM base */
> + writel_relaxed(lower_32_bits(pci->dbi_phys_addr), pcie->parf + PARF_ECAM_BASE);
> + writel_relaxed(upper_32_bits(pci->dbi_phys_addr), pcie->parf + PARF_ECAM_BASE_HI);
> +
> + /*
> + * The only device on root bus is the Root Port. Any access other than that
> + * should not go out of the link and should return all F's. Since the iATU
> + * is configured for the buses which starts after root bus, block the transactions
> + * starting from function 1 of the root bus to the end of the root bus (i.e from
> + * dbi_base + 4kb to dbi_base + 1MB) from going outside the link.
> + */
> + addr = pci->dbi_phys_addr + SZ_4K;
> + writel_relaxed(lower_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_WR_BASE);
> + writel_relaxed(upper_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_WR_BASE_HI);
> +
> + writel_relaxed(lower_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_RD_BASE);
> + writel_relaxed(upper_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_RD_BASE_HI);
> +
> + addr_end = pci->dbi_phys_addr + SZ_1M - 1;
> +
> + writel_relaxed(lower_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_WR_LIMIT);
> + writel_relaxed(upper_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_WR_LIMIT_HI);
> +
> + writel_relaxed(lower_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_RD_LIMIT);
> + writel_relaxed(upper_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_RD_LIMIT_HI);
> +
> + val = readl_relaxed(pcie->parf + PARF_SYS_CTRL);
> + val |= PCIE_ECAM_BLOCKER_EN;
> + writel_relaxed(val, pcie->parf + PARF_SYS_CTRL);
> +}
> +
> static int qcom_pcie_start_link(struct dw_pcie *pci)
> {
> struct qcom_pcie *pcie = to_qcom_pcie(pci);
> @@ -303,6 +354,9 @@ static int qcom_pcie_start_link(struct dw_pcie *pci)
> qcom_pcie_common_set_16gt_lane_margining(pci);
> }
>
> + if (pci->pp.ecam_mode)
> + qcom_pci_config_ecam(&pci->pp);
> +
> /* Enable Link Training state machine */
> if (pcie->cfg->ops->ltssm_enable)
> pcie->cfg->ops->ltssm_enable(pcie);
> @@ -1233,6 +1287,7 @@ static int qcom_pcie_host_init(struct dw_pcie_rp *pp)
> {
> struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
> struct qcom_pcie *pcie = to_qcom_pcie(pci);
> + u16 offset;
> int ret;
>
> qcom_ep_reset_assert(pcie);
> @@ -1241,6 +1296,11 @@ static int qcom_pcie_host_init(struct dw_pcie_rp *pp)
> if (ret)
> return ret;
>
> + if (pp->ecam_mode) {
> + offset = readl(pcie->parf + PARF_SLV_DBI_ELBI);
> + pcie->elbi = pci->dbi_base + offset;
> + }
If you use the existing 'elbi' register offset defined in DT, you can just rely
on the DWC core to call dw_pcie_ecam_supported() as I mentioned in my comment in
patch 3.
> +
> ret = phy_set_mode_ext(pcie->phy, PHY_MODE_PCIE, PHY_MODE_PCIE_RC);
> if (ret)
> goto err_deinit;
> @@ -1615,6 +1675,13 @@ static int qcom_pcie_probe(struct platform_device *pdev)
> pci->ops = &dw_pcie_ops;
> pp = &pci->pp;
>
> + pp->bridge = devm_pci_alloc_host_bridge(dev, 0);
> + if (!pp->bridge) {
> + ret = -ENOMEM;
> + goto err_pm_runtime_put;
> + }
> +
This will also go away.
> + pci->pp.ecam_mode = dw_pcie_ecam_supported(pp);
This too.
- Mani
--
மணிவண்ணன் சதாசிவம்
^ permalink raw reply [flat|nested] 19+ messages in thread* Re: [PATCH v4 4/4] PCI: qcom: Enable ECAM feature
2025-02-10 9:22 ` Manivannan Sadhasivam
@ 2025-02-10 9:34 ` Krishna Chaitanya Chundru
2025-02-10 9:47 ` Manivannan Sadhasivam
0 siblings, 1 reply; 19+ messages in thread
From: Krishna Chaitanya Chundru @ 2025-02-10 9:34 UTC (permalink / raw)
To: Manivannan Sadhasivam
Cc: cros-qcom-dts-watchers, Bjorn Andersson, Konrad Dybcio,
Rob Herring, Krzysztof Kozlowski, Conor Dooley, Lorenzo Pieralisi,
Krzysztof Wilczyński, Bjorn Helgaas, Jingoo Han,
linux-arm-msm, devicetree, linux-kernel, linux-pci, quic_vbadigan,
quic_mrana, quic_vpernami, mmareddy
On 2/10/2025 2:52 PM, Manivannan Sadhasivam wrote:
> On Fri, Feb 07, 2025 at 04:58:59AM +0530, Krishna Chaitanya Chundru wrote:
>> The ELBI registers falls after the DBI space, PARF_SLV_DBI_ELBI register
>> gives us the offset from which ELBI starts. so use this offset and cfg
>> win to map these regions instead of doing the ioremap again.
>>
>> On root bus, we have only the root port. Any access other than that
>> should not go out of the link and should return all F's. Since the iATU
>> is configured for the buses which starts after root bus, block the
>> transactions starting from function 1 of the root bus to the end of
>> the root bus (i.e from dbi_base + 4kb to dbi_base + 1MB) from going
>> outside the link through ECAM blocker through PARF registers.
>>
>> Signed-off-by: Krishna Chaitanya Chundru <krishna.chundru@oss.qualcomm.com>
>> ---
>> drivers/pci/controller/dwc/pcie-qcom.c | 77 ++++++++++++++++++++++++++++++++--
>> 1 file changed, 73 insertions(+), 4 deletions(-)
>>
>> diff --git a/drivers/pci/controller/dwc/pcie-qcom.c b/drivers/pci/controller/dwc/pcie-qcom.c
>> index e4d3366ead1f..84297b308e7e 100644
>> --- a/drivers/pci/controller/dwc/pcie-qcom.c
>> +++ b/drivers/pci/controller/dwc/pcie-qcom.c
>> @@ -52,6 +52,7 @@
>> #define PARF_AXI_MSTR_WR_ADDR_HALT_V2 0x1a8
>> #define PARF_Q2A_FLUSH 0x1ac
>> #define PARF_LTSSM 0x1b0
>> +#define PARF_SLV_DBI_ELBI 0x1b4
>> #define PARF_INT_ALL_STATUS 0x224
>> #define PARF_INT_ALL_CLEAR 0x228
>> #define PARF_INT_ALL_MASK 0x22c
>> @@ -61,6 +62,17 @@
>> #define PARF_DBI_BASE_ADDR_V2_HI 0x354
>> #define PARF_SLV_ADDR_SPACE_SIZE_V2 0x358
>> #define PARF_SLV_ADDR_SPACE_SIZE_V2_HI 0x35c
>> +#define PARF_BLOCK_SLV_AXI_WR_BASE 0x360
>> +#define PARF_BLOCK_SLV_AXI_WR_BASE_HI 0x364
>> +#define PARF_BLOCK_SLV_AXI_WR_LIMIT 0x368
>> +#define PARF_BLOCK_SLV_AXI_WR_LIMIT_HI 0x36c
>> +#define PARF_BLOCK_SLV_AXI_RD_BASE 0x370
>> +#define PARF_BLOCK_SLV_AXI_RD_BASE_HI 0x374
>> +#define PARF_BLOCK_SLV_AXI_RD_LIMIT 0x378
>> +#define PARF_BLOCK_SLV_AXI_RD_LIMIT_HI 0x37c
>> +#define PARF_ECAM_BASE 0x380
>> +#define PARF_ECAM_BASE_HI 0x384
>> +
>> #define PARF_NO_SNOOP_OVERIDE 0x3d4
>> #define PARF_ATU_BASE_ADDR 0x634
>> #define PARF_ATU_BASE_ADDR_HI 0x638
>> @@ -84,6 +96,7 @@
>>
>> /* PARF_SYS_CTRL register fields */
>> #define MAC_PHY_POWERDOWN_IN_P2_D_MUX_EN BIT(29)
>> +#define PCIE_ECAM_BLOCKER_EN BIT(26)
>> #define MST_WAKEUP_EN BIT(13)
>> #define SLV_WAKEUP_EN BIT(12)
>> #define MSTR_ACLK_CGC_DIS BIT(10)
>> @@ -294,6 +307,44 @@ static void qcom_ep_reset_deassert(struct qcom_pcie *pcie)
>> usleep_range(PERST_DELAY_US, PERST_DELAY_US + 500);
>> }
>>
>> +static void qcom_pci_config_ecam(struct dw_pcie_rp *pp)
>> +{
>> + struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
>> + struct qcom_pcie *pcie = to_qcom_pcie(pci);
>> + u64 addr, addr_end;
>> + u32 val;
>> +
>> + /* Set the ECAM base */
>> + writel_relaxed(lower_32_bits(pci->dbi_phys_addr), pcie->parf + PARF_ECAM_BASE);
>> + writel_relaxed(upper_32_bits(pci->dbi_phys_addr), pcie->parf + PARF_ECAM_BASE_HI);
>> +
>> + /*
>> + * The only device on root bus is the Root Port. Any access other than that
>> + * should not go out of the link and should return all F's. Since the iATU
>> + * is configured for the buses which starts after root bus, block the transactions
>> + * starting from function 1 of the root bus to the end of the root bus (i.e from
>> + * dbi_base + 4kb to dbi_base + 1MB) from going outside the link.
>> + */
>> + addr = pci->dbi_phys_addr + SZ_4K;
>> + writel_relaxed(lower_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_WR_BASE);
>> + writel_relaxed(upper_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_WR_BASE_HI);
>> +
>> + writel_relaxed(lower_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_RD_BASE);
>> + writel_relaxed(upper_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_RD_BASE_HI);
>> +
>> + addr_end = pci->dbi_phys_addr + SZ_1M - 1;
>> +
>> + writel_relaxed(lower_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_WR_LIMIT);
>> + writel_relaxed(upper_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_WR_LIMIT_HI);
>> +
>> + writel_relaxed(lower_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_RD_LIMIT);
>> + writel_relaxed(upper_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_RD_LIMIT_HI);
>> +
>> + val = readl_relaxed(pcie->parf + PARF_SYS_CTRL);
>> + val |= PCIE_ECAM_BLOCKER_EN;
>> + writel_relaxed(val, pcie->parf + PARF_SYS_CTRL);
>> +}
>> +
>> static int qcom_pcie_start_link(struct dw_pcie *pci)
>> {
>> struct qcom_pcie *pcie = to_qcom_pcie(pci);
>> @@ -303,6 +354,9 @@ static int qcom_pcie_start_link(struct dw_pcie *pci)
>> qcom_pcie_common_set_16gt_lane_margining(pci);
>> }
>>
>> + if (pci->pp.ecam_mode)
>> + qcom_pci_config_ecam(&pci->pp);
>> +
>> /* Enable Link Training state machine */
>> if (pcie->cfg->ops->ltssm_enable)
>> pcie->cfg->ops->ltssm_enable(pcie);
>> @@ -1233,6 +1287,7 @@ static int qcom_pcie_host_init(struct dw_pcie_rp *pp)
>> {
>> struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
>> struct qcom_pcie *pcie = to_qcom_pcie(pci);
>> + u16 offset;
>> int ret;
>>
>> qcom_ep_reset_assert(pcie);
>> @@ -1241,6 +1296,11 @@ static int qcom_pcie_host_init(struct dw_pcie_rp *pp)
>> if (ret)
>> return ret;
>>
>> + if (pp->ecam_mode) {
>> + offset = readl(pcie->parf + PARF_SLV_DBI_ELBI);
>> + pcie->elbi = pci->dbi_base + offset;
>> + }
>
> If you use the existing 'elbi' register offset defined in DT, you can just rely
> on the DWC core to call dw_pcie_ecam_supported() as I mentioned in my comment in
> patch 3.
> >> +
>> ret = phy_set_mode_ext(pcie->phy, PHY_MODE_PCIE, PHY_MODE_PCIE_RC);
>> if (ret)
>> goto err_deinit;
>> @@ -1615,6 +1675,13 @@ static int qcom_pcie_probe(struct platform_device *pdev)
>> pci->ops = &dw_pcie_ops;
>> pp = &pci->pp;
>>
>> + pp->bridge = devm_pci_alloc_host_bridge(dev, 0);
>> + if (!pp->bridge) {
>> + ret = -ENOMEM;
>> + goto err_pm_runtime_put;
>> + }
>> +
>
> This will also go away.
>
Hi Mani,
I get your point but the problem is in ECAM mode the DBI address to
maximum of 256 MB will be ioremap by pci_ecam_create(). If we don't skip
this ioremap of elbi ioremap in pci_ecam_create because we already
iormaped elbi which falls in dbi address to 256 MB region( as we can't
remap same region twice). so we need to skip doing ioremap for elbi
region.
To avoid this issue we are doing this way to figure out ecam can be
enabled or not before doing ioremap of elbi region.
- Krishna Chaitanya.
>> + pci->pp.ecam_mode = dw_pcie_ecam_supported(pp);
>
> This too.
>
> - Mani
>
^ permalink raw reply [flat|nested] 19+ messages in thread* Re: [PATCH v4 4/4] PCI: qcom: Enable ECAM feature
2025-02-10 9:34 ` Krishna Chaitanya Chundru
@ 2025-02-10 9:47 ` Manivannan Sadhasivam
2025-02-10 9:53 ` Krishna Chaitanya Chundru
0 siblings, 1 reply; 19+ messages in thread
From: Manivannan Sadhasivam @ 2025-02-10 9:47 UTC (permalink / raw)
To: Krishna Chaitanya Chundru
Cc: cros-qcom-dts-watchers, Bjorn Andersson, Konrad Dybcio,
Rob Herring, Krzysztof Kozlowski, Conor Dooley, Lorenzo Pieralisi,
Krzysztof Wilczyński, Bjorn Helgaas, Jingoo Han,
linux-arm-msm, devicetree, linux-kernel, linux-pci, quic_vbadigan,
quic_mrana, quic_vpernami, mmareddy
On Mon, Feb 10, 2025 at 03:04:43PM +0530, Krishna Chaitanya Chundru wrote:
>
>
> On 2/10/2025 2:52 PM, Manivannan Sadhasivam wrote:
> > On Fri, Feb 07, 2025 at 04:58:59AM +0530, Krishna Chaitanya Chundru wrote:
> > > The ELBI registers falls after the DBI space, PARF_SLV_DBI_ELBI register
> > > gives us the offset from which ELBI starts. so use this offset and cfg
> > > win to map these regions instead of doing the ioremap again.
> > >
> > > On root bus, we have only the root port. Any access other than that
> > > should not go out of the link and should return all F's. Since the iATU
> > > is configured for the buses which starts after root bus, block the
> > > transactions starting from function 1 of the root bus to the end of
> > > the root bus (i.e from dbi_base + 4kb to dbi_base + 1MB) from going
> > > outside the link through ECAM blocker through PARF registers.
> > >
> > > Signed-off-by: Krishna Chaitanya Chundru <krishna.chundru@oss.qualcomm.com>
> > > ---
> > > drivers/pci/controller/dwc/pcie-qcom.c | 77 ++++++++++++++++++++++++++++++++--
> > > 1 file changed, 73 insertions(+), 4 deletions(-)
> > >
> > > diff --git a/drivers/pci/controller/dwc/pcie-qcom.c b/drivers/pci/controller/dwc/pcie-qcom.c
> > > index e4d3366ead1f..84297b308e7e 100644
> > > --- a/drivers/pci/controller/dwc/pcie-qcom.c
> > > +++ b/drivers/pci/controller/dwc/pcie-qcom.c
> > > @@ -52,6 +52,7 @@
> > > #define PARF_AXI_MSTR_WR_ADDR_HALT_V2 0x1a8
> > > #define PARF_Q2A_FLUSH 0x1ac
> > > #define PARF_LTSSM 0x1b0
> > > +#define PARF_SLV_DBI_ELBI 0x1b4
> > > #define PARF_INT_ALL_STATUS 0x224
> > > #define PARF_INT_ALL_CLEAR 0x228
> > > #define PARF_INT_ALL_MASK 0x22c
> > > @@ -61,6 +62,17 @@
> > > #define PARF_DBI_BASE_ADDR_V2_HI 0x354
> > > #define PARF_SLV_ADDR_SPACE_SIZE_V2 0x358
> > > #define PARF_SLV_ADDR_SPACE_SIZE_V2_HI 0x35c
> > > +#define PARF_BLOCK_SLV_AXI_WR_BASE 0x360
> > > +#define PARF_BLOCK_SLV_AXI_WR_BASE_HI 0x364
> > > +#define PARF_BLOCK_SLV_AXI_WR_LIMIT 0x368
> > > +#define PARF_BLOCK_SLV_AXI_WR_LIMIT_HI 0x36c
> > > +#define PARF_BLOCK_SLV_AXI_RD_BASE 0x370
> > > +#define PARF_BLOCK_SLV_AXI_RD_BASE_HI 0x374
> > > +#define PARF_BLOCK_SLV_AXI_RD_LIMIT 0x378
> > > +#define PARF_BLOCK_SLV_AXI_RD_LIMIT_HI 0x37c
> > > +#define PARF_ECAM_BASE 0x380
> > > +#define PARF_ECAM_BASE_HI 0x384
> > > +
> > > #define PARF_NO_SNOOP_OVERIDE 0x3d4
> > > #define PARF_ATU_BASE_ADDR 0x634
> > > #define PARF_ATU_BASE_ADDR_HI 0x638
> > > @@ -84,6 +96,7 @@
> > > /* PARF_SYS_CTRL register fields */
> > > #define MAC_PHY_POWERDOWN_IN_P2_D_MUX_EN BIT(29)
> > > +#define PCIE_ECAM_BLOCKER_EN BIT(26)
> > > #define MST_WAKEUP_EN BIT(13)
> > > #define SLV_WAKEUP_EN BIT(12)
> > > #define MSTR_ACLK_CGC_DIS BIT(10)
> > > @@ -294,6 +307,44 @@ static void qcom_ep_reset_deassert(struct qcom_pcie *pcie)
> > > usleep_range(PERST_DELAY_US, PERST_DELAY_US + 500);
> > > }
> > > +static void qcom_pci_config_ecam(struct dw_pcie_rp *pp)
> > > +{
> > > + struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
> > > + struct qcom_pcie *pcie = to_qcom_pcie(pci);
> > > + u64 addr, addr_end;
> > > + u32 val;
> > > +
> > > + /* Set the ECAM base */
> > > + writel_relaxed(lower_32_bits(pci->dbi_phys_addr), pcie->parf + PARF_ECAM_BASE);
> > > + writel_relaxed(upper_32_bits(pci->dbi_phys_addr), pcie->parf + PARF_ECAM_BASE_HI);
> > > +
> > > + /*
> > > + * The only device on root bus is the Root Port. Any access other than that
> > > + * should not go out of the link and should return all F's. Since the iATU
> > > + * is configured for the buses which starts after root bus, block the transactions
> > > + * starting from function 1 of the root bus to the end of the root bus (i.e from
> > > + * dbi_base + 4kb to dbi_base + 1MB) from going outside the link.
> > > + */
> > > + addr = pci->dbi_phys_addr + SZ_4K;
> > > + writel_relaxed(lower_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_WR_BASE);
> > > + writel_relaxed(upper_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_WR_BASE_HI);
> > > +
> > > + writel_relaxed(lower_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_RD_BASE);
> > > + writel_relaxed(upper_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_RD_BASE_HI);
> > > +
> > > + addr_end = pci->dbi_phys_addr + SZ_1M - 1;
> > > +
> > > + writel_relaxed(lower_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_WR_LIMIT);
> > > + writel_relaxed(upper_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_WR_LIMIT_HI);
> > > +
> > > + writel_relaxed(lower_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_RD_LIMIT);
> > > + writel_relaxed(upper_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_RD_LIMIT_HI);
> > > +
> > > + val = readl_relaxed(pcie->parf + PARF_SYS_CTRL);
> > > + val |= PCIE_ECAM_BLOCKER_EN;
> > > + writel_relaxed(val, pcie->parf + PARF_SYS_CTRL);
> > > +}
> > > +
> > > static int qcom_pcie_start_link(struct dw_pcie *pci)
> > > {
> > > struct qcom_pcie *pcie = to_qcom_pcie(pci);
> > > @@ -303,6 +354,9 @@ static int qcom_pcie_start_link(struct dw_pcie *pci)
> > > qcom_pcie_common_set_16gt_lane_margining(pci);
> > > }
> > > + if (pci->pp.ecam_mode)
> > > + qcom_pci_config_ecam(&pci->pp);
> > > +
> > > /* Enable Link Training state machine */
> > > if (pcie->cfg->ops->ltssm_enable)
> > > pcie->cfg->ops->ltssm_enable(pcie);
> > > @@ -1233,6 +1287,7 @@ static int qcom_pcie_host_init(struct dw_pcie_rp *pp)
> > > {
> > > struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
> > > struct qcom_pcie *pcie = to_qcom_pcie(pci);
> > > + u16 offset;
> > > int ret;
> > > qcom_ep_reset_assert(pcie);
> > > @@ -1241,6 +1296,11 @@ static int qcom_pcie_host_init(struct dw_pcie_rp *pp)
> > > if (ret)
> > > return ret;
> > > + if (pp->ecam_mode) {
> > > + offset = readl(pcie->parf + PARF_SLV_DBI_ELBI);
> > > + pcie->elbi = pci->dbi_base + offset;
> > > + }
> >
> > If you use the existing 'elbi' register offset defined in DT, you can just rely
> > on the DWC core to call dw_pcie_ecam_supported() as I mentioned in my comment in
> > patch 3.
> > >> +
> > > ret = phy_set_mode_ext(pcie->phy, PHY_MODE_PCIE, PHY_MODE_PCIE_RC);
> > > if (ret)
> > > goto err_deinit;
> > > @@ -1615,6 +1675,13 @@ static int qcom_pcie_probe(struct platform_device *pdev)
> > > pci->ops = &dw_pcie_ops;
> > > pp = &pci->pp;
> > > + pp->bridge = devm_pci_alloc_host_bridge(dev, 0);
> > > + if (!pp->bridge) {
> > > + ret = -ENOMEM;
> > > + goto err_pm_runtime_put;
> > > + }
> > > +
> >
> > This will also go away.
> >
> Hi Mani,
>
> I get your point but the problem is in ECAM mode the DBI address to maximum
> of 256 MB will be ioremap by pci_ecam_create(). If we don't skip
> this ioremap of elbi ioremap in pci_ecam_create because we already
> iormaped elbi which falls in dbi address to 256 MB region( as we can't
> remap same region twice). so we need to skip doing ioremap for elbi
> region.
>
Then obviously, your DT entries are wrong. You cannot define overlapping regions
on purpose. Can't you leave the ELBI region and start the config region?
- Mani
--
மணிவண்ணன் சதாசிவம்
^ permalink raw reply [flat|nested] 19+ messages in thread* Re: [PATCH v4 4/4] PCI: qcom: Enable ECAM feature
2025-02-10 9:47 ` Manivannan Sadhasivam
@ 2025-02-10 9:53 ` Krishna Chaitanya Chundru
2025-02-10 10:08 ` Manivannan Sadhasivam
2025-03-05 18:26 ` Manivannan Sadhasivam
0 siblings, 2 replies; 19+ messages in thread
From: Krishna Chaitanya Chundru @ 2025-02-10 9:53 UTC (permalink / raw)
To: Manivannan Sadhasivam, Krishna Chaitanya Chundru
Cc: cros-qcom-dts-watchers, Bjorn Andersson, Konrad Dybcio,
Rob Herring, Krzysztof Kozlowski, Conor Dooley, Lorenzo Pieralisi,
Krzysztof Wilczyński, Bjorn Helgaas, Jingoo Han,
linux-arm-msm, devicetree, linux-kernel, linux-pci, quic_vbadigan,
quic_mrana, quic_vpernami, mmareddy
On 2/10/2025 3:17 PM, Manivannan Sadhasivam wrote:
> On Mon, Feb 10, 2025 at 03:04:43PM +0530, Krishna Chaitanya Chundru wrote:
>>
>>
>> On 2/10/2025 2:52 PM, Manivannan Sadhasivam wrote:
>>> On Fri, Feb 07, 2025 at 04:58:59AM +0530, Krishna Chaitanya Chundru wrote:
>>>> The ELBI registers falls after the DBI space, PARF_SLV_DBI_ELBI register
>>>> gives us the offset from which ELBI starts. so use this offset and cfg
>>>> win to map these regions instead of doing the ioremap again.
>>>>
>>>> On root bus, we have only the root port. Any access other than that
>>>> should not go out of the link and should return all F's. Since the iATU
>>>> is configured for the buses which starts after root bus, block the
>>>> transactions starting from function 1 of the root bus to the end of
>>>> the root bus (i.e from dbi_base + 4kb to dbi_base + 1MB) from going
>>>> outside the link through ECAM blocker through PARF registers.
>>>>
>>>> Signed-off-by: Krishna Chaitanya Chundru <krishna.chundru@oss.qualcomm.com>
>>>> ---
>>>> drivers/pci/controller/dwc/pcie-qcom.c | 77 ++++++++++++++++++++++++++++++++--
>>>> 1 file changed, 73 insertions(+), 4 deletions(-)
>>>>
>>>> diff --git a/drivers/pci/controller/dwc/pcie-qcom.c b/drivers/pci/controller/dwc/pcie-qcom.c
>>>> index e4d3366ead1f..84297b308e7e 100644
>>>> --- a/drivers/pci/controller/dwc/pcie-qcom.c
>>>> +++ b/drivers/pci/controller/dwc/pcie-qcom.c
>>>> @@ -52,6 +52,7 @@
>>>> #define PARF_AXI_MSTR_WR_ADDR_HALT_V2 0x1a8
>>>> #define PARF_Q2A_FLUSH 0x1ac
>>>> #define PARF_LTSSM 0x1b0
>>>> +#define PARF_SLV_DBI_ELBI 0x1b4
>>>> #define PARF_INT_ALL_STATUS 0x224
>>>> #define PARF_INT_ALL_CLEAR 0x228
>>>> #define PARF_INT_ALL_MASK 0x22c
>>>> @@ -61,6 +62,17 @@
>>>> #define PARF_DBI_BASE_ADDR_V2_HI 0x354
>>>> #define PARF_SLV_ADDR_SPACE_SIZE_V2 0x358
>>>> #define PARF_SLV_ADDR_SPACE_SIZE_V2_HI 0x35c
>>>> +#define PARF_BLOCK_SLV_AXI_WR_BASE 0x360
>>>> +#define PARF_BLOCK_SLV_AXI_WR_BASE_HI 0x364
>>>> +#define PARF_BLOCK_SLV_AXI_WR_LIMIT 0x368
>>>> +#define PARF_BLOCK_SLV_AXI_WR_LIMIT_HI 0x36c
>>>> +#define PARF_BLOCK_SLV_AXI_RD_BASE 0x370
>>>> +#define PARF_BLOCK_SLV_AXI_RD_BASE_HI 0x374
>>>> +#define PARF_BLOCK_SLV_AXI_RD_LIMIT 0x378
>>>> +#define PARF_BLOCK_SLV_AXI_RD_LIMIT_HI 0x37c
>>>> +#define PARF_ECAM_BASE 0x380
>>>> +#define PARF_ECAM_BASE_HI 0x384
>>>> +
>>>> #define PARF_NO_SNOOP_OVERIDE 0x3d4
>>>> #define PARF_ATU_BASE_ADDR 0x634
>>>> #define PARF_ATU_BASE_ADDR_HI 0x638
>>>> @@ -84,6 +96,7 @@
>>>> /* PARF_SYS_CTRL register fields */
>>>> #define MAC_PHY_POWERDOWN_IN_P2_D_MUX_EN BIT(29)
>>>> +#define PCIE_ECAM_BLOCKER_EN BIT(26)
>>>> #define MST_WAKEUP_EN BIT(13)
>>>> #define SLV_WAKEUP_EN BIT(12)
>>>> #define MSTR_ACLK_CGC_DIS BIT(10)
>>>> @@ -294,6 +307,44 @@ static void qcom_ep_reset_deassert(struct qcom_pcie *pcie)
>>>> usleep_range(PERST_DELAY_US, PERST_DELAY_US + 500);
>>>> }
>>>> +static void qcom_pci_config_ecam(struct dw_pcie_rp *pp)
>>>> +{
>>>> + struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
>>>> + struct qcom_pcie *pcie = to_qcom_pcie(pci);
>>>> + u64 addr, addr_end;
>>>> + u32 val;
>>>> +
>>>> + /* Set the ECAM base */
>>>> + writel_relaxed(lower_32_bits(pci->dbi_phys_addr), pcie->parf + PARF_ECAM_BASE);
>>>> + writel_relaxed(upper_32_bits(pci->dbi_phys_addr), pcie->parf + PARF_ECAM_BASE_HI);
>>>> +
>>>> + /*
>>>> + * The only device on root bus is the Root Port. Any access other than that
>>>> + * should not go out of the link and should return all F's. Since the iATU
>>>> + * is configured for the buses which starts after root bus, block the transactions
>>>> + * starting from function 1 of the root bus to the end of the root bus (i.e from
>>>> + * dbi_base + 4kb to dbi_base + 1MB) from going outside the link.
>>>> + */
>>>> + addr = pci->dbi_phys_addr + SZ_4K;
>>>> + writel_relaxed(lower_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_WR_BASE);
>>>> + writel_relaxed(upper_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_WR_BASE_HI);
>>>> +
>>>> + writel_relaxed(lower_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_RD_BASE);
>>>> + writel_relaxed(upper_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_RD_BASE_HI);
>>>> +
>>>> + addr_end = pci->dbi_phys_addr + SZ_1M - 1;
>>>> +
>>>> + writel_relaxed(lower_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_WR_LIMIT);
>>>> + writel_relaxed(upper_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_WR_LIMIT_HI);
>>>> +
>>>> + writel_relaxed(lower_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_RD_LIMIT);
>>>> + writel_relaxed(upper_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_RD_LIMIT_HI);
>>>> +
>>>> + val = readl_relaxed(pcie->parf + PARF_SYS_CTRL);
>>>> + val |= PCIE_ECAM_BLOCKER_EN;
>>>> + writel_relaxed(val, pcie->parf + PARF_SYS_CTRL);
>>>> +}
>>>> +
>>>> static int qcom_pcie_start_link(struct dw_pcie *pci)
>>>> {
>>>> struct qcom_pcie *pcie = to_qcom_pcie(pci);
>>>> @@ -303,6 +354,9 @@ static int qcom_pcie_start_link(struct dw_pcie *pci)
>>>> qcom_pcie_common_set_16gt_lane_margining(pci);
>>>> }
>>>> + if (pci->pp.ecam_mode)
>>>> + qcom_pci_config_ecam(&pci->pp);
>>>> +
>>>> /* Enable Link Training state machine */
>>>> if (pcie->cfg->ops->ltssm_enable)
>>>> pcie->cfg->ops->ltssm_enable(pcie);
>>>> @@ -1233,6 +1287,7 @@ static int qcom_pcie_host_init(struct dw_pcie_rp *pp)
>>>> {
>>>> struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
>>>> struct qcom_pcie *pcie = to_qcom_pcie(pci);
>>>> + u16 offset;
>>>> int ret;
>>>> qcom_ep_reset_assert(pcie);
>>>> @@ -1241,6 +1296,11 @@ static int qcom_pcie_host_init(struct dw_pcie_rp *pp)
>>>> if (ret)
>>>> return ret;
>>>> + if (pp->ecam_mode) {
>>>> + offset = readl(pcie->parf + PARF_SLV_DBI_ELBI);
>>>> + pcie->elbi = pci->dbi_base + offset;
>>>> + }
>>>
>>> If you use the existing 'elbi' register offset defined in DT, you can just rely
>>> on the DWC core to call dw_pcie_ecam_supported() as I mentioned in my comment in
>>> patch 3.
>>> >> +
>>>> ret = phy_set_mode_ext(pcie->phy, PHY_MODE_PCIE, PHY_MODE_PCIE_RC);
>>>> if (ret)
>>>> goto err_deinit;
>>>> @@ -1615,6 +1675,13 @@ static int qcom_pcie_probe(struct platform_device *pdev)
>>>> pci->ops = &dw_pcie_ops;
>>>> pp = &pci->pp;
>>>> + pp->bridge = devm_pci_alloc_host_bridge(dev, 0);
>>>> + if (!pp->bridge) {
>>>> + ret = -ENOMEM;
>>>> + goto err_pm_runtime_put;
>>>> + }
>>>> +
>>>
>>> This will also go away.
>>>
>> Hi Mani,
>>
>> I get your point but the problem is in ECAM mode the DBI address to maximum
>> of 256 MB will be ioremap by pci_ecam_create(). If we don't skip
>> this ioremap of elbi ioremap in pci_ecam_create because we already
>> iormaped elbi which falls in dbi address to 256 MB region( as we can't
>> remap same region twice). so we need to skip doing ioremap for elbi
>> region.
>>
>
> Then obviously, your DT entries are wrong. You cannot define overlapping regions
> on purpose. Can't you leave the ELBI region and start the config region?
>
> - Mani
ELBI is part of DBI space(present in the first 4kb of the dbi) we can't
relocate ELBI region to different location.
can we keep this elbi region as optional and remove elbi from the
devicetree and binding?
- Krishna Chaitanya.
>
^ permalink raw reply [flat|nested] 19+ messages in thread* Re: [PATCH v4 4/4] PCI: qcom: Enable ECAM feature
2025-02-10 9:53 ` Krishna Chaitanya Chundru
@ 2025-02-10 10:08 ` Manivannan Sadhasivam
2025-03-05 18:26 ` Manivannan Sadhasivam
1 sibling, 0 replies; 19+ messages in thread
From: Manivannan Sadhasivam @ 2025-02-10 10:08 UTC (permalink / raw)
To: Krishna Chaitanya Chundru
Cc: Krishna Chaitanya Chundru, cros-qcom-dts-watchers,
Bjorn Andersson, Konrad Dybcio, Rob Herring, Krzysztof Kozlowski,
Conor Dooley, Lorenzo Pieralisi, Krzysztof Wilczyński,
Bjorn Helgaas, Jingoo Han, linux-arm-msm, devicetree,
linux-kernel, linux-pci, quic_vbadigan, quic_mrana, quic_vpernami,
mmareddy
On Mon, Feb 10, 2025 at 03:23:43PM +0530, Krishna Chaitanya Chundru wrote:
>
>
> On 2/10/2025 3:17 PM, Manivannan Sadhasivam wrote:
> > On Mon, Feb 10, 2025 at 03:04:43PM +0530, Krishna Chaitanya Chundru wrote:
> > >
> > >
> > > On 2/10/2025 2:52 PM, Manivannan Sadhasivam wrote:
> > > > On Fri, Feb 07, 2025 at 04:58:59AM +0530, Krishna Chaitanya Chundru wrote:
> > > > > The ELBI registers falls after the DBI space, PARF_SLV_DBI_ELBI register
> > > > > gives us the offset from which ELBI starts. so use this offset and cfg
> > > > > win to map these regions instead of doing the ioremap again.
> > > > >
> > > > > On root bus, we have only the root port. Any access other than that
> > > > > should not go out of the link and should return all F's. Since the iATU
> > > > > is configured for the buses which starts after root bus, block the
> > > > > transactions starting from function 1 of the root bus to the end of
> > > > > the root bus (i.e from dbi_base + 4kb to dbi_base + 1MB) from going
> > > > > outside the link through ECAM blocker through PARF registers.
> > > > >
> > > > > Signed-off-by: Krishna Chaitanya Chundru <krishna.chundru@oss.qualcomm.com>
> > > > > ---
> > > > > drivers/pci/controller/dwc/pcie-qcom.c | 77 ++++++++++++++++++++++++++++++++--
> > > > > 1 file changed, 73 insertions(+), 4 deletions(-)
> > > > >
> > > > > diff --git a/drivers/pci/controller/dwc/pcie-qcom.c b/drivers/pci/controller/dwc/pcie-qcom.c
> > > > > index e4d3366ead1f..84297b308e7e 100644
> > > > > --- a/drivers/pci/controller/dwc/pcie-qcom.c
> > > > > +++ b/drivers/pci/controller/dwc/pcie-qcom.c
> > > > > @@ -52,6 +52,7 @@
> > > > > #define PARF_AXI_MSTR_WR_ADDR_HALT_V2 0x1a8
> > > > > #define PARF_Q2A_FLUSH 0x1ac
> > > > > #define PARF_LTSSM 0x1b0
> > > > > +#define PARF_SLV_DBI_ELBI 0x1b4
> > > > > #define PARF_INT_ALL_STATUS 0x224
> > > > > #define PARF_INT_ALL_CLEAR 0x228
> > > > > #define PARF_INT_ALL_MASK 0x22c
> > > > > @@ -61,6 +62,17 @@
> > > > > #define PARF_DBI_BASE_ADDR_V2_HI 0x354
> > > > > #define PARF_SLV_ADDR_SPACE_SIZE_V2 0x358
> > > > > #define PARF_SLV_ADDR_SPACE_SIZE_V2_HI 0x35c
> > > > > +#define PARF_BLOCK_SLV_AXI_WR_BASE 0x360
> > > > > +#define PARF_BLOCK_SLV_AXI_WR_BASE_HI 0x364
> > > > > +#define PARF_BLOCK_SLV_AXI_WR_LIMIT 0x368
> > > > > +#define PARF_BLOCK_SLV_AXI_WR_LIMIT_HI 0x36c
> > > > > +#define PARF_BLOCK_SLV_AXI_RD_BASE 0x370
> > > > > +#define PARF_BLOCK_SLV_AXI_RD_BASE_HI 0x374
> > > > > +#define PARF_BLOCK_SLV_AXI_RD_LIMIT 0x378
> > > > > +#define PARF_BLOCK_SLV_AXI_RD_LIMIT_HI 0x37c
> > > > > +#define PARF_ECAM_BASE 0x380
> > > > > +#define PARF_ECAM_BASE_HI 0x384
> > > > > +
> > > > > #define PARF_NO_SNOOP_OVERIDE 0x3d4
> > > > > #define PARF_ATU_BASE_ADDR 0x634
> > > > > #define PARF_ATU_BASE_ADDR_HI 0x638
> > > > > @@ -84,6 +96,7 @@
> > > > > /* PARF_SYS_CTRL register fields */
> > > > > #define MAC_PHY_POWERDOWN_IN_P2_D_MUX_EN BIT(29)
> > > > > +#define PCIE_ECAM_BLOCKER_EN BIT(26)
> > > > > #define MST_WAKEUP_EN BIT(13)
> > > > > #define SLV_WAKEUP_EN BIT(12)
> > > > > #define MSTR_ACLK_CGC_DIS BIT(10)
> > > > > @@ -294,6 +307,44 @@ static void qcom_ep_reset_deassert(struct qcom_pcie *pcie)
> > > > > usleep_range(PERST_DELAY_US, PERST_DELAY_US + 500);
> > > > > }
> > > > > +static void qcom_pci_config_ecam(struct dw_pcie_rp *pp)
> > > > > +{
> > > > > + struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
> > > > > + struct qcom_pcie *pcie = to_qcom_pcie(pci);
> > > > > + u64 addr, addr_end;
> > > > > + u32 val;
> > > > > +
> > > > > + /* Set the ECAM base */
> > > > > + writel_relaxed(lower_32_bits(pci->dbi_phys_addr), pcie->parf + PARF_ECAM_BASE);
> > > > > + writel_relaxed(upper_32_bits(pci->dbi_phys_addr), pcie->parf + PARF_ECAM_BASE_HI);
> > > > > +
> > > > > + /*
> > > > > + * The only device on root bus is the Root Port. Any access other than that
> > > > > + * should not go out of the link and should return all F's. Since the iATU
> > > > > + * is configured for the buses which starts after root bus, block the transactions
> > > > > + * starting from function 1 of the root bus to the end of the root bus (i.e from
> > > > > + * dbi_base + 4kb to dbi_base + 1MB) from going outside the link.
> > > > > + */
> > > > > + addr = pci->dbi_phys_addr + SZ_4K;
> > > > > + writel_relaxed(lower_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_WR_BASE);
> > > > > + writel_relaxed(upper_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_WR_BASE_HI);
> > > > > +
> > > > > + writel_relaxed(lower_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_RD_BASE);
> > > > > + writel_relaxed(upper_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_RD_BASE_HI);
> > > > > +
> > > > > + addr_end = pci->dbi_phys_addr + SZ_1M - 1;
> > > > > +
> > > > > + writel_relaxed(lower_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_WR_LIMIT);
> > > > > + writel_relaxed(upper_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_WR_LIMIT_HI);
> > > > > +
> > > > > + writel_relaxed(lower_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_RD_LIMIT);
> > > > > + writel_relaxed(upper_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_RD_LIMIT_HI);
> > > > > +
> > > > > + val = readl_relaxed(pcie->parf + PARF_SYS_CTRL);
> > > > > + val |= PCIE_ECAM_BLOCKER_EN;
> > > > > + writel_relaxed(val, pcie->parf + PARF_SYS_CTRL);
> > > > > +}
> > > > > +
> > > > > static int qcom_pcie_start_link(struct dw_pcie *pci)
> > > > > {
> > > > > struct qcom_pcie *pcie = to_qcom_pcie(pci);
> > > > > @@ -303,6 +354,9 @@ static int qcom_pcie_start_link(struct dw_pcie *pci)
> > > > > qcom_pcie_common_set_16gt_lane_margining(pci);
> > > > > }
> > > > > + if (pci->pp.ecam_mode)
> > > > > + qcom_pci_config_ecam(&pci->pp);
> > > > > +
> > > > > /* Enable Link Training state machine */
> > > > > if (pcie->cfg->ops->ltssm_enable)
> > > > > pcie->cfg->ops->ltssm_enable(pcie);
> > > > > @@ -1233,6 +1287,7 @@ static int qcom_pcie_host_init(struct dw_pcie_rp *pp)
> > > > > {
> > > > > struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
> > > > > struct qcom_pcie *pcie = to_qcom_pcie(pci);
> > > > > + u16 offset;
> > > > > int ret;
> > > > > qcom_ep_reset_assert(pcie);
> > > > > @@ -1241,6 +1296,11 @@ static int qcom_pcie_host_init(struct dw_pcie_rp *pp)
> > > > > if (ret)
> > > > > return ret;
> > > > > + if (pp->ecam_mode) {
> > > > > + offset = readl(pcie->parf + PARF_SLV_DBI_ELBI);
> > > > > + pcie->elbi = pci->dbi_base + offset;
> > > > > + }
> > > >
> > > > If you use the existing 'elbi' register offset defined in DT, you can just rely
> > > > on the DWC core to call dw_pcie_ecam_supported() as I mentioned in my comment in
> > > > patch 3.
> > > > >> +
> > > > > ret = phy_set_mode_ext(pcie->phy, PHY_MODE_PCIE, PHY_MODE_PCIE_RC);
> > > > > if (ret)
> > > > > goto err_deinit;
> > > > > @@ -1615,6 +1675,13 @@ static int qcom_pcie_probe(struct platform_device *pdev)
> > > > > pci->ops = &dw_pcie_ops;
> > > > > pp = &pci->pp;
> > > > > + pp->bridge = devm_pci_alloc_host_bridge(dev, 0);
> > > > > + if (!pp->bridge) {
> > > > > + ret = -ENOMEM;
> > > > > + goto err_pm_runtime_put;
> > > > > + }
> > > > > +
> > > >
> > > > This will also go away.
> > > >
> > > Hi Mani,
> > >
> > > I get your point but the problem is in ECAM mode the DBI address to maximum
> > > of 256 MB will be ioremap by pci_ecam_create(). If we don't skip
> > > this ioremap of elbi ioremap in pci_ecam_create because we already
> > > iormaped elbi which falls in dbi address to 256 MB region( as we can't
> > > remap same region twice). so we need to skip doing ioremap for elbi
> > > region.
> > >
> >
> > Then obviously, your DT entries are wrong. You cannot define overlapping regions
> > on purpose. Can't you leave the ELBI region and start the config region?
> >
> > - Mani
> ELBI is part of DBI space(present in the first 4kb of the dbi) we can't
> relocate ELBI region to different location.
I'm not asking you to relocate ELBI. I'm asking you if you can skip the first
4KiB of the config region to not overlap with DBI and ELBI. Either extend the
config region or just map 255 MiB since you can use DBI for accessing root port.
Like this,
+ <4 0x00000000 0 0xf1d>,
+ <4 0x00000f20 0 0xa8>,
+ <4 0x10000000 0 0x1000>,
+ <4 0x00100000 0 0xFF00000>; /* First 1MiB is part of DBI space */
- Mani
--
மணிவண்ணன் சதாசிவம்
^ permalink raw reply [flat|nested] 19+ messages in thread* Re: [PATCH v4 4/4] PCI: qcom: Enable ECAM feature
2025-02-10 9:53 ` Krishna Chaitanya Chundru
2025-02-10 10:08 ` Manivannan Sadhasivam
@ 2025-03-05 18:26 ` Manivannan Sadhasivam
2025-03-07 1:14 ` Krishna Chaitanya Chundru
1 sibling, 1 reply; 19+ messages in thread
From: Manivannan Sadhasivam @ 2025-03-05 18:26 UTC (permalink / raw)
To: Krishna Chaitanya Chundru
Cc: Krishna Chaitanya Chundru, cros-qcom-dts-watchers,
Bjorn Andersson, Konrad Dybcio, Rob Herring, Krzysztof Kozlowski,
Conor Dooley, Lorenzo Pieralisi, Krzysztof Wilczyński,
Bjorn Helgaas, Jingoo Han, linux-arm-msm, devicetree,
linux-kernel, linux-pci, quic_vbadigan, quic_mrana, quic_vpernami,
mmareddy
On Mon, Feb 10, 2025 at 03:23:43PM +0530, Krishna Chaitanya Chundru wrote:
>
>
> On 2/10/2025 3:17 PM, Manivannan Sadhasivam wrote:
> > On Mon, Feb 10, 2025 at 03:04:43PM +0530, Krishna Chaitanya Chundru wrote:
> > >
> > >
> > > On 2/10/2025 2:52 PM, Manivannan Sadhasivam wrote:
> > > > On Fri, Feb 07, 2025 at 04:58:59AM +0530, Krishna Chaitanya Chundru wrote:
> > > > > The ELBI registers falls after the DBI space, PARF_SLV_DBI_ELBI register
> > > > > gives us the offset from which ELBI starts. so use this offset and cfg
> > > > > win to map these regions instead of doing the ioremap again.
> > > > >
> > > > > On root bus, we have only the root port. Any access other than that
> > > > > should not go out of the link and should return all F's. Since the iATU
> > > > > is configured for the buses which starts after root bus, block the
> > > > > transactions starting from function 1 of the root bus to the end of
> > > > > the root bus (i.e from dbi_base + 4kb to dbi_base + 1MB) from going
> > > > > outside the link through ECAM blocker through PARF registers.
> > > > >
> > > > > Signed-off-by: Krishna Chaitanya Chundru <krishna.chundru@oss.qualcomm.com>
> > > > > ---
> > > > > drivers/pci/controller/dwc/pcie-qcom.c | 77 ++++++++++++++++++++++++++++++++--
> > > > > 1 file changed, 73 insertions(+), 4 deletions(-)
> > > > >
> > > > > diff --git a/drivers/pci/controller/dwc/pcie-qcom.c b/drivers/pci/controller/dwc/pcie-qcom.c
> > > > > index e4d3366ead1f..84297b308e7e 100644
> > > > > --- a/drivers/pci/controller/dwc/pcie-qcom.c
> > > > > +++ b/drivers/pci/controller/dwc/pcie-qcom.c
> > > > > @@ -52,6 +52,7 @@
> > > > > #define PARF_AXI_MSTR_WR_ADDR_HALT_V2 0x1a8
> > > > > #define PARF_Q2A_FLUSH 0x1ac
> > > > > #define PARF_LTSSM 0x1b0
> > > > > +#define PARF_SLV_DBI_ELBI 0x1b4
> > > > > #define PARF_INT_ALL_STATUS 0x224
> > > > > #define PARF_INT_ALL_CLEAR 0x228
> > > > > #define PARF_INT_ALL_MASK 0x22c
> > > > > @@ -61,6 +62,17 @@
> > > > > #define PARF_DBI_BASE_ADDR_V2_HI 0x354
> > > > > #define PARF_SLV_ADDR_SPACE_SIZE_V2 0x358
> > > > > #define PARF_SLV_ADDR_SPACE_SIZE_V2_HI 0x35c
> > > > > +#define PARF_BLOCK_SLV_AXI_WR_BASE 0x360
> > > > > +#define PARF_BLOCK_SLV_AXI_WR_BASE_HI 0x364
> > > > > +#define PARF_BLOCK_SLV_AXI_WR_LIMIT 0x368
> > > > > +#define PARF_BLOCK_SLV_AXI_WR_LIMIT_HI 0x36c
> > > > > +#define PARF_BLOCK_SLV_AXI_RD_BASE 0x370
> > > > > +#define PARF_BLOCK_SLV_AXI_RD_BASE_HI 0x374
> > > > > +#define PARF_BLOCK_SLV_AXI_RD_LIMIT 0x378
> > > > > +#define PARF_BLOCK_SLV_AXI_RD_LIMIT_HI 0x37c
> > > > > +#define PARF_ECAM_BASE 0x380
> > > > > +#define PARF_ECAM_BASE_HI 0x384
> > > > > +
> > > > > #define PARF_NO_SNOOP_OVERIDE 0x3d4
> > > > > #define PARF_ATU_BASE_ADDR 0x634
> > > > > #define PARF_ATU_BASE_ADDR_HI 0x638
> > > > > @@ -84,6 +96,7 @@
> > > > > /* PARF_SYS_CTRL register fields */
> > > > > #define MAC_PHY_POWERDOWN_IN_P2_D_MUX_EN BIT(29)
> > > > > +#define PCIE_ECAM_BLOCKER_EN BIT(26)
> > > > > #define MST_WAKEUP_EN BIT(13)
> > > > > #define SLV_WAKEUP_EN BIT(12)
> > > > > #define MSTR_ACLK_CGC_DIS BIT(10)
> > > > > @@ -294,6 +307,44 @@ static void qcom_ep_reset_deassert(struct qcom_pcie *pcie)
> > > > > usleep_range(PERST_DELAY_US, PERST_DELAY_US + 500);
> > > > > }
> > > > > +static void qcom_pci_config_ecam(struct dw_pcie_rp *pp)
> > > > > +{
> > > > > + struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
> > > > > + struct qcom_pcie *pcie = to_qcom_pcie(pci);
> > > > > + u64 addr, addr_end;
> > > > > + u32 val;
> > > > > +
> > > > > + /* Set the ECAM base */
> > > > > + writel_relaxed(lower_32_bits(pci->dbi_phys_addr), pcie->parf + PARF_ECAM_BASE);
> > > > > + writel_relaxed(upper_32_bits(pci->dbi_phys_addr), pcie->parf + PARF_ECAM_BASE_HI);
> > > > > +
> > > > > + /*
> > > > > + * The only device on root bus is the Root Port. Any access other than that
> > > > > + * should not go out of the link and should return all F's. Since the iATU
> > > > > + * is configured for the buses which starts after root bus, block the transactions
> > > > > + * starting from function 1 of the root bus to the end of the root bus (i.e from
> > > > > + * dbi_base + 4kb to dbi_base + 1MB) from going outside the link.
> > > > > + */
> > > > > + addr = pci->dbi_phys_addr + SZ_4K;
> > > > > + writel_relaxed(lower_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_WR_BASE);
> > > > > + writel_relaxed(upper_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_WR_BASE_HI);
> > > > > +
> > > > > + writel_relaxed(lower_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_RD_BASE);
> > > > > + writel_relaxed(upper_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_RD_BASE_HI);
> > > > > +
> > > > > + addr_end = pci->dbi_phys_addr + SZ_1M - 1;
> > > > > +
> > > > > + writel_relaxed(lower_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_WR_LIMIT);
> > > > > + writel_relaxed(upper_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_WR_LIMIT_HI);
> > > > > +
> > > > > + writel_relaxed(lower_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_RD_LIMIT);
> > > > > + writel_relaxed(upper_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_RD_LIMIT_HI);
> > > > > +
> > > > > + val = readl_relaxed(pcie->parf + PARF_SYS_CTRL);
> > > > > + val |= PCIE_ECAM_BLOCKER_EN;
> > > > > + writel_relaxed(val, pcie->parf + PARF_SYS_CTRL);
> > > > > +}
> > > > > +
> > > > > static int qcom_pcie_start_link(struct dw_pcie *pci)
> > > > > {
> > > > > struct qcom_pcie *pcie = to_qcom_pcie(pci);
> > > > > @@ -303,6 +354,9 @@ static int qcom_pcie_start_link(struct dw_pcie *pci)
> > > > > qcom_pcie_common_set_16gt_lane_margining(pci);
> > > > > }
> > > > > + if (pci->pp.ecam_mode)
> > > > > + qcom_pci_config_ecam(&pci->pp);
> > > > > +
> > > > > /* Enable Link Training state machine */
> > > > > if (pcie->cfg->ops->ltssm_enable)
> > > > > pcie->cfg->ops->ltssm_enable(pcie);
> > > > > @@ -1233,6 +1287,7 @@ static int qcom_pcie_host_init(struct dw_pcie_rp *pp)
> > > > > {
> > > > > struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
> > > > > struct qcom_pcie *pcie = to_qcom_pcie(pci);
> > > > > + u16 offset;
> > > > > int ret;
> > > > > qcom_ep_reset_assert(pcie);
> > > > > @@ -1241,6 +1296,11 @@ static int qcom_pcie_host_init(struct dw_pcie_rp *pp)
> > > > > if (ret)
> > > > > return ret;
> > > > > + if (pp->ecam_mode) {
> > > > > + offset = readl(pcie->parf + PARF_SLV_DBI_ELBI);
> > > > > + pcie->elbi = pci->dbi_base + offset;
> > > > > + }
> > > >
> > > > If you use the existing 'elbi' register offset defined in DT, you can just rely
> > > > on the DWC core to call dw_pcie_ecam_supported() as I mentioned in my comment in
> > > > patch 3.
> > > > >> +
> > > > > ret = phy_set_mode_ext(pcie->phy, PHY_MODE_PCIE, PHY_MODE_PCIE_RC);
> > > > > if (ret)
> > > > > goto err_deinit;
> > > > > @@ -1615,6 +1675,13 @@ static int qcom_pcie_probe(struct platform_device *pdev)
> > > > > pci->ops = &dw_pcie_ops;
> > > > > pp = &pci->pp;
> > > > > + pp->bridge = devm_pci_alloc_host_bridge(dev, 0);
> > > > > + if (!pp->bridge) {
> > > > > + ret = -ENOMEM;
> > > > > + goto err_pm_runtime_put;
> > > > > + }
> > > > > +
> > > >
> > > > This will also go away.
> > > >
> > > Hi Mani,
> > >
> > > I get your point but the problem is in ECAM mode the DBI address to maximum
> > > of 256 MB will be ioremap by pci_ecam_create(). If we don't skip
> > > this ioremap of elbi ioremap in pci_ecam_create because we already
> > > iormaped elbi which falls in dbi address to 256 MB region( as we can't
> > > remap same region twice). so we need to skip doing ioremap for elbi
> > > region.
> > >
> >
> > Then obviously, your DT entries are wrong. You cannot define overlapping regions
> > on purpose. Can't you leave the ELBI region and start the config region?
> >
> > - Mani
> ELBI is part of DBI space(present in the first 4kb of the dbi) we can't
> relocate ELBI region to different location.
> can we keep this elbi region as optional and remove elbi from the
> devicetree and binding?
>
Since ELBI is a DWC generic region, you should move the resource get call to
dw_pcie_get_resources(). Also, it is an optional region, so you should open code
devm_platform_ioremap_resource_byname() to skip devm_ioremap_resource() if
platform_get_resource_byname() returns NULL. DT binding should make sure that
the DTS has the region specified if required.
And this should allow you to move the dw_pcie_ecam_supported() call inside DWC
core.
- Mani
--
மணிவண்ணன் சதாசிவம்
^ permalink raw reply [flat|nested] 19+ messages in thread* Re: [PATCH v4 4/4] PCI: qcom: Enable ECAM feature
2025-03-05 18:26 ` Manivannan Sadhasivam
@ 2025-03-07 1:14 ` Krishna Chaitanya Chundru
2025-03-07 17:40 ` Manivannan Sadhasivam
0 siblings, 1 reply; 19+ messages in thread
From: Krishna Chaitanya Chundru @ 2025-03-07 1:14 UTC (permalink / raw)
To: Manivannan Sadhasivam, Krishna Chaitanya Chundru
Cc: cros-qcom-dts-watchers, Bjorn Andersson, Konrad Dybcio,
Rob Herring, Krzysztof Kozlowski, Conor Dooley, Lorenzo Pieralisi,
Krzysztof Wilczyński, Bjorn Helgaas, Jingoo Han,
linux-arm-msm, devicetree, linux-kernel, linux-pci, quic_vbadigan,
quic_mrana, quic_vpernami, mmareddy
On 3/5/2025 11:56 PM, Manivannan Sadhasivam wrote:
> On Mon, Feb 10, 2025 at 03:23:43PM +0530, Krishna Chaitanya Chundru wrote:
>>
>>
>> On 2/10/2025 3:17 PM, Manivannan Sadhasivam wrote:
>>> On Mon, Feb 10, 2025 at 03:04:43PM +0530, Krishna Chaitanya Chundru wrote:
>>>>
>>>>
>>>> On 2/10/2025 2:52 PM, Manivannan Sadhasivam wrote:
>>>>> On Fri, Feb 07, 2025 at 04:58:59AM +0530, Krishna Chaitanya Chundru wrote:
>>>>>> The ELBI registers falls after the DBI space, PARF_SLV_DBI_ELBI register
>>>>>> gives us the offset from which ELBI starts. so use this offset and cfg
>>>>>> win to map these regions instead of doing the ioremap again.
>>>>>>
>>>>>> On root bus, we have only the root port. Any access other than that
>>>>>> should not go out of the link and should return all F's. Since the iATU
>>>>>> is configured for the buses which starts after root bus, block the
>>>>>> transactions starting from function 1 of the root bus to the end of
>>>>>> the root bus (i.e from dbi_base + 4kb to dbi_base + 1MB) from going
>>>>>> outside the link through ECAM blocker through PARF registers.
>>>>>>
>>>>>> Signed-off-by: Krishna Chaitanya Chundru <krishna.chundru@oss.qualcomm.com>
>>>>>> ---
>>>>>> drivers/pci/controller/dwc/pcie-qcom.c | 77 ++++++++++++++++++++++++++++++++--
>>>>>> 1 file changed, 73 insertions(+), 4 deletions(-)
>>>>>>
>>>>>> diff --git a/drivers/pci/controller/dwc/pcie-qcom.c b/drivers/pci/controller/dwc/pcie-qcom.c
>>>>>> index e4d3366ead1f..84297b308e7e 100644
>>>>>> --- a/drivers/pci/controller/dwc/pcie-qcom.c
>>>>>> +++ b/drivers/pci/controller/dwc/pcie-qcom.c
>>>>>> @@ -52,6 +52,7 @@
>>>>>> #define PARF_AXI_MSTR_WR_ADDR_HALT_V2 0x1a8
>>>>>> #define PARF_Q2A_FLUSH 0x1ac
>>>>>> #define PARF_LTSSM 0x1b0
>>>>>> +#define PARF_SLV_DBI_ELBI 0x1b4
>>>>>> #define PARF_INT_ALL_STATUS 0x224
>>>>>> #define PARF_INT_ALL_CLEAR 0x228
>>>>>> #define PARF_INT_ALL_MASK 0x22c
>>>>>> @@ -61,6 +62,17 @@
>>>>>> #define PARF_DBI_BASE_ADDR_V2_HI 0x354
>>>>>> #define PARF_SLV_ADDR_SPACE_SIZE_V2 0x358
>>>>>> #define PARF_SLV_ADDR_SPACE_SIZE_V2_HI 0x35c
>>>>>> +#define PARF_BLOCK_SLV_AXI_WR_BASE 0x360
>>>>>> +#define PARF_BLOCK_SLV_AXI_WR_BASE_HI 0x364
>>>>>> +#define PARF_BLOCK_SLV_AXI_WR_LIMIT 0x368
>>>>>> +#define PARF_BLOCK_SLV_AXI_WR_LIMIT_HI 0x36c
>>>>>> +#define PARF_BLOCK_SLV_AXI_RD_BASE 0x370
>>>>>> +#define PARF_BLOCK_SLV_AXI_RD_BASE_HI 0x374
>>>>>> +#define PARF_BLOCK_SLV_AXI_RD_LIMIT 0x378
>>>>>> +#define PARF_BLOCK_SLV_AXI_RD_LIMIT_HI 0x37c
>>>>>> +#define PARF_ECAM_BASE 0x380
>>>>>> +#define PARF_ECAM_BASE_HI 0x384
>>>>>> +
>>>>>> #define PARF_NO_SNOOP_OVERIDE 0x3d4
>>>>>> #define PARF_ATU_BASE_ADDR 0x634
>>>>>> #define PARF_ATU_BASE_ADDR_HI 0x638
>>>>>> @@ -84,6 +96,7 @@
>>>>>> /* PARF_SYS_CTRL register fields */
>>>>>> #define MAC_PHY_POWERDOWN_IN_P2_D_MUX_EN BIT(29)
>>>>>> +#define PCIE_ECAM_BLOCKER_EN BIT(26)
>>>>>> #define MST_WAKEUP_EN BIT(13)
>>>>>> #define SLV_WAKEUP_EN BIT(12)
>>>>>> #define MSTR_ACLK_CGC_DIS BIT(10)
>>>>>> @@ -294,6 +307,44 @@ static void qcom_ep_reset_deassert(struct qcom_pcie *pcie)
>>>>>> usleep_range(PERST_DELAY_US, PERST_DELAY_US + 500);
>>>>>> }
>>>>>> +static void qcom_pci_config_ecam(struct dw_pcie_rp *pp)
>>>>>> +{
>>>>>> + struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
>>>>>> + struct qcom_pcie *pcie = to_qcom_pcie(pci);
>>>>>> + u64 addr, addr_end;
>>>>>> + u32 val;
>>>>>> +
>>>>>> + /* Set the ECAM base */
>>>>>> + writel_relaxed(lower_32_bits(pci->dbi_phys_addr), pcie->parf + PARF_ECAM_BASE);
>>>>>> + writel_relaxed(upper_32_bits(pci->dbi_phys_addr), pcie->parf + PARF_ECAM_BASE_HI);
>>>>>> +
>>>>>> + /*
>>>>>> + * The only device on root bus is the Root Port. Any access other than that
>>>>>> + * should not go out of the link and should return all F's. Since the iATU
>>>>>> + * is configured for the buses which starts after root bus, block the transactions
>>>>>> + * starting from function 1 of the root bus to the end of the root bus (i.e from
>>>>>> + * dbi_base + 4kb to dbi_base + 1MB) from going outside the link.
>>>>>> + */
>>>>>> + addr = pci->dbi_phys_addr + SZ_4K;
>>>>>> + writel_relaxed(lower_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_WR_BASE);
>>>>>> + writel_relaxed(upper_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_WR_BASE_HI);
>>>>>> +
>>>>>> + writel_relaxed(lower_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_RD_BASE);
>>>>>> + writel_relaxed(upper_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_RD_BASE_HI);
>>>>>> +
>>>>>> + addr_end = pci->dbi_phys_addr + SZ_1M - 1;
>>>>>> +
>>>>>> + writel_relaxed(lower_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_WR_LIMIT);
>>>>>> + writel_relaxed(upper_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_WR_LIMIT_HI);
>>>>>> +
>>>>>> + writel_relaxed(lower_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_RD_LIMIT);
>>>>>> + writel_relaxed(upper_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_RD_LIMIT_HI);
>>>>>> +
>>>>>> + val = readl_relaxed(pcie->parf + PARF_SYS_CTRL);
>>>>>> + val |= PCIE_ECAM_BLOCKER_EN;
>>>>>> + writel_relaxed(val, pcie->parf + PARF_SYS_CTRL);
>>>>>> +}
>>>>>> +
>>>>>> static int qcom_pcie_start_link(struct dw_pcie *pci)
>>>>>> {
>>>>>> struct qcom_pcie *pcie = to_qcom_pcie(pci);
>>>>>> @@ -303,6 +354,9 @@ static int qcom_pcie_start_link(struct dw_pcie *pci)
>>>>>> qcom_pcie_common_set_16gt_lane_margining(pci);
>>>>>> }
>>>>>> + if (pci->pp.ecam_mode)
>>>>>> + qcom_pci_config_ecam(&pci->pp);
>>>>>> +
>>>>>> /* Enable Link Training state machine */
>>>>>> if (pcie->cfg->ops->ltssm_enable)
>>>>>> pcie->cfg->ops->ltssm_enable(pcie);
>>>>>> @@ -1233,6 +1287,7 @@ static int qcom_pcie_host_init(struct dw_pcie_rp *pp)
>>>>>> {
>>>>>> struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
>>>>>> struct qcom_pcie *pcie = to_qcom_pcie(pci);
>>>>>> + u16 offset;
>>>>>> int ret;
>>>>>> qcom_ep_reset_assert(pcie);
>>>>>> @@ -1241,6 +1296,11 @@ static int qcom_pcie_host_init(struct dw_pcie_rp *pp)
>>>>>> if (ret)
>>>>>> return ret;
>>>>>> + if (pp->ecam_mode) {
>>>>>> + offset = readl(pcie->parf + PARF_SLV_DBI_ELBI);
>>>>>> + pcie->elbi = pci->dbi_base + offset;
>>>>>> + }
>>>>>
>>>>> If you use the existing 'elbi' register offset defined in DT, you can just rely
>>>>> on the DWC core to call dw_pcie_ecam_supported() as I mentioned in my comment in
>>>>> patch 3.
>>>>> >> +
>>>>>> ret = phy_set_mode_ext(pcie->phy, PHY_MODE_PCIE, PHY_MODE_PCIE_RC);
>>>>>> if (ret)
>>>>>> goto err_deinit;
>>>>>> @@ -1615,6 +1675,13 @@ static int qcom_pcie_probe(struct platform_device *pdev)
>>>>>> pci->ops = &dw_pcie_ops;
>>>>>> pp = &pci->pp;
>>>>>> + pp->bridge = devm_pci_alloc_host_bridge(dev, 0);
>>>>>> + if (!pp->bridge) {
>>>>>> + ret = -ENOMEM;
>>>>>> + goto err_pm_runtime_put;
>>>>>> + }
>>>>>> +
>>>>>
>>>>> This will also go away.
>>>>>
>>>> Hi Mani,
>>>>
>>>> I get your point but the problem is in ECAM mode the DBI address to maximum
>>>> of 256 MB will be ioremap by pci_ecam_create(). If we don't skip
>>>> this ioremap of elbi ioremap in pci_ecam_create because we already
>>>> iormaped elbi which falls in dbi address to 256 MB region( as we can't
>>>> remap same region twice). so we need to skip doing ioremap for elbi
>>>> region.
>>>>
>>>
>>> Then obviously, your DT entries are wrong. You cannot define overlapping regions
>>> on purpose. Can't you leave the ELBI region and start the config region?
>>>
>>> - Mani
>> ELBI is part of DBI space(present in the first 4kb of the dbi) we can't
>> relocate ELBI region to different location.
>> can we keep this elbi region as optional and remove elbi from the
>> devicetree and binding?
>>
>
> Since ELBI is a DWC generic region, you should move the resource get call to
> dw_pcie_get_resources(). Also, it is an optional region, so you should open code
> devm_platform_ioremap_resource_byname() to skip devm_ioremap_resource() if
> platform_get_resource_byname() returns NULL. DT binding should make sure that
> the DTS has the region specified if required.
>
Hi Mani,
even though elbi is a dwc region the registers in the elbi are specific
to the vendors. The ELBI register contents in the qcom might not match
with the other vendors. So we can skip this adding this in
dw_pcie_get_resources()
we will try to make it optional only in qcom driver, and remove from the
dt as suggested.
- Krishna Chaitanya.
> And this should allow you to move the dw_pcie_ecam_supported() call inside DWC
> core.
>
> - Mani
>
^ permalink raw reply [flat|nested] 19+ messages in thread* Re: [PATCH v4 4/4] PCI: qcom: Enable ECAM feature
2025-03-07 1:14 ` Krishna Chaitanya Chundru
@ 2025-03-07 17:40 ` Manivannan Sadhasivam
0 siblings, 0 replies; 19+ messages in thread
From: Manivannan Sadhasivam @ 2025-03-07 17:40 UTC (permalink / raw)
To: Krishna Chaitanya Chundru
Cc: Krishna Chaitanya Chundru, cros-qcom-dts-watchers,
Bjorn Andersson, Konrad Dybcio, Rob Herring, Krzysztof Kozlowski,
Conor Dooley, Lorenzo Pieralisi, Krzysztof Wilczyński,
Bjorn Helgaas, Jingoo Han, linux-arm-msm, devicetree,
linux-kernel, linux-pci, quic_vbadigan, quic_mrana, quic_vpernami,
mmareddy
On Fri, Mar 07, 2025 at 06:44:53AM +0530, Krishna Chaitanya Chundru wrote:
>
>
> On 3/5/2025 11:56 PM, Manivannan Sadhasivam wrote:
> > On Mon, Feb 10, 2025 at 03:23:43PM +0530, Krishna Chaitanya Chundru wrote:
> > >
> > >
> > > On 2/10/2025 3:17 PM, Manivannan Sadhasivam wrote:
> > > > On Mon, Feb 10, 2025 at 03:04:43PM +0530, Krishna Chaitanya Chundru wrote:
> > > > >
> > > > >
> > > > > On 2/10/2025 2:52 PM, Manivannan Sadhasivam wrote:
> > > > > > On Fri, Feb 07, 2025 at 04:58:59AM +0530, Krishna Chaitanya Chundru wrote:
> > > > > > > The ELBI registers falls after the DBI space, PARF_SLV_DBI_ELBI register
> > > > > > > gives us the offset from which ELBI starts. so use this offset and cfg
> > > > > > > win to map these regions instead of doing the ioremap again.
> > > > > > >
> > > > > > > On root bus, we have only the root port. Any access other than that
> > > > > > > should not go out of the link and should return all F's. Since the iATU
> > > > > > > is configured for the buses which starts after root bus, block the
> > > > > > > transactions starting from function 1 of the root bus to the end of
> > > > > > > the root bus (i.e from dbi_base + 4kb to dbi_base + 1MB) from going
> > > > > > > outside the link through ECAM blocker through PARF registers.
> > > > > > >
> > > > > > > Signed-off-by: Krishna Chaitanya Chundru <krishna.chundru@oss.qualcomm.com>
> > > > > > > ---
> > > > > > > drivers/pci/controller/dwc/pcie-qcom.c | 77 ++++++++++++++++++++++++++++++++--
> > > > > > > 1 file changed, 73 insertions(+), 4 deletions(-)
> > > > > > >
> > > > > > > diff --git a/drivers/pci/controller/dwc/pcie-qcom.c b/drivers/pci/controller/dwc/pcie-qcom.c
> > > > > > > index e4d3366ead1f..84297b308e7e 100644
> > > > > > > --- a/drivers/pci/controller/dwc/pcie-qcom.c
> > > > > > > +++ b/drivers/pci/controller/dwc/pcie-qcom.c
> > > > > > > @@ -52,6 +52,7 @@
> > > > > > > #define PARF_AXI_MSTR_WR_ADDR_HALT_V2 0x1a8
> > > > > > > #define PARF_Q2A_FLUSH 0x1ac
> > > > > > > #define PARF_LTSSM 0x1b0
> > > > > > > +#define PARF_SLV_DBI_ELBI 0x1b4
> > > > > > > #define PARF_INT_ALL_STATUS 0x224
> > > > > > > #define PARF_INT_ALL_CLEAR 0x228
> > > > > > > #define PARF_INT_ALL_MASK 0x22c
> > > > > > > @@ -61,6 +62,17 @@
> > > > > > > #define PARF_DBI_BASE_ADDR_V2_HI 0x354
> > > > > > > #define PARF_SLV_ADDR_SPACE_SIZE_V2 0x358
> > > > > > > #define PARF_SLV_ADDR_SPACE_SIZE_V2_HI 0x35c
> > > > > > > +#define PARF_BLOCK_SLV_AXI_WR_BASE 0x360
> > > > > > > +#define PARF_BLOCK_SLV_AXI_WR_BASE_HI 0x364
> > > > > > > +#define PARF_BLOCK_SLV_AXI_WR_LIMIT 0x368
> > > > > > > +#define PARF_BLOCK_SLV_AXI_WR_LIMIT_HI 0x36c
> > > > > > > +#define PARF_BLOCK_SLV_AXI_RD_BASE 0x370
> > > > > > > +#define PARF_BLOCK_SLV_AXI_RD_BASE_HI 0x374
> > > > > > > +#define PARF_BLOCK_SLV_AXI_RD_LIMIT 0x378
> > > > > > > +#define PARF_BLOCK_SLV_AXI_RD_LIMIT_HI 0x37c
> > > > > > > +#define PARF_ECAM_BASE 0x380
> > > > > > > +#define PARF_ECAM_BASE_HI 0x384
> > > > > > > +
> > > > > > > #define PARF_NO_SNOOP_OVERIDE 0x3d4
> > > > > > > #define PARF_ATU_BASE_ADDR 0x634
> > > > > > > #define PARF_ATU_BASE_ADDR_HI 0x638
> > > > > > > @@ -84,6 +96,7 @@
> > > > > > > /* PARF_SYS_CTRL register fields */
> > > > > > > #define MAC_PHY_POWERDOWN_IN_P2_D_MUX_EN BIT(29)
> > > > > > > +#define PCIE_ECAM_BLOCKER_EN BIT(26)
> > > > > > > #define MST_WAKEUP_EN BIT(13)
> > > > > > > #define SLV_WAKEUP_EN BIT(12)
> > > > > > > #define MSTR_ACLK_CGC_DIS BIT(10)
> > > > > > > @@ -294,6 +307,44 @@ static void qcom_ep_reset_deassert(struct qcom_pcie *pcie)
> > > > > > > usleep_range(PERST_DELAY_US, PERST_DELAY_US + 500);
> > > > > > > }
> > > > > > > +static void qcom_pci_config_ecam(struct dw_pcie_rp *pp)
> > > > > > > +{
> > > > > > > + struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
> > > > > > > + struct qcom_pcie *pcie = to_qcom_pcie(pci);
> > > > > > > + u64 addr, addr_end;
> > > > > > > + u32 val;
> > > > > > > +
> > > > > > > + /* Set the ECAM base */
> > > > > > > + writel_relaxed(lower_32_bits(pci->dbi_phys_addr), pcie->parf + PARF_ECAM_BASE);
> > > > > > > + writel_relaxed(upper_32_bits(pci->dbi_phys_addr), pcie->parf + PARF_ECAM_BASE_HI);
> > > > > > > +
> > > > > > > + /*
> > > > > > > + * The only device on root bus is the Root Port. Any access other than that
> > > > > > > + * should not go out of the link and should return all F's. Since the iATU
> > > > > > > + * is configured for the buses which starts after root bus, block the transactions
> > > > > > > + * starting from function 1 of the root bus to the end of the root bus (i.e from
> > > > > > > + * dbi_base + 4kb to dbi_base + 1MB) from going outside the link.
> > > > > > > + */
> > > > > > > + addr = pci->dbi_phys_addr + SZ_4K;
> > > > > > > + writel_relaxed(lower_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_WR_BASE);
> > > > > > > + writel_relaxed(upper_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_WR_BASE_HI);
> > > > > > > +
> > > > > > > + writel_relaxed(lower_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_RD_BASE);
> > > > > > > + writel_relaxed(upper_32_bits(addr), pcie->parf + PARF_BLOCK_SLV_AXI_RD_BASE_HI);
> > > > > > > +
> > > > > > > + addr_end = pci->dbi_phys_addr + SZ_1M - 1;
> > > > > > > +
> > > > > > > + writel_relaxed(lower_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_WR_LIMIT);
> > > > > > > + writel_relaxed(upper_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_WR_LIMIT_HI);
> > > > > > > +
> > > > > > > + writel_relaxed(lower_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_RD_LIMIT);
> > > > > > > + writel_relaxed(upper_32_bits(addr_end), pcie->parf + PARF_BLOCK_SLV_AXI_RD_LIMIT_HI);
> > > > > > > +
> > > > > > > + val = readl_relaxed(pcie->parf + PARF_SYS_CTRL);
> > > > > > > + val |= PCIE_ECAM_BLOCKER_EN;
> > > > > > > + writel_relaxed(val, pcie->parf + PARF_SYS_CTRL);
> > > > > > > +}
> > > > > > > +
> > > > > > > static int qcom_pcie_start_link(struct dw_pcie *pci)
> > > > > > > {
> > > > > > > struct qcom_pcie *pcie = to_qcom_pcie(pci);
> > > > > > > @@ -303,6 +354,9 @@ static int qcom_pcie_start_link(struct dw_pcie *pci)
> > > > > > > qcom_pcie_common_set_16gt_lane_margining(pci);
> > > > > > > }
> > > > > > > + if (pci->pp.ecam_mode)
> > > > > > > + qcom_pci_config_ecam(&pci->pp);
> > > > > > > +
> > > > > > > /* Enable Link Training state machine */
> > > > > > > if (pcie->cfg->ops->ltssm_enable)
> > > > > > > pcie->cfg->ops->ltssm_enable(pcie);
> > > > > > > @@ -1233,6 +1287,7 @@ static int qcom_pcie_host_init(struct dw_pcie_rp *pp)
> > > > > > > {
> > > > > > > struct dw_pcie *pci = to_dw_pcie_from_pp(pp);
> > > > > > > struct qcom_pcie *pcie = to_qcom_pcie(pci);
> > > > > > > + u16 offset;
> > > > > > > int ret;
> > > > > > > qcom_ep_reset_assert(pcie);
> > > > > > > @@ -1241,6 +1296,11 @@ static int qcom_pcie_host_init(struct dw_pcie_rp *pp)
> > > > > > > if (ret)
> > > > > > > return ret;
> > > > > > > + if (pp->ecam_mode) {
> > > > > > > + offset = readl(pcie->parf + PARF_SLV_DBI_ELBI);
> > > > > > > + pcie->elbi = pci->dbi_base + offset;
> > > > > > > + }
> > > > > >
> > > > > > If you use the existing 'elbi' register offset defined in DT, you can just rely
> > > > > > on the DWC core to call dw_pcie_ecam_supported() as I mentioned in my comment in
> > > > > > patch 3.
> > > > > > >> +
> > > > > > > ret = phy_set_mode_ext(pcie->phy, PHY_MODE_PCIE, PHY_MODE_PCIE_RC);
> > > > > > > if (ret)
> > > > > > > goto err_deinit;
> > > > > > > @@ -1615,6 +1675,13 @@ static int qcom_pcie_probe(struct platform_device *pdev)
> > > > > > > pci->ops = &dw_pcie_ops;
> > > > > > > pp = &pci->pp;
> > > > > > > + pp->bridge = devm_pci_alloc_host_bridge(dev, 0);
> > > > > > > + if (!pp->bridge) {
> > > > > > > + ret = -ENOMEM;
> > > > > > > + goto err_pm_runtime_put;
> > > > > > > + }
> > > > > > > +
> > > > > >
> > > > > > This will also go away.
> > > > > >
> > > > > Hi Mani,
> > > > >
> > > > > I get your point but the problem is in ECAM mode the DBI address to maximum
> > > > > of 256 MB will be ioremap by pci_ecam_create(). If we don't skip
> > > > > this ioremap of elbi ioremap in pci_ecam_create because we already
> > > > > iormaped elbi which falls in dbi address to 256 MB region( as we can't
> > > > > remap same region twice). so we need to skip doing ioremap for elbi
> > > > > region.
> > > > >
> > > >
> > > > Then obviously, your DT entries are wrong. You cannot define overlapping regions
> > > > on purpose. Can't you leave the ELBI region and start the config region?
> > > >
> > > > - Mani
> > > ELBI is part of DBI space(present in the first 4kb of the dbi) we can't
> > > relocate ELBI region to different location.
> > > can we keep this elbi region as optional and remove elbi from the
> > > devicetree and binding?
> > >
> >
> > Since ELBI is a DWC generic region, you should move the resource get call to
> > dw_pcie_get_resources(). Also, it is an optional region, so you should open code
> > devm_platform_ioremap_resource_byname() to skip devm_ioremap_resource() if
> > platform_get_resource_byname() returns NULL. DT binding should make sure that
> > the DTS has the region specified if required.
> >
> Hi Mani,
> even though elbi is a dwc region the registers in the elbi are specific
> to the vendors. The ELBI register contents in the qcom might not match
> with the other vendors. So we can skip this adding this in
> dw_pcie_get_resources()
>
No. I was just asking you to move the devm_platform_ioremap_resource_byname() of
ELBI to dw_pcie_get_resources(), like DBI, iATU. Then controller drivers can use
'dw_pcie::elbi' to access EBI specific registers with their own offset.
Since ELBI is DWC specific, the resource fetch code should belong to the DWC
core. And it will simplify your patch also.
- Mani
--
மணிவண்ணன் சதாசிவம்
^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: [PATCH v4 4/4] PCI: qcom: Enable ECAM feature
2025-02-06 23:28 ` [PATCH v4 4/4] PCI: qcom: Enable ECAM feature Krishna Chaitanya Chundru
2025-02-10 9:22 ` Manivannan Sadhasivam
@ 2025-02-10 22:54 ` Bjorn Helgaas
2025-02-17 5:07 ` Krishna Chaitanya Chundru
1 sibling, 1 reply; 19+ messages in thread
From: Bjorn Helgaas @ 2025-02-10 22:54 UTC (permalink / raw)
To: Krishna Chaitanya Chundru
Cc: cros-qcom-dts-watchers, Bjorn Andersson, Konrad Dybcio,
Rob Herring, Krzysztof Kozlowski, Conor Dooley, Lorenzo Pieralisi,
Krzysztof Wilczyński, Manivannan Sadhasivam, Bjorn Helgaas,
Jingoo Han, linux-arm-msm, devicetree, linux-kernel, linux-pci,
quic_vbadigan, quic_mrana, quic_vpernami, mmareddy
On Fri, Feb 07, 2025 at 04:58:59AM +0530, Krishna Chaitanya Chundru wrote:
> The ELBI registers falls after the DBI space, PARF_SLV_DBI_ELBI register
> gives us the offset from which ELBI starts. so use this offset and cfg
> win to map these regions instead of doing the ioremap again.
> + /* Set the ECAM base */
> + writel_relaxed(lower_32_bits(pci->dbi_phys_addr), pcie->parf + PARF_ECAM_BASE);
> + writel_relaxed(upper_32_bits(pci->dbi_phys_addr), pcie->parf + PARF_ECAM_BASE_HI);
> +
> + /*
> + * The only device on root bus is the Root Port. Any access other than that
> + * should not go out of the link and should return all F's. Since the iATU
> + * is configured for the buses which starts after root bus, block the transactions
> + * starting from function 1 of the root bus to the end of the root bus (i.e from
> + * dbi_base + 4kb to dbi_base + 1MB) from going outside the link.
99% of this file fits in 80 columns. Wrap comments to do the same.
The text doesn't quite make sense because accesses to devices on the
root bus *never* involve a link. Only Root Ports have links and the
links all lead to buses other than the root bus.
^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: [PATCH v4 4/4] PCI: qcom: Enable ECAM feature
2025-02-10 22:54 ` Bjorn Helgaas
@ 2025-02-17 5:07 ` Krishna Chaitanya Chundru
0 siblings, 0 replies; 19+ messages in thread
From: Krishna Chaitanya Chundru @ 2025-02-17 5:07 UTC (permalink / raw)
To: Bjorn Helgaas
Cc: cros-qcom-dts-watchers, Bjorn Andersson, Konrad Dybcio,
Rob Herring, Krzysztof Kozlowski, Conor Dooley, Lorenzo Pieralisi,
Krzysztof Wilczyński, Manivannan Sadhasivam, Bjorn Helgaas,
Jingoo Han, linux-arm-msm, devicetree, linux-kernel, linux-pci,
quic_vbadigan, quic_mrana, quic_vpernami, mmareddy
On 2/11/2025 4:24 AM, Bjorn Helgaas wrote:
> On Fri, Feb 07, 2025 at 04:58:59AM +0530, Krishna Chaitanya Chundru wrote:
>> The ELBI registers falls after the DBI space, PARF_SLV_DBI_ELBI register
>> gives us the offset from which ELBI starts. so use this offset and cfg
>> win to map these regions instead of doing the ioremap again.
>
>> + /* Set the ECAM base */
>> + writel_relaxed(lower_32_bits(pci->dbi_phys_addr), pcie->parf + PARF_ECAM_BASE);
>> + writel_relaxed(upper_32_bits(pci->dbi_phys_addr), pcie->parf + PARF_ECAM_BASE_HI);
>> +
>> + /*
>> + * The only device on root bus is the Root Port. Any access other than that
>> + * should not go out of the link and should return all F's. Since the iATU
>> + * is configured for the buses which starts after root bus, block the transactions
>> + * starting from function 1 of the root bus to the end of the root bus (i.e from
>> + * dbi_base + 4kb to dbi_base + 1MB) from going outside the link.
>
> 99% of this file fits in 80 columns. Wrap comments to do the same.
>
> The text doesn't quite make sense because accesses to devices on the
> root bus *never* involve a link. Only Root Ports have links and the
> links all lead to buses other than the root bus.
Hi Bjorn,
As part of enumeration PCIe sw will look read the vendor id's and device
id's under PCIe bus0 to see if there is any multi root ports etc..like
bus0 device1, bus0 device2.
In the first 1MB only first 4kb is used as config space for root port,
remaining memory acts as PCIe memory i.e if we access this memory the
transactions will go outside the link which can trigger some unknown
error.
if there is read request for vendor id for bus0 device2 PCIe sw will
try to access after 4kb region which can cause unknown errors.
So we need to block these transaction from going through PCIe link.
I will update the comment description in the next patch.
- Krishna Chaitanya.
^ permalink raw reply [flat|nested] 19+ messages in thread