linuxppc-dev.lists.ozlabs.org archive mirror
 help / color / mirror / Atom feed
* [PATCH 1/2 v2] Revert "powerpc/eeh: Fix crash in eeh_add_device_early() on Cell"
@ 2016-04-07  0:20 Guilherme G. Piccoli
  2016-04-07  0:20 ` [PATCH 2/2 v2] powerpc/iommu: Remove the dependency on EEH struct in DDW mechanism Guilherme G. Piccoli
  2016-04-07  0:35 ` [PATCH 1/2 v2] Revert "powerpc/eeh: Fix crash in eeh_add_device_early() on Cell" Gavin Shan
  0 siblings, 2 replies; 5+ messages in thread
From: Guilherme G. Piccoli @ 2016-04-07  0:20 UTC (permalink / raw)
  To: linuxppc-dev, gwshan; +Cc: mpe, benh, paulus, nfont

This reverts commit 89a51df5ab1d38b257300b8ac940bbac3bb0eb9b.

The function eeh_add_device_early() is used to perform EEH initialization in
devices added later on the system, like in hotplug/DLPAR scenarios. Since the
commit 89a51df5ab1d ("powerpc/eeh: Fix crash in eeh_add_device_early() on Cell")
a new check was introduced in this function - Cell has no EEH capabilities
which led to kernel oops if hotplug was performed, so checking for
eeh_enabled() was introduced to avoid the issue.

However, in architectures that EEH is present like pSeries or PowerNV, we might
reach a case in which no PCI devices are present on boot time and so EEH is not
initialized. Then, if a device is added via DLPAR for example,
eeh_add_device_early() fails because eeh_enabled() is false, and EEH end up
not being enabled at all.

This reverts the aforementioned patch since a new verification was introduced by
the commit d91dafc02f42 ("powerpc/eeh: Delay probing EEH device during hotplug")
and so the original Cell issue does not happen anymore.

Signed-off-by: Guilherme G. Piccoli <gpiccoli@linux.vnet.ibm.com>
---
 arch/powerpc/kernel/eeh.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/arch/powerpc/kernel/eeh.c b/arch/powerpc/kernel/eeh.c
index 6544017..6758484 100644
--- a/arch/powerpc/kernel/eeh.c
+++ b/arch/powerpc/kernel/eeh.c
@@ -1068,7 +1068,7 @@ void eeh_add_device_early(struct pci_dn *pdn)
 	struct pci_controller *phb;
 	struct eeh_dev *edev = pdn_to_eeh_dev(pdn);
 
-	if (!edev || !eeh_enabled())
+	if (!edev)
 		return;
 
 	if (!eeh_has_flag(EEH_PROBE_MODE_DEVTREE))
-- 
2.1.0

^ permalink raw reply related	[flat|nested] 5+ messages in thread

* [PATCH 2/2 v2] powerpc/iommu: Remove the dependency on EEH struct in DDW mechanism
  2016-04-07  0:20 [PATCH 1/2 v2] Revert "powerpc/eeh: Fix crash in eeh_add_device_early() on Cell" Guilherme G. Piccoli
@ 2016-04-07  0:20 ` Guilherme G. Piccoli
  2016-04-07  0:48   ` Gavin Shan
  2016-04-07  0:35 ` [PATCH 1/2 v2] Revert "powerpc/eeh: Fix crash in eeh_add_device_early() on Cell" Gavin Shan
  1 sibling, 1 reply; 5+ messages in thread
From: Guilherme G. Piccoli @ 2016-04-07  0:20 UTC (permalink / raw)
  To: linuxppc-dev, gwshan; +Cc: mpe, benh, paulus, nfont

Commit 39baadbf36ce ("powerpc/eeh: Remove eeh information from pci_dn")
changed the pci_dn struct by removing its EEH-related members.
As part of this clean-up, DDW mechanism was modified to read the device
configuration address from eeh_dev struct.

As a consequence, now if we disable EEH mechanism on kernel command-line
for example, the DDW mechanism will fail, generating a kernel oops by
dereferencing a NULL pointer (which turns to be the eeh_dev pointer).

This patch just changes the configuration address calculation on DDW
functions to a manual calculation based on pci_dn members instead of
using eeh_dev-based address.

No functional changes were made. This was tested on pSeries, both
in PHyp and qemu guest.

Fixes: 39baadbf36ce ("powerpc/eeh: Remove eeh information from pci_dn")
Signed-off-by: Guilherme G. Piccoli <gpiccoli@linux.vnet.ibm.com>
---
 arch/powerpc/platforms/pseries/iommu.c | 24 ++++++++++++------------
 1 file changed, 12 insertions(+), 12 deletions(-)

diff --git a/arch/powerpc/platforms/pseries/iommu.c b/arch/powerpc/platforms/pseries/iommu.c
index bd98ce2..3db2cea 100644
--- a/arch/powerpc/platforms/pseries/iommu.c
+++ b/arch/powerpc/platforms/pseries/iommu.c
@@ -912,7 +912,8 @@ machine_arch_initcall(pseries, find_existing_ddw_windows);
 static int query_ddw(struct pci_dev *dev, const u32 *ddw_avail,
 			struct ddw_query_response *query)
 {
-	struct eeh_dev *edev;
+	struct device_node *dn;
+	struct pci_dn *pdn;
 	u32 cfg_addr;
 	u64 buid;
 	int ret;
@@ -923,11 +924,10 @@ static int query_ddw(struct pci_dev *dev, const u32 *ddw_avail,
 	 * Retrieve them from the pci device, not the node with the
 	 * dma-window property
 	 */
-	edev = pci_dev_to_eeh_dev(dev);
-	cfg_addr = edev->config_addr;
-	if (edev->pe_config_addr)
-		cfg_addr = edev->pe_config_addr;
-	buid = edev->phb->buid;
+	dn = pci_device_to_OF_node(dev);
+	pdn = PCI_DN(dn);
+	cfg_addr = (pdn->busno << 8) | (pdn->devfn);
+	buid = pdn->phb->buid;
 
 	ret = rtas_call(ddw_avail[0], 3, 5, (u32 *)query,
 		  cfg_addr, BUID_HI(buid), BUID_LO(buid));
@@ -941,7 +941,8 @@ static int create_ddw(struct pci_dev *dev, const u32 *ddw_avail,
 			struct ddw_create_response *create, int page_shift,
 			int window_shift)
 {
-	struct eeh_dev *edev;
+	struct device_node *dn;
+	struct pci_dn *pdn;
 	u32 cfg_addr;
 	u64 buid;
 	int ret;
@@ -952,11 +953,10 @@ static int create_ddw(struct pci_dev *dev, const u32 *ddw_avail,
 	 * Retrieve them from the pci device, not the node with the
 	 * dma-window property
 	 */
-	edev = pci_dev_to_eeh_dev(dev);
-	cfg_addr = edev->config_addr;
-	if (edev->pe_config_addr)
-		cfg_addr = edev->pe_config_addr;
-	buid = edev->phb->buid;
+	dn = pci_device_to_OF_node(dev);
+	pdn = PCI_DN(dn);
+	cfg_addr = (pdn->busno << 8) | (pdn->devfn);
+	buid = pdn->phb->buid;
 
 	do {
 		/* extra outputs are LIOBN and dma-addr (hi, lo) */
-- 
2.1.0

^ permalink raw reply related	[flat|nested] 5+ messages in thread

* Re: [PATCH 1/2 v2] Revert "powerpc/eeh: Fix crash in eeh_add_device_early() on Cell"
  2016-04-07  0:20 [PATCH 1/2 v2] Revert "powerpc/eeh: Fix crash in eeh_add_device_early() on Cell" Guilherme G. Piccoli
  2016-04-07  0:20 ` [PATCH 2/2 v2] powerpc/iommu: Remove the dependency on EEH struct in DDW mechanism Guilherme G. Piccoli
@ 2016-04-07  0:35 ` Gavin Shan
  1 sibling, 0 replies; 5+ messages in thread
From: Gavin Shan @ 2016-04-07  0:35 UTC (permalink / raw)
  To: Guilherme G. Piccoli
  Cc: linuxppc-dev, gwshan, mpe, benh, paulus, nfont, ruscur

On Wed, Apr 06, 2016 at 09:20:04PM -0300, Guilherme G. Piccoli wrote:
>This reverts commit 89a51df5ab1d38b257300b8ac940bbac3bb0eb9b.
>
>The function eeh_add_device_early() is used to perform EEH initialization in
>devices added later on the system, like in hotplug/DLPAR scenarios. Since the
>commit 89a51df5ab1d ("powerpc/eeh: Fix crash in eeh_add_device_early() on Cell")
>a new check was introduced in this function - Cell has no EEH capabilities
>which led to kernel oops if hotplug was performed, so checking for
>eeh_enabled() was introduced to avoid the issue.
>
>However, in architectures that EEH is present like pSeries or PowerNV, we might
>reach a case in which no PCI devices are present on boot time and so EEH is not
>initialized. Then, if a device is added via DLPAR for example,
>eeh_add_device_early() fails because eeh_enabled() is false, and EEH end up
>not being enabled at all.
>
>This reverts the aforementioned patch since a new verification was introduced by
>the commit d91dafc02f42 ("powerpc/eeh: Delay probing EEH device during hotplug")
>and so the original Cell issue does not happen anymore.
>
>Signed-off-by: Guilherme G. Piccoli <gpiccoli@linux.vnet.ibm.com>

Reviewed-by: Gavin Shan <gwshan@linux.vnet.ibm.com>

>---
> arch/powerpc/kernel/eeh.c | 2 +-
> 1 file changed, 1 insertion(+), 1 deletion(-)
>
>diff --git a/arch/powerpc/kernel/eeh.c b/arch/powerpc/kernel/eeh.c
>index 6544017..6758484 100644
>--- a/arch/powerpc/kernel/eeh.c
>+++ b/arch/powerpc/kernel/eeh.c
>@@ -1068,7 +1068,7 @@ void eeh_add_device_early(struct pci_dn *pdn)
> 	struct pci_controller *phb;
> 	struct eeh_dev *edev = pdn_to_eeh_dev(pdn);
>
>-	if (!edev || !eeh_enabled())
>+	if (!edev)
> 		return;
>
> 	if (!eeh_has_flag(EEH_PROBE_MODE_DEVTREE))
>-- 
>2.1.0
>

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH 2/2 v2] powerpc/iommu: Remove the dependency on EEH struct in DDW mechanism
  2016-04-07  0:20 ` [PATCH 2/2 v2] powerpc/iommu: Remove the dependency on EEH struct in DDW mechanism Guilherme G. Piccoli
@ 2016-04-07  0:48   ` Gavin Shan
  2016-04-07  1:55     ` Guilherme G. Piccoli
  0 siblings, 1 reply; 5+ messages in thread
From: Gavin Shan @ 2016-04-07  0:48 UTC (permalink / raw)
  To: Guilherme G. Piccoli; +Cc: linuxppc-dev, gwshan, mpe, benh, paulus, nfont

On Wed, Apr 06, 2016 at 09:20:05PM -0300, Guilherme G. Piccoli wrote:
>Commit 39baadbf36ce ("powerpc/eeh: Remove eeh information from pci_dn")
>changed the pci_dn struct by removing its EEH-related members.
>As part of this clean-up, DDW mechanism was modified to read the device
>configuration address from eeh_dev struct.
>
>As a consequence, now if we disable EEH mechanism on kernel command-line
>for example, the DDW mechanism will fail, generating a kernel oops by
>dereferencing a NULL pointer (which turns to be the eeh_dev pointer).
>
>This patch just changes the configuration address calculation on DDW
>functions to a manual calculation based on pci_dn members instead of
>using eeh_dev-based address.
>
>No functional changes were made. This was tested on pSeries, both
>in PHyp and qemu guest.
>
>Fixes: 39baadbf36ce ("powerpc/eeh: Remove eeh information from pci_dn")
>Signed-off-by: Guilherme G. Piccoli <gpiccoli@linux.vnet.ibm.com>

Reviewed-by: Gavin Shan <gwshan@linux.vnet.ibm.com>

Thanks, Guilherme. Please make sure if it needs to be backported
to stable kernel. I assume it probably needs to be in 3.10+. 

>---
> arch/powerpc/platforms/pseries/iommu.c | 24 ++++++++++++------------
> 1 file changed, 12 insertions(+), 12 deletions(-)
>
>diff --git a/arch/powerpc/platforms/pseries/iommu.c b/arch/powerpc/platforms/pseries/iommu.c
>index bd98ce2..3db2cea 100644
>--- a/arch/powerpc/platforms/pseries/iommu.c
>+++ b/arch/powerpc/platforms/pseries/iommu.c
>@@ -912,7 +912,8 @@ machine_arch_initcall(pseries, find_existing_ddw_windows);
> static int query_ddw(struct pci_dev *dev, const u32 *ddw_avail,
> 			struct ddw_query_response *query)
> {
>-	struct eeh_dev *edev;
>+	struct device_node *dn;
>+	struct pci_dn *pdn;
> 	u32 cfg_addr;
> 	u64 buid;
> 	int ret;
>@@ -923,11 +924,10 @@ static int query_ddw(struct pci_dev *dev, const u32 *ddw_avail,
> 	 * Retrieve them from the pci device, not the node with the
> 	 * dma-window property
> 	 */
>-	edev = pci_dev_to_eeh_dev(dev);
>-	cfg_addr = edev->config_addr;
>-	if (edev->pe_config_addr)
>-		cfg_addr = edev->pe_config_addr;
>-	buid = edev->phb->buid;
>+	dn = pci_device_to_OF_node(dev);
>+	pdn = PCI_DN(dn);
>+	cfg_addr = (pdn->busno << 8) | (pdn->devfn);
>+	buid = pdn->phb->buid;

No parentheses required on the second operand when getting @cfg_addr.
It would be nicer to initialize @buid and then @cfg_addr. When
developers look at the code, they usually check @buid and then
@cfg_addr.

>
> 	ret = rtas_call(ddw_avail[0], 3, 5, (u32 *)query,
> 		  cfg_addr, BUID_HI(buid), BUID_LO(buid));
>@@ -941,7 +941,8 @@ static int create_ddw(struct pci_dev *dev, const u32 *ddw_avail,
> 			struct ddw_create_response *create, int page_shift,
> 			int window_shift)
> {
>-	struct eeh_dev *edev;
>+	struct device_node *dn;
>+	struct pci_dn *pdn;
> 	u32 cfg_addr;
> 	u64 buid;
> 	int ret;
>@@ -952,11 +953,10 @@ static int create_ddw(struct pci_dev *dev, const u32 *ddw_avail,
> 	 * Retrieve them from the pci device, not the node with the
> 	 * dma-window property
> 	 */
>-	edev = pci_dev_to_eeh_dev(dev);
>-	cfg_addr = edev->config_addr;
>-	if (edev->pe_config_addr)
>-		cfg_addr = edev->pe_config_addr;
>-	buid = edev->phb->buid;
>+	dn = pci_device_to_OF_node(dev);
>+	pdn = PCI_DN(dn);
>+	cfg_addr = (pdn->busno << 8) | (pdn->devfn);
>+	buid = pdn->phb->buid;

Same comment as above.

Thanks,
Gavin
>
> 	do {
> 		/* extra outputs are LIOBN and dma-addr (hi, lo) */
>-- 
>2.1.0
>

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH 2/2 v2] powerpc/iommu: Remove the dependency on EEH struct in DDW mechanism
  2016-04-07  0:48   ` Gavin Shan
@ 2016-04-07  1:55     ` Guilherme G. Piccoli
  0 siblings, 0 replies; 5+ messages in thread
From: Guilherme G. Piccoli @ 2016-04-07  1:55 UTC (permalink / raw)
  To: Gavin Shan; +Cc: paulus, nfont, linuxppc-dev

On 04/06/2016 09:48 PM, Gavin Shan wrote:
> On Wed, Apr 06, 2016 at 09:20:05PM -0300, Guilherme G. Piccoli wrote:
>> Fixes: 39baadbf36ce ("powerpc/eeh: Remove eeh information from pci_dn")
>> Signed-off-by: Guilherme G. Piccoli <gpiccoli@linux.vnet.ibm.com>
>
> Reviewed-by: Gavin Shan <gwshan@linux.vnet.ibm.com>
>
> Thanks, Guilherme. Please make sure if it needs to be backported
> to stable kernel. I assume it probably needs to be in 3.10+.

Thanks for your quick review Gavin! I guess we only need to backport 
this to kernels containing 39baadbf36ce. Stable is a good idea.


>> +	dn = pci_device_to_OF_node(dev);
>> +	pdn = PCI_DN(dn);
>> +	cfg_addr = (pdn->busno << 8) | (pdn->devfn);
>> +	buid = pdn->phb->buid;
>
> No parentheses required on the second operand when getting @cfg_addr.
> It would be nicer to initialize @buid and then @cfg_addr. When
> developers look at the code, they usually check @buid and then
> @cfg_addr.

OK, I'll change this in v3. buid first, cfg_addr second. Also I'll 
remove the parentheses - it's a bad habit of mine =)

Cheers,


Guilherme

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2016-04-07  1:55 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2016-04-07  0:20 [PATCH 1/2 v2] Revert "powerpc/eeh: Fix crash in eeh_add_device_early() on Cell" Guilherme G. Piccoli
2016-04-07  0:20 ` [PATCH 2/2 v2] powerpc/iommu: Remove the dependency on EEH struct in DDW mechanism Guilherme G. Piccoli
2016-04-07  0:48   ` Gavin Shan
2016-04-07  1:55     ` Guilherme G. Piccoli
2016-04-07  0:35 ` [PATCH 1/2 v2] Revert "powerpc/eeh: Fix crash in eeh_add_device_early() on Cell" Gavin Shan

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).