* [PATCH] accel/amdxdna: fix IRQ vector leak in aie2_init()
@ 2026-04-14 12:10 Guangshuo Li
2026-04-14 16:06 ` Lizhi Hou
0 siblings, 1 reply; 3+ messages in thread
From: Guangshuo Li @ 2026-04-14 12:10 UTC (permalink / raw)
To: Min Ma, Lizhi Hou, Oded Gabbay, Jeff Hugo, George Yang,
Narendra Gutta, dri-devel, linux-kernel
Cc: Guangshuo Li, stable
aie2_init() allocates MSI-X vectors with pci_alloc_irq_vectors() before
creating the PSP handle, starting the hardware and initializing the
resolver.
When aie2m_psp_create(), aie2_hw_start() or xrsm_init() fails after IRQ
vectors have been allocated successfully, the function releases the
firmware and unwinds hardware state, but fails to free the allocated
IRQ vectors.
The issue was identified by a static analysis tool I developed and
confirmed by manual review. Add a dedicated error path to free the IRQ
vectors after pci_alloc_irq_vectors() succeeds.
Fixes: 8c9ff1b181ba ("accel/amdxdna: Add a new driver for AMD AI Engine")
Cc: stable@vger.kernel.org
Signed-off-by: Guangshuo Li <lgs201920130244@gmail.com>
---
drivers/accel/amdxdna/aie2_pci.c | 6 ++++--
1 file changed, 4 insertions(+), 2 deletions(-)
diff --git a/drivers/accel/amdxdna/aie2_pci.c b/drivers/accel/amdxdna/aie2_pci.c
index 4924a9da55b6..f05f49f691b5 100644
--- a/drivers/accel/amdxdna/aie2_pci.c
+++ b/drivers/accel/amdxdna/aie2_pci.c
@@ -591,14 +591,14 @@ static int aie2_init(struct amdxdna_dev *xdna)
if (!ndev->psp_hdl) {
XDNA_ERR(xdna, "failed to create psp");
ret = -ENOMEM;
- goto release_fw;
+ goto free_irq_vectors;
}
xdna->dev_handle = ndev;
ret = aie2_hw_start(xdna);
if (ret) {
XDNA_ERR(xdna, "start npu failed, ret %d", ret);
- goto release_fw;
+ goto free_irq_vectors;
}
xrs_cfg.clk_list.num_levels = ndev->max_dpm_level + 1;
@@ -623,6 +623,8 @@ static int aie2_init(struct amdxdna_dev *xdna)
stop_hw:
aie2_hw_stop(xdna);
+free_irq_vectors:
+ pci_free_irq_vectors(pdev);
release_fw:
release_firmware(fw);
--
2.43.0
^ permalink raw reply related [flat|nested] 3+ messages in thread
* Re: [PATCH] accel/amdxdna: fix IRQ vector leak in aie2_init()
2026-04-14 12:10 [PATCH] accel/amdxdna: fix IRQ vector leak in aie2_init() Guangshuo Li
@ 2026-04-14 16:06 ` Lizhi Hou
2026-04-15 2:05 ` Guangshuo Li
0 siblings, 1 reply; 3+ messages in thread
From: Lizhi Hou @ 2026-04-14 16:06 UTC (permalink / raw)
To: Guangshuo Li, Min Ma, Oded Gabbay, Jeff Hugo, George Yang,
Narendra Gutta, dri-devel, linux-kernel
Cc: stable
On 4/14/26 05:10, Guangshuo Li wrote:
> [You don't often get email from lgs201920130244@gmail.com. Learn why this is important at https://aka.ms/LearnAboutSenderIdentification ]
>
> aie2_init() allocates MSI-X vectors with pci_alloc_irq_vectors() before
> creating the PSP handle, starting the hardware and initializing the
> resolver.
>
> When aie2m_psp_create(), aie2_hw_start() or xrsm_init() fails after IRQ
> vectors have been allocated successfully, the function releases the
> firmware and unwinds hardware state, but fails to free the allocated
> IRQ vectors.
aie2_init enables device via pcim_enable_device(), which sets managed
device. Thus the vectors are automatically cleanup. So NAK.
Lizhi
>
> The issue was identified by a static analysis tool I developed and
> confirmed by manual review. Add a dedicated error path to free the IRQ
> vectors after pci_alloc_irq_vectors() succeeds.
>
> Fixes: 8c9ff1b181ba ("accel/amdxdna: Add a new driver for AMD AI Engine")
> Cc: stable@vger.kernel.org
> Signed-off-by: Guangshuo Li <lgs201920130244@gmail.com>
> ---
> drivers/accel/amdxdna/aie2_pci.c | 6 ++++--
> 1 file changed, 4 insertions(+), 2 deletions(-)
>
> diff --git a/drivers/accel/amdxdna/aie2_pci.c b/drivers/accel/amdxdna/aie2_pci.c
> index 4924a9da55b6..f05f49f691b5 100644
> --- a/drivers/accel/amdxdna/aie2_pci.c
> +++ b/drivers/accel/amdxdna/aie2_pci.c
> @@ -591,14 +591,14 @@ static int aie2_init(struct amdxdna_dev *xdna)
> if (!ndev->psp_hdl) {
> XDNA_ERR(xdna, "failed to create psp");
> ret = -ENOMEM;
> - goto release_fw;
> + goto free_irq_vectors;
> }
> xdna->dev_handle = ndev;
>
> ret = aie2_hw_start(xdna);
> if (ret) {
> XDNA_ERR(xdna, "start npu failed, ret %d", ret);
> - goto release_fw;
> + goto free_irq_vectors;
> }
>
> xrs_cfg.clk_list.num_levels = ndev->max_dpm_level + 1;
> @@ -623,6 +623,8 @@ static int aie2_init(struct amdxdna_dev *xdna)
>
> stop_hw:
> aie2_hw_stop(xdna);
> +free_irq_vectors:
> + pci_free_irq_vectors(pdev);
> release_fw:
> release_firmware(fw);
>
> --
> 2.43.0
>
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [PATCH] accel/amdxdna: fix IRQ vector leak in aie2_init()
2026-04-14 16:06 ` Lizhi Hou
@ 2026-04-15 2:05 ` Guangshuo Li
0 siblings, 0 replies; 3+ messages in thread
From: Guangshuo Li @ 2026-04-15 2:05 UTC (permalink / raw)
To: Lizhi Hou
Cc: Min Ma, Oded Gabbay, Jeff Hugo, George Yang, Narendra Gutta,
dri-devel, linux-kernel, stable
Hello,
Thanks for reviewing.
On Wed, 15 Apr 2026 at 00:06, Lizhi Hou <lizhi.hou@amd.com> wrote:
>
>
>
> aie2_init enables device via pcim_enable_device(), which sets managed
> device. Thus the vectors are automatically cleanup. So NAK.
>
>
> Lizhi
>
Sorry, I missed that pcim_enable_device() already makes the device
managed, so the IRQ vectors are cleaned up automatically. Thanks for
pointing it out.
Best regards,
Guangshuo
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2026-04-15 2:05 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2026-04-14 12:10 [PATCH] accel/amdxdna: fix IRQ vector leak in aie2_init() Guangshuo Li
2026-04-14 16:06 ` Lizhi Hou
2026-04-15 2:05 ` Guangshuo Li
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox