* [PATCH v2 0/1] nvmet: pci-epf: Do not complete commands twice if nvmet_req_init() fails @ 2025-07-16 11:15 Rick Wertenbroek 2025-07-16 11:15 ` [PATCH v2 1/1] " Rick Wertenbroek 2025-07-17 11:40 ` [PATCH v2 0/1] " Christoph Hellwig 0 siblings, 2 replies; 5+ messages in thread From: Rick Wertenbroek @ 2025-07-16 11:15 UTC (permalink / raw) Cc: rick.wertenbroek, dlemoal, alberto.dassatti, Rick Wertenbroek, stable, Christoph Hellwig, Sagi Grimberg, Chaitanya Kulkarni, Manivannan Sadhasivam, Keith Busch, Krzysztof Wilczyński, linux-nvme, linux-kernel Changes from v1 : - Updated comment for nvmet_pci_epf_queue_response() per Damien's suggestion. - Fixed typo in commit message. - Added 3 tags in commit message: Reviewed-by: Damien Le Moal <dlemoal@kernel.org> Fixes: 0faa0fe6f90e ("nvmet: New NVMe PCI endpoint function target driver") Cc: stable@vger.kernel.org Best regards, Rick Rick Wertenbroek (1): nvmet: pci-epf: Do not complete commands twice if nvmet_req_init() fails drivers/nvme/target/pci-epf.c | 23 ++++++++++++++++------- 1 file changed, 16 insertions(+), 7 deletions(-) -- 2.25.1 ^ permalink raw reply [flat|nested] 5+ messages in thread
* [PATCH v2 1/1] nvmet: pci-epf: Do not complete commands twice if nvmet_req_init() fails 2025-07-16 11:15 [PATCH v2 0/1] nvmet: pci-epf: Do not complete commands twice if nvmet_req_init() fails Rick Wertenbroek @ 2025-07-16 11:15 ` Rick Wertenbroek 2025-07-16 21:56 ` Chaitanya Kulkarni 2025-07-17 11:40 ` [PATCH v2 0/1] " Christoph Hellwig 1 sibling, 1 reply; 5+ messages in thread From: Rick Wertenbroek @ 2025-07-16 11:15 UTC (permalink / raw) Cc: rick.wertenbroek, dlemoal, alberto.dassatti, Rick Wertenbroek, stable, Christoph Hellwig, Sagi Grimberg, Chaitanya Kulkarni, Krzysztof Wilczyński, Manivannan Sadhasivam, Keith Busch, linux-nvme, linux-kernel Have nvmet_req_init() and req->execute() complete failed commands. Description of the problem: nvmet_req_init() calls __nvmet_req_complete() internally upon failure, e.g., unsupported opcode, which calls the "queue_response" callback, this results in nvmet_pci_epf_queue_response() being called, which will call nvmet_pci_epf_complete_iod() if data_len is 0 or if dma_dir is different from DMA_TO_DEVICE. This results in a double completion as nvmet_pci_epf_exec_iod_work() also calls nvmet_pci_epf_complete_iod() when nvmet_req_init() fails. Steps to reproduce: On the host send a command with an unsupported opcode with nvme-cli, For example the admin command "security receive" $ sudo nvme security-recv /dev/nvme0n1 -n1 -x4096 This triggers a double completion as nvmet_req_init() fails and nvmet_pci_epf_queue_response() is called, here iod->dma_dir is still in the default state of "DMA_NONE" as set by default in nvmet_pci_epf_alloc_iod(), so nvmet_pci_epf_complete_iod() is called. Because nvmet_req_init() failed nvmet_pci_epf_complete_iod() is also called in nvmet_pci_epf_exec_iod_work() leading to a double completion. This not only sends two completions to the host but also corrupts the state of the PCI NVMe target leading to kernel oops. This patch lets nvmet_req_init() and req->execute() complete all failed commands, and removes the double completion case in nvmet_pci_epf_exec_iod_work() therefore fixing the edge cases where double completions occurred. Signed-off-by: Rick Wertenbroek <rick.wertenbroek@gmail.com> Reviewed-by: Damien Le Moal <dlemoal@kernel.org> Fixes: 0faa0fe6f90e ("nvmet: New NVMe PCI endpoint function target driver") Cc: stable@vger.kernel.org --- drivers/nvme/target/pci-epf.c | 23 ++++++++++++++++------- 1 file changed, 16 insertions(+), 7 deletions(-) diff --git a/drivers/nvme/target/pci-epf.c b/drivers/nvme/target/pci-epf.c index a4295a5b8d28..9cd470938463 100644 --- a/drivers/nvme/target/pci-epf.c +++ b/drivers/nvme/target/pci-epf.c @@ -1242,8 +1242,11 @@ static void nvmet_pci_epf_queue_response(struct nvmet_req *req) iod->status = le16_to_cpu(req->cqe->status) >> 1; - /* If we have no data to transfer, directly complete the command. */ - if (!iod->data_len || iod->dma_dir != DMA_TO_DEVICE) { + /* + * If the command failed or we have no data to transfer, complete + * the command immediately. + */ + if (iod->status || !iod->data_len || iod->dma_dir != DMA_TO_DEVICE) { nvmet_pci_epf_complete_iod(iod); return; } @@ -1604,8 +1607,13 @@ static void nvmet_pci_epf_exec_iod_work(struct work_struct *work) goto complete; } + /* + * If nvmet_req_init() fails (e.g., unsupported opcode) it will call + * __nvmet_req_complete() internally which will call + * nvmet_pci_epf_queue_response() and will complete the command directly. + */ if (!nvmet_req_init(req, &iod->sq->nvme_sq, &nvmet_pci_epf_fabrics_ops)) - goto complete; + return; iod->data_len = nvmet_req_transfer_len(req); if (iod->data_len) { @@ -1643,10 +1651,11 @@ static void nvmet_pci_epf_exec_iod_work(struct work_struct *work) wait_for_completion(&iod->done); - if (iod->status == NVME_SC_SUCCESS) { - WARN_ON_ONCE(!iod->data_len || iod->dma_dir != DMA_TO_DEVICE); - nvmet_pci_epf_transfer_iod_data(iod); - } + if (iod->status != NVME_SC_SUCCESS) + return; + + WARN_ON_ONCE(!iod->data_len || iod->dma_dir != DMA_TO_DEVICE); + nvmet_pci_epf_transfer_iod_data(iod); complete: nvmet_pci_epf_complete_iod(iod); -- 2.25.1 ^ permalink raw reply related [flat|nested] 5+ messages in thread
* Re: [PATCH v2 1/1] nvmet: pci-epf: Do not complete commands twice if nvmet_req_init() fails 2025-07-16 11:15 ` [PATCH v2 1/1] " Rick Wertenbroek @ 2025-07-16 21:56 ` Chaitanya Kulkarni 2025-07-16 22:42 ` Damien Le Moal 0 siblings, 1 reply; 5+ messages in thread From: Chaitanya Kulkarni @ 2025-07-16 21:56 UTC (permalink / raw) To: Rick Wertenbroek Cc: rick.wertenbroek@heig-vd.ch, dlemoal@kernel.org, alberto.dassatti@heig-vd.ch, stable@vger.kernel.org, Christoph Hellwig, Sagi Grimberg, Chaitanya Kulkarni, Krzysztof Wilczyński, Manivannan Sadhasivam, Keith Busch, linux-nvme@lists.infradead.org, linux-kernel@vger.kernel.org On 7/16/25 04:15, Rick Wertenbroek wrote: > Have nvmet_req_init() and req->execute() complete failed commands. > > Description of the problem: > nvmet_req_init() calls __nvmet_req_complete() internally upon failure, > e.g., unsupported opcode, which calls the "queue_response" callback, > this results in nvmet_pci_epf_queue_response() being called, which will > call nvmet_pci_epf_complete_iod() if data_len is 0 or if dma_dir is > different from DMA_TO_DEVICE. This results in a double completion as > nvmet_pci_epf_exec_iod_work() also calls nvmet_pci_epf_complete_iod() > when nvmet_req_init() fails. > > Steps to reproduce: > On the host send a command with an unsupported opcode with nvme-cli, > For example the admin command "security receive" > $ sudo nvme security-recv /dev/nvme0n1 -n1 -x4096 > > This triggers a double completion as nvmet_req_init() fails and > nvmet_pci_epf_queue_response() is called, here iod->dma_dir is still > in the default state of "DMA_NONE" as set by default in > nvmet_pci_epf_alloc_iod(), so nvmet_pci_epf_complete_iod() is called. > Because nvmet_req_init() failed nvmet_pci_epf_complete_iod() is also > called in nvmet_pci_epf_exec_iod_work() leading to a double completion. > This not only sends two completions to the host but also corrupts the > state of the PCI NVMe target leading to kernel oops. > > This patch lets nvmet_req_init() and req->execute() complete all failed > commands, and removes the double completion case in > nvmet_pci_epf_exec_iod_work() therefore fixing the edge cases where > double completions occurred. > > Signed-off-by: Rick Wertenbroek<rick.wertenbroek@gmail.com> > Reviewed-by: Damien Le Moal<dlemoal@kernel.org> > Fixes: 0faa0fe6f90e ("nvmet: New NVMe PCI endpoint function target driver") > Cc:stable@vger.kernel.org Good catch, looks good, I wish we have tests for this part of target to it will get tested on regular basis, not the requirement, just a thought. Reviewed-by: Chaitanya Kulkarni <kch@nvidia.com> -ck ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH v2 1/1] nvmet: pci-epf: Do not complete commands twice if nvmet_req_init() fails 2025-07-16 21:56 ` Chaitanya Kulkarni @ 2025-07-16 22:42 ` Damien Le Moal 0 siblings, 0 replies; 5+ messages in thread From: Damien Le Moal @ 2025-07-16 22:42 UTC (permalink / raw) To: Chaitanya Kulkarni, Rick Wertenbroek Cc: rick.wertenbroek@heig-vd.ch, alberto.dassatti@heig-vd.ch, stable@vger.kernel.org, Christoph Hellwig, Sagi Grimberg, Krzysztof Wilczyński, Manivannan Sadhasivam, Keith Busch, linux-nvme@lists.infradead.org, linux-kernel@vger.kernel.org, Shin'ichiro Kawasaki On 7/17/25 06:56, Chaitanya Kulkarni wrote: > Good catch, looks good, I wish we have tests for this part of target > to it will get tested on regular basis, not the requirement, just > a thought. qemu does not have a PCI endpoint capable controller device and you cannot link 2 VMs to communicate over PCIe (one VM as host the other as endpoint). So unless you get a PCIe-endpoint capable board, you cannot run this driver easily. We can add a blktest case for sending an unsupported command though. That is easy to do. But FYI, right now, running blktest/nvme group against a pci-epf device, we get a hang... Shin'ichiro is looking into that. -- Damien Le Moal Western Digital Research ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH v2 0/1] nvmet: pci-epf: Do not complete commands twice if nvmet_req_init() fails 2025-07-16 11:15 [PATCH v2 0/1] nvmet: pci-epf: Do not complete commands twice if nvmet_req_init() fails Rick Wertenbroek 2025-07-16 11:15 ` [PATCH v2 1/1] " Rick Wertenbroek @ 2025-07-17 11:40 ` Christoph Hellwig 1 sibling, 0 replies; 5+ messages in thread From: Christoph Hellwig @ 2025-07-17 11:40 UTC (permalink / raw) To: Rick Wertenbroek Cc: rick.wertenbroek, dlemoal, alberto.dassatti, stable, Christoph Hellwig, Sagi Grimberg, Chaitanya Kulkarni, Manivannan Sadhasivam, Keith Busch, Krzysztof Wilczyński, linux-nvme, linux-kernel Thanks, applied to nvme-6.17. ^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2025-07-17 11:40 UTC | newest] Thread overview: 5+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2025-07-16 11:15 [PATCH v2 0/1] nvmet: pci-epf: Do not complete commands twice if nvmet_req_init() fails Rick Wertenbroek 2025-07-16 11:15 ` [PATCH v2 1/1] " Rick Wertenbroek 2025-07-16 21:56 ` Chaitanya Kulkarni 2025-07-16 22:42 ` Damien Le Moal 2025-07-17 11:40 ` [PATCH v2 0/1] " Christoph Hellwig
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).