* [PATCH] nvmet: loop: kill timeout handler
@ 2019-04-15 1:51 Ming Lei
2019-04-22 9:55 ` Ming Lei
` (3 more replies)
0 siblings, 4 replies; 5+ messages in thread
From: Ming Lei @ 2019-04-15 1:51 UTC (permalink / raw)
Firstly it doesn't make sense to handle timeout for loop: 1) for admin
queue, the request is always completed in code path of queuing IO. 2)
for normal IO request, the timeout on these IOs have been handled by
underlying queue already.
Secondly nvme-loop's timeout handler is simply broken, and easy to
cause issue: 1) no any sync/protection between timeout and normal
completion, and now it is driver's responsibility to deal with
that; 2) bad reset implementation, blk_mq_update_nr_hw_queues()
is called after all NSs's queue is stopped(quiesced), and easy
to trigger deadlock.
So kill the timeout handler.
Cc: Keith Busch <keith.busch at intel.com>
Cc: James Smart <jsmart2021 at gmail.com>
Cc: Hannes Reinecke <hare at suse.de>
Cc: Sagi Grimberg <sagi at grimberg.me>
Signed-off-by: Ming Lei <ming.lei at redhat.com>
---
drivers/nvme/target/loop.c | 16 ----------------
1 file changed, 16 deletions(-)
diff --git a/drivers/nvme/target/loop.c b/drivers/nvme/target/loop.c
index b9f623ab01f3..c44b425848fb 100644
--- a/drivers/nvme/target/loop.c
+++ b/drivers/nvme/target/loop.c
@@ -129,20 +129,6 @@ static void nvme_loop_execute_work(struct work_struct *work)
nvmet_req_execute(&iod->req);
}
-static enum blk_eh_timer_return
-nvme_loop_timeout(struct request *rq, bool reserved)
-{
- struct nvme_loop_iod *iod = blk_mq_rq_to_pdu(rq);
-
- /* queue error recovery */
- nvme_reset_ctrl(&iod->queue->ctrl->ctrl);
-
- /* fail with DNR on admin cmd timeout */
- nvme_req(rq)->status = NVME_SC_ABORT_REQ | NVME_SC_DNR;
-
- return BLK_EH_DONE;
-}
-
static blk_status_t nvme_loop_queue_rq(struct blk_mq_hw_ctx *hctx,
const struct blk_mq_queue_data *bd)
{
@@ -253,7 +239,6 @@ static const struct blk_mq_ops nvme_loop_mq_ops = {
.complete = nvme_loop_complete_rq,
.init_request = nvme_loop_init_request,
.init_hctx = nvme_loop_init_hctx,
- .timeout = nvme_loop_timeout,
};
static const struct blk_mq_ops nvme_loop_admin_mq_ops = {
@@ -261,7 +246,6 @@ static const struct blk_mq_ops nvme_loop_admin_mq_ops = {
.complete = nvme_loop_complete_rq,
.init_request = nvme_loop_init_request,
.init_hctx = nvme_loop_init_admin_hctx,
- .timeout = nvme_loop_timeout,
};
static void nvme_loop_destroy_admin_queue(struct nvme_loop_ctrl *ctrl)
--
2.9.5
^ permalink raw reply related [flat|nested] 5+ messages in thread
* [PATCH] nvmet: loop: kill timeout handler
2019-04-15 1:51 [PATCH] nvmet: loop: kill timeout handler Ming Lei
@ 2019-04-22 9:55 ` Ming Lei
2019-04-22 14:57 ` Keith Busch
` (2 subsequent siblings)
3 siblings, 0 replies; 5+ messages in thread
From: Ming Lei @ 2019-04-22 9:55 UTC (permalink / raw)
On Mon, Apr 15, 2019@9:52 AM Ming Lei <ming.lei@redhat.com> wrote:
>
> Firstly it doesn't make sense to handle timeout for loop: 1) for admin
> queue, the request is always completed in code path of queuing IO. 2)
> for normal IO request, the timeout on these IOs have been handled by
> underlying queue already.
>
> Secondly nvme-loop's timeout handler is simply broken, and easy to
> cause issue: 1) no any sync/protection between timeout and normal
> completion, and now it is driver's responsibility to deal with
> that; 2) bad reset implementation, blk_mq_update_nr_hw_queues()
> is called after all NSs's queue is stopped(quiesced), and easy
> to trigger deadlock.
>
> So kill the timeout handler.
>
> Cc: Keith Busch <keith.busch at intel.com>
> Cc: James Smart <jsmart2021 at gmail.com>
> Cc: Hannes Reinecke <hare at suse.de>
> Cc: Sagi Grimberg <sagi at grimberg.me>
> Signed-off-by: Ming Lei <ming.lei at redhat.com>
> ---
> drivers/nvme/target/loop.c | 16 ----------------
> 1 file changed, 16 deletions(-)
>
> diff --git a/drivers/nvme/target/loop.c b/drivers/nvme/target/loop.c
> index b9f623ab01f3..c44b425848fb 100644
> --- a/drivers/nvme/target/loop.c
> +++ b/drivers/nvme/target/loop.c
> @@ -129,20 +129,6 @@ static void nvme_loop_execute_work(struct work_struct *work)
> nvmet_req_execute(&iod->req);
> }
>
> -static enum blk_eh_timer_return
> -nvme_loop_timeout(struct request *rq, bool reserved)
> -{
> - struct nvme_loop_iod *iod = blk_mq_rq_to_pdu(rq);
> -
> - /* queue error recovery */
> - nvme_reset_ctrl(&iod->queue->ctrl->ctrl);
> -
> - /* fail with DNR on admin cmd timeout */
> - nvme_req(rq)->status = NVME_SC_ABORT_REQ | NVME_SC_DNR;
> -
> - return BLK_EH_DONE;
> -}
> -
> static blk_status_t nvme_loop_queue_rq(struct blk_mq_hw_ctx *hctx,
> const struct blk_mq_queue_data *bd)
> {
> @@ -253,7 +239,6 @@ static const struct blk_mq_ops nvme_loop_mq_ops = {
> .complete = nvme_loop_complete_rq,
> .init_request = nvme_loop_init_request,
> .init_hctx = nvme_loop_init_hctx,
> - .timeout = nvme_loop_timeout,
> };
>
> static const struct blk_mq_ops nvme_loop_admin_mq_ops = {
> @@ -261,7 +246,6 @@ static const struct blk_mq_ops nvme_loop_admin_mq_ops = {
> .complete = nvme_loop_complete_rq,
> .init_request = nvme_loop_init_request,
> .init_hctx = nvme_loop_init_admin_hctx,
> - .timeout = nvme_loop_timeout,
> };
>
> static void nvme_loop_destroy_admin_queue(struct nvme_loop_ctrl *ctrl)
> --
> 2.9.5
>
>
> _______________________________________________
> Linux-nvme mailing list
> Linux-nvme at lists.infradead.org
> http://lists.infradead.org/mailman/listinfo/linux-nvme
Ping...
Thanks,
Ming Lei
^ permalink raw reply [flat|nested] 5+ messages in thread
* [PATCH] nvmet: loop: kill timeout handler
2019-04-15 1:51 [PATCH] nvmet: loop: kill timeout handler Ming Lei
2019-04-22 9:55 ` Ming Lei
@ 2019-04-22 14:57 ` Keith Busch
2019-04-24 16:11 ` Sagi Grimberg
2019-04-25 14:52 ` Christoph Hellwig
3 siblings, 0 replies; 5+ messages in thread
From: Keith Busch @ 2019-04-22 14:57 UTC (permalink / raw)
On Sun, Apr 14, 2019@06:51:46PM -0700, Ming Lei wrote:
> Firstly it doesn't make sense to handle timeout for loop: 1) for admin
> queue, the request is always completed in code path of queuing IO. 2)
> for normal IO request, the timeout on these IOs have been handled by
> underlying queue already.
>
> Secondly nvme-loop's timeout handler is simply broken, and easy to
> cause issue: 1) no any sync/protection between timeout and normal
> completion, and now it is driver's responsibility to deal with
> that; 2) bad reset implementation, blk_mq_update_nr_hw_queues()
> is called after all NSs's queue is stopped(quiesced), and easy
> to trigger deadlock.
>
> So kill the timeout handler.
Makes sense to me, looks good.
Reviewd-by: Keith Busch <keith.busch at intel.com>
^ permalink raw reply [flat|nested] 5+ messages in thread
* [PATCH] nvmet: loop: kill timeout handler
2019-04-15 1:51 [PATCH] nvmet: loop: kill timeout handler Ming Lei
2019-04-22 9:55 ` Ming Lei
2019-04-22 14:57 ` Keith Busch
@ 2019-04-24 16:11 ` Sagi Grimberg
2019-04-25 14:52 ` Christoph Hellwig
3 siblings, 0 replies; 5+ messages in thread
From: Sagi Grimberg @ 2019-04-24 16:11 UTC (permalink / raw)
Reviewed-by: Sagi Grimberg <sagi at grimberg.me>
^ permalink raw reply [flat|nested] 5+ messages in thread
* [PATCH] nvmet: loop: kill timeout handler
2019-04-15 1:51 [PATCH] nvmet: loop: kill timeout handler Ming Lei
` (2 preceding siblings ...)
2019-04-24 16:11 ` Sagi Grimberg
@ 2019-04-25 14:52 ` Christoph Hellwig
3 siblings, 0 replies; 5+ messages in thread
From: Christoph Hellwig @ 2019-04-25 14:52 UTC (permalink / raw)
Thanks,
applied to nvme-5.2.
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2019-04-25 14:52 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2019-04-15 1:51 [PATCH] nvmet: loop: kill timeout handler Ming Lei
2019-04-22 9:55 ` Ming Lei
2019-04-22 14:57 ` Keith Busch
2019-04-24 16:11 ` Sagi Grimberg
2019-04-25 14:52 ` Christoph Hellwig
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.