From: Oleksandr Natalenko <oleksandr@natalenko.name>
To: Ming Lei <ming.lei@redhat.com>
Cc: Jens Axboe <axboe@fb.com>,
linux-block@vger.kernel.org,
Christoph Hellwig <hch@infradead.org>,
linux-scsi@vger.kernel.org,
"Martin K . Petersen" <martin.petersen@oracle.com>,
"James E . J . Bottomley" <jejb@linux.vnet.ibm.com>,
Bart Van Assche <bart.vanassche@sandisk.com>,
Johannes Thumshirn <jthumshirn@suse.de>,
Cathy Avery <cavery@redhat.com>
Subject: Re: [PATCH V4 0/10] block/scsi: safe SCSI quiescing
Date: Mon, 11 Sep 2017 23:24:15 +0200 [thread overview]
Message-ID: <2499737.s63AaEEcmZ@natalenko.name> (raw)
In-Reply-To: <20170911111021.25810-1-ming.lei@redhat.com>
=46or v4 with regard to suspend/resume:
Tested-by: Oleksandr Natalenko <oleksandr@natalenko.name>
On pond=C4=9Bl=C3=AD 11. z=C3=A1=C5=99=C3=AD 2017 13:10:11 CEST Ming Lei wr=
ote:
> Hi,
>=20
> The current SCSI quiesce isn't safe and easy to trigger I/O deadlock.
>=20
> Once SCSI device is put into QUIESCE, no new request except for
> RQF_PREEMPT can be dispatched to SCSI successfully, and
> scsi_device_quiesce() just simply waits for completion of I/Os
> dispatched to SCSI stack. It isn't enough at all.
>=20
> Because new request still can be comming, but all the allocated
> requests can't be dispatched successfully, so request pool can be
> consumed up easily.
>=20
> Then request with RQF_PREEMPT can't be allocated and wait forever,
> meantime scsi_device_resume() waits for completion of RQF_PREEMPT,
> then system hangs forever, such as during system suspend or
> sending SCSI domain alidation.
>=20
> Both IO hang inside system suspend[1] or SCSI domain validation
> were reported before.
>=20
> This patch introduces preempt freeze, and solves the issue
> by preempt freezing block queue during SCSI quiesce, and allows
> to allocate request of RQF_PREEMPT when queue is in this state.
>=20
> Oleksandr verified that V3 does fix the hang during suspend/resume,
> and Cathy verified that revised V3 fixes hang in sending
> SCSI domain validation.
>=20
> Both SCSI and SCSI_MQ have this IO deadlock issue, this patch fixes
> them all by introducing/unifying blk_freeze_queue_preempt() and
> blk_unfreeze_queue_preempt(), and cleanup is done together.
>=20
> The patchset can be found in the following gitweb:
>=20
> https://github.com/ming1/linux/tree/blk_safe_scsi_quiesce_V4
>=20
> V4:
> - reorganize patch order to make it more reasonable
> - support nested preempt freeze, as required by SCSI transport spi
> - check preempt freezing in slow path of of blk_queue_enter()
> - add "SCSI: transport_spi: resume a quiesced device"
> - wake up freeze queue in setting dying for both blk-mq and legacy
> - rename blk_mq_[freeze|unfreeze]_queue() in one patch
> - rename .mq_freeze_wq and .mq_freeze_depth
> - improve comment
>=20
> V3:
> - introduce q->preempt_unfreezing to fix one bug of preempt freeze
> - call blk_queue_enter_live() only when queue is preempt frozen
> - cleanup a bit on the implementation of preempt freeze
> - only patch 6 and 7 are changed
>=20
> V2:
> - drop the 1st patch in V1 because percpu_ref_is_dying() is
> enough as pointed by Tejun
> - introduce preempt version of blk_[freeze|unfreeze]_queue
> - sync between preempt freeze and normal freeze
> - fix warning from percpu-refcount as reported by Oleksandr
>=20
>=20
> [1] https://marc.info/?t=3D150340250100013&r=3D3&w=3D2
>=20
>=20
> Thanks,
> Ming
>=20
>=20
> Ming Lei (10):
> blk-mq: only run hw queues for blk-mq
> block: tracking request allocation with q_usage_counter
> blk-mq: rename blk_mq_[freeze|unfreeze]_queue
> blk-mq: rename blk_mq_freeze_queue_wait as blk_freeze_queue_wait
> block: rename .mq_freeze_wq and .mq_freeze_depth
> block: pass flags to blk_queue_enter()
> block: introduce preempt version of blk_[freeze|unfreeze]_queue
> block: allow to allocate req with RQF_PREEMPT when queue is preempt
> frozen
> SCSI: transport_spi: resume a quiesced device
> SCSI: preempt freeze block queue when SCSI device is put into quiesce
>=20
> block/bfq-iosched.c | 2 +-
> block/blk-cgroup.c | 8 +-
> block/blk-core.c | 95 ++++++++++++++++----
> block/blk-mq.c | 180
> ++++++++++++++++++++++++++++---------- block/blk-mq.h =
|=20
> 1 -
> block/blk-timeout.c | 2 +-
> block/blk.h | 12 +++
> block/elevator.c | 4 +-
> drivers/block/loop.c | 24 ++---
> drivers/block/rbd.c | 2 +-
> drivers/nvme/host/core.c | 8 +-
> drivers/scsi/scsi_lib.c | 25 +++++-
> drivers/scsi/scsi_transport_spi.c | 3 +
> fs/block_dev.c | 4 +-
> include/linux/blk-mq.h | 15 ++--
> include/linux/blkdev.h | 32 +++++--
> 16 files changed, 313 insertions(+), 104 deletions(-)
WARNING: multiple messages have this Message-ID (diff)
From: Oleksandr Natalenko <oleksandr@natalenko.name>
To: Ming Lei <ming.lei@redhat.com>
Cc: Jens Axboe <axboe@fb.com>,
linux-block@vger.kernel.org,
Christoph Hellwig <hch@infradead.org>,
linux-scsi@vger.kernel.org,
"Martin K . Petersen" <martin.petersen@oracle.com>,
"James E . J . Bottomley" <jejb@linux.vnet.ibm.com>,
Bart Van Assche <bart.vanassche@sandisk.com>,
Johannes Thumshirn <jthumshirn@suse.de>,
Cathy Avery <cavery@redhat.com>
Subject: Re: [PATCH V4 0/10] block/scsi: safe SCSI quiescing
Date: Mon, 11 Sep 2017 23:24:15 +0200 [thread overview]
Message-ID: <2499737.s63AaEEcmZ@natalenko.name> (raw)
In-Reply-To: <20170911111021.25810-1-ming.lei@redhat.com>
For v4 with regard to suspend/resume:
Tested-by: Oleksandr Natalenko <oleksandr@natalenko.name>
On pondělí 11. září 2017 13:10:11 CEST Ming Lei wrote:
> Hi,
>
> The current SCSI quiesce isn't safe and easy to trigger I/O deadlock.
>
> Once SCSI device is put into QUIESCE, no new request except for
> RQF_PREEMPT can be dispatched to SCSI successfully, and
> scsi_device_quiesce() just simply waits for completion of I/Os
> dispatched to SCSI stack. It isn't enough at all.
>
> Because new request still can be comming, but all the allocated
> requests can't be dispatched successfully, so request pool can be
> consumed up easily.
>
> Then request with RQF_PREEMPT can't be allocated and wait forever,
> meantime scsi_device_resume() waits for completion of RQF_PREEMPT,
> then system hangs forever, such as during system suspend or
> sending SCSI domain alidation.
>
> Both IO hang inside system suspend[1] or SCSI domain validation
> were reported before.
>
> This patch introduces preempt freeze, and solves the issue
> by preempt freezing block queue during SCSI quiesce, and allows
> to allocate request of RQF_PREEMPT when queue is in this state.
>
> Oleksandr verified that V3 does fix the hang during suspend/resume,
> and Cathy verified that revised V3 fixes hang in sending
> SCSI domain validation.
>
> Both SCSI and SCSI_MQ have this IO deadlock issue, this patch fixes
> them all by introducing/unifying blk_freeze_queue_preempt() and
> blk_unfreeze_queue_preempt(), and cleanup is done together.
>
> The patchset can be found in the following gitweb:
>
> https://github.com/ming1/linux/tree/blk_safe_scsi_quiesce_V4
>
> V4:
> - reorganize patch order to make it more reasonable
> - support nested preempt freeze, as required by SCSI transport spi
> - check preempt freezing in slow path of of blk_queue_enter()
> - add "SCSI: transport_spi: resume a quiesced device"
> - wake up freeze queue in setting dying for both blk-mq and legacy
> - rename blk_mq_[freeze|unfreeze]_queue() in one patch
> - rename .mq_freeze_wq and .mq_freeze_depth
> - improve comment
>
> V3:
> - introduce q->preempt_unfreezing to fix one bug of preempt freeze
> - call blk_queue_enter_live() only when queue is preempt frozen
> - cleanup a bit on the implementation of preempt freeze
> - only patch 6 and 7 are changed
>
> V2:
> - drop the 1st patch in V1 because percpu_ref_is_dying() is
> enough as pointed by Tejun
> - introduce preempt version of blk_[freeze|unfreeze]_queue
> - sync between preempt freeze and normal freeze
> - fix warning from percpu-refcount as reported by Oleksandr
>
>
> [1] https://marc.info/?t=150340250100013&r=3&w=2
>
>
> Thanks,
> Ming
>
>
> Ming Lei (10):
> blk-mq: only run hw queues for blk-mq
> block: tracking request allocation with q_usage_counter
> blk-mq: rename blk_mq_[freeze|unfreeze]_queue
> blk-mq: rename blk_mq_freeze_queue_wait as blk_freeze_queue_wait
> block: rename .mq_freeze_wq and .mq_freeze_depth
> block: pass flags to blk_queue_enter()
> block: introduce preempt version of blk_[freeze|unfreeze]_queue
> block: allow to allocate req with RQF_PREEMPT when queue is preempt
> frozen
> SCSI: transport_spi: resume a quiesced device
> SCSI: preempt freeze block queue when SCSI device is put into quiesce
>
> block/bfq-iosched.c | 2 +-
> block/blk-cgroup.c | 8 +-
> block/blk-core.c | 95 ++++++++++++++++----
> block/blk-mq.c | 180
> ++++++++++++++++++++++++++++---------- block/blk-mq.h |
> 1 -
> block/blk-timeout.c | 2 +-
> block/blk.h | 12 +++
> block/elevator.c | 4 +-
> drivers/block/loop.c | 24 ++---
> drivers/block/rbd.c | 2 +-
> drivers/nvme/host/core.c | 8 +-
> drivers/scsi/scsi_lib.c | 25 +++++-
> drivers/scsi/scsi_transport_spi.c | 3 +
> fs/block_dev.c | 4 +-
> include/linux/blk-mq.h | 15 ++--
> include/linux/blkdev.h | 32 +++++--
> 16 files changed, 313 insertions(+), 104 deletions(-)
next prev parent reply other threads:[~2017-09-11 21:24 UTC|newest]
Thread overview: 27+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-09-11 11:10 [PATCH V4 0/10] block/scsi: safe SCSI quiescing Ming Lei
2017-09-11 11:10 ` [PATCH V4 01/10] blk-mq: only run hw queues for blk-mq Ming Lei
2017-09-11 11:10 ` [PATCH V4 02/10] block: tracking request allocation with q_usage_counter Ming Lei
2017-09-11 11:10 ` [PATCH V4 03/10] blk-mq: rename blk_mq_[freeze|unfreeze]_queue Ming Lei
2017-09-11 11:10 ` [PATCH V4 04/10] blk-mq: rename blk_mq_freeze_queue_wait as blk_freeze_queue_wait Ming Lei
2017-09-11 11:10 ` [PATCH V4 05/10] block: rename .mq_freeze_wq and .mq_freeze_depth Ming Lei
2017-09-11 11:10 ` [PATCH V4 06/10] block: pass flags to blk_queue_enter() Ming Lei
2017-09-11 11:10 ` [PATCH V4 07/10] block: introduce preempt version of blk_[freeze|unfreeze]_queue Ming Lei
2017-09-11 11:10 ` [PATCH V4 08/10] block: allow to allocate req with RQF_PREEMPT when queue is preempt frozen Ming Lei
2017-09-11 16:03 ` Bart Van Assche
2017-09-11 16:03 ` Bart Van Assche
2017-09-12 3:40 ` Ming Lei
2017-09-13 16:48 ` Ming Lei
2017-09-13 17:28 ` Bart Van Assche
2017-09-13 17:28 ` Bart Van Assche
2017-09-13 17:48 ` Ming Lei
2017-09-13 19:07 ` Bart Van Assche
2017-09-13 19:07 ` Bart Van Assche
2017-09-14 1:15 ` Ming Lei
2017-09-14 13:37 ` Bart Van Assche
2017-09-14 13:37 ` Bart Van Assche
2017-09-14 16:18 ` Ming Lei
2017-09-11 11:10 ` [PATCH V4 09/10] SCSI: transport_spi: resume a quiesced device Ming Lei
2017-09-11 11:10 ` [PATCH V4 10/10] SCSI: preempt freeze block queue when SCSI device is put into quiesce Ming Lei
2017-09-11 21:24 ` Oleksandr Natalenko [this message]
2017-09-11 21:24 ` [PATCH V4 0/10] block/scsi: safe SCSI quiescing Oleksandr Natalenko
2017-09-12 19:03 ` Cathy Avery
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=2499737.s63AaEEcmZ@natalenko.name \
--to=oleksandr@natalenko.name \
--cc=axboe@fb.com \
--cc=bart.vanassche@sandisk.com \
--cc=cavery@redhat.com \
--cc=hch@infradead.org \
--cc=jejb@linux.vnet.ibm.com \
--cc=jthumshirn@suse.de \
--cc=linux-block@vger.kernel.org \
--cc=linux-scsi@vger.kernel.org \
--cc=martin.petersen@oracle.com \
--cc=ming.lei@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.