From: Christoph Hellwig <hch@infradead.org>
To: Ilya Dryomov <idryomov@gmail.com>
Cc: Christoph Hellwig <hch@infradead.org>,
"Martin K. Petersen" <martin.petersen@oracle.com>,
Hannes Reinecke <hare@suse.com>, Jens Axboe <axboe@fb.com>,
linux-block@vger.kernel.org
Subject: Re: [PATCH 2/2] block: cope with WRITE ZEROES failing in blkdev_issue_zeroout()
Date: Tue, 3 Oct 2017 01:04:44 -0700 [thread overview]
Message-ID: <20171003080444.GB8345@infradead.org> (raw)
In-Reply-To: <1506013972-23049-3-git-send-email-idryomov@gmail.com>
On Thu, Sep 21, 2017 at 07:12:52PM +0200, Ilya Dryomov wrote:
> sd_config_write_same() ignores ->max_ws_blocks == 0 and resets it to
> permit trying WRITE SAME on older SCSI devices, unless ->no_write_same
> is set. Because REQ_OP_WRITE_ZEROES is implemented in terms of WRITE
> SAME, blkdev_issue_zeroout() may fail with -EREMOTEIO:
>
> $ fallocate -zn -l 1k /dev/sdg
> fallocate: fallocate failed: Remote I/O error
> $ fallocate -zn -l 1k /dev/sdg # OK
> $ fallocate -zn -l 1k /dev/sdg # OK
>
> The following calls succeed because sd_done() sets ->no_write_same in
> response to a sense that would become BLK_STS_TARGET/-EREMOTEIO, causing
> __blkdev_issue_zeroout() to fall back to generating ZERO_PAGE bios.
>
> This means blkdev_issue_zeroout() must cope with WRITE ZEROES failing
> and fall back to manually zeroing, unless BLKDEV_ZERO_NOFALLBACK is
> specified. For BLKDEV_ZERO_NOFALLBACK case, return -EOPNOTSUPP if
> sd_done() has just set ->no_write_same thus indicating lack of offload
> support.
>
> Fixes: c20cfc27a473 ("block: stop using blkdev_issue_write_same for zeroing")
> Cc: Christoph Hellwig <hch@lst.de>
> Cc: "Martin K. Petersen" <martin.petersen@oracle.com>
> Cc: Hannes Reinecke <hare@suse.com>
> Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
> ---
> block/blk-lib.c | 27 +++++++++++++++++++++------
> 1 file changed, 21 insertions(+), 6 deletions(-)
>
> diff --git a/block/blk-lib.c b/block/blk-lib.c
> index 6b97feb71065..1cb402beb983 100644
> --- a/block/blk-lib.c
> +++ b/block/blk-lib.c
> @@ -316,12 +316,6 @@ static void __blkdev_issue_zero_pages(struct block_device *bdev,
> * Zero-fill a block range, either using hardware offload or by explicitly
> * writing zeroes to the device.
> *
> - * Note that this function may fail with -EOPNOTSUPP if the driver signals
> - * zeroing offload support, but the device fails to process the command (for
> - * some devices there is no non-destructive way to verify whether this
> - * operation is actually supported). In this case the caller should call
> - * retry the call to blkdev_issue_zeroout() and the fallback path will be used.
> - *
> * If a device is using logical block provisioning, the underlying space will
> * not be released if %flags contains BLKDEV_ZERO_NOUNMAP.
> *
> @@ -374,6 +368,27 @@ int blkdev_issue_zeroout(struct block_device *bdev, sector_t sector,
> &bio, flags);
> if (ret == 0 && bio) {
> ret = submit_bio_wait(bio);
> + /*
> + * Fall back to a manual zeroout on any error, if allowed.
> + *
> + * Particularly, WRITE ZEROES may fail with -EREMOTEIO if the
> + * driver signals zeroing offload support, but the device
> + * fails to process the command (for some devices there is no
> + * non-destructive way to verify whether this operation is
> + * actually supported).
> + */
> + if (ret && bio_op(bio) == REQ_OP_WRITE_ZEROES) {
No need for the additional levels of indentation here. Also I
really do not like the logic, we shouldn't have to duplicate much
of the logic multiple times.
I'd more go for something like (sketched in mail):
bool try_write_zeroes = !!bdev_write_zeroes_sectors(bdev);
retry:
bio = NULL;
blk_start_plug(&plug);
if (try_write_zeroes)
ret = __blkdev_issue_write_zeroes(...)
else
ret = __blkdev_issue_zero_pages(...)
if (ret == 0 && bio) {
ret = submit_bio_wait(bio);
bio_put(bio);
}
blk_finish_plug(&plug);
if (ret && try_write_zeroes) {
try_write_zeroes = false;
goto retry;
}
next prev parent reply other threads:[~2017-10-03 8:04 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-09-21 17:12 [PATCH 0/2] block: cope with WRITE ZEROES failing in blkdev_issue_zeroout() Ilya Dryomov
2017-09-21 17:12 ` [PATCH 1/2] block: factor out __blkdev_issue_zero_pages() Ilya Dryomov
2017-10-03 7:56 ` Christoph Hellwig
2017-09-21 17:12 ` [PATCH 2/2] block: cope with WRITE ZEROES failing in blkdev_issue_zeroout() Ilya Dryomov
2017-10-03 8:04 ` Christoph Hellwig [this message]
2017-10-04 14:56 ` Ilya Dryomov
2017-10-02 6:25 ` [PATCH 0/2] " Ilya Dryomov
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20171003080444.GB8345@infradead.org \
--to=hch@infradead.org \
--cc=axboe@fb.com \
--cc=hare@suse.com \
--cc=idryomov@gmail.com \
--cc=linux-block@vger.kernel.org \
--cc=martin.petersen@oracle.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).