* [PATCH 0/2] block: cope with WRITE ZEROES failing in blkdev_issue_zeroout()
@ 2017-09-21 17:12 Ilya Dryomov
2017-09-21 17:12 ` [PATCH 1/2] block: factor out __blkdev_issue_zero_pages() Ilya Dryomov
` (2 more replies)
0 siblings, 3 replies; 7+ messages in thread
From: Ilya Dryomov @ 2017-09-21 17:12 UTC (permalink / raw)
To: Christoph Hellwig
Cc: Martin K. Petersen, Hannes Reinecke, Jens Axboe, linux-block
Hi Christoph, Martin,
blkdev_issue_zeroout() now checks for any error. This required a minor
refactor, so I dropped the stable tag, Jens can add it back if needed.
Previous patch and discussion at
https://marc.info/?l=linux-block&m=150471953327942&w=2
Thanks,
Ilya
Ilya Dryomov (2):
block: factor out __blkdev_issue_zero_pages()
block: cope with WRITE ZEROES failing in blkdev_issue_zeroout()
block/blk-lib.c | 85 +++++++++++++++++++++++++++++++++++----------------------
1 file changed, 53 insertions(+), 32 deletions(-)
--
2.4.3
^ permalink raw reply [flat|nested] 7+ messages in thread
* [PATCH 1/2] block: factor out __blkdev_issue_zero_pages()
2017-09-21 17:12 [PATCH 0/2] block: cope with WRITE ZEROES failing in blkdev_issue_zeroout() Ilya Dryomov
@ 2017-09-21 17:12 ` Ilya Dryomov
2017-10-03 7:56 ` Christoph Hellwig
2017-09-21 17:12 ` [PATCH 2/2] block: cope with WRITE ZEROES failing in blkdev_issue_zeroout() Ilya Dryomov
2017-10-02 6:25 ` [PATCH 0/2] " Ilya Dryomov
2 siblings, 1 reply; 7+ messages in thread
From: Ilya Dryomov @ 2017-09-21 17:12 UTC (permalink / raw)
To: Christoph Hellwig
Cc: Martin K. Petersen, Hannes Reinecke, Jens Axboe, linux-block
blkdev_issue_zeroout() will use this in !BLKDEV_ZERO_NOFALLBACK case.
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
---
block/blk-lib.c | 58 +++++++++++++++++++++++++++++++--------------------------
1 file changed, 32 insertions(+), 26 deletions(-)
diff --git a/block/blk-lib.c b/block/blk-lib.c
index 62240f8832ca..6b97feb71065 100644
--- a/block/blk-lib.c
+++ b/block/blk-lib.c
@@ -274,6 +274,35 @@ static unsigned int __blkdev_sectors_to_bio_pages(sector_t nr_sects)
return min(pages, (sector_t)BIO_MAX_PAGES);
}
+static void __blkdev_issue_zero_pages(struct block_device *bdev,
+ sector_t sector, sector_t nr_sects, gfp_t gfp_mask,
+ struct bio **biop)
+{
+ struct bio *bio = *biop;
+ int bi_size = 0;
+ unsigned int sz;
+
+ while (nr_sects != 0) {
+ bio = next_bio(bio, __blkdev_sectors_to_bio_pages(nr_sects),
+ gfp_mask);
+ bio->bi_iter.bi_sector = sector;
+ bio_set_dev(bio, bdev);
+ bio_set_op_attrs(bio, REQ_OP_WRITE, 0);
+
+ while (nr_sects != 0) {
+ sz = min((sector_t) PAGE_SIZE, nr_sects << 9);
+ bi_size = bio_add_page(bio, ZERO_PAGE(0), sz, 0);
+ nr_sects -= bi_size >> 9;
+ sector += bi_size >> 9;
+ if (bi_size < sz)
+ break;
+ }
+ cond_resched();
+ }
+
+ *biop = bio;
+}
+
/**
* __blkdev_issue_zeroout - generate number of zero filed write bios
* @bdev: blockdev to issue
@@ -304,9 +333,6 @@ int __blkdev_issue_zeroout(struct block_device *bdev, sector_t sector,
unsigned flags)
{
int ret;
- int bi_size = 0;
- struct bio *bio = *biop;
- unsigned int sz;
sector_t bs_mask;
bs_mask = (bdev_logical_block_size(bdev) >> 9) - 1;
@@ -316,30 +342,10 @@ int __blkdev_issue_zeroout(struct block_device *bdev, sector_t sector,
ret = __blkdev_issue_write_zeroes(bdev, sector, nr_sects, gfp_mask,
biop, flags);
if (ret != -EOPNOTSUPP || (flags & BLKDEV_ZERO_NOFALLBACK))
- goto out;
-
- ret = 0;
- while (nr_sects != 0) {
- bio = next_bio(bio, __blkdev_sectors_to_bio_pages(nr_sects),
- gfp_mask);
- bio->bi_iter.bi_sector = sector;
- bio_set_dev(bio, bdev);
- bio_set_op_attrs(bio, REQ_OP_WRITE, 0);
-
- while (nr_sects != 0) {
- sz = min((sector_t) PAGE_SIZE, nr_sects << 9);
- bi_size = bio_add_page(bio, ZERO_PAGE(0), sz, 0);
- nr_sects -= bi_size >> 9;
- sector += bi_size >> 9;
- if (bi_size < sz)
- break;
- }
- cond_resched();
- }
+ return ret;
- *biop = bio;
-out:
- return ret;
+ __blkdev_issue_zero_pages(bdev, sector, nr_sects, gfp_mask, biop);
+ return 0;
}
EXPORT_SYMBOL(__blkdev_issue_zeroout);
--
2.4.3
^ permalink raw reply related [flat|nested] 7+ messages in thread
* [PATCH 2/2] block: cope with WRITE ZEROES failing in blkdev_issue_zeroout()
2017-09-21 17:12 [PATCH 0/2] block: cope with WRITE ZEROES failing in blkdev_issue_zeroout() Ilya Dryomov
2017-09-21 17:12 ` [PATCH 1/2] block: factor out __blkdev_issue_zero_pages() Ilya Dryomov
@ 2017-09-21 17:12 ` Ilya Dryomov
2017-10-03 8:04 ` Christoph Hellwig
2017-10-02 6:25 ` [PATCH 0/2] " Ilya Dryomov
2 siblings, 1 reply; 7+ messages in thread
From: Ilya Dryomov @ 2017-09-21 17:12 UTC (permalink / raw)
To: Christoph Hellwig
Cc: Martin K. Petersen, Hannes Reinecke, Jens Axboe, linux-block
sd_config_write_same() ignores ->max_ws_blocks == 0 and resets it to
permit trying WRITE SAME on older SCSI devices, unless ->no_write_same
is set. Because REQ_OP_WRITE_ZEROES is implemented in terms of WRITE
SAME, blkdev_issue_zeroout() may fail with -EREMOTEIO:
$ fallocate -zn -l 1k /dev/sdg
fallocate: fallocate failed: Remote I/O error
$ fallocate -zn -l 1k /dev/sdg # OK
$ fallocate -zn -l 1k /dev/sdg # OK
The following calls succeed because sd_done() sets ->no_write_same in
response to a sense that would become BLK_STS_TARGET/-EREMOTEIO, causing
__blkdev_issue_zeroout() to fall back to generating ZERO_PAGE bios.
This means blkdev_issue_zeroout() must cope with WRITE ZEROES failing
and fall back to manually zeroing, unless BLKDEV_ZERO_NOFALLBACK is
specified. For BLKDEV_ZERO_NOFALLBACK case, return -EOPNOTSUPP if
sd_done() has just set ->no_write_same thus indicating lack of offload
support.
Fixes: c20cfc27a473 ("block: stop using blkdev_issue_write_same for zeroing")
Cc: Christoph Hellwig <hch@lst.de>
Cc: "Martin K. Petersen" <martin.petersen@oracle.com>
Cc: Hannes Reinecke <hare@suse.com>
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
---
block/blk-lib.c | 27 +++++++++++++++++++++------
1 file changed, 21 insertions(+), 6 deletions(-)
diff --git a/block/blk-lib.c b/block/blk-lib.c
index 6b97feb71065..1cb402beb983 100644
--- a/block/blk-lib.c
+++ b/block/blk-lib.c
@@ -316,12 +316,6 @@ static void __blkdev_issue_zero_pages(struct block_device *bdev,
* Zero-fill a block range, either using hardware offload or by explicitly
* writing zeroes to the device.
*
- * Note that this function may fail with -EOPNOTSUPP if the driver signals
- * zeroing offload support, but the device fails to process the command (for
- * some devices there is no non-destructive way to verify whether this
- * operation is actually supported). In this case the caller should call
- * retry the call to blkdev_issue_zeroout() and the fallback path will be used.
- *
* If a device is using logical block provisioning, the underlying space will
* not be released if %flags contains BLKDEV_ZERO_NOUNMAP.
*
@@ -374,6 +368,27 @@ int blkdev_issue_zeroout(struct block_device *bdev, sector_t sector,
&bio, flags);
if (ret == 0 && bio) {
ret = submit_bio_wait(bio);
+ /*
+ * Fall back to a manual zeroout on any error, if allowed.
+ *
+ * Particularly, WRITE ZEROES may fail with -EREMOTEIO if the
+ * driver signals zeroing offload support, but the device
+ * fails to process the command (for some devices there is no
+ * non-destructive way to verify whether this operation is
+ * actually supported).
+ */
+ if (ret && bio_op(bio) == REQ_OP_WRITE_ZEROES) {
+ if (flags & BLKDEV_ZERO_NOFALLBACK) {
+ if (!bdev_write_zeroes_sectors(bdev))
+ ret = -EOPNOTSUPP;
+ } else {
+ bio_put(bio);
+ bio = NULL;
+ __blkdev_issue_zero_pages(bdev, sector,
+ nr_sects, gfp_mask, &bio);
+ ret = submit_bio_wait(bio);
+ }
+ }
bio_put(bio);
}
blk_finish_plug(&plug);
--
2.4.3
^ permalink raw reply related [flat|nested] 7+ messages in thread
* Re: [PATCH 0/2] block: cope with WRITE ZEROES failing in blkdev_issue_zeroout()
2017-09-21 17:12 [PATCH 0/2] block: cope with WRITE ZEROES failing in blkdev_issue_zeroout() Ilya Dryomov
2017-09-21 17:12 ` [PATCH 1/2] block: factor out __blkdev_issue_zero_pages() Ilya Dryomov
2017-09-21 17:12 ` [PATCH 2/2] block: cope with WRITE ZEROES failing in blkdev_issue_zeroout() Ilya Dryomov
@ 2017-10-02 6:25 ` Ilya Dryomov
2 siblings, 0 replies; 7+ messages in thread
From: Ilya Dryomov @ 2017-10-02 6:25 UTC (permalink / raw)
To: Christoph Hellwig
Cc: Martin K. Petersen, Hannes Reinecke, Jens Axboe, linux-block,
Christoph Hellwig
On Thu, Sep 21, 2017 at 7:12 PM, Ilya Dryomov <idryomov@gmail.com> wrote:
> Hi Christoph, Martin,
>
> blkdev_issue_zeroout() now checks for any error. This required a minor
> refactor, so I dropped the stable tag, Jens can add it back if needed.
>
> Previous patch and discussion at
>
> https://marc.info/?l=linux-block&m=150471953327942&w=2
>
> Thanks,
>
> Ilya
>
>
> Ilya Dryomov (2):
> block: factor out __blkdev_issue_zero_pages()
> block: cope with WRITE ZEROES failing in blkdev_issue_zeroout()
>
> block/blk-lib.c | 85 +++++++++++++++++++++++++++++++++++----------------------
> 1 file changed, 53 insertions(+), 32 deletions(-)
Ping... Christoph, could you please take a look?
Thanks,
Ilya
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH 1/2] block: factor out __blkdev_issue_zero_pages()
2017-09-21 17:12 ` [PATCH 1/2] block: factor out __blkdev_issue_zero_pages() Ilya Dryomov
@ 2017-10-03 7:56 ` Christoph Hellwig
0 siblings, 0 replies; 7+ messages in thread
From: Christoph Hellwig @ 2017-10-03 7:56 UTC (permalink / raw)
To: Ilya Dryomov
Cc: Christoph Hellwig, Martin K. Petersen, Hannes Reinecke,
Jens Axboe, linux-block
Looks fine,
Reviewed-by: Christoph Hellwig <hch@lst.de>
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH 2/2] block: cope with WRITE ZEROES failing in blkdev_issue_zeroout()
2017-09-21 17:12 ` [PATCH 2/2] block: cope with WRITE ZEROES failing in blkdev_issue_zeroout() Ilya Dryomov
@ 2017-10-03 8:04 ` Christoph Hellwig
2017-10-04 14:56 ` Ilya Dryomov
0 siblings, 1 reply; 7+ messages in thread
From: Christoph Hellwig @ 2017-10-03 8:04 UTC (permalink / raw)
To: Ilya Dryomov
Cc: Christoph Hellwig, Martin K. Petersen, Hannes Reinecke,
Jens Axboe, linux-block
On Thu, Sep 21, 2017 at 07:12:52PM +0200, Ilya Dryomov wrote:
> sd_config_write_same() ignores ->max_ws_blocks == 0 and resets it to
> permit trying WRITE SAME on older SCSI devices, unless ->no_write_same
> is set. Because REQ_OP_WRITE_ZEROES is implemented in terms of WRITE
> SAME, blkdev_issue_zeroout() may fail with -EREMOTEIO:
>
> $ fallocate -zn -l 1k /dev/sdg
> fallocate: fallocate failed: Remote I/O error
> $ fallocate -zn -l 1k /dev/sdg # OK
> $ fallocate -zn -l 1k /dev/sdg # OK
>
> The following calls succeed because sd_done() sets ->no_write_same in
> response to a sense that would become BLK_STS_TARGET/-EREMOTEIO, causing
> __blkdev_issue_zeroout() to fall back to generating ZERO_PAGE bios.
>
> This means blkdev_issue_zeroout() must cope with WRITE ZEROES failing
> and fall back to manually zeroing, unless BLKDEV_ZERO_NOFALLBACK is
> specified. For BLKDEV_ZERO_NOFALLBACK case, return -EOPNOTSUPP if
> sd_done() has just set ->no_write_same thus indicating lack of offload
> support.
>
> Fixes: c20cfc27a473 ("block: stop using blkdev_issue_write_same for zeroing")
> Cc: Christoph Hellwig <hch@lst.de>
> Cc: "Martin K. Petersen" <martin.petersen@oracle.com>
> Cc: Hannes Reinecke <hare@suse.com>
> Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
> ---
> block/blk-lib.c | 27 +++++++++++++++++++++------
> 1 file changed, 21 insertions(+), 6 deletions(-)
>
> diff --git a/block/blk-lib.c b/block/blk-lib.c
> index 6b97feb71065..1cb402beb983 100644
> --- a/block/blk-lib.c
> +++ b/block/blk-lib.c
> @@ -316,12 +316,6 @@ static void __blkdev_issue_zero_pages(struct block_device *bdev,
> * Zero-fill a block range, either using hardware offload or by explicitly
> * writing zeroes to the device.
> *
> - * Note that this function may fail with -EOPNOTSUPP if the driver signals
> - * zeroing offload support, but the device fails to process the command (for
> - * some devices there is no non-destructive way to verify whether this
> - * operation is actually supported). In this case the caller should call
> - * retry the call to blkdev_issue_zeroout() and the fallback path will be used.
> - *
> * If a device is using logical block provisioning, the underlying space will
> * not be released if %flags contains BLKDEV_ZERO_NOUNMAP.
> *
> @@ -374,6 +368,27 @@ int blkdev_issue_zeroout(struct block_device *bdev, sector_t sector,
> &bio, flags);
> if (ret == 0 && bio) {
> ret = submit_bio_wait(bio);
> + /*
> + * Fall back to a manual zeroout on any error, if allowed.
> + *
> + * Particularly, WRITE ZEROES may fail with -EREMOTEIO if the
> + * driver signals zeroing offload support, but the device
> + * fails to process the command (for some devices there is no
> + * non-destructive way to verify whether this operation is
> + * actually supported).
> + */
> + if (ret && bio_op(bio) == REQ_OP_WRITE_ZEROES) {
No need for the additional levels of indentation here. Also I
really do not like the logic, we shouldn't have to duplicate much
of the logic multiple times.
I'd more go for something like (sketched in mail):
bool try_write_zeroes = !!bdev_write_zeroes_sectors(bdev);
retry:
bio = NULL;
blk_start_plug(&plug);
if (try_write_zeroes)
ret = __blkdev_issue_write_zeroes(...)
else
ret = __blkdev_issue_zero_pages(...)
if (ret == 0 && bio) {
ret = submit_bio_wait(bio);
bio_put(bio);
}
blk_finish_plug(&plug);
if (ret && try_write_zeroes) {
try_write_zeroes = false;
goto retry;
}
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH 2/2] block: cope with WRITE ZEROES failing in blkdev_issue_zeroout()
2017-10-03 8:04 ` Christoph Hellwig
@ 2017-10-04 14:56 ` Ilya Dryomov
0 siblings, 0 replies; 7+ messages in thread
From: Ilya Dryomov @ 2017-10-04 14:56 UTC (permalink / raw)
To: Christoph Hellwig
Cc: Martin K. Petersen, Hannes Reinecke, Jens Axboe, linux-block
On Tue, Oct 3, 2017 at 10:04 AM, Christoph Hellwig <hch@infradead.org> wrote:
> On Thu, Sep 21, 2017 at 07:12:52PM +0200, Ilya Dryomov wrote:
>> sd_config_write_same() ignores ->max_ws_blocks == 0 and resets it to
>> permit trying WRITE SAME on older SCSI devices, unless ->no_write_same
>> is set. Because REQ_OP_WRITE_ZEROES is implemented in terms of WRITE
>> SAME, blkdev_issue_zeroout() may fail with -EREMOTEIO:
>>
>> $ fallocate -zn -l 1k /dev/sdg
>> fallocate: fallocate failed: Remote I/O error
>> $ fallocate -zn -l 1k /dev/sdg # OK
>> $ fallocate -zn -l 1k /dev/sdg # OK
>>
>> The following calls succeed because sd_done() sets ->no_write_same in
>> response to a sense that would become BLK_STS_TARGET/-EREMOTEIO, causing
>> __blkdev_issue_zeroout() to fall back to generating ZERO_PAGE bios.
>>
>> This means blkdev_issue_zeroout() must cope with WRITE ZEROES failing
>> and fall back to manually zeroing, unless BLKDEV_ZERO_NOFALLBACK is
>> specified. For BLKDEV_ZERO_NOFALLBACK case, return -EOPNOTSUPP if
>> sd_done() has just set ->no_write_same thus indicating lack of offload
>> support.
>>
>> Fixes: c20cfc27a473 ("block: stop using blkdev_issue_write_same for zeroing")
>> Cc: Christoph Hellwig <hch@lst.de>
>> Cc: "Martin K. Petersen" <martin.petersen@oracle.com>
>> Cc: Hannes Reinecke <hare@suse.com>
>> Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
>> ---
>> block/blk-lib.c | 27 +++++++++++++++++++++------
>> 1 file changed, 21 insertions(+), 6 deletions(-)
>>
>> diff --git a/block/blk-lib.c b/block/blk-lib.c
>> index 6b97feb71065..1cb402beb983 100644
>> --- a/block/blk-lib.c
>> +++ b/block/blk-lib.c
>> @@ -316,12 +316,6 @@ static void __blkdev_issue_zero_pages(struct block_device *bdev,
>> * Zero-fill a block range, either using hardware offload or by explicitly
>> * writing zeroes to the device.
>> *
>> - * Note that this function may fail with -EOPNOTSUPP if the driver signals
>> - * zeroing offload support, but the device fails to process the command (for
>> - * some devices there is no non-destructive way to verify whether this
>> - * operation is actually supported). In this case the caller should call
>> - * retry the call to blkdev_issue_zeroout() and the fallback path will be used.
>> - *
>> * If a device is using logical block provisioning, the underlying space will
>> * not be released if %flags contains BLKDEV_ZERO_NOUNMAP.
>> *
>> @@ -374,6 +368,27 @@ int blkdev_issue_zeroout(struct block_device *bdev, sector_t sector,
>> &bio, flags);
>> if (ret == 0 && bio) {
>> ret = submit_bio_wait(bio);
>> + /*
>> + * Fall back to a manual zeroout on any error, if allowed.
>> + *
>> + * Particularly, WRITE ZEROES may fail with -EREMOTEIO if the
>> + * driver signals zeroing offload support, but the device
>> + * fails to process the command (for some devices there is no
>> + * non-destructive way to verify whether this operation is
>> + * actually supported).
>> + */
>> + if (ret && bio_op(bio) == REQ_OP_WRITE_ZEROES) {
>
> No need for the additional levels of indentation here. Also I
> really do not like the logic, we shouldn't have to duplicate much
> of the logic multiple times.
>
> I'd more go for something like (sketched in mail):
>
> bool try_write_zeroes = !!bdev_write_zeroes_sectors(bdev);
>
> retry:
> bio = NULL;
> blk_start_plug(&plug);
> if (try_write_zeroes)
> ret = __blkdev_issue_write_zeroes(...)
> else
> ret = __blkdev_issue_zero_pages(...)
> if (ret == 0 && bio) {
> ret = submit_bio_wait(bio);
> bio_put(bio);
> }
> blk_finish_plug(&plug);
> if (ret && try_write_zeroes) {
> try_write_zeroes = false;
> goto retry;
> }
Yeah, I didn't like the code flow either but we are going to duplicate
some of it either way. In particular, !bdev_write_zeroes_sectors() ->
ret = -EOPNOTSUPP part is still needed to avoid propagating -EREMOTEIO
in BLKDEV_ZERO_NOFALLBACK case:
if (try_write_zeroes)
ret = __blkdev_issue_write_zeroes(...);
else if (!(flags & BLKDEV_ZERO_NOFALLBACK))
ret = __blkdev_issue_zero_pages(...);
else if (!bdev_write_zeroes_sectors(bdev))
ret = -EOPNOTSUPP;
bs_mask check from __blkdev_issue_zeroout() too.
I'll post v2 in a few.
Thanks,
Ilya
^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2017-10-04 14:56 UTC | newest]
Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2017-09-21 17:12 [PATCH 0/2] block: cope with WRITE ZEROES failing in blkdev_issue_zeroout() Ilya Dryomov
2017-09-21 17:12 ` [PATCH 1/2] block: factor out __blkdev_issue_zero_pages() Ilya Dryomov
2017-10-03 7:56 ` Christoph Hellwig
2017-09-21 17:12 ` [PATCH 2/2] block: cope with WRITE ZEROES failing in blkdev_issue_zeroout() Ilya Dryomov
2017-10-03 8:04 ` Christoph Hellwig
2017-10-04 14:56 ` Ilya Dryomov
2017-10-02 6:25 ` [PATCH 0/2] " Ilya Dryomov
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).