* [PATCH --resend 0/2] fix writesame @ 2017-02-13 19:55 Shaohua Li 2017-02-13 19:55 ` [PATCH --resend 1/2] md: disable WRITE SAME if it fails for linear/raid0 Shaohua Li 2017-02-13 19:55 ` [PATCH --resend 2/2] md/multipath: disable WRITE SAME if it fails for multipath Shaohua Li 0 siblings, 2 replies; 5+ messages in thread From: Shaohua Li @ 2017-02-13 19:55 UTC (permalink / raw) To: linux-raid; +Cc: neilb This is the writesame issue fix reported in bugzilla 118581. Sitsofe doesn't reply to me if he tested these, but they do work for me with a scsi_debug test. I'm going to apply them for 4.11 if no objection. Shaohua Li (2): md: disable WRITE SAME if it fails for linear/raid0 md/multipath: disable WRITE SAME if it fails for multipath drivers/md/linear.c | 6 +++++- drivers/md/md.c | 45 +++++++++++++++++++++++++++++++++++++++++++++ drivers/md/md.h | 2 ++ drivers/md/multipath.c | 4 ++++ drivers/md/raid0.c | 6 +++++- 5 files changed, 61 insertions(+), 2 deletions(-) -- 2.9.3 ^ permalink raw reply [flat|nested] 5+ messages in thread
* [PATCH --resend 1/2] md: disable WRITE SAME if it fails for linear/raid0 2017-02-13 19:55 [PATCH --resend 0/2] fix writesame Shaohua Li @ 2017-02-13 19:55 ` Shaohua Li 2017-02-13 23:42 ` NeilBrown 2017-02-13 19:55 ` [PATCH --resend 2/2] md/multipath: disable WRITE SAME if it fails for multipath Shaohua Li 1 sibling, 1 reply; 5+ messages in thread From: Shaohua Li @ 2017-02-13 19:55 UTC (permalink / raw) To: linux-raid; +Cc: neilb This makes md do the same thing as dm for write same IO failure. Please see 7eee4ae(dm: disable WRITE SAME if it fails) for details why we need this. Also reported here: https://bugzilla.kernel.org/show_bug.cgi?id=118581 Signed-off-by: Shaohua Li <shli@fb.com> --- drivers/md/linear.c | 6 +++++- drivers/md/md.c | 45 +++++++++++++++++++++++++++++++++++++++++++++ drivers/md/md.h | 2 ++ drivers/md/raid0.c | 6 +++++- 4 files changed, 57 insertions(+), 2 deletions(-) diff --git a/drivers/md/linear.c b/drivers/md/linear.c index 26a73b2..bebc834 100644 --- a/drivers/md/linear.c +++ b/drivers/md/linear.c @@ -291,7 +291,11 @@ static void linear_make_request(struct mddev *mddev, struct bio *bio) trace_block_bio_remap(bdev_get_queue(split->bi_bdev), split, disk_devt(mddev->gendisk), bio_sector); - generic_make_request(split); + if (bio_op(split) == REQ_OP_WRITE_SAME) + generic_make_request(md_writesame_setup(mddev, + split)); + else + generic_make_request(split); } } while (split != bio); return; diff --git a/drivers/md/md.c b/drivers/md/md.c index 13020e5..7354f0b 100644 --- a/drivers/md/md.c +++ b/drivers/md/md.c @@ -312,6 +312,51 @@ static blk_qc_t md_make_request(struct request_queue *q, struct bio *bio) return BLK_QC_T_NONE; } +struct md_writesame_data { + struct bio *orig_bio; + struct mddev *mddev; + struct bio cloned_bio; +}; + +static void md_writesame_endio(struct bio *bio) +{ + struct md_writesame_data *data = bio->bi_private; + + if (bio->bi_error == -EREMOTEIO && + !bdev_get_queue(bio->bi_bdev)->limits.max_write_same_sectors) + data->mddev->queue->limits.max_write_same_sectors = 0; + + data->orig_bio->bi_error = bio->bi_error; + bio_endio(data->orig_bio); + + kfree(data); +} + +struct bio *md_writesame_setup(struct mddev *mddev, struct bio *bio) +{ + struct md_writesame_data *data; + struct bio *cloned_bio; + + /* + * this failure means we ignore a chance to handle writesame failure, + * which isn't critcal, we can handle the failure if new writesame IO + * comes + */ + data = kmalloc(sizeof(*data), GFP_NOIO | __GFP_NORETRY); + if (!data) + return bio; + cloned_bio = &data->cloned_bio; + data->mddev = mddev; + data->orig_bio = bio; + bio_init(cloned_bio, NULL, 0); + __bio_clone_fast(cloned_bio, bio); + + cloned_bio->bi_private = data; + cloned_bio->bi_end_io = md_writesame_endio; + return cloned_bio; +} +EXPORT_SYMBOL_GPL(md_writesame_setup); + /* mddev_suspend makes sure no new requests are submitted * to the device, and that any requests that have been submitted * are completely handled. diff --git a/drivers/md/md.h b/drivers/md/md.h index 2a51403..5c983e9 100644 --- a/drivers/md/md.h +++ b/drivers/md/md.h @@ -710,4 +710,6 @@ static inline void mddev_clear_unsupported_flags(struct mddev *mddev, { mddev->flags &= ~unsupported_flags; } + +extern struct bio *md_writesame_setup(struct mddev *mddev, struct bio *bio); #endif /* _MD_MD_H */ diff --git a/drivers/md/raid0.c b/drivers/md/raid0.c index 848365d..e38636e 100644 --- a/drivers/md/raid0.c +++ b/drivers/md/raid0.c @@ -503,7 +503,11 @@ static void raid0_make_request(struct mddev *mddev, struct bio *bio) trace_block_bio_remap(bdev_get_queue(split->bi_bdev), split, disk_devt(mddev->gendisk), bio_sector); - generic_make_request(split); + if (bio_op(split) == REQ_OP_WRITE_SAME) + generic_make_request(md_writesame_setup(mddev, + split)); + else + generic_make_request(split); } } while (split != bio); } -- 2.9.3 ^ permalink raw reply related [flat|nested] 5+ messages in thread
* Re: [PATCH --resend 1/2] md: disable WRITE SAME if it fails for linear/raid0 2017-02-13 19:55 ` [PATCH --resend 1/2] md: disable WRITE SAME if it fails for linear/raid0 Shaohua Li @ 2017-02-13 23:42 ` NeilBrown 2017-02-14 0:04 ` Shaohua Li 0 siblings, 1 reply; 5+ messages in thread From: NeilBrown @ 2017-02-13 23:42 UTC (permalink / raw) To: Shaohua Li; +Cc: linux-raid [-- Attachment #1: Type: text/plain, Size: 2568 bytes --] On Mon, Feb 13 2017, Shaohua Li wrote: > This makes md do the same thing as dm for write same IO failure. Please > see 7eee4ae(dm: disable WRITE SAME if it fails) for details why we need > this. > > Also reported here: https://bugzilla.kernel.org/show_bug.cgi?id=118581 > > Signed-off-by: Shaohua Li <shli@fb.com> > --- > drivers/md/linear.c | 6 +++++- > drivers/md/md.c | 45 +++++++++++++++++++++++++++++++++++++++++++++ > drivers/md/md.h | 2 ++ > drivers/md/raid0.c | 6 +++++- > 4 files changed, 57 insertions(+), 2 deletions(-) > > diff --git a/drivers/md/linear.c b/drivers/md/linear.c > index 26a73b2..bebc834 100644 > --- a/drivers/md/linear.c > +++ b/drivers/md/linear.c > @@ -291,7 +291,11 @@ static void linear_make_request(struct mddev *mddev, struct bio *bio) > trace_block_bio_remap(bdev_get_queue(split->bi_bdev), > split, disk_devt(mddev->gendisk), > bio_sector); > - generic_make_request(split); > + if (bio_op(split) == REQ_OP_WRITE_SAME) > + generic_make_request(md_writesame_setup(mddev, > + split)); > + else > + generic_make_request(split); > } > } while (split != bio); > return; > diff --git a/drivers/md/md.c b/drivers/md/md.c > index 13020e5..7354f0b 100644 > --- a/drivers/md/md.c > +++ b/drivers/md/md.c > @@ -312,6 +312,51 @@ static blk_qc_t md_make_request(struct request_queue *q, struct bio *bio) > return BLK_QC_T_NONE; > } > > +struct md_writesame_data { > + struct bio *orig_bio; > + struct mddev *mddev; > + struct bio cloned_bio; > +}; > + > +static void md_writesame_endio(struct bio *bio) > +{ > + struct md_writesame_data *data = bio->bi_private; > + > + if (bio->bi_error == -EREMOTEIO && > + !bdev_get_queue(bio->bi_bdev)->limits.max_write_same_sectors) > + data->mddev->queue->limits.max_write_same_sectors = 0; What would be *really* nice is if a block device could send a reconfigure message to its 'holder' (bd_holder). This could include device size changes and, for this, changes to max_write_same_sectors. There are probably other changes that can usefully be propagated. But for this patch, wouldn't it be easier, and maybe more efficient, to do if (bio_op(split) == REQ_OP_WRITE_SAME && !bdev_get_queue(bio->bi_bdev)->limits.max_write_same_sectors) mddev->queue->limits.max_write_same_sectors = 0; generic_make_request(split); ??? If there is some reason that can't work, then the patch as it stands look OK to me. Thanks, NeilBrown [-- Attachment #2: signature.asc --] [-- Type: application/pgp-signature, Size: 832 bytes --] ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH --resend 1/2] md: disable WRITE SAME if it fails for linear/raid0 2017-02-13 23:42 ` NeilBrown @ 2017-02-14 0:04 ` Shaohua Li 0 siblings, 0 replies; 5+ messages in thread From: Shaohua Li @ 2017-02-14 0:04 UTC (permalink / raw) To: NeilBrown; +Cc: Shaohua Li, linux-raid On Tue, Feb 14, 2017 at 10:42:32AM +1100, Neil Brown wrote: > On Mon, Feb 13 2017, Shaohua Li wrote: > > > This makes md do the same thing as dm for write same IO failure. Please > > see 7eee4ae(dm: disable WRITE SAME if it fails) for details why we need > > this. > > > > Also reported here: https://bugzilla.kernel.org/show_bug.cgi?id=118581 > > > > Signed-off-by: Shaohua Li <shli@fb.com> > > --- > > drivers/md/linear.c | 6 +++++- > > drivers/md/md.c | 45 +++++++++++++++++++++++++++++++++++++++++++++ > > drivers/md/md.h | 2 ++ > > drivers/md/raid0.c | 6 +++++- > > 4 files changed, 57 insertions(+), 2 deletions(-) > > > > diff --git a/drivers/md/linear.c b/drivers/md/linear.c > > index 26a73b2..bebc834 100644 > > --- a/drivers/md/linear.c > > +++ b/drivers/md/linear.c > > @@ -291,7 +291,11 @@ static void linear_make_request(struct mddev *mddev, struct bio *bio) > > trace_block_bio_remap(bdev_get_queue(split->bi_bdev), > > split, disk_devt(mddev->gendisk), > > bio_sector); > > - generic_make_request(split); > > + if (bio_op(split) == REQ_OP_WRITE_SAME) > > + generic_make_request(md_writesame_setup(mddev, > > + split)); > > + else > > + generic_make_request(split); > > } > > } while (split != bio); > > return; > > diff --git a/drivers/md/md.c b/drivers/md/md.c > > index 13020e5..7354f0b 100644 > > --- a/drivers/md/md.c > > +++ b/drivers/md/md.c > > @@ -312,6 +312,51 @@ static blk_qc_t md_make_request(struct request_queue *q, struct bio *bio) > > return BLK_QC_T_NONE; > > } > > > > +struct md_writesame_data { > > + struct bio *orig_bio; > > + struct mddev *mddev; > > + struct bio cloned_bio; > > +}; > > + > > +static void md_writesame_endio(struct bio *bio) > > +{ > > + struct md_writesame_data *data = bio->bi_private; > > + > > + if (bio->bi_error == -EREMOTEIO && > > + !bdev_get_queue(bio->bi_bdev)->limits.max_write_same_sectors) > > + data->mddev->queue->limits.max_write_same_sectors = 0; > > What would be *really* nice is if a block device could send a > reconfigure message to its 'holder' (bd_holder). This could include > device size changes and, for this, changes to max_write_same_sectors. > There are probably other changes that can usefully be propagated. > > But for this patch, wouldn't it be easier, and maybe more efficient, to > do > > if (bio_op(split) == REQ_OP_WRITE_SAME && > !bdev_get_queue(bio->bi_bdev)->limits.max_write_same_sectors) > mddev->queue->limits.max_write_same_sectors = 0; > generic_make_request(split); > > ??? > If there is some reason that can't work, then the patch as it stands > look OK to me. So we don't disable writesame in the first IO error and do it until a new writesame comes? Good idea and much simpler! Let me check if it works. Thanks, Shaohua ^ permalink raw reply [flat|nested] 5+ messages in thread
* [PATCH --resend 2/2] md/multipath: disable WRITE SAME if it fails for multipath 2017-02-13 19:55 [PATCH --resend 0/2] fix writesame Shaohua Li 2017-02-13 19:55 ` [PATCH --resend 1/2] md: disable WRITE SAME if it fails for linear/raid0 Shaohua Li @ 2017-02-13 19:55 ` Shaohua Li 1 sibling, 0 replies; 5+ messages in thread From: Shaohua Li @ 2017-02-13 19:55 UTC (permalink / raw) To: linux-raid; +Cc: neilb This is the part for multipath. Since multipatch already attaches private data into original bio, we just disable write same there. Signed-off-by: Shaohua Li <shli@fb.com> --- drivers/md/multipath.c | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/drivers/md/multipath.c b/drivers/md/multipath.c index aa8c4e5c..5ee9579 100644 --- a/drivers/md/multipath.c +++ b/drivers/md/multipath.c @@ -89,6 +89,10 @@ static void multipath_end_request(struct bio *bio) struct mpconf *conf = mp_bh->mddev->private; struct md_rdev *rdev = conf->multipaths[mp_bh->path].rdev; + if (bio_op(bio) == REQ_OP_WRITE_SAME && bio->bi_error == -EREMOTEIO && + !bdev_get_queue(bio->bi_bdev)->limits.max_write_same_sectors) + mp_bh->mddev->queue->limits.max_write_same_sectors = 0; + if (!bio->bi_error) multipath_end_bh_io(mp_bh, 0); else if (!(bio->bi_opf & REQ_RAHEAD)) { -- 2.9.3 ^ permalink raw reply related [flat|nested] 5+ messages in thread
end of thread, other threads:[~2017-02-14 0:04 UTC | newest] Thread overview: 5+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2017-02-13 19:55 [PATCH --resend 0/2] fix writesame Shaohua Li 2017-02-13 19:55 ` [PATCH --resend 1/2] md: disable WRITE SAME if it fails for linear/raid0 Shaohua Li 2017-02-13 23:42 ` NeilBrown 2017-02-14 0:04 ` Shaohua Li 2017-02-13 19:55 ` [PATCH --resend 2/2] md/multipath: disable WRITE SAME if it fails for multipath Shaohua Li
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).