From: Kent Overstreet <kent.overstreet@gmail.com>
To: Ming Lei <ming.lei@canonical.com>
Cc: Jens Axboe <axboe@fb.com>,
linux-kernel@vger.kernel.org, linux-block@vger.kernel.org,
Christoph Hellwig <hch@infradead.org>,
Eric Wheeler <bcache@lists.ewheeler.net>,
Sebastian Roesner <sroesner-kernelorg@roesner-online.de>,
"4.3+" <stable@vger.kernel.org>, Shaohua Li <shli@fb.com>
Subject: Re: [PATCH v1] block: make sure big bio is splitted into at most 256 bvecs
Date: Tue, 5 Apr 2016 19:46:28 -0800 [thread overview]
Message-ID: <20160406034628.GA25428@kmo-pixel> (raw)
In-Reply-To: <1459914212-9330-1-git-send-email-ming.lei@canonical.com>
On Wed, Apr 06, 2016 at 11:43:32AM +0800, Ming Lei wrote:
> After arbitrary bio size is supported, the incoming bio may
> be very big. We have to split the bio into small bios so that
> each holds at most BIO_MAX_PAGES bvecs for safety reason, such
> as bio_clone().
>
> This patch fixes the following kernel crash:
>
> > [ 172.660142] BUG: unable to handle kernel NULL pointer dereference at 0000000000000028
> > [ 172.660229] IP: [<ffffffff811e53b4>] bio_trim+0xf/0x2a
> > [ 172.660289] PGD 7faf3e067 PUD 7f9279067 PMD 0
> > [ 172.660399] Oops: 0000 [#1] SMP
> > [...]
> > [ 172.664780] Call Trace:
> > [ 172.664813] [<ffffffffa007f3be>] ? raid1_make_request+0x2e8/0xad7 [raid1]
> > [ 172.664846] [<ffffffff811f07da>] ? blk_queue_split+0x377/0x3d4
> > [ 172.664880] [<ffffffffa005fb5f>] ? md_make_request+0xf6/0x1e9 [md_mod]
> > [ 172.664912] [<ffffffff811eb860>] ? generic_make_request+0xb5/0x155
> > [ 172.664947] [<ffffffffa0445c89>] ? prio_io+0x85/0x95 [bcache]
> > [ 172.664981] [<ffffffffa0448252>] ? register_cache_set+0x355/0x8d0 [bcache]
> > [ 172.665016] [<ffffffffa04497d3>] ? register_bcache+0x1006/0x1174 [bcache]
>
> Fixes: 54efd50(block: make generic_make_request handle arbitrarily sized bios)
> Reported-by: Sebastian Roesner <sroesner-kernelorg@roesner-online.de>
> Reported-by: Eric Wheeler <bcache@lists.ewheeler.net>
> Cc: stable@vger.kernel.org (4.3+)
> Cc: Shaohua Li <shli@fb.com>
> Cc: Kent Overstreet <kent.overstreet@gmail.com>
> Signed-off-by: Ming Lei <ming.lei@canonical.com>
That'll work
Acked-by: Kent Overstreet <kent.overstreet@gmail.com>
> ---
> V1:
> - Kent pointed out that using max io size can't cover
> the case of non-full bvecs/pages
>
> The issue can be reproduced by the following approach:
> - create one raid1 over two virtio-blk
> - build bcache device over the above raid1 and another cache device
> and bucket size is set 2Mbytes
> - set cache mode as writeback
> - run random write over ext4 on the bcache device
> - then the crash can be triggered
>
> block/blk-merge.c | 19 +++++++++++++++++++
> 1 file changed, 19 insertions(+)
>
> diff --git a/block/blk-merge.c b/block/blk-merge.c
> index 2613531..7b96471 100644
> --- a/block/blk-merge.c
> +++ b/block/blk-merge.c
> @@ -94,8 +94,10 @@ static struct bio *blk_bio_segment_split(struct request_queue *q,
> bool do_split = true;
> struct bio *new = NULL;
> const unsigned max_sectors = get_max_io_size(q, bio);
> + unsigned bvecs = 0;
>
> bio_for_each_segment(bv, bio, iter) {
> + bvecs++;
> /*
> * If the queue doesn't support SG gaps and adding this
> * offset would create a gap, disallow it.
> @@ -103,6 +105,23 @@ static struct bio *blk_bio_segment_split(struct request_queue *q,
> if (bvprvp && bvec_gap_to_prev(q, bvprvp, bv.bv_offset))
> goto split;
>
> + /*
> + * With arbitrary bio size, the incoming bio may be very
> + * big. We have to split the bio into small bios so that
> + * each holds at most BIO_MAX_PAGES bvecs because
> + * bio_clone() can fail to allocate big bvecs.
> + *
> + * It should have been better to apply the limit per
> + * request queue in which bio_clone() is involved,
> + * instead of globally. The biggest blocker is
> + * bio_clone() in bio bounce.
> + *
> + * TODO: deal with bio bounce's bio_clone() gracefully
> + * and convert the global limit into per-queue limit.
> + */
> + if (bvecs >= BIO_MAX_PAGES)
> + goto split;
> +
> if (sectors + (bv.bv_len >> 9) > max_sectors) {
> /*
> * Consider this a new segment if we're splitting in
> --
> 1.9.1
>
next prev parent reply other threads:[~2016-04-06 3:46 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-04-06 3:43 [PATCH v1] block: make sure big bio is splitted into at most 256 bvecs Ming Lei
2016-04-06 3:46 ` Kent Overstreet [this message]
2016-08-11 6:33 ` [PATCH v2] " Eric Wheeler
2016-08-11 14:02 ` Christoph Hellwig
2016-08-12 11:12 ` Ming Lei
2016-08-12 16:36 ` Kent Overstreet
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20160406034628.GA25428@kmo-pixel \
--to=kent.overstreet@gmail.com \
--cc=axboe@fb.com \
--cc=bcache@lists.ewheeler.net \
--cc=hch@infradead.org \
--cc=linux-block@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=ming.lei@canonical.com \
--cc=shli@fb.com \
--cc=sroesner-kernelorg@roesner-online.de \
--cc=stable@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.